Full Text
Full Text
Advisor(s)
Author(s)
Chen, Yongpin.; .
Citation
Issued Date
URL
Rights
2011
http://hdl.handle.net/10722/174463
by
Yongpin CHEN ()
October 2011
Yongpin CHEN
for the degree of Doctor of Philosophy
at The University of Hong Kong
in October 2011
Surface integral equation (SIE) method with the kernel of layered medium Green's
function (LMGF) is investigated in details from several fundamental aspects. A
novel implementation of discrete complex image method (DCIM) is developed to
accelerate the evaluation of Sommerfeld integrals and especially improve the far
field accuracy of the conventional one. To achieve a broadband simulation of thin
layered structure such as microstrip antennas, the mixed-form thin-stratified
medium fast-multipole algorithm (MF-TSM-FMA) is developed by applying
contour deformation and combining the multipole expansion and plane wave
expansion into a single multilevel tree. The low frequency breakdown of the
integral operator is further studied and remedied by using the loop-tree
decomposition and the augmented electric field integral equation (A-EFIE), both
in the context of layered medium integration kernel. All these methods are based
on the EFIE for the perfect electric conductor (PEC) and hence can be applied in
antenna and circuit applications. To model general dielectric or magnetic objects,
the layered medium Green's function based on pilot vector potential approach is
generalized for both electric and magnetic current sources. The matrix
representation is further derived and the corresponding general SIE is setup.
Finally, this SIE is accelerated with the DCIM and applied in quantum optics,
such as the calculation of spontaneous emission enhancement of a quantum
emitter embedded in a layered structure and in the presence of nano scatterers.
Declaration
I declare that the thesis and the research work thereof represents my own work,
except where due acknowledgement is made, and that it has not been previously
included in a thesis, dissertation or report submitted to this University or to any
other institution for a degree, diploma or other qualifications.
Signed
Yongpin CHEN
BY
YONGPIN CHEN
DISSERTATION
Submitted in partial fulfillment of the requirements
for the degree of Doctor of Philosophy in Electrical and Electronic Engineering
in the Graduate School of the
University of Hong Kong, 2011
Hong Kong
ABSTRACT
Surface integral equation (SIE) method with the kernel of layered medium
Greens function (LMGF) is investigated in details from several fundamental
aspects. A novel implementation of discrete complex image method (DCIM)
is developed to accelerate the evaluation of Sommerfeld integrals and especially improve the far field accuracy of the conventional one. To achieve a
broadband simulation of thin layered structure such as microstrip antennas,
the mixed-form thin-stratified medium fast-multipole algorithm (MF-TSMFMA) is developed by applying contour deformation and combining the multipole expansion and plane wave expansion into a single multilevel tree. The
low frequency breakdown of the integral operator is further studied and remedied by using the loop-tree decomposition and the augmented electric field
integral equation (A-EFIE), both in the context of layered medium integration kernel. All these methods are based on the EFIE for the perfect electric
conductor (PEC) and hence can be applied in antenna and circuit applications. To model general dielectric or magnetic objects, the layered medium
Greens function based on pilot vector potential approach is generalized for
both electric and magnetic current sources. The matrix representation is
further derived and the corresponding general SIE is setup. Finally, this
SIE is accelerated with the DCIM and applied in quantum optics, such as
the calculation of spontaneous emission enhancement of a quantum emitter
embedded in a layered structure and in the presence of nano scatterers.
ii
iii
ACKNOWLEDGMENTS
First and foremost, I would like to thank my mentor and thesis adviser,
Professor Weng Cho CHEW, for providing me this opportunity to study
with him. His patient and careful guidance during the last four years has
totally changed my way of thinking, re-shaped my knowledge structure, and
finally built my self-confidence. His enthusiasm, persistence, and diligent
work ethic has inspired me much and will continue to affect my career and
life.
I would also like to thank my co-adviser, Dr. Lijun JIANG, who always
encouraged me when I was frustrated, shared with me his working experience
in industry, and helped me a lot in technical details.
I feel grateful to my former supervisors Professor Zaiping NIE and Professor Jun HU at UESTC, who opened me the door of the fascinating CEM
world, encouraged and supported me to pursue study at HKU, and always
pay kind attention to my recent status.
Many thanks to my friends and colleagues with the Electromagnetics and
Optics Laboratory, to Dr. Wallace C. H. CHOY for providing me TA opportunities and inviting me to audit his course on organic devices, to Dr.
Sheng SUN for giving me suggestions on possible directions about circuit
simulation, to Dr. Wei SHA for sharing his knowledge on modern physics,
to Dr. Yang LIU for teaching me lots of mathematics, to Dr. Yat Hei LO for
sharing his expertise on computer and optics, to Dr. Shaoying HUANG, Dr.
Min TANG, Dr. Osman GONI, Dr. Bo ZHU, Phillip ATKINS, Peng YANG,
Qi DAI, Jun HUANG, Zuhui MA, Shiquan HE, Yumao WU, Yan LI, Nick
HUANG, Ping LI for their help and friendship.
I also wish to thank alumni from both UIUC and UESTC. A special thanks
to Dr. Yuan LIU for helping me a lot when I first came to HKU, though we
have never met with each other, to Dr. Liming XU for sharing his code and
teaching me the transmission line method, to Dr. Jie XIONG for sharing her
iv
TABLE OF CONTENTS
ix
DISCRETE
. . . . . . . .
. . . . . . . .
. . . . . . . .
. . . . . . . .
. . . . . . . .
. . . . . . . .
.
.
.
.
.
.
1
1
3
. 4
. 4
. 6
. 9
. 14
. 15
24
30
36
CHAPTER 4 REMEDIES FOR LOW FREQUENCY BREAKDOWN OF INTEGRAL OPERATOR IN LAYERED MEDIUM
4.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . .
4.2 Loop-Tree Decomposition and Frequency Normalization . .
4.3 Basis Rearrangement via Connection Matrix . . . . . . . .
4.4 Numerical Results . . . . . . . . . . . . . . . . . . . . . . .
4.5 Augmented Electric Field Integral Equation (A-EFIE) . .
37
37
39
42
45
45
vi
.
.
.
.
.
.
.
.
.
.
.
.
19
19
21
4.6
4.7
4.8
4.9
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
51
56
59
60
CHAPTER 5 A NEW LAYERED MEDIUM GREENS FUNCTION (LMGF) FORMULATION FOR GENERAL OBJECTS .
5.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . .
5.2 Dyadic Form of LMGF . . . . . . . . . . . . . . . . . . . .
5.3 Matrix Representation of LMGF . . . . . . . . . . . . . . .
5.4 Duality Principle of LMGF . . . . . . . . . . . . . . . . . .
5.5 Surface Integral Equation . . . . . . . . . . . . . . . . . .
5.6 Numerical Results . . . . . . . . . . . . . . . . . . . . . . .
5.7 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
63
63
65
73
82
84
85
89
.
.
.
.
.
.
.
.
90
90
91
95
. . 98
. . 101
. . 105
vii
LIST OF TABLES
viii
LIST OF FIGURES
. . 6
. . 11
. . 11
. . 13
. . 15
. . 16
. . 16
. . 17
. . 17
23
23
28
30
32
32
33
33
34
34
35
3.12 Memory requirement (solid line) and CPU time per iteration (dash line) versus number of unknowns. . . . . . . . . . 35
4.1
4.2
4.3
4.4
4.5
4.6
4.7
4.8
4.9
4.10
4.11
4.12
41
43
44
44
53
54
54
55
55
57
60
61
4.13 The capacitance of the circular parallel plate capacitor. AEFIE I represents the capacitance extracted from current,
while A-EFIE Q means the capacitance extracted from
charge. The A-EFIE current suffers from an inaccuracy
problem while the A-EFIE charge is stable. The result
agrees with the static solver. Both are further validated
by the analytic solution. When the frequency is below 1
MHz, the relative error of A-EFIE Q is around 0.1%. . . . . . 61
5.1 A homogeneous dielectric object is embedded in a layered
medium. The external excitation is either a plane wave or
a Hertzian dipole. Equivalent electric and magnetic currents are induced on the boundary which then generate the
scattered field. . . . . . . . . . . . . . . . . . . . . . . . . .
5.2 An electric or magnetic dipole is radiating in a seven-layer
medium (unit: m). The layered medium is both dielectric
and magnetic, and the layer constants are shown in the
figure. The source point is in Layer 2 and the observation
line is in Layer 5. . . . . . . . . . . . . . . . . . . . . . . .
5.3 The magnitude of electric field generated by an electric
dipole. The dipole polarization is ( = 20o , = 30o ) and
is working at f = 300 MHz. The result is validated by the
transmission line method (solid line). . . . . . . . . . . . .
5.4 The magnitude of electric field generated by a magnetic
dipole. The dipole polarization is ( = 20o , = 30o ) and
is working at f = 300 MHz. The result is validated by the
transmission line method (solid line). . . . . . . . . . . . .
5.5 Cases where testing line integral exists. The testing function is straddling the interface, all radiation from the RWG
or half-RWG basis functions (triangles) in color needs invoking testing line integral, while radiation from others
does not need this line integral. . . . . . . . . . . . . . . .
5.6 Line integral test. The testing function is at the top interface. Radiation from: basis function 1: no line integral
activated; basis function 2: testing line integral activated;
basis function 3: both testing and basis line integrals activated. (unit: m). . . . . . . . . . . . . . . . . . . . . . . .
5.7 A homogeneous capsule structure embedded in a 5-layer
medium, where h = 0.6, r = 0.15 (unit: m). . . . . . . . .
5.8 The scattered field inside the object with = 2 and = 2.
Since there is no contrast, the scattered field recovers the
incident field (solid line). . . . . . . . . . . . . . . . . . . .
xi
. . 65
. . 73
. . 74
. . 74
. . 80
. . 80
. . 86
. . 86
xii
. . 87
. . 87
. . 88
. . 88
. . 89
. . 96
. . 99
. . 100
. . 103
. . 103
. . 104
. . 104
. . 105
LIST OF ABBREVIATIONS
SIE
LMGF
PEC
CEM
Computational Electromagnetics
FDM
FEM
MoM
Method of Moments
GPR
Ground-Penetrating Radar
LED
Light-Emitting Diode
GMRES
DCIM
PML
RFFM
SBC
SIP
TE
Transverse Electric
TM
Transverse Magnetic
MLFMA
AIM
CG-FFT
MLMDA
ACA
RCS
EBG
CPU
PC
Personal Computer
A-EFIE
CAD
Computer-Aided Design
RWG
Rao-Wilton-Glisson
CCIE
SPIE
MPIE
CNE
LT
Loop-Tree
2D
Two-Dimensional
3D
Three-Dimensional
VIE
BOR
Body of Revolution
TL
Transmission Line
PMCHWT
Poggio-Miller-Chang-Harrington-Wu-Tsai
EFIE
MFIE
SE
Spontaneous Emission
SPPs
LSP
LDOS
EM
Electromagnetics
QED
Quantum Electrodynamics
PC
Photonic Crystal
SER
DOS
Density of States
OLED
MEMS
Microelectromechanical Systems
GIBC
xv
CHAPTER 1
INTRODUCTION
1.1 Motivation
With the development of computer technology and scientific computation,
numerical simulation becomes as important as theoretical study and experimental verification in understanding the nature of our world. Computational
electromagnetics (CEM) is one branch of numerical techniques to model all
electromagnetic phenomena governed by the Maxwells equations, covering
from circuit engineering, microwave and antenna engineering, to optical engineering.
The methods in CEM can be classified into different categories according to
different criteria, such as different computation domains (time or frequency),
or different equation forms (differential or integral) [1]. The time domain
methods solve the time domain EM fields directly, while the frequency domain methods solve the harmonic fields based on Fourier transform. On the
other hand, the differential equation methods solve the Maxwells equations
in differential form, such as finite difference method (FDM) [2], finite element method (FEM) [3], while the integral equation methods first extract
the point response (Greens function) of a differential equation, and solve the
integral equation formulated from equivalence principle (surface or volume),
such as the method of moments (MoM) [4].
Integral equation method in the frequency domain is one of the most powerful methods in analyzing electromagnetic radiation and scattering. The key
of this method is the Greens function, which describes the EM response of
a point source in a specific environment. It satisfies the boundary condition
automatically and hence does not require any domain truncation or absorption boundary condition in the computation. Also, if the surface equivalence
principle is applied, the surface integral equation (SIE) can be obtained,
1
where the unknowns are only associated with the boundary of the scatterers
and thus can be made minimum.
Although the SIE can always be derived mathematically, the Greens functions for arbitrary inhomogeneous medium is not trivial and can only be
obtained numerically, which may be as complex as solving the original problem. For this reason, most research on surface integral equation methods are
focused on homogeneous environment, such as aerospace applications. However, if the inhomogeneity is one dimensional piecewise, or so called layered
medium, the Greens function can be determined analytically in the spectral (Fourier) domain, and the spatial domain counterpart can be obtained
by simply inverse Fourier transforming it [5]. In fact, this scenario consists of a broad class of applications in both microwave and optical regimes,
such as microstrip antennas and microwave circuits, geophysical exploration,
ground-penetrating radar (GPR), solar cell, light-emitting diode (LED), and
lithography, etc.
This thesis is focused on the SIE method with layered medium Greens
function (LMGF), and addresses several fundamental problems of this method.
First, the spatial domain LMGF can only be obtained by numerically integrating the spectral domain counterpart, expressed as a Sommerfeld integral [6]. This integral is oscillatory and slowly convergent, which makes
the repeated evaluation in MoM extremely time expensive. An acceleration
technique at the Greens function level is necessary.
Second, the SIE leads to a full matrix, which requires an O(N 2 ) memory
and an O(N 3 ) or O(N 2 ) computational time if a direct inverse such as LU
decomposition or an iterative method such as the GMRES is applied [7] [8]. A
fast algorithm is indispensable for large scale computation, such as radiation
of an antenna array. The fast algorithm developed shall also cover a broad
frequency range to achieve a broadband simulation.
Third, when the structure under investigation is much smaller then the
wavelengthwhich is common in circuit simulationthe integral operator
may suffer from a low frequency breakdown. Remedies shall be proposed to
overcome this difficulty for the specific integration kernel of LMGF.
Fourth, though the commonly used metal is a good conductor at the radio
or microwave frequencies and the electric field integral equation (EFIE) [9]
for a perfect electric conductor (PEC) can be implemented in most applications, the ability of dielectric modeling is important in other situations such
2
CHAPTER 2
A NOVEL IMPLEMENTATION OF
DISCRETE COMPLEX IMAGE METHOD
(DCIM)
A novel implementation of discrete complex image method (DCIM) based
on the Sommerfeld branch cut is proposed to accurately capture the far-field
behavior of the layered medium Greens function (LMGF) as a complement
to the traditional DCIM. By contour deformation, the Greens function can
be naturally decomposed into branch-cut integration (radiation modes) and
pole contributions (guided modes). For branch-cut integration, matrix pencil
method is applied, and the alternative Sommerfeld identity in terms of kz
integration is utilized to get a closed-form solution. The guided modes are
accounted for with a pole-searching algorithm. Both one-branch-cut and
two-branch-cut cases are studied. Several numerical results are presented to
validate this method. For the sake of completeness, the LMGF formulation
will also be briefly summarized in this chapter.
2.1 Introduction
Sommerfeld first investigated a dipole radiating above a half space using
Hertzian potential [6]. This problem was further studied and extended to
general multilayered medium by various researchers [12][15], [5]. There are
several approaches to derive the layered medium Greens function (LMGF),
such as the popular transmission line analog [16], [17], Hertzian potential
approach [6], [18], Ez Hz formulation [19], [20], and pilot vector potential
approach [5], [21], etc. No matter which method is applied, however, the
LMGF can only be expressed as an infinite, oscillatory and slowly-convergent
integral, Sommerfeld integral, making the numerical evaluation process very
inefficient. Several methods have been developed to expedite the evaluation of the LMGF, such as the function approximation approach [22], [23],
the path deformation technique [5], [24], the tabulation and interpolation
4
method [25], [26], the perfectly matched layers (PML) method [27], [28], the
singularity subtraction method [29], and the fast all-modes combined with
numerical modified steepest descent path method [30], [31] etc.
Function approximation in the spectral domain is one of the most popular
methods. In this method, the integration kernel is first approximated by
certain simple functions, and the integral is then evaluated in a closed
form by applying relevant integration identities. Though lots of function
approximation techniques are available from a numerical analysis point of
view, those candidates with closed-form identities of the infinite integrals
in our context can finally be utilized. This leads to the following methods:
the complex discrete complex image method (DCIM) [22], [32] (based on the
complex exponential functions), the rational function fitting method (RFFM)
[23], [33] (based on the rational fraction functions), or their combination [34].
The popular DCIM has unpredictable errors when the interaction is in
the far field region ( 0). The original sampling path cannot effectively
1
2
knm
( z)( z)g
(2.1)
TM
(r, r )
2
where knm
= 2n m . The g TE/TM (r, r ) is expressed as a Sommerfeld inte-
gral,
dk
(1)
H0 (k )F (k, z, z )
(2.2)
kmz k
p 2
(1)
where F (k, z, z ) is the propagation factor [5], kmz = km
k2 , and H0 (k )
is the first kind Hankel function of order 0 (the time convention is assumed
to be eit ).
i
g(r, r ) =
8
(2.3)
mR
m,m+1 R
m,m1 eikmz (2dm+1 2dm +z z)
+ M
mR
m,m1 R
m,m+1 eikmz (2dm+1 2dm z +z)
+ M
where the generalized reflection coefficients and Fresnel reflection coefficients
[5] are
iki+1,z (2di+2 2di+1 )
Rij =
p=
(2.5)
pj kiz pi kjz
pj kiz + pi kjz
(2.6)
,
,
(2.7)
TE
TM
and
B. n > m
h
i1
m = 1 R
m,m1 R
m,m+1 e2ikmz (dm+1 dm )
M
(2.8)
The observation layer is above the source layer. There is only secondary
field. Then
h
i
ikmz (dm+1 z )
ikmz (dm+1 2dm +z )
F (z, z ) = e
+ Rm,m1 e
(2.9)
h
i
iknz (zdn )
iknz (2dn+1 dn z)
e
+ Rn,n+1 e
Mm Tmn
where
n1
Y
Tmn =
j=m+1
ik (d d )
e jz j+1 j Sj1,j Sn1,n
Tj1,j
.
Sj1,j =
(2.10)
(2.11)
C. n < m
The observation layer is below the source layer. There is also only secondary field. Then
h
i
m Tmn
e
+ Rn,n1e
M
F (z, z ) =
where
Tmn =
m1
Y
j=n+1
Sj+1,j =
ik (d d )
e jz j+1 j Sj+1,j Sn+1,n
Tj+1,j
(2.12)
(2.13)
(2.14)
the detailed derivation can be found in [21]. In the electric field integral
equation (EFIE) formulation, there are five scalar Greens functions in the
matrix-friendly formulation.
gss (r, r ) = k2 g TE (r, r)
(2.15)
2
gzz (r, r ) = kmn
g TM (r, r ) z z g TE (r, r)
n
gz1 (r, r) =
z g TM (r, r ) + z g TE (r, r)
m
m
gz2 (r, r ) =
z g TM (r, r ) + z g TE (r, r)
n
(2.16)
g (r, r) =
z z TM
g (r, r ) g TE (r, r)
2
knm
(2.17)
(2.18)
(2.19)
where the partial derivative with respect to z and z can be easily implemented in the spectral domain, namely, z = iknz and z = ikmz where
the signs are determined by the relative positions of the source and observation points.
dk
k (1)
H (k )
g (k ).
kz 0
(2.20)
M
X
ai eikz bi
(2.21)
i=1
dk
p
k (1)
H0 (k )eikz z , r = 2 + z 2
kz
9
(2.22)
M
X
i=1
ai
eikri
, ri =
4ri
2 + b2i .
(2.23)
The complex exponential series can be obtained by, for example, the matrix
pencil method [38], which approximates a function with real argument by
y(t) =
M
X
Ri eSi t .
(2.24)
i=1
(2.25)
iSi T0
, ai = Ri eikbi
k(1 iT0 )
(2.26)
The far field prediction of the traditional DCIM is poor, and various remedies
have been proposed [35][37], [40].
10
11
Z
k (1)
i
gbranch =
dk
H (k )
g (k )
8 SBC
kN z 0
1 X k,q (1)
gpole =
H (k,q )Res [
g (k,q )]
4 q kN z,q 0
(2.27)
(2.28)
(2.29)
H0 (k )
2 i(k 4 )
e
(k )
k
(2.30)
If k is real, we have
gpole
p
1/
(2.31)
The gbranch represents the radiation modes from the branch cut integration,
which can be obtained in closed form by the new DCIM. By transforming
the variable from k to kN z , (2.28) becomes
gbranch
i
=
8
(1)
dkN z H0 (k )
g (k )
with
dkN z =
k
dk
kN z
(2.32)
(2.33)
T0
, 0 t T0 .
2
(2.34)
Si
, ai = Ri eibi T0 /2
i
(2.35)
(1)
dkz H0 (k )eikz z , r =
p
2 + z 2
(2.36)
The radiation modes include spatial wave and the lateral wave, which have
the following asymptotic behavior respectively
gspatial 1/
(2.37)
glateral 1/2
(2.38)
From (2.31), (2.37), and (2.38), we can see that usually the surface wave
dominates in the far field. However, at the interface, the spatial waves of the
primary term and the secondary term cancel each other and the lateral wave
can be observed, if there are no pole contribution for certain cases. This
cancelation in DCIM was first analyzed in details in [40].
(2.39)
where gbranch,N and gpole are similar to those in (2.32) and (2.29), while
gbranch,1 has the form of
gbranch,1
i
=
8
(1)
dk1z H0 (k )
k1z
g(k ).
kN z
(2.40)
which is of 1/ .
The acceleration of DCIM lies in the fact that multiple sourceobservation
points can share the same image coefficients. For the numerical examples
shown in Figure 2.5 and Figure 2.6, we compare the computational time
for calculating 5,000 k0 samples. The central processing unit (CPU) time
is listed in Table 2.1. We can observe that the computation can be much
accelerated by using DCIM.
To validate the two-branch cut case, a 3-layer model with lossy material
14
4
Numerical
DCIM
log10|gss|
0
2
4
6
8
3
0
1
log10(k0)
Figure 2.5: The magnitude of gss versus k0 for the microstrip structure.
The asymptotic behavior is 1/2 , which agrees with the theoretical
prediction.
Table 2.1: CPU time comparison (seconds)
LMGF
Numerical
DCIM
gss
56.82
0.59
74.69
2.44
2.5 Summary
A novel implementation of the DCIM based on Sommerfeld branch cut is
proposed to improve the far field prediction of the LMGF. By contour deformation, the Greens function can be naturally decomposed into the radiation
modes and guided modes. The guided modes can be obtained by a robust
pole-searching algorithm and the radiation modes can be calculated in a
15
0
Numerical
DCIM
log10|g|
2
3
4
5
6
3
0
1
log10(k0)
asymptotic behavior of 1/ .
16
4
Numerical
DCIM
log10|gss|
0
2
4
6
8
3
0
1
log10(k0)
Figure 2.8: The magnitude of gss versus k0 for the 3-layer model. The
asymptotic behavior is 1/2 .
2
0
Numerical
DCIM
log10|g|
2
4
6
8
10
12
3
0
1
log10(k0)
Figure 2.9: The magnitude of g versus k0 for the 3-layer model. The
asymptotic behavior is 1/2 .
17
closed form, so that the evaluation can be made efficient compared to the
direct numerical integration. For small interaction, we simply switch back
to the traditional DCIM to capture the near field. One should note that
in this new implementation, when becomes small, the length of sampling
path in spectral domain increases, and it becomes harder to approximate the
kernel. Efforts can be made to improve this DCIM in the near field, such
as extracting the asymptotic behavior analytically. At the same time, for
cases when poles are very close to the branch cut, the accuracy of function
approximation may be affected and more careful treatment of the poles is
necessary. Such efforts shall also be carried out in the future to improve this
DCIM.
18
CHAPTER 3
MIXED-FORM THIN-STRATIFIED
MEDIUM FAST-MULTIPOLE
ALGORITHM (MF-TSM-FMA)
A mixed-form thin-stratified medium fast-multipole algorithm (MF-TSMFMA) is proposed for fast simulation of general microstrip structures at both
low and mid-frequencies. The newly developed matrix-friendly formulation
of layered medium Greens function (LMGF) is applied in this algorithm.
For well-separated interactions, the contour deformation technique is implemented to achieve a smoother and exponentially convergent integral. The
two-dimensional addition theorem is then incorporated into the integrand to
expedite the matrix-vector product. In our approach, multipole expansion
(low-frequency fast-multipole algorithm) as well as the plane wave expansion
(mid-frequency fast-multipole algorithm) of the translational addition theorem are combined into a single multilevel tree to capture quasi-static physics
and wave physics simultaneously. The outgoing wave is represented first in
terms of multipole expansion at leafy levels, and then switched to plane wave
expansion automatically at higher levels. This seamless connection makes the
algorithm applicable in simulations, where subwavelength interaction (circuit
physics) and wave physics both exist.
3.1 Introduction
The simulation of microstrip structures has attracted intensive study for
many years. Since this kind of structure usually involves a thin layered
medium, the LMGF can be applied to reduce the number of unknowns. All
the acceleration techniques introduced at the beginning in Chapter 2 can be
implemented to expedite the process during the matrix filling stage in the integral equation method. However, since the integral equation method leads
to a full matrix, the memory consumption is large and the matrix-vector
product is still time-consuming in iterative solvers. Fast algorithms are in19
simultaneously. Motivated by the idea of mixed-form fast multipole algorithm in free space [70], a mixed-form thin-stratified medium fast-multipole
algorithm (MF-TSM-FMA), based on the newly-formulated LMGF [21], is
developed in this chapter. In this MF-TSM-FMA, the multipole expansion
and the plane wave expansion are combined into one multilevel tree, where
different scales of interaction can be separated by the multilevel nature of
the MLFMA [71]. A transition equation can be derived to connect these two
expansions.
1
2
knm
( z)( z)g
(3.1)
TM
(r, r )
where
g
TE/TM
i
(r, r ) =
8
dk
(1)
H0 (k )F TE/TM (k , z, z )
kmz k
(3.2)
To obtain a TSM-FMA, the path deformation technique is necessary to improve the convergence of the Sommerfeld integral.
1
=
2
JI () =
P
X
djJ ()
JI ()Ii ()
(3.3)
Hp(1) (k JI )eip(J I 2 )
(3.4)
p=P
jJ () = eik jJ
Ii () = eik Ii
(3.5)
where J and I are box centers for the observation point j and the source
point i, respectively. Here k is in general complex. In implementing TSMFMA, the z-dependent propagation factor F (z, z ) can be easily factorized
and embedded into the radiation and receiving patterns [67].
H0 (k ) Ceik0 eu
(3.6)
2
23
Matrix-Friendly LMGF
The matrix element in the MoM procedure can be expressed in the matrixfriendly LMGF as [21]
z2
z1
ss
zz
+ Zji
+ Zji
Zji = Zji
+ Zji
+ Zji
(3.7)
ss
Zji
= im hfs (rj ), gss (rj , ri ), fs (ri )i
(3.8)
zz
Zji
= im h
z f(rj ), gzz (rj , ri ), z f(ri )i
(3.9)
where
z1
Zji
= im h
z f(rj ), gz1 (rj , ri ), f(ri )i
(3.10)
z2
Zji
= im h f(rj ), gz2 (rj , ri ), z f(ri )i
(3.11)
Zji
= im h f(rj ), g (rj , ri ), f(ri )i
(3.12)
(3.13)
2
gzz (rj , ri ) = kmn
g T M (rj , ri ) z z g T E (rj , ri )
n
gz1 (rj , ri ) =
z g T M (rj , ri ) + z g T E (rj , ri )
m
m
gz2 (rj , ri ) =
z g T M (rj , ri ) + z g T E (rj , ri )
n
(3.14)
g (rj , ri ) =
z z T M
g (rj , ri ) g T E (rj , ri )
2
knm
24
(3.15)
(3.16)
(3.17)
(3.18)
(3.19)
(3.20)
(3.21)
mR
m,m+1
I2 (m) = M
(3.22)
(3.23)
(3.24)
mR
m,m1
I3 (m) = M
(3.25)
(3.26)
(3.27)
(3.28)
m Tmn
I(m, n) = M
(3.29)
B. n > m
where
25
C. n < m
F (z, z ) = fv (z)I(m, n)fr (z )
(3.30)
(3.31)
(3.32)
m Tmn
I(m, n) = M
(3.33)
where
sign in the direct term F1 (z, z ) = eikmz |zz | is a problem, however, we can
show that this modulus sign can be removed in the context of vertical branch
cut integration [67]. After implementing factorization of the propagation
factor, the addition theorem can be applied to accelerate the computation.
However, there are totally ten scalar components that need to be factorized
separately, hence optimization of the patterns are necessary.
= ikmz e
+ Rm,m1 e
h
i
iknz (zdn )
iknz (2dn+1 dn z)
m
iknz e
Rn,n+1 e
Tmn M
(3.34)
Receiving A
Receiving B
g T E (r )
x f(r )
x f(r)g T E (r)
TE
TE
(r )
y f(r )
(r ) f(r )
z g T E (r )
z f(r )
g T M (r )
z f(r )
z g T M (r ) f(r )
y f(r)g
TE
f(r)g
(r)
TE
0
z f(r)z g T E (r)
(r)
f(r)g T E (r)
z f(r)g T M (r)
z f(r)g T M (r)
z f(r)z g T E (r)
f(r)z g T M (r)
f(r)z g T M (r)
z1
part of Zji
and Zji
can be accelerated at the same time. If we denote
g (r, r) = g (r)g (r )
(3.35)
then we can set one radiation pattern and two receiving patterns to be
Ir = f(r )g T E (r )
(3.36)
(3.37)
(3.38)
By doing this, two interactions can share the same radiation patterns,
outgoing waves and incoming waves. However, minor modification is needed
in the last stage, where incoming waves at the leafy level are disaggregated
into observation points by using different receiving patterns. The pattern
layout is listed in Table 3.1. By categorizing the patterns, one may only
need to calculate six kinds of interactions instead of ten.
XX
M
(3.39)
(1)
(3.40)
(3.41)
H0 (k ji ) =
XX
M
(3.42)
Though this multipole expansion is equivalent to the plane wave expansion mathematically, the former is stable in the low frequency regime since
normalization can be easily implemented to balance the and , to make
each numerical step error controllable. For the same setup as in Figure 3.1,
the relative error of the multipole expansion is shown in Figure 3.3. It is
obvious that the expansion is stable at low-frequencies. However, when the
frequency increases, the multipole expansion becomes very inefficient since
more and more expansion terms are required to capture the wave physics.
So it should be switched back to plane wave representation to describe the
dynamic field.
28
JI ()Ii ()ei(+ 2 )n
dei(+ 2 )m jJ ()
(3.43)
This equation combines the multiple expansion and plane wave expansion.
Substituting it into (3.39), we can construct a mixed-form TSM-FMA. At
leafy levels, the box size is much smaller compared to the wavelength, where
interaction is in the sub-wavelength regime, the outgoing wave is calculated
by multipole expansion. With the process of aggregation, the box size becomes large enough, where the interaction may enter the dynamic regime, the
outgoing wave is switched into the plane wave representation to capture the
wave physics. Similar idea can be applied in the process of disaggregation.
In matrix notation, the MF-TSM-FMA for a certain k is expressed as,
H0 (k ji ) = [jJ1 ]1P1 [J1 J2 ]P1 P2 [T ]P2 K3 [J2 J3 ]K3 K3
(1)
T = ei(+ 2 )n KP
(3.45)
d = 1.59 mm, and the relative permittivity and permeability are r = 2.2
and r = 1. The radiation patterns are calculated by feeding at the input
port at the working frequency of f = 9.42 GHz. The patterns in both Eplane ( = 0o ) and H-plane ( = 90o ) are shown in Figure 3.6. The results
agree well with the results extracted from [36]. Here, the box size at the
leafy level is 0.30 , and a five-level FMA is set up. Since the switch level is
below the leafy level, the MF-TSM-FMA applies the plane-wave expansion
for all levels. In order to activate the low-frequency FMA, we decrease the
frequency to f = 2.5 GHz, and calculate the radar cross section (RCS) of
this microstrip array. The results are shown in Figure 3.7. The leafy level
box size is 0.080 , and the switch level is 4. The number of unknown is 9,394,
and the memory consumption of the MF-TSM-FMA is 40 MB. Next, to show
the capability of modeling vertical structures, a dipole antenna mounted on
a 4 4 mushroom-type electromagnetic band gap (EBG) is simulated, with
its geometrical structure shown in Figure 3.8. The geometrical parameters
are: w = 29 mm, g = 1 mm, d = 1 mm, h = 2 mm, L = 63 mm, W = 1
mm, H = 2 mm. The FR4 is used as the substrate with = 4.4 and
tan = 0.02. The antenna is working at f = 2.2 GHz. The radiation pattern
in the E plane is calculated and shown in Figure 3.9 (in unit of dB). Due
to the layered medium assumption, there is no back lobe and side lobes in
the radiation pattern. A 30 30 microstrip array [47] shown in Figure 3.10
is then simulated at frequency of 2.2 GHz, 1.1 GHz, and 550 MHz, where
a = b = 6 cm, L = 3.66 cm, W = 2.6 cm, the thickness of the substrate is
10
20
30
40
E at = 90
E at = 0
MFTSMFMA
Ref.
50
60
80
60
40
20
0
( )
20
40
60
80
32
35
MoM
MFTSMFMA
40
45
50
55
60
65
70
75
80
50
100
150
200
()
250
300
350
Figure 3.7: The bistatic RCS of the microstrip array at f = 2.5 GHz and
(i , i ) = (60 , 0 ).
33
Figure 3.9: Radiation pattern of the dipole in E plane, due to the layered
medium assumption, the ground plane is infinitely large.
34
40
60
80
100
120
f=2.2 GHz
Ref.
f=1.1 GHz
f=0.55 GHz
140
160
10
20
30
40
50
60
70
80
90
()
300
200
100
10
0
12
4
x 10
Figure 3.12: Memory requirement (solid line) and CPU time per iteration
(dash line) versus number of unknowns.
35
3.5 Summary
A mixed-form thin-stratified medium fast multipole algorithm is developed in
this chapter. By path deformation, the LMGF can be expressed by severalterm summation of Hankel functions weighted by z-dependent propagation
factors. For each quadrature point, the Hankel function can be accelerated
by 2D MLFMA. The matrix-friendly formulation of the LMGF is applied,
and interaction is categorized into six components if we introduce two receiving patterns for one radiation pattern. Due to the numerical instability
of the TSM-FMA in the low frequency regime, a mixed-form TSM-FMA is
proposed for broadband simulation. By utilizing the multilevel nature of
the MLFMA, interactions with different scales are accelerated by different
FMA expansions, namely multipole expansion and plane wave expansion. A
transform equation is derived to embed the two expansions into one MLFMA
tree. Numerical results show the accuracy and efficiency of this algorithm. It
should be noted that in the very low frequency regime, the integral equation
in layered medium may suffer from another kind of low frequency breakdown,
where the existence of the null space of the operator plagues the numerical
procedure. Before modeling large problems in extremely low frequency, one
must develop an efficient algorithm accounting for the two breakdowns at
the same time.
36
CHAPTER 4
REMEDIES FOR LOW FREQUENCY
BREAKDOWN OF INTEGRAL
OPERATOR IN LAYERED MEDIUM
When the working frequency further decreases, the low frequency breakdown
of the integral operatorthe electric field integral equation (EFIE) occurs.
Two remediesthe loop-tree decomposition and the augmented electric field
integral equation (A-EFIE)which were first developed in free space, are
extended to layered medium in this chapter. In the loop-tree decomposition,
the current is decomposed into divergence-free part and non-divergence-free
part according to quasi-Helmholtz decomposition when frequency tends to
zero, in order to capture both capacitance and inductance physics. Frequency normalization and basis rearrangement are applied to stabilize the
matrix system. In the A-EFIE, the traditional EFIE can be cast into a
generalized saddle-point system, by separating charge as extra unknown list
and enforcing the current continuity equation. Frequency scaling for the
matrix-friendly layered medium Greens function (LMGF) is analyzed when
frequency tends to zero. Rank deficiency and the charge neutrality enforcement of the A-EFIE for LMGF is discussed in detail. The electrostatic limit
of the A-EFIE is also analyzed. Without any topological loop-searching algorithm, electrically small conducting structures embedded in a general layered
medium can be simulated by using this new A-EFIE formulation.
4.1 Introduction
Computational electromagnetics becomes indispensable as a computer-aided
design (CAD) methodology in various electrical engineering applications,
such as in integrated circuit and wireless communication device. The operating frequency of the electrical systems keeps on increasing to several
gigahertz, meanwhile fabrication process has achieved nanoscale. Hence, a
broadband simulation tool is badly needed for capturing circuit physics of
37
the tiny structures as well as wave physics for the whole package. Unfortunately, however, in the sub-wavelength regime, or so-called low frequency
regime, the electric field and magnetic field decouple, and the total current
in Maxwells equations decomposes into a divergence-free part and a curlfree part, with different frequency scaling properties. In this situation, the
commonly used EFIE method solved by the method of moments (MoM)
[4] with the Rao-Wilton-Glisson (RWG) basis function [73] suffers from a
low frequency breakdown problem, where vector potential gradually looses
its significance compared with the scalar potential part when the frequency
decreases, and the EFIE operator becomes singular [1]. Various approaches
have been proposed to overcome this problem in the last few years. One of the
most popular remedies is the loop-tree or loop-star decomposition [74], [75],
where the solenoidal and irrotational components of the unknown current can
be separated due to the quasi-Helmholtz decomposition (also known as Hodge
decomposition), to capture inductance physics and capacitance physics when
the frequency tends to zero. However, even after frequency normalization,
the matrix is still ill-conditioned. Preconditioning is necessary to improve
the convergence when iterative solvers are applied. Several effective preconditioners have been proposed, either based on the basis-rearrangement, where
the favorable property of electrostatic problems is utilized [76], or based on
the near-field interactions, where the incomplete factorization with a heuristic drop strategy is applied [77]. By using the Calderon identity and the dual
basis or Buffa-Christiansen basis function [78], [79], a more effective preconditioner has been constructed [80][82]. The loop-tree or loop-star method
has also been implemented with the LMGF, which is more versatile in the
simulation of printed antenna and planarly integrated circuit [83][85].
However, one big issue associated with the loop-tree or loop-star method
is the loop-search process. It is a bottleneck for complicated interconnecting geometries with increasing number of unknowns, where many entangled
global loops may exist. Situation becomes even worse when layered medium
with conducting ground plane is involved, where extra implicit global loops
are introduced because of the vias. To avoid the loop-search process, the
idea of separating current and charge to construct a stable formulation has
been studied recently. The current and charge integral equation (CCIE)
method [86] puts charges into the extra unknown list and manipulates the
equation system to be of the second kind. While in the separated potential
38
integral equation (SPIE) method [87], the scalar potential is included as the
unknowns, where resistive loss and dielectric loss are introduced to flatten
the condition number when the frequency is low. In the recently developed
augmented electric field integral equation (A-EFIE) method [88][90], the
similar idea of separating current and charge as independent unknowns is
applied. By enforcing the current continuity equation explicitly and implementing a proper frequency scaling, the EFIE can be cast into a generalized
saddle point system [91]. With the help of a constraint preconditioner for
the saddle point system and the mixed-form fast multipole algorithm, a real
package problem with more than one million unknowns has been successfully
solved on a personal computer [89].
In this chapter, we discuss both loop-tree decomposition and the A-EFIE
involving the LMGF.
N
X
i=1
where
fi (r) =
fi (r)Ii = f(r) I
+
i
,
2A+
i
2Ai ,
i
0,
r Ti+
r Ti
(4.1)
(4.2)
otherwise
The RWG basis function is defined on an adjacent triangular patch pair, and
A
i is the area of the two triangles associated with the i-th basis, and the i
is the vector pointing to a point r from the two vertices. Here, the function
is normalized by its edge length for convenience.
excitation. After applying the Galerkin procedure [4] to the EFIE, a matrix
equation can then be set up
I =V
Z
(4.4)
where the matrix elements in the matrix-friendly formulation [21] are shown
in (3.7)(3.17).
When 0, the matrix elements have different frequency scaling behaviors. The detailed analysis can be found in Section 4.5 and here we only list
the scaling for the lossless case.
Z ss O(), Z zz O(), Z z1 O(), Z z2 O(), Z O( 1)
(4.5)
NL
X
fLi (r)ILi +
NT
X
fT i (r)IT i
(4.6)
i=1
i=1
The loop functions can be obtained from the RWG functions [92], and the
tree functions have the same form as the RWG functions. The loop-tree basis
spans the same linear space as the RWG basis. For a general structure with
a hole, loop basis contains both local loop and global loop, as shown in
Figure 4.1.
Applying the loop-tree function as both expansion function and testing
function, the matrix equation then takes the form of
"
LL Z
LT
Z
TL Z
TT
Z
# "
IL
IT
"
VL
VT
(4.7)
where
LL = Z
ss + Z
zz
Z
LL
LL
(4.8)
LT = Z
ss + Z
zz + Z
z1
Z
LT
LT
LT
(4.9)
40
(4.10)
TT = Z
ss + Z
zz + Z
z1 + Z
z2 + Z
Z
TT
TT
TT
TT
TT
(4.11)
LL (O()) Z
LT (O())
Z
T L (O()) Z
T T (O( 1))
Z
# "
IL
IT
"
VL
VT
(4.12)
=
T L (O())
T T (O(1))
Z
ik0 Z
(ik0 )1 IT
VT
(4.13)
The matrix equation behaves much better after frequency normalization
since contribution from ZLL is elevated while the contribution from ZT T is
suppressed and the diagonal elements are dominant.
41
NX
P 1
i=1
NX
P 1
Ni (r)Qi
(4.14)
i=1
R
where Pi (r) is the normalized pulse basis satisfying s dsPi (r) = 1. Here Ni (r)
is the new expansion function to describe charge density in a charge-neutral
system. It can be written in a more compact form,
(r) = Nt (r) Q
(4.15)
Since the loop current does not produce charges, we can connect the tree
current to charge via the continuity equation
t (r) IT = iNt (r) Q
J
T
(4.16)
Taking the inner product of the above equation with P(r), we get
t (r) IT = K
IT = i Q
P(r), J
T
(4.17)
connects the current coefficient with the charge coefThe sparse matrix K
ficient. Substituting (4.17) into (4.13), a system with good convergence can
42
0
LT
RWG
RCS (dBsm)
50
100
150
200
250
0
10
10
10
10
Frequency (KHz)
10
10
LL
LT K
1
(ik0 )1 Z
Z
t )1 Z
T L ik0 (K
t )1 Z
TT K
1
(K
# "
IL
Q
"
(ik0 )1 VL
t )1 VT
(K
(4.18)
N N = N, g, Nt
Z
(4.19)
T T = f , g, f t
Z
(4.20)
t N = f
K
(4.21)
NN = (K
t )1 Z
TT K
1
Z
(4.22)
while the matrix in terms of the tree function or the RWG function f is
Since
we can get
43
10
LT + Free Space
LT + Layered Medium
6
10
10
10
10
10
10
1
10
10
10
10
Frequency (KHz)
10
10
44
r ) = I +
G(r,
g(r, r)
(4.23)
2
k0
where k0 is the wave number in free space and g(r, r) is the solution to the
scalar Helmholtz wave equation with a point source,
g(r, r) =
eik0 |rr |
4|r r |
45
(4.24)
Again we first use the RWG basis function [73] shown in (4.2) to expand the
induced current. In free space, the mixed potential integral equation (MPIE)
is favorable since the scalar vector potential and scalar potential are well
defined under the Lorentz gauge, and the operator in the Greens function
can be easily moved to the RWG basis function. The surface divergence of
the RWG basis is analytical.
s fi (r) =
1
A+
i
A1
i
0,
r Ti+
r Ti
(4.25)
otherwise
After applying the Galerkin procedure [4], a matrix system shown in (4.4)
can be obtained and the matrix can be expressed as
= ik0 0 A
+ 0 S
Z
ik0
(4.26)
where 0 is the free space wave impedance. The A and S correspond to the
magnetic vector potential and electric scalar potential
ji = hfj (r), g(r, r), fi (r )i
[A]
(4.27)
(4.28)
From (4.26)(4.28), we can see that the vector potential block and the scalar
potential one are imbalanced when the frequency is low, namely k0 0,
since they are in different frequency order. The EFIE operator becomes
singular because any divergence-free current is a solution to the EFIE in the
quasi-static limit,
I0
S
(4.29)
To balance the system, the charge can be separated and added into the
unknown list to make the system stable in an augmented fashion [88], [89].
We define the normalized pulse basis function on each triangular patch as
pi (r) =
1
Ai
0,
46
r Ti
otherwise
(4.30)
(4.31)
we can obtain the relationship between the patch-pair based (in terms of
divergence of RWG basis) scalar potential matrix and the patch-based one
=D
T P
D
(4.32)
[D]ji =
1, Patch j is the negative part of RWG i
0,
otherwise
(4.33)
(4.34)
where c0 is the light speed in vacuum and is the charge unknowns. Substituting the above equations into the EFIE matrix equation, and enforcing the
current continuity equation explicitly, we can arrive at the following A-EFIE
system
"
# "
# "
#
D
T P
A
ik0 J
01 V
=
(4.35)
D
k02I
c0
0
This equation is the generalized saddle point system with the lower right
block nearly equals to zero and various methods can be applied to solve this
problem efficiently [91].
47
(4.36)
(4.37)
zz ]ji = h
[Z
z fj (r), gzz (r, r ), z fi (r )i
(4.38)
z1 ]ji = h
[Z
z fj (r), gz1 (r, r), fi (r )i
(4.39)
(4.40)
(4.41)
where
Frequency Scaling
Since the primary (direct) term can be analyzed in the similar way as in free
space, only secondary (reflected or transmitted) terms are considered in this
section. We first assume that the layered medium is lossless. For general
case, namely i 6= j and i 6= j , when 0, kiz ik , the frequency
scaling of the Fresnel reflection coefficient is
Ri,j =
pj pi
pj kiz pi kjz
=
O( 0)
pj kiz + pi kjz
pj + pi
(4.42)
where p = for TE wave and p = for TM wave. Then we can get the
frequency scaling for other quantities:
i,j O( 0), M
m O( 0), Tmn O( 0)
R
(4.43)
(4.44)
48
We can separate the matrix into two parts according to the frequency scaling
ss
= mr Z
+Z
zz + Z
z1 + Z
z2
A
(4.46)
= knm Z
S
nr
(4.47)
= ik0 0 A
+ 0 S
Z
ik0
(4.48)
O( 0), S
O( 0)
A
(4.49)
where
Equation (4.48) has the same form as Equation (4.26) in free space, which
allows us to augment the EFIE in a similar fashion as in (4.35).
Since most material is non-magnetic, namely i = j , we discuss this
situation separately. The frequency scaling of the Fresnel reflection coefficient
for TE wave is a high order term of frequency,
TE
Ri,j
=
kiz kjz
j kiz i kjz
=
O( 2)
j kiz + i kjz
kiz + kjz
(4.50)
Then we have
T E O( 2)
R
i,j
(4.51)
T E O( 0), T T E O( 0)
M
m
mn
(4.52)
i
0
(4.53)
(4.54)
A
J
(ik0 0 )1 V
D
ik0I
c0
0
(4.55)
(4.56)
Consistency Validation
We discuss two extreme cases to analyze the consistency of the A-EFIE for
layered medium Greens function. One with homogeneous layers (free space)
and the other with perfect electrical conductor (PEC) layer (half space), both
of which have closed form Greens functions.
In free space, the EFIE can be separated into two parts, the vector potential part and the scalar potential part, both of which are scalar problems
with scalar Greens function, because of the homogeneity of the medium,
shown in (4.26)-(4.28). However, in the layered medium, the response of a
dipole is polarization dependent. A vertical electric dipole can only generate a TM wave, while a horizontal electric dipole generates TE as well as
TM waves. The polarization dependence leads to the difficulty in defining
ss
a uniform scalar potential. In the matrix friendly formula, we can see Z
zz manifest the different response of a horizontal and a vertical dipole
and Z
in a layered medium. If we asymptotically make the inhomogeneity disappear, namely, making mr 1 and mr 1 for each layer, and applying the
Sommerfeld identity
i
g(r, r ) =
4
dk
0
J0 (k )eikz |zz |
kz
(4.57)
ss and Z
zz recover the polarization independent vector potential in
the Z
, it goes back to the scalar potential
(4.27). Similar situation holds for the Z
part in (4.28) when removing the inhomogeneity. The physical meanings of
50
z1 and Z
z2 are ambiguous due to the lack of exact
the remaining two terms Z
definition of scalar potential. There is no correspondence in free space, since
the TE and TM waves cancel each other when the layered medium degrades
into a free space. By appearance, we can interpret it as the cross interaction
between charge and the vertical current. According to their same frequency
ss and Z
zz , we can group them together to obtain the layered
scaling with Z
medium A-EFIE, as is done in (4.46).
For a half space with a PEC layer, the image method can be applied and
the dyadic Greens function can be expressed in a closed form [12]
1
r ) = I [g(r, r) gi (r, r )] + 2
G(r,
z zgi (r, r )
k02
(4.58)
where g(r, r) and gi (r, r ) is the free space scalar Greens function with real
and S
defined as
source point r and image source point r , with A
ji = hfj (r), g(r, r) gi (r, r ), fi (r )i + 2h
[A]
z fj (r), gi (r, r), z fi (r )i (4.59)
ji = h fj (r), g(r, r) gi (r, r), fi (r )i
[S]
(4.60)
T P
B
A
D
D
F
k02Ir
# "
ik0 J
c0 r
"
01 V
0
(4.61)
r
=B
52
(4.62)
18
10
With CNE
Without CNE
16
10
14
Condition number
10
12
10
10
10
10
10
10
10
10
10
Frequency (Hz)
10
10
Figure 4.6: The condition number versus frequency for the rectangular loop.
The condition number is unbounded when decreasing the frequency. Charge
neutrality enforcement (CNE) makes the condition number constant.
x 10
With CNE
Without CNE
3
2
1
0
1
2
3
0
3
4
5
Real part of eigenvalues
9
3
x 10
Figure 4.7: The eigenvalue distribution for the rectangular loop at 1 Hz.
The smallest eigenvalue is removed away from the origin after the charge
neutrality enforcement (CNE).
54
Figure 4.8: The geometrical model of the half loop embedded in a five-layer
medium (including the PEC layer), unit: mm. A delta gap excitation is
applied at the center of the top arm.
5
3.04
x 10
Condition number
3.038
3.036
3.034
3.032
Without CNE
3.03
0
10
10
10
Frequency (Hz)
10
10
Figure 4.9: The condition number versus frequency for the half loop. Since
it is connected to the ground plane, charge neutrality cannot be
guaranteed. The condition number is bounded when decreasing the
frequency without any special treatment.
matrix in this situation.
=B
= I
F
(4.63)
One should note that though the number of unknowns in A-EFIE increases
knowns is the same as the number of RWG basis, the memory requirement
F
and B
(4.65)
(r) = 0 r S
(4.66)
(4.67)
where is the surface charge density, Gs (r, r ) is the static layered medium
Greens function [94] and 0 is the potential generated by the source s(r). In
this limit, a typical problem is the parallel plate capacitor, shown in Figure
4.10. Here we connect the two plates with a narrow strip so that we can
apply the electrodynamic analysis.
In this limit, the A-EIFE suffers from an inaccuracy problem, even though
the matrix is nonsingular, because the current is a higher order term in
frequency. To capture the current accurately to arbitrary order, the perturbation method should be applied [95]. We will show that the charge is always
stable and describes the electrostatic physics. In DC, the A-EFIE becomes
"
s D
T P
s B
A
D
F
0
# "
ik0 J
c0 r
"
01 V
0
(4.68)
s and P
s are matrix evaluated at k0 = 0. For this problem, the
where A
current J disappears, while the charge remains constant,
J 0, c
(4.69)
Since the matrix is still full rank, we have unique solution. However, the
56
z (mm)
50
0
50
1000
500
0
500
x (mm)1000
1000
500
500
1000
y (mm)
Figure 4.10: The geometrical model of the circular parallel plate capacitor,
with a dielectric layer (r = 2.65) inserted in between. A delta gap is
applied at the edge. The mesh is refined to capture the fringing effect.
current is no longer correct due to the finite numerical precision. If we
discard the ik0 J term manually, we have
T P
s B
r = 0 V
D
(4.70)
1
4|r r |
(4.71)
In a layered medium, we will show the static form of the general matrix s in the following subsection, which agrees with
friendly Greens function in P
that of [94] applied in the electrostatic analysis except for minor differences
such as the layer index, constant 1/40 and the sign definition in the Fresnel
reflection coefficient.
57
where the Greens function is described by several images with weight i and
distance Zi
A. m = n
There are one primary term and four image terms.
0 = 1, Z0 = |z z |
(4.73)
m,m1 M
m , Z1 = 2dm + z + z
1 = R
(4.74)
m,m+1 M
m , Z2 = 2dm+1 z z
2 = R
(4.75)
m,m+1 R
m,m1 M
m , Z3 = 2hm + z z
3 = R
(4.76)
m,m+1 R
m,m1 M
m , Z4 = 2hm z + z
4 = R
(4.77)
B. m < n
There are four image terms.
+
1 = Tmn
Mm , Z1 = z + z
(4.78)
n,n+1 T + M
2 = R
mn m , Z2 = 2dn+1 z z
(4.79)
m,m1 T + M
3 = R
mn m , Z3 = 2dm + z + z
(4.80)
+
m,m1 R
n,n+1 Tmn
4 = R
Mm , Z4 = 2dm + 2dn+1 + z z
(4.81)
C. m > n
There are also four image terms.
1 = Tmn
Mm , Z1 = z z
(4.82)
n,n1 T M
2 = R
mn m , Z2 = 2dn + z + z
(4.83)
m,m+1 T M
3 = R
mn m , Z3 = 2dm+1 z z
(4.84)
58
m,m+1 R
n,n1 T M
4 = R
mn m , Z4 = 2dm+1 2dn z + z
where,
(4.85)
i+1,i+2 e2hi+1
Ri,i+1 + R
Ri,i+1 =
i+1,i+2 e2hi+1
1 + Ri,i+1 R
(4.86)
2hi1
(4.87)
m =
M
+
Tmn
=
1
m,m1 R
m,m+1 e2hm
1R
n1
Y
Tj,j+1
j+1,j+2 e2hj+1
1 Rj+1,j R
j=m
Tmn
=
m1
Y
j=n
Tj+1,j
j,j1e2hj
1 Rj,j+1R
hi = di+1 di
(4.88)
(4.89)
(4.90)
(4.91)
10
10
10
10
AEFIE
LT
EFIE
10
10
10
10
10
Frequency (Hz)
10
10
Figure 4.11: The input reactance of the rectangular loop. A-EFIE agrees
well with the loop-tree (LT) decomposition, while the traditional EFIE
breaks down quickly when decreasing the frequency.
last section, while the charge is accurate and agrees with the static analysis.
The analytic result from asymptotic expansion [96] is also shown to validate
the numerical results. In this example, when the frequency is below 1 MHz,
the relative error of the A-EFIE with charge information is around 0.1%. If
the frequency is increased, wave physics begins to play a role and the parallel
plate is no longer a pure capacitor.
4.9 Summary
Two remedies for low frequency breakdown of the EFIE operator involving the LMGF are discussed in this chapter. By decomposing the current
into the divergence-free part and the non-divergence-free part according to
the property of the current in the low frequency regime, we can capture
both capacitance and inductance physics, which are of the same importance
in circuit physics. Mathematically, loop-tree decomposition combined with
frequency normalization makes the matrix stable. The connection matrix
further makes the final system well conditioned, as is done in free space. Due
to the complexity of pole searching, the A-EFIE for LMGF is developed in
this chapter. The frequency scaling is analyzed for both lossless and lossy
media. The rank deficiency of the A-EFIE in layered media depends on if the
60
10
10
10
10
10
AEFIE
EFIE
PEC halfspace
10
10
10
10
10
Frequency (Hz)
10
10
Figure 4.12: The input reactance of the half loop. A-EFIE maintains the
scale invariance very well while the traditional EFIE breaks down quickly
when decreasing the frequency. Since the non-magnetic dielectric is
transparent to the inductor, a PEC half space model is applied to validate
the results.
Capacitance (nF)
0.9
0.8
0.7
AEFIE I
AEFIE Q
Static
Analytic
0.6
0.5
0
10
10
10
10
10
Frequency (Hz)
10
10
61
charge neutrality condition is satisfied. For independent structures, the enforcement is necessary in the low frequency regime, while at mid-frequencies,
such implementation is no longer necessary. For structures connected to the
ground, the A-EFIE matrix is full rank, and no special treatment is needed.
The electrostatic limit is analyzed and compared with the static formulation.
One should note that in the electrostatic limit, though the matrix is well conditioned, the A-EFIE has an inaccuracy problem due to the extremely small
current, which has a high-order frequency dependence. Remedies such as the
perturbation method shall be implement to capture the high-order term of
the current in the future.
62
CHAPTER 5
A NEW LAYERED MEDIUM GREENS
FUNCTION (LMGF) FORMULATION FOR
GENERAL OBJECTS
In this chapter, a new Greens function formulation is developed systematically for modeling general homogeneous (dielectric or magnetic) objects in
a layered medium. The dyadic form of the Greens function is first derived
based on the pilot vector potential approach. The matrix representation in
the moment method implementation is then derived by applying integration
by parts and vector identities. The line integral issue in the matrix representation is investigated, based on the continuity property of the propagation
factor and the consistency of the primary term and the secondary term. The
duality principle of the LMGF is also discussed to make the formulation succinct. After that, the extinction theorem is revisited in the inhomogeneous
background and a surface integral equation for general homogeneous objects
is set up. Different from the popular mixed potential integral equation formulation, this method avoids the artificial definition of scalar potential. The
singularity of the matrix representation of the Greens function can be made
as weak as possible. Several numerical results are demonstrated to validate
the formulation developed at the end of this chapter.
5.1 Introduction
During the last several decades, intensive study has been carried out to model
objects embedded in a layered medium by using integral equation methods
and the LMGF. The two-dimensional (2D) analysis was developed in [97],
where scattering from a conducting cylinder partially buried in a half space
is analyzed. For more practical applications, the three-dimensional (3D)
analysis was carried out and applied to many problems on radiation and
scattering from perfect electric conducting (PEC) objects in the background
of layered medium [98], [16], [99], [52], [100], [101]. Among them, the LMGF,
63
(5.1)
(5.2)
where J denotes the electric current and M is the magnetic current, r is the
observation point and r is the source point. These expressions are valid for
any inhomogeneous medium; however, we will focus on the planarly layered
medium here. The operator LE is defined as [1]
Z
LE (r, r ) = i
e (r, r)(r )
dr G
(5.3)
KH (r, r ) = (r)
e (r, r )(r )
dr G
(5.4)
Due to the duality principle of the Greens function shown in Section 5.4, we
can further define the other two operators as,
LH (r, r ) = i
KE (r, r ) = (r)
m (r, r )(r )
dr G
(5.5)
m (r, r)(r )
dr G
(5.6)
66
e and G
e , since G
m and G
m can be easily obtained by the
on G
duality principle of the Greens function shown in Section 5.4.
e
Expression of G
e of the primary term in (5.3) is
The G
Ge (r, r ) = I + 2
g(r, r)
km
(5.7)
g(r, r) =
eikm R
eikm |rr |
=
4|r r |
4R
(5.8)
where the subscript m is the layer index of the source point (and n for
observation layer, here m = n), and km is the wave number in layer m. In
Cartesian coordinates, the dyadic Greens function can be further written as,
ik R 1
m
e (r, r ) = I R
R
+ I 3R
R
G
g(r, r)
2 R2
km
(5.9)
= R = r r = 1 [(x x )
R
x + (y y )
y + (z z )
z]
R
|r r |
R
(5.10)
e
Expression of G
e of the primary term in (5.4), we have
For G
e (r, r ) = Ig(r, r)
G
67
(5.11)
e (r, r) =
G
z
y
z
0
x
zz
(y y )
R1 ikm
1
R
x g(r, r)
0
(z z )
y y
0
x x
g(r, r)
(5.12)
(x x )
0
(5.13)
2
where knm
= 2 n m , and
(5.14)
TM (r, r ) = ( z) ( z) g TM (r, r )
G
e
(5.15)
where
g (r, r) =
=
i
8 2
i
4
RR +
dks
eiks (rs rs ) F (ks , z, z )
kmz ks2
R +
0
(5.16)
dk
J (k )F (k , z, z )
kmz k 0
1
(2)2
RR +
68
(5.17)
1
Sn {f(k )} =
2
(5.18)
we have
i
g (r, r ) = F 1
2
F (ks , z, z )
kmz ks2
i
= S0
2
F (k , z, z )
kmz k2
(5.19)
(5.20)
The expressions have been derived in [16] (Appendix I) and will not be repeated here. Notice that the second-order Sommerfeld integrals are further
simplified to lower-order counterparts by using the recursive property of the
Bessel function [111], [18]
J2 (k ) =
2
J1 (k ) J0 (k ).
k
(5.21)
All the relations will be utilized in deriving the following Greens function
components (the time convention here is eit and is different from that of
[16]).
e
Expression of G
We first consider the TE wave, since
z = y x x y
we have
y y
TE (r, r) =
G
x y
e
0
where
GTE
e,xx
i
cos 2S1
=
2
F TE
kmz k2
(5.22)
y x 0
x x
0
0 g TE (r, r )
0
i
+ (1 cos 2) S0
4
69
F TE
kmz
(5.23)
(5.24)
GTE
e,xy
GTE
e,yy
i
=
sin 2S1
2
F TE
kmz k2
i
sin 2S0
4
F TE
kmz
TE
GTE
e,yx = Ge,xy
TE
TE
i
F
i
F
= cos 2S1
+ (1 + cos 2) S0
2
2
kmz k
4
kmz
(5.25)
(5.26)
(5.27)
(5.28)
then
x x z z x y z z x z k2
TM
TM (r, r) =
G
y x z z y y z z y z k2 g (r, r )
e
x z k2
y z k2
k4
where
i
GTM
e,xx = 2 cos 2S1
+ 4i
z z F TM
kmz k2
z z F TM
kmz
(1 + cos 2) S0
n
o
z z F TM
i
GTM
=
sin
2S
1
e,xy
2
kmz k2
n
o
F TM
+ 4i sin 2S0 z kzmz
i
z F TM
TM
Ge,xz = cos S1
2
kmz
TM
GTM
e,yx = Ge,xy
n
o
z z F TM
i
GTM
=
cos
2S
1
e,yy
2
kmz k2
n
o
F TM
+ 4i (1 cos 2) S0 z kzmz
i
z F TM
TM
Ge,yz = sin S1
2
kmz
i
z F TM
TM
Ge,zx = cos S1
2
kmz
i
z F TM
TM
Ge,zy = sin S1
2
kmz
70
(5.29)
(5.30)
(5.31)
(5.32)
(5.33)
(5.34)
(5.35)
(5.36)
(5.37)
GTM
zz
i
= S0
2
k2 F TM
kmz
(5.38)
e
Expression of G
e , from (5.13)(5.15), we simply have
For G
e (r, r ) = G
TE (r, r) + 1 G
TM (r, r )
G
e
e
2
knm
(5.39)
(5.40)
(5.41)
where
Here (5.41) is derived due to the fact that for the secondary field, we have
2 + kn2 g (r, r) = 0.
(5.42)
Again we first consider the TE wave, from (5.22), (5.28), and (5.40), we have
x z y x z x 0
TE
TE (r, r) =
G
y z y y z x 0 g (r, r )
e
k2 y
k2 x 0
(5.43)
where,
n TE o
i
z F
TE
=
G
sin
2S
1
e
2
kmz k2
xx
n TE o
+ 4i sin 2S0 zkFmz
n TE o
z F
i
TE
G
=
cos
2S
1
e
2
kmz k2
xy
n TE o
4i (1 + cos 2) S0 zkFmz
n TE o
i
z F
TE
cos
2S
G
=
1
e
2
kmz k2
yx
n TE o
i
+ 4 (1 cos 2) S0 zkFmz
TE
TE
G
= G
e
yy
71
xx
(5.44)
(5.45)
(5.46)
(5.47)
F TE
kmz
TE
F
i
= cos S1
.
2
kmz
i
TE
G
= sin S1
e
zx
2
TE
G
e
zy
(5.48)
(5.49)
y x z
y y z
y k2
2 TM
TM (r, r ) =
G
x x z x y z x k2 kn g (r, r )
e
0
0
0
where
n
o
z F TM
i
=
sin
2S
kn2
1
2
kmz k2
xx
n
o
F TM
+ 4i sin 2S0 zk mz
kn2
n
o
z F TM
i
TM
n
o
F TM
+ 4i (1 cos 2) S0 zk mz
kn2
TM
i
F
TM
kn2
Ge xz = sin S1
2
kmz
n
o
z F TM
i
TM
n
o
F TM
4i (1 + cos 2) S0 zk mz
kn2
TM
TM
G
= G
e
e
yy
xx
TM
i
F
TM
G
=
cos
S
kn2
1
e
yz
2
kmz
TM
G
e
(5.50)
(5.51)
(5.52)
(5.53)
(5.54)
(5.55)
(5.56)
To validate the dyadic form of the Greens function derived in this section,
an electric dipole radiating in a seven-layer medium is investigated. The
layered medium is shown in Figure 5.2, and the working frequency is f = 300
MHz. The layered medium is both dielectric and magnetic. The source is
at (x = 0, y = 0, z = 1.4) m and the observation line is at (3 x
3, y = 1, z = 0.3) m. The polarization of the dipole is ( = 20o , = 30o ).
e is involved.
The electric field of an electric dipole is first evaluated, where G
The result is compared with that from the transmission line (TL) method
[17] as shown in Figure 5.3. Good agreement is observed. To further test the
magnetic-type Greens function, the electric field of a magnetic dipole is also
72
300
Ex
Ey
250
Electric field (V/m)
Ez
200
150
100
50
0
3
1
0
1
Observation point x (m)
1.4
Ex
1.2
Ey
Ez
1
0.8
0.6
0.4
0.2
0
3
1
0
1
Observation point x (m)
74
(5.57)
m
hf (r),
n j
e (r, r), fi (r )i
G
(5.58)
I, fi (r )i
R
where the inner product is defined as hf(r), X(r)i = s drf(r) X(r) for real
=
m
hf (r), g(r, r)
n j
function f(r). Also fj (r) is the j-th testing function and fi (r ) is the i-th basis
function, g(r, r) is the scalar homogeneous medium Greens function shown
in (5.8).
75
(5.59)
+ im h
z fj (r), ge,zd(r, r ), fi (r )i
+ im h fj (r), ge,dz (r, r ), z fi (r )i
+ im h fj (r), ge,dd(r, r ), fi (r )i
where fs =
z z f is the horizontal projection of the basis function and
ge,ss
i
= S0
2
F TE
kmz
1
i
TE
2
TM
ge,zz = S0 z z F + kmn F
2
kmz k2
i
n
1
TE
TM
ge,zd = S0
z F
z F
2
m
kmz k2
i
m
1
TE
TM
z F z F
ge,dz = S0
2
n
kmz k2
i
z z TM
1
TE
ge,dd = S0
F + 2 F
.
2
knm
kmz k2
(5.60)
(5.61)
(5.62)
(5.63)
(5.64)
One should note that for RWG basis function straddling the interface of two
adjacent layers, the coefficient m should be modified accordingly. Since in
real implementation the matrix element is accounted for by half RWG-half
RWG (triangle-triangle) interactions, the expression in (5.59) is convenient
and shall not cause confusion. The derivation details can be found in [21]
and will not be repeated here.
We pay more attention to the matrix representation of the KH operator
e here. For the TE wave, from (5.40), (5.42), we
with the kernel of G
have
TE (r, r) = ( z + k 2 z)( z)g TE (r, r )
G
(5.65)
e
n
76
It can be divided into two parts and the matrix representation can be obtained by applying integration by parts,
TE (r, r), fi (r )i
hfj (r), G
e
= h fj (r), z zg TE (r, r), fi (r )i
+
(5.66)
dlfj (r) n
hz zg TE (r, r ), fi (r )i
c
+ h
z fj (r), kn2 zg TE (r, r ), fi (r )i
Similarly from (5.41), we have
TM (r, r) = k 2 ( z) z + k 2 z g TM (r, r )
G
e
n
m
(5.67)
TM (r, r ), fi (r )i
hfj (r), G
e
= hfj (r), kn2 zz g TM (r, r ), fi (r )i
+ hfj (r),
dl kn2
c
zz g
TM
(5.68)
(r, r )fi (r ) n
i
2
+ hfj (r), kn2 km
zg TM (r, r ), z fi (r )i
77
where
m
h
n
m
n
m
h
z fj (r), gce,zs(r, r), fis (r )i
n
m
hf (r), gce,sd(r, r ),
n js
m
hf (r), c
n js
m
hf (r), gce,sz (r, r), z fi (r )i
n js
dlfj (r) n
hgce,ds (r, r), fis (r )i
fi (r )i
dl gce,sd(r, r )fi (r ) n
i
#
sin i
z F TE
S1
gce,ds (r, r ) =
2
kmz k2
cos
"
#
TE
sin
i
F
2
S1
gce,zs (r, r ) = kn
kmz k2
cos 2
"
#
sin
i
z F TM
n
S1
gce,sd(r, r ) =
m cos 2
kmz k2
"
#
TM
sin
i
F
2
gce,sz (r, r ) = kmn
S1
2
kmz k2
cos
(5.69)
"
(5.70)
(5.71)
(5.72)
(5.73)
Z+
Z+
Matrix (real)
9.662
1.080
Matrix (imag)
0.623
5.147
Dyadic (real)
Dyadic (imag)
0.189
4.348
9.133 3.538
0.967
2.568
0.816 3.538
0.315 2.568
Z+
Z+
Matrix (real)
2.870
2.103
3.868
1.436
2.015
Matrix (imag)
1.521
Dyadic (real)
0.911
0.437
0.547
0.119
2.015
Dyadic (imag)
4.274 0.712
0.185 1.303
0.385 1.303
(5.74)
p1
1 (z+ )z F1 (z+ , z ) = p2 (z )z F2 (z , z )
(5.75)
kz (z )
kz (z)
(5.76)
p1 (z+
)
p2 (z
)
F
(z,
z
)
=
F (z, z
)
1
+
k1z (z+ )
k2z (z ) 2
(5.77)
k1z
(z+ )z F1 (z, z+
) = k2z
(z )z F2 (z, z
)
(5.78)
The continuity of the kernel of the LE in (5.59) is shown in [109]. The one of
KH in (5.69) can be verified similarly since the horizontal partial derivatives
(( ) z) in (5.66) and (5.68) do not affect the vertical continuity due to the
phase matching condition.
79
Figure 5.5: Cases where testing line integral exists. The testing function is
straddling the interface, all radiation from the RWG or half-RWG basis
functions (triangles) in color needs invoking testing line integral, while
radiation from others does not need this line integral.
Figure 5.6: Line integral test. The testing function is at the top interface.
Radiation from: basis function 1: no line integral activated; basis function
2: testing line integral activated; basis function 3: both testing and basis
line integrals activated. (unit: m).
80
Table 5.3: Matrix element of KE : testing & basis line integrals (104 )
Matrix (real)
Matrix (imag)
Dyadic (real)
Dyadic (imag)
Z++
Z+
1.406
0.063
0.680
1.406
0.680
2.765
0.063
2.765
Z+
0.131
1.426
0.131
1.426
in (5.59). However, for the KH , we find that the secondary term is not
consistent with the primary term. In the primary term, the operator
acts directly on the Greens function in (5.58), while in the secondary term,
it is transferred to either testing or basis function in (5.69). This operator
transfer reduces the singularity of the Greens function; however, it also
destroys the continuity of the propagation factor and leads to non-canceled
line integral for interactions where the primary term is involved. For the
testing line integral (the first line integral in (5.69)), it needs to be calculated
only when the testing RWG function straddles the interface and the basis
RWG functions or half-RWGs (either positive or negative triangles) are in
the same two layers shown in Figure 5.5. The necessity of basis line integral
(the secondary line integral in (5.69)) can be analyzed similarly. Since we are
filling the matrix elements by triangle-triangle pairs other than RWG-RWG
pairs, the line integral can be easily judged and added if necessary; and also
the number of Sommerfeld integrals can be reduced to minimum.
81
be obtained from the duality principle. One form of the duality principle
reads [5],
E H, H E, ,
(5.79)
M J, J M, m , m
(5.80)
(5.81)
82
Hence
Gm (r, r ) = Ge (r, r ) = I + 2
g(r, r)
km
(5.82)
d 1 d
1 2
+
k (z) F (z, z ) = 0
dz p(z) dz p(z) z
(5.83)
It is obvious that
F TE (z, z ) F TM (z, z ), F TM (z, z ) F TE (z, z )
(5.84)
(5.85)
hence,
Finally, we have
TE (r, r )
m (r, r ) = G
TM (r, r) + 1 G
G
m
2
kmn m
(5.86)
(5.87)
TE (r, r ) = ( z) ( z) g TE (r, r)
G
m
(5.88)
e (r, r) G
m (r, r), G
m (r, r) G
e (r, r )
G
(5.89)
where
Hence,
e and G
m.
and similar duality holds between G
With these properties, we can now easily verify the correctness of definition
of the LH and KE in (5.5) and (5.6).
83
have
o
Eoinc (r) = LoE (r, r ) J(r) + KE
(r, r ) M(r ).
(5.90)
Here we use superscript o to represent that the Greens function is evaluated in the outside region (i will be used for inside region). Similarly, if
r S + , r S ,
i
0 = LiE (r, r) [J(r )] + KE
(r, r ) [M(r )]
(5.91)
where the negative signs of the currents are due to the unique definition of
the unit normal vector in one problem (J = n
H, M = E n
). One should
note that for Li and Ki , the choice of the Greens function has some freedom.
As long as the material and the boundary condition is maintained, the field
can be uniquely determined. The Greens function may even not satisfy the
radiation boundary condition (non-physical). However, for simplicity, we
usually choose the homogeneous medium Greens function.
In practice, we may have internal resonance problems if we only use the
above two E-type equations. To avoid it, the H-type equations are usually
applied too. The H-field equations read
o
Hoinc (r) = LoH (r, r) M(r ) + KH
(r, r) J(r )
(5.92)
i
0 = LiH (r, r) [M(r )] + KH
(r, r) [J(r )]
(5.93)
=
o
i
Hoinc tan
(KH
+ KH
) (LoH + LiH )
M tan
84
(5.94)
polarization fields are evaluated at z = 0.5. The results are shown in Figure
5.12 and good agreement can be observed. Finally, the cuboid is filled with
dielectric material ( = 5 and = 1). The scattered field is again calculated
and shown in Figure 5.13, which is weaker than that of the PEC case.
85
500
Ex
Ey
400
Ez
300
200
100
0.3
0.2
0.1
0
0.1
Observation point x (m)
0.2
0.3
Figure 5.8: The scattered field inside the object with = 2 and = 2.
Since there is no contrast, the scattered field recovers the incident field
(solid line).
86
40
Ex
35
Ey
30
25
20
15
10
5
0
3
1
0
1
Observation point x (m)
Figure 5.9: The scattered field outside the object with = 4 and = 1.
The tangential components of the E field are continuous at the interface.
Symbol: field at d+
5 , solid line: field at d5 .
50
Ez at d+5
Ez at d5
40
30
20
10
0
3
1
0
1
Observation point x (m)
Figure 5.10: The scattered field outside the object with = 4 and = 1.
The normal component of the E field differs by a factor of 3 (4 /5 ) at the
interface.
87
0.25
EFIE
MFIE
0.2
0.15
0.1
0.05
0
3
1
0
1
Observation point x (m)
Figure 5.12: The scattered field of the PEC object, calculated from EFIE
and MFIE.
88
0.1
Dielectric object
0.08
0.06
0.04
0.02
0
3
1
0
1
Observation point x (m)
5.7 Summary
A new Greens function formulation for modeling homogeneous objects in
layered medium is systematically developed in this chapter. The dyadic
form of the Greens function is developed by applying the pilot vector potential approach, and all components are derived in terms of order 0 and
order 1 Sommerfeld integrals. The matrix representation is further derived
by applying integration by parts and vector identities. The line integral issue for interactions between straddling basis functions is discussed in details
for different situations, based on the continuity property of the propagation
factor and the consistency analysis of the primary term and the secondary
term. The duality principle of the LMGF is further discussed to make the
formulation succinct. Finally, the extinction theorem is revisited in inhomogeneous environment and the relevant integral equations are set up. Several
numerical results are presented to validate this formulation.
89
CHAPTER 6
DCIM ACCELERATED SURFACE
INTEGRAL EQUATION (SIE) METHOD
FOR NANO-OPTICAL APPLICATIONS
The SIE involving LMGF developed in Chapter 5 is accelerated by the DCIM
introduced in Chapter 2 and is applied to nano-optical applications. We
only focus on the situation where the nano scatterers are embedded in a
single layer, which is common in various optical applications. Some issues in
incorporating DCIM to SIE in this case will be discussed. The basic theory
of spontaneous emission (SE) of a quantum emitter and the surface plasmon
resonance will also be briefly reviewed. SE is then studied numerically for an
emitter embedded in a layered medium and in the presence of nano scatterers,
where the emitter is coupled to the surface plasmon polaritons (SPPs) and
the localized surface plasmons (LSP).
6.1 Introduction
When atoms are prepared in the excited states, they spontaneously emit light
even in the vacuum. This interesting phenomenon attracted much attention
after the development of quantum mechanics in 1920s [116]. Purcell in 1946
first demonstrated that the SE of a quantum emitter is not an intrinsic
property of the emitter but can be modified when it is located in a cavity,
now known as the Purcell effect [117]. SE can be explained by the atom-field
interaction, after the quantization of the electromagnetic (EM) field. The
vacuum fluctuation of the EM field perturbs the atom system and causes the
spontaneous emission of photons. If the local electromagnetic environment
is changed, which leads to the change of local density of states (LDOS), the
emission can be modified. Purcells work stimulated relevant studies, led to
the development of laser (firstly in microwave spectrum, known as maser),
and finally started the subject of cavity quantum electrodynamics (Cavity
QED) [118]. In the so-called weak coupling regime, the SE enhancement and
90
repeated here. We will also focus our attention on the secondary field since
the primary field has closed-form solution.
(6.1)
mR
m,m1 eikmz z1
F1 (z, z ) = M
(6.2)
mR
m,m+1 eikmz z2
F2 (z, z ) = M
(6.3)
mR
m,m+1 R
m,m1 eikmz z3
F3 (z, z ) = M
(6.4)
mR
m,m1 R
m,m+1 eikmz z4
F4 (z, z ) = M
(6.5)
z1 = 2dm + z + z
(6.6)
z2 = 2dm+1 z z
(6.7)
z3 = 2dm+1 2dm + z z
(6.8)
z4 = 2dm+1 2dm z + z
(6.9)
where
and
The position dependent factor is already in the form of exponentials and can
be extracted during the function fitting in DCIM process, if we separate the
propagation factor into four parts, as are shown in (6.2)(6.5). Once the
DCIM coefficients ai and bi are obtained, the position-dependent factor can
be easily added back to the Greens function; hence, (2.23) becomes
g() =
Mj
4 X
X
j=1 i=1
aji
eikm rji
, rji =
4rji
92
2 + (bi + zj )2
(6.10)
dk
k
J0 (k )eikmz |z|
kmz
(6.11)
eikm r
(1 ikm r) = i
r3
k2
dk
J1 (k )eikmz |z|
kmz
(6.12)
Alternatively, we can also use the order 0 identity in this operator and calculate the horizontal partial derivatives in the spatial domain. For example,
x
eikm r
eikm r
= 3 (ikm r 1)x
r
r
(6.13)
2 F TM
R +
(z z F TE +kmn
)
k
i
= 4 0 dk kmz J0 (k )
k2
ge,zz =
i
S
2 0
(6.14)
(6.15)
"
sin
cos
"
sin
cos
i
S
2 1
i
4
z F TE
kmz k2
R +
0
(6.16)
k2
dk kmz
J1 (k)
F TE
k2
z F TE
k2
(6.17)
z F TE
kmz k2
S1
z F TM
kmz k2
(6.18)
Fortunately, the two always appear in pairs, hence the poles can be canceled in a similar way as in the matrix representation of the LE operator. In
a word, when using DCIM in KH , we choose the dyadic LMGF other than
the matrix representation.
94
2 X
I |ii|2(i f )
|hf |H
2
h
f
(6.19)
H
pE
(6.20)
(6.21)
p = hg|
p|ei = he|
p|gi
(6.22)
and
has the form of
The electric field operator E
=
E
i
Xh
+
Ek a
k (t) + Ek a
k (t)
(6.23)
where
a
k (t) = a
k (0)eik t
(6.24)
a
k (t) = ak (0)eik t
(6.25)
95
0 X
[p (uk uk ) p] (k 0 )
h
0 k
(6.26)
h
k
uk ,
20
E
k
h
k
u
20 k
(6.27)
0 2
|p| p (r0 , 0 )
3h0
(6.28)
with
p (r0 , 0 ) = 3
X
k
where p = pnp .
[np (uk uk ) np ] (k 0 )
(6.29)
k2
uk (r, k ) = 0
c2
(6.30)
The electric-type Greens function defined in (5.1), (5.3), satisfies the same
equation with excitation of delta function.
2
r , ) G(r,
r , ) = I(r r )
G(r,
c2
(6.31)
ck (r , )uk (r, k )
(6.32)
The expansion coefficients can be easily determined by the orthonormal property of the eigen modes and hence we have
r, ) =
G(r,
c2
uk (r , k )uk (r, k )
k2 2
(6.33)
(6.34)
Finally, the partial LDOS in (6.29) can be expressed by the the imaginary
part of the Greens function as
p (r0 , 0) =
60
0 , r0 , ) np }
{n
Im
G(r
p
c2
(6.35)
X
k
|uk |2 (k 0 )
(6.36)
If the emitter has no fixed dipole axis, the emission rate is obtained by
averaging over all possible directions. In this case, the partial LDOS becomes
the same as the LDOS.
(r0 , 0) =
20
0 , r0 , 0 ) }
Im{Tr G(r
2
c
97
(6.37)
where Tr means trace. Alternatively, the SER can also be derived from
the fluctuation-dissipation theorem [124], [125], which will not be discussed
here.
2
(r, )E
(r , )i = h
hE
Im{G (r, r , )}
(6.38)
c2 0
Specifically, in vacuum (free space), the imaginary part of the trace of the
dyadic Greens function shown in (5.7) is
1
0 (r0 , r0 , 0 ) } = k0 = 0 .
Im{Tr G
3
6
6c
(6.39)
0 =
03 |p|2
30h
c3
(6.40)
which is identical with the one obtained by Einstein from Boltzmann statistics and Plancks formula in the black body radiation [116].
To quantify the SE enhancement in an inhomogeneous environment, it is
convenient to define the normalized SER or Purcell factor as
0 , r0 , 0 ) }
Im{Tr G(r
(r0 , 0 )
=
=
(6.41)
0 (r0 , r0 , 0 ) }
0
0 (r0 , 0)
Im{Tr G
In summary, the local environment changes the eigen modes of the EM
field, and thus changes the partial LDOS or LDOS. By calculating the Greens
function, and making the source point and the observation point identical
(r = r = r0 ), one can obtained the partial LDOS and finally the SER, as
are shown in (6.28) and (6.35).
1 2
1 + 2
(6.42)
This leads to the dispersion curve of the SPPs lying to the right of the light
line. When the frequency tends to the surface plasmon frequency,
sp =
p
1 + 2
(6.43)
99
(6.44)
1 () 2
1 () + 22
(6.45)
100
(6.46)
DCIM
Matrix filling
1.5 days
9.2 minutes
Excitation
2.1 minutes
1 second
Scattered field
2.1 minutes
1 second
sphere are both 10 nm. We scan the visible spectrum from 400 nm to 800 nm,
with the step of 10 nm. The dielectric constant of the gold is obtained from
a more accurate Brendel-Bormann model [128], which satisfies the KramersKronig relation. We consider the following three cases: (a) SE of an emitter
in the presence of the slab; (b) SE of an emitter in the presence of the
nano sphere; (c) SE of an emitter in the presence of both structures. The
normalized SER are calculated and shown in Figure 6.5. For case a, the SER
of the emitter is enhanced by the SPPs of the slab, while for case b, the
SER is enhanced by the LSP of the nano sphere. In vacuum, the resonant
frequency of SPPs and LSP are similar and around 510 nm. For case c, the
SER is strongly enhanced by the coupling of SPPs and LSP at the same
resonant frequency.
For the nano sphere here, a reasonably fine surface mesh can be obtained
by using 1620 triangular elements. The number of unknowns (electric and
magnetic current) is 4860, hence it only requires around 180 MB to store the
matrix if single precision is used. The CPU times for matrix filling, RHS
vector (excitation) filling and scattered field calculation at a single frequency
(wavelength) point are also listed in Table 6.1 for further comparison. This
simulation is run on a laptop computer with Intel 2.00 GHz processor. It is
observed that the computation can be much accelerated by using DCIM.
Next, we consider a nano bowl structure embedded in a layered medium
with two gold layers shown in Figure 6.6. The mesh of the nano bowl is
shown in Figure 6.7. The normalized SER is calculated and shown in Figure
6.8. Two sub-figures are also demonstrated to show the respective effects
from the layered medium and the nano bowl.
102
Figure 6.4: A gold nano sphere with radius 20 nm is located above a gold
slab with thickness 30 nm. A z-polarized emitter is located at the middle of
the sphere and slab.
350
Normalized SER
300
Slab
Sphere
Both
250
200
150
100
50
0
400
450
500
550
600
650
Wavelength (nm)
700
750
800
Figure 6.5: Normalized SER for the three cases: (a) only slab (SPPs
enhanced SE); (b) only nano sphere (LSP enhanced SE); (c) both (both
effects).
103
Figure 6.6: A gold nano bowl is located in a layered medium with two gold
layers. A z-polarized emitter is located at the center of the aperture. The
dimensions are shown in the figure.
80
z (nm)
60
40
20
0
100
50
x (nm)0
50
50
50
100
0 y (nm)
104
Figure 6.8: Normalized SER. The first peak corresponds to the peak from
SPPs of the layered medium at 520 nm, the second peak is from the nano
bowl effect around 590 nm.
6.6 Summary
DCIM is incorporated into the SIE formulation and the developed 3D full
wave EM solver is applied to study SE enhancement of a quantum emitter
in plasmonic structures. The special features of combination of DCIM and
SIE are discussed in details for the case when the scatterers are confined in a
single layer. The fundamental principle and computational technique of SE is
reviewed, which is based on the Fermis golden rule and the Greens function
approach. SPPs of a air-metal interface and LSP of a metallic nano particle
are also discussed for completeness. Finally, several numerical results are
demonstrated to show the SE enhancement of a quantum emitter coupled to
SPPs and LSP. It is observed that the SE can further be enhanced from the
coupling of the two plasmonic resonances.
105
CHAPTER 7
CONCLUSION
Surface integral equation (SIE) method involving the layered medium Greens
function (LMGF) is important in analyzing electromagnetic radiation and
scattering in planarly layered structures, such as the microstrip antenna,
integrated circuit, organic light-emitting diode (OLED), and optical forces
in microelectromechanical systems (MEMS) etc.
In this thesis, the SIE with LMGF is investigated systematically. The acceleration technique for evaluation of Greens function is first studied. The
popular discrete complex image method (DCIM) is reviewed and a novel implementation scheme based on Sommerfeld branch cut is proposed. Next,
a broadband fast integral equation solver, the mixed-form thin-stratified
medium fast-multipole algorithm (MF-TSM-FMA), is developed to expedite the matrix-vector product in the iterative solvers. After that, the lowfrequency breakdown of the integral operator is discussed and the loop-tree
decomposition and the augmented electric field integral equation (A-EFIE)
are studied. In order to model general homogeneous objects in layered
medium, a new formulation of the LMGF based on the pilot vector potential
approach is developed and the corresponding SIE is then obtained. Finally,
attempt is made to apply the developed full wave three-dimensional (3D)
electromagnetic solver to nano-optics, and the spontaneous emission (SE) of
a quantum emitter located in a plasmonic system is studied.
Future work can be done to incorporate the generalized impedance boundary condition (GIBC) to model the finite conductance in circuit simulation.
The finite element method (FEM) can also be combined with the developed
SIE to model inhomogeneous objects in under-ground detection. For objects
with arbitrary penetration such as in optical lithography, DCIM can be further combined with the interpolation technique to expedite matrix filling and
fast integral equation solvers such as the kernel-independent algorithm can
be further developed for large-scale problems.
106
REFERENCES
[1] W. C. Chew, M. S. Tong, and B. Hu, Integral Equations for Electromagnetic and Elastic Waves, Morgan & Claypool Publishers, 2009.
[2] A. Taflove and S. C. Hagness, Computational Electrodynamics : the
Finite-Difference Time-Domain Method, 2nd ed. Boston, MA: Artech
House, 2000.
[3] J. Jin, The Finite Element Method in Electromagnetics, 2nd ed. New
York: Wiley, 2002.
[4] R. F. Harrington, Field Computation by Moment Methods, New York:
IEEE Press, 1993.
[5] W. C. Chew, Waves and Fields in Inhomogeneous Media, Van Nostrand
Reinhold, 1990, Reprinted by IEEE Press, 1995.
107
108
[25] G. J. Burke and E. K. Miller, Modeling antennas near to and penetrating a lossy interface, IEEE Trans. Antennas Propagat., vol. 32,
pp. 1040-1049, 1984.
[26] J. Y. Chen, A. A. Kishk and A. W. Glisson, A 3D Interpolation model
for the calculation of the Sommerfeld integrals to analyze Dielectric
resonators in a multilayered medium, Electromagnetics, vol. 20, pp.
1-15, 2002.
[27] H. Derudder, F. Olyslager, and D. D. Zutter, An efficient series expansion for the 2D Greens function of a microstrip substrate using
perfectly matched layers, IEEE Microw. Guided Wave Lett., vol. 9,
no. 12, pp. 505-507, Dec. 1999.
[28] F. Olyslager and H. Derudder, Series representation of Green dyadics
for layered media using PMLs, IEEE Trans. Antennas Propag., vol.
51, no. 9, pp. 2319-2326, Sept. 2003.
[29] E. Simsek, Q. H. Liu, and B. Wei, Singularity subtraction for evaluation of Greens function for multilayer media, IEEE Trans. Microw.
Theory Tech., vol. 54, no. 1, pp. 216-225, Jan. 2006.
[30] L. Tsang, and B. Wu, Electromagnetic fields of Hertzian dipoles in
layered media of moderate thickness including the effects of all modes,
IEEE Antennas Wireless Propagat. Lett., vol. 6, pp. 316-319, 2007.
[31] B. Wu, and L. Tsang, Fast computation of layered medium Greens
functions of multilayers and lossy media using fast all-modes method
and numerical modified steepest descent path method, IEEE Trans.
Microwave Theory Tech., vol. 56, no. 6, pp. 1446-1454, Jun. 2008.
[32] Y. L. Chow, J. J. Yang, D. G. Fang and G. E. Howard,A closedform spatial Greens function for the thick microstripsubstrate, IEEE
Trans. Microwave Theory Tech., vol. 39, no. 3, pp. 588-592, 1991.
[33] V. I. Okhmatovski and A. C. Cangellaris, Evaluation of layered media
Greens functions via rational function fitting, IEEE Microw. Wireless
Components Lett., vol. 14, no. 1, pp. 22-24, Jan. 2004.
[34] A. G. Polimeridis, T. V. Yioultsis, and T. D. Tsiboukis, A robust
method for the computation of Greens functions in stratified media,
IEEE Trans. Antennas Propagat., vol. 55, no. 7, pp. 1963-1969, Jul.
2007.
[35] M. I. Aksun, A robust approach for the derivation of closed-forrn
Greens functions, IEEE Trans. Microw. Theory Tech., vol. 44, no. 5,
pp. 651-658, May 1996.
109
[36] F. Ling, and J.-M. Jin, Discrete complex image method for Greens
functions of general multilayer media, IEEE Microw. Guided Wave
Lett., vol. 10, no. 10, pp. 400-402, Oct. 2000.
[37] M. Yuan, T. K. Sarkar, and M. Salazar-Palma, A direct discrete complex image method from the closed-form Greens functions in multilayered media, IEEE Trans. Microw. Theory Tech., vol. 54, no. 3, pp.
1025-1032, Mar. 2006.
[38] T. K. Sarkar and O. Pereira, Using the matrix pencil method to estimate the parameters of a sum of complex exponentials, lEEE Antennas Propagat. Magazine, vol. 37, no. 1, pp. 48-55, Feb. 1995.
[39] E. P. Karabulut, A. T. Erdogan, and M. I. Aksun, Discrete complex
image method with spatial error criterion, IEEE Trans. Microw. Theory Tech., vol. 59, no. 4, pp. 793-802, Apr. 2011.
[40] A. Alparslan, M. I. Aksun, and K. A. Michalski, Closed-form Greens
functions in planar layered media for all ranges and materials, IEEE
Trans. Microw. Theory Tech., vol. 58, no. 3, pp. 602-613, Mar. 2010.
[41] F. Mesa, R. R. Boix, and F. Medina, Closed-form expressions of multilayered planar Greens functions that account for the continuous spectrum in the far field, IEEE Trans. Microw. Theory Tech., vol. 56, no.
7, pp. 1601-1614, Jul. 2008.
[42] Y. P. Chen, W. C. Chew, and L. Jiang, A novel implementation of
discrete complex image method for layered medium Greens function,
IEEE Antennas Wireless Propagat. Lett., vol. 10, pp. 419-422, 2011.
[43] B. Hu, and W. C. Chew, Fast inhomogeneous plane wave algorithm for electromagnetic solutions in layered medium structures: Twodimensional case, Radio Sci., vol. 35, no. 1, pp. 31-43, Jan.-Feb. 2000.
[44] V. Rokhlin, Rapid solution of integral equations of classic potential
theory, J. Comput. Phys., vol. 60, no. 2, pp. 187-207, Sept. 1985.
[45] R. Coifman, V. Rokhlin, and S. Wandzura, The fast multipole method
for the wave equation: A pedestrian prescription, IEEE Antennas
Propagat. Mag., vol. 35, pp. 7-12, June 1993.
[46] J. M. Song, C. C. Lu, and W. C. Chew, Multilevel fast multipole
algorithm for electromagnetic scattering by large complex objects,
IEEE Trans. Antennas Propagat., vol. 45, no. 10. pp. 1488-1493, Oct.
1997.
110
111
115
117
118
LIST OF PUBLICATIONS
Journal Papers
Y. P. Chen, J. L. Xiong, W. C. Chew, and Z. P. Nie, Numerical
analysis of electrically small structures embedded in a layered medium,
Microw. Opt. Tech. Lett., vol. 51, no. 5, pp. 1304-1308, May 2009.
Y. P. Chen, L. Jiang, Z.-G. Qian, and W. C. Chew, An augmented
Conference Papers
Y. P. Chen, J. L. Xiong, W. C. Chew, and Z. P. Nie, Simulation
of structures situated in a layered medium at low frequencies, Asia
Pacific Microwave Conference, Hong Kong SAR, China, Dec. 16-20,
2008.
Y. P. Chen, W. C. Chew, and J. L. Xiong, A mixed-form thinstratified medium fast-multipole algorithm for broadband simulation,
Workshop on Computational Electromagnetics and Its Applications,
Chengdu, China, Aug. 2009.
Y. P. Chen, J. L. Xiong, and W. C. Chew, Fast and broadband
simulation of large-scale microstrip structures, Progress in Electromagnetics Research Symposium, Xian, China, Mar. 22-26, 2010.
120
Y. H. Lo, L. Jiang, Y. P. Chen and W. C. Chew, Finite-width excitation and impedance models, Progress in Electromagnetics Research
Symposium, Suzhou, China, pp. 273-273, Sept. 12-16, 2011.
121
AUTHORS BIOGRAPHY
Yongpin CHEN was born in Zhejiang, China, in 1981. He received the B.S.
and the M.S. degrees in microwave engineering from the University of Electronic Science and Technology of China (UESTC), Chengdu, in 2003 and
2006, respectively. He was a Ph.D. student at UESTC from 2006 to 2007,
and then a Research Assistant at the University of Hong Kong (HKU) in
2008. He transferred to HKU as a Ph.D. student from Jan. 1, 2009 and is
currently pursuing his Ph.D. degree in electrical and electronic engineering
at HKU.
122