Aul Bur 2000
Aul Bur 2000
54
G. AULBUR.LARSJONSSON.
WILFRIED AND JOHN W . WILKINS
Department of Physics
Ohio State University
Columbus. Ohio
1
ISBN O-LZ-M)7754-I Copyright C)Zoo0 by Academic Press
ISSN 0081-1947/00$30.00 All rights of reproduction in any form reserved
2 WILFRIED G. AULBUR, LARS JONSSON, AND JOHN W. WILKINS
29. Overestimation of Optical Constants within DFT . ... . . ... . ... .. 175
30. The “Scissors Operator” and its Limitations . . . . ... . . ... . ... .. 179
31. Local-Field Effects in Optical Response . . . . . . . ... . . . . . . . . . .. 184
32. Density-Polarization Functional Theory . . . . . . ... . . ... . ... .. 191
VII. Excited States within Density Functional Theory . . . . . ... . . ... . . . . .. 195
33. Functionals Based on Ground-State Densities . . . ... . . ... . ... .. 196
34. Functionals Based on Excited-State Densities . . . .... . . ... . ... .. 201
35. Time-Dependent Density Functional Theory . . . .... . . ... . ... .. 203
36. Monte-Carlo Calculations . . . . . . . . . . . . . . .... . . ... . ... .. 205
Appendix: Density Functional Theory . . . . . . . . . . . . . . .. . . . . . . . . ... .. 207
1. Universal Density Functionals . . . . . . . . . . . . .... .. ... . ... .. 208
2. The Kohn-Sham System . . . . . . . . . . . . . . . . .... . . . . . . . . . .. 209
3. The Band-Gap Discontinuity . . . . . ; . . . . . . . .... ..... .... .. 21 1
4. The Exchange-Correlation Hole . . . . . . . . . . . .... ..... . . . . .. 213
5. Coupling-Constant Averages . . . . . . . . . . . . . . . . . ..... .... .. 215
6. Local Approximations . . . . . . . . . . . . . . . . . . . . . . .... .... .. 217
1. INTRODUCTIONAND OVERVIEW
During the last decade, quasiparticle calculations have been used success-
fully to describe the electronic excited-state properties of solids such as
single-particle band structures and absorption spectra. Under the assump-
tion that electronic and ionic degrees of freedom can be decoupled, the
problem is that of N electrons in a solid described by the following
Hamiltonian: ’
We use atomic units throughout this article unless otherwise noted. In these units, energy is
measured in Hartree and h = e = me = 4m0 = 1; E~ is the permittivity of vacuum.
QUASIPARTICLE CALCULATIONS IN SOLIDS 3
which explains the name of the approximation. The GWA for the compu-
tation of quasiparticle energies was proposed by in 1965. However,
not until the mid-eighties was the approach applied to large-scale, numerical
electronic structure calculation^.^^^ The resulting ab-initio band structures
compare favorably with experiment.
Several reviews of quasiparticle calculations in the GWA have been
published. An early review of bulk and surface calculations in the GWA was
done by Hybertsen and Louie.’ Bechstedt’ discussed the physics of the
GWA in relation to model approaches for the calculation of the dielectric
response as well as the self-energy. Godby’ reviewed quasiparticle calcula-
tions for jellium, simple metals, and semiconductors. Mahan“ examined
different GW approximations resulting from the inclusion of self-energy and
* The self-energy I:is related to G via Dyson’s equation; see Section 11.14, Eq. (2.5).
L. Hedin, Phys. Rev. 139, A796 (1965).
L. Hedin and S. Lundqvist, in Solid State Physics, vol. 23, eds. F. Seitz, D. Turnbull, and H.
Ehrenreich, Academic, New York (1969), 1 .
’ M. S. Hybertsen and S. G. Louie, Phys. Rev. Lett. 55, 1418 (1985).
R. W. Godby, M. Schliiter, and L. J. Sham, Phys. Rev. Lett. 56, 2415 (1986).
’ M. S. Hybertsen and S. G. Louie, Comm. Cond. Mat. Phys. 13, 223 (1987).
* F. Bechstedt, in ~esrkiipeiprooble~eelAdvances in Solid State Physics, vol. 32, ed. U. Rossler
Vieweg, Braunschweig/Wiesbaden (1992), 161.
R. W. Godby, “Unoccupied Electronic States,” Topics in Applied Physics, vol. 69, eds. J. E.
Inglesfield and J. Fuggle, Springer, New York (1992).
l o G. D. Mahan, Comm. Cond. Matt. Phys. 16,333 (1994).
4 WILFRIED G. AULBUR, LARS JONSSON, AND JOHN W. WILKINS
2. QUASIPARTICLES
[ - 2 V2 + V, + V,,,
I S
Yi(r) + C(r, r’; Ei)Yi(r’)dr’= EiYi(r) (1.3)
Here V, is the electrostatic or Hartree potential of the electrons, that is, with
n as the electron density,
l6 The self-energy equals the energy of the bare particle interacting with itself via the
polarization cloud that the particle generates in the many-body system.
6 WILFRIED G. AULBUR, LARS JONSSON, AND JOHN W. WILKINS
V,,, is the external potential from the ions, and Y iand E i are the quasi-
particle wave function and energy, respectively. We will come back to the
quasiparticle equation and define Yiand Eiin Section 11.4.
The Green Function. Quasiparticle properties such as energies, lifetimes,
and expectation values of single-particle operators, such as the density and
the total energy of a many-body system, are determined by the single-
particle Green function (see, for instance, standard textbooks such as Refs.
17, 18, 19, 20, 21, and 22). The Green function G is also called the
single-particle propagator. With IN, 0) as the ground state of the N-electron
Hamiltonian in Eq. (l.l), Y(rt) = exp(ifit)Y(r) exp( - ifit) as the fermion
annihilation operator23 in the Heisenberg representation, Y t(rt) as the
corresponding creation operator, and T as the time-ordering operator, the
single-particle Green function is defined as
For t > t' (t' > t), G describes the propagation of a particle (hole) added to
the many-body system described by fi, that is, G describes the dynamics of
the N + N f 1 excitations in an N-electron system. G is a function of only
six spatial degrees of freedom and hence much more manageable than the
N-electron wave function, which depends on 3N spatial degrees of freedom.
Many of the complexities of the ground-state wave function are eliminated
by taking the expectation values.
The imaginary part of the Green function determines the spectral func-
tion A,
" G. D. Mahan, Many-Particle Physics, 2nd. ed., Plenum Press, New York (1993).
'' R. D. Mattuck, A Guide to Feynman Diagrams in the Many-Body Problem, Dover, New
York (1992).
2 3 %(rt) destroys an electron at point r and time t.
QUASIPARTICLECALCULATIONS IN SOLIDS 7
r
A(E) 2z n - l z (1.8)
( E - Elz + r2'
EE'CL E
FIG. 1. Schematic representation of the spectral function A ( E ) (Eq. (1.8)) for a noninteract-
ing and an interacting many-body system. The differences between the two cases are (1) the
real part E of the quasiparticle energy E = E + iT is shifted with respect to the bare energy E;
(2) the quasiparticle acquires a finite lifetime ljr due to interaction compared to the infinite
lifetime of the noninteracting particle; and (3) the spectral weight Z (shaded area) of the
quasiparticle peak is less than unity due to redistribution of spectral weight into the incoherent
background (Im+(E)), whereas the spectral weight of the bare particle is unity. The chemical
potential is p.
8 WILFRIED G. AULBUR, LARS JONSSON, AND JOHN W. WILKINS
-P I
%
photon -> electron: photoelectrunspectroscopy
electron o photon: Inverse photocmlrJion
hv
c
\ VB
photoekctron inverse
spectroscopy photoemlulon
N->N-1 N->N+1
PARTICLES
3. KOHN-SHAM
[ 2 1
N
- Vz + V, + V,,, + V,, mi = E ~ @ ~ , no =
i=l
IQil2. (1.9)
(1.10)
a wave vector k.
QUASIPARTICLE CALCULATIONS IN SOLIDS 11
and all potentials are evaluated at the ground-state density no@). The
exchange-correlation energy, which contains all Coulomb correlation effects
beyond the Hartree approximation and a part of the kinetic energy of the
interacting electrons, is not known explicitly for real systems.36 Practical use
of the Kohn-Sham equations requires good approximations to the ex-
change-correlation energy and, via Eq. (l.lO), to the exchange-correlation
potential.
The local density approximation replaces the inhomogeneous exchange-
correlation energy density per particle at a point r by the exchange-
correlation energy density per particle of a homogeneous electron gas,
U,h:m(r), evaluated at the local density. The total exchange-correlation
energy is then obtained as the integral over all local contributions:
(1.11)
36 For a formal expression of the exchange-correlation potential, see, for instance, L. J. Sham,
Phys. Rev. B 32, 3876 (1985), and references therein.
3 7 R. 0.Jones and 0. Gunnarsson, Rev. Mod. Phys. 61, 689 (1989).
38 Theory of the Inhomogeneous Electron Gas, eds. S. Lundqvist and N. H. March, Plenum
Press, New York and London (1983).
39 L. J. Sham and W. Kohn, Phys. Rev. 145, 561 (1966).
12 WILFRIED G. AULBUR, LARS JONSSON, AND JOHN W. WILKINS
Levy, and Balduz4' proved that E~ equals the negative of the ionization
energy.41
Perhaps the most prominent discrepancy between LDA band structures
and experiment is the fact that LDA underestimates the band gap of
semiconductors and insulators by about 0.5 to 2.0eV. In the case of Ge,
LDA leads to a semimetal rather than to an indirect-gap semiconductor.
Figure 3 demonstrates the underestimation of experimental band gaps in
LDA for all semiconductors and insulators for which ab-initio GWA
calculations have been reported. Figure 3 also shows that GWA calculations
largely correct the LDA band-gap underestimation and are in good agree-
ment with experiment. In spite of the bind-gap underestimation, LDA wave
functions are often good approximations to quasiparticle wave function^.'^
In the absence of quasiparticle calculations, LDA energies are routinely
used to interpret experimental spectra. LDA energy dispersions are often in
fair agreement with experiment, and in some'cases the LDA band gap can
be empirically adjusted to fit the experimental gap. This approach implies
an interpretation of the LDA exchange-correlation potential as an approxi-
mate self-energy that neglects nonlocal, energy-dependent, and lifetime
effects. Although the LDA band structure cannot claim quantitative accu-
racy for the determination of the electronic structure of solids, LDA
generally provides a qualitative understanding.
This section has four purposes. (1) The basic equations that govern the
dynamically screened interaction approximation are introduced (Sections
11.4 and 11.5) but not derived. We refer the reader to standard textbooks, for
example, Ref. 20, and the review articles by Hedin and Lundqvist4 and
Aryasetiawan and G u n n a r ~ s o n in
' ~ particular for a derivation of the Hedin
equations (Section 11.5). (2) Useful separations of the self-energy as well as
basis-set-independent details about the evaluation of the single-particle
propagator G and the screened interaction W are described in Sections 11.6,
11.7, and 11.8. (3) Section 11.9 gives a historical overview of early GWA
calculations and related approaches. (4) The physics of the self-energy
operator is analyzed in Sections 11.10, 11.11, 11.12, 11.13, and 11.14, with
Sections 11.13 and 11.14 focusing on the consistency of GWA calculations
and on extensions to the GWA, respectively. An overview of important
equations for the GWA and symbols used in this article is given in Tables
1 and 2.
40 J. P. Perdew, R. G. Parr, M. Levy, and J. L. Balduz, Phys. Rev. Lett. 49, 1691 (1982).
41 For a more detailed description, see the appendix.
-2 0 2 4 6 a 1
Si; 42
Ge; 43
dlamond; 44
3c Sic; 42
UCI; 44
gcN.45
Bdf46
B P 46
BAS: 46
AIN; 47
AIP 48
A&; 43
AISb; 48
GaN; 42
GaP 48
GaAs; 43
GaSb; 48
InP 48 Expt., indirect gap
InAs: 48
InSb; 48 0 Expt.. direct gap
ZnS; 51
2% 51
ZnTe; 51
CdS; 52
CdSe; 51
CdTe; 51
%
! z
NIO; 55
CaCuO . 5 4
6:
u 56
Z h * ; 57
sno,; 58
FIG. 3. Comparison of characteristic direct and indirect LDA, GWA, and experimental energy gaps for all semiconductors and insulators for which
first-principles GWA calculations have been reported. GWA corrects most of the LDA band gap underestimation over more than one order of
magnitude in the experimental band gap. The values for MnO, ZnO, and CaCuO, are from model-GWA calculations, which are accurate to within
0.4eV. The discrepancy between GWA and experiment for LiO, results from the neglect of excitonic effects. The experimental value for BAS is tentative.
e
The references for the LDA, GWA, and experimental values are listed after the element symbols. W
14 WILFRIED G. AULBUR, LARS JONSSON, AND JOHN W. WILKINS
TABLE1. DEFINITIONS
OF SYMBOLS
AND QUANTITIES
SYMBOL DEFINITION
47 A. Rubio, J. L. Corkill, M. L. Cohen, E. L. Shirley, and S . G. Louie, Phys. Rev. B48, 11810
(1993).
48 X. Zhu and S . G. Louie, Phys. Rev. B43, 14142 (1991).
5 0 S. Massidda, R. Resta, M. Posternak, and A. Baldereschi, Phys. Rev. B 52, R16977 (1995).
0. Zakharov, A. Rubio, X. Blase, M. L. Cohen, and S . G. Louie, Phys. Rev. B50, 10780
(1994).
.52 M. Rohlfing, P. Kriiger, and J. Pollmann, Phys. Rev. Lett. 75, 3489 (1995).
QUASIPARTICLE CALCULATIONS IN SOLIDS 15
TABLE2. IMPORTANT
EQUATIONS
FOR THE GWA
Go(r, r'; E ) = C
mi(r)@f(r')
Independent-particle Green function
i E-Ei
(Eq. (2.4))
+ Sj ,,
Go(r, r,; E)X(r r,; E)G(r ,, r'; E)dr,dr,
Independent-particle polarizability
0%. (2.21))
+ E-E,,-Ei+i6
Static Coulomb-hole self-energy XCoH(r,r') = 46(r -r')[W(r, r'; E = 0)- dr, r')]
(Eq. (2.32))
OEE
4. THEQUASIPARTICLE
EQUATION
The physical relevance of the Green function6’ (Eq. (1.5)) can be made clear
by expressing G in terms of quasiparticle wave functions and energies via its
spectral function A. Consider a complete set of eigenstates of the many-body
Hamiltonian for a system with N + 1 or N - 1 particles. Let p be the
chemical potential and denote the quantum number of the (N + 1)-particle
or (N - 1)-particle states with i. With EN,i as the energy of the N-electron
system in state i (i = 0 for the ground state), one defines the quasiparticle
amplitude Yi(r) and the quasiparticle energy Ei as4
Yi(r) = (N,Ol’&)lN + l,i), Ei = EN+l,i- E N , O for Ei 2 p,
(2.1)
Yi(r) = ( N - 1, &r)IN, o ) , E~ = E ~- ,E,- ~ for E~ < p.
and the integration contour C runs infinitesimally above the real ,Y-axis for
E‘ < p and infinitesimally below for E’ > p.
5. THEHEDINEQUATIONS
AND THE GWA
641)
P(1,2) = -
SV(2) *
Here 1 is a short notation for a combined space and time coordinate.
Similarly, the vertex function r is given by the variation of the inverse Green
function with respect to a change in the total potential or, alternatively, by
the variation of the self-energy with respect to SV:
+
With the above definitions, 1 = (r ,t1 S), 6 > 0 infinitesimal, and u(l,2) as
+
Z(1,2) = i
s G(1,4)W(1+,3)r(4, 2; 3)d(3,4),
s
J
(2.10)
P(1,2) = - i G(2,3)G(4,2)r(3, 4; l)d(3,4),
QUASIPARTICLE CALCULATIONS IN SOLIDS 19
FIG. 4. (a) Schematic representation of the iterative determination of the self-energy Z using
Hedin’s equations (Eqs. (2.10)) in conjunction with Dyson’s equation (Eq. (2.5)). Entries in
boxes symbolize the mathematical relations that link C, G,r, P, and W Starting with Z = 0
leads to an RPA screened interaction W and subsequently to C = GWRPA.(b) Schematic
representation of the self-consistent determination of the self-energy in the GWA. P, U: and Z
are constructed starting from an LDA or Hartree-Fock independent-particle propagator.
Subsequently, Z updates the quasiparticle wave functions and energies and a new Green
function G is determined. This process is repeated until self-consistency is reached. Most
practical applications either determine only the quasiparticle energies self-consistently or do not
update quasiparticle energies and wave functions at all. Self-consistency of GWA calculations
is discussed in Section 11.13.
20 WILFRIED G. AULBUR, LARS JONSSON, A N D JOHN W. WILKINS
W( 1, 2) = U( 1, 2) +
s
P(1, 2) = -iG(l, 2)G(2, 1).
W( 1,3)P(3,4)~(4,
2)d(3, 4), (2.12)
(2.13)
64 Both names are used interchangeably in the literature. We will use the name “energy-
references therein.
72 L. Hedin, B. I. Lundqvist, and S. Lundqvist, J . Res. Natl. Bur. Stand. Sect. A 74A, 417 (1970).
7 3 L. Hedin, Phys. Scripta 21, 477 (1980).
7 4 D. C. Langreth, Phys. Rev. E l , 471 (1970).
(2.15)
(2.16)
(2.18)
7 8 Screening etTects due to inhomogeneous density distributions are in this article simply called
local-field efects. Modifications of screening due to exchange and correlation beyond the
Hartree approximation are called many-body local-field efects.
79 Ref. 21, Chapter 5.
QUASIPARTICLE CALCULATIONS IN SOLIDS 23
(2.21)
2
P&,(q; o)= - C (il exp(i(q
v ii’ + G ) r)li’)(i’l exp( - i(q + G ) .r’)li)
+ ( E ~ .- E~ -o + i6
and for the RPA dielectric matrix:
4n:
EgL!(q; 0)= 6 ~ -~1qs
~ + G 1 2 p:G(q; 4. (2.23)
6. SEPARATION
OF THE SELF-ENERGY
2x
eiE”G(r,r’; E + E‘)W(r, r‘; E’)dE‘. (2.24)
This equation is used below in conjunction with Eq. (2.4) for the indepen-
dent-particle propagator to derive three possible, approximate (Go is used
instead of G ) ways of separating the full self-energy C into physically
meaningful pieces.
Energy-Dependent Correlation + Bare Exchange. The energy-dependent
correlation contribution Zc(E) describes self-energy effects beyond the
bare-exchange or Hartree-Fock contribution Cx.Subtracting the Hartree-
Fock exchange potential from the self-energy operator and using Eq. (2.19)
for the inverse dielectric matrix leads to an expression of Cc(E) in terms of
fluctuation potentials, V,(r) = 1u(r, r’)n,(r’)dr’, electron-hole energies, E , =
EN,, - EN,,, and single-particle energies, E ~ : ”
2x
exp(iE‘G)[ W ( r , r’; E’) - u(r - r’)]Go(r,r’;E + E’)dE‘
=?so
OCC
V, ( r )V,*(r’)@ (r)@T(r’)
E+E,-EEi-i6
V, ( r )V,* (r’)@ (r)@T(r’)
+YCc i ,+o E-&,-~~+i6 ’
(2.25)
2r4
6(r - r’), (2.27)
where c1 is the dipole polarizability of the ion core around which the
Rydberg electron cycles. The last approximate equality follows from a dipole
expansion of the Coulomb potential inside the expression of the fluctuation
potential in which one keeps only the lowest order, that is, dipole terms. The
above expression equals the classical Coulomb energy of the Rydberg
electron caused by the field of the induced dipole, which is adiabatically
switched on in the ion core. The GWA recovers the relevant classical limit
for this special case. Further examples whose essential physics is contained
in the GWA are (1) the energy loss per unit time of a fast electron in an
electron gas and (2) the self-energy shift of a core electron in a solid. These
limiting cases are detailed in Refs. 4, 26, and 27 and are not discussed here.
+
Coulomb Hole Screened Exchange (COHSEX). The COHSEX approx-
imation is a physically motivated separation of the self-energy into a
Coulomb-hole (COH) part and a screened-exchange (SEX) part whose
static limit (1) has been used extensively to correct Hartree-Fock band
structures; (2) produces direct band gaps to within 20% of experiment but
gives a less reliable account of indirect band gaps (see Section 11.9); and (3)
allows an examination of local-field effects on, and the energy dependence
of, the self-energy. Consider the convolution in energy space of G and W
given by Eq. (2.24). This convolution can be determined formally by
26 WILFRIED G. AULBUR, LARS JONSSON, AND JOHN W. WILKINS
CCoH(r,r’; E ) = 1Oi(r)@?(r’)P
i
OCC
:j dE
B(r, r‘; E’)
E-q-E’
(2.30)
W = W , + W , P , W , + W , P , W , P , W , + - ~ ~ ~ WW,P,W,.
u+ (2.36)
The three terms in Eq. (2.35) can be identified as the exchange potential
from the core (G,W x G,u due to the large energy denominators involved,
compare Refs. 83,84, and 85), as the screened polarization potential from
the core ( G , W,P, W,), and as the self-energy of the valence electrons ( G , W,).
A local density approximation of the bare core-exchange and screened
core-polarization potential leads in general to small errors except for
deviations on the order of 0.3 to 0.4eV in the band gap of materials with
large, soft cores whose lowest conduction-band state is localized on the ionic
cores. The nonlocal bare core-exchange and the screened, energy-dependent
core-polarization potential are generally small, as discussed, for example, by
Hybertsen and Louie.44Estimates for atomic Na4 or solid AlS6 indicate that
both terms contribute approximately 1 eV, relative to the bottom of the
valence band, to the quasiparticle energies. In GWA calculations based on
the LDA the proper core-valence terms are replaced by an LDA exchange-
correlation potential, which leads to a much smaller error. For states that
are localized on large, soft cores, such as the r2, conduction-band state in
7. DETERMINATION GREEN
OF THE SINGLE-PARTICLE FUNCTION
The determination of Z starting from Hartree theory requires self-consistency -the very
complication that the “best G, best W” philosophy tries to avoid-since Hartree band
structures and wave functions for semiconductors and insulators are qualitatively wrong. See
Refs. 88 and 89.
R. Daling and W. van Haeringen, Phys. Rev. B40, 11659 (1989).
89 R. Del Sole, L. Reining, and R. W. Godby, Phys. Rev. B49,8024 (1994).
90 F. Gygi and A. Baldereschi, Phys. Rev. B34, 4405 (1986).
QUASIPARTICLE CALCULATIONS IN SOLIDS 29
tially cancel.*' Note, however, that overlaps small compared to unity have
been observed, for instance, in transition metals68 and transition-metal
oxides (see Refs. 50, 54, 55, and 91). The disadvantage of using an
independent-particle Hamiltonian in the LDA is that density functional
theory is nonperturbative. Systematic improvements of GWA calculations
based on LDA Hamiltonians are only feasible once self-consistency is
achieved.
In contrast to LDA, Hartree-Fock calculations provide a good single-
particle basis only for weakly polarizable materials and are numerically
expensive due to the nonlocality of the exchange kernel. Hartree-Fock
Hamiltonians were used as independent-particle Hamiltonians in the 1970s
and 1980s to study self-energy corrections in rare earth and ionic solids (see
Section II.9a). Current uses of Hartree-Fock wave functions and energies as
input for GWA calculations are limited to the study of trans-polyacetylene
(see Section IV.25b).
OF THE DYNAMICALLY
8. DETERMINATION SCREENED INTERACTION
91 S. Massidda, A. Continenza, M. Posternak, and A. Baldereschi, Phys. Rev. Lett. 74, 2323
(1995).
30 WILFRIED G. AULBUR, LARS JONSSON, AND JOHN W. WILKINS
Om2 t
0.0
e
-0.2 -
-
4-0 ..
s
n
---- CPP Model
Walter and &hen
’.
3 -0.4 .
‘
n *
3
v - 0 2 4 8 8 1 0
W M
-
W I
n
0 w 0.6 I 1
Y
3 GPP i
w 0.4
.- 1
f-cw
-2
8-8-cur 0.2
0.0
-0.2
-0.4
---* CFT Model
-0.6
0 4 8 12 16
-0.4
10 M 90
0 40 50
fiw (eV)
fi w’ (ev)
FIG.5. Left panel: Numerical results obtained using the empirical pseudopotential technique
for various elements of the real ( E , ) and imaginary ( E J parts of the dielectric matrix EGG(q;CO)
as a function of frequency for Si (solid line). The real part of the Hybertsen-Louie plasmon-pole
model is plotted for comparison (dashed line). The model replaces the peaked structure in E ~ ( C O )
by a &function, is constructed to describe the limits w -P 0 and w + co correctly, and breaks
down for intermediate frequencies. Right panel: Real part of the inverse dielectric function of
Si derived from the numerical, empirical-pseudopotential-based results by Walter and Cohen9’
in comparison to the Hybertsen-Louie plasmon-pole model. The average behavior of the
inverse dielectric function is captured rather well by the plasmon-pole model for energies below
the plasmon energy of Si (up,= 16.7 eV). (Adapted from Ref. 44.)
QUASIPARTICLE CALCULATIONS IN SOLIDS 31
..........................
t .. ............
$?
wE
I ubz
Excitatiom...
I
4, 2kF
- Reciprocal lattice vector q --+-
FIG. 6. Spectrum of excitation energies versus wave vector transfer q for a homogeneous,
interacting electron gas. The upper (lower) boundary of the single-particle excitations is
O.S[(k, + q)2 - kb] ( O . S [ ( - k , + q)2 - kb]). In simple plasmon-pole models this spectrum is
replaced by a single mode wp,(q)indicated by the dotted line. The real plasmon mode (full line)
is the dominant excitation for small q-vectors (0 ,< q < 4,). Beyond qc, the plasmon is heavily
damped (dashed line). (Adapted from Ref. 93.)
32 WILFRIED G. AULBUR, LARS JONSSON, AND JOHN W. WILKINS
efective screening potential that depends only on the local electron density; see, for instance,
Refs. 96 and 97.
96 W. Hanke and L. J. Sham, Phys. Rev. B38, 13361 (1988).
97 P. A. Sterne and J. C. Inkson, J. Phys. C Sol. State Phys. 17, 1497 (1984).
for instance, Ref. 99) (1) reduces to a single plasmon in the limit q 0; (2) -+
where wpl = (47cn0(0))"2 is the plasmon frequency of the system and no(0)is
the G = 0 component of the unperturbed density. With uF as Fermi velocity,
the plasmon dispersion is approximated by w2(q)= +
4 q 2 / 3 +(q2/2)2.
The continuous spectrum of density fluctuations is substituted by a single
plasmon pole, and electron-hole excitations in particular are neglected. In
the case of the homogeneous electron gas, L u n d q v i ~ t ~ ~ *shows ' ~ ~ ~that
'~'
electron-hole excitations are of minor importance for the determination of
the self-energy C.
In inhomogeneous semiconductors such as Si, quasiparticle energies
determined using plasmon-pole models (e.g., Ref. 42) differ, by no more than
50meV for the direct gap at r, from those determined by taking the full
frequency dependence of &GG'((I; w) into a c c o ~ n t . ' Table
~ ~ *21 ~ shows
~ ~ that
the valence-band width in Si is about 0.3 eV smaller when the full frequency
dependence is taken into account (compare calculations by Fleszar and
Hankelo2 and Rieger et a1.1°3 with plasmon-pole model calculations). The
effect of particle-hole excitations on quasiparticle energies is not significantly
larger than typical numerical uncertainties of GWA calculations (see Sec-
tions 111.15, 111.16, and 111.18).
A modification of the Lundqvist model for s e m i c o n d u ~ t o r(1)
s~~ reduces
~
to the static dielectric constant for q -+ 0; (2) reproduces free-electron
behavior at large q; (3) describes a modified Thomas-Fermi screening at
intermediate q ; and (4) allows for the analytic evaluation of the static
Coulomb-hole contribution to the self-energy. Capellini et u1.'04 suggested
the following expression for the dielectric function of a semiconductor:
where the Thomas-Fermi wave vector k,, depends on the electron density
n and ~(0) is the static RPA dielectric constant. The parameter a is taken as
a fitting parameter to optimize agreement with first-principles dielectric
constants and turns out to be approximately constant for small- and
medium-gap semiconductors.104As is shown in Fig. 7 for Si and GaAs, the
model by Cappellini et al. gives a better description of the full RPA dielectric
function than the Levine-Louie model, which is introduced below. However,
in contrast to the Levine-Louie model, the model of Cappellini et al. does
not reduce to the RPA dielectric function in the case of metallic screening.
\. ..-
FIG. 7. Static dielectric function E(q. w = 0) as a function of the reciprocal lattice vector q
for Si and GaAs. The model dielectric function of Cappellini et (solid line) compares
better with RPA results of Walter and C ~ h e n "(closed
~ boxes for q along the (111) direction,
open boxes for q along the (100) direction) than the Levine-Louie model dielectric function
(dashed line).lo6 RPA results by Baldereschi and Tosatti (stars)'" for large q seem to favor
the Levine-Louie model over the model by Cappellini et al. The parameters a and ~(0) are the
same as in Eq. (2.38). (Adapted from Ref. 104.)
QUASIPARTICLE CALCULATIONS IN SOLIDS 35
For an application of the model in the context of GWA see, for example,
Ref 108.
COHSEX calculations for insulators in the 1970s and early 1980s (see
Section II.9a) often relied on the Penn dielectric function'09 and its
variations, which describe the dielectric response of an isotropic, three-
dimensional insulating electron gas. Perm'" used the Ehrenreich-Cohen
formula"0 for the dielectric response function. With Egapas an average
optical gap adjusted to fit E ( q = 0) with the experimental dielectric constant,
E F as the Fermi energy, and k, as the Fermi vector, the numerical results
can be fitted with an interpolation formula:"'*''2
(2.39)
(2.40)
function via
finds
of,
E(q -+ 0,o) = 1 + oiap
- o2.
(2.42)
at the local density determined by r,(r’) and used the Levine-Louie gap
parameter to ensure correct long-range screening:
Here ~ ( 0is) the static dielectric constant and an input parameter to the
model. Hybertsen and Louie’ explicitly symmetrized the screening poten-
tial:
[EE&] + ;l
(q ;o = O)u(q G ) = - u(q G ) + J [E~J - '(Iq + GI; rs(r'))e'(G-G).dr'
r'
1.
-
+ u(q+G')
s [ ~ ~ ~ ] - l ( l q + Gr,(r))e'(G'-G)'rdr
'I;
b. Plasmon-Pole Models
The imaginary part of the important elements of the dielectric matrix has a
peaked structure as a function of frequency, which plasmon-pole models
approximate by a &function characterized by two parameters: the effective
strength and the effective frequency of the plasmon excitation. Together with
the independent-particle propagator (Eq. (2.4)), whose frequency depend-
ence is straightforward, plasmon-pole models allow an analytic evaluation
of the convolution of G and W in frequency space (Eq. (2.24)). As a
consequence, the numerically expensive integration in Eq. (2.24) is avoided.
Plasmon-pole models give a good description of both the low-energy
behavior of the dielectric matrix -via reproduction of the static limit -and
the high-energy behavior of the dielectric matrix -via reproduction of the
first frequency moment. As a consequence, energy integrations of the
screened interaction are sufficiently accurate for the determination of quasi-
particle energies and effective masses." The effective strength and fre-
quency of the plasmon excitation are determined by forcing the model to
reproduce the static dielectric matrix in the zero-frequency limit and by
using Johnson's sum rule.' l 7 The latter sum rule connects the first
6+1
l 5 N . H. March, Electron Correlation in Molecules and Condensed Phases, Plenum Press, New
York (1996), 132.
D. L. Johnson, Phys. Rev. B 9 , 4475 (1974); M. Taut, J. Phys. C: Sol. State Phys. 18, 2677
(1985); and Ref. 44.
'I7 The use of Johnson's sum rule in conjunction with nonlocal pseudopotentials is not
justified, as pointed out for instance in Refs. 44 and 118. However, band-gap errors due to this
procedure are only about 15 meV for Si, as reported in Ref. 119.
G . E. Engel and B. Farid, Phys. Rev. B46, 15812 (1992).
R. T. M. Ummels, P. A. Bobbert, W. van Haeringen, Phys. Rev. B57 11962 (1998).
38 WILFRIED G. AULBUR, LARS JONSSON, A N D JOHN W. WILKINS
x
X&!AQ) = - 5 (9 + G ).(4 + G')no(G- G'). (2.48)
Farid.' 2o Both models give identical expressions for the expectation value
of the self-energy operator provided that appropriately scaled plasmon-pole
eigenvalues and eigenvectors are introduced (see below and Sections 111.15
and 111.16).
Von der Linden and Horsch. Von der Linden and Horsch'22 considered
an N-parameter plasmon-pole model derived from the eigenvalue decompo-
sition of the symmetrized dielectric matrix E under the assumption that the
energy dependence of E is contained in its eigenvalues only. Define the
following symmetrized, Hermitian dielectric matrix,
(2.49)
where p numbers the real, positive eigenvalues &'(a) and the correspond-
ing eigenvectors IOw(o)). Assume that the frequency dependence of the
inverse dielectric matrix is solely contained in the eigenvalues (lOw(o)) =
IOw(o = 0))) and is of the
(2.52)
2
LGG'(q) = (q $. G ) .(q + G')no(G- G') = --
71
xg&(q), (2.53)
one can express the plasmon-pole frequencies using Johnson's sum rule' l6
as
(2.54)
Subsequently, the energy integration of Eq. (2.24) can be done and the result
is given in Sections 111.15 and 111.16 for a plane-wave basis.
Engel and Farid. Engel and Farid'a' derived an N-parameter plasmon-
pole model whose eigenvalues and eigenvectors are frequency dependent by
explicitly constructing an approximation X to the full polarizability x, which
approximation is exact in the static and the high-frequency limits. With X,
x, and L-' (the inverse of the first-moment matrix L in Eq. (2.53)) as
matrices in the reciprocal lattice vectors G and G', the approximation X to
the full polarizability x can be expressed as"'
(2.56)
(2.57)
(2.58)
(2.59)
QUASIPARTICLE CALCULATIONS IN SOLIDS 41
(2.61)
(2.62)
%(O) a
This observation in conjunction with Eq. (2.60) led Engel and FaridI2’ to
interpret the eigenenergies of Eq. (2.56) as plasmon energies and their small
q-vector dispersion coefficient as plasmon dispersion coefficient.
9. EARLYQUASIPARTICLE
CALCULATIONS
This section reviews the history of GWA calculations and outlines some
related approaches. As in the section on the model dielectric matrices, we
concentrate on work done on semiconductors and insulators. We focus our
survey on the time span between 1970 and 1986. Before 1970 most of the
GWA work was done on the homogeneous electron gas. The only work on
semiconductors is the paper by Brinkman and Goodman.83 These papers
are thoroughly reviewed in Refs. 4 and 132. In 1985, first-principles
quasiparticle calculations were done by Hybertsen and Louie’ and in 1986
by Godby, Schluter, and Sham.6 First-principles GWA calculations are
discussed at length starting in Section 11.10.
DIRECTGAP (eV)
Ehhp/E;:; (MATERIAL,
EXP.
FEATURE CORE I? BASIS DIELECTRIC CONSTANT)
“Static COHSEX calculation neglecting local fields and based on a model dielectric function.
bGWAcalculation including energy dependence (plasmon-pole model in Ref. 5) and local fields,
and based on first-principles dielectric matrix.
S. Baroni, G. Grosso, and G. Pastori Parravicini, Phys. Rev. B29, 2891 (1984).
14' S. Baroni, G. Pastori Parravicini, and G. Pezzica, Phys. Rev. B32, 4077 (1985).
1 4 2 Handbook of Optical Constants ofsolids, ed. E. D. Palik, Academic Press, Orlando (1985).
144 Hybertsen and Louie used a plasmon-pole model; Godby and collaborators determined
b. Excitonic Efects
An extension of the COHSEX approximation' 54,1 that includes excitonic
correlations in the irreducible polarizability gives a better description of
photoabsorption experiments-in which photons create interacting electron-
hole pairs- than do RPA-based approaches. In contrast to RPA-based GWA
calculations whose fundamental excitations are noninteracting electron-hole
pairs and plasmons, GWA calculations based on a description of screening in
the time-dependent Hartree-Fock approximation include the short-range,
attractive interaction between electrons and holes via ladder diagrams.
The determination of the influence of excitonic correlations on density
fluctuations and hence on the dielectric matrix requires the solution of the
equation of motion of the two-particle Green function, that is, the Bethe
Salpeter equation.' 5 4 * 1 5 5 ~ 15 6 The inverse dielectric matrix is given in terms
of the density-density correlation function,'65 and the latter can be written
as a two-particle Green function:'67
16' To be consistent with Hanke et aL's notation we do not consider density fluctuation
operators A'(rt) = h(rt) - (A(rt)) as in Eqs. (2.17) and (2.18), but density operators h(rt). For
the definition of the physically relevant retarded dielectric matrix, the two approaches are
equivalent, since c-numbers always commute. For the time-ordered dielectric matrix, the
approaches differ only in the limit w + 0. The formulation using the density fluctuation
operator avoids a divergence due to the bosonic nature of the density-density correlation
function (see, for instance, Refs. 18, 558ff., and 166).
QUASIPARTICLE CALCULATIONS IN SOLIDS 47
x
Generally, a two-particle Green function is defined as
G ( r , t , , rzt,, r3t3,r4f4) = ( - i ) * ( N , OlTC~(r,tl)Y(r2t2)Yt(r4r4)~t(r3r3)llN,
0).
16’ W. Hanke and L. J. Sham, Phys. Rev. B21,4656 (1980).
,IITs= 3’
4‘
_ - _ _ - - _Coulomb
- repulsion
‘VVL Coulomb attraction
FIG. 8. The irreducible electron-hole interaction in the time-dependent Hartree-Fock ap-
proximation is given by the sum of a screened Coulomb attraction and an unscreened Coulomb
repulsion. The first term generates an infinite sum of ladder diagrams, whereas the second term
generates an infinite sum of polarization bubbles (compare Eq. (2.66)). (From Ref. 168.)
48 WILFRIED G. AULBUR, LARS JbNSSON, A N D JOHN W. WILKINS
dependence of the dielectric matrix (via Eqs. (2.64) and (2.18)) and on the
quasiparticle spectrum (via Eqs. (1.3) and (2.1 1)).
The inclusion of vertex corrections in the irreducible polarizability but not
in the self-energy is motivated by two observations. (1) An exact Ward-
identity-based relation between the self-energy and the vertex function is
expected to be better observed when the irreducible polarizability is deter-
mined in the time-dependent Hartree-Fock rather than the time-dependenl
Hartree approximation.' 5 6 This Ward-identity-based relation results from
charge conservation. (2) Hanke and collaborators suggested that the evalu-
ation of the GWA self-energy with an interaction screened using the
time-dependent Hartree-Fock approximation is more appropriate than an
evaluation of C using an RPA-screened interaction. The choice of screening
in the time-dependent Hartree-Fock approximation can be motivated a$
follows: Choosing C = 0 in Fig. 4 as the starting point for an iterative
solution leads to l-(1,2; 3) = 6(1,2)6(1,3) and E = E~~~ (see Eqs. (2.10) and
(2.23)). The next iteration yields C(1,2) = iG(1,2)W(1+,2) and subsequent13
x G(4,6)G(7,5)r ( 6 , 7 ; 3)d(4,5,6,7)
= 6(1,2)6(1,3) +i
s W(1+,2)G(1,6)G(7,2)r(6,7; 3)d(6,7),
(2.68
where contributions due to SW/SC are neglected since they are difficult tc
handle and their contributions are estimated to be negligible for Si anc
diamond.'69 Using the above expression for the vertex function in thc
definition of the polarizability as well as the dielectric matrix, one finds tha
the dielectric matrix includes not only RPA bubbles, corresponding tc
6(1,2)6(1,3) in the above equation, but also an infinite series of screenec
ladder diagrams resulting from the second term in Eq. (2.68). Note tha
Mahan" used Ward identities to justify the simultaneous inclusion of iden
tical vertex corrections in C and P in contrast to the approach suggested b!
Hanke et al. Mahan's approach is discussed in more detail in Section 11.14
Inclusion of excitonic effects via the solution of the Bethe-Salpete
equation for several semiconductors leads to (see Refs. 156, 169, 170, 171
Structural Properties of Condensed Matter, eds. J. T. Devreese and P. Van Camp, Plenum, Nev
York (1985), 113.
170 H. J. Mattausch, W. Hanke, and G. Strinati, Phys. Rev. B27,3735 (1983).
"' N . Meskini, H. J. Mattausch, and W. Hanke, Sol. State Comm. 48, 807 (1983).
QUASIPARTICLE CALCULATIONS IN SOLIDS 49
and 172): (1) band gaps and valence band widths in good agreement with
experiment, (2) an increase in oscillator strength at the El peak of the
frequency spectrum of E by 50- loo%, in good agreement with experiment,
and (3) a larger El peak intensity than E , peak intensity for Si, in contrast
with experiment (see Fig. 39 for a definition of the E l and E , peaks in the
imaginary part of the Si dielectric function). Practical calculations use a
matrix representation of the Bethe-Salpeter equation with respect to semi-
empirical tight-binding valence and conduction bands, which are expressed
in a Gaussian-orbital basis
c. Local Approach
Local approaches to the correlation problem in solids (for a review see Ref.
19) are based on the physical picture that adding an electron or hole to an
insulator leads to a polarization of neighboring bonds whose induced
dipoles will act on and change the energy of the original particle. A local
description of correlation effects based on quantum chemistry methods
starts from a Hartree-Fock wave function and includes local correlations in
the form of one- and two-particle1 7 3 excitations in the many-body wave
function I Y ) via a projection operator exp($:
The operator s
depends on parameters that characterize ground- and
excited-state correlations and that can be determined variationally.
Bond polarizations are interatomic correlation effects that dominate
dielectric screening and hence quasiparticle energy shifts of occupied and
unoccupied states in covalent semiconductors. These interatomic correlation
effects can be described by projector methods based on a minimal basis set
of Gaussian orbital^.'^^.'^^,'^^ Other important effects, like the relaxation
of electronic orbitals in the neighborhood of an added electron or the
change of ground-state correlations due to the presence of an extra electron,
require a larger basis set. Each of these two partially canceling effects is
estimated to amount to band-gap corrections on the order of 1 to 2eV
based on molecular calculations.157~158~159
Local projector methods determine ground-state properties of a variety
of semiconductors with better accuracy than LDA,' 74.1 7 5 - 17 6 but applica-
tions to excited states are limited and remain difficult due to the extended
nature of these ~ t a t e s . ' ~ The
~ . ' ~calculations
~ scale as the number of basis
functions to the fifth power,'79 compared to standard LDA and GWA
calculations that scale as the number of basis functions to the third and
fourth power, respectively. Applications of projector methods to excited-
state proper tie^'^^*'^^*'^^ have been reported for Si, Egap= 3.0eV (3.3-
3.4 eV experiment); diamond, Egap= 7.2 eV (7.3 eV experiment); and Ge,
Egap= 0.44eV (0.9 eV experiment).
Here p(r) is a local momentum and the chemical potentials of the in-
homogeneous (p) and the homogeneous @,,) system are introduced to line
up the Fermi levels. Calculations on metals using this approximation are
reported by, among others, Arbman and von BarthS6 for A1 and by
Sacchetti"' for Cu.
For practical calculations [Refs. 160, 161, 162, and 1811 in semiconduc-
tors the homogeneous self-energy is evaluated in the GWA with the further
assumptions that (1) the Levine-Louie dielectric function describes screen-
ing, and (2) the Green function equals the Green function of a fictitious,
insulating free-electron gas. Although this ansatz is guided by physical
intuition, it lacks rigorous justification86*'60 and may elude systematic
improvements.'60 Advantages of the QPLDA are that (1) it is not limited
to minimal basis sets, in contrast to the early work of Hanke et u1.154*155,156
and Fulde et a1.,'57*'58*159 and (2) it leads to only a 30% increase in
computation time compared to LDA and is therefore very efficient com-
pared to the other methods discussed above.
Important physics of the many-body self-energy is not captured in the
QPLDA for two reasons: (1) the energy-dependence of the QPLDA self-
177 J. Grifenstein, H. Stoll, and P. Fulde, Phys. Rev. BS5, 13588 (1997).
'" B. Paulus, private communication.
179 G. Stollhoff and P. Fulde, Z . Phys. B29,231 (1978).
F. Sacchetti, J. Phys. l? Met. Phys. 12, 281 (1982).
"' C. S. Wang and W. E. Pickett, Phys. Rev. Lett. 51, 597 (1983).
QUASIPARTICLE CALCULATIONS IN SOLIDS 51
energy increases the LDA band gaps and achieves satisfactory agreement
with experiment; and (2) local-field effects are not taken into account. In
more accurate GWA calculations static, inhomogeneous screening effects
open the LDA band gap of semiconductors, and dynamic effects reduce the
gaps to within a few tenths of an eV of the experimental values (see next
sections). Nevertheless, QPLDA band gaps generally agree with experiment
to within 0.4 eV. For example, the direct band gap of Si is 2.99 eV compared
to experimental values of 3.3 to 3.4eV, and the indirect gap equals 0.81 eV
compared to an experimental value of 1.17 eV.162
10. LOCAL-FIELD
EFFECTS AND THE NONLOCALITY
OF THE SELF-ENERGY
a. Local Fields
A real-space analysis of the static screening potential, Wscr(r,r’) = W(r, r‘;
E = 0) - v(r, r’), around an extra electron centered at a bond center or at
52 WILFRIED G. AULBUR, LARS JBNSSON, AND JOHN W. WILKINS
2- ....,....,....,.-..-
an interstitial site44 shows that (1) local fields contribute more than a third
of the screening potential in the region near the center of the bond; (2) the
local-field contribution is much smaller (roughly one-tenth) in the low-
density, interstitial regions; (3) local fields are responsible for the anisotropy
of the screening potential; and (4) local fields are short-ranged and their
effect is negligible beyond roughly one bond length. Figure 10 shows the
results for WScr evaluated at E = 0 in analogy to the static COHSEX
approximation to separate the energy dependence of C from local-field
effects.
The screening potential due to an electron at site r evaluated at the same
site determines the Coulomb-hole contribution to the self-energy, which
consequently is constant when local fields are neglected and much deeper in
the bonding region than in the interstitial region when local-field effects are
QUASIPARTICLE CALCULATIONS IN SOLIDS 53
FIG.10. Contour plot in the Si (110) plane of the screening potential W"' (in Rydberg) in
response to a single electron (indicated by a cross) at (a) a bond-centered or (c) an interstitial
site. Panels (b) and (d) show the corresponding local-field (LF) contributions. Local fields (1)
contribute between 1/3 (bond center) and 1/10 (interstitial) to the total screening potential; (2)
are responsible for the asymmetry of the screening potential; and (3) are short-ranged and
become negligible after one bond length. (Adapted from Ref. 44.)
taken into account. Local fields are more important for valence than for
conduction bands since the valence-band density is concentrated in the
bonding region and the conduction-band density is concentrated in the
interstitial region. The Coulomb-hole and screened-exchangecontributions
to the self-energy can be expressed in terms of the screening potential WScr
as follows (see Eqs. (2.32) and (2.33):
1
F o H ( r , r') = -6(r - r')Wscr(r, r), (2.71)
1,
2
SEX Wscr(r, r') +-
Ir - r'( (2.72)
54 WILFRIED G. AULBUR, LARS JONSSON, AND JOHN W. WILKINS
COHSEX
COHSEX
Si No LF LF GWA
xCOH
XCoH and that local-field effects for unoccupied bands in XSEX are even
smaller. Since local-field effects in X S E X and CCoHare of opposite sign and
amount to at least several tenths of an eV, they must be included in the
determination of both terms to ensure quantitatively accurate results. The
strong effect of local fields on XCoH for the valence band with respect to the
conduction band, and the relative insensitivity of XSEX to local fields, leads
to a band-gap increase of 0.8 to 4.4eV compared to the LDA or the static
COHSEX approximation without local fields.
Table 6 demonstrates that:44 (1) the LDA gives band gaps that are at least
0.5 to 2.0eV smaller than experiment (see also Fig. 3); (2) the COHSEX
approximation without local fields gives band gaps that are in better
agreement with experiment, although significant deviations from experiment
remain; and (3) local fields dramatically open up the band gap and require
inclusion of dynamic screening to achieve quantitative agreement with
experiment.
Local-field effects on the self-energy shifts in simple metals are less than
the numerical uncertainties of GWA calculations (see Section V.26b and Ref.
184) since their density is relatively homogeneous compared to covalently
bonded semiconductors.
FIG. 11. Contour plots of self-energy Z(r, r’; E = midgap) in eV a.u.-3 for r fixed at the bond
center and r’ shown in the (110) plane for (a) Si, (b) GaAs, (c) AIAs, and (d) diamond. For
silicon, the corresponding plots with r fixed at the tetrahedral interstitial site are also shown
(panel (e)). For comparison, the self-energy operator of jellium with rr = 2 (the average density
of silicon) is shown (panel (f)). Godby, Schliiter, and Sham suggested modeling the approxi-
mate form of the nonlocality of the self-energy (not its depth) using the results from jellium
calculations. (From Ref. 185.)
QUASIPARTICLECALCULATIONS IN SOLIDS 57
OF THE SELF-ENERGY
11. ENERGYDEPENDENCE
2 - (2) 51
c
0
f;
=
W
-
w
-2
-5 0 I
OW OUASIPARTICLE
EN€RCr ( r V )
FIG. 12. (a) The real part of the matrix elements of the GWA self-energy operator of silicon
(Z(E))and the LDA exchange-correlation potential (V’D”) plotted against the quasiparticle
energy E. (b) The differences (Z(E)- VkD”)(essentially the quasiparticle energy correction)
and (Z(0) - V k D A )of silicon plotted against the quasiparticle energy. The energy dependence
of the self-energy must be taken into account to obtain the correct dispersion for the
quasiparticle energies. (c) For comparison, (Z(E)- V;””) and (Z(0) - V i D ” ) are plotted
against the quasiparticle energy E for jellium at the average density of silicon (r, = 2). The
Fermi energy E , is aligned with the middle of the band gap. Both are relatively featureless and
QUASIPARTICLE CALCULATIONS IN SOLIDS 59
that the expectation values of the self-energy operator in diamond, Si, Ge,
and LiCl have a negative slope of about -0.2.44 This slope translates into
nearly constant quasiparticle weights Ziof about 0.8, as shown in Table 9
(Eq. (2.7)), and to about a 20% reduction in the quasiparticle shifts via Eq.
(2.6). However, the quasiparticle weights Ziare close enough to unity that
quasiparticles are well-defined excitations for energies close to the band gap,
although dynamical effects are nonnegligible. This statement is no longer
valid in d and f electron systems, as discussed in Section V.26~.The curves
of X ( E ) bend upward for hole and downward for electron states, which
clearly do not share the sharp discontinuities present in (a) and (b). (d) The real parts of the
matrix elements of the bare-exchange (Hartree-Fock) self-energy operator of Si (Z,) and the
statically screened exchange self-energy operator ( X s s x ) . ( VhD”) is subtracted as in (b). The
nonlocality of Z, has too large a range due to the neglect of screening which leads to a jump
in the self-energy corrections at the Fermi level that is too large in comparison with experiment.
(e) The real parts of the matrix elements in silicon and jellium (r, = 2) of the frequency
derivative of the self-energy ( d Z ( o = midgap)/aw). (Taken from Ref. 185.)
60 WILFRIED G. AULBUR, LARS JONSSON, AND JOHN W. WILKINS
Energy (eV)
FIG. 13. Matrix elements of the electron self-energy operator evaluated in the GWA drawn
as a function of energy for selected states near the band gap are displayed for (a) diamond, (b)
Si, (c) Ge, and (d) LiCI. The quasiparticle energies of these states are indicated on the energy
axis. The self-energy shows an approximately linear dependence on energy close to the
quasiparticle energy for all states and all materials considered. The slope of (Z(E))is roughly
material and state independent and amounts to about -0.2. This indicates that dynamical
effects are important for a quantitative description of quasiparticle energies in solids. (Taken
from Ref. 44.)
QUASIPARTICLE CALCULATIONS IN SOLIDS 61
0.5
r ‘
1 2 3 4 5 6 7 8 9
I
10
Wave Vector
FIG. 14. Self-energy correction A(k) = (Z(E)- V;””) to the direct band gap at the point
and at 10 special k-points in the fcc Brillouin zone for Si, SIC, and GaN. The standard deviation
from the average self-energy correction (the “scissors” shift) amounts to 0.03 eV for Si, 0.13 eV
for SIC, and 0.17eV for GaN. This corresponds to 4.5%/10.4%/11.7% of the average self-
energy correction (0.73 eV/1.20 eV/1.47 eV for Si/SiC/GaN). Although the scissors approxi-
mation works well for covalently bonded Si, it clearly breaks down for more ionic materials
such as Sic and GaN.
and Ge. For more ionic materials such as diamond and LiCI, the deviation
is as large as 1.5 eV.
A simplified model of the self-energy of an insulator approximates its
nonlocality and energy dependence by the corresponding quantities of the
jellium self-energy evaluated at the local density in the i n s u l a t ~ r . Figure
'~~
12, panel (e) shows that the variation of the self-energy with respect to
energy can be modeled by results obtained from the homogeneous electron
gas. Assuming that the energy dependence of the self-energy factors out
completely and considering the observations in the previous section, Godby,
Schliiter, and Sham' 8 5 suggested the following physically appealing ap-
proximation for the self-energy:
where g(lr - r'l) and h ( E ) are functions that describe the nonlocality and
energy dependence of the self-energy of jellium. The factor (f(r) + f(r'))/2
accounts for local-field effects. To the best of our knowledge applications of
this formula have been limited to model calculations.
12. CORE-POLARIZATION
EFFECTS
I S 7 We define semicore orbitals loosely as d and f orbitals that have a significant overlap with
and are close in energy to valence electrons.
IS8 G. E. Engel and R. J. Needs, Phys. Rev. B41, 7876 (1990).
'13' V. Fiorentini, M. Methfessel, and M. Schemer, Phys. Rev. B 47, 13353 (1993).
A. Nazzal and A. Qteish, Phys. Rev. B 53, 8262 (1996).
S.-H. Wei and A. Zunger, Phys. Rev. B 37, 8958 (1988).
G. B. Bachelet and N. E. Christensen, Phys. Rev. 831, 879 (1985).
N. E. Christensen and I. Gorczyca, Phys. Rev. B 50,4397 (1994).
A. Garcia and M. L. Cohen, Phys. Rev. B 47, 6751 (1993).
W. G. Aulbur, Z. H. Levine, J. W. Wilkins, and D. C. Allan, Phys. Rev. B51,10691 (1995).
D. Cociorva, W. G. Aulbur, and J. W. Wilkins, unpublished.
64 WlLFRlED G. AULBUR, LARS JONSSON, AND JOHN W. WILKINS
Semicore states also significantly affect the electronic and optical proper-
ties of solids, since, for example, (1) band gaps decrease upon inclusion of
semicore states as valence states by about 0.5 eV;'91*'92 (2) the calculated
dielectric constant increases by 10-50% upon inclusion of semicore
state^,'^'*'^^*'^^ and (3) for higher-order optical response functions, such as
the coefficient of second-harmonic generation, inclusion of semicore states
can give an effect sufficiently large to switch their sign.'9s-200Since quasipar-
ticle calculations depend crucially on an accurate description of screening,
a careful treatment of core states is necessary in 11-VI semiconductors, some
111-V materials, alkali metals, alkali earths, and noble metals.
The LDA description of exchange and correlation between valence,
semicore, and same-shell core electrons (e.g., Cd 5s, Cd 4d, and Cd 4p, 4s;
Ref 52) is based on a local, energy-independent functional of the total
density of the system. However, exchange and correlation effects between
core and valence electrons are nonlocal and energy dependent. Dynamic
correlation effects occur when fluctuating core dipoles interact with fluctuat-
ing valence densities. These correlations modify the effective interaction
between valence electrons. Rather than interacting via the bare Coulomb
potential u, valence electrons experience an interaction potential,
W, = EC u, where EC describes the screening by core dipoles.20'
The errors due to the LDA can be eliminated either by a core-polariz-
ation-potential (CPP) approach20'*202*203- which is computationally effi-
cient, since core electrons can still be treated on a frozen-core or
pseudopotential level -or by treatment of core electrons as valence elec-
t r o n ~ . ' ~ * ~The
~ , 'most
~ ' important error introduced by the pseudopotential
approximation is the total neglect of core relaxation, that is, changes in the
core orbitals due to a change in the chemical environment (e.g., crystal-field
effects) and to hybridization of core and valence orbitals. These effects must
be estimated in the C P P approach, whereas they are included on an LDA
level when core electrons are treated explicitly. Also, the semicore d states
of 11-VI materials energetically overlap the valence band and hence cannot
be eliminated via a core-polarization potential. Explicit inclusion of the d
electrons in the valence band amounts to a significant increase in computa-
tional cost for plane-wave-based quasiparticle calculations204 such as the
one used in Ref. 43. Other approaches that describe semicore states via only
nonlinear core corrections205have been shown to give band structures in
good agreement with experiment for 11-VI materials.51 However, this
agreement must be considered coincidental, since it relies upon error
cancellation between core-relaxation and core-valence exchange and polar-
ization effects.
(2.75)
(2.76)
and ri is the location of the ith valence electron and 2, the atomic number
of the Jth core. The modification of the original valence Hamiltonian Vcpp
can now be written aszo1
(2.77)
'05 S. G. Louie. S. Froven. and M. L. Cohen. Phvs. Rev. B26. 1738 (1982).
66 WILFRIED G. AULBUR, LARS JONSSON, AND JOHN W. WILKINS
effects.z0’~202~z03 The first two effects are easy to incorporate into standard
band structure calculations. The contributions due to the valence electrons
need further modifications, since valence electrons cannot be described as
point charges. For instance, the single-electron potential for one valence
electron far away from a core of polarizability a reduces correctly to the
classical result - 0 1 / ( 2 r ~ ) .Ho
’ ~ ~wever, valence electrons can penetrate the
core where the classical result diverges. Accordingly, a cut-off function has
to be i n t r o d u ~ e d , and
~ ~ ~a ’typical
~ ~ choice is
(2.79)
The sum over 1 in the above equations is a sum over angular momenta, and
is the corresponding projection operator. For each atomic species one has
to determine the parameters 01, r I , and Fe-,. Shirley et al.43used the experi-
mental core polarizabilityZo7 and varied the rl (1 = 0, 1, 2) to obtain the
correct removal energy for one valence electron of angular momentum 1
outside the ion core (rl = rz for 12 3) and Fe-e = 0.5(r0 rl). +
The two-electron potential k-eis an effective interaction between two
valence electrons due to core-polarization effects. This term screens the bare
Coulomb interaction, which screening has to be taken into account in
quasiparticle calculations. The effective interaction between valence elec-
trons can be expressed as4943
(2.80)
208 "Crystal and Solid State Physics", in Landolt-Bornstein, Numerical Data and Functional
Relationships in Science and Technology, vol. 17a, ed. by 0. Madelung, Springer, Berlin (1984).
2 0 9 As cited in Ref. 43.
210 Note that the correct ordering of the lowest conduction bands in Ge can also be achieved
in calculations that have no self-interaction errors (Refs. 211 and 212). The energies of the
lowest-lying conduction bands in these methods deviate significantly from experiment (,!IcB at
L - r - X = 0.10 eV - 0.12 eV - 0.53 eV (Ref. 21 l), 1.01 eV - 1.28 eV - 1.34 eV (Ref. 212) versus
experimental values of 0.74 eV - 0.90 eV - 1.2 eV given in Table 10.
QP
CVI IN CVI IN
QUANTITY LDA LDA CPP ExF-T."
13. SELF-CONSISTENCY
216 P. Horsch, W. von der Linden, and W.-D. Lukas, Sol. State Comm. 62, 359 (1987).
2’7 The quoted value in Ref. 52 is slightly diBerent and equals -7.4 eV.
70 WILFRIED G. AULBUR, LARS JONSSON, AND JOHN W. WILKINS
11. CALCULATED
TABLE BINDING
ENERGIES
IN eV OF THE SEMICORE
STATESCd 4d IN
Cumc CdS, Ge 3d IN Ge, AND Si 2p IN Si. THEZEROOF THE ENERGYSCALEIS GIVEN
BY THE RESPECTIVEVALENCE-BAND MAXIMUM. THE EXPERIMENTAL DATAARE
WEIGHTED AVERAGESOF SPIN-ORBIT-SPLIT LEVELSAND ARE T m AS QUOTED IN
REF.213. THES ~ LGLDA S AND GQPSCORRESPOND TO GWA CALCULATIONS USING
AN LDA PROPAGATOR OR A PROPAGATOR CALCULATED m A RENORMALIZED
SPECTRALFUNCTION (APPROXIMATELY SELF-CONSISTENTGWA CALCULATION; SEE
REF. 213). ALL VALUESARE TAKEN FROM REF.213 mTHE FOLLOWING EXCEPTIONS:
IN THE CASEOF Ge WE ALSO LISTLDA, GLDAWA, AND SLATER-TRANSITION-STATE
(STS) RESULTS OBTAINED BY AN LMTO FORCdS, WE LISTTHE
LDA A M ) SELF-INTERACTION- AND RELAXATION-CORRECTED PSEUDOPOTENTIAL
(SIRC-PSP) &.SULTs OF REF. 218.
GWA ’
“Ref. 218; bSIRC-PSP ‘Ref. 145; dSTS; ‘Ref. 219; fRefs. 220 and 221; 8Ref. 222;
hRef. 223.
and energies; and (3) in the majority of cases the frequency dependence of
the dielectric response is further approximated by plasmon-pole models.
This approach, which we call the “standard” GWA, or GoWoA in what
follows, works well in practical applications and generally leads to good
agreement with experiment for quasiparticle energies. However, from a
principle point of view it has several shortcomings.
A consistent approach to GWA calculations evaluates the RPA polariz-
ability and the GWA self-energy using Hartree wave functions and energies
and iterates the calculation until self-consistency in G and W is achieved.
’18 D. Vogel, P. Kriiger, and J. Pollmann, Phys. Rev. B54, 5495 (1996).
A. P. J. Stampfl, P. Hofmann, 0.SchalT, and A. M. Bradshaw, Phys. Rev. B 55,9679 (1997).
220 L. Ley, R. A. Pollak, F. R. McFeely, S. P. Kowalczyk, and D. A. Shirley, Phys. Rev. B9,
600 (1974).
’” N. G. Stoffel, Phys. Rev. B28,3306 (1983).
2 2 2 Zahlenwerte und Funktionen aus Naturwissenschajien und Technik, eds. A. Goldmann and
E. E. Koch, Landolt-Bornstein, New Series, Group 111, vol. 23a, Springer-Verlag,Berlin (1989).
223 W. Monch, in Semiconductor Surfaces and Interfaces, eds. by G. Ertl, R. Gomer, and D.
Mills, Springer Series in Surface Sciences, vol. 26, Springeer-Verlag,Berlin (1993).
224 W. von der Linden and P. Horsch, Phys. Scripta 38,617 (1988).
QUASIPARTICLE CALCULATIONS IN SOLIDS 71
u
J
1.6
5 1
Y
4 1.8
41B 1
4
0.8
4.6
-1
- 1 0 0 i o m 5 o a m 6 0 7 0
ELDA(ev)
FIG. 15. Top panel: Silicon quasiparticle energy shifts E,, - EL,, as a function of the LDA
energy E,,,. The overall shape of the energy shifts is similar to the shape of quasiparticle
energy shifts in jellium4~z29(not shown) and Ag (see Fig. 42 and Section V.26a). Bottom panel:
Diagonal matrix elements of the imaginary part of the Si self-energy with respect to LDA
orbitals calculated at the energy of the corresponding quasiparticle peak. The shape of the
imaginary Si self-energy is similar to the shape of the non-self-consistent decay rate of jellium
(see Fig. 18). In both figures, the full circles are the result of a standard (RPA) GWA
calculation, the empty circles have the (time-dependent) LDA vertex function included (GWT
calculations, see Section 11.14). The self-energy is calculated along the L-T-X line. (From Ref.
102.)
QUASIPARTICLE CALCULATIONS IN SOLIDS 73
self-energy is only justified in the immediate vicinity of the band gap. As can
be seen in Fig. 15 (top panel), quasiparticle energy shifts arelo2 linearly
decreasing for occupied states, roughly constant for conduction band states
with 0 IE < 25 eV, and roughly linearly increasing for energies E 2 25 eV.
Equation (2.81) does not capture the behavior of quasiparticle corrections
over an extended energy range. The overall shape of the energy dependence
of quasiparticle corrections in Si is reminiscent of the behavior in j e l l i ~ m ~ ~ ~
and Ag (see Section V.26a and Ref. 216).
A truly self-consistent GWA calculation not only eliminates the depend-
ency of results on the initial starting point of the iteration but is also charge
and energy conserving in a Baym-Kadanoff ~ e n s e . ~ It' . has~ ~ five disadvan-
tages: (1) the self-consistent fundamental band gap of Si ( x 1.9 eV) exceeds
experiment (1.17eV) by as much as LDA (x0.5eV) falls below experi-
ment;230 (2) the band width of simple metals is larger than independent-
particle results and contradicts experiment; 2 3 0 * 2 3 (3) plasmon-pole
19232
229 K. W.-K. Shung, B. E. Sernelius, and G. D. Mahan, Phys. Rev. B36,4499 (1987).
230 W. D. Schone and A. G. Eguiluz, Phys. Rev. Lett. 81, 1662 (1998).
2 3 1 B. Holm and U . von Barth, Phys. Rev. B57, 2108 (1998).
232 E. L. Shirley, Phys. Rev. B54, 7758 (1996).
2 3 3 A description of (multiple) satellites that improves upon the GWA can be obtained by
cumulant expansions (Ref. 77) or by a T-matrix approach (Ref. 71). For a review. see Ref. 14.
234 A. Schindlmayr, T. J. Pollehn, and R. W. Godby, Phys. Rev. B58, 12684 (1998).
2 3 5 A. G. Eguiluz and W.-D. Schone, Mol. Phys. 94, 87 (1998).
74 WILFRIED G. AULBUR, LARS JONSSON, AND JOHN W. WILKINS
236 H. J. de Groot, R. T. M. Ummels, P. A. Bobbert, and W. van Haeringen, Phys. Rev. 854,
2374 (1996).
237 U. von Barth and B. Holm, Phys. Rev. 854,8411 (1996).
A. G. Eguiluz, private communication.
239 B. Holm and U. von Barth, private communication.
240 G. D. Mahan and B. E.Sernelius, Phys. Rev. Left. 62,2718 (1989).
241 K. W.-K. Shung and G. D. Mahan, Phyx Rev. Left. 57, 1076 (1986).
75
FIG. 17. The quasiparticle weight Z , for jellium at rs = 4 determined by a fully self-
consistent GWA calculation (solid line), a partially self-consistent GW’A calculation (dashed
line), and a standard G’W’A calculation (dotted line). Self-consistency leads to a systematic
increase in quasiparticle weight and to a loss of structure due to the suppression of plasmon
satellites in the fully self-consistent result. (From Ref. 231.)
242 For a discussion of the experimental value of 12.5 eV, see Section III.18a.
243 Y. Levin, C.D. Wu, and Y . Bar-Yam, Comp. Mat. Sci. 3, 505 (1995).
244 B. Farid, R. W. Godby, and R. J. Needs, in 20th International Conference on the Physics
of Semiconductors, eds. E. M . Anastassakis and J. D. Joannopoulos, vol. 3, World Scientific,
Singapore (1990), 1759.
245 V. Galitskii and A. Migdal, Zh. Eksp. n o r . Fiz. 34, 139 (1958) [Sox Phys. JETP 7, 96
(1958)l.
246 M.M. Rieger and R. W. Godby, Phys. Rev. 858, 1343 (1998).
78 WILFRIED G. AULBUR, LARS JONSSON, AND JOHN W. WILKINS
4.2-
Y -
-0.4
e
ur' 4.6 -
-1.4 - ' I
0.0 02 0.4 0.6 0.8 1.0 12
WkF
FIG. 19. Quasiparticle dispersion Ek for a fully self-consistent GWA calculation (solid line),
a partially self-consistent GW'A calculation (dashed line), and a standard G'W'A calculation
(dotted line), and for noninteracting electrons (dashed-dotted line). Only the standard G'W'A
calculation leads to band-width narrowing in comparison to the free-electron result. However,
this observed band-width narrowing is too small to explain experimentally observed narrow
band widths in simple metals. Self-consistency increases the band width in comparison to free
electrons and hence leads to increased deviation from experiment. Data shown correspond to
r, = 4 and are taken from Ref. 231.
247 All quantities are to be interpreted as matrices with respect to plane-wave coefficients (see
Section III.15b).
QUASIPARTICLE CALCULATIONS IN SOLIDS 79
14. VERTEXCORRECTIONS
As follows from Eq. (2.9), vertex corrections to the self-energy describe the
linear response of the self-energy to a change in the total electric potential
of the system. Vertex corrections in dielectric screening describe exchange
and correlation effects between an electron and other electrons in the
screening density. For example, if screening is provided by a spin-up
electron, other spin-up electrons cannot get too close and are less likely to
help with the screening." As a consequence, the screening is weakened and
the interaction is strengthened. Such short-ranged vertex corrections im-
prove the description of quasiparticles and of low-energy satellites. Other,
long-ranged, vertex corrections improve the description of the plasmon
satellites, as discussed in detail in Refs. 14 and 249.
Vertex corrections increase correlation functions such as the density-
density response function, whereas self-energy insertions in the Green
function, that is, self-consistency corrections, reduce correlation functions.
DuBois250*25'was the first to notice that, within the context of the GWA,
vertex-correction diagrams and self-energy diagrams cancel each other to a
large degree. Because of this cancellation, discrepancies between experiment
and self-consistent GWA calculations, such as those described in the
previous section, are to be expected. In principle, a consistent treatment of
vertex corrections and self-consistency is required for a quantitative descrip-
tion of experiment. M a h a n " ~suggested
~ ~ ~ on the basis of the work of Baym
and Kadanoff and Ward,253 that a consistent procedure to include
62963
248 Special care must be taken to account for core charges that are missing in the pseudopoten-
tial approach of Ref. 246.
249 C. Verdozzi, R. W. Godby, and S. Holloway, Phys. Rev. Lett. 74, 2327 (1995).
2 5 4 This assumes a bare interaction between the electrons. Screened interactions have propa-
gators in internal lines, which lead to additional vertex corrections (Ref. 10).
2 5 5 For a discussion of vertex corrections see also B. Farid, Phil. Mag. B76, 145 (1997).
2 5 6 M. Hindgren and G O . Almbladh, Phys. Rev. B56, 12832 (1997).
’” J. E. Northrup, M. S. Hybertsen, and S. G. Louie, Phys. Rev. Lett. 59, 819 (1987).
”* M. P. Surh, J. E. Northrup, and S. G. Louie, Phys. Rev. B38, 5976 (1988).
’” A. Schindlmayrand R. W. Godby, Phys. Rev. Lett. 80, 1702 (1998).
QUASIPARTICLE CALCULATIONS IN SOLIDS 81
x P+
')(6, 7; 3)d(4, 5, 6, 71, (2.84)
Table 12 shows that these vertex corrections included only in the screening
reduce band gaps by 0.1 to 0.2 eV and valence band widths by about 0.5 eV
in Si. Subsequent inclusion of vertex corrections in the self-energy causes a
band-gap increase that cancels the effect of the vertex corrections in the
screening. Note the qualitative difference between this band-gap increase"
and the band-gap decrease obtained by including dynamical vertex correc-
tions in X,119*262
as discussed in the paragraph Vertex Corrections to Second
261 We take the corrected value of Ref. 119 rather than the original value of 0.12 eV quoted
in Ref. 262.
262 P. A. Bobbert and W. van Haeringen, Phys. Rev. B49, 10326 (1994).
263 Note that because of the use of the LDA, the diagrammatic structure of these vertex
corrections is not well defined.
QUASIPARTICLE CALCULATIONS IN SOLIDS 83
-1.0
0 1 2 5 4 5 6
rs
FIG. 20. Band narrowing relative to the independent-particle band width versus electron
density parameter rr for the homogeneous electron gas and several approaches to the
determination of the self-energy. A positive value corresponds to an actual narrowing of the
bands. The two dashed lines correspond to a standard (RPA) GWA calculation (in which
vertex corrections are neglected in the dielectric function E and the self-energy C) and a GWA
calculation in which vertex corrections are included only in E (I'= 1). The solid line
corresponds to a calculation that includes vertex corrections in both E and Z using a
Vashista-Singwi many-body local-field factor.264This curve is universal in the sense that it is
hardly a5ected by a change of the many-body local-field factor. The r = 1 calculation leads to
band narrowing that is about twice as large as the RPA band narrowing for metallic densities
(rs z 2 - 6) and compares favorably with experiments for simple metals. Adding consistent
vertex corrections to the self-energy cancels the effect of vertex corrections in the dielectric
screening, giving results in close agreement with RPA. (From Ref. 240.)
GWA HIGHERORDER
EXACT
EF z(o),Z(k,) W EF EF
(2.88)
269 The local-field factor f(q) is obtained by fitting the dielectric constant to quantum
Monte-Carlo data, Ref. 270. A similar approximation has been used by Hindgren and
Almbladh, Ref. 256, who showed it to be very reliable.
”* C. Bowen, G. Sugiyama, and B. J. Alder, P h p . Rev. SSO, 14838 (1994).
This approximation affects the self-energy corrections to the lowest conduction and highest
valence band by less than 20 meV, as shown in Table 21. Also, de Groot et al. (Ref. 236) studied
the lowest-order vertex and self-consistency corrections to the RPA polarizability and the
GWA self-energy for a 1D semiconducting wire. We do not discuss that model system but
concentrate on the results for Si.
272 W. Kohn, Phys. Rev. 110,857 (1958).
+ 0' i
+ Qsc4 SCI SC2 SC3 SC4
+o V +% 2
xU
FIG.21. Left panel: RPA potarizability and corrections to first order in the screened interaction W SCl-SC4 denote corrections due r
to selfenergy insertions in the Green function, that is, due to lowest-order self-consistency effects, V corresponds to a ladder diagram
that is the lowest-order vertex correction. The solid directed line denotes the LDA Green function. The cross denotes - V ,' and the
wavy line denotes the RPAdynamically-screened interaction. Right panel GWA self-energy plus lowest-order self-consistency
(SC1-SC4)and vertex (V)corrections. The diagrams SC3 and SC4 are selfenergy corrections due to lowest-order corrections to the
valence density. The wavy line denotes the dynamically screened interaction, which can be evaluated either in RPA or in
+ +
RPA V SC.The notation is otherwise identical to the one used in the left panel. (From Ref. 119.)
88 WILFRIED G. AULBUR, LARS JQNSSON, AND JOHN W. WILKINS
0.03
0.01
4 - 0 4
-0.-
0.0 1.0 2 0 3
.0 4.0 6.0 6.0
4-07
0.0 4.0
a
a.0 e.0 at.0 4.0 6.0
1- [Zrua] 1- [2rr/orl
FIG. 22. Cancellation between vertex (V) and self-consistency (SC) corrections to the
diagonal elements of the RPA static polarizability PGc(k;w = 0) (in Rydberg am) for silicon
and diamond. Lowest-order vertex and self-consistency corrections as considered in Ref. 119
largely cancel, leading to an overall enhancement of the RPA polarizability by < 15%. The
constant contributions of the vertex and self-consistency corrections to the head element of the
RPA polarizability matrix for k + 0 cancel, which ensures the correct screening behavior in
semiconductors. (From Ref. 119.)
Si DIAMOND
METHOD
(SELF-ENERGY/SCREENING) r X L r
LDA 2.5 3.4 2.6 5.5
GWA/RPA 3.3 4.2 3.4 1.6
GWA/RPA+V+SC 3.4 4.3 3.4 1.8
GWA + V + SC/RPA 3.6 4.5 3.1 8.1
GWA+V+ SC/RPA+V+SC 3.1 4.6 3.8 8.4
EXPT.”.~ 3.4 4.3 3.5 1.3
al.’ l 9 suggest that the cancellation between lowest-order vertex and self-
consistency contributions is far from complete if one determines these
corrections to the GWA self-energy using a noninteracting LDA Green
function. In fact, the V contribution cancels the SC contribution to the
self-energy by only 35% (45%) in the case of Si and only - 10% (15%) in
+ +
the case of diamond for RPA V SC (RPA) screening. In the latter case,
even the term “cancellation” is therefore inappropriate. ’’
111. GWA Calculations: Numerical Considerations
15. DIFFWENT
IMPLEMENTATIONS
OF THE GWA
a. Reciprocal-Space Approach
Early calculation^^.^ in the dynamically screened potential approximation
determined the expectation value of the self-energy operator using a plane-
wave basis set in reciprocal and frequency space, since this allows both a
straightforward evaluation of matrix elements that occur in the self-energy
and a systematiccontrol over convergence. In addition, a GWA calculation in
reciprocal and frequency space parallels the experimentalsituation in (inverse)
photoemission, which determines the band structure of a solid as a function of
the reciprocal wave vector k and of the frequency w of the quasiparticle.
a. 1. PLANE WAVES. Pseudopotentialsin conjunction with a plane-wave basis
set are widely used in computationalcondensed matter theory because of their
ease of use and systematic convergence proper tie^."^ Mostly, plane-wave-
based GWA calculations are applied to sp-bonded bulk solids and to their
interfaces, surfaces, defects, and clusters, as discussed in Sections IV and V.
The two main disadvantages of plane-wave basis sets are that (1) the
number of plane waves Npw increases with the system volume V and the
energy cut-off E,,, needed for a converged description as N,, x (1/21)VE:i?
(in Hartree atomic units); and (2) plane waves have no direct physical
interpretation in contrast to local-orbital basis sets discussed below. Recip-
rocal-space GWA calculations scale as N $ , (see below), which makes plane-
wave-based GWA calculations prohibitively expensive for large- V systems
such as complicated defect structures and for large-E,,, systems such as d
and f electron materials.
Detailed Formulas. A Bloch wave function of wave vector k and band n
can be expressed in terms of its Fourier components cnk(G),where G is a
reciprocal lattice vector, as
1
@nk(r) =- 1 c,,(G) exp(i(k + G) * r). (3.1)
J V G
(3.2)
c
1 occ+unocc BZ
(4k I W ) I L k) = -
v n
1 2 ME&,
q GG'
q)CM"d(kk, q>l*
exp(iw6) WGG~(q9 0)
(-27Ci) E +0- - i6Sgn(P - E,k-q)
Enk- q
do.
(3.4)
Here, the matrix elements M;.(k, q) generalize Eq. (3.2) to arbitrary states
n and m.
The remaining frequency integration is usually performed analytically
using the plasmon-pole models presented in Section II.8b rather than
numerically as in Refs. 102, 103, 185, and 276. The plasmon-pole model
(PPM) is accurate to within a few tens of meV for states close to the Fermi
level and to within a few tenths of an eV for low-lying valence states, as is
demonstrated in Table 21. Splitting the self-energy into a bare exchange part
C x and an energy-dependent correlation contribution C c ( E ) , as detailed in
276 H. N. Rojas, R. W. Godby, and R. J. Needs, Phys. Rev. Left. 74, 1827 (1995).
92 WILFRIED G. AULBUR, LARS JONSSON, AND JOHN W. WILKINS
(3.5)
(3.6)
where the matrix elements are defined as277
P;m(k, 4) = c CM&rn(k,:q)I*@,(G),
G
(3.7)
and ci = 27tzpq0,(q) for the von der Linden-Horsch model and ct = 1 for the
Engel-Farid plasmon-pole model.
The N & scaling of the reciprocal-space GWA algorithm follows from
Eqs. (3.2), (3.6), and (3.7). The construction of the matrix elements in Eq.
(3.2) scales as the number of plane waves used to describe the LDA wave
functions. The matrix elements in Eq. (3.7) are determined in N , operations,
where N , is the number of plane waves used to describe the plasmon-pole
eigenvectors. The self-energy itself (Eq. (3.6)) must be determined via a sum
over LDA bands and a sum over plasmon-pole bands. The number of LDA
and plasmon-pole bands as well as N , and N p , increase linearly with
system size and are all of the same order. Hence, the overall algorithm scales
as the size of the system or the number of plane waves to the fourth power.
An approximate evaluation of the ~elf-energy"~via Taylor expansions
around a given set of energies reduces the GWA scaling to N;w and allows
larger GWA calculations based on plane waves. Other appro ache^^^^.^^^
permit the determination of the density-density response function with an
O ( N ; , ) effort, but have not been extended to the calculation of the
self-energy.
a.2. LOCAL-ORBITAL BASISSETS. The number of basis functions of local-
orbital basis sets,' s which is needed to describe bulk semiconductors, is
smaller by about one order of magnitude than the corresponding number
of plane waves.281 Although the overall scaling of the reciprocal-space
277 The scaled plasmon-pole eigenvectors for the von der Linden-Horsch and the Engel-Farid
plasmon-pole models are defined in Eqs. (2.52) and (2.61), respectively.
2 7 8 L. Reining, G. Onida, and R. W. Godby, Phys. Rev. B56,R4301 (1997).
''' X. Blase, A. Rubio, S. G. Louie, and M. L. Cohen, Phys. Rev. BS2, R2225 (1995).
A. A. Quong and A. G. Eguiluz, Phys. Rev. Lett. 70, 3955 (1993).
The description of bulk Si requires, for example, 40-60 Gaussian orbitals compared to
about 450 plane waves (Ref. 114).
QUASIPARTICLE CALCULATIONS IN SOLIDS 93
GWA algorithm still scales as the number of basis functions to the fourth
power, the strong reduction of the number of basis functions results in
significant savings in computation time and allows the study of materials
that are not accessible in plane-wave-based approaches such as N i 0 5 5 or
Si,H, clusters.’” Unfortunately, the matrix element construction is com-
putationally more intense in a local-orbital than in a plane-wave basis, which
partially reduces gains due to the smaller number of basis function^."^
Linearized Augmented Plane Waves (LAP W ) . LAPW is an all-electron
method that has been applied to GWA calculations in Si’85 and Ni6’ In Si, 45
basis functions per Si atom are needed, which corresponds to a reduction by a
factor of five compared to plane-wave-based calculations. Although LAPW
calculations allow systematic convergence, they do not seem to offer significant
computational savings compared to plane-wave-based calculations, which
may have inhibited a widespread use of LAPW for GWA calculations.’86
Linearized Mufin-Tin Orbitals (LMTO). LMTO is an all-electron
method that has been used, for instance, for GWA calculations of transition-
metal oxide5’ and the Compton profile of alkali It is exten-
sively reviewed in Ref. 14. GWA calculations for transition-metal oxides are
prohibitively expensive for plane-wave-based methods, since an accurate
description of the transition-metal d and oxygen 2 p electrons requires a very
large energy cut-off and several thousand plane waves. In LMTO, the
description of transition-metal oxides uses only 50- 100 basis functions per
atom.55
Aryasetiawan, Gunnarsson, and collaborator^'^ used LMTO in the
atomic sphere approximation and included so-called combined correc-
t i o n ~ ’ ’ ~only in the energies, not in the wave f ~ n c t i o n s . ’This
~ ~ approach
has two disadvantages. First, omission of the combined-correction term
leads to discontinuous wave functions in the interstitial region and in the
overlapping region of the muffin-tin sphere^.*^^*^^' This results in errors in
282 M. Rohlfing and S. G. Louie, Phys. Rev. Lett. 80, 3320 (1998).
283 In the case of bulk Si, Gaussian-orbital-based calculations take as much time as
plane-wave-based calculations (Ref. 284). For the Si(OO1) (2 x 1) reconstructed surface, a
speed-up of a factor of five compared to plane waves is achieved (Ref. 284).
284 M. Rohlfing, P. Kriiger, and J. Pollmann, Phys. Rev. B52, 1905 (1995).
reconstructed ~ u r f a c e ; ” (2)
~ an explicit treatment of the Cd n = 4 core and
semicore electron^;^' and (3) the inclusion of excitonic effects in Si,H,
clusters.’
The main disadvantage of Gaussian orbitals is that while Gaussian basis
sets can systematically converge, constructing them is difficult. Table 15
demonstrates the problems in establishing the energies of low-lying conduc-
tion states of small-gap and medium-gap semiconductors. For bulk
semiconductors, 20 Gaussians per atom are suggested to be adequate for
convergence of quasiparticle energies of low-lying conduction and valence
states to within 0.1eV.”4 While this is true for Si”4 and for large-gap
materials with strongly localized densities such as diamond, Sic, and
GaN,295good convergence for more “metallic,” small- and medium-gap
semiconductors such as Ge and GaAs requires at least 30 Gaussians per
atom. Tables 15 and 22 show that in the case of Ge (1) the lowest
conduction-band energies at r, X , and L for a well-~onverged’~~ plane wave
PW 60 GO 40 GO
at least 0.5 eV and are on the order of the self-energy shifts observed upon
inclusion of 4p and 4s electrons as valence electron^.^'
Good quasiparticle band gaps are obtained by treating the core and
semicore electrons in II-VI materials via nonlinear core correction^.^'
However, the success of this approach is based on a fortuitous cancellation
of errors due to the neglect of wave-function relaxation and hybridization
and to the neglect of dynamic screening by semicore electrons and exchange
interaction with core and semicore electrons.
E,,,(W a d 4
b. Real-Spacellmaginary-Time Approach
The basic idea of the space-time approach is to choose the representation
(e.g., reciprocal space and frequency or real space and time) that minimizes
the computations necessary to evaluate the basic GWA quantities: G , E, W
respectively.
305 The energy cut-off used for the core case is 6.75 Hartree and 9.5 Hartree for the valence
case. Neglect of scalar relativistic effects increases the gap by about 0.3 eV to 1.93 eV (core)
and 0.67 eV (valence) (Ref. 301).
98 WILFRIED G . AULBUR, LARS JONSSON, A N D JOHN W. WILKINS
which eliminates two convolutions in reciprocal space and one in the fre-
quency domain, and leads to an algorithm whose dominant parts scale as
O(N$,) and O ( N , ) . Similarly, the dielectric matrix and the screened
interaction require convolutions in real space (Eqs. (2.20) and (2.14)) but
multiplications in reciprocal space (Eqs. (2.23) and (2.16)). A representation
change from reciprocal to real or frequency to time space using Fast Fourier
Transforms (FFTs) and vice versa scales as N,, log(N,,) or N , log(N,)
and is computationally very efficient. This allows the efficient determination
of C, E, and W as products rather than as convolutions.
The evaluation of operators in real space and the extensive use of FFTs
require operators that are short-ranged in real space and quantities that can
be represented by an equidistant numerical grid. Relevant quantities in the
GWA, namely, G, C, and Po, are proportional to Ir - r’l-’ for Ir - r’I -, co
and indeed short-ranged (compare for example Figs. 9 and 11). The screened
interaction decays proportionally to Ir - r’1-l for Ir - r’I -+ co, but this
long-ranged tail can be taken into account e~plicitly,”~ allowing the use of
a finite cut-off for Ir - r’I in numerical work. Although G, X, and W have
branch cuts on the real-frequency axis, they can be analytically continued
to the imaginary-frequency (and imaginary-time) axis along which they are
much smoother and can be accurately discretized by an equidistant Fourier
grid, as shown in Fig. 23.
The central task of the real-space/imaginary-time algorithm is the same
as that of the reciprocal-space/frequency-space algorithm: determination of
G -, Po -, E W -, X and subsequent calculation of quasiparticle shifts and,
if desired, the solution of Dyson’s equation. Here, we will describe in detail
the construction of G and Po and the Fourier transformation of Po, since
their construction is used to illustrate the parallel real-space/imaginary-time
algorithm in Section 111.17. For further details and convergence studies we
refer the reader to the work by Rieger et
2
h
2
v
L3 o
.-
0
-
9
-2
20
-z
A
3
10
0
-
9 0
-10
307 The index i has been replaced by the band index n and the reciprocal vector k in the first
Brillouin zone. Spin is disregarded. The Fermi energy, which is at zero in this subsection, lies
in the band gap; that is, if' < 0 and E;Y z 0.
100 WILFRIED G . AULBUR, LARS JONSSON, AND JOHN W. WILKINS
unit cell while r’ denotes a point in the “interaction cell” outside of which
Go is set to zero. Denote as unk(r) the periodic part of the Bloch wave
function Onk(r),and define for a given r a vector r” in the real-space unit cell
and a real-space lattice vector L, such that308
r’ = r + r” + L = r + x; x = r” + L. (3.10)
Then
occ(unocc)
Go(r, r + x; ir) = i sgn(.s) C u,,k(r)t&(r + r”) exp( - ik .x) exp(snkc).
nk
(3.1 1)
c c c unk(S-’r)z&(S-’(r + r”))
occ(unocc) IBZ
Go@, r + x; ir) = isgn(T)
n k S
Assuming an interaction cell that is of constant size for a given material, the
determination of the independent-particle propagator scales as N , x Nband
x N,, where N , is the number of frequencies or times considered; Nband, the
number of bands in the summation of the above equation; and N,, the
number of r-points in the real-space unit cell. Since Nband and N , increase
linearly with the size of the system, the overall scaling of the method is
proportional to the system size squared and hence of order O(N2,,).
Convergence considerations. Convergence parameters important for the
real-space approach are (1) the number of bands in Eq. (3.12), (2) the real-
space grid spacing Ar and the size of the interaction cell, and (3) the time
spacing AT and the maximum sampling time rmax.Convergence to within
20meV for Si requires 145 bands, a Ar = 0.32 a.u., an interaction cell
radius309 Rmax= 18 a.u., a time spacing AT = 0.15 a.u., and T~~~ z 20 a.u.
308 The r and r‘ meshes are not offset, which consequently requires special treatment of the
Coulomb divergency; see Ref. 103.
309 Assume a spherical interaction cell for simplicity.
QUASIPARTICLE CALCULATIONS IN SOLIDS 101
Po(k, r, r’) =
R
exp( - ik. (r - r’ - R))Po(r, r’ + R). (3.14)
The sum over lattice vectors R is limited to vectors in the interaction cell of
r (ZC(r)). The quantity Po(k, r, r’) is periodic in r and r’ separately, and its
Fourier transform can be determined by two successive 3-dimensional
F F T s . ~ The Fourier transform of the independent-particle polarizability is
given as (V, is the unit cell volume)314
3 ’ 0 Materials such as GaN with larger density variations than Si will require an even smaller
Ar to obtain similar convergence. The range of the interaction cell should not increase,
although the dielectric constant of GaN is less than half of the dielectric constant of Si since
the short-range Green function, not the long-range screened interaction, determines the range.
311 L. Steinbeck, private communication. Note that Steinbeck, Rieger, and Godby recently
DETAILS
16. PLANEWAVES:NUMERICAL
This section has a threefold purpose: (1) Detailed information on the use of
symmetry and the integration of the Coulomb divergence in actual GWA
implementations (see subsections a and b) is presented. (2) The effects of
different numerical parameters on quasiparticle energies in plane-wave-
based GWA calculations are specified (see subsections c and d). (3) Details
about a particular GWA implementation are provided (see subsections e
and f).
a. Use of Symmetry
The special k-points technique3' reduces by symmetry the summation over
a uniform mesh of k-points needed to integrate the independent-particle
polarizability (Eq. (3.3)) and the self-energy (Eqs. (3.5) and (3.6)) over the
Brillouin zone to the summation over a smaller set of special k-points. With
S as an element of the little group L, of the external wave vector,316 that is,
Sq = q, and T~ as its nonsymmorphic translation, the matrix elements of Eq.
With BZq as the irreducible part of the Brillouin zone defined by the little
group of q, the static independent-particle polarizability (Eq. (3.3)) is given
by8'
(3.19)
where ct = 2nzPqo,(q) for the von der Linden-Horsch and ct = 1 for the
Engel-Farid plasmon-pole model.
diverges as l/lqlz for q -,0 and whose integral over the Brillouin zone is
known analytically. Rewrite the expectation value of C x as9'
(3.20)
(3.21)
Choosing either Eq. (3.21) or (3.22) for f(q) leads to deviations in the
valence and low-lying conduction quasiparticle energies of Si and Ge at
r, X , and L of less than 30meV. Functions f(q) appropriate for sym-
metries other than fcc have been suggested by Wenzien, Cappellini, and
Be~hstedt.~~
c. Convergence
Quasiparticle band structures can vary by a few hundreds to a fev tens of
meV because of differences in (1) the number of q-points used in the
integration of Eqs. (3.5) and (3.6); (2) the energy cut-off used in the
expansion of the LDA wave functions; (3) the number of single-particle and
plasmon-pole bands kept in the summations in Eq. (3.6); (4) the number of
G-vectors kept in the summation in Eq. (3.7); and ( 5 ) the degree of
convergence of the plasmon-pole band structure. To determine reliable
320 Similar conclusions hold for the convergence properties of the plasmon-pole parameters,
point (3,which are not presented here.
' 2 1 C. M. Bender and S. A. Orszag, Advanced Mathematical Methods for Scientists and
-1
Si
-2
K risV=o.o
c
.-0
.-iii -3
5
al
U
>
.-c
al
-m
c
E
0
0 -4
-
-5
-6
20 40 so
number of k-points in IBZ
Si
-5
number of reciprocal lattice vectors in sum
FIG. 25. Convergence of valence and conduction band energies at the r point in Si as a
function of the number of G-vectors in the internal sum over G-vectors in Eq. (3.7).
Convergence is smooth, and about 100 G-vectors are sufficient to converge the energies to less
than 1%. The top of the valence band is set to zero in each case. The converged value for each
energy is approximated by the energy calculated with 283 G-vectors corresponding to a cut-off
in reciprocal space of 3.9 Hartree. All other numerical parameters are identical to those in
Table 18.
322 Note that fitting schemes for convergence plots are not unique, as discussed, for example,
in S. Wei, D. C. Allan, and J. W. Wilkins, Phys. Rev. B46, 12411 (1992) (see Fig. 3 and
discussion thereof).
QUASIPARTICLE CALCULATIONS IN SOLIDS 107
-1
-2
c
.-0
.-z -3
a
-
V
a
.->
m
-
g!
c
0
-4
-
rn
0
-5
-6
40 80 120
number of plasmon pole bands
-1
-2
s
.-
iii -3
.-
U
a,
a,
.-
-m
c
p!
i
c
- -4
-5
Y
R
"25 75 125 1 J
number of bands in sum over states
10 28 28 10 10
8.5 10 10 25 25
10 10 10 10 10
8.5 10 10 25 25
3.4 3.4 3.1 4.5 4.5
146 146 146 146 196
10 28 10 10 10
8.5 10 10 25 25
3.0 3.0 3.0 4.5 4.5
60 60 60 100 80
60 100 100 150 150
110 WILFRIED G. AULBUR, LARS JONSSON, AND JOHN W. WILKINS
TABLE19. COMPARISON OF LDA AND GWA ENERGIES AND THE QUASIPARTICLE SHIFTAT THE r
POINT IN si FOR A TROULLIER-MARTINS PSEUDOPOTENTIAL325 (TM) AND A GENERALIZED
NORM-CONSERVING326(GNC) PSEUDOPOTENTIAL.DEVIATIONS IN THE GWA BANDSTRUCTURE
ARE LARGEST (0.18eV) FOR THE r2,STATE,WHICH IS THE MOST DIFFICULTLOW-LYING
CONDUCTION-BAND STATETO CONVERGE. THECHOICEOF THE PSEUDOPOTENTIAL AFFECTSTHE
QUASIPARTICLE SHIFTSSIGNIFICANTLY LESSTHAN THE ABSOLUTEENERGIES (60-meV MAXIMUM
DEVIATION).ALL NUMERICAL PARAMETERS ARE GIVENIN TABLE 18 EXCEPTFOR G,,,, WHICH
EQUALS 3.1 a.u. r~ THE PRESENT STUDY.ALL ENERGIES ARE IN eV.
PSP
LH EF LH EF LH EF
e. Object Orientation
The common computing paradigm in most scientific applications is pro-
cedure-oriented programming. The central task of a software engineer using
the procedure-oriented paradigm is to identify the procedures, that is, the
data manipulations, required to solve a problem and the corresponding best
algorithms. The structure element of a procedure-oriented code is the
procedure or subroutine with its input and output data. This way of
programming leads in general to many global and few local data. The global
data are either passed explicitly from subroutine to subroutine or-as in
Fortran -via COMMON blocks. This violation of the software engineering
principle of data can lead to severe problems once a complex
piece of software has to be debugged, modified, or extended beyond the
realm of its initial use.328
3 2 7 B. Stroustrup, The C++ Programming Languuge, 2nd ed., Addison-Wesley, New York
(1991) 14ff.
3 2 8 For example, it took a graduate student at the University of Illinois, Urbana-Champaign,
two years to include d basis functions into an O ( N ) tight-binding code. The 20,000 lines of O ( N )
code were originally implemented for s and p functions only (D. Drabold, private communica-
tion).
112 WILFRIED G. AULBUR, LARS JONSSON, A N D JOHN W. WILKINS
f. Eficiency
Since C + + is less well established in the scientific community than is
Fortran, an interface with existing Fortran codes such as low-level BLAS
subroutines is a must. Efficient code results, for example, from interpreting
wave functions as matrices rather than as linear arrays and by performing
convolutions as in Eqs. (3.2) and (3.7) via BLAS3 matrix multiplications. As
a consequence, major portions of the code run at about 270 MFlops on a
CRAY-YMP8 (peak performance 333 MFlops). The overall code perform-
ance is 170 MFlops and the code uses memory very efficiently according to
the standard CRAY performance tools. About 20% of the program is
documentation to make it easy to maintain and use.
A good test of the reliability of a code is a “standardized” test calculation
and comparison with an independently written code. To eliminate subtle
differences in pseudopotentials, parametrization of exchange-correlation
potentials, and the like, which plague input to self-energy calculations using
different LDA codes, one can generate wave functions and energies using an
empirical Si p ~ e u d o p o t e n t i a instead.
l ~ ~ ~ The corresponding wave functions
and energies are, in addition, easy to generate. Results based on empirical
pseudopotentials between two independent codes from Refs. 42 and 302
agree to within nine significant digits.
GWA CALCULATIONS
17. PARALLEL
a. Reciprocal-Space Approach
The basic idea for a parallel determination of the independent-particle
polarizability (Eq. (3.17)) and the self-energy (Eqs. (3.18) and (3.19)) is to
distribute the conduction bands over n p e s processing elements (PEs)
during the course of the calculation. The principle will be explained for the
case of the independent-particle polarizabiliiy; the case of the self-energy can
be handled analogously. Let PEO denote the PE that does all the 1 / 0
between the file system and the parallel machine; Nvblcbthe number of
valence/conduction bands; N, the number of G-vectors; and N , , the
number of plane waves.
For each k-vector PEO reads the energies and the G-vectors. Then it reads
the first N,,/npes conduction bands (assume mod(Ncb/npes)= 0 for
simplicity) and sends the information about k-point, energies, G-vectors,
and the first block of conduction bands to PE1. Subsequently, PEO reads
the second block of conduction bands and sends it together with the k-point,
energies, and G-vectors to PE2, and so on. Once all conduction bands are
distributed, PEO reads the valence-band information for a point k - q in the
BZ, and broadcasts that information to all PEs. Then all PEs -including
PEO -calculate the following expression:
(3.24)
mp
...over
....q-points
...............
loop
.. ..over
.....k-points
.... .
.. ..
...
'
'
I
I read CBs i*ncb/npes.
' I
' I
' II
calculate
single-node accumulation
of partial sums a t PEO to
calculate Plw(k.q)
3. Load balance. The load balance is excellent. Not counting the 1/0 over-
head, all PEs have exactly the same workload if mod(Ncb,npes)=O.
If mod(N,,,npes) # 0, then the last n p e s - 1 PEs calculate the
116 WILFRIED G . AULBUR, LARS JONSSON, AND JOHN W. WILKINS
The actual sending of the data and the communication primitive used in
this process are hidden from the user. This feature makes a change
from PVM to, for example, MPI easy, since the change is done in only
one function. Switching between PVM and MPI can be conveniently
achieved via preprocessor directives.
332 Fully optimized PVM executables are no longer being created for most new platforms.
333 W. G. Aulbur and J. W. Wilkins, Bull. Am. Phys. Soc. 40,No. 1,251 (1995).
QUASIPARTICLE CALCULATIONS IN SOLIDS 117
334 This parallelization scheme was later adapted for the parallelization of the real-space/
imaginary-time code; see below.
335 This algorithm was developed by W. G. Aulbur in collaboration with L. Steinbeck, M. M.
Rieger, and R. W. Godby.
118 WILFRIED G. AULBUR, LARS JONSSON, AND JOHN W. WILKINS
0
calculate C (r. r’. tit)
0
wrlte P 11. r’, +ato dlsk
for each PEI
do 3D FFT to obtaln
4 , Polk. r. G’. 4
1
, I,
redlstrlbute using MPILGATHER
do 3D FFT to obtain
P o (k. G. G’. 10
FIG. 29. Schematic diagram of the communication structure of the parallel implementation
of the real-space GWA code. We list all communication and calculational steps needed to
determine the independent-particle polarizability Po in real space and imaginary time, and to
transform it to reciprocal space. The distribution of the unlr(r)and of Po(r,r‘; ir) over all PEs is
described in the text. The redistribution step using MPI -GATHER is detailed in Fig. 30. The
Fourier transformation from imaginary time to imaginary frequency and assembly of the
screened interaction via parallel matrix operations is also described in the text.
interaction cell according to Eq. (3.12). For example, if the number nviuc
of r-points in the irreducible real-space unit cell is 55 and the number of
processors is 4, then PEO calculates Go for the first 13 r-vectors, PE1 for the
following 14, and so on.
To calculate Go at a point ri, each PE needs u,,k(ri) for all bands and all
k-points, as well as U,*k(r’) for all r’, all bands, and all k-points. Let rank
denote the number of a PE. For each i and each node, PEO reads the
corresponding wave function information unk(ri)in real space and sends it
to PE rank.At the end PEO reads the information that it needs for its own
QUASIPARTICLE CALCULATIONS I N SOLIDS 119
of G-vectors
MPI-GATHER
PE3
FIG. 30. Communication structure for the redistribution of the independent-particle polar-
izability over PEs. After the first 3D FFT each PE has the independent-particle polariz-
ability Po(k, r, G’,ir) for one particular k-point, a subset of r-vectors in the irreducible
real-space unit cell as well as their images under symmetry operations of the space group, all
G’-vectors, and one particular time i t As indicated by the arrows in the figure, MPI -GATHER
is used to assemble Po for all r-vectors and a block of G’-vectors on each PE (first block on
PEO, second on PEI, and so on). After the MPI _GATHER step described in the figure, Po is
reordered according to its r argument to have standard FFT ordering, and the second 3D FFT
is performed on each node. The different length of the r-blocks in the picture corresponds to a
different number of r-points on each PE. The total number of G‘-vectors kept after performing
the first FFT is nG and n p e s is the number of PEs used. In the above example
mod(n,.,npes) = 2.
336 For GaAs, we include unpublished data provided by F. Aryasetiawan (Ref. 290). For Ge,
GaAs, and SIC, we include unpublished 60-Gaussian orbital data by Rohlfing, Kriiger, and
Pollmann (Ref. 303).
QUASIPARTICLE CALCULATIONS IN SOLIDS 121
a. Silicon
Silicon is a prototypical semiconductor that has been studied in seven inde-
pendent GWA calculations based on (1) pseudopotentials and plane waves
(Refs. 5,6,42,102, and 103), (2) pseudopotentials and Gaussian orbitals,114and
(3) LAPW.285 In contrast to all other calculations, the pseudopotential
calculations of Rieger et d 1 0 3 and of Fleszar and HankeIo2 avoid the
plasmon-pole approximation by explicitly computing the frequency-dependent
interaction.
Comparison of both the LDA and GWA energy variations between the
different calculations for the lowest conduction-band states of Si listed in
Table 21 reveals
1. All GWA calculations for the lowest conduction band lie within 0.2 eV
of the range of experimental energies observed.
2. Between all calculations, the variations in both LDA and GWA
energies of the lowest conduction band are 0.03 eV and 0.14 eV at r,
0.05 eV and 0.30 eV at X , and 0.1 eV and 0.17 eV at L, respectively.
3. Comparison of plane-wave calculations without an update of the Green
function, disregarding354Ref. 6, gives variations for the lowest conduc-
tion-band energy of 30 meV and 40 meV at r, 50 meV and 40 meV at X ,
and 50 meV and 50 meV at L, for LDA and GWA, respectively.
4. An update of the energies in the Green function (see Section 11.13)
increases the fundamental gap by about 0.1 eV.
Four main conclusions can be drawn from the above observations:
1. LDA and GWA calculations for the lowest direct band gaps in the
prototypical semiconductor Si are accurate to within 0.1 eV for LDA
and 0.1 to 0.3 eV for GWA.
33' This point will be discussed in more detail in Sections VI.29 and 30.
338 E. L. Shirley and S. G. Louie, Pbys. Rev. Lett. 71, 133 (1993).
122 WILFRIED G. AULBUR, LARS JONSSON, AND JOHN W. WILKINS
TABLE
21. Continued
"Ref. 339; 'Ref 340; 'Ref. 341; dRef. 342; 'Ref. 343; IRef. 344; @Ref.345; *Ref, 346; 'Ref. 347;
'Ref. 348; 'Ref. 349; 'Ref. 311; '"Ref. 350, gap at X ; "estimated from Fig. 1 of Ref. 284; "estimated
from Fig. 1 of Ref. 44; PRef. 351; 4Ref. 352.
124 WILFRIED G. AULBUR, LARS JdNSSON, AND JOHN W. WILKINS
Conclusions 1,2, and 4 are state and material dependent. Take the F2, state
in Si, which is particularly difficult to converge, as an example. Here, the
deviations between all calculations are 0.39 eV in LDA and 0.41 eV in GWA.
The deviations for well-converged plane-wave calculations are 0.1 eV in
both LDA and GWA, well outside chemical accuracy. In addition, materials
that are less “benign” than Si- that is, in which, for example, core electrons
or relativistic effects are relevant -exhibit larger variations in LDA and
GWA energies, as will be shown below. Exceptions to conclusion 4 will be
discussed in Sections IV and V.
The plasmon-pole approximation is expected to break down for quasipar-
ticle energies of the order of the Si plasmon energy, wpl = 16.7 eV (see Table
3). Comparing plane-wave calculations, with the exception of Ref. 6, for the
lowest valence-band energies at r, X,and L shows that the plasmon-pole
339 L. Ley, S. P. Kowalcyzk, R. A. Pollak, and D. A. Shirley, Phys. Rev. Lett. 29, 1088 (1972),
as presented by J. R. Chelikowsky and M. L. Cohen, Phys. Rev. 8 10, 5095 (1974).
340 W. D. Grobman and D. E. Eastman, Phys. Rev. Lett. 29, 1508 (1972).
341 M. Welkowsky and R. Braunstein, Phys. Rev. 8 5 , 497 (1972).
"' D. M. Ceperley and B. I. Alder, Phys. Rev. Lett. 45, 566 (1980).
356 U. von Barth and L. Hedin, J. Phys. C 5, 1629 (1972).
357 The difference between relative energies in GWA and GWT for the lowest direct band gaps
is less than 0.02 eV (Ref. 102), as can be seen in Table 21.
126 WILFRIED G. AULBUR, LARS JONSSON, AND JOHN W. WILKINS
increasing energy gap and ionicity of the material and exceed, for example,
a negative correction of 1 eV in the case of LiCl even with the use of RPA
~orrelation.~~
Figure 31 compares the results of an LDA and GWA calculation with
photoemission and inverse photoemission experiments. The size of a typical
experimental resolution is 0.27 eV in energy and 0.1 8- in momentum.342
In numerical calculations the momentum is well defined, while the energy
uncertainty is estimated to be about kO.1 to 0.2eV from the above
discussions. The agreement between theory and experiment is good along
15
10
z- 5
zi
>
B
9
s
0
-m
e
>r
F
w
E -5
-10
-1 5
L A r A X
Wave vector
FIG. 31. Comparison of LDA and GWA band structures along L - r - X with photoemis-
sion and inverse photoemission experiments for Si. Agreement between theory and experiment
is within the experimental and theoretical uncertainties. Data are taken from Ref. 342 (full
diamonds), Ref. 347 (open circles), Ref. 348 (full triangle), Ref. 351 (open triangles), and Ref.
358 (open diamonds). The typical experimental resolution is 0.27 eV in energy and 0.1 k'in
momentum (compare Ref. 342). The theoretical error in energy is estimated to be about +0.1
to 0.2 eV (compare Table 21).
QUASIPARTICLE CALCULATIONS IN SOLIDS 127
the T L direction. The lowest conduction band shows the largest discrepancy
between theory and experiment and seems to lie slightly higher than the
calculation. Along the T X direction agreement between experiment and theory is
satisfactory. In part due to the large experimental momentum uncertainty,
theory and experiment still agree to within the respective uncertainties. However,
the lowest valence band as well as the first two conduction bands -the A, and
the A; bands -show less dispersion than the theoretical prediction.
358 D. Straub, L. Ley, and F. J. Himpsel, Phys. Rev. B33, 2607 (1986).
128 WILFRIED G. AULBUR, LARS JONSSON, AND JOHN W. WILKINS
XI"
Expt.' - 9.3(2)b -3.15(20) 1.3(2)
-3.5(2)'
Aulbur4' - 8.88 - 3.02 0.70 9.50
-887 - 3.08 1.15 10.04
Rohlfing114 -8.91 - 3.02 1.03 9.54
- 9.06 - 3.16 1.74 10.19
Hybert~en~~ -9.13 - 3.22 1.23
Shirley43 0.55
1.28
(1-09)
TABLE
23. Continued
Shirley43 1.21
2.07
(1.95)
AryasetiawanZgo - 10.37 -7.01 -2.72 1.29 1.59 10.34
-10.50 -6.88 -2.45 2.88 3.38 11.50
Shirley43 0.70
1.55
(1.75)
Aryasetiawan'" -11.11 -6.84 -1.12 0.72 4.66 8.26
-11.27 -6.59 -1.01 2.14 6.51 9.51
average deviation in the LDA band structure between all three approaches
is 0.19 eV (maximum 0.41 eV for the L,, state); the average deviation in the
quasiparticle energies is 0.36 eV (maximum 0.82 eV for the X,, state); and
the average deviation in the quasiparticle shifts is 0.29 eV (maximum 0.67 eV
for the X,, state). It is not understood what causes the significant differences
in the LDA and quasiparticle band structures of these three independent
calculations.362 All three, however, give a good account of experimentally
valence band and hybridize with the s bands. In LDA, d electrons are
underbound in comparison to experiment, leading to a strong, unphysical
s-d hybridization. A proper treatment of the Ga 3d electrons requires a
self-consistent determination of the wave functions and the inclusion of
exchange with the 3s and 3p electrons, as explained in Section III.15a. A
computationally cheaper, surprisingly accurate approach to the determina-
tion of the quasiparticle band gap is the treatment of the 3d electrons via
nonlinear core corrections,205 which has been adopted in the plane-wave
calculations of Refs. 42,47, and 367. Variations in the quasiparticle energies
of the lowest-lying conduction band for the three different calculations
amount to about 0.2 to 0.7 eV and are significantly larger than the generally
quoted GWA uncertainty of f O . l eV. In part, these variations may result
from the choice of different pseudopotentials, as specified in Table 26, and
the use of a model dielectric matrix in Ref. 47 compared to RPA dielectric
matrices in Refs. 42 and 367.
19. BULK
365 S. Nakamura and G . Fasol, The Blue Laser Diode: GaN Based Light Emitters and Lasers,
Springer-Verlag,Berlin and New York (1997).
366 A. F. Wright and J. S. Nelson, Phys. Rev. B50, 2159 (1994).
M. Palummo, L. Reining, R. W. Godby, C. M. Bertoni, and N. Bornsen, Europhys. Lett.
26, 607 (1994).
134 WILFRIED G. AULBUR, LARS JONSSON, AND JOHN W. WILKINS
TABLE26. GaN ENERGIES AT HIGH-SYMMETRY POINTS, SAME NOTATION AS IN TABLE 21. THE
DIFFERENTCALCULATIONS OF THE LDA AND GWA ENERGIES FOR BOTHTHE VALENCE-BAND
MAXIMUM AND THE CONDUCTION-BAND MINIMUM DIFFERBY 0.1 TO 0.4 eV AND BY 0.2 TO 0.7 eV,
RESPECTIVELY. TAKINGALL STATESINTO ACCOUNT LEADSTO A VARIATION OF 0.1 TO 0.8eV
FOR LDA AND 0.1 TO 1.3eV FOR GWA. ALL CALCULATIONS ARE PSEUDOPOTENTIAL-PLANE-
WAVE-BASED AND USEA FIRST-PRINCIPLES RPA DIELECTRIC MATRIX, WITH THE EXCEPTION OF
RUBIOet aL4' WHO USEDA MODELDIELECTRIC MATRIX.THE MODELDIELECTRIC MATRIXIS
USUALLY ACCURATE TO WITHIN0.1 TO 0.4 eV AND CANNOT ACCOUNT FOR ALL THE DIFFERENCES
IN THE DATA.FURTHER DEVIATIONS MAY RESULTFROM THE DIFFERENT CHOICE OF THE
NITRCGENPSEUDOPOTENTIAL. BHS STANDS FOR BACHELET-HAMANN-SCHLmR324 PSEUDO-
POTENTIALS AS IMPLEMENTED BY STUMPF, GONZE AND sCHEFFLER;370 HSC STANDS FOR
HAMANN-SCHL~ER-CHIANG PSEUDOPOTENTIALS;3 AND TM STANDS FOR TROULLIER-MARTINS
PSEUDOPOTENTIALS.32s'372 ALL ENERGIESARE IN ev.
a. Band-Gap Narrowing in Si
Two competing processes influence the band-gap modification of Si upon
n-type doping:373(1) Electrons fill up conduction bands to some new Fermi
level and effectively increase the photoexcitation gap of valence electrons. (2)
The conduction electrons form a low-density, “metallic” gas of carriers and
increase screening of the electron-electron interaction, which reduces the
band gap. A quantitative understanding of these effects on the band gap of
doped semiconductors is important since it affects the performance of
semiconductor devices.
Quasiparticle calculation^^^^*^ 75 have so far considered band-gap nar-
rowing in doped Si and find that (1) LDA is inadequate for the description
of band-gap narrowing, (2) the modification of screening due to excess
carriers dominates band-gap narrowing in the GWA calculation, and (3) the
energy dependence of the intrinsic dielectric matrix -omitted in all model
calculations (for a review of model calculations see Ref. 375) -modifies
band-gap narrowing by up to a factor of two for common dopant concen-
trations in Si. A direct comparison of GWA ~ a l c u l a t i o n s with ~~~*~~~
experiment376*377-378 is inappropriate since the self-energy calculations do
not consider electron-donor scattering.379With AVH and AVxc as changes in
the electrostatic and LDA exchange-correlation potentials upon introduc-
tion of additional electrons in the system, the LDA contribution to band-
gap narrowing can be calculated in first-order perturbation theory as
AEnk =
s @:k(r)[AVH(r) + AVxc(r)]@nk(r)dr~
3 6 8 T. Lei, T. D. Moustakas, R. J. Graham, Y. He, and S. J. Berkowitz, J. Appl. fhys. 11,4933
(4.1)
The first term describes the change in the screening of the electron-electron
interaction and turns out to be the dominant contribution to band-gap
narrowing, equaling about -0.1 eV at a carrier concentration of n =
lo2’ cm-3 and about -0.25 eV at n = lo2’ ~ m - The ~ . effect of screening
becomes stronger with increasing carrier concentrations. The second term
stems from the changed pole structure of the Green function and also
increases with increasing concentration. However, its absolute contribution
to the band-gap narrowing is negligible and amounts to less than 10meV
for concentrations smaller than cm-j.
First-principles GWA calculations identify the neglect of frequency de-
pendence of the intrinsic dielectric matrix as an unphysical assumption
common to many model calculations of carrier-induced band-gap narrow-
ing.37 5 The frequency dependence of the intrinsic dielectric matrix increases
band-gap narrowing by about 30% of the dominant GdoPedAW contribution
at n = lo2’ cmV3and by about a factor of two at n = 1021~ m - and ~ ,
should not be neglected. Other effects, such as local fields in the intrinsic
dielectric matrix or the description of the conduction-band states either as
LDA states or as plane waves, are unimportant since the main contributions
to the modification of screening arise from the long-wavelength component.
b. Transition-Metal Oxides
The large discrepancy between local spin density (LSDA) functional calcu-
lations and experiment indicates that strongly correlated transition-metal
QUASIPARTICLE CALCULATIONS IN SOLIDS 137
TABLE27. ENERGY GAPS(eV) AND MAGNETIC MOMENTS (pB) FOR TRANSITION-METAL OXIDES
OBTAINED IN ab-initio GWA (ARYASETIAWAN AND G U N N A R ~ ~ MODEL
~ N ~ ~ GWA
), (MASSIDDA
AND cOLLABORATORS50'54),A N D LOCALSPIN DENSITY (LSDA) FUNCTIONAL CALCULATIONS IN
COMPARISON TO EXPERIMENT, LDA + U CALCULATIONS, AND SELF-INTERACTION-CORRECTED
(SIC) CALCULATIONS. GWA AGREEMENT WITH EXPERIMENT IS BETTER THAN OR AT LEASTAS
GCOD AS IN LDA+U OR SIC. BECAUSE OF THE LARGEGWA CORRECTIONS TO LSDA r~ THE
TRANSITION METALS, IN THE GWA WAVEFUNCTIONS
SELF-CONSISTENCY IS REQUIREDTO GET
GOODAGREEMENTWITH EXPERIMENT. VALUES IN PARENTHESES FOR THE MAGNETIC MOMENTS
INCLUDE THE ORBITALCONTRIBUTION.(ADAFTED FROM REF. 54.)
ARYASETIAWAN MASSIDDA
ENERGY
GAPS(eV)
MnO 1.o 4.2 3.5 3.98 3.8-4.2"'
NiO 0.2 5.5 0.3 3.7 3.1 2.54 4.3'
4.0r
CaCuO, 0.0 1.4 2.1 1 9
MAGNETICMOMENTS(pB)
MnO 4.29 4.52 4.61 4.49 4.58h
(4.52) (4.49)
NiO 1.o 1.6 1.12 1.56 1.59 1.53 1.77'
(1.83) (1.80) 1.64'
1.90h
CaCuO, 0.42 0.66 0.51'
"Ref. 382; bRefs. 383 and 384; 'Ref. 385; dRef. 386; 'Ref. 387; IRef. 388; @Ref.389; hRef. 390;
'Ref. 391; 'Ref. 392; 'Ref. 393.
QUASIPARTICLE CALCULATIONS IN SOLIDS 139
molecule Hubbard-U parameters of 1.0 and 0.5 eV, respectively, are of the
same order as the measured band width of about 0.5eV of the highest
occupied (Hlu) and two lowest unoccupied (T,, and 7J multiband com-
plexes. In spite of these intermediate to strong electronic correlations, GWA
calculations for face-centered cubic C,, describe the fundamental energy
gaps quantitatively we11.338*394,395 Self-energy corrections double the LDA
fundamental gap of 1.04eV to a quasiparticle gap of 2.15 eV that compares
well with a gap of 1.85 & 0.1 eV obtained in microwave conductivity
experiments396 or of 2.3-2.7 eV deduced from direct and indirect photo-
emission experiment^.^^'^^^^.^^^ The H,-T,, peak-to-peak distance in the
experimental density of states equals 3.5-3.7 eV3979398,399 and is reproduced
reasonably well by a quasiparticle value of 3.0eV, which corrects an LDA
result of 1.6- 1.7 eV. Quasiparticle calculations for perfectly ordered, crystal-
line C , , lead to about a 30% increase in the LDA band width, as shown in
Fig. 32, resulting in GWA band widths of 0.9 eV, 0.7 eV, and 0.8 eV for H , ,
TI,, and Tlg, respectively. Since LDA overestimates the experimental band
width, further physics is needed to explain the experimental band-width
narrowing within a quasiparticle framework.
Four causes for a lack of dispersion in angle-resolved photoemission
spectra of the H , , T,,, and Tlg bands338*394 are (1) the multiband nature of
the system, (2) orientational disorder, (3) integration over reciprocal lattice
4 j -
0
L r x w
FIG.32. Band structure of the face-centered cubic Fm3 structure of solid C,, as obtained in
LDA (a) and GWA (b). GWA leads to a band gap in good agreement with experiment and to
a 30% increase in band width of the T,g,TI, and If, bands compared to LDA. (From Ref. 338.)
BARRIERS
20. SUPERLATTICES-INTERFACES-SCHOTTKY
a. Superlattices
Simple superlattices consist of periodically repeated units of n layers of a
401 X. Blase, A. Rubio, S. G. Louie, and M. L. Cohen, Europhys. Lett. 28, 335 (1994).
402 X. Blase, A. Rubio, S. G. Louie, and M. L. Cohen, Phys. Rev. B51,6868 (1995).
QUASIPARTICLE CALCULATIONS IN SOLIDS 141
0
0 si (CUBIC)
0 Ge(TETR1
Si4Ge4 (001)
0.5
I
Y
4
P
-0.5
-4
I I
-2 0
I
:
I
2 4
I
E''(eV)
FIG. 33. The calculated self-energy correction EQP - E~~~ is plotted against the quasiparticle
energy EQP for a Si,Ge, superlattice, cubic Si, and tetragonally distorted Ge. No significant
quantitative difference between the self-energy corrections in, the superlattice and the bulk
materials is found since self-energy effects in Si and Ge are very close. All values are aligned
with respect to the valence-band edge. (From Ref. 406.)
,09S. B. Zhang, M. S. Hybertsen, M. L. Cohen, S. G. Louie, and D. Tomanek, Phys. Rev. Lett.
63,1495 (1989).
S.B. Zhang, M. L. Cohen, S. G. Louie, D. Tomanek, and M. S. Hybertsen, Phys. Rev. B41,
10058 (1990).
S. B. Zhang, M. L. Cohen, and S. G. Louie, Phys. Rev. B43,9951 (1991).
4'2 X. Zhu, S. B. Zhang, S. G. Louie, and M. L. Cohen, Phys. Rev. Lett. 63,2112 (1989).
For a review, see, for instance, M. S. Hybertsen, Mat. Sci. Eng. B 14,254 (1992).
See, for instance, F. Capasso and G. Margaritondo (eds.), Heterojunction Band Discontinu-
ities: Physics and Applications, North Holland, New York (1987).
QUASIPARTICLE CALCULATIONS IN SOLIDS 143
FIG. 34. Schematic representation of the volume-averaged dipole density n(z) (dashed line),
the Hartree potential V ( z ) (solid line), and the interface conduction-band offset, AEc, and
valence-band offset, AE,,, of two semiconductors A and B. In the semiconductor bulk regions,
V ( z ) equals the average Hartree potential ( which serves as an energy reference for the
bulk valence-band maximum ( E u , A ( Band J bulk conduction-band minimum (Ec.,qnJ of each
semiconductor. The dipole potential Vdipolc equals the difference between the average Hartree
potentials. Egap,AfBl is the energy gap of the bulk semiconductors.
orientation, defects, polarity, and strain on band offsets can be studied using
density functional theory. However, a more accurate prediction of valence-
and conduction-band offsets requires a quasiparticle approach.413 Since the
sum of the valence- and conduction-band offsets equals the known band-gap
discontinuity between the semiconductors A and B, AE,,, = AE” AE,, +
self-energy corrections of only the valence-band offset are discussed in this
subsection.
Within a few monolayers of the sharp interface, charge density rear-
rangement leads to the formation of an interface dipole that causes a
potential step between the bulk materials.41 The dipole contribution can
be combined with the bulk-derived valence-band edge positions to obtain a
band offset for a particular interface. Let the volume-a~eraged,~’electros-
tatic (Hartree) potential ( VH)A(B) define an absolute energy level with
respect to which the quasiparticle valence-band edges EV,A(B) are measured.
The (LDA) dipole potential is given as V&:c) = ( V H ) y D ”- ) (VH)LjD”).
Denote the difference in the quasiparticle corrections to the bulk LDA band
structure of the semiconductors A and B as AX. The valence-band offset is
144 WILFRIED G. AULBUR, LARS JONSSON, AND JOHN W. WILKINS
then given as
AEv = &,A +
- E ~ . B hipole =
ELDA
U.A
- ELDA
v,B + AX + Vdipole. (4~~)
Under the assumption that LDA describes the interface dipole correctly,
that is, hipole = Vk;ie, and ignoring possible complications related to
absolute energy shifts in GWA,416AEv is determined from an LDA supercell
calculation, which gives Ef,3A- Ef,$A and the interface dipole, and from
bulk GWA calculations, which determine AX. This implies that interface
GWA calculations are computationally less demanding than quasiparticle
calculations of superlattices since they require consideration of only the bulk
rather than the generally much larger superlattice unit cell. Neglect of the
difference between the real and the LDA interface dipole potential is
considered to be a good approximation, since LDA describes semiconductor
densities to within a few percent.
Density distributions at semiconductor interfaces such as GaAs-A1As are
smooth because of the similar average valence electron densities and
screening properties of the constituent materials (average valence electron
density = 0.177 electrons/A3 (GaAs), 0.176 electrons/A3 (AlAs); dielectric
constant = 10.8 (GaAs), 8.2 (AlAs)). More inhomogeneous systems such as
metal-vaccum interfaces require a self-consistent treatment of quasiparticle
wave functions, as discussed in Section V.28.
Quasiparticle corrections to the valence-band offset at a GaAs-A1As
i n t e r f a ~ e ~amount
l ~ . ~ ~to~about 0.1 eV or 30% of the valence-band offset
of 0.41 eV determined in LDA. Self-energy corrections are crucial to obtain
a theoretical valence-band offset to 0.53 f 0.05 eV, in good agreement with
experimental values of 0.53-0.56 eV.418*419
As in the case of GaAs-AlAs, Table 28 shows that quasiparticle effects
415 After performing an in-plane average of, for example, the potential v(z) =
l/sJsV(x,y,z)dxdy (S is the in-plane area of the surface unit cell), an additional “running”
average over the extension of a unit cell in the growth direction (z) is performed. With a as the
lattice constant in the z direction, one defines V(z) = I/a J:?$z T(z’)dz’, which reduces to a
constant in the bulk and shows a smooth transition at the interface. For a discussion of these
kinds of averages, see, for instance, S. Baroni, R. Resta, A. Baldereschi, and M. Peressi, in
Spectroscopy of Semiconductor Microstructures, eds. G. Fasol, A. Fasolino, and P. Lugli,
NATO AS1 Series B, vol. 206, Plenum Press, New York (1989), 251.
416 The absolute energy level in GWA can be shifted by convergence, the choice of the
exchange-correlation potential, and the choice of vertex corrections, as detailed in Section 11.14.
The valence-band offset is determined with the implicit assumption that the difference in
self-energy corrections is physically meaningful and independent of technical details provided
that both semiconductors are treated on the same level of approximation.
417 S. B. Zhang, D. Tomhek, S. G. Louie, M. L. Cohen, and M. S. Hybertsen, Sol. State Comm.
66, 585 (1988).
418 P. Dawson, K. J. Moore, and C. T. Foxon, in Quantum Well and Superlattice Physics,
Proceedings of the SPIE 792, eds. G. H. Dohler and J. N. Schulman, SPIE, Washington
(1987), 208.
QUASIPARTICLE CALCULATIONS IN SOLIDS 145
28. VALENCE-BAND
TABLE OFFSETM eV AT THE NONPOLAR,
ZINCBLENDE
GaN/AlN( 110) INTERFACE, USINGTHE AVERAGELATTICECONSTANT OF
AlN AND GaN AND PSEUDOPOTENTIALS WITH THE Ga 3d ELECTRONS
TREATED EITHERAS CORE OR AS VALENCE ELECTRONS'96OR LMT0.420.421
INCLUSION OF THE Ga 3d ELECTRONS AS VALENCE ELECTRONS LEADSTO
AGREEMENT BETWEEN THE THREE LDA CALCULATIONS AND CHANGES THE
SIGN OF THE INTERFACED1POLE.lg6 THE SELF-ENERGY CORRECTION OF
0.25 eV EQUALS ABOUT 30% OF THE LDA VALENCE-BAND OFFSET AND HAS
BEEN ESTIMATED FROM BULK CALCULATIONS THAT TREATTHE Ga 3d
ELECTRONS VIA NONLINEAR CORE CORRECTIONS'96 (SEE SECTION III.15a
FOR A DISCUSIONOF THE TREATMENT OF d ELECTRONS IN GWA). THE
EXPERIMENTAL VALUESREFeR TO WURTZITE INTERFACES.
LDA
c. Schottky Barriers
At a metal/insulator interface such as Al/GaAs(l lo), alignment of the metal
and semiconductor Fermi levels leads to the creation of a 100-1000-A-thick
space charge layer on the semiconductor side, which, via Poisson’s equation,
bends the valence and conduction bands. Electrons (holes) that flow from
the metal into the n-type (p-type) semiconductor must overcome the
so-called Schottky barrier if the metal Fermi level is pinned inside the
semiconducting gap E, (for a recent review of Schottky barriers see Ref.
424). The n-type and p-type Schottky barriers B, and B, are defined in terms
of the conduction-band minimum E, and the valence-band maximum E, of
the bulk semiconductor and the bulk metal Fermi energy E, as
B, = E, - E ,
B, = E , - E ,
* B, + B, = E,. (4.5)
424 J.-G. Li, Mat. Chem. Phys. 47, 126 (1997),and references therein.
QUASIPARTICLE CALCULATIONS IN SOLIDS 147
425 J. P. A. Charlesworth, R. W. Godby, R. J. Needs, and L. J. Sham, Mat. Sci. Eng. B 14, 262
(1 992).
426 J. P. A. Charlesworth, R. W. Godby, and R. J. Needs, Phys. Rev. Lett. 70, 1685 (1993).
4 2 7 R. J. Needs, J. P. A. Charlesworth, and R. W. Godby, Europhys. Lett. 25, 31 (1994).
4 2 8 M. S. Hybertsen and S. G. Louie, Phys. Rev. B38,4033 (1988).
148 WILFRIED G. AULBUR, LARS JONSSON, AND JOHN W. WILKINS
433 R. D. Bringans, R. I. G. Uhrberg, R. Z. Bachrach, and J. E. Northrup, Phys. Rev. Left. 55,
533 (1985).
434 R. D. Bringans, R. I. G. Uhrberg, R. Z. Bachrach, and J. E. Northrup, J. Vuc. Sci. Technol.
A 4 , 1380 (1986).
4 3 5 R. D. Bringans, R. I. G. Uhrberg, and R. Z. Bachrach, Phys. Rev. 834,2373 (1986).
5 ,
--
% -I
P: -8
bl
-
P r it
FIG. 35. Left panel: LDA and quasiparticle (QP) Ge( 1 1 1):As (1 x 1) surface-state energies in
comparison to the projected bulk quasiparticle band structure (shaded) along high-symmetry
directions in the surface Brillouin zone. Quasiparticle corrections (1) open up the gap between
r;
the surface states; (2) eliminate the unphysical, flat LDA dispersion at and (3) improve the
position of the occupied surface bands with respect to the valence-band maximum in
comparison to experiment. Right panel: Calculated occupied quasiparticle surface-state ener-
gies compared to angle-resolved photoemission data (Ref. 433). Agreement between theory and
experiment is within the quoted theoretical uncertainty of kO.1 eV. (Adapted from Ref. 432.)
150 WILFRIED G. AULBUR, LARS JONSSON, AND JOHN W. WILKINS
“Ref. 436.
437 M. Rohlfing, P. Kriiger, and J. Pollmann, Phys. Rev. B54, 13759 (1996).
r J , K J’ r [oto] J;
FIG. 36. GWA (solid line), LDA (dashed line), and experimental photoemission data
(diamonds: Refs. 438, 439, 440, circles: Refs. 441, 442) for occupied and unoccupied states of
the clean Ge(001) (2 x 1) surface. Clear discrepancies with experiment exist for the dispersion
of the unoccupied Ddown state, of the occupied D,, state between JK/2 and J’, and for the
dispersion of the occupied states at I-. The projected bulk GWA band structure is shadowed,
and energies are measured with respect to the valence-band maximum. Open symbols represent
weak features in the experimental spectra. (From Ref. 437.)
QUASIPARTICLE CALCULATIONS IN SOLIDS 151
lations and experiment along the JK/2 to J' direction in the surface Brillouin
zone, and (2) the 0.6-eV experimental band width of the Ddownstate is
strongly overestimated in quasiparticle theory (z1.0eV, Ref. 437). Similar
discrepancies exist for the sulphur-terminated Ge(001):S (2 x 1) surface.437
Here the energies of the occupied quasiparticle D surface band and the
occupied B surface band are underbound by 0.9 and 0.4eV, respectively,
compared to experiment.
Quasiparticle calculations for surface states help explain and analyze
surface reconstructions and can be especially valuable when LDA total
energy minimizations lead to contradicting results with very different
electronic properties. For example, the Pandey n-bonded-chain model is
favored for the explanation of the C( 111) (2 x 1) surface reconstruction, but
the precise position of the atoms in the surface has been a subject of
discussion. Vanderbilt and suggested a surface geometry of slightly
buckled but undimerized chains based on a linear-combination-of-atomic-
orbitals LDA calculation. Iarlori et a1.444found a dimerized surface with no
buckling based on LDA molecular dynamics simulations. Model GWA
calculations by KreB, Fiedler, and B e ~ h s t e d t ,whose
~ ~ ~ results
, ~ ~ ~are shown
in Fig. 37, support the dimerized model. The quasiparticle calculations show
that the Vanderbilt-Louie model leads to a metallic surface and that the
model by Iarlori et al. opens up a surface gap of 1.7 k 0.3 eV, in good
agreement with the experimental value of about 2.0 eV.447,448
438 L. Kipp, R. Manzke, and M. Skibowski, Surf: Sci. 269/270, 854 (1992).
439 L. Kipp, R. Manzke, and M. Skibowski, Sol. State Comm. 93, 603 (1995).
440 M. Skibowski and L. Kipp, J. Electron Spectrosc. Relat. Phenom. 68, 77 (1994).
4 4 1 E. Landemark, R. I. G. Uhrberg, P. Kriiger, and J. Pollmann, Surt Sci. Lett. 236, L359
(1990).
442 E. Landemark, C. J. Karlsson, L. S. 0. Johansson, and R. I. G. Uhrberg, Phys. Rev. B49,
16523 (1994).
443 D. Vanderbilt and S. G. Louie, Phys. Rev. B29, 7099 (1984).
444 S. Iarlori, G. Galli, F. Gygi, M. Parinello, and E. Tosatti, Phys. Rev. Lett. 69, 2947 (1992).
445 C. KreB, M. Fiedler, and F. Bechstedt, Europhys. Lett. 28, 433 (1994).
446 C. KreB, M. Fiedler, W. G. Schmidt, and F. Bechstedt, Surf. Sci. 331-333, 1152 (1995).
calculations, such as the Ge(ll1) (2 x 1) surface (Ref. 449); the Ge(001):H (2 x 1) surface (Ref.
437); the Si(l1l):As (1 x 1) surface (Refs. 428 and 450); the Si(ll1) (2 x 1) surface (Ref. 451); the
Si(OO1) (2 x 1) surface (Refs. 284 and 452); the Si(001) c(4 x 2) surface (Ref. 453); and the
p-SiC(001) (2 x 1 ) surface (Ref. 454).
449 X. Zhu and S. G. Louie, Phys. Rev. B43, 12146 (1991).
r J K J’ r J K J’
FIG. 37. Quasiparticle band structure of the C(111) (2 x 1) surface for slightly buckled,
undimerized rr-bonded chains (left and dimerized but unbuckled r-bonded chains
(right Only the latter case (b) leads to semiconducting rather than metallic behavior
and a surface gap of 1.7 eV445in good agreement with the experimental gap of about 2.0 eV.447
The projected bulk quasiparticle band structure is shown in black. (Adapted from Ref. 445.)
22. DEFECTS
454 M.Sabisch, P. Kriiger, A. Mazur, M.Rohlfing, and J. Pollmann, Phys. Reo. B53, 13121
( 1996).
QUASIPARTICLE CALCULATIONS IN SOLIDS 153
(within one to two lattice constants) 1s state and shows structural relaxations
relative to the ideal crystal structure that are mostly confined to nearest-neighbor
alkali-metal at0ms.4~~ In this case, a theoretical description of the F-center using
a supercell approach with relatively small unit cells seems appropriate.
The fundamental electronic F-center transitions are (1) transitions from
the localized 1s electron to the conduction band critical points at L, A, and
X, and (2) an intra-defect transition from the localized 1s to a localized 2 p
level. For the first transition, the experimental values of 4.5 eV, 5.0 eV, and
5.8eV for the L, A, and X transitions are well reproduced by the quasi-
particle values 4.5 eV, 5.0 eV, and 5.7 eV, which correct LDA results of
1.8 eV, 2.2 eV, and 2.8 eV by an almost constant shift of 2.8 eV.455For the
second, because of the localized nature of the 1s and 2 p orbitals involved in
the 1s -,2p intra-defect excitation, excitonic effects must be included in the
determination of the quasiparticle transition energy. These effects lower the
transition energy by about 1eV, leading to a quasiparticle value of 3.4 eV-
a 1.0-eV correction to the LDA value of 2.4eV-in good agreement with
experiments ranging from 3.1 to 3.3 eV.455
Besides excitonic effects, quasiparticle defect calculations are challenging
because of problems related to finite-size effects, the treatment of the
localization and spin of the defect electron, the possible need for a self-
consistent treatment of quasiparticle defect states, and so forth. In the case of
the 2p defect state even the determination of the corresponding LDA state is
difficult since the band gap underestimation in LDA causes this state to be
resonant and hybridize with bulk bands. For a detailed discussion of technical
problems occurring in GWA defect calculations, we refer the reader to Ref. 455.
23. PRESSURE
455 M. P. Surh, H. Cacham, and S. G. Louie, Phys. Rev. 851, 7464 (1995).
456 H. Chacham, X. Zhu, and S. G. Louie, Europhys. Lerr. 14,65 (1991).
457 H. Chacham, X. Zhu, and S. G. Louie, Phys. Rev. 846,6688 (1992).
4s8 H. Chacham and S. G. Louie, Phys. Rev. Leu. 66,64 (1991).
4s9 M. P. Surh, S. G. Louie, and M. L. Cohen, Phys. Rev. 845, 8239 (1992).
460 For a review, see F. Siringo, R. Pucci, and N. H. March, High Press. Res. 2, 109 (1989).
154 WILFRIED G. AULBUR, LARS JBNSSON, AND JOHN W. WILKINS
z H U4 6 2 GODBY463
465 L. Brey and C. Tejedor, Sol. State Comm. 55, 1093 (1985).
QUASIPARTICLE CALCULATIONS IN SOLIDS 155
466 K. A. Goettel, J. H. Eggert, I. F. Silvera, and W. C. Moss,Phys. Rev. Lett. 62,665 (1989).
467 R. Reichlin, K. E. Brister, A. K. McMahan, M. Ross, S. Martin, Y. K. Vohra, and A. L.
Ruoff, Phys. Rev. Lett. 62,669 (1989).
468 A. L. Ruoff, H. Luo, and Y. K. Vohra, J. Appl. Php. 69,6413 (1991).
156 WILFRIED G. AULBUR, LARS JONSSON, AND JOHN W. WILKINS
a
E
6
rn
Density (mol/cma)
FIG. 38. Minimum band gap of onentationally ordered solid molecular hydrogen in the hcp
structure as a function of density and hence pressure for Hartree-Fock (HF), GWA (GW), and
LDA. GWA is close to Hartree-Fock for low densities and close to LDA for high densities.
Lines are drawn as a guide to the eye. (From Ref. 457.)
the minimum band gap and is (1) negligible at zero pressure,"69 and (2)
crucial at high densities exemplified by a metal-insulator transition pres-
sure of 151 GPa for hcp H,, whose molecules are perfectly aligned along
the c-axis and of 300 GPa for orientationally disordered H2.457Experimen-
tally, metallization of solid hydrogen-defined by a finite T + 0 DC con-
ductivity-remains to be observed (Refs. 470, 471, 472, and 473). Optical
experiments have failed to detect evidence for gap closure below about
200 GPa,"70.471 and a metal-insulator transition pressure of more than
300 GPa seems likely.473
469 The LDA and GWA gaps for hcp H, with all molecules aligned along the c-axis are 8.4
and 15.4 eV. When disorder is included via a virtual crystal approximation, the gaps change
to 8.3 and 15.3 eV compared to an experimental value of 14.5+ 1 eV (Ref. 457).
470 N. H. Chen, E. Sterer, and I. F. Silvera, Phys. Rev. Left. 76, 1663 (1996).
471 R. J. Hemley, H.-K. Mao, A. F. Goncharov, M. Hanfland, and V. Struzhkin, Phys. Rev.
Left 76, 1667 (1996).
472 R. J. Hemley and N. W. Ashcroft, Physics Today 51.26 (1998).
473 M. Ross, Phys. Rev. 854, R9589 (1996).
QUASIPARTICLE CALCULATIONS IN SOLIDS 157
24. EXCITONS
474 S.Albrecht, L. Reining, R. Del Sole, and G. Onida, Phys. Rev. Lett. 80, 4510 (1998).
475 L. X. Benedict, E. L. Shirley, and R. B. Bohm, Phys. Rev. B57, R9385 (1998).
476 L. X. Benedict, E. L. Shirley, and R. B. Bohm, Phys. Rev. Lett. 80,4514 (1998).
477 M. Rohlfing and S. G. Louie, Phys. Rev. Lett. 81, 2312 (1998).
478 Handbook of Optical Constants of Solidr, ed. E. D. Palik, Academic Press, Boston (1991).
479 E. L. Shirley, Phys Rev. Lett. 80, 794 (1998).
480 G. Onida, L. Reining, R. W. Godby, R. Del Sole, and W. Andreoni, Phys. Rev. Lett. 75,
818 (1995).
158 WILFRIED G. AULBUR, LARS JONSSON, AND JOHN W. WILKINS
2 S , ] , ,
;‘ ‘ a
E2 E,
6 8 10 12 14 16 1% 20 6 8 10 12 14 16 18 20
hiD (eV) (eV)
FIG.39. Imaginary part of the dielectric function E ~ ( w )= Im(&(o)) as a function of ho for
Si (upper panel, adapted from Ref. 474) and diamond (lower panel, adapted from Ref. 475).
For Si, the dots correspond to experiment274 and the solid line includes local fields and
electron-hole attraction in the Hamiltonian; the long-dashed line includes only local fields, the
short-dashed line only RPA screening with quasiparticle shifts. The low-energy experimental
peak is commonly referred to as the E , peak, and the high-energy experimental peak is referred
to as the E , peak. For diamond, the left panel contains theoretical results, neglecting the
electron-hole interaction (dashed lines) and experimental results from Refs. 142 and 478 (solid
lines). The right panel contains theoretical results including the electron-hole interaction
(dashed line) and the same experimental results (solid line). Excitonic effects improve agreement
between theory and experiment significantly for peak position and height and are larger in
large-gap materials such as diamond than in medium-gap semiconductors such as Si.
a. Atoms
Assessment of the usefulness of GWA for atoms requires the comparison of
theoretical and experimental results for some key quantities, such as the first
and second ionization potentials and transfer energies.67The GWA ioniz-
ation potentials of major group elements such as B, Al, and Ga deviate on
average only 3% from experimental values, compared to a 12% deviation in
H a r t r e e - F ~ c k No
. ~ ~ systematic improvement compared to the local spin
density approximation (LSDA)481*482 is obtained. For s + p promotion
energies the error is 20% in GWA, 40% in Hartree-Fock, and 5% in LSDA,
as shown in Table 31. In absolute terms, deviations between GWA and
experiment vary between 0 and 0.8eV. Hence, taking the dynamically
screened interaction into account leads to a significantly improved agree-
ment with experiment compared to Hartree-Fock but not compared to
LSDA. Absolute errors exceed chemical accuracy by more than one order
of magnit~de.~’
Figure 40 shows the s + d promotion energy in the iron series and
demonstrate^^^.^^^ that (1) GWA agrees with experiment quantitatively for
the first half of the iron series, in contrast to Hartree-Fock and the LSDA;
(2) GWA reproduces experimental trends qualitatively for the second half
of the iron series, in contrast to Hartree-Fock, but is inferior by about a
factor of three to the LSDA; and (3) GWA cannot compete in accuracy with
PROMOTION
IONIZATION
POTENTLUS ENERGY
B
HF 7.5 23.8 10.4 2.0
GWA 8.5 24.9 : 12.0 2.8
LSDA" 8.6, 8.6b 24.6 12.7 3.2
Expt.' 8.3 25.2 12.9 3.6
A1
HF 5.3 17.8 8.5 2.1
GWA 6.0 18.9 10.0 3.0
LSDA" 6.0, 6.p 18.9 10.7 3.6
Expt.' 6.0 18.8 10.6 3.6
Ga
HF 5.3 19.0 9.5 2.9
GWA 6.0 20.0 11.2 4.1
LSDA" 6.1 20.8 12.3 5.0
Expt.' 6.0 20.5 12.0 4.8
"Ref. 481 unless otherwise noted; bRef. 482; 'as cited in Ref. 67; for further reference, see Refs.
483 and 484.
b. Molecules
Besides "zero-dimensional" atoms, quasiparticle calculations have been
done for quasi-one-dimensional trans-polyacetylene (trans-(CH)J4" -the
simplest prototype of a conducting polymer -and a quasi-two-dimensional
"' This approach modifies the LSDA by considering a nonlocal, screened Hartree-Fock
operator as part of the exchange-correlation potential (Ref. 486). This technique includes
nonlocality effects of the quasiparticle self-energy but omits its energy dependence, in contrast
to GWA.
488 E. C. Ethridge, J. L. Fry, and M. Zaider, Phys. Rev. B53, 3662 (1996).
QUASIPARTICLE CALCULATIONS IN SOLIDS 161
0.._.__.
0HF
D-OGWA
4-4 Exp.
B - - V LSD R ..a,
......
A- - 4 LSDSX Q\
f '\ '0
Ce Sc Ti V Cr Mn Fe Co Ni Cu
Element of iron series
FIG.40. The neutral s - + d promotion energies (eV) for the iron series elements from
experiment, Hartree-Fock (HF), and GWA6' and from the local spin-density approximation
(LSDA) and the screened-exchange local spin-density approximation (LSDSX)486from Ref.
485. GWA improves upon Hartree-Fock for all elements and upon the local spin-density
approximation for the first half of the series, but cannot compete in accuracy with the
screened-exchange local spin-density results. The figure combines data from Refs. 67 and 485.
489 C. G. Van de Walle and J. E. Northrup, P h p . Rev. Lett. 70, 1116 (1993).
490 M. S. Brandt, H. D. Fuchs, M. Stutzmann, J. Weber, and M. Cardona, Sol. State Comm.
81, 307 (1992).
162 WILFRIED G. AULBUR, LARS JONSSON, AND JOHN W. WILKINS
493 H. Fritzsche, N. Niicker, B. Scheerer, and J. Fink, Synth. Met. 28,D237 (1989).
494 D. Moses, A. Feldblum, E. Ehrenfreund, A. J. Heeger, T.-C. Chung, and A. G. MacDiar-
mid, Phys. Rev. B26,3361 (1982).
495 J. J. Ritsko, E. J. Mele, A. J. Heeger, A. G. MacDiarmid, and M. Ozaki, Phys. Rev. Letr.
44, 1351 (1980).
496 H. Zscheile, R. Griindler, U. Dahms, J. Frohner, and G. Lehmann, Phys. Stat. Solidi. B 121,
K161 (1984).
497 The same quasiparticle correction is assumed to be valid for the polysilane and the siloxene
compound based on the similarity of the highest occupied and lowest unoccupied state in the
two materials. This approximation is crude, since quasiparticle shifts are determined by a sum
over a range of occupied and unoccupied states.
498 For recent reviews on porous silicon, see, for instance, P. D. J. Calcott, Mat. Sci. Eng. B
51, 132 (1998); A. G. Cullis, J. Appl. Phys. 82,909 (1997); P.M. Fauchet, J. Lumin.70, 294
(1996); B. Hamilton, Semicond. Sci. Technol. 10,1187 (1995); G. C. John and V. A. Singh, Phys.
Rep. 263, 93 (1995); and M. H. Ludwig, Crit. Rev. Sol. State Mat. Sci. 21, 265 (1996).
QUASIPARTICLE CALCULATIONS IN SOLIDS 163
V. Metals
Quasiparticle calculations for metals have been reported for bulk (Section
V.26), clusters (Section V.27), and surfaces (Section V.28), and are reviewed
in detail in Ref. 14. This section concentrates on a few important aspects
such as the importance of core-valence exchange for the energy-dependence
of the self-energy, the band width of alkali metals, and quasiparticle
corrections in Ni. The reader is referred to Ref. 14 for a more detailed
discussion of some of the quasiparticle applications. Regarding surfaces in
particular, we discuss only the recent work of White et ~ . A1~ (111)~
1 on an ~
metal surface since the extensive work on jellium surfaces was reviewed in
Ref.’ 14.
26. BULK
499 I. D. White, R. W. Godby, M. M. Rieger, and R. J. Needs, Phys. Rev. Lett. 80,4265(1998).
500 W. Speier, R. Zeller, and J. C. Fuggle, Phys. Rev. B32, 3597 (1985).
501 G. Materlik, J. E. Miiller, and J. W. Wilkins, Phys. Rev. Left. 50, 267 (1983).
164 WILFRIED G. AULBUR, LARS JONSSON, AND JOHN W. WILKINS
c -
0
-2
0 20 40 00 80 100
Energy (eV) '
FIG.41. Self-energy contribution to the energy of the Ag 5s photoelectron arising from the
exchange interaction with electrons in the filled valence shell (n = 4) for different values of the
angular momentum 1 (dashed lines). Comparison of the sum of the s, p, and d contributions
(dashed-dotted line) with the solution that takes the effect of dynamic screening into account
(solid line) shows that exchange is dominant. Energies are measured from the Fermi level.
(From Ref. 216.)
b. Alkali Metals
The weakness of the effective crystal potential in alkali metals seems to
permit a nearly-free-electr~n-model~~~ description of the conduction elec-
trons and seems to offer an experimentally accessible system whose many-
body corrections are weak, allowing the use of perturbation theory such as
GWA, and not complicated by the effects of a complex band structure.
'02 E. Clementi and C. Roetti, Atomic Data and Nuclear Data Tables 14, 177 (1974).
Figure 42 shows that screening of the nonlocal exchange part leads to moderate quanti-
tative changes in the results, as does the inclusion of lifetime effects, that is, the imaginary part
of the selfenergy.
N. W. Ashcroft and N. D. Mermin, Solid Srare Physics, Holt, Rinehart, and Winston, New
York (1976) 2858.
QUASIPARTICLE CALCULATIONS IN SOLIDS 165
0 50 100 150
Energy (eV)
FIG.42. Self-energy of a 5s photoelectron in Ag relative to the Fermi energy as a func-
tion of energy. The different curves are for (1) the bare exchange due to the 5s conduction
electrons (dotted line), (2) the screened exchange due to the 5s conduction electrons (dashed
line), (3) the screened exchange due to the 5s conduction electrons and including the effects of
a finite imaginary part of the self-energy (dash-dotted line), (4) same as in (3) but adding the
bare exchange with the core electrons (solid line a), and ( 5 ) same as in (3) but adding the
screened exchange with the core electrons (solid line b). Experimental data from Ref. 500 are
given as squares. Inclusion of exchange with the core electrons changes the energy dependence
of the self-energy qualitatively. Screening and a nonzero imaginary part of the self-energy affect
the results quantitatively. (From Ref. 216.)
I I I I I
FIG. 43. Photoemission peak positions as a function of photon energy for the Na (110)
surface. The quasiparticle (QP) bands include the real and imaginary part of the self-energy and
are represented by solid curves. The QP bands are narrower than the nearly-free-electron bands
(NFE, dashed curves) by 0.37 eV. The full theory (solid dots) also includes surface effects and
reproduces the sharp peaks in the “gap” at photon energies of about 35 eV. Experimental data
are denoted by crosses and taken from Ref. 505. (From Ref. 229.)
QUASIPARTICLE CALCULATIONS IN SOLIDS 167
NORTHRUP
et Ul.184’258 SHIRLEYz3’ SHUNG et
’I2 The appropriate choice of vertex corrections is a topic of current discussions (Refs. 232 and
267).
’13 Note that both Shirley and Northrup er al. use plasmon-pole models, which may affect the
accuracy of their calculated valence band widths.
’14 See also Ref. 11 for a critical discussion of the theoretical determination of the alkali metal
bandwidth.
’” Further Applications: Recently, Kubo reported Compton profile studies of Li and Na using
full-potential LMTO (Refs. 287 and 288).
’I6 F. Aryasetiawan and K. Karlsson, Phys. Rev. 854, 5353 (1996).
QUASIPARTICLE CALCULATIONS IN SOLIDS 169
short description of the successes and failures of GWA in the case of Ni6'
Quasiparticle calculations for Ni lead to a significant improvement of the
LDA band structure and in particular reduce the 3d band width by about
1 eV, in agreement with experiment. Experimental quasiparticle lifetimes are
reproduced as well. Self-energy shifts in Ni are strongly state dependent
because the LDA exchange-correlation potential is a better approximation
for the self-energy of, for example, free-electron-like s states than for the
self-energy of strongly correlated d states. The different character of the
s-like and d-like states is also reflected in the quasiparticle weight, which is
about 0.7 for s-like but only about 0.5 for d-like states.
The 6-eV satellite is not reproduced in GWA, and the exchange splitting
is only marginally improved since both features result from strong hole-hole
interactions that require the inclusion of vertex corrections. A common
explanation of the photoemission process starts with the creation of a 3d
hole, which introduces a strong perturbation because of its localized nature
and excites another 3d electron to an empty state just above the Fermi level.
The two holes scatter repeatedly and form a bound state at 6eV. A
first-principles T-matrix calculation7' includes the neglected hole-hole in-
teraction via ladder diagrams and leads to a 6-eV satellite and an improved
exchange splitting.
27. CLUSTERS
The total energy per atom of, for example, Na clusters as a function of the
number of Na atoms N is a smooth function except for small kinks at
N = 8, 18, 20, 34, and so on (for a review on metal clusters, see, for instance,
Ref. 517). This shell structure is reminiscent of the behavior observed in
nuclei and atoms and results from the fact that the electrons in the alkali
cluster can be reasonably well described by an effective one-particle, spheri-
cal potential. The properties of the metal clusters-such as their total
energy and hence their stability, their ionization potential, and their electron
affinity-change in an abrupt way whenever one shell of electrons is filled
up, leading to an abrupt change as a function of cluster size. The spherical,
one-particle potential of choice is the so-called jellium-sphere-background
model in which the positive ion cores in the cluster are replaced with a
constant-density sphere. The sphere radius is given by T , N ' / ~with , r, being
the Wigner-Seitz radius corresponding to the average metal density.
As shown in Fig. 44, LDA calculations of the ionization potential and
electron affinity do not reflect the experimental shell structure, that is, the
dependence on cluster size of these q u a n t i t i e ~ . ~This
~ ~ *discrepancy
'~~ arises
517 W. A. De Heer, W. D. Knight, M. Y. Chou, and M. L. Cohen, Sol. State Phys. 40,93(1987).
170 WILFRIED G. AULBUR, LARS JONSSON, AND JOHN W. WILKINS
21
0 10 20 50 40
Number of atoms per cluster, n
4
J'/>! ............x......)<
X....''
-/
0 1
0 10 20 50 40
Number of atoms per cluster, n
FIG. 44. The absolute value of the quasiparticle energies of the highest-occupied (top) and
lowest-unoccupied (bottom) states in potassium as a function of cluster size in LDA (open
circles), GWA (filled circles), and experiment (triangles, data from Refs. 517, 518, 519, 520, and
521). The crosses represent LDA total-energy differences (ASCF) between positive (KJ) and
neutral (top) clusters or between neutral (K,,) and negative (Kn-, bottom) clusters. The LDA
calculation is not possible for small (n = 2 and 8) clusters. GWA tracks experiment and the
ASCF calculations closely for the highest-occupied states in contrast to LDA. The same is true
for the lowest-unoccupied state although the improvement over LDA is less dramatic. (Adapted
from Ref. 523.)
QUASIPARTICLE CALCULATIONS IN SOLIDS 171
28. SURFACES
In classical electrostatics, an electron at a location z outside a metal surface
induces a surface charge and in turn experiences an attractive image poten-
tial ym(z) whose asymptotic form for large z is given as - 1/(4(z - zo)),
where zo is the effective edge of the metal. On a microscopic level, the
rearrangement of charges at the metal surface is due to long-range exchange
and correlation effects, which are absent in LDA because of the exponential,
ather than inverse power-law decay of the LDA exchange correlation
potential outside the metal surface.525This severe LDA failure leads to a
poor description of surface states and to an absence of image states and
resonances in LDA. Discrepancies between LDA results and the results of
surface-sensitive experimental techniques such as low-energy electron dif-
518 P. Fayet, J. P. Wolf, and L. Woste, Phys. Rev. B33, 6792 (1986).
’I9 A. Herrmann, E. Schumacher, and L. Woste, J . Chem. Phys. 68, 2327 (1978).
5 2 0 M. M. Kappes, M. Schar, P. Radi, and E. Schumacher, J . Chem. Phys. 84, 1863 (1986).
5 2 1 K. M. McHugh, J. G. Eaton, G. H. Lee, H. W. Sarkas, L. H. Kidder, J. T. Snodgrass,
M. R. Manaa, and K. H. Bowen, J. Chem. Phys. 91,3792 (1989).
5 2 2 S. Saito, S. B. Zhang, S. G. Louie, and M. L. Cohen, Phys. Rev. B40, 3643 (1989).
5 2 3 S. Saito, S. B. Zhang, S. G. Louie, and M. L. Cohen, J. Phys. Cond. Mar. 2, 9041 (1990).
5 2 4 What self-interaction errors there are in GWA arise from the input wave functions and
energies used for the construction of the self-energy operator. Errors in the wave functions are
small since the overlap between quasiparticle and LDA wave functions is larger than about
95% (Refs. 522 and 523). Errors in the energies can be eliminated by a self-consistent
determination of quasiparticle energies.
5 2 5 N. D. Lang and W. Kohn, Phys. Rev. B7, 3541 (1973).
172 WILFRIED G. AULBUR, LARS JONSSON, AND JOHN W. WILKINS
Koc(r) =
s X(r, r’; E)Y(r’)dr’/Y(r).
For an Al(111) surface, Fig. 45 shows that (1) outside the metal surface
(5.1)
&(r) has the correct asymptotic, image-like behavior, and (2) inside the
metal surface self-energy corrections to LDA for states close to the Fermi
energy are small, resulting in nearly identical values of Koc(r) and V:FA(r).
Essential for the above successful applications of the dynamically screened
interaction approximation is a self-consistent evaluation of quasiparticle
states.499The difference between the exponential LDA decay and the GWA
inverse power-law decay leads to an increased weight of the quasiparticle
states in the near-surface region, demonstrated in Fig. 46. Note that results
similar to the ones discussed here for the Al( 111)surface were obtained earlier
than those of Ref. 499 by Deisz, Eguiluz, Hanke, and c ~ l l a b o r a t o r s ~ ~ ~ ~ ’ ~ ~ ~ ’
for the case of a jellium surface. This work was reviewed in Ref. 14.
For semiconductors and insulators, the lessons learned from the GWA
regarding the band structure have led to a new level of accuracy in
calculations of optical response. The reason is the crucial role played by the
0.00
-0.10
-0.20
-0.30
I
-0.401 . ’ ” . ’ . ’ ’
-10.0 -5.0 0.05.0 10.0 15.0
Distance outside surface (a.u.)
FIG.45. Surface averaged effective local potential (Eq. (5.1)) at the Al(111) surface compared
with the exponentially decaying LDA exchange correlation potential Vkfl”. The local potential
calculated from the GWA self-energy is virtually identical to Vkfl” in the A1 region, and crosses
over to the classical image form (best fit shown) in the vacuum, in contrast to KtflA.(From
Ref. 499.)
4.00 I 1
I .
FIG. 46. Surface-state quasiparticle wave function (full line) of the Al(111) surface at r
1.66 eV below the vacuum level in comparison to its LDA counterpart (dashed line). Since the
local potential resulting from the self-energy decays as l/z, z being the distance from the surface,
rather than exponentially as does the LDA exchange-correlation potential, the quasiparticle
state has weight transferred into the vacuum relative to the LDA state. The quasiparticle state
is obtained by an iterative solution of the energy-dependent quasiparticle equation. (From Ref.
499.)
174 WILFRIED G. AULBUR, LARS JONSSON, AND JOHN W. WILKINS
29. OVERESTIMATION
OF OPTICAL WITHIN DFT
CONSTANTS
where E is the total internal electric field. The scalar product symbolizes the
fact that in general the susceptibilities are tensors. However, in many
common materials only a few tensor components are independent -for
crystals with cubic symmetry in particular, x = x,, = xyy = x,, are the only
nonzero components. We use a simple scalar notation for all tensor
properties below, and only when needed do we add explicit functional
dependencies on time, position, frequency, or momentum.
In an insulator with cubic symmetry, the linear susceptibility, in the
independent-particle approximation, can be written as (I/ is the total
volume)538
x=v4 c I(ckIxl~k)l’
c,u,k &ck - &uk
9 (6.2)
Ge
30 -
Se
00
GaAS
J
x
0It20
-
I
0
a AlAs
9"
F
si
GaP 0
0
0
10 - AIP
0 warn
sic 0
0 OC Bh
GaN
0 0
OO 5 10 1
Direct band gap (eV)
FIG. 47. Percent deviation of the LDA dielectric constants from experiment for 13 semicon-
ductors and insulators. The numerical and experimentaldata are from Refs. 536, 540, 541, 542,
and 543). The calculations use scalar relativistic pseudopotentials.
TABLE33. DIRECT LDA BANDGAP (IN eV) AT r WITH DIFFERENT LEVELSOF APPROXIMATION:
NR -NONRELATIVISTIC; SR-PP -SCALARRELATIVISTIC PSEUDOPOTENTIAL(WITH ATOMIC 3d
CORE);FR-PP- FULLY RELATIVISTICPSEUDOPOTENTIAL;
FR-CR -FULLY RELATIVISTICWITH
CORE RELAXATION. THE TAFIULATEDVALUES REFLECT MEDIANSOF A MULTITUDE OF PUBLISHED
VALUES.TYPICALLY, A PARTICULAR REFERENCEWILL HAVEVALUESWITHIN0.15eV OF THE
TABULATED ONES.HOWEVER,DESPITEVARIATIONS IN ABSOLUTEVALUESTHE RELATIVETRENDS
AGREEBETWEEN DEWRENT REFERENCES.
E,,(eV)
"Ref. 544; bRef, 545; 'Ref. 114; dRef. 185; 'Ref. 43; 'Ref. 546; ORef. 192; "Ref. 121; 'Ref. 541; 'Ref.
547; 'Ref. 548; 'Ref. 549; "Ref. 550; "Ref. 551.
show for Si, Ge, and GaAs how different approximations affect the LDA gap.
The table shows, from left to right, the importance of scalar relativistic effects
with frozen core states, spin-orbit interactions (fully relativistic),and relaxation
and hybridization of core states. In GaAs, all these effects are important if we
aim for an accuracy of 0.1 eV for band-gap predictions. With a 1.5-eV band
gap, a 0.1-eV error corresponds to a 5-10% error in the dielectric constant.
The commonly used scalar relativistic pseudopotentials typically give a gap of
0.4-0.5 eV in GaAs (including a 0.1-eV spin-orbit correction), which is about
0.3 eV larger than a fully relativistic all-electron calculation. In contrast to
GaAs, the band gap of silicon is not sensitive to the level of approximation,
which shows that silicon is not a good test case for the evaluation of accurate
methods. Note also the large effects in Table 33 for Ge and GaAs from the
neglect of relativistic effects even for these relatively light atoms.
'40 J. Chen, Z. H. Levine, and J. W. Wilkins, Phys. Rev. B50, 11514 (1994).
541 Z. H. Levine and D. C. Allan, Phys. Rev. 844, 12781 (1991).
542 J. Chen, Z. H. Levine, and J. W. Wilkins, Appl. Phys. Lett. 66, 1129 (1995).
'43 H. Zhong, Z. H.Levine, D. C. Allan, and J. W. Wilkins, Phys. Rev. 848, 1384 (1993).
'44 C. S. Wang and B. M. Klein, Phys. Rev. B24,3393 (1981).
"' M.-Z. Huang and W. Y.Ching, Phys. Rev. B 47, 9449 (1993).
546 M. Alouani and J. Wills, Phys. Rev. B 54, 2480 (1996).
547 S. Bei der Kellen and A. J. Freeman, Phys. Rev. B54, 11187 (1996).
'48 S.-H. Wei and A. Zunger, Phys. Rev. B39, 3279 (1989).
549 H. Krakauer, S.-H. Wei, B. M. Klein, and C. S . Wang, Bull. Am. Phys. SOC.29,391 (1984).
''O B. I. Min, S. M d d a , and A. J. Freeman, Phys. Rev. 838, 1970 (1988).
'" Crystd and Wid State Physics, Landholt-Bornstein, Numerical Data and Functional
Relation&@ in Sciawv and Technology, vol. 17% ed. 0. Madelung, Springer, Berlin (1984).
178 WILFRIED G. AULBUR, LARS JONSSON, AND JOHN W. WILKINS
’” A. Dal Corso, F. Mauri, and A. Rubio, Phys. Rev. B53, 15638 (1996).
’” V. Fiorentini, Phys. Rev. B46, 2086 (1992).
LDA PT UE CP
Si -0.8 0.7 0.7 0.7
Ge - 1.2 0.6 0.8 1.1
GaAs - 1.3 0.7 0.9 1.3
QUASIPARTICLE CALCULATIONS IN SOLIDS 179
30. THE“SCISSORS
OPERATOR”
AND ITS LIMITATIONS
overcorrection of the too high LDA dielectric constant. This overcorrection can
be understood by considering the band and k-space summations in Eq. (6.2).
The most important regions in k-space when calculating E are those where
valence and conduction bands are approximately parallel, leading to a large
joint density of states. The effect of the joint density of states on the dielectric
constant is best seen when E is written in terms of the imaginary part ~ ~ ( 0 )
via the Kramers-Kronig relation:
554 A. Dal Corso, S. Baroni, R. Resta, Phys. Rev. B49, 5323 (1994).
555 A. Dal Corso, A. Pasquarello, A. Baldereschi, and R. Car, Phys. Rev. B 53, 1180 (1996).
180 WILFRIED G. AULBUR, LARS JONSSON, AND JOHN W. WILKINS
where
The position matrix elements are the same in LDA and in the scissors-
operator approach, since the wave functions are the same by construction.
The only effect of the scissors shift A in the expression for E ~ ( w )is that A is
added inside the 6 function. As shown in Fig. 39 and in Figs. 48 and 49, for
most semiconductors the dominant structure in &'(a) is not the region
around the band gap but the one that consists of a couple of peaks a few
eV higher up in energy.
To shift the main peaks in ~ ' ( 0to) their experimental positions requires
a A smaller than the gap mismatch. Because of both band-structure and
50
40
n 30
3
YII
w- 20
10
-0
n
2 4 8 8 1 0
Energy [eV]
0 2 4 6 8 1 0
Energy [eV]
FIG. 49. Effects of quasiparticle weights and vertex corrections on the imaginary part of the
dielectric constants for diamond and silicon (from Ref. 564): short dashed line-experiment
(diamond, Ref. 197; Si, Ref. 565); dotted line- LDA, long-dashed line-self-energy corrected
without quasiparticle-weight correction; dash-dotted line -self-energy corrected with weight
factor Z , Z , for each matrix element; solid line- self-energy corrected including vertex
corrections. The close agreement between the solid and long-dashed curve shows that the
strong effect of the weight factor Z,Z, is almost canceled by vertex corrections. Since the dotted
LDA curves look almost like solid lines, note that the LDA curves are farthest to the left and
have the highest peak in both materials.
many-electron effects, the mismatch between the LDA and the experimental
regions of strong optical absorption is not related to the band-gap mismatch
in an obvious way. Figure 48 shows the results by Hughes and SipeSs6for
GaAs and Gap. They used a scissors shift to fit the LDA band gaps to the
experimental gaps, which puts the dominant absorption peaks about 0.5 eV
too high. As discussed in Sections II.9b and IV.24, inclusion of excitonic
effects will strongly improve the absorption spectrum in regard to both peak
position and the peak heights (see Refs. 154, 155, 156,474, 475, and 476). A
systematic first-principles argument for what A to choose is therefore hard
to find and the scissors-shift approach must be considered largely empirical.
Early calculations found good agreement with experiment by using LDA
gaps plus an averaged GWA correction for A, which suggests a close to
ab-initio method for optical calculations.535~536 Table 35 shows the results
of Levine and Allan for the dielectric constant of Si, Ge, GaAs, Gap, AlAs,
and Alp. Leaving the discussion of local-field corrections to the next section,
we see that the scissors approximation brings the too large LDA values to
within 5% of experiment. These good results are obtained in large part
because of the particular approximation made to the band structure. For
LDA N o LF LF EXPT.
GaAs, Levine and Allan found good agreement with experiment by using
the average GWA correction of 0.8 eV from the semirelativistic
pseudopotential calculation by Godby, Schluter, and Sham."' This correc-
tion is added to a semirelativistic pseudopotential LDA band structure
(neglecting Ga 3d core relaxation and spin-orbit corrections). Their LDA
gap is 0.5 eV at the experimental lattice constant, which yields a 1.3-eV gap
after the scissors correction. This gap is 0.2eV too small compared to
experiment, but this underestimation of the gap is what is needed to get the
dominant contribution from the strong peaks in the LDA absorption
spectrum to end up close to the experimental peak structures.
For large band-gap materials (E,,, > 4 eV), GWA-based scissors-oper-
ator corrections frequently lead to an underestimation of E that is as large
as the overestimation by LDA. This effect was first seen by Chen et al. for
diamond and Table 36 shows the results of Chen et al. for diamond,
Sic, and GaN. Here the LDA values again overestimate the dielectric
constant but by only 4-6%. The average GWA corrections to the direct
gaps are of the order of 1-2 eV, which when used as scissors corrections give
dielectric constants well below experiment. Similar results were obtained by
Adolph et aLS6l and by Gavrilenko and B e ~ h s t e d t , ~who
~ ' concluded that
to obtain an E in agreement with experiment a shift of OSA,,, is needed in
Si; 0.45A,,, in Sic; and 0.2A,,, in diamond.
Another important consideration is the effect on the optical response of
the quasiparticle weight Z that multiplies the single-particle Green function
(see Eqs. (1.7), (1.8), (2.7), and Section 11.11). Within the independent-
9797 (1996).
5 6 2 V. I. Gavrilenko and F. Bechstedt, Phys. Rev. B55,4343 (1997).
563 R. Del Sole and R. Girlanda, Phys. Rev. B54, 14376 (1996).
564 F. Bechstedt, K. Tenelsen, B. Adolph, and R. Del Sole, Phys. Rev. Lett. 78, 1528 (1997).
184 WILFRIED G. AULBUR, LARS JONSSON, AND JOHN W. WILKINS
EFFECTSIN OPTICAL
31. LOCAL-FIELD RELSPONSE
2
(ilexp(i(q + G)*r)Ii')(i'(exp(-i(q + G ) - r ) l i )
Pgd.(q; o)= -
v1
ii'
E~ - E ~ , E~-
, E~ - o + i6
which is an extension to include local-field effects of the Ehrenreich and
Cohen formula for the dielectric constant in a solid."'
In the scissors-operator approach, the conduction-band energies in the
denominator of the Adler-Wiser formula for P&(q; o) are shifted by A.
Without local fields, the static susceptibility x, defined by E = 1 + 47q, is
obtained by taking the limit 4 + 0 in Eq. (6.8): x = - limqd0P&(q,O)/q2.
In numerical implementations, two alternative forms of the Adler-Wiser
formalism have been used: (1) the length-gauge formulation of Sipe and
C O W O ~ ~ based ~ ~ onS matrix
~ ~ ~elements
* ~ ~of ~the position operator, which
was used in Eq. (6.2); and (2) the velocity-gauge formulation developed by
Levine and Allan (see Refs. 535, 536, 541, 567, and 568). These two
formulations differ by the way the matrix elements of type
(cklexp(iq.r)luk) are evaluated.
The length-gauge formulation can be obtained by an expansion in q of
the exponential exp(iq-r):
1 1
- (ck(exp(iq.r)luk) = - (ckluk
V R
+ q)&+,,k* (6.11)
(6.13)
b. Numerical Results
The numerical results below for local-field effects in semiconductors and
insulators are all based on an extension to the Adler-Wiser expression in Eq.
569 0.Pulci, G. Onida, A. I. Shkrebtii, R. Del Sole, and B. Adolph, Phys. Rev B55,6685 (1997).
QUASIPARTICLE CALCULATIONS IN SOLIDS 187
(6.9), which extension treats the local fields in the time-dependent local-
density approximation570 rather than in RPA (time-dependent Hartree
approximation). In this extension, exchange and correlation effects in the
induced electron density are calculated self-consistently within LDA. For
further details, see the references cited in the following discussion.
Columns two and three in Table 35 show typical local-field corrections to
the dielectric constant. Invariably a reduction of E by 5-10% is obtained.
These results are consistent with the effect of local-field corrections on the
absorption spectrum shown in Fig. 39. The weight in the dominant peaks
in E ~ ( W ) shifts somewhat to higher frequencies, causing a reduction in E.
In second-order response, the local fields give a larger correction -gen-
erally reducing x(') but occasionally increasing it. Table 37 shows the effect
of both the scissors correction and the local fields on x(') for those of the
previously discussed materials that lack inversion symmetry, which is a
prerequisite for a nonzero second-order response. The LDA values are too
large by as much as a factor of two, with the severest errors for the smaller-
gap materials. The local-field corrections are of the order of 10-20% and
are negative except for the zzz component in Sic, which shows a 13%
increase. Despite the limited experimental data, we can see a trend similar
to that found for &-the scissors-operator approach works well for the
smaller-gap materials, but an overcorrection is obtained for the larger-gap
materials, here exemplified by Sic and GaN.
''O A. Zangwill and P. Soven, Phys. Rev. Lett. 45, 204 (1980).
"Ref.571; bRef.572.
188 WILFRIED G . AULBUR, LARS JONSSON, AND JOHN W. WILKINS
5 7 1 B. F. Levine and C. G. Bethea, Appl. Phys. Lett 20,272 (1972), as revised by D. A. Roberts,
potential method, used by Chen et al., was developed by Levine and Allan
(Refs. 536, 541, 567, and 568), who showed how to include a scissors
shift in the velocity gauge. Their method is mostly applied in the long-
wavelength limit,540*542 although frequency dependence below the gap can
be obtained.575
Sipe and coworker^^^^,^^^ made a detailed comparison between the
velocity-gauge and length-gauge approaches and found that a simpler
formulation than the Levine-Allan approach can be obtained when working
in the length gauge. The length-gauge formulation was used by Hughes and
Sipe for GaAs and and by Hughes, Wang, and Sipe for GaN and
They calculated both the real and imaginary part of x ( ~ for )
577 J. P. L. Hughes, Y. Wang, and J. E. Sipe, Phys. Rev. B55, 13630 (1997).
578 B. Adolph and F. Bechstedt, Phys. Rev. B57, 6519 (1998).
579 X. Gonze and J.-P. Vigneron, Phys. Rev. B39, 13120 (1989).
190 WILFRIED G. AULBUR, LARS JONSSON, AND JOHN W. WILKINS
to the optical rotatory power of a-quartz and selenium and found that the
local fields dominate the response. The optical rotatory power is the ability
of crystals with a chiral structure to rotate the plane of polarization of light
passing through. Table 38 shows the effect of local fields on the dielectric
constants, the second-order susceptibilities, and the rotatory powers of
selenium and a-quartz. For E in both materials and for x") in quartz, the
corrections are modest reductions. For x(2) in Se there is a 33% positive
correction. Finally, in the rotatory power the local fields dominate the
response -in a-quartz the local-field correction increases the rotatory
power by a factor of eight, while in Se the local fields change the sign and
more than double the absolute value.
The calculations for the rotatory power cannot be made with the scalar
theory of optical response described above, since the physical effect is a
rotation of polarization and requires a vector description. Therefore, a
generalization to vector response in terms of induced currents and vector
F. F. Martens, Ann. Phys. (Leipzig) 6, 603 (1901), cited in D. E. Gray, American Institute
of Physics Handbook, 2nd ed., McGraw-Hill, New York (1963).
G. W. Day, Appl. Phys. Lett. 18, 347 (1971).
5 8 2 B. F. Levine and C. G. Bethea, Appl. Phys. Lett. 20, 272 (1972).
583 L. Jonsson, Z. H. Levine, and J. W. Wilkins, Phys. Rev. Lett. 76, 1372 (1996).
584 W. Henrion and F. Eckart, Z. Naturforsch. 19A, 1024 (1964).
5 8 5 J. E. Adams and W. Haas, in The Physics of Selenium and Tellurium, ed. W. C. Cooper,
Pergamon, New York (1969), 293.
5 8 6 A. Carvallo, C. R Acad. Sci. 126, 728 (1898), cited in D. E. Gray, American Institute of
Physics Handbook, 2nd ed., McGraw-Hill, New York (1963).
QUASIPARTICLE CALCULATIONS IN SOLIDS 191
32. DENSITY-POLARIZATION
FUNCTIONAL
THEORY
In the static limit, the induced response to a weak perturbing potential can
be obtained from a ground-state calculation and should therefore be
obtained exactly by Kohn-Sham perturbation theory. This fact has always
been the strong, and just, argument made by opponents to the scissors-
operator approach, which artificially adds a nonlocal potential to a theory
that should be able to give the correct low-frequency optical response by
using an appropriate Kohn-Sham potential. However, there is no doubt that
LDA does not suffice, because of the band-gap problem, and there are
indications that even in exact Kohn-Sham theory a large gap mismatch
exists (see Appendix, Section 3c).
a. Divergence of K,,
A solution to this apparent paradox was recently presented by Gonze,
Ghosez, and G ~ d b y . ’ ~They
’ pointed out that in an infinite sample the
exchange-correlation potential in Kohn-Sham theory can be divergent in the
long-wavelength limit. In a system with a gap, within which a finite electric
field can exist and a finite polarization can be induced, great care has to be
applied when the limit of infinite sample size is taken within density
functional theory. The discovery of Gonze, Ghosez, and Godby has impor-
tant consequences for many situations where bulk polarization occurs (see
Refs. 591, 592, 593, 594, 595, 596, and 597), not least for ferroelectrics, but
here we discuss only the necessary revision of linear response theory.
58’ H. Zhong, Z. H. Levine, D. C. Allan, and J. W. Wilkins, Phys. Rev. Lett. 69, 379 (1992);
Let us first state unequivocally that there are no flaws in the original work
by Hohenberg and Kohn3’ and Kohn and Sham.33Their work addressed
finite systems. The only question is how the limit of infinite extent should
be taken. This limit can be approached in two ways: (1) The infinite-volume
limit can be taken for the finite-size theory in a weak constant field, with
careful consideration of surface effects; or (2) the long-wavelength limit can
be taken in the infinite-sample results. Here we adopt the latter approach
and focus on the long-wavelength limit of optical response in the type of
bulk formulations we use in the previous sections.
In the appendix, we define the exchange-correlation potential Vxc(r)=
&Ex,[n] /6n(r), which is the additional potential besides the physical poten-
tial felt by the Kohn-Sham electrons. By “physical potential” we mean the
electrical potential that would act on a weak test charge inside the sample.
The physical potential consists of the external potential and the Hartree
potential. The fictitious Kohn-Sham electrons are, in addition, acted upon
by the exchange-correlation potential, which by construction is necessary to
yield the same density in the Kohn-Sham system as for the electrons in the
real, interacting system.
In linear response, the exchange-correlation potential can be expanded to
first order in the density change. The exchange-correlation kernel K,,, which
is the response function giving the induced exchange-correlation potential
in terms of the induced density, is given by the second functional derivative
of the exchange-correlation energy:
6V,,(r) =
s Kxc(r,r’)&n(r’)dr’; r’) =
KXE(r,
induced potential must be added to the induced Hartree potential. The total
induced potential acting on the Kohn-Sham electrons is therefore
(6.15)
(6.16)
The key point of the work by Gonze, Ghosez, and Godby is that the
exchange-correlation kernel in an insulator should be expected to have a
QUASIPARTICLECALCULATIONS IN SOLIDS 193
l/q2 divergence just like the Coulomb potential in the Hartree term. This
assertion has been shown to be true in model calculations.595 In a different
language, the l/q2 divergence of K,, means that the Kohn-Sham electrons
feel a macroscopic potential different from the physical electrical potential.
In LDA, K,, goes to a constant as q + 0 and is unimportant compared
to the l/q2 divergence of the Coulomb potential. This finiteness of K,, in
LDA is due to the metallic character of a homogeneous electron gas, which
is used to compute the LDA kernel. Hence, despite its success for many
material properties, LDA completely fails to describe the divergence in K,,
expected in insulators.
The discussion above concerning potentials and density can be recast in
terms of electric fields and polarization. Instead of the long-wavelength
component of the density, we can consider the induced polarization P and
a corresponding exchange-correlation electric field felt by the Kohn-Sham
electrons besides the physical field. In general, for longitudinal, scalar
potentials we can define E = - V V = - iqV and 6n = - V . P = -iq.P.
Then an induced exchange-correlation potential, SV,, = K,,6n a 6n/q2, is
equivalent to an exchange-correlation electric field directly proportional to
the induced polarization, Ex,a P. The latter formulation has given this
version of Kohn-Sham theory for infinite insulators the name “density-
polarization functional theory.”
The problem with the band-gap mismatch between the quasiparticle and
the Kohn-Sham spectrum has now been put in a different light than in our
discussion in the previous sections, where the focus was on how to adjust
the gap. Concerning optical response at long wavelengths, the Kohn-Sham
picture is very different from the quasiparticle picture. The quasiparticles are
driven by the physical electrical field and need a correct band structure to
give results close to experiments. On the other hand, the fictitious Kohn-
Sham electrons inherently will not have the correct band structure and must
therefore be driven by a fictitious additional electric field that exactly
compensates for the spectral differences and ensures that the correct density,
or polarization, is induced as guaranteed by density functional theory in the
static limit.
b. Real Materials
The above discussion demonstrates only a qualitative way in which density-
functional theory can give correct response, but there is no recipe yet that
yields a numerical expression for the exchange-correlation field. However, as
discussed by Aulbur et the fact that this field is proportional to the
induced polarization allows us to gain some important insight about its
strength and qualitative behavior.
194 WILFRIED G . AULBUR, LARS JONSSON, AND JOHN W. WILKINS
0.1 1
\
\ \
\ AIN
\
0.0 \
n
k
\
-0.1
I
<
>2 -0.2 \ \ \
Sic w. >
II
&
BASEDON GROUND-STATE
33. FUNCTIONALS DENSITIES
601 J. B. Krieger, Y. Li, and G. J. Iafrate, Phys. Rev. A 45, 101 (1992); 46,5453 (1992); Y. Li,
METHOD Si Ge C GaAs
"Ref. 609; 'Ref. 607; 'Ref. 604, 'Ref. 605; 'Ref. 610; 'Ref. 611 with
Gaussian exchange-correlation hole; #Ref. 61 1 with Lorentzian ex-
change-correlation hole; hRef. 612- Diffusion MC; 'Ref. 613 -Varia-
tional MC.
the EXX values of Kotani and Akai, who used a nonrelativistic KKR band
structure within the atomic-sphere approximation, deviate by up to 0.7 eV
from experiment. The ASCF gaps, which include the band-gap discontinuity
of the exchange potential, are close to the Hartree-Fock values and
overestimate the experimental gaps by 5-8 eV. The good agreement between
experiment and the EXX Kohn-Sham gaps, together with the large values
of Ax (5- 10eV), imply that Ac is equally large but negative.
QUASIPARTICLE CALCULATIONS IN SOLIDS 199
@: (r)@y (r‘)Qj(r)mi(r’)
drdr’. (7.3)
i<j
‘lo A. Seidl, A. Gorling, P. Vogl, J. A. Majewski, and M. Levy, Phys. Rev. B 53,3764 (1996).
‘” L. Fritsche and Y. M. Gu, Phys. Rev. B48,4250 (1993).
‘12 A. J. Williamson, R. Q. Hood, R. J. Needs, and G. Rajagopal, Phys. Rev. B57, 12140
(1998).
‘13 P. R. C. Kent, R. Q. Hood, M. D. Towler, R. J. Needs, and G. Rajagopal, Phys. Rev. B57,
15293 (1998).
‘14 0. Madelung (ed.), Semiconductors-Busic Data, Springer, New York (1996).
‘15 A. Svane, Phys. Rev. B35,5496 (1987).
616 R. Padjen, D. Paquet, and F. Bonnouvrier, Int. J. Quunt. Chem. Symp. 21,45 (1987).
200 WILFRIED G. AULBUR, LARS JONSSON, AND JOHN W. WILKINS
34. FUNCTIONALS
BASED ON EXCITED-STATE
DENSITIES
That is, that functional is defined for a very large set of densities and, in
particular, for any well-behaved excited-state density. The total energy E [ n ]
is obtained by adding the energy of the interaction with the external
potential: E [ n ] = F [ n ] + 1T/ext(r)n(r)dr.
What is the relationship between the exact energy Ei of the’ ith excited
state and the energy E [ n i ] obtained from the exact density,of the ith excited
state? From the minimization in the constrained search in Eq. (7.4), it
follows that the functional value of E[ni] must be lower or’equal to the exact
excited-state energy: E [ n i ] < E i . E r n , ] is by construction the lowest energy
any wave function can have for the given density ni; however, the wave
function that minimizes E [ n i ] does not have to be equal to the excited-state
wave function, even though they both yield the same density.
Perdew and Levy6” showed that the equal sign in the relation E [ n i ] < Ei
holds if and only if the density ni yields an extremum for E [ n i ] . However,
all excited states do not yield an extremum for ECn,]. Perdew and Levy took
35. TIME-DEPENDENT
DENSITY THEORY
FUNCTIONAL
630 E. Runge and E. K. U. Gross, Phys. Rev. Leu. 52, 997 (1984).
631 E. K. U. Gross and W. Kohn, Adv. Quantum Chem. 21, 255 (1990).
632 G. Vignale and W. Kohn, Phys. Rev. Lett. 77, 2037 (1996).
6 3 3 J. F. Dobson, Phys. Rev. Lett 73, 2244 (1994).
6 3 4 G. Vignale, Phys. Rev. Lett. 74, 3233 (1995).
functional theory with a static LDA kernel to describe the effect of exchange
and correlation on the local fields. The need for a scissors-operator correc-
tion to obtain a reasonable absorption spectrum demonstrates that this
simple kernel is inadequate for the description of semiconductor gaps.
Vignale and K ~ h n developed
~ ~ ’ a time-dependent formalism based on
functionals of both density and current. They showed that in this formalism
local approximations are more promising. Further, the theory can be cast
in a form where the electron gas is described by a continuum fluid model in
which the viscosity and elasticity coefficients can be obtained from par-
ameters of the homogeneous electron gas.639,640On e of the successes of this
method is a derivation of the damping of collective modes in an in-
homogeneous electron gas. Applications to quantum wells give dampings of
the correct order of magnitude, although these applications also reveal some
fundamental difficulties.641
36. MONTE-CARLO
CALCULATIONS
G. Vignale, C. A. Ullrich, and S. Conti, Phys. Rev. Lett. 79, 4878 (1997).
640 S. Conti, R. Nifosi, and M. P. Tosi, J. Phys. Cond. Mat. 9, L475 (1997).
641 C. A. Ullrich and G. Vignale, Phys. Rev. E58, 15756 (1998).
642 W. L. McMillan, Phys. Rev. 138, A442 (1965).
643 D. Ceperley, G. Chester, and M. Kalos, Phys. Rev. E16, 3081 (1977).
644 M. H. Kalos, Phys. Rev. A 128, 1791 (1962).
206 WILFRIED G. AULBUR, LARS JONSSON, AND JOHN W. WILKINS
645 L. Mitas and R. M. Martin, Phys. Rev. Lett. 72, 2439 (1994).
646 L. Mitas, Comp. Phys. Comm. %, 107 (1996).
647 L. Mitas, Physicu B 237-238, 318 (1997).
Monte Carlo to calculate the exact form of the exchange correlation hole in
silicon, including the dependence on coupling constant. Other examples are
the work by Knorr and Godby 649 and Engel et a1.,650who analyzed the
Kohn-Sham potential for model semiconductors with the help of exact
densities obtained from diffusion Monte-Carlo simulations. Also, Kent et
uL613 used variational Monte Carlo to calculate both density matrices and
the full band structure of silicon (see Table 39 for their value of the gap).
ACKNOWLEDGMENTS
1. UNIVERSAL FUNCTIONALS
DENSITY
where ri is the coordinate for the ith electron. To simplify the notation, we
denote the first term in Eq. (A.l), the kinetic-energy operator, by 2 and the
second term, the Coulomb-interaction operator, by pcOul.These two terms
are the same in all N-electron systems considered; only the external
potential differs between systems. This invariance of the kinetic and
Coulomb operators in the Hamiltonian is the basis for the idea of universal
functionals.
Following the formulation by Levy61* p 6 1 9 and Lieb,620 we define the
density functional,
where the minimization is over all N-electron wave functions Y that yield
the given electron density n(r). That is, F [ n ] is the lowest expectation value,
or energy, of the Hamiltonian ? + pcoulthat can be obtained by any
N-electron wave function, given that the density is equal to n. This
functional is universal in the sense that it does not include any dependence
on the external potential. The total energy functional E [ n ] for a specific
system is obtained by addition of the interaction with the external potential:
2. THEKOHN-SHAM
SYSTEM
where Y is any N-electron wave function that yields the density n. That is,
To is the lowest kinetic energy any wave function can give under the
constraint that it must yield the density n. The total-energy functional of the
Kohn-Sham system is
When the minimization in Eq. (A.7) is extended to include mixed states and
fractional occupation, T,,[n] has a well-defined functional derivative, and we
obtain, in analogy with Eq. (AS),
(A. 11)
QUASIPARTICLE CALCULATIONS IN SOLIDS 21 1
(A.14)
(A.15)
3. THEBAND-GAP
DISCONTINUITY
Let us first define the ionization energy I and the electron affinity A for the
N-electron system:
I = E N - 1 - EN; A = EN - EN+l, (A.17)
where we have defined Axc = cN+l , N + - E N , N + 1, that is, the energy differ-
ence between the highest occupied Kohn-Sham orbital in the (N + 1)-
electron system and the lowest unoccupied Kohfi-Sham orbital in the N-
electron system. This difference is in general finite659*660 and believed to be
of the order of eV in semiconductors and insulators.6 The last two terms in
Eq. (A.19), E N , N + I - E N , N , give the band gap in the N-electron Kohn-Sham
system.
As the notation suggests, Axc is due to the exchange-correlation potential
V,, in Eq. (A.15) and is caused by a discontinuous jump in Vxcon addition
of an electron:659*660v661
where the explicit r dependence indicates that the difference between the
exchange-correlation potentials in the (N + 1)-electron system and the
N-electron system is a position-independentconstant Axe. Note that since the
potential only changes by a constant, there is no difference between the
+
Kohn-Sham wave functions in the (N 1)-electron system and the N-
electron system.
The discontinuity Axc originates solely from the exchange-correlation
potential. The external potential is by construction unchanged; the Hartree
potential has a simple, direct dependence on the density, which only changes
infinitesimally, while the discontinuity in the kinetic energy To is exactly
equal to the Kohn-Sham gap E N , N + - E N , N . This discontinuity in the kinetic
~~~~
659 J. P. Perdew and M. Levy, Phys. Rev. Lett. 51, 1884 (1983).
660 L. J. Sham and M. Schliiter, Phys. Rev. Lett. 51, 1888 (1983).
6 6 1 Strictly, similarly to the chemical potential, the exchange-correlation potentials at integer
N must also be defined by their limits as the electron number approaches N from below in a
system with fractional occupation of the Nth Kohn-Sham orbital.
QUASIPARTICLE CALCULATIONS IN SOLIDS 213
4. THEEXCHANGE-CORRELATION
HOLE
n(r) =N
I IY(r, rz,. . . ,rN)12dr2... dr,.
Note that the total integral over both coordinates in n(r, r’) is equal to twice
the number of unique electron pairs.
The exchange-correlation hole is described by the function g(r, r’), which
is defined through
n(r, r’) = n(r)n(r’)(l + g(r, r’)). (A.23)
Formally, by applying the theory for fractionally occupied orbitals to the noninteracting
Kohn-Sham system, one can prove that the discontinuity in the kinetic energy To as the
electron number passes through N is exactly equal to the Kohn-Sham band gap. The intuitive
argument in the text relying on the infinitesimal difference between the Kohn-Sham orbitals in
the N- and (N + 1)-electron systems is not necessary.
214 WILFRIED G. AULBUR, LARS JONSSON, A N D JOHN W. WILKINS
(A.25)
(A.26)
and the sum rules for the other two g components are obtained by
interchange of 7 and 1 in Eqs. (A.25) and (A.26).
These two relations can be understood by extending the motivation for
the sum rule for spinless particles to fermions with N f and N1 particles of
each spin, respectively. If the particle in r is assumed to have spin up, there
are N T - 1 more particles with spin up in the distribution described by
nf(r’)(l + gTT(r,r’)), which gives Eq. (A.25). However, there are still N1
particles of opposite spin left in the distribution nT(r‘)(l + gT1(r,r‘)), which
gives Eq. (A.26).
QUASIPARTICLE CALCULATIONS IN SOLIDS 215
From this equation we see directly that g = 0 gives the Hartree energy,
while,the exchange and correlation effects are described by g. For a system
of noninteracting electrons described by a single Slater determinant, there
are no correlation effects, only exchange. The sum rule for g describes the .
important role of exchange to subtract out the self-interaction piece from
the Hartree term, which piece is unphysically included in the Hartree term
when defined in terms of an integral over the total densities. For spin
one-half electrons, gfT describes both exchange and Coulomb correlation
among the equal-spin electrons, and the sum rule for gTTensures that the
self-interaction is subtracted out, while gT1describes Coulomb correlation
among electrons with opposite spin.
5. COUPLING-CONSTANT
AVERAGES
(A.28)
Denoting the ground-state wave function for a given h by Y,, the total
ground-state energy E , is
E, = (Ynl~IY,) +~ ~ , l ~ ~ c o u l K +
( r ,b o~ ( W .
l ~ (A.30)
(A.31)
which after integration over h from 0 to 1, together with the relations in Eq.
(A.29), yields
FCnol = Torn01 + ~ ~ ~ l ~ c o u l l ~ * ~ ~ ~
(A.34)
In the interacting system the functional F[nO] is equal to the sum of the
QUASIPARTICLE CALCULATIONS IN SOLIDS 217
kinetic energy, the Hartree energy, and the Coulomb energy of the ex-
change-correlation hole g, while the Kohn-Sham system is described by the
kinetic energy To[nO],the Hartree energy, and the Coulomb energy of the
averaged exchange-correlation hole @.Hence, the effect of the coupling-
constant average is to incorporate the difference between To and the total
physical kinetic energy into the Kohn-Sham exchange-correlation functional
E,. The Kohn-Sham construction is made specifically to obtain a set of
single-particle equations by which the exact density can be found. However,
the partitioning of F[n] into a sum of To[n],E H [ n ] , and a remainder E,,[n]
is not a physical partitioning, so the exchange-correlation energy contains
both ,kinetic and exchange-correlation energy.
6. LOCALAPPROXIMATIONS
(A.35)
664 S. H. Vosko, L. Wilk, and M. Nusair, Can. J. Phys. 58, 1200 (1980).
665 J. P. Perdew and A. Zunger, Phys. Rev. B23, 5048 (1981).
666 J. C. Slater, Phys. Rev. 81, 385 (1951); ibid. 82, 538 (1951).
667 E. Fermi, Z. Phys. 48, 73 (1928).
668 L. H. Thomas, Proc. Curnb. Phil. SOC.23, 542 (1927).
218 WILFRIED G. AULBUR, LARS JONSSON, AND JOHN W. WILKINS