Performance of Music Algorithm
Performance of Music Algorithm
Abstract-In this paper, we present an asymptotic performance the variance of estimated DOA’s for the ESPRIT algorithm
analysis of three subspace-based methods for direction of arrival using the data matrix.
(DOA) estimation-the ESPRIT algorithm using second-order The performance of ESPRIT has also been analyzed by Rao
statistics, the higher order ESPRIT algorithm using fourth-order
cutrmlants, and the virtual ESPRIT (VESPA) algorithm using and Hari [ 1I], who derived the mean squared error (MSE)
fourth-order cumulants. We examine the least-squares version for estimated DOA’s of a second-order ESPRIT algorithm.
of these algorithms, derive the expressions for the asymptotic They assumed a linear equispaced array and jointly complex
variance of the estimated DOA’s, and use specific examples to Gaussian sources in additive white Gaussian noise. Their
compare the relative performance of the algorithms. Finally, analysis is done for the least-squares version of ESPRIT.
we present Monte Carlo simulations to validate the theoretical
anallysis. However, they show that the asymptotic MSE’s of the LS-
ESPRIT and TLS-ESPRIT are the same. Our work in this
paper follows that of Rao and Hari.
I. INTRODUCTION While the analysis in [ 111 is valid for only the second-order
ESPRIT and for only Gaussian sources, the development in
I N THE last 15 years, subspaced-based methods for es-
timating the directions of arrival (DOA’s) of far-field
sources impinging on an array of sensors have become very
this paper is valid for Gaussian and non-Gaussian sources in
the second-order case. Furthermore, the analysis here encom-
passes the fourth-order ESPRIT [2] and the recently developed
popular. In particular, the MUSIC algorithm and its deriva-
fourth-order VESPA [3] algorithms.
tives have received much attention. Extensive simulations
This paper is organized as follows. Section I1 summa-
and performance analyses have been presented by several
rizes the least-squares version of the second-order ESPRIT,
authors showing the benefits of MUSIC in array processing
fourth-order ESPRIT, and VESPA algorithms. The asymptotic
estimation problems. One disadvantage of these subspaced-
variance of the DOA estimates is then derived in Section 111.
bawd methods, including MUSIC, is that information (e.g.,
The analysis common to all three ESPRIT algorithms is first
gain, phase, relative location) about the array of sensors must
presented and then the analysis for each specific algorithm is
be known a priori. The ESPRIT algorithm, developed by Roy
given. In Section IV, Monte Carlo simulations and numerical
et al. 191, [12], lessens the a priori information requirement
evaluations of the equations derived in the previous section are
by using two identical arrays of sensors.
provided to show the match between analysis and simulation.
The performance of second-order ESPRIT and its derivative
This paper concludes in Section V. Some relevant equations
algorithms has been analyzed by several authors. Ottersten
are derived in the Appendix.
et al. [SI derived the asymptotic distribution of the error in
estimated DOA’s for the total least-squares ESPRIT (TLS-
ESFRIT) and compared the variance to the Cramer-Rao 11. SUMMARY
OF ESPRIT ALGORITHMS
Bound. Mathews and Zoltowski [7] derived the asymptotic In this section, we summarize the least-squares (LS) ES-
variance for the uniform circular array ESPRIT, which is a 2-D PRIT algorithms using second-order statistics and fourth-
angle estimation algorithm. Gavish and Weiss [5] analyzed the order cumulants. To avoid ambiguity, we refer to the second-
virtual interpolated array ESPRIT algorithm, which is based order ESPRIT as ESPRIT2, the higher order ESPRIT as
on interpolated arrays [4]. Li et al. [6] derived expressions for ESPRIT4, and the virtual ESPRIT using fourth-order cu-
mulants as VESPA. The ESPRIT2 and ESPRIT4 algorithms
Manuscript received July 8, 1995; revised April 9, 1996. This work was require two subarrays of sensors that are identical but separated
supported by the National Science Foundation under Grant NSF MIP-90- by a known displacement vector A with magnitude A. The
17221, and in part by the University of California MICRO program and
Applied Signal Technology, Inc., and thc Office of Naval Research under outputs of the subarrays are modeled as
Contracts N00014-9 1-J- 1602 and N00014-95- 1-09 12. The associate editor
coordinating the review of this manuscript and approving it for publication x(t) = A s ( t ) + n,(t) (1)
was 11. A. Lee Swindlehurst.
The authors are with the Department of Electrical and Computer En- Y(L) = A W t ) + n,(t) (2)
gineering, University of Califomia, Davis, CA 95616 USA (e-mail: fried-
Ian @ece.ucdavis.edu). where A is the m x d array response matrix associated with
Publisher Item Identifier S 1053-587X(96)07127-9. each of the (identical) subarrays, s ( t ) is the d x 1 vector of
1053-587X/96$05.00 0 1996 IEEE
2538 IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 44, NO. IO, OCTOBER 1996
narrowband zero-mean signals, n,(t) and n,(t) are m x 1 Decomposing E, into two m x d matrices E, and E,, we get
Gaussian noise vectors of the X and Y subarrays, respectively,
and @ is a d x d diagonal matrix of phase delays between the
doublet sensors for the d signals. The matrix $ is given by
(1 1)
$42 ...
(3) with
... e34d
R{E,) =R{E,)
where =R{A}. (12)
WOA sin ek. Since the range of E,Tand the range of E, are equal, the rank
4% = C
(4) of the matrix [E, E,] is d, which implies that there exists
a 2d x d matrix
tlk is the direction of arrival of the kth source, W O is the carrier
frequency, and c is the speed of propagation. (13)
Now we combine the subarray outputs to get
= E,P,+EY P Y
-
=A R , , A ~ + g2xn (8)
where E: denotes the pseudoinverse of E,. Combining (15)
and (16), we get
where R,, is the source covariance matrix. The 2m - d ATF = A+T. (19)
smallest generalized eigenvalues of (Rzz,E,) are equal to
0'. The d generalized eigenvectors corresponding to the d Since T is invertible and A has full rank, it follows from
largest generalized eigenvalues are used to form ( 19) that
has an extra requirement that the signals in s ( t ) are non- In a manner similar to the second-order ESPRIT, we can
Gaussian. Because the signals are assumed to have zero mean, formulate a combined output as in (5)-(7). Doing this, we can
the fourth-order cumulant is given by get the fourth-order cumulant matrix
&4(kl,
A
-
k2;l1, l 2 )
c,, = -
ArP
- Cum rq, zlc, , .T2, Zk21
- E{z,*l%1}E{~,*2Zn,}
= E{z,*lzZIilzf2ZkL}
- E { z , * l ~ k 2 } ~ { 4 2 ~ k-l lE { Z k , Z k 2 } E { Z , * l ~ ; 2 ) . (21)
From this point, the fourth-order ESPRIT algorithm proceeds
We shall use the notation ~ ( k . 1 k.2; , 11, I z ) , where as in the second-order case, with the covariance matrix RZz
k l , k2. 11. 12 are the subscripts of the array sensor elements, replaced by the cumulant matrix C,, in (8).
to denote the fourth-order cumulant whenever there is no
conl’usion about which sensor element (of which subarray) C. Virtual ESPRIT
is being referred.
A fourth-order ESPRIT algorithm has been proposed by Recently, Dogan and Mendel [3] have given a novel inter-
Chizing and Nikias [2] for the case of equispaced sensors and pretation to cumulants for narrowband array processing. Basi-
statistically independent sources. The steering matrix is given cally, they have shown that, for independent sources, a fourth-
1
order cross-cumulant can be interpreted as a (second-order)
by
cross-covariance. The interesting insight of this interpretation
1 1 ...
is that for some cumulants, the interpreted cross-covariance is
A= “’w’ eJW2 ... plWd
. (22)
a “virtual” cross-covariance. That is, some cross-covariances
are between data received at real sensors and “virtual” sensors
pl(m-1) p 2 ( 7 7 - 1 ) . .i . pd(m-1)
1 1 (i.e., sensors that don’t physically exist). Dogan and Mendel
have applied the idea of virtual sensors to several array
In the second-order ESPRIT algorithm, it is not necessary to
processing problems, such as implementing a virtual ESPRIT
know the form of the steering matrix, only that there are two
algorithm (VESPA) and suppressing non-Gaussian additive
identical arrays and that the displacement vector A is known.
noise. Here, we give a brief summary of the interpretation.
However, for the fourth-order ESPRIT algorithm, when the
See [3] for more details.
cteeiing matrix is of the form (22), then the cumulant matrices
Single Source: First assume that we have ,a single (;on-
(23) and (24) (shown at the bottom of the page) have the forms
Gaussian) source with a propagation vector k, where k =
C,, =AI’AH (25) A-&, + k&, (a’, and SY are unit vectors along the z and y
C,, =AraHAH (26) axis, respectively), with variance af and kurtosis ys.
Referring to Fig. 1, the shaded circles denote real sensors
and the clear circles denote a virtual sensor. Let the reference
doublet be and 211 (both real sensors), then the signals
received at the real sensors are, for example
/ and v(t) is
where a;j is the response of the ith sensor to the jth source.
Now consider the fourth-order cross-cumulant
//
Fig. I. Virtual sensor interpretation of higher order cumulants. The shaded .;l.T(t) exp [ j ( G ' &k)]
circles represent real sensors and the clear circles represent virtual sensors.
The source approaches the anay with propagation vector 1;. + ak;(t) exp [ j ( G 2 . ,I)&
UllSl(t) exp [-j(21 . I),&
where a; is the response (i.e., gain and phase) of the ith sensor. + a l 2 s z ( t ) exp [-&2 . &,)I
The noiseless cross-covariance between T , ( t )and vl ( t ) is = lall12aElallr.sl exp [-.j(G1 . A)]
exp [-&
. CXP [j(h L)] . &,)I
E{r;(t)w/(t)}= E{nZs*(t)cxp [.j(G & k ) ] '
. exp [ - j ( G . dll)]. (31) Comparing (34) and ( 3 3 , we see that the cumulant and
covariance are no longer proportional as is the case in (33).
Now consider the fourth-order cross-cumulant However, we can still intemret (35) as a covariance with
I ~,
This means that for several sources, the cumulant of the sum
'This equation shows that a fourth-order cumulant of the output of sources is simply the sum of the cumulant of each source.
from real physical sensors is proportional to a cross-covariance Given an array of sensors and one additional sensor, we
of output from a real and a virtual sensor. In other words, we can have, using this interpretation, a virtual copy of the first
have an interpreted array that has a larger aperture than the array, allowing us to implement the VESPA algorithm. The
physical array. additional sensor has identical characteristics to the reference
Multiple Sources: Assume now that we have two statisti- sensor and the pair forms a doublet. For example, if we have an
cally independent (non-Gaussian) sources impinging on an array of four sensors and one additional sensor (five total), the
array. In this case, the noiseless cross-covariance between ~ ( t )cumulant matrix used to implement the least-squares VESPA
Y U t N AND FRIEDLANDER ESPRIT ALGORITHMS 2541
in this case would be Furthermore, the left and right eigenvectors are orthogonal
[13] and we can normalize the left eigenvectors so that
(37)
where we obtain (38) and (39) (shown at the bottom of the Equations (40) and (41) imply that
page), where c ( Z , xj,xi, 21) represents the cumulant of
the arguments z:, x j , xi, 21. It is shown in [3] that, with A j = q;Fvi. (42)
five physical sensors, VESPA can be implemented with two
Under most circumstances, the matrix F has to be estimated
subiirrays of five sensors in each subarray instead of only four
using finite data, so that, from (18)
sensors in each subarray. In this case, V,, and V,, would
be 5 x 5 matrices instead of the 4 x 4 matrices shown in (38)
F = E$Ey. (43)
and (39). This larger virtual subarray is used for the examples
in Section IV. The VESPA algorithm now proceeds as in the An error SF = F - F in estimating F will cause an error
second-order case, with R,, replaced by V,, in (8). SA; = & - A,. As a first-order approximation, the error SA
From a different standpoint, we can view VESPA as using can be written as
the available cumulants in a more efficient way. For instance,
in an m-element array of sensors, there are m4 possible SA; N q;SFV;. (44)
cumlulants. ESPRIT4 requires two rn-element arrays, which
means there are 16m4possible cumulants. ESPRIT4 uses only It follows from (18) that we have the first-order approxima-
4m" out of 1 6 7 ~ 1possible
~ cumulants. VESPA, on the other tions
hand, uses 4m2 out of m4 possible cumulants.
(E, + SE,)(F + SF) = E, + SE ?I> (45)
E,SF z bE, - SE,F (46)
I [I. ASYMPTOTIC
PERFORMANCE
ANALYSIS
OF ESPRIT
and
In this section, we derive the asymptotic variance of the 6F zE$SE, - EFGE,F. (47)
estimated DOA's. The derivation in this paper is along the
line:; of the first-order analysis done by Rao and Hari [ 111. In Using (47) in (44) and noting that I A , L / =
~ 1, we get
that paper, Rao and Hari derived the asymptotic variance of the
estimated DOA's for the second-order LS-ESPRIT algorithm SA, RZ q;E,#(SE,v;- SE,FV;)
under the assumption of Gaussian signals. Furthermore, they = q;E,#(SE,v; - A;SE,v;)
show that the asymptotic variance of LS-ESPRIT is equal to
that of the total LS-ESPRIT algorithm, whether the signals = qzE$(W2 - ALW1)~ESVZ
are Gaussian or not. The assumption of this paper is that = - Aiq;E,#(Wl - ATW~)SE,?V; (48)
the signals may be non-Gaussian and, hence, the variance
expressions of [ l l ] in general do not apply. However, the where
methodology of [ 111 is followed here.
In LS-ESPRIT, we compute a matrix F using (18) and then w1 = [I, On,]; (49)
perform an eigendecomposition to get its associated eigenval- wz = [ O m In,] (50)
ues. Let A, be an eigenvalue of F, v, be the corresponding
eigenvector, and q, be the corresponding left eigenvector, such and
that
= Es- E,s. (51)
ainalysis, we use the "=" sign even though many of the ex-
pressions are simply approximations. From (48), the variance
of A, 1s
This expression is common to all three algorithms consid-
E{/6X,/2}=q7E,#(W1- X:W~)E{SE,V,V,NSE$}
ered in this paper. The difference among the three ESPRIT
. (W, - ~:W2)H(q7E$)H algorithms is in the statistic used for estimation. ESPRIT2
=q,E$(W1 - X,*W,) uses a covariance matrix Rz2;ESPRIT4 uses a fourth-order
cumulant matrix C z z ; VESPA uses a different fourth-order
cumulant matrix Vzz.Each of these matrices produces a set
of eigenvectors {SI,. . . , sd} that has different statistical prop-
erties from the eigenvectors produced by the other matrices. In
other words, the covariances of the eigenvectors, E{SsgSsf}
and E{bs,SsZ}, which appear in (52) and (53), are unique
to each algorithm. Therefore, in the following subsections, we
Lg=1 h = l
derive these covariances for each algorithm.
. ( W i - A:W2)H(qtE$)H (52)
A. Covariance of Eigenvectors-ESPRIT2
where 6s, is the gth column of the matrix bE, and v, is
The first-order Taylor series expansion of the eigenvectors
t'he gth element of the vector v,. The quantities sil and v, h
(corresponding to distinct eigenvalues) of a 2m x 2m matrix
are similarly defined.
Rzz [14] is
Following the above steps in an analogous manner, we can
ad50 show that the unconjugated variance is
Lg=1 R=1
- real{E{(SA,)2}(A5)2}]. (58)
If B, is given in degrees instead of radians, then
r The second-order and fourth-order moments, denoted by p2 ( )
and pug( ), respectively, of the received data z ( t ) are given by
YUI:N AND FRIEDLANDER ESPRIT ALGORITHMS 2543
S ? ~ a I S g , n 2 S ~ . b l S hbz
Ifs n f h ni=l n a = l b l = l by=l
1=1 n=l
' q ( C z , ?- c , z ) n l n 2 ( c z z- C z z ) b 1 b 2 }
Because of the inherent symmetry in the covariance matrix szsnT
R,,, (63) can be used to compute E{(R,, - R,Z)alu2(Rzz - (69)
Rzz)ljlb2}by swapping the indices bl and b2.
(ay - a l ) ( a h- "1 '
E { 6s, ssf }
(68)
and its asymptotic covariance is given in (81) of the Appendix.
That is, (81) is used to compute E{(C,, - C,z)nlc,2(Czt-
C-IL);),b2}in (67). Note that the indices (al.a ~ )which
, refer
2544 IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 44, NO. IO, OCTOBER 1996
\%SPA5
Subarray 1
r2 3 0 v2
WSPA5
Subarray 2
VEsPA8
rT
Subarray 1
r2
r3 a
"\ v5
VESPAB
Subarray 2
r3 0 0 V3
r4 0 0 v4
0
vs 1
Fig. 3 . Subarrays of sensors used in Examples 1 to 2 for VESPA8. Subarray
1 consists of rl -rs while subatray 2 consists of VI-VK. The shaded circles
represent real sensors while the clear circles represent virtual sensors. There
is an overlap of four real sensors in subarray 2. A source approaches the
array at an angle 8 .
i
VESPAB: dash-dot IO00
ESPRITP: dot ixxx
\
Subarr Subarray 2
0
'
IY x
20 40 60 80 100 120 140 160
I
180
DOA of Source I (Degrees)
Fig. S . Results Tor Example I. The standard deviation of estimated DOA for Subarr
source 2, plotted as a function of the DOA of source 1 for two linear subarrays
with four sensors each. The DOA of source 2 is fixed at 40'. The dash and
solid lines are the theoretically predicted results and the discrete points are
simulation results. Fivc thousand snapshots are used.
+
4r
1
(b)
Fig. 7. Array configuration used i n [ 3 ] . Configuration in (a) is used for
VESPAS and (b) i s used for ESPRIT2. The shaded triangles and X's represent
real sensors, and the clear triangles represent virtual sensors.
Subarray 1
/ I . / . \
Fig. 6. Results for Example 2. The standard deviation o f estimated DOA for
source I , plotted as a function of the input SNR. Source 1 and 2 impinge on
the array at 20" and 40'. respectively. Two linear arrays with four sensors
each. SO0 snapshots.
Fig. 8. Array configuration used for VESPAS in [ 3 ] .Five sensor elements
plane. Thus, a steering vector has the form' arc common to both subarrays. The clear triangles represent virtual sensors.
Std Deviationfor Estimated DOA of Source 1, T-Array Std Deviation for Estimated DOA of Source 1, T-Array
-
g14
e!
a
Q12
c
5 1
g
n
$08
V
go6
04
02
0 5 10 15 20 25 30
Input SNR (dB)
Fig. 9. Results for Example 3. The theoretical standard deviation of esti- Fig. 11. Results for Example 4. The standard deviation of estimated DOA
maled DOA for source 1, plotted as a function of the DOA of source 1. The for source 1, plotted as a function of the input SNR for the inverted T-array
DCIA of source 2 is fixed at 40’. ESPRIT2 uses two inverted T-arrays with configuration. The DOA of source 1 and 2 are fixed at 20’ and 40°,
four sensors each. VESPAS uses five real sensors and VESPAI uses eight rcal respectively. The dash and solid lines are the theorctically predicted rcsults
sensors. The dash and solid lines are the theoretically predicted results and and the discrete points are simulation results. Five hundred snapshots are used.
the discrete points are simulation results. Five thousand snapshots are used.
performance of an algorithm. However, it is unclear which
Std Deviation for Estimated DOA of Source 2, T-Array set of cumulants would yield the best performance.
2.5
I It is noteworthy that the simulation results in [3] show that
1 the performance of VESPA exceeds that of ESPRIT2 for the
/ I
case of BPSK signals in white Gaussian noise. It is possible
that BPSK signals, which takes on fewer possible values than
a 4-QAM signal, can be viewed as being “less Gaussian” than
VESPAB: solid I )(yx 4-QAM signals. Hence, BPSK signals may yield better results
VESPA8: dot I ooo for the higher order techniques, which require non-Gaussian
ESPRIT2 dash / *’* signals.
V. CONCLUSION
In this paper, we derived expressions for the asymptotic
variance of DOA estimates for three LS-ESPRIT algorithms
under a non-Gaussian signal assumption. The formulas derived
are used to evaluate the performance of the second-order
ESPRIT algorithm, the fourth-order ESPRIT algorithm, and
the VESPA algorithm. Monte Carlo simulations are presented
Fig. IO. Results for Example 3. The theoretical standard deviation of esti-
to verify the theoretically predicted performance. From the
mated DOA for source 2, plotted as a function of the DOA of source I . The examples we used, where the additive noise is white Gauss-
DOA of source 2 is fixed at 40’. ESPRIT2 uses two inverted T-arrays with ian, it appears that the second-order ESPRIT has the best
four sensors each. VESPAS uses five real sensors and VESPAR uses eight real
sen:,ors. The dash and solid lines are the theoretically predicted results and
performance.
the discrete points are simulation results. Five thousand snapshots are used.
APPENDIXA
COVARIANCE OF FOURTH-ORDER
SAMPLE CUMULANTS
algorithms. We emphasize that when the noise is colored with In this appendix, we derive approximate expressions for the
unknown covariance matrix, it is better to use the higher covariance of fourth-order sample cumulants. The expressions
order methods. It is interesting that the VESPAS algorithm for zero-mean and symmetrically distributed signals are de-
performs just as well as ESPRIT2 and better than ESPRIT4 rived in [IO]. We extend that derivation to zero-mean and
in 1he region of small DOA separation, even though the latter possibly nonsymmetrically distributed signals.
algorithms require more sensors. While the performance of The fourth-order cumulant and sample cumulant are given
VESP.48 in most cases is better than VESPAS, VESPA8 still by
trails ESPRIT2. Because ESPRIT4, VESPAS, and VESPA8
use different cumulants to form their respective cumulant 64(kl? k2; 11, l 2 )
matrices, the difference in performance suggests that using = I*.4(k1? k 2 ; 11, l 2 ) - VZ(k1; ll)!-LZ(k2; 12)
a different set of cumulants will improve or degrade the - P 2 ( h ; h ) / L z ( k z ; 11) - P2(h, b ) P 2 ( h , l2) (75)
2548 IEEE TKANSACTIONS ON SIGNAL PROCESSING, VOL. 44, NO. in, OCTOBER 1996
2 2
i=l 7=1
n n
Hence
n n
.m(m3-,; n 3 - q ) ~ 4 ( kmp;
, I,, n4)
- [P4(kl, k 2 ; 11, 1 2 ) - 2P2(kl; / 1 ) p 2 ( h ; l 2 )
- 2Pz(h; / 2 ) P z ( h ; 1111
. [1*.4(ml,m2; n l , n ~-)2 ~ 2 ( m 1n; 1 ) ~ 2 ( m 2.z);
- 2 P 2 ( m ; nz)CLz(m2; n1)] (83)
which is the equation derived in [lo]. Given the true model
of the measurements, all the moments can be computed. A
method for computing the moments for statistically indepen-
dent QAM signals in spatially white Gaussian noise is given
in [lo]. Alternatively, these moments can be estimated when
only measurements are given.
APPENDIXB
p=l q=l
COVARIANCE OF EVEN-ORDER
MOMENTS
' [P4(kl, k2, m p ; n q ) We assume that the received signals have zero first-order
moments. We also assume that the signals emitted at dif-
- P2(kl> k2)P2(mp; %)I ferent times by the same source or by different sources are
f p2(llr /2)pZ(ml, m2)[p4(k1, k2; 121, n2)
statistically independent. The even-order moments are given
- P2(h, kz)P2(nlr m)] by
+ P2(11, l2)p2(nl, n2)[P4(kl, k 2 , ml, m2)