DA-MUSIC Data-Driven DoA Estimation Via Deep Augmented MUSIC Algorithm
DA-MUSIC Data-Driven DoA Estimation Via Deep Augmented MUSIC Algorithm
Document license:
TAVERNE
DOI:
10.1109/TVT.2023.3320360
Document Version:
Publisher’s PDF, also known as Version of Record (includes final page, issue and volume numbers)
General rights
Copyright and moral rights for the publications made accessible in the public portal are retained by the authors and/or other copyright owners
and it is a condition of accessing publications that users recognise and abide by the legal requirements associated with these rights.
• Users may download and print one copy of any publication from the public portal for the purpose of private study or research.
• You may not further distribute the material or use it for any profit-making activity or commercial gain
• You may freely distribute the URL identifying the publication in the public portal.
If the publication is distributed under the terms of Article 25fa of the Dutch Copyright Act, indicated by the “Taverne” license above, please
follow below link for the End User Agreement:
www.tue.nl/taverne
Abstract—Direction of arrival (DoA) estimation of multiple arrival (DoA) estimation of multiple, broadband, and possibly
signals is pivotal in sensor array signal processing. A popular coherent signal sources plays a key role in a wide range of
multi-signal DoA estimation method is the multiple signal clas- applications, including radar, communications, image analysis,
sification (MUSIC) algorithm, which enables high-performance
super-resolution DoA recovery while being highly applicable in and speech enhancement [2], [3], [4]. Over the last decades,
practice. MUSIC is a model-based algorithm, relying on an ac- a multitude of different Direction of arrival (DoA) estimation
curate mathematical description of the relationship between the algorithms have been proposed, and the problem is an active
signals and the measurements and assumptions on the signals area of research [5], [6].
themselves (non-coherent, narrowband sources). As such, it is A leading scheme employed in many DoA applications
sensitive to model imperfections. In this work, we propose to
overcome these limitations of MUSIC by augmenting the algo- is the popular multiple signal classification (MUSIC) algo-
rithm with specifically designed neural architectures. Our pro- rithm [7], which can provide asymptotically unbiased estimates
posed deep augmented MUSIC (DA-MUSIC) algorithm is thus a of the number of incident wavefronts present, their approxi-
hybrid model-based/data-driven DoA estimator, which leverages mate frequencies, and their DoAs. multiple signal classification
data to improve performance and robustness while preserving the (MUSIC) and other classic approaches, e.g., conventional beam-
interpretable flow of the classic method. DA-MUSIC is shown to
learn to overcome limitations of the purely model-based method, forming [8] and MVDR beamforming [9], are based on knowl-
such as its inability to successfully localize coherent sources as edge of the underlying statistical model; this dependency in-
well as estimate the number of coherent signal sources present. duces several key limitations. Among the drawbacks of the
We further demonstrate the superior resolution of the DA-MUSIC model-based (MB) approaches is the inherent limitation of the
algorithm in synthetic narrowband and broadband scenarios as signal model from which they are derived, which considers
well as with real-world data of DoA estimation from seismic signals.
narrowband signals. This results, for example, in the inability to
Index Terms—DoA estimation, MUSIC, model-based deep consistently estimate the DoA of correlated (coherent) signals,
learning. as well as a failure to resolve closely spaced signals with an
insufficient number of samples or a low signal-to-noise ratio
I. INTRODUCTION (SNR) [5], [7].
OURCE separation, localization, and tracking are crucial To extend narrowband models to broadband DoA estima-
S tasks in sensor array processing. In particular, Direction of tion, various extensions and alternative approaches have been
explored [10], because when the impinging signals are broad-
band, multiple frequency ranges can carry different information
Manuscript received 11 January 2023; revised 14 August 2023; accepted 7
September 2023. Date of publication 28 September 2023; date of current version regarding the DoA angles at hand. Generally, these extensions
13 February 2024. This work was supported by the Spectralligence, EUREKA IA from narrowband to broadband can be subdivided into coher-
Call, ITEA4 under Project 20209. The work of Tirza Routtenberg was supported ent and incoherent processing methods. In coherent processing
by the PAZY Foundation, Israel. The review of this article was coordinated by
Prof. Zhiguo Shi. An earlier version of this work was presented at the 2022 methods, the covariance matrices of the observations in different
IEEE International Conference on Acoustics, Speech, and Signal Processing frequency bins are coherently combined using certain transfor-
(ICASSP) as the paper [DOI: 10.1109/ICASSP43922.2022.9746637]. (Corre- mation matrices. Most coherent techniques are based on the
sponding author: Julian P. Merkofer.)
Julian P. Merkofer was with the ETH Zürich, 8092 Zürich, Switzerland. He is coherent signal subspace (CSS) concept [11], which focuses
now with the Electrical Engineering Department, Eindhoven University of Tech- the transformations of the covariances at different frequencies
nology, 5612 AZ Eindhoven, The Netherlands (e-mail: [email protected]). into a surrogate narrowband model, yet utilizes different trans-
Guy Revach is with the D-ITET, ETH Zürich, 8092 Zürich, Switzerland
(e-mail: [email protected]). formation matrices. Incoherent broadband processing methods
Nir Shlezinger is with the School of ECE, Ben-Gurion University of the combine DoA estimates obtained separately for each frequency
Negev, Beer Sheva 8499000, Israel (e-mail: [email protected]). bin [12].
Tirza Routtenberg is with the School of ECE, Ben-Gurion University of
the Negev, Beer Sheva 8499000, Israel, and also with the ECE Department, The recent success of data-driven (DD) deep learning (DL)
Princeton University, Princeton, NJ 08540 USA (e-mail: [email protected]). across a wide range of disciplines gave rise to neural net-
Ruud J. G. van Sloun is with the Electrical Engineering Department, Eind- work (NN)-aided DoA estimators [13]. The works [14], [15],
hoven University of Technology, 5612 AZ Eindhoven, The Netherlands (e-mail:
[email protected]). [16], [17] implemented model-agnostic DoA estimation us-
Digital Object Identifier 10.1109/TVT.2023.3320360 ing a dense NN, a convolutional neural network (CNN), a
0018-9545 © 2023 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission.
See https://www.ieee.org/publications/rights/index.html for more information.
Authorized licensed use limited to: Eindhoven University of Technology. Downloaded on February 14,2024 at 14:56:09 UTC from IEEE Xplore. Restrictions apply.
2772 IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, VOL. 73, NO. 2, FEBRUARY 2024
sparse-connected CNN, and a U-Net architecture, respectively. data to enable MUSIC to successfully operate in such scenarios,
Other methods [18], [19] utilize the knowledge of the spatial which is the focus of this work.
covariance matrices, where [18] trained a CNN based on a We introduced deep augmented multiple signal classifica-
classification task (allowing it to also operate with unknown tion (DA-MUSIC) in [1], a hybrid MB/DD implementation of
number of sources) and [19] analyzed purely data-driven (DD) MUSIC, which exploits the structure of the classic subspace
estimators and DD methods in combination with maximum like- algorithm while augmenting it with NNs to learn to enhance its
lihood estimation (MLE). The works [20], [21] also combined performance. The proposed architecture overcomes the funda-
DL with MLE by using multiple dense NNs and a ResNet, mental limitations of MUSIC, enabling it to accurately detect
respectively, to estimate a subset of candidate angles. [22] ex- the locations of coherent sources. Our design builds upon the
tends the architecture of [21] to estimate the number of sources insight that the sensitivity to model mismatch and inability to
with the ResNet. While such black-box NNs enable handling handle coherent and broadband sources of MUSIC lie in its
array imperfections due to their model-agnostic nature, they estimation of the noise and signal subspaces through an EVD
involve highly parameterized models that may be computation- of the empirical covariance matrix. Accordingly, the proposed
ally intensive and lack the interpretability of model-based (MB) DA-MUSIC improves this crucial step by obtaining a surrogate,
methods. pseudo covariance matrix through a recurrent neural network
Alternatively, NNs were used to robustify the MB MUSIC as a (RNN) from the measurements, which is learned along with a
form of hybrid MB/DD system [23], [24], [25]. Specifically, the NN that acts as a peak finder. This work presents crucial exten-
work [26] and [27] proposed to estimate the discretized MUSIC sions to DA-MUSIC allowing it to operate with an unknown and
spectrum from the (spatially smoothed) covariance matrix of the varying number of signal sources by successfully estimating the
measurements through the utilization of multiple convolutional number of sources for non-coherent and coherent signals. An
NNs. While these methods are more robust to model inaccura- additional neural augmentation allows the categorization into
cies compared with the original MUSIC algorithm, utilizing the noise and signal eigenvectors to be obtained in a learned fashion
MUSIC spectrum as a label for training causes them to experi- by training a classifier. Numerical results show an estimation
ence the same drawbacks as their MB counterpart. Another DD accuracy of 98% in comparison to MUSIC with 89% accuracy
approach proposed in [28] considered systems with subarray in the same non-coherent scenario. We further demonstrate DA-
sampling and uses NNs to obtain a single estimated covariance MUSIC’s capabilities in various SNR domains, in three different
matrix from incoherent subarray measurements. This NN-aided broadband scenarios, as well as with real-world seismic signals.
estimate is utilized for DoA recovery via the subspace-based DA-MUSIC not only manages to focus frequency components
MUSIC algorithm. The method addresses the fundamental de- but is also able to concurrently focus an interdependent elevation
pendency of MUSIC on the estimated covariance matrix, thereby angle to receive a stable azimuth estimation.
robustifying the MUSIC algorithm, yet it does not fully exploit The rest of the article is organized as follows: Section II
the NNs’ ability to improve MUSIC, as the NN is trained using describes the assumed system model and surveys some of the
the true covariance matrix as a label, without considering its related work of DoA estimation, as well as the technical de-
downstream task. Furthermore, the work [29] proposed a hybrid tails of the MB MUSIC algorithm; Section III introduces the
MB/DD approach that includes an eigenvalue decomposition DA-MUSIC algorithm; Section IV presents the results of the
(EVD) of full-row Toeplitz matrices reconstruction (FTMR) simulations; and Section V provides concluding remarks.
matrices [30] and classification deep NNs for estimation of Throughout the article, we use boldface lower-case letters
a MUSIC-like spectrum and detection of number of sources for vectors; e.g., x, and boldface uppercase letters for matrices,
present via eigenvectors and eigenvalues e.g., for X. The ith entry of a vector x is denoted by xi and
The works [31], [32] applied CNNs for broadband DoA entries separated by commas represent column vectors. We use
estimation of a single source. The two approaches, however, calligraphic letters to denote the discrete-time Fourier transform,
utilize very different preprocessing methods and are trained e.g., X (ω) is the frequency representation of a signal x(t). We
as classification and regression tasks, respectively. While the use (·) and (·)H to denote the transpose and Hermitian operators
classification approach limits itself to a fixed resolution, the respectively, and · and · F to denote the 2 and Forbenius
regressor of [32] outputs the DoA angles directly. Therefore, it norms respectively. Further, U and CN represent the uniform
can achieve an arbitrary resolution, yet it is limited to the spatial and the complex normal distributions. The symbol I denotes the
covariance matrix as input. Another approach [33] first decom- identity matrix.
posed the broadband signals into non-overlapping narrowband
components and used support vector regression to obtain the
DoA. II. SYSTEM MODEL AND PRELIMINARIES
Accordingly, there is a concrete need for DoA estimation In this section, we detail the system model for which we
algorithms that can resolve multiple signals that are possibly develop the proposed DA-MUSIC algorithm in Section III. To
broadband and coherent in a low complexity and interpretable that aim, we first discuss the signal model in Section II-A. Then,
manner. The aforementioned individual limitations of both ex- we formulate the DoA estimation problem in Section II-B, and
isting MB and DD DoA estimation algorithms thus motivate the survey some of the related literature in Section II-C. Since our
derivation of a NN-aided DoA estimator capable of leveraging proposed solution builds upon the MUSIC algorithm, we recall
Authorized licensed use limited to: Eindhoven University of Technology. Downloaded on February 14,2024 at 14:56:09 UTC from IEEE Xplore. Restrictions apply.
MERKOFER et al.: DA-MUSIC: DATA-DRIVEN DOA ESTIMATION VIA DEEP AUGMENTED MUSIC ALGORITHM 2773
MUSIC in Section II-D and briefly discuss the CSS method for
broadband extension in Section II-E.
A. Signal Model
We distinguish the considered signal model into two cases: the
conventional narrowband setting and its more general broadband
setup. For more details regarding the two models, we refer the
reader to the textbooks [10], [34], and [35].
1) Narrowband: The most common setup in the array signal
processing literature concerning DoA estimation is the narrow- Fig. 1. DoA estimation illustration.
band setting. Here, the time it takes for the waves to propagate
the array is assumed to be negligible such that the occurring which can, in turn, be written in vector-matrix form analogous
delays are sufficiently small. Therefore, the signal measurement to the narrowband system model (1) as follows:
model for an arbitrary array structure consisting of M sensor
elements measuring D impinging narrowband signals takes the X (ω) = A(ω, θ) S(ω) + V(ω). (7)
following form:
For a ULA, the broadband steering vectors are defined for
x(t) = A(θ) s(t) + v(t). (1) frequency ω and angle of interest ψ as
In (1), the measurements x(t) ∈ CM at time instance t depend on Δm Δm
a(ω, ψ) = 1, e−jω c sin ψ , . . . , e−jω(M −1) c sin ψ , (8)
the signals s(t) ∈ CD , which originate from the unknown angles
θ = [θ1 , . . . , θD ], while v(t) ∈ CM is additive white Gaussian where c is the propagation velocity.
noise. The matrix A(θ) ∈ CM ×D contains the steering vectors
{a(θd )}, i.e., B. Problem Formulation
A(θ) = a(θ1 ) . . . a(θD ) , (2) DoA estimation is concerned with localizing signal sources
by determining their angles of incidence utilizing the measure-
where {θd } are the source directions. For example, the steering ments of an array aperture [34]. Formally, this corresponds to
vector a(ψ) for a uniform linear array (ULA) with an element estimating the angles θ = [θ1 , . . . , θD ] from the measurements
spacing of Δm = /2, where is the wavelength of the signals, x(t) = [x1 (t), . . . , xM (t)] measured at multiple time instances
is defined for direction ψ as t ∈ {1, . . ., T }. An illustration of such a system is depicted
in Fig. 1. The following additional assumptions are imposed
a(ψ) = 1, e−jπ sin ψ , . . . , e−jπ(M −1) sin ψ . (3) upon the observation model as well as the derived synthetic
data. The signals are uncorrelated to the noise (though signals
Consequently, the steering vectors (and, in turn, the matrix may possibly be correlated with each other) and generated in
A(θ)) specify the underlying array geometry. The collection the far-field region. And there is uniform propagation in all
of the measurements at the array elements over multiple time directions in an isotropic and non-dispersive medium. We further
instances is defined as assume that we have knowledge (though possibly mismatched)
X = x(1) . . . x(T ) , (4) of the underlying array geometry, implying that we can compute
an approximation of a(ψ) or a(ω, ψ). We consider a data-driven
where T is referred to as the number of snapshots. setting where we have access to training data. This data is a set
2) Broadband: In practice, many signals are broadband, and of U pairs, {(Xu , θ u )}U u=1 , each comprising the observations
the delay caused by propagating the array aperture needs to be and DoA angles from where the signals originated. In many
incorporated into the signal model. The following notation mod- scenarios; e.g., in wireless communications, a specific training
els the measurements received at array element m ∈ {1, . . ., M } set can be developed before deployment. If the ground-truth
DoAs are not obtainable at all, the data-driven DoA estimator
must be trained by utilizing synthetic data that closely describes
D
xm (t) = sd (t − τmd ) + vm (t), (5) the real signals. Our goal is thus to leverage the available domain
d=1 knowledge and data to design a system for recovering the DoAs
θ from a corresponding observation matrix X whose columns
where τmd represents the time delay of the mth array element are T snapshots of the measured waveforms at the M sensors.
measuring source d ∈ {1, . . .., D} compared to the measure-
ment at the reference element m = 1. Transformed to the fre-
C. Related Literature
quency domain, the relationship in (5) at frequency ω becomes
We next provide an overview of the relevant DoA esti-
mation methods for the presented numerical evaluations of
D
Xm (ω) = e−jωτmd Sd (ω) + Vm (ω), (6) Section IV, followed by reviewing relevant extensions. An ex-
d=1 tensive overview of various MB DoA estimators can be found
Authorized licensed use limited to: Eindhoven University of Technology. Downloaded on February 14,2024 at 14:56:09 UTC from IEEE Xplore. Restrictions apply.
2774 IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, VOL. 73, NO. 2, FEBRUARY 2024
in [5], [6], and a recent literature review of DD DoA estimation These methods, however, experience significant bias with larger
approaches can be found in [36]. angular sectors of interest and can impose additional model
1) MB Narrowband Estimators: The conventional beam- assumptions such as noise statistics.
former (i.e. maximizing the steered response) is a basic ap- 3) DD Estimators: Inspired by the dramatic success of DL in
proach to DoA estimation [37]. An alternative family of DoA computer vision and natural language processing, recent years
estimators is based on subspace methods, which aim at recov- have witnessed a growing interest in the application of NNs for
ering the DoAs by identifying the noise and signal subspaces. DD DoA estimation. A recent review of DL DoA estimation
The MUSIC algorithm [7] is a highly popular subspace-based approaches can be found in [36] and essential methods were
method and has been researched extensively. Being the focus of covered in Section I. DL architectures presented in the numer-
this article, more information and a detailed description of the ical evaluations or otherwise closely related to this work are
algorithm can be found in Section II-D. Extensions of MUSIC summarized and discussed below.
include Root-MUSIC [38], a polynomial-rooting version, as The work [18] implemented a CNN architecture for DD DoA
well as spatially smoothed MUSIC, which removes the cor- estimation based on a multi-label classification task. Particularly,
relation between the incident signals by dividing the receiver the method takes real, imaginary, and phase information of
array into overlapping subarrays [39]. In practice, however, the the sample covariance as a three-channel input and predicts a
number of coherent sources is mostly unknown, and therefore probability grid of directions. Given the binary cross-entropy
the decorrelation effect of spatial smoothing is not obvious [40]. loss for training, the method is capable of operating with a
Another popular subspace-based method for DoA estimation is potentially unknown and varying number of sources. However,
estimation of signal parameters via rotational invariance tech- with a growing number of sources, the number of classes grows
niques (ESPRIT) [41] and its variations [42]. These methods rely exponentially.
heavily on the accuracy of the underlying model assumptions, Similarly to the above, [26] proposed CNNs for DoA esti-
are generally sensitive to array aperture perturbations, and are mation by taking the same channels of the sample covariance
inherently derived for narrowband signals. matrix as input. However, the CNNs are trained as a regression
2) MB Broadband Estimators: Broadband DoA estimation task estimating a discretized segment of the MUSIC spectrum.
algorithms can be categorized into two groups: incoherent meth- Thereby, they not only inherit the drawbacks and imitations of
ods and coherent methods. Generally, incoherent methods use MUSIC but are, besides increased robustness, dependant on the
independent frequency bins (IFBs) to process the DoA informa- underlying model assumptions.
tion at every frequency separately and then combine the results
of these narrowband DoA estimations. There are many different
variations in the implementation of this approach [43], [44], [45],
which typically vary in the computation of the covariance matri- D. MUSIC Algorithm
ces. Unfortunately, the computational complexity increases with As our proposed DA-MUSIC algorithm originates from the
each frequency bin. MUSIC algorithm, we next present this method in detail. MU-
Conversely, coherent methods combine the covariance matri- SIC, originally proposed by Schmidt in [7], considers the nar-
ces estimated for IFBs in order to apply narrowband techniques rowband signal model with incoherent sources (1), where the
directly over a single, so-called focused covariance matrix. An signals in s(t) are mutually independent. Fig. 2 visualizes a
overview of coherent techniques can be found in [10], [46]. simple outline of the MUSIC structure as a block diagram. The
A leading approach for coherent broadband DoA estimation approach takes the empirical covariance matrix of the received
is based on the CSS method [11], with example estimators measurements X, then conducts an EVD, followed by catego-
given in [47], [48], [49]. Many variations of the CSS method rizing the eigenvectors into signal and noise subspaces. The
are concerned with the crucial aspect of the focusing strategy, orthogonality between the two subspaces allows the formulation
proposing different techniques to combine the covariances at of a spatial spectrum, which contains peaks at DoA angles.
each IFB to obtain a faithful narrowband formulation. See, 1) Formal Derivation: With the assumption of the signal and
for example, [50], [51] and the more recent summary [52]. the noise being uncorrelated, the covariance matrix of x(t) is
Authorized licensed use limited to: Eindhoven University of Technology. Downloaded on February 14,2024 at 14:56:09 UTC from IEEE Xplore. Restrictions apply.
MERKOFER et al.: DA-MUSIC: DATA-DRIVEN DOA ESTIMATION VIA DEEP AUGMENTED MUSIC ALGORITHM 2775
Authorized licensed use limited to: Eindhoven University of Technology. Downloaded on February 14,2024 at 14:56:09 UTC from IEEE Xplore. Restrictions apply.
2776 IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, VOL. 73, NO. 2, FEBRUARY 2024
Authorized licensed use limited to: Eindhoven University of Technology. Downloaded on February 14,2024 at 14:56:09 UTC from IEEE Xplore. Restrictions apply.
MERKOFER et al.: DA-MUSIC: DATA-DRIVEN DOA ESTIMATION VIA DEEP AUGMENTED MUSIC ALGORITHM 2777
process of the noise subspace from the eigenvectors; and the B. Training Procedure
adaptation of the output strategy to overcome the varying number
DA-MUSIC is trained end-to-end in a supervised setting as a
of DoA angles. multiple regression problem. As detailed in Section II-B, the
Estimating number of sources: Our design allows DA-MUSIC
training set is comprised of U tuples of sequences of mea-
to learn abstract mappings as pseudo covariance features K̃,
surements and their corresponding DoAs; i.e., the uth tuples
which is geared towards end-to-end training and is not restricted includes the Tu measurements Xu and their corresponding Du
to a natural ordering of the model-based covariances. Conse-
DoA angles θ u . We first describe how this data is used for
quently, instead of estimating the number of sources by inspect-
training when D is known and fixed; i.e., Du ≡ D. This acts
ing the magnitude of its eigenvalues, we opt for a data-driven as a preliminary step for discussing how the training procedure
approach. We augment the process of estimating the number
is carried out in the general case where the model does not have
of sources with a classifier, implemented through a MLP, as knowledge of D.
a classification task. Since subspace methods with M inputs 1) Known Number of Sources: Given a sequence of measure-
can resolve at most M − 1 signals, the classifier has M − 1
ments X as input, the model predicts the estimated DoA angles θ̂
classes. Fig. 5 depicts the DA-MUSIC architecture with an added that are compared to the true DoA angles θ. Gradient-based opti-
classifier taking the eigenvalues as an input and outputting a
mization is possible because every element of the architecture is
probability vector for the estimated number of sources with a
differentiable, allowing backpropagation through the complete
softmax activation. structure. Derived from the root mean squared periodic error
Computing noise subspace: The noise subspace selector
(RMSPE) [56], [57] the following loss function additionally
needs to know the number of sources D ∈ {1, . . ., M − 1}
compares all permutations of the predicted angles with the true
to classify the eigenvectors into signal and noise subspaces; angles to capture all possible assignments of the estimated DoA
i.e., by choosing the eigenvectors corresponding to the N =
to the true DoA. Thereby the minimal permutation RMSPE in-
M − D smallest eigenvalues. When the number of sources
cludes the permutation invariance of the DoAs and is obtained as:
is not known, we propose to weight each eigenvector by an
estimate of it belonging to the noise subspace. We do this 2 21
1
by introducing an additional neural augmentation, depicted in RMSPE(θ, θ̂) = min modβ (θ − Pθ̂) , (14)
P∈PD D
Fig. 6, which uses a MLP to cluster the eigenvalues in a learned
fashion. where PD is the set of all D × D permutations and modβ
In particular, the MLP maps the estimated M eigenvalues denotes the element-wise modulus operation regarding the
into a vector q = [q1 , . . ., qM ] whose entries hold the individ- angle range of interest, e.g. β = π for ψ ∈ [−π/2, π/2) or
ual probabilities of choosing the corresponding eigenvector as β = 2π for ψ ∈ [0, 2π).
a noise eigenvector, implemented using a sigmoid activation. 2) Unknown Number of Sources: As discussed in the previ-
The selection is performed by computing Ẽ = Vdiag(q) = ous subsection, DA-MUSIC is designed to resolve a varying and
[q1 v1 . . . qM vM ], allowing to learn a suitable noise subspace. unknown number of sources by introducing an additional NN
Note that this setting specializes in the conventional approach classifier for the number of sources. To formulate the training
of assigning based on the multiplicity of the minimal eigen- procedure of the overall system, we use wc to denote the
values; in the conventional approach, the entries of q are trainable parameters of the classifier, while wd represents the
either ones or zeros and Ẽ coincides with the correspond- parameters of the remaining trainable modules of DA-MUSIC
ing subspace. Consequently, the proposed approach provides (covariance estimator, peak finder, and subspace selector). The
additional flexibility in selecting the noise subspace and fa- loss used to train wd is the RMSPE loss of (14), which is altered
cilitates coping with settings where distinguishing between during training to account for varying sources by computing,
the eigenvectors is challenging due, for example, to low
SNRs. LRMSPE θ u , θ̂(Xu ) = RMSPE θ1:Du , θ̂1:Du (Xu ) .
Outputting varying number of DoAs: The most common (15)
strategy to overcome the dynamic output dimensions of NNs is In (15), θ̂(Xu ) denotes the M − 1 outputs of DA-MUSIC ap-
to scale the output dimension to the maximum occurring value. plied to Xu , while θ1:Du = (θ1 , . . ., θDu ). This means that only
In our case, since MUSIC cannot resolve more than M − 1 the first Du angles of the estimated DoA θ̂ are compared with
sources, the dimension of the last dense layer of DA-MUSIC the true DoA θ while the remaining angles of θ̂ are completely
is set to M − 1. The only additional modification required with ignored.
this strategy compared with known D is the slight alteration to The separate classifier is trained using the categorical cross-
the loss function discussed in Section III-B below. An advantage entropy of the classes as a loss function ensuring optimal train-
of this strategy is that the approach allows to extract up to M − 1 ing. In particular, letting λu be the input to the MLP classifier
DoA, where θ1 is most likely a true DoA angle, θ2 has a slightly when applying DA-MUSIC to Xu and letting D̂(λu ) be the
lower probability to be a DoA angle, etc. The final estimation is softmax output of the MLP applied to λu (with D̂i (λu ) being
thus carried out by first taking D̂ from the module that estimates its ith entry), the loss used for training wc is given by
the number of sources, and then using the first D̂ outputs as the
recovered DoAs. LCE Du , D̂(λu ) = − log D̂Du (λu ). (16)
Authorized licensed use limited to: Eindhoven University of Technology. Downloaded on February 14,2024 at 14:56:09 UTC from IEEE Xplore. Restrictions apply.
2778 IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, VOL. 73, NO. 2, FEBRUARY 2024
TABLE I
Algorithm 1: Training DA-MUSIC. DEFAULT SIMULATION PARAMETERS
Authorized licensed use limited to: Eindhoven University of Technology. Downloaded on February 14,2024 at 14:56:09 UTC from IEEE Xplore. Restrictions apply.
MERKOFER et al.: DA-MUSIC: DATA-DRIVEN DOA ESTIMATION VIA DEEP AUGMENTED MUSIC ALGORITHM 2779
TABLE II
RMSPE OF DIFFERENT DOA ESTIMATION ALGORITHMS WITH CONSTANT AND
KNOWN D FOR T = 200 SNAPSHOTS
A. Synthetic Narrowband
The numerical evaluations of synthetic data presented be-
low are obtained by simulating the measurements x(t) ac-
cording to the narrowband system model (1). In particular,
we simulate a ULA with M = 8 array elements that measure
impinging waveforms originating from the DoA angles θ =
[θ1 , . . . , θD ], which are separately drawn from the uniform dis-
tribution U (−π/2, π/2). The signals s(t) = [s1 (t) . . . sD (t)]
are each drawn randomly from the complex Gaussian distri-
bution CN (0, 1) for all t modeling random amplitudes and
phases. The noises measured at the M array elements v(t) =
[v1 (t) . . . vM (t)] are also drawn from CN (0, 1) for all t, fol- Fig. 8. RMSPE of DoA estimation with various levels of mismatch in the
array geometry for D = 5 sources.
lowed by appropriate scaling to meet the constant SNR. In the
coherent cases, all signals have identical amplitudes and phases,
and when not stated otherwise, the simulation parameters are set
according to Table I. located close together at a Δθ distance from each other. The
1) Known Number of Sources: We first evaluate the RMSPE MB MUSIC algorithm is shown to collapse when the angular
in [rad] achieved when the number of sources D is known in difference approaches Δθ ≈ 0.1 radians, while DA-MUSIC
the synthetic narrowband scenario described above. The results, demonstrates a constant low error for all Δθ, indicating its
reported in Table II, compare the performance of the following improved resolution.
DoA estimators for a different number of sources D: Next, we evaluate DA-MUSIC in the presence of a mismatch
r The DA-MUSIC architecture is implemented according to in the array geometry. Fig. 8 depicts the RMSPE achieved
Fig. 3 and trained separately for each case D. when each element of the steering vector a(·) is corrupted
r The classic MB MUSIC algorithm, implemented as de- with zero-mean Gaussian noise, leading to a mismatch from
scribed in Section II-D, utilizes the external knowledge of the values used to compute the spatial spectrum. Indicating
D. improved robustness, DA-MUSIC is shown to overcome such
r The CNN architecture of [18], with a reduced stride in the mismatches in the array geometry from the data. The CNN,
first layer to account for M = 8, and an increased grid-size DeepMUSIC, and the Random algorithm remain unaffected as
for the last layer to achieve R = 360. they are independent of a(·).
r The DD DeepMUSIC proposed in [26], while incorporat- Fig. 9 depicts the performances differences when localizing
ing minor alterations that were necessary to accommodate D = 5 non-coherent signals with T = 200 snapshots available
for the difference in the setup, includes tuning of individual for different SNR levels in the range of [−20, 20] dB. DA-
hyperparameters to assure successful training of the CNNs. MUSIC shows a constant low SNR for positive dB settings and
The above algorithms are compared with random guessing of without any fluctuations which slowly decreases with increasing
the DoA angles. The results in Table II show that the proposed SNR.
DA-MUSIC notably outperforms all considered benchmarks, Fig. 10 depicts the performance degradation of the estimators
notably surpassing the MB MUSIC algorithm not only for with fewer snapshots available. The DD estimators are only
coherent sources but also for non-coherent ones, which is the trained for the case T = 200 as indicated by the circle around the
scenario for which the MB algorithm is designed. T = 200 marker, yet manage to operate with shorter sequences
To compare the resolution of the algorithms, Fig. 7 shows the during inference due to the recurrent unit or by taking the
RMSPE for localizing D = 2 non-coherent signals, which are covariance matrix as input.
Authorized licensed use limited to: Eindhoven University of Technology. Downloaded on February 14,2024 at 14:56:09 UTC from IEEE Xplore. Restrictions apply.
2780 IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, VOL. 73, NO. 2, FEBRUARY 2024
TABLE III
RMSPE OF DIFFERENT DOA ESTIMATION ALGORITHMS WITH VARYING AND
UNKNOWN D FOR T = 200 SNAPSHOTS
Authorized licensed use limited to: Eindhoven University of Technology. Downloaded on February 14,2024 at 14:56:09 UTC from IEEE Xplore. Restrictions apply.
MERKOFER et al.: DA-MUSIC: DATA-DRIVEN DOA ESTIMATION VIA DEEP AUGMENTED MUSIC ALGORITHM 2781
Fig. 12. Accuracy of estimating D̂ for various T with D ∈ {2, . . ., 5} non- Fig. 14. RMSPE for various T with varying and unknown number of
coherent narrowband signals. D ∈ {2, . . ., 5} non-coherent narrowband signals.
Fig. 15. RMSPE for various T with varying and unknown number of
D ∈ {2, . . ., 5} coherent narrowband signals.
Fig. 13. Accuracy of estimating D̂ for various T with D ∈ {2, . . ., 5} coherent
narrowband signals.
Authorized licensed use limited to: Eindhoven University of Technology. Downloaded on February 14,2024 at 14:56:09 UTC from IEEE Xplore. Restrictions apply.
2782 IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, VOL. 73, NO. 2, FEBRUARY 2024
TABLE IV
SIMULATION PARAMETERS OF THE BROADBAND SCENARIOS
1
K−1
Fig. 16. RMSPE for various T with known number of D = 2 signals from
sd,OFDM = s̄k exp (2πjkΔfd t/K), (19) Broadband Scenario 1.
K
k=0
Authorized licensed use limited to: Eindhoven University of Technology. Downloaded on February 14,2024 at 14:56:09 UTC from IEEE Xplore. Restrictions apply.
MERKOFER et al.: DA-MUSIC: DATA-DRIVEN DOA ESTIMATION VIA DEEP AUGMENTED MUSIC ALGORITHM 2783
TABLE V
RMSPE IN [RAD] OF DIFFERENT DOA ESTIMATION ALGORITHMS
FOR SEISMIC DATA
for some frequency f and with the wave vector for certain
elevation α and azimuth ψ of interest,
Authorized licensed use limited to: Eindhoven University of Technology. Downloaded on February 14,2024 at 14:56:09 UTC from IEEE Xplore. Restrictions apply.
2784 IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, VOL. 73, NO. 2, FEBRUARY 2024
Authorized licensed use limited to: Eindhoven University of Technology. Downloaded on February 14,2024 at 14:56:09 UTC from IEEE Xplore. Restrictions apply.
MERKOFER et al.: DA-MUSIC: DATA-DRIVEN DOA ESTIMATION VIA DEEP AUGMENTED MUSIC ALGORITHM 2785
[50] E. D. di Claudio and R. Parisi, “WAVES: Weighted average of signal Nir Shlezinger (Senior Member, IEEE) received the
subspaces for robust wideband direction finding,” IEEE Trans. Signal B.Sc., M.Sc., and Ph.D. degrees in electrical and com-
Process., vol. 49, no. 10, pp. 2179–2191, Oct. 2001. puter engineering from Ben-Gurion University, Beer-
[51] Y.-S. Yoon, L. M. Kaplan, and J. H. McClellan, “TOPS: New DOA sheba, Israel, in 2011, 2013, and 2017, respectively.
estimator for wideband signals,” IEEE Trans. Signal Process., vol. 54, He is currently an Assistant Professor wtih the School
no. 6, pp. 1977–1989, Jun. 2006. of Electrical and Computer Engineering, Ben-Gurion
[52] F. Ma and X. Zhang, “Wideband DOA estimation based on focusing signal University. From 2017 to 2019, he was a Postdoctoral
subspace,” Signal, Image Video Process., vol. 13, PP. 675–682, 2019. Researcher with the Technion, and from 2019 to 2020,
[53] S. M. Kay, Fundamentals of Statistical Signal Processing: Estimation he was a Postdoctoral Researcher with the Weizmann
Theory. Englewood Cliffs, NJ, USA: Prentice-Hall International Editions, Institute of Science, where he was awarded the FGS
1993, pp. 27–37. prize for outstanding research achievements. His re-
[54] H. Krim and J. Proakis, “Smoothed eigenspace-based parameter estima- search interests include communications, information theory, signal processing,
tion,” Automatica, vol. 30, no. 1, pp. 27–38, 1994. and machine learning.
[55] O. Solomon et al., “Deep unfolded robust PCA with application to clutter
suppression in ultrasound,” IEEE Trans. Med. Imag., vol. 39, no. 4,
pp. 1051–1063, Apr. 2020.
[56] T. Routtenberg and J. Tabrikian, “Bayesian parameter estimation using
periodic cost functions,” IEEE Trans. Signal Process., vol. 60, no. 3, Tirza Routtenberg (Senior Member, IEEE) received
pp. 1229–1240, Mar. 2012. the B.Sc. degree from the Technion Israel Institute of
[57] T. Routtenberg and J. Tabrikian, “Non-Bayesian periodic Cramér-Rao Technology, Haifa, Israel, in 2005, and the M.Sc. and
bound,” IEEE Trans. Signal Process., vol. 61, no. 4, pp. 1019–1032, Ph.D. degrees in electrical and computer engineer-
Apr. 2013. ing from Ben-Gurion University, Beersheba, Israel,
[58] D. P. Kingma and J. Ba, “Adam: A method for stochastic optimization,” in 2007 and 2012, respectively. She is currently an
2014, arXiv:1412.6980. Associate Professor with the School of Electrical
[59] Z. Zhang, Z. Shi, and Y. Gu, “Ziv-Zakai bound for DOAs estimation,” and Computer Engineering, Ben-Gurion University,
IEEE Trans. Signal Process., vol. 71, pp. 136–149, 2023. Beersheba, Israel. She was a Postdoctoral Fellow with
[60] T. Routtenberg and J. Tabrikian, “A general class of outage error probabil- the School of Electrical and Computer Engineering,
ity lower bounds in Bayesian parameter estimation,” IEEE Trans. Signal Cornell University, Ithaca, NY, USA, from 2012 to
Process., vol. 60, no. 5, pp. 2152–2166, May 2012. 2014. Since 2014, she has been a faculty Member with the School of Elec-
[61] S. B. Weinstein, “The history of orthogonal frequency-division multiplex- trical and Computer Engineering, and Ben-Gurion University of the Negev,
ing [history of communications],” IEEE Commun. Mag., vol. 47, no. 11, Beer-Sheva. During 2022–2023, she was a William R. Kenan, Jr. Visiting
pp. 26–35, Nov. 2009. Professor for Distinguished Teaching with Princeton University, Princeton, NJ,
[62] H.-P. Harjes, “Design and siting of a new regional array in central Europe,” USA. Her research interests include signal processing in smart grids, statistical
Bull. Seismological Soc. America, vol. 80, pp. 1801–1817, 1990. signal processing, estimation and detection theory, and signal processing on
graphs. She is an Associate Editor for IEEE TRANSACTIONS ON SIGNAL AND
INFORMATION PROCESSING OVER NETWORKSand IEEE SIGNAL PROCESSING
LETTERS, and a Member of the IEEE Signal Processing Theory and Methods
Julian P. Merkofer (Student Member, IEEE) re- Technical Committee. She was the recipient of the Best Student Paper Award
at the International Conference on Acoustics, Speech and Signal Processing
ceived the B.Sc. and M.Sc. degrees in electrical engi-
(ICASSP) 2011, IEEE International Workshop on Computational Advances
neering and information technology from the Federal
in Multi-Sensor Adaptive Processing (CAMSAP) 2013 (co-author), ICASSP
Institute of Technology in Zurich (ETHZ), Zurich,
2017 (co-author), and IEEE Workshop on Statistical Signal Processing (SSP)
Switzerland, in 2019, and 2021, respectively. He is
2018 (co-author). She was awarded the Negev scholarship in 2008, Lev-Zion
currently working toward the Ph.D. degree with the
scholarship in 2010, Marc Rich Foundation Prize in 2011, and Toronto Prize for
Biomedical Diagnostics Laboratory, Eindhoven Uni-
versity of Technology, Eindhoven, The Netherlands. Excellence in Research in 2021.
His research interests include machine learning for
signal processing, model-based deep learning, and
magnetic resonance spectroscopy. Ruud J. G. van Sloun (Member, IEEE) received
the M.Sc. and Ph.D. degree (cum laude) in electrical
engineering from the Eindhoven University of Tech-
Guy Revach (Graduate Student Member, IEEE) re- nology, Eindhoven, The Netherlands, in 2014, and
2018, respectively. He is currently an Associate Pro-
ceived the B.Sc. (cum laude) and M.Sc. degrees from
fessor with the Department of Electrical Engineering,
the Andrew and Erna Viterbi Department of Elec-
Eindhoven University of Technology, Eindhoven, the
trical and Computer Engineering, Technion – Israel
Institute of Technology, Haifa, Israel, in 2008 and Netherlands. From 2019 to 2020, he was a Visiting
Professor with the Department of Mathematics and
2017, respectively. Since 2019, he has been working
Computer Science, Weizmann Institute of Science,
toward the Ph.D. degree with the Institute for Sig-
Rehovot, Israel, and from 2020 to 2023, he was a
nal and Information Processing (ISI), ETH Zürich,
Zürich, Switzerland, supervised by Dr. Hans-Andrea kickstart AI Fellow with Philips Research. His current research interests include
closed-loop image formation, deep learning for signal processing and imaging,
Loeliger. He did his master’s thesis under the su-
active signal acquisition, model-based deep learning, compressed sensing, ul-
pervision of Prof. Nahum Shimkin on planning for
trasound imaging, and probabilistic signal and image reconstruction. He was the
cooperative multiagents. He is currently a Researcher with a proven industry
recipient of an ERC starting grant, NWO VIDI grant, NWO Rubicon grant, and
track record as an innovator and system engineer. His research interests include
Google Faculty Research Award.
the intersection of machine learning with signal processing, and more specifi-
cally combining sound theoretical principles from classical signal processing
with state-of-the-art machine learning algorithms for tracking and detection
problems. Before coming to ETH Zürich, he was with the Israeli wireless
communication industry for more than ten years, first as a real-time embedded
Software Engineer and then a Software Manager. He was the main innovator be-
hind state-of-the-art, software-defined radio (SDR) for wireless communication,
game-changing and groundbreaking in terms of size, weight, and power. As a
System Engineer, he defined major aspects of SDR requirements and architecture
for hardware, software, network, cyber defense, signal processing, data analysis,
and control algorithms.
Authorized licensed use limited to: Eindhoven University of Technology. Downloaded on February 14,2024 at 14:56:09 UTC from IEEE Xplore. Restrictions apply.