A Source Number Estimation Algorithm Based on Data Local Density and Fuzzy C-Means Clustering

Wu, Na; Wang, Ke; Wan, Liangtian; Liu, Ning

doi:https://doi.org/10.1155/2021/6658785

Wireless Communications and Mobile Computing

On this page

Abstract Introduction Conclusion Data Availability Conflicts of Interest Acknowledgments References Copyright Related Articles

Special Issue

Source Direct Position Determination in Distributed Sensing Systems

View this Special Issue

Research Article | Open Access

Volume 2021 | Article ID 6658785 | https://doi.org/10.1155/2021/6658785

A Source Number Estimation Algorithm Based on Data Local Density and Fuzzy C-Means Clustering

Na Wu,^1,2Ke Wang,³Liangtian Wan,⁴and Ning Liu²

Academic Editor: Jian Feng Li

Received10 Dec 2020

Revised03 Jan 2021

Accepted31 Jan 2021

Published22 Feb 2021

Abstract

An advanced source number estimation (SNE) algorithm based on both fuzzy C-means clustering (FCM) and data local density (DLD) is proposed in this paper. The DLD of an eigenvalue refers to the number of eigenvalues within a specific neighborhood of this eigenvalue belonging to the data covariance matrix. This local density essentially as the one-dimensional sample feature of the FCM is extracted into the SNE algorithm based on FCM and can enable to improve the probability of correct detection (PCD) of the SNE algorithm based on the FCM especially for low signal-to-noise ratio (SNR) environment. Comparison experiment results demonstrate that compared to the SNE algorithm based on the FCM and other similar algorithms, our proposed algorithm can achieve highest PCD of the incident source number in both cases of spatial white noise and spatial correlation noise.

1. Introduction

Superresolution spatial spectrum estimation algorithm for the direction of arrival (DOA) estimation has been applied in various fields including passive detection and location, communication, sonar detection, multiple input multiple output (MIMO) radar, and communication [1–11]. Almost all spatial spectrum estimation algorithms are constructed on the basis of the known incident source numbers to further enabling the accurate DOA estimation. Unknown or erroneous source numbers will cause the very bad accuracy or even ineffective of the spatial spectrum estimation algorithms, which is contributed to that the noise subspace and signal subspace are staggered and overlapped in this case [12]. The source number estimation (SNE) is thus the first and key process of the spatial spectrum estimation algorithm, and the accurate SNE is the primary prerequisite for the following DOA estimation. Therefore, how to rapidly and accurately retrieve the unknown incident source number from the array observation data containing the complex space noise and receiver noise thus becomes an issue of practical significance.

In the last few decades, SNE algorithms have been extensively studied by worldwide scholars. To be more specific, SNE algorithms can be divided into the following two stages, namely, the classical SNE algorithms. The classical algorithms proposed during this period mainly include the SNE algorithms based on hypothesis testing [13] and information theory criteria [14]. Due to the limitations of these classical algorithms in terms of estimation performance and stability, a series of improved SNE algorithms based on mAIC criteria [15] and mMDL criteria [16] for source number estimation under spatial correlation noise environment and a robust improved MDL criterion [17] for algorithm stability have been proposed. These improved algorithms achieve the better estimation performance of the classical SNE algorithms. However, almost none of them can accurately estimate the incident source number when encountering the complex spatial environment, such as the poor signal-to-ratio (SNR) cases and intense unknown correlated noise scenarios.

Under the powerful impetus of array signal processing technology, more and more advanced theories and strategies have been used to the SNE and then strongly pushed the SNE algorithm into the next stage. Abundant SNE algorithms have been constantly emerging, such as canonical correlation technology [18, 19] and MDL criterion based on the random matrix theory [20], although these SNE algorithms have their respective superiorities in terms of the accuracy, robustness, operation cost, and antijamming capability. However, they are limited in practical application more or less. SNE algorithms based on the regularization correlation technique have a severe requirement on the array design for obtaining enough accuracy. SNE algorithms based on the MDL criterion by using random theory have to require the receiver array featuring many enough elements to enable better estimation performance. SNE algorithms based on the information theory criterion by introducing diagonal loading have a higher estimation accuracy under the color noise background, but a poor accuracy in the cases of the white noise.

In recent years, the artificial intelligence technology has been explosively researched and has achieved exciting progresses. Also, artificial intelligence algorithms have been further applied in passive direction finding systems for accurately estimating the number of incident sources [21–23]. One of the unsupervised algorithms named fuzzy logic theory can effectively deal with the unknown and uncertain factors in the actual system and shown good performance in SNE, such as the representative SNE algorithm based on fuzzy C-means clustering (FCM) [24]. This algorithm has superior estimation capability in retrieving the number of the incident sources under both the spatially white noise scenarios and the spatially correlated color noise ones. However, the sample features of existing SNE based on FCM algorithms are obtained according to the value of eigenvalues rather than the overall distribution characteristics of eigenvalues, which thus limits its further application in the real passive FCM algorithm systems, especially for low SNR, arbitrary, and complex electromagnetic environment.

To tackle this issue and further improve the accuracy and applicability of the SNE algorithms in the actual system, we extract the distribution density characteristics of the eigenvalues from the received data covariance matrix and take it as one-dimensional (1D) characteristic information of the fuzzy C-means clustering and further construct an advanced SNE algorithm based on both the FCM and the data local density (DLD). The DLD potentially increases the relationship between by adding 1D information and may have a good expectation that our proposed SNE algorithm can obtain a high probability of correct detection (PCD) even for low SNR and few snapshots cases.

The following contents of this paper are organized as follows. The array signal model for a uniform linear array used in source number estimation algorithm is rigorously formulated in Section 2. Section 3 introduces the basic theory of the FCM algorithm in brief. Then, the SNE algorithm based on the FCMC and the DLD is proposed in the Section 4. Section 5 demonstrates the accuracy and applicability of our proposed SNE algorithm by numerical simulation. Finally, Section 6 is the conclusion of this paper.

2. Array Signal Model

Assume that there are stationary random far-field narrow-band independent signals in space. The receiving array is a uniform linear array composed of scalar arrays; assume that the wavelength of the incident source is , and the array spacing of the receiving array is . Let the incident angle of the first incident signal be . The first element in the array is taken as the reference, when the th signal impinging on the array, the delay between the element and the reference element is

The corresponding phase difference is

Then, the array guide vector is

Therefore, the array receiving data vector is where is the array receiving data vector, is array popularity matrix, is the array receiving signal vector, and is the array receiving noise vector.

Assuming that the spatial noise received by the array is zero-mean stationary random Gaussian white noise of power is and the noise is independent of the incident signal, the covariance matrix of the data vector which received by array can be obtained as: where is the incident signal covariance matrix and is the covariance matrix of noise received by the array.

By eigenvalue decomposition of Equation (5), the following formula can be obtained as: where is the th eigenvalue of the array receiving data covariance matrix, is the eigenvector corresponding to . In an ideal situation, arranged the eigenvalues in descending order, the result is:

However, array receiving data for signal processing are actually obtained through limited sampling of system in actual direction finding system. Therefore, the array receiving data covariance matrix is usually obtained by the maximum likelihood algorithm; when the number of snapshots is , the ML estimate is:

By eigenvalue decomposition of Equation (8) and arranging the eigenvalues in descending order, the following formula can be obtained as:

The reason why the noise eigenvalues in formula (9) no longer meet the equal relationship is that there will be errors which cannot be avoided in the actual system and the estimation process. Obviously, it is impossible to estimate incident source number from the identical eigenvalues number. So it is necessary to carry out the study of the estimation algorithm of source number, i.e., the SNE algorithm.

3. Fuzzy C-Means Clustering

Clustering is a kind of unsupervised learning. The key of clustering is the selection of sample characteristics to divide the samples. FCM [24] is a soft clustering method, which takes into account the uncertainty of data to be clustered. Compared with the hard clustering algorithm represented by K-mean clustering, fuzzy C-mean clustering can better deal with the uncertainty in the system.

Let express a given set of samples. For the incident SNE problem, it is the value of the relevant feature of the eigenvalue of array received data covariance matrix, such as the value and the local density of eigenvalues. Where represents the sample space dimension, represents the sample number, that is the eigenvalue number or the elements number in receiving antenna array. For the problem of source number estimation, the purpose of clustering is to divide the eigenvalues into two categories, which belong to signal subspace and noise subspace, respectively, so the number of cluster centers is fixed, and the value is 2. A general FCM model is shown as: where with dimension is the fuzzy membership matrix, represents the membership value of the th sample belonging to the th class; here, is the number of cluster centers, and its value is 2 in source number estimation. is the matrix composed of c cluster center vectors. is the weighted index of fuzzy clustering; is the distance from the sample to the center .

According to formula (10), the updating formulas of fuzzy membership and cluster center are as follows:

4. Proposed Algorithm

Suppose there are two signal incidents on a uniform array composed with 10 elements, the distribution of eigenvalues of the array received data covariance matrix versus different SNR is shown in Figure 1; red and blue points in the figure represent signal eigenvalues and noise eigenvalues, respectively.

The result of Figure 1 shows that compared to the eigenvalues of signal, the value difference of the eigenvalues of noise is small; in another word, the density is larger in a certain neighborhood. The concept of local density and its calculation formula are proposed in [25]. The local density of a sample is the number of samples within a field of that sample. According to the result shown in Figure 1 and the concept of local density, we can divide each eigenvalue of array receiving data into noise eigenvalue and signal eigenvalue according to its local density and then get the incident source number. So, in this paper, we construct 1D input samples of FCM according to the local density of eigenvalues, especially in low SNR environments.

Firstly, the local density of each eigenvalue is calculated by the following formula: where is a piecewise function:

indicates the distance between the th and the th eigenvalue; indicates a truncated distance, which specifies the neighborhood range of each eigenvalue when calculating the local density. In this paper, the calculation formula of is:

It can be seen from formula (13) that the local density of the th eigenvalue is equal to the eigenvalue number distributed in the neighborhood with radius of of the th eigenvalue. Figure 1 shows that with the increase of SNR, the difference between signal eigenvalue and noise eigenvalue also increases. The value of has a great influence on the algorithm performance. The value of cannot always satisfy with the change of SNR. Therefore, we use fuzzy C-means clustering to cluster the sample features and then get the number of the incident source.

It can be seen from formulas (13)–(15) that the calculation of local density in the proposed algorithm can be completed by simple addition calculation; therefore, compared with the eigen decomposition operation, the influence of the calculation of local density on the algorithm complexity can be ignored.

The main steps of the source number estimation algorithm based on data local density and fuzzy C-means clustering (LDFCM) are summed up as follows:

Step 1. Calculate the array receiving data covariance matrix and extract the eigenvalue from eigenvalue decomposition.

Step 2. Calculate the local density of all eigenvalues according to formulas (13) and (14).

Step 3. The eigenvalues and their data local density are used as input samples of fuzzy C-means clustering.

Step 4. The maximum and minimum values of eigenvalues are taken as the initial values of the two clustering centers, respectively, and the termination conditions are set.

Step 5. Fuzzy membership and clustering center are updated by using formulas (11) and (12) until the termination condition is satisfied.

Step 6. Judge the number of information sources according to the final result.

5. Simulation Verification

Firstly, we will show the validity of LDFCM algorithm through the first experiment; assuming that the number of array elements is 10, the DOAs of 2 incident signals with equal power are , SNR is 0, and the number of snapshots is 100. It can be seen from Figure 2 that the LDFCM algorithm can correctly classify the eigenvalues belonging to the noise subspace and the signal subspace, where the number of eigenvalues belonging to the signal subspace is the number of sources.

Then, we will compare the PCD of incident source number use LDFCM with MDL, AIC, mMDL, mAIC, and FCM in white noise and spatial correlation noise, respectively. 5000 Monte-Carlo runs are taken in each experiment for performance comparison. We consider two narrowband and equal-amplitude stationary Gaussian signals illuminating on a uniform linear array at intervals of half-wavelength between elements.

Figure 3 shows the relationship between the PCD and the SNR. The numbers of antennas and snapshots are 10 and 50, respectively, in this experiment. DOAs of the two incident sources with the same power are . It can be seen that the LDFCM, FCM, MDL, mAIC, and mMDL methods converge to 1 in PCD when the SNR tends to infinity. Since AIC is not a consistent estimate, it does not converge to 1. The LDFCM Algorithm has the highest PCD under low SNR; the reason is that in the case of low SNR, the difference between noise eigenvalue and minimum signal eigenvalue is small, but the local density difference between them is large. Therefore, we make the local density of eigenvalue as 1D sample feature used in FCM, which is equivalent to 1D additional information to the FCM algorithm, so the PCD of improved algorithm proposed in this paper is higher than FCM algorithm, especially for the low SNR cases. For example, when SNR is -10 dB, the PCD of LDFCM algorithm is higher than 70%, and the PCD of FCM algorithm is less than 10%.

In the third experiment, assume that the number of antennas is 10. DOAs of the two incident sources with equal power are ; SNR is 0. The PCD versus the number of snapshots are provided in Figure 4. It is observed that the LDFCM, FCM, MDL, mAIC, and mMDL methods converge to 1 in PCD as the number of snapshots goes to infinity. AIC is still not converge to 1. The PCD of LDFCM and FCM methods are close and are better than other comparison algorithms. The comparison results show that SNE algorithm based on LDFCM and FCM have a low requirement for the number of snapshots.

Figures 5 and 6 show the PCD of algorithms versus the number of snapshots and the SNR in the condition of the spatial correlation noise.

Numerical simulation results demonstrate that the MDL and AIC methods fail in the case of the spatial correlation noise, and LDFCM has the highest PCD compared with the other three algorithms. It also can be seen that when the SNR reaches saturation which means that the PCD of LDFCM and FCM algorithms both reaches 100%. The PCD of algorithms is less affected by the number of snapshots.

We can compare the PCD of LDFCM algorithm in white noise with that in spatial correlation noise by seeing Figures 3 and 5. LDFCM algorithm has the better performance while in noise environment than that in spatial correlation noise when the SNR is below -5 dB, but when the SNR is better than -5 dB, the PCD of proposed algorithm both can get 1 in both two different noise environments. In addition, the LDFCM algorithm has the highest PCD among all contrast algorithm in both two kinds of noise environments.

6. Conclusion

In this paper, we propose an advanced source number estimation algorithm named LDFCM. This algorithm adds 1D sample which named data local density of eigenvalues, which increase the amount of information available for FCM algorithm from the perspective of information theory. Data local density of eigenvalues expresses the relative distribution relation among multiple eigenvalues; therefore, when the SNR is low, the difference between the maximum noise eigenvalue and the minimum signal eigenvalues are small; the proposed algorithm can distinguish these two eigenvalues into two different kinds by using the local density. The numerical simulation results demonstrate that the LDFCM algorithm can provide higher accuracy of the SNE than the FCM algorithm in both cases, i.e., the white noise and the spatial correlation noise.

Data Availability

No data were used to support this study.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

Acknowledgments

This work was supported by the Nanjing University of Posts and Telecommunications Start Foundation (NUPTSF) under grant XK0160919133 and the National Natural Science Foundation of China under grant no. 61802207.

References

J. Shi, F. Wen, and T. Liu, “Nested MIMO radar: coarrays, tensor modeling and angle estimation,” IEEE Transactions on Aerospace and Electronic Systems, vol. 57, no. 1, pp. 573–585, 2021.
View at: Publisher Site | Google Scholar
G. Qin, M. G. Amin, and Y. D. Zhang, “DOA estimation exploiting sparse array motions,” IEEE Trans. on Signal Processing, vol. 67, no. 11, pp. 3013–3027, 2019.
View at: Publisher Site | Google Scholar
F. Wen, J. Wang, J. Shi, and G. Gui, “Auxiliary vehicle positioning based on robust DOA estimation with unknown mutual coupling,” IEEE Internet of Things Journal, vol. 7, no. 6, pp. 5521–5532, 2020.
View at: Publisher Site | Google Scholar
B. Xu and Y. Zhao, “Transmit beamspace-based DOD and DOA estimation method for bistatic MIMO radar,” Signal Processing, vol. 157, pp. 88–96, 2019.
View at: Publisher Site | Google Scholar
F. Wen, C. Mao, and G. Zhang, “Direction finding in MIMO radar with large antenna arrays and nonorthogonal waveforms,” Digital Signal Processing, vol. 94, pp. 75–83, 2019.
View at: Publisher Site | Google Scholar
J. Li, P. Ma, X. Zhang, and G. Zhao, “Improved DFT algorithm for 2D DOA estimation based on 1D nested array motion,” IEEE Communications Letters, vol. 24, no. 9, pp. 1953–1956, 2020.
View at: Publisher Site | Google Scholar
F. Wen, Z. Zhang, K. Wang, G. Sheng, and G. Zhang, “Angle estimation and mutual coupling self-calibration for ULA-based bistatic MIMO radar,” Signal Processing, vol. 144, pp. 61–67, 2018.
View at: Publisher Site | Google Scholar
J. Li, Y. He, P. Ma, X. Zhang, and Q. Wu, “Direction of arrival estimation using sparse nested arrays with coprime displacement,” IEEE Sensors Journal, vol. 21, no. 4, pp. 5282–5291, 2021.
View at: Publisher Site | Google Scholar
F. Wen and J. Shi, “Fast direction finding for bistatic EMVS-MIMO radar without pairing,” Signal Processing, vol. 173, article 107512, 2020.
View at: Publisher Site | Google Scholar
F. Wen, J. Shi, and Z. Zhang, “Joint 2D-DOD, 2D-DOA and polarization angles estimation for bistatic EMVS-MIMO radar via PARAFAC analysis,” IEEE Transactions on Vehicular Technology, vol. 29, no. 2, pp. 1626–1638, 2020.
View at: Publisher Site | Google Scholar
F. Wen, J. Shi, and Z. Zhang, “Direction finding for bistatic mimo radar with unknown spatially colored noise,” circuits, systems, and signal processing, vol. 39, no. 5, pp. 2412–2424, 2020.
View at: Publisher Site | Google Scholar
l. huang and h. c. so, “Source enumeration via mdl criterion based on linear shrinkage estimation of noise subspace covariance matrix,” IEEE Transactions on Signal Processing, vol. 61, no. 19, pp. 4806–4821, 2013.
View at: Publisher Site | Google Scholar
D. B. Williams and D. H. Johnson, “Using the sphericity test for source detection with narrow-band passive arrays,” IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. 38, no. 11, pp. 2008–2014, 1990.
View at: Publisher Site | Google Scholar
M. Wax and T. Kailath, “Detection of signals by information theoretic criteria,” IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. 33, no. 2, pp. 387–392, 1985.
View at: Publisher Site | Google Scholar
Q. T. Zhang and K. M. Wong, “Information theoretic criteria for the determination of the number of signals in spatially correlated noise,” IEEE Transactions on Signal Processing, vol. 41, no. 4, pp. 1652–1663, 1993.
View at: Publisher Site | Google Scholar
J. L. Xie and X. C. Si, “Determining the number of sources based on diagonal loading to the covariance matrix,” Systems Engineering and Electronics, vol. 29, no. 5, pp. 596–600, 2008.
View at: Google Scholar
B. Nadler, “Nonparametric detection of signals by information theoretic criteria: performance analysis and an improved estimator,” IEEE Transactions on Signal Processing, vol. 58, no. 5, pp. 2746–2756, 2010.
View at: Publisher Site | Google Scholar
S. P. Cedervall, “Detection tests for array processing in unknown correlated noise fields,” IEEE Transactions on Signal Processing, vol. 49, no. 5, pp. 2351–2362, 1997.
View at: Publisher Site | Google Scholar
H. Y. Guo, G. Wei, and L. X. Shan, “Estimation of the source parameters based on combining technique under strong interference,” Application of Electronic Technique, vol. 35, no. 10, pp. 116–118, 2009.
View at: Google Scholar
N. Farsi, B. Escrig, and A. Hamza, “Estimation of the number of signals based on a sequence of hypothesis test and random matrix theory,” in 2015 Third World Conference on Complex Systems (WCCS), pp. 1–5, IEEE, Marrakech, 2015.
View at: Google Scholar
J. G. Mackinnon, “Bootstrap hypothesis testing,” in Handbook of Computational Econometrics, pp. 183–213, John Wiley & Sons, Ltd, 2009.
View at: Google Scholar
Z. Lu, Y. Chakhchoukh, and A. M. Zoubir, “Source number estimation in impulsive noise environments using bootstrap techniques and robust statistics,” in 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Prague, Czech Republic, May 2011.
View at: Publisher Site | Google Scholar
W. Hu, R. Liu, X. Lin, Y. Li, X. Zhou, and X. He, “A deep learning method to estimate independent source number,” in 2017 4th International Conference on Systems and Informatics (ICSAI), pp. 1055–1059, Hangzhou, China, November 2017.
View at: Publisher Site | Google Scholar
L. Zan, C. Xihong, L. Jin, and L. Qiang, “Source number estimation method based on fuzzy C-class mean clustering,” Systems Engineering and Electronics Technology, vol. 2, pp. 244–248, 2019.
View at: Google Scholar
A. Rodriguez and A. Laio, “Clustering by fast search and find of density peaks,” Science, vol. 344, no. 6191, pp. 1492–1496, 2014.
View at: Publisher Site | Google Scholar

Copyright

Copyright © 2021 Na Wu et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

PDF Download Citation

Download other formats

Order printed copies

Views

350

Downloads

676

Citations