1 Introduction

ADS-B (automatic dependent surveillance and broadcast) is an important technology for monitoring aircraft operations. However, the effects of signal interference on this method can introduce hazards into aircraft operations. One potential way of addressing this problem is to use an array antenna to estimate the DOA (direction of signal arrival) based on the spatial sparsity of ADS-B signals. Implementing this method requires an accurate signal recovery algorithm that is effective in the presence of a high compression ratio. Compression sensing (CS) was first proposed for breaking the limitation of the Nyquist sampling theorem, by permitting the original sparse signal to be reconstructed with high accuracy from a small amount of sampling data using an optimization algorithm.

CS involves constructing a measurement matrix and applying a recovery algorithm to the sparse signal. There has been a considerable amount of researches about the construction of the measurement matrix, because the information in the original signal preserved during compression sampling plays a vital role in determining the signal recovery performance. Several studies have demonstrated the applicability effectiveness of a Gaussian random matrix [1], Bernoulli random matrix [2], and random local Hadamard matrix as the measurement matrix. However, the application of a random matrix has some drawbacks associated with the presence of random elements, such as the difficulty of implementation in hardware, high storage complexity, and transmission difficulties [3]. These drawbacks mean that using a random matrix cannot ensure the stability and efficiency of compression sampling, which led to the proposal of using a deterministic measurement matrix [4]. The most popular algorithm for constructing deterministic matrices can be divided into four categories based on different generation methods for the matrix elements: (1) a finite framework, such as the expander graph [5]; (2) encoding, such as chirp codes [6] or the Toeplitz deterministic matrix [7]; (3) the maximum Welch boundary, such as that constructed using a difference set algorithm [8] and numerical searching algorithm [9]; and (4) a training mode by optimizing the restricted isometry property of the initial matrix [10] and implemented by a projection algorithm and gradient-based training algorithm.

This paper focuses on the fourth category, namely, on an algorithm for constructing the measurement matrix based on training. The reason is that it can exhibit better signal-recovery performance while not being constrained by the matrix dimensions compared with the other methods. By constructing a sensing dictionary that has the same dimensions and a low mutual coherence with the measurement matrix, the sensing dictionary included in the reconstruction algorithm can achieve a better sparse approximation to the original signal. Consequently it improves the accuracy of signal recovery. The previous study demonstrated that a measurement matrix with a smaller mutual coherence can significantly improve the signal recovery performance [11]. This study proposes a measurement matrix and sensing dictionary cooperative construction algorithm (MSCA) to improve the accuracy of the original sparse signal by utilizing a sensing dictionary.

The remainder of this paper is organized as follows. In Section 2, the conventional sensing dictionary method is introduced. In Section 3, a mathematical model for co-constructing a measurement matrix and sensing dictionary is firstly proposed. Then, the measurement matrix and sensing dictionary cooperative construction (MSCA) algorithm is proposed to solve the model. In Section 4, a series of simulations are conducted. In Section 5, conclusions and future research directions are presented.

2 Sensing dictionary

For a one-dimensional sparse signal, x ∈ RN × 1, with a measurement matrix Φ ∈ RM × N, M ≪ N. The compressed signal can be described as y = Φx, y ∈ RM × 1. The viewpoint of energy loss also means the information loss in the signal during compression sampling. Suppose that Φ = I and it holds that \( {\left\Vert x\right\Vert}_2^2={\left\Vert \varPhi x\right\Vert}_2^2 \). This means that the energy of the original signal can be preserved 100%, although this is hardly ever the case for most Φ ∈ RM × N when M ≪ N. It is obvious that a smaller difference between measurement matrix Φ and the orthogonal matrix will maintain the energy in the original signal and so improve the ability to perform the reconstruction. On the other hand, from the viewpoint of signal recovery, the state-of-the-art OMP reconstruction algorithm can be described as follows: calculate b = ΦTy and determine the maximum in vector b, whose position corresponds to the position of the atom in Φ that matches best up to y. Suppose that Φ satisfies ΦTΦ = I, then b = ΦTy = ΦTΦx = x. A precise signal reconstruction can be obtained in this situation, namely, the identified elements in the supporting set are 100% accurate. The above assumptions indicate that constructing internal atoms of measurement matrix Φ requires a low mutual coherence; however, this situation is almost never achieved since Φ is not a square matrix.

In order to resolve the above difficulties, a sensing dictionary ∑ that has the same dimensions and a low mutual coherence with Φ needs to be constructed. ∑TΦ has greater orthogonality than does ΦTΦ. And then, we use the sensing dictionary to take place of the measurement matrix to reconstruct the signal, since of the coherence between internal atoms of sensing dictionary is lower than measurement matrix, and the sensing dictionary could better save the information of the original signal. As a result, we could achieve a better sparse approximation to the original signal in signal recovery. The elements of the support set are then identified using a greedy algorithm such as the OMP algorithm. This algorithm can be described as follows:

$$ i=\underset{1\le i\le N}{\mathrm{argmax}}\mid \left\langle {\varepsilon}_i,y\right\rangle \mid $$
(1)

where εi is the ith atom of sensing dictionary ∑.

The analysis above shows that the goal of constructing the measurement matrix is to reduce the coherence between the internal atoms. The t-average coherence coefficient is defined as [12, 13]

$$ {\mu}_t\left(\varPhi \right)\triangleq \frac{\sum_{1\le i,j\le N,i\ne j}\left(|G\left(i,j\right)|\ge t\right)\mid G\left(i,j\right)\mid }{\sum_{1\le i,j\le N,i\ne j}\left(|G\left(i,j\right)|\ge t\right)} $$
(2)

where G(i, j) is the element at the ith row and jth column of the Gram matrix G = ΦTΦ. The t-average coherence coefficient is the average value of all non-diagonal elements with an absolute value greater than a specific threshold t in matrix G. The sensing dictionary is constructed with the aim of ∑TΦ being a sufficiently accurate approximation to I. In this situation, the inner product of each corresponding atom in ∑ and Φ obey

$$ \Big\{{\displaystyle \begin{array}{c}\mid \left\langle {\varepsilon}_i,{\phi}_i\right\rangle \mid =1,\mathrm{if}\;i=j\\ {}\mid \left\langle {\varepsilon}_i,{\phi}_i\right\rangle \mid =0,\mathrm{if}\;i\ne j\end{array}} $$
(3)

The sensing dictionary and measurement matrix were constructed cooperatively in this study based on a revised t-average index. The basic idea is to first set up the inner product between ϕi and εi equal 1, according to Eq. (3), and then calculate G' = ∑TΦ and tighten the non-diagonal elements so that they gradually approach 0 using a greedy algorithm until μt(∑, Φ) satisfies threshold t. After these steps are finished, a pair of Φ and ∑ can be constructed. This process can be described as follows:

$$ \underset{\varPhi, \sum }{\mathrm{argmin}}\;{\left\Vert {\sum}^T\varPhi -I\right\Vert}_F^2 $$
(4)

The value of threshold t can be set to > 0 in order to reduce the number of iterations. Tropp et al. [14] demonstrated that the minimum achievable value of the non-diagonal elements was \( {t}_E=\sqrt[\pm ]{\frac{n-m}{m\left(n-1\right)}} \) for a matrix with the ETF (equidimensional tight framework) property. Because G' is similar to the ETF matrix, we set t = tE. Equation (4) can be modified to

$$ \underset{\varPhi, \sum }{\mathrm{argmin}}\;{\left\Vert {\sum}^T\varPhi -H\right\Vert}_F^2 $$
(5)

where matrix H ∈ RN × N and has unit diagonal elements and non-diagonal elements equal to tE sign(G'(i, j)).

3 Co-construction algorithm

In order to evaluate the performance of the sensing dictionary, the t-average mutual coherence coefficient is proposed:

$$ {\mu}_t\left(\varSigma, \varPhi \right)\triangleq \frac{\sum_{1\le i,j\le N,i\ne j}\left(|{G}^{\hbox{'}}\left(i,j\right)|\ge t\right)\mid {G}^{\hbox{'}}\left(i,j\right)\mid }{\sum_{1\le i,j\le N,i\ne j}\left(|{G}^{\hbox{'}}\left(i,j\right)|\ge t\right)} $$
(6)

where G'(i, j)is the ijth element of Gram matrix G' = ∑TΦ. μt(∑, Φ) calculates the average value of all non-diagonal elements in G' with absolute values greater than the specific threshold t. A smaller μt(∑, Φ) will make ∑TΦ closer to I and result in the sensing dictionary exhibiting better performance.

Measurement matrix Φ and sensing dictionary ∑ can be obtained by solving Eq. (8), which can be decomposed into the following two subproblems:

figure a

These subproblems can be alternately solved using an iterative process, with the performance of Φ and ∑ then evaluated using Eqs. (2) and (6) in each iteration. If the variation between two results of consecutive iterations is less than threshold ξ, or the number of iterations exceeds a defined maximum, the algorithm is terminated and Φ and ∑ are outputted.

3.1 Gradient-based approach for constructing the measurement matrix

A descending gradient-based method was adopted for solving problem ① in this study that can reduce the value of nondiagonal elements of G = ΦTΦ [15], namely, the mutual coherence of the internal atoms in Φ can be reduced gradually. The optimization routine can be described as follows:

  1. 1)

    Define the cost function as \( C={\left\Vert {\varPhi}^T\varPhi -H\right\Vert}_F^2 \);

  2. 2)

    Calculate the gradient of the cost function:

$$ \frac{\partial C}{\partial \varPhi }=\frac{\partial }{\partial \varPhi } Tr\left\{{\left({\varPhi}^T\varPhi -H\right)}^T\left({\varPhi}^T\varPhi -H\right)\right\} $$
(7)

Simplify (7) with the matrix derivative rule [6] to

$$ \frac{\partial C}{\partial \varPhi }=4\varPhi \left({\varPhi}^T\varPhi -H\right) $$
(8)
  1. 3)

    The full update equation will then be

$$ {\varPhi}_{\left(k+1\right)}={\varPhi}_{(k)}-\beta \frac{\partial C}{\partial {\varPhi}_k} $$
(9)

where k is the number of iterations and β is the step length.

  1. 4)

    Calculate the t-average coherence coefficient of Φ using Eq. (2), and estimate whether the variation between two results of consecutive iterations is less than threshold ξ; if so, terminate the algorithm and output Φ.

The above method is used to update Φ so as to gradually reduce cost function C and finally converge to a local optimal value. The convergence of the above method is proved in [16, 17]. The paper above proves that the algorithm performs to the maximal iterations and gets the convergency value when β takes a minimum of 0.001. The rate of convergence and the convergency value be expressed by μt(Φ) are equivalent when 0.005 ≤ β ≤ 0.01, namely, the value of β in this range has little effect on the convergence of gradient algorithm.

3.2 Tightening and SVD-based approach for the sensing dictionary

Measurement matrix Φ will be obtained after solving problem ①. Problem ② can be solved based on the previous results. There are two key points associated with solving problem ②: (1) take full advantage of the correlation between matrices ∑ and Φ and (2) ensure that the mutual coherence between Σ and Φ, namely μt(∑, Φ), is as low as possible. In order to implement these two points in the process of solving the problem, this study proposed the following methods: (1) construct matrix G' = ∑TΦ, (2) shrink non-diagonal elements of G' using the tightening operator to gradually make G'close to H, and (3) obtain a pair of sensing dictionary and measurement matrix by SVD decomposition [18].

The numerical range of non-diagonal elements of matrix G' is [– 1, 1] since Φ and ∑ were normalized in the initial stage, and this range could be further reduced to [−γ, γ] (γ < 1) using a tightening operator. A simple operator for mapping [– 1, 1] to [−γ, γ] is [21]

$$ \rho =\frac{4}{\pi}\gamma \arctan \left({G}^{\hbox{'}}\left(i,j\right)\right) $$
(10)

The tightening operator only needs one parameter to adjust the scope of non-diagonal elements of G in the iteration process; we set γ = 0.4 in this study.

Finally, matrix G' is decomposed by SVD to achieve a pair of Φ and ∑ [19]:

$$ {G}^{\hbox{'}}={U}^T VW $$
(11)

In order to make G' closer to H, we arranged all diagonal elements in V from the top left to bottom right and retained the maximum M elements in a new matrix VM [20]. Φ and ∑ could then be constructed as follows:

$$ \sum ={V}_M^{\frac{1}{2}}U,\kern0.36em \varPhi ={V}_M^{\frac{1}{2}}W $$
(12)

In order to ensure the inner-product corresponding atoms between Φ and ∑ equal 1, the following equations should be used:

$$ {\phi}_i=\frac{\phi_i}{{\left\Vert {\phi}_i\right\Vert}_2},\kern0.48em {\varepsilon}_i=\frac{\varepsilon_i}{\mid \left\langle {\varepsilon}_i,{\phi}_i\right\rangle \mid } $$
(13)

This approach constructs a pair of sensing dictionary ∑ and measurement matrix Φ with low mutual coherence. The MSCA is listed in Algorithm 1.

figure b

Algorithm 1.

The MSCA algorithm

4 Simulations and discussion

We used simulations to verify the validity of the MSCA. Suppose that the length of the original sparse signal is N = 256, the measurement matrix is Φ, and the sensing dictionary is ∑ with Φ, ∑  ∈ RM × N, where M is the number of measurements. In order to access the performance of matrix Φ or ∑ for different compression ratios, we varied M from 100 to 150 with a step size of 5. We adopted the matrix coherence coefficient and t-average mutual coherence coefficient as the evaluation indices, and the OMP algorithm was used for signal reconstruction. In addition, a Gaussian random matrix and gradient algorithm [21] were selected for comparison. The corresponding Welch boundary was used to compare the performances of the various algorithms. The simulation results are shown in Fig. 1.

Fig. 1
figure 1

Performances of the measurement matrices constructed using various algorithms. a Matrix coherence coefficient. bt-average coherence coefficient. ct-average mutual coherence coefficient

Figure 1 indicates that the MSCA can reduce the coherence of the matrix compared with the Gaussian random matrix and the gradient algorithm. Figure 1a shows that both the MSCA and the gradient algorithm are more effective than the Gaussian random matrix in reducing the coherence of Φ, and the performance of the MSCA is slightly better than that of the gradient algorithm: based on the gradient algorithm, the coherence coefficient of the matrix decreased gradually from 0.22 to 0.17 as the measured signal length changed, while the index decreased gradually from 0.11 to 0.08 based on the MSCA, which is closer to the Welch boundary. Figure 1b presents the results for the t-average coherence coefficient as an index to evaluate the performance of Φ between the various algorithms. The MSCA was still closer to the Welch boundary than for the Gaussian random matrix and the gradient algorithm. Figure 1c shows the performance of the sensing dictionary when comparing the t-average mutual coherence coefficient of ∑ and the t-average coherence coefficient of Φ. The figure indicates that for the same signal compression ratio, the t-average mutual coherence coefficient of ∑ was smaller than the t-average coherence coefficient of Φ. Namely, the average of the off-diagonal elements of G' = ∑TΦ is less than the G = ΦTΦ, which means that ∑TΦ approaches I more closely on the premise of diagonal elements equal 1 [22, 23]. Collectively, the t-average mutual coherence coefficient of the sensing dictionary could be closer to the Welch boundary, so that the original signal support set can be identified more accurately through ∑ [24].

We compared the reconstruction performances of the abovementioned algorithms. It is clear from Fig. 2 that the reconstruction performances of the gradient algorithm and the MSCA were better than that of the Gaussian random matrix. In Fig 2a, some signal components are not recovered because of the internal atoms of the Gaussian random matrix have bigger mutual coherence than other two matrix [25].

Fig. 2
figure 2

Performance in reconstructing sparse sampled signals for various algorithms. a Gaussian random matrix. b Gradient algorithm. c MSCA

In order to accurately estimate the precision of the recovery result, we calculated the reconstruction error as

$$ \mathrm{error}\_\mathrm{reconstruction}=\frac{{\left\Vert x-\overset{\frown }{x}\right\Vert}_2}{{\left\Vert x\right\Vert}_2} $$
(14)

We remain the size of the original signal unchanged in the simulation and set the signal sparsity as a variable. Signal sparsity indicates the number of non-zero bits in the signal. The results show the signal reconstruction under different signal sparsity and with competing measurement dictionary methods, namely Gaussian random matrix and the gradient matrix [26]. Since the original constructed sparse signal is random, the simulation was repeated 100 times with different signal sparsity, and the average error was calculated. The results are presented in Fig. 3.

Fig. 3
figure 3

Average reconstruction error

We construct a pair of measurement matrix and sensing by the use MSCA algorithm in this simulation; the MSCA measurement matrix in Fig. 3 indicates the average reconstruction error based on MSCA measurement matrix only. Correspondingly, the MSCA cooperative construction in Fig. 3 indicates the average reconstruction error based on measurement matrix and MSCA sensing dictionary cooperatively. While, we indicate the MSCA sensing dictionary to show the result obtained reconstructing the signal with the measurement matrix lonely. Figure 3 shows that the reconstruction error was smallest for the MSCA-cooperative construction in the same signal sparsity, followed by the MSCA-measurement matrix and gradient algorithm, and highest for the Gaussian random matrix. Moreover, the MSCA was better than the Gaussian random matrix in terms of reconstruction stability. Result of simulation proves that the sensing dictionary plays a better performance in reconstruction.

In order to verify the processing ability of the MSCA algorithm to the image, we use the MSCA matrix to compress an image like Fig. 4a. By the way, this paper only simulates the binary image for the purpose of simplifying the process of compression and reconstruction [27, 28]. The stimulation experimental flow is as follows: (1) converting the original image to a 0–1 matrix ϒ with ϒ ∈ RM × N by binarization; (2) N pair of measurement matrix Φi and sensing dictionary ∑i with Φi, ∑i ∈ Rm × M, m < M be constructed by using MSCA algorithm; (3) compressive sampling: Y = Φϒ, yi = Φiϒi, i = 1, 2, …, N; (4) the signal ϒ'is reconstructed by using the sensing dictionary-based OMP algorithm [29]; and (5) image reconstruction and the result are shown as the Fig. 4c.

Fig. 4
figure 4

Compression sensing for image. a Original image. b Reconstructed image by Gaussian matrix. c Reconstructed image by gradient matrix. d Reconstructed image by MSCA sensing dictionary

It is observed that the results of image reconstruction by MSCA sensing dictionary and gradient matrix are better than the Gaussian random matrix by comparing Fig. 4 a, b, c, and d. In order to compare the effects of compression sensing of the three methods on the image signal more clearly, we calculate the reconstruction error as formula (14). The results are shown in the Table 1. Specific results are as follows:

Table 1 The accuracy of reconstruction by the three methods

This illustrates that the MSCA matrix could retain the information of the original signal better than the Gaussian random matrix and gradient matrix, and the coupled sensing dictionary also could be used in the reconstruction for signal [30, 31].

5 Conclusion

This study addressed the optimization of compression measurements. In this work, a mathematical model for co-constructing a measurement matrix and sensing dictionary was proposed. Then, the measurement matrix and sensing dictionary cooperative construction (MSCA) algorithm is proposed to solve the model at a faster rate. The proposed MSCA makes full use of the inherent correlation between the measurement matrix and sensing dictionary. The advantage of our approach is that it involves constructing the measurement matrix and sensing dictionary cooperatively rather than dividing this into two separate processes, which should also improve the performance in sparse recovery. The t-average mutual coherence coefficient was defined for evaluating the performance of the sensing dictionary. The results obtained from simulations have shown that the sensing dictionary has a lower mutual coherence and can successfully reconstruct the sparse signal using the proposed algorithm. Compared with the state-of-art algorithms, the simulation results showed that MSCA is better than the other algorithms in ensuring the low coherence of the measurement matrix and that ∑ has better coherence than Φ.

Based on the results of the proposed cooperative construction method for the measurement matrix and sensing dictionary in this work, we will continue the following work: (1) sparse reconstructed algorithm based on sensing dictionary [32] and (2) adaptive sparse representation compatible with sensing dictionary [33].