Next Article in Journal
Photothermal Sensing of Nano-Devices Made of Graphene Materials
Previous Article in Journal
Energy-Efficient Cluster Management Using a Mobile Charger for Solar-Powered Wireless Sensor Networks
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Cost-Reference Particle Filter for Cognitive Radar Tracking Systems with Unknown Statistics

School of Electronics and Information, Northwestern Polytechnical University, Xi’an 710072, China
*
Author to whom correspondence should be addressed.
Sensors 2020, 20(13), 3669; https://doi.org/10.3390/s20133669
Submission received: 23 May 2020 / Revised: 26 June 2020 / Accepted: 28 June 2020 / Published: 30 June 2020
(This article belongs to the Section Intelligent Sensors)

Abstract

:
A novel robust particle filtering algorithm is proposed for updating both the waveform and noise parameter for tracking accuracy simultaneously and adaptively. The approach is a significant step for cognitive radar towards more robust tracking in random dynamic systems with unknown statistics. Meanwhile, as an intelligent sensor, it would be most desirable for cognitive radar to develop the application of a traditional filter to be adaptive and to expand the adaptation to a wider scope. In this paper, after analysis of the Bayesian bounds and the corresponding cost function design, we propose the cognitive radar tracking method based on a particle filter by completely reconstructing the propagation and the update process with a cognitive structure. Moreover, we develop the cost-reference particle filter based on optimizing the cost function design according to the complicated system or environment with unknown statistics. With this method, the update of the estimation cost and variance arrives at the approximate optimization, and the estimation error can be more adjacent to corresponding low bounds. Simulations about the tracking implementation in unknown noise are utilized to demonstrate the superiority of the proposed algorithm to the existing methods in traditional radar.

1. Introduction

1.1. Problem Statement

Cognitive radar can basically be defined as: an intelligent radar system of hardware and software in which the transmit and receive parameters (i.e., pulse length, pulse repetition frequency (PRF), modulation, power, frequency, and polarization) are selected, in real-time, and use adaptation between the information extracted from the sensor/processor and the design and transmission of subsequent waveforms, in response to the observed scene to optimize the performance of a given application. The problem of target tracking in cognitive radar system has received considerable attention. It is well known that most of the state-space dynamical systems are nonlinear or non-Gaussian. For example, the multi-state transition of a drone from hovering to maneuvering is often nonlinear, and the measurement noise is mainly flicker noise or heavy-tailed noise [1].
Despite the performance decline of extended Kalman filter (EKF) and unscented Kalman filter (UKF) in highly nonlinear problems, and that general analytical solutions are intractable in nonlinear or non-Gaussian systems, solutions continue to emerge from different viewpoints. However, most of the methods rely on the assumption that the noise has known statistics, or they require accurate mathematical representation of the dynamics of the system evolution; otherwise, it is almost impossible to directly approximate the true distribution. In practice, sometimes the assumption is in accordance with the actual situation, but sometimes not. The same situation occurs in cognitive radar systems. Without explicit mathematical models or a priori information, how cognitive radar can still exert its advantages is a concern of this article.
We briefly review various existing methods to cognitive radar tracking problem that involve particle filter (PF) in some relevant manner.

1.2. Related Works

For handling arbitrary nonlinear models and arbitrary noise distributions, methods based on Monte Carlo (MC) methodology have emerged in Bayesian estimation. MC is a simulation-based method aimed at estimating the posteriori pdf of the state given the observations. Markov Chain Monte Carlo (MCMC) and Sequential MC (SMC) are two main tools in it, and they can sample from high dimensional probability distributions. The performance of MCMC would be unreliable when the proposal distributions that are used to explore the space are poorly chosen and/or if highly correlated variables are updated independently. While, PF became popular for it is particularly suitable for real-time estimation.
The Sequential Importance Sampling (SIS) algorithm was used for the first time to solve the problem of nonlinear filtering [2]. The formal establishment of PF was attributed to the proposal of resampling technology [3,4]. Meanwhile, the idea of Sampling Importance Resampling (SIR) was discovered and developed [5]. PF is allowed to express a complete and precise state posterior distribution, so any statistical data such as mean, variance, and modulus can be easily calculated, and theoretically, the accuracy is higher than that of EKF and UKF in nonlinear systems [6]. Hence, despite its computational pressure, it is still quite attractive to us. During the development of PF, there are several inevitable drawbacks need to be addressed: (1) How to approximate the optimal proposal distribution further; (2) How to overcome the problems of weight degeneracy, sample impoverishment, etc., to make the resampling more effective; (3) How to make the algorithm efficient and online.
Therefore, more variants of PF were produced [7]. Efficient importance sampling techniques were studied in [8,9] for the first problem, but these algorithms require that the posterior distribution of the states are assumed as a priori known and can be approximated by a Gaussian distribution. An auxiliary variable particle filter was used to deal with the second problem [10], but the filtering performance degrades when the state noise is strong, and because the likelihood function and weight value need to be calculated twice for each particle, the calculation amount increases. Adaptive PF (APF) can release the computation burden by adjusting the number of particles dynamically [11,12,13,14,15]. This method chooses a small particle number if the density is focused on a small part of the state-space, and chooses a large number if the state uncertainty is high [16]. KLD-APF is based on Kullback–Leibler (KL) information or KL distance (KLD) sampling, but it ignores any mismatch between the true and the proposal distribution [17]. When the mismatch happens, the addition of particle number will only increase the computational load. On the other hand, reducing the particle number may aggravate the sample impoverishment and further weaken the effect of resampling.
In addition to these drawbacks, PF would still be invalid when some issues are encountered. PF requires that the conditional pdf of the observed variable can be estimated, otherwise the weight of the particles cannot be calculated. Thus, particle MCMC (PMCMC) was proposed by combing SMC with MCMC as an efficient approach [18]. Based on PMCMC, the SMC2 algorithm is motivated to tackle the intractable problem of probable increments in state-space models [19]. Similar to the SMC2 scheme, nested particle filters for online parameter estimation were proposed but in a purely recursive manner, in order to address the problem of approximating the posterior probability distribution of the fixed parameters of a state-space dynamical system [20]. On robustness, cooperative parallel particle filters were designed for the dual purpose of Bayesian inference and on-Line Model Selection, with the online adaptation for the particle number [21]. Particle learning was proved in [22] to be outperforming existing PF alternatives and a competitor to MCMC. In the presence of model uncertainty where discrete data are encountered, a new SMC method was proposed for the filtering and prediction of time-varying signals [23]. A similar method with better predictive powers was proposed in [24], wherein the resampling step could be dynamically adjusted and the predictive powers could be updated sequentially as more data were observed. Moreover, what deserves attention is that Bayes is not the only choice, as a neural filter based on GRNN is also outperforming numerical filters in state vector estimation during dynamic changes of target movement parameters [25].
Inspired by the previous research, we might find a solution independent of the prior information through updating the proposal distribution and its stepwise approximation to the true distribution by iterative methods. The authors of [26] designed a cost function in PF to iteratively update the state and variance when the prior information is unknown. Naturally, it has been applied to many areas such as target tracking, autonomous vehicle positioning, sensor network, orthogonal frequency division multiplexing (OFDM) systems, wireless local area network (WLAN), and tilt estimation [27,28,29]. There are some improvements to the algorithm. In [30], a particle selection algorithm was proposed and analyzed for implementation with parallel computing devices and to circumvent the main drawback of the conventional resampling techniques. Authors of [31] melded random measures of two or more cost-reference particle filters to obtain a fused random measure that combines the information from the individual cost-reference particle filters. However, there is no further study on the optimization of the cost function, especially when the tracking structure can be adaptive. Cognitive radar designs the optimal estimator for Bayesian framework in [32], but the majority of the existing Bayesian tracking methods for cognitive radar applications are based on the Kalman filter for linear systems [33,34], and Kalman-like solutions, e.g., cubature KF (CKF) and continuous-discrete (CD)-CKF for nonlinear problems [35]. Few researches use PF. In [36], a particle filter combined with probabilistic data association is used as a tracker. In [37], a cognitive structure was designed as only a part of PF, namely, a parallel structure of PF, while EKF was used by waveform selection to adapt the particle number and reduce the computation cost.

1.3. Contributions and Organization of the Paper

When we specify PF as the nonlinear tracking method in a cognitive radar, we consider using the cognitive structure to expand the state-space of PF or its variants to another dimension where the waveform parameters can be changed from fixed to dynamic, and we can optimize the cost function design and corresponding lower bound of the estimation error. In turn, using the particle filter tracking method based on the optimal cost function can expand the scope of the application of cognitive radar. The main contributions of this paper will be presented by the following items:
(1) The cognitive radar tracking method based on PF is proposed and the mathematical model of which it is derived for the first time. Push the cognitive radar tracking framework from existing the Kalman-like to the developing SMC-like. Not only the data process but also the waveform design is in the PF iteration, that is, a fully cognitive PF.
(2) Refine the idea of cost-reference PF with cognitive framework. When the estimation of the parameters (i.e., signal-to-noise ratio (SNR)) in cognitive radar mismatches the real situation, a novel cognitive cost-reference particle filter algorithm is proposed to bring about robustness to the existing cognitive radar tracking methods and intelligence for the current adaptive method. The convergence is proofed mathematically.
(3) The Cramér–Rao Lower Bound (CRLB) of the proposed cognitive PF is derived, and the corresponding cost function is designed.
The rest of this paper is organized as follows: the standard PF, the mathematical model of cognitive radar, and the interface of PF to cognitive algorithm are presented, and the CRLB of the cognitive PF is derived in Section 2. The principle of the cost-reference particle filter is restated, the proposed cognitive scheme is presented along with the implementation steps, and the convergence of the solution is proofed in Section 3. Section 4 shows the dynamic model, the maneuvering target tracking in unknown non-Gaussian noise, and the simulation results. Section 5 presents the conclusions.

2. PF for Cognitive Radar Tracking

In this section, the cognitive tracking method based on PF is formed to address the non-linear and non-Gaussian state estimation problem in cognitive radar.

2.1. PF and Cognitive Radar Model

2.1.1. Standard PF

The basic idea is to generate a set of random samples in the state-space according to the empirical conditioning distribution of the system state vector. They are called particles, the weight and position of which are continuously adjusted according to the measurement, and the initial empirical conditioning distribution is modified. The MC estimate of the integral can be written as:
E [ f ( x ) ] = f ( x ) p ( x ) d x = f ( x ) [ p ( x ) / q ( x ) ] q ( x ) d x .
Let ω ( x ( i ) ) = p ( x ( i ) ) / q ( x ( i ) ) denote the importance weights. Equation (1) is calculated by generating N 1 independent samples { x ( i ) : i = 1 , , N } distributed according to q ( x ) . The weighted sum is formed and normalized as:
E N [ f ( x ) ] = 1 N i = 1 N f ( x ( i ) ) ω ( x ( i ) ) ,
E [ f ( x ) ] = 1 N i = 1 N f ( x ( i ) ) ω ( x ( i ) ) 1 N j = 1 N ω ( x ( j ) ) = i = 1 N f ( x ( i ) ) ω ˜ ( x ( i ) ) .
The weight update equation can be shown to be:
ω ( x k ( i ) ) = ω ( x k 1 ( i ) ) p ( z k | x k ( i ) ) p ( x k ( i ) | x k 1 ( i ) ) q ( x k ( i ) | x k 1 ( i ) ) .
Substitution of q ( x k ( i ) | x k 1 ( i ) ) = p ( x k ( i ) | x k 1 ( i ) ) into Equation (4) yields:
ω ( x k ( i ) ) = ω ( x k 1 ( i ) ) p ( z k | x k ( i ) ) .
When N , the posterior probability density can be approximated by the following:
p ( x k | z 1 : k ) i = 1 N ω ( x k ( i ) ) δ ( x k x k ( i ) ) ,
where δ ( ) denotes the Dirac delta measure, with defining properties: (1) δ ( x k x k ( i ) ) = 0 for x k x k ( i ) , (2) δ ( x k x k ( i ) ) = + for x k = x k ( i ) ; and (3) δ ( x k x k ( i ) ) d x k = 1 . More details about the SIR-PF can refer to [38]. It has been termed standard PF (SPF) in the simulation below. The accuracy approximates the optimal estimation.

2.1.2. Cognitive Radar System Model

The radar transmitter through the emission exploiting waveform selection stimulates the background with the goal to obtain a response from it such as a radar echo. The mentioned response is perceived by radar receiver, which plays the equivalent role of the human senses. The detector performs a low-level processing of received sensor data. The scene analyzer is conceived by exploiting previously acquired information to estimate the statistical characteristics of the operating environment. The tracker acts as a high-level processor.
The functions manager receives requests for radar activities (e.g., search, tracking) from the keyboard according to a specific mode of working or operator requirements. The functional interaction is with the radar hardware, the signal processor, the data processor, and the external peripherals. Starting from the state estimate x ^ k 1 and state error covariance at time step k − 1, the transmitter uses waveform parameter θ k 1 to illuminate the environment. The receiver performs a prediction and obtains x ^ k ( i ) (i.e., Equation (49)) with the previous value and current measurement. Then, x ^ k ( i ) is utilized to evaluate the CRLB, and thus, the measurement error covariance matrix R k ( θ ) . For each waveform of the library, the predicted error covariance matrix P k is computed for evaluation (i.e., Equation (5) in Algorithm 1). At this point, a cost function (i.e., Equation (18)) is to be formulated, and the waveform at the next transmission is chosen as the one that minimizes the weighted mean square error (MSE) on the target state, namely minimizing the cost function by the optimal waveform design or parameters adjustment. The transmitted signal then provides a new optimized measurement z k to feed the PF processor and track the target. The obtained target state estimate x ^ k is finally exploited to predict the next target state x ^ k + 1 ( i ) , and optimizes the new transmitted waveform according to the closed-loop paradigm. This constructs the PAC in cognitive radar [39].
Summarizing, the block scheme of a cognitive tracking system architecture and the information-flow in the mathematical model is displayed in Figure 1. Notice that, if the blocks in red are cancelled, the rest blocks in blue degrade to the traditional radar system.

2.2. PF-Based Cognitive Tracking Algorithm

2.2.1. Bayesian Bounds for Cognitive Radar

The recursive Bayesian estimators, such as the EKF or PF, work in the non-linear system employing the feed-forward processing chain, and do not consider the feedback information from the receiver to the transmission of subsequent waveforms. So far, the corresponding recursive BCRB on the MSE matrix of the state vector was derived and presented in [40]. It can be obtained by calculating the Fisher information matrix (FIM) [41], the inverse of which is the unbiased estimator of the CRLB of the state estimation error covariance [42]. In this paper, the PF based on the cognitive structure characterized by feedback is analyzed, and the estimation is biased. Bayesian FIM (BIM) for this case can be developed and computed iteratively using this form [43]:
B k ( θ k | z 1 : k 1 ; Θ k 1 ) = B k ( θ k | z 1 : k 1 ; Θ k 1 ) + J k ( θ k | z 1 : k 1 ; Θ k 1 ) ,
where B k ( θ k | z 1 : k 1 ; Θ k 1 ) represents the contribution of the prior information is prior, called the prior term, and is denoted as J P ; the J k ( θ k | z 1 : k 1 ; Θ k 1 ) represents the contribution of the data, called data term and denoted as J D .
J D = E k { J F ( x k ) } = E k { E z k | x k ; θ k { 2 ln f ( z k | x k ; θ k ) x k 2 } } ,
J P = E x k { 2 ln f ( x k ) x k 2 } .
The lower bound is defined as the inverse of the BIM, as the form:
B C R B = [ J D + J P ] 1 .
The lower bound of CR would be smaller than the general BCRB, which can be proved as:
[ J D ( θ k ) + J P ( θ k ) ] 1 [ J D + J P ] 1 = B C R B .

2.2.2. Cost Function Design

As we know, CRLB is the possible MMSE of an unbiased estimation. It can be proved that minimizing the trace of the covariance is equal to the MMSE of the expectation of tracking state [38]; thus, we choose the minimum trace with respect to the waveform parameters as the processor cost function modeled by:
C Θ ( θ k | z 1 : k 1 ; Θ k 1 ) = E x k   z k | I k   θ k [ ( x k x ^ k ( z k ) ) T ( x k x ^ k ( z k ) ) ] tr { B k ( θ k | z 1 : k 1 ; Θ k 1 ) 1 } ,
where tr ( ) is an operator that extracts the trace of B k ( θ k | z 1 : k 1 ; Θ k 1 ) 1 , and the waveform parameters are denoted as Θ { θ 1 , θ 2 , , θ k } . x ^ k + 1 ( I k , x k + 1 , z k + 1 ; θ k ) is the posterior predictive value of the state estimation given waveform parameter. Since x ^ k + 1 is related to z k + 1 , which is a nonlinear mapping of x k + 1 , we use approximation rather than integration to solve this expectation. C Θ ( θ k | z 1 : k 1 ; Θ k 1 ) is approximated by tr { B k ( θ k | z 1 : k 1 ; Θ k 1 ) 1 } [33,35]. The model of P k + 1 | k + 1 in KF can refer to [44], while in PF, P k + 1 | k + 1 is modeled by a different form [38]. More details of the relation between P k + 1 | k + 1 and R k ( θ k 1 ) are referred to in [35]. We derive the model of measurement noise covariance based on waveform selection as [45]:
R k ( θ k 1 ) = Γ B k ( θ k | z 1 : k 1 ; Θ k 1 ) 1 Γ = Γ U 1 ( θ k 1 ) Γ / η ,
Γ diag [ c 2 , c 4 π f c ] ,
where η = 2 E R / N 0 is the SNR, and N 0 denotes the spectral density of the complex noise envelope n ˜ ( k ) . U ( θ k 1 ) is a scaled version of FIM. Γ denotes a symmetric matrix defined as this form. c is the speed of waveform propagation, and f c is the carrier frequency.
When the LFM signal with Gaussian amplitude modulation is selected as the transmit waveform, with the duration of Gaussian pulse λ and the chirp rate b chosen to form the parameter vector θ k 1 = [ λ , b ] , the measurement noise covariance matrix can be modeled by R k , r r ˙ ( θ k 1 ) [41]:
R k ( θ k 1 ) = R k , r r ˙ ( θ k 1 ) = [ c 2 λ 2 2 η c 2 b λ 2 2 π f c η c 2 b λ 2 2 π f c η c 2 ( 2 π f c ) 2 η ( 1 2 λ 2 + 2 b 2 λ 2 ) ] .
The sensor cost function is defined as:
C Θ ( θ k ) = { 0 θ k Θ otherwise .
The loss function is defined with respective to the cost function of the processor and sensor, modeled by:
L C , Θ ( θ k | z 1 : k 1 ; Θ k 1 ) C Θ ( θ k | z 1 : k 1 ; Θ k 1 ) + C Θ ( θ k ) .
The choice of the optimal parameter is equivalent to the optimization problem given by:
θ k = arg min θ k P C Θ ( θ | z 1 : k 1 ; Θ k 1 )   s . t .   θ Θ .

2.2.3. Algorithm and Numerical Simulation

We have restated the principle of the cognitive radar system, and have given the interface from PF to it to induce the human cognition mechanism into the PF algorithm. As with the statements before, SPF is available to be the tracking method of the cognitive radar for a tracking target with a nonlinear dynamic and non-Gaussian distribution. The basic algorithm steps of SPF along with the interface to cognitive tracking framework is given as follows Algorithm 1:
Algorithm 1. Cognitive Radar Tracking Recursion Based on PF
Initialization
 1.     x ^ 0 ( i ) p ( x 0 ) , ω k ( i ) = 1 / N s , i = 1 , , N s
Controller Optimization
 2.     x ^ k ( i ) q ( x k ( i ) | x ^ k 1 ( i ) , z k ) , the mean E p k ( x k | x ^ k 1 ( i ) ) [ x ^ k ] = f x ( x ^ k 1 ( i ) ) , i = 1 , , N s
 3.     ω k ( i ) ( θ ) p ( z k | x ^ k ( i ) ) p ( x ^ k ( i ) | x k 1 ( i ) ) / q ( x k ( i ) | x 0 : k ( i ) , z 1 : k ) ,   i = 1 , , N s ,   normalize   ω ˜ k ( i ) ( θ ) = ω k ( x 0 : k ( i ) ) / i = 1 N s ω k ( x 0 : k ( i ) )
 4.     If   N e f f = 1 / i = 1 N s ( ω ˜ k ( i ) ) 2 < N t h ( empirical ) ,   then   [ { x ^ k ( i ) , ω ^ k ( i ) ( θ ) } i = 1 N s ] = RESAMPLE [ { x k ( i ) , ω ˜ k ( i ) ( θ ) } i = 1 N s ]
 5.     P k ( θ ) = i = 1 N s ω k ( i ) ( θ ) ( x ^ k ( i ) x ^ k ) ( x ^ k ( i ) x ^ k ) T
 6.     θ k * = arg min θ k P [ Tr ( P ¯ k + 1 | k + 1 ( θ k ) ) ] .   Select   the   optimal   θ k = θ k *
Motion Update and Measurement
 7.     f ( x ^ k 1 ( i ) ) = q ( x k ( i ) | x k 1 ( i ) ; θ k ) f ( x ^ k 1 ( i ) ) d x k 1 ,   x ^ k 1 ( i ) = μ k = E k [ x ^ k ]
 8.     y ^ k = arg min y ln f ( z k | y ; θ k )
Information Update and State Estimation
 9.     { x ^ k ( i ) , ω k ( i ) } = { x k ( i ) , ω ˜ k ( i ) ; θ k } ,   i = 1 , , N s ,   x ^ k i = 1 N s ω k ( i ) x ^ k ( i )
As an illustration of the CRLB, consider the scenario that the target is assumed to move with a constant velocity along a straight line in the Cartesian plane. The target kinematic state (position, velocity) [ x 1 , k , x 2 , k , x 3 , k , x 4 , k ] T is estimated from the noise-corrupted measurements z k = [ r k ,   β k ] T + w k with r k = ( x 1 , k 2 + x 2 , k 2 ) 1 / 2 , β k = arctan ( x 2 , k / x 1 , k ) . Thus, the measurement equation is nonlinear. The measurement noise w k is assumed to be white, zero-mean Gaussian, with covariance R = d i a g [ σ r 2 , σ β 2 ] , and with the sampling interval T = 0.1 s. The covariance of the process noise is Q = σ x 2 d i a g [ T 0 , T 0 ] , wherein T 0 = [ 0.5 T 4   0.5 T 3 ;   0.5 T 3   T 2 ] . The initial true state of the target is [ 4000 ,   80 ,   12 , 000 ,   20 ] T , and the given initial state is set as [ 5000 ,   90 ,   15 , 000 ,   20 ] T , with an estimation error covariance of P 0 = 1000 I .
Figure 2 shows the results in terms of posterior Cramér–Rao bound (PCRB) and root mean square errors (RMSEs) in the range estimation. As observed in Figure 2a, when we perform the proposed CPF for target tracking in cognitive radar systems, the MSE is initially larger than PCRB, due to the initialization that is not exactly matched to PCRB. Very soon though, CPF demonstrates a fast convergence. Then, MSE agrees with the theoretical curve for PCRB, presenting a considerably well overall performance. Figure 2b compare the PCRB curves obtained using a fixed waveform in a traditional radar and dynamic waveform in cognitive radar. Obviously, the PCRB in the cognitive radar provides a valid lower bound for all time steps, and the convergence is faster.
As we know, the CRLB does not care about the estimation method, but only reflects the best effect of using the existing available information to estimate the parameters. Thus, it can be seen from the figures that after the cognitive system utilizes the extra environmental information through the closed-loop structure, it is possible to lower the previous bounds further.

3. Cognitive Tracking Problem Model in Unknown Environment

In this section, we develop the cost-reference PF (CRPF) for a cognitive robust filter. This means that we extend the classic cognitive radar scenarios to an unknown environment. Specifically, the general description of the CRPF approach is listed first, then the details on fusing the cost with cognitive method are specified.

3.1. Cost-Reference Particle Filtering Approach

As we all know, SPF works well when the mathematical forms of the probability distributions of the noise are assumed explicit, but when the mathematical representation of the dynamics of the system evolution is unknown a priori, or the assumptions of probabilistic models cannot be achieved, SPF may fail and the estimation results may be inaccurate even if they have the cognitive structure.
Unlike basic PF, which only gives the state vectors in sequential algorithm, CRPF gives the state samples and associated costs, namely the WPS as the form:
Ξ k = { x k ( i ) , C k ( i ) } i = 1 M ,
where C k ( i ) = C ( x 0 : k ( i ) | y 1 : k , λ ) is the cost of the particle x k ( i ) . The cost function can be denoted by a recursive additive structure:
C ( x 0 : k | z 1 : k , λ ) = λ C ( x 0 : k 1 | z 1 : k 1 , λ ) + Δ C ( x k | z k ) ,
where λ is a forgetting factor, which is used to avoid attributing an excessive weight to old observations. Δ C ( x k | z k ) is the incremental cost function, the prediction of which can be obtained as:
Δ C ( f x ( x k 1 ) | z k ) = R ( x k 1 | z k ) ,
where the one-step risk function is introduced, and the risk of particle i can be computed as:
R : R L x × R L y R ,   x k 1 , z k R ( x k 1 | z k ) ,
R k + 1 ( i ) = λ C k ( i ) + R ( x k ( i ) | z k + 1 ) .
We assign the particles in Ξ k + 1 the PMF, which is, according to the cost, defined as:
π k + 1 ( i ) μ ( C k + 1 ( i ) ) ,
where a monotonically decreasing function can be used as μ ( ) , which is selected to guarantee an adequate discrimination of low-cost particles from higher ones. Then, we would obtain the minimum cost by maximum the function below:
x ˜ 0 : k + 1 min = x k + 1 ( i 0 ) , i 0 = arg max i { π k + 1 ( i ) } .
Notice that the mean value here is equal to the minimum cost estimate, and it even has slight advantages over the latter one. More details can be referred to in [26], and the presentation can be found in Algorithm 2.
The overall procedure of the CRPF algorithm is summarized in Algorithm 2.
Algorithm 2. CRPF algorithm for target tracking problem
Initialization
 1.     x 0 ( i ) p 0 ( x 0 ) ,   C 0 ( i ) = 0 ,   σ 0 2 , ( i ) ,   i = 1 , , N s ,   the   weighted - particle   set   Ξ 0 = { x 0 ( i ) , C 0 ( i ) } i = 1 N s
PMF Update
 2.     R k ( i ) = λ C k 1 ( i ) + z k f y ( f x ( x k 1 ( i ) ) ) q ,   q = 1 , 2 ,   π ^ k ( i ) μ ( R k ( i ) ) = 1 ( R k ( i ) min { R k ( i ) } i = 1 N s + δ ) β ,   i = 1 , , N s .
 3.     { x ^ k 1 ( i ) , C ^ k 1 ( i ) } i = 1 N s = RESAMPLE [ { π ^ k ( i ) } i = 1 N s ]
Particle Propagation and Variance Update
 4.     E p k ( x k | x ^ k 1 ( i ) ) [ x k ] = f x ( x ^ k 1 ( i ) ) ,   C o v p k ( x k | x ^ k 1 ( i ) ) [ x k ] = σ k 2 , ( i ) I [ x ] ,   x k ( i ) p k ( x k | x ^ k 1 ( i ) ) .
 5.     σ k 2 , ( i ) = { σ k 1 2 , ( i ) t 10 k 1 k σ k 1 2 , ( i ) + x k ( i ) g ( x ^ k 1 ( i ) ) 2 k × dim [ x ] t > 10 ,   i = 1 , , N s
State Estimation
 6.     C k ( i ) = λ C k 1 ( i ) + z k f y ( x k ( i ) ) q ,   π ^ k ( i ) μ 2 ( C k ( i ) ) = 1 ( C k ( i ) min { C k ( i ) } i = 1 N s + δ ) β ,   δ , β > 0 ,   normalized   to   π k ( i ) ,
 7.     x ^ k = x k m e a n = i = 1 N s π k ( i ) x k ( i ) ,   i = 1 , , N s

3.2. Cognitive Cost-Reference Particle Filtering

Although the CRPF approach can be used in the dynamic system to deal with the uncertainty problem, by controlling the risk and cost and adapting the variance of the noise, it is also limited by the system performance, e.g., the waveform parameter, process noise, SNR, etc. If the iteration can be performed in another dimension, namely in the varying information entropy caused by the varying parameters, through perceiving the exterior environment, the estimation error may be decreased further. To address this challenge, we propose a cognitive cost-reference particle filter algorithm to perform the robust adaptive filter for target tracking.

3.2.1. Cost Function Design

Considering the more complicated feedback in cognitive CRPF (CCRPF), it seems not easy to achieve the closed-form solution of the CRLB even iteratively [46]. Thus, we use the MC approach to approximate the BIM.
Breaking through the limitation of the traditional CRPF method in Equation (25) that only minimizes the cost of particles in a one-time step, we are dedicated to designing a new cost function based on parameter adaption in cognitive radar framework. The processor cost function is defined with a recursive additive structure as follows:
C ( x 0 : k | z 1 : k , λ ; θ k ) = λ C ^ k + Δ C k + 1 ( θ k ) = i = 0 k λ ( k i ) Δ C ( x k | z k ; θ k ) .
Lemma 1.
Let μ k , M be the unbiased estimation of mean value μ k , let σ k ( i ) ( θ k ) be the variance, and let Δ C ( x k | y k ; θ k ) be a cost function.
If the two following conditions are met:
(1) The σ ^ k ( i ) ( θ k ) is asymptotically to the σ k ( i ) ( θ k ) , that is
lim k | σ k ( i ) ( θ k ) σ ^ k ( i ) ( θ k ) | = 0 ( i . p . )
where i.p. stands for “in probability”.
(2) The mean incremental cost Δ C k ¯ = i = 1 M ω k ( i ) Δ C ( x k ( i ) | y k ; θ k ) converges to the minimal incremental cost
lim M | Δ C ( x k o p t | y k ; θ k ) Δ C k ¯ | = 0 ( i . p . ) ,
and then μ k , M is asymptotically to μ k , so namely the error satisfies
lim k | μ k , M μ k | = 0 ( i . p . ) .
See Appendix A for a proof.
It can be seen from Equation (26) that the cost function changes with the waveform parameters. According to Lemma 1, we can achieve the asymptotically optimal sequence of state vectors by using the MC approach. The loss function is defined with respect to the cost function of the processor and sensor, modeled by:
L C , Θ ( θ k | z 1 : k 1 ; Θ k 1 ) L { C k + 1 ( θ k | z 1 : k 1 ; Θ k 1 ) , T Θ ( θ k ) } ,
where P k is the waveform library at time k. The loss function should be modeled to balance the predicted conditional Bayes risk/cost and measurement cost. Thus, we define it to be the sum of them, that is,
L C , Θ ( θ k | z 1 : k 1 ; Θ k 1 ) C k + 1 ( θ k | z 1 : k 1 ; Θ k 1 ) + T Θ ( θ k ) .
The next θ k * is chosen to minimize the loss function, and the choice of the optimal parameter is equivalent to the optimization problem given by:
θ k * = arg min θ k P C k + 1 ( θ k | z 1 : k 1 ; Θ k 1 )   s.t.   θ k Θ .
Standard PF uses the statistical reference, which gives the particles the equal weighting after resampling. CRPF uses the cost reference, which preserves the particle cost after resampling to shift the random grid representation of the cost function toward its local minima. The proposed method is to shift the cost function toward another dimension, namely the vector space effected by the waveform library, to further minimize the cost signal estimates.

3.2.2. Sequential Algorithm and Design Issue

According to Equation (21) and the definition of the risk function, the risk function R ( x k | z k + 1 ; θ k ) is a prediction of the cost increment Δ C ( x k + 1 | z k + 1 ; θ k ) , so R ( ) and Δ C ( ) are related closely, which can be chosen simply as:
Δ C ( x k | z k ; θ k ) = z k f y ( x k ; θ k ) q ,
R ( x k | z k + 1 ; θ k ) = z k + 1 f y ( f x ( x k ; θ k ) ) q .
After resampling, particles are drawn according to the propagation density, in which we add the waveform parameter θ k because of the cognitive framework, denoted as:
x k + 1 ( i ) p k + 1 ( x k + 1 | x ^ k ( i ) ; θ k ) .
The expected E [ ] with respect to the PDF p ( ) is used as the constraint of Equation (35), denoted as:
E p k + 1 ( x k + 1 | x k ; θ k ) [ x k + 1 ] = f x ( x k ; θ k ) .
Therefore, the particles propagate randomly according to the dynamic model, which changes with θ k , that is, the conditional pdf p k + 1 ( x k + 1 | x k ) still can be arbitrary, but the range of the constraint will be changed by the parameters in another dimension. A zero-mean Gaussian pdf with adaptive variance is used for the particle propagation, denoted as:
x k + 1 ( i ) N ( f x ( x ^ k 1 ( i ) ; θ k ) , σ k 2 , ( i ) I L x ) ,
where σ k 2 , ( i ) is the variance, and I L x is the L x × L x identity function. One choice of adaptive selection for σ k 2 , ( i ) is to compute it by the recursive model.
σ k 2 , ( i ) ( θ k ) = t 1 t σ k 1 2 , ( i ) + x k ( i ) f x ( x ^ k 1 ( i ) ; θ k ) 2 k L x .
In terms of the cost function:
C k + 1 ( i ) ( θ k ) = λ C ^ k ( i ) + Δ C k + 1 ( i ) ( θ k ) ,
where Δ C k + 1 ( i ) is also affected by the newly updated θ k , shown as:
Δ C k + 1 ( i ) ( θ k ) = Δ C ( x k + 1 ( i ) | z k + 1 ; θ k ) .
The next transmitted waveform is selected. The function μ in Equation (41) is selected to assign large probability masses to lower-cost particles. When we add θ k to the cost function, the probability masses will be re-assigned as follows:
π ^ k ( i ) μ 2 { C k ( i ) ( θ k ) } = 1 ( C k ( i ) ( θ k ) min p { C k ( p ) ( θ k ) } + δ ) β ,
where 0 < δ < 1 , β > 1 . The current state estimation and its prediction covariance matrix is updated by:
i 0 = arg max π ^ k ( i ) , x ^ k min = x ^ k ( i 0 ) .
We can obtain the optimal sequence of state vectors and the pointwise solution, denoted as:
x ^ k m e a n = i = 1 N s π ^ k ( i ) x ^ k ( i ) .
The cognitive structure is combined with the cost-reference function couple, considering the internal and external mechanisms to form the complete CCRPF algorithm. The implementation details are given in Algorithm 3. This can be reduced to the CRPF if using a fixed waveform.
Algorithm 3. CCRPF algorithm for maneuvering target tracking problem.
Initialization, ( for   i = 1 , , N s ) ,  
 1.     generate   x 0 ( i ) p 0 ( x 0 ) ,   assign   the   cos t   C 0 ( i ) = 0 ,   and   initialize   σ 0 2 , ( i ) .
PMF Update
 2.    Start with the initial waveform parameter θ . For each θ compute:
 3.       { x ^ k 1 ( i ) , ω k 1 ( i ) } = { x k 1 ( i ) , ω k 1 ( i ) } ,   i = 1 , , N s ,   z k = f y ( x k , w k ; θ )
 4.       R k ( i ) ( θ ) = λ C k 1 ( i ) + z k f y ( f x ( x k 1 ( i ) ) ) q ,   q = 1 , 2 ;   i = 1 , , N s ,   π k ( i ) μ ( R k ( i ) ) = 1 ( R k ( i ) min { R k ( i ) } i = 1 N s + δ ) β
 5.       Resampling   x ^ k 1 = { x ^ k 1 ( i ) , C ^ k 1 ( i ) } i = 1 N s   according   to   π k ( i )
Particle Propagation and Waveform Selection
 6.       x k ( i ) p k ( x k | x ^ k 1 ( i ) ) ,   compute   the   cost   C k ( i ) ( θ ) = λ C k 1 ( i ) + z k f y ( x k ( i ) ) q
 7.       Compute   the   cos t   function   θ k * = arg min θ C Θ ( i ) ( θ k | z 1 : k 1 ; Θ k 1 ) + C Θ ( θ k )
 8.     Select   the   optimal   waveform   parameter   θ k = θ k *
Particle and Measurement Recursive Update
 9.       x ^ k ( i ) = x ^ k ( i ) ,   z k = f y ( x k , w k ; θ k ) ,   C k ( i ) = C k ( i ) ( θ k )
 10.       σ k 2 , ( i ) ( θ k ) = { σ k 1 2 , ( i ) t 10 k 1 k σ k 1 2 , ( i ) + x k ( i ) f x ( x ^ k 1 ( i ) ) 2 k × dim [ x ] t > 10 , i = 1, …, Ns
Information Update and State Estimation
 11.     π ^ k ( i ) μ 2 ( C k ( i ) ; θ k ) = 1 ( C k ( i ) min { C k ( i ) } i = 1 N s + δ ) β ,   where   α , β > 0 .   Normalize   the   PMF .
 12.       x ^ k = x ^ k m e a n = i = 1 N s π ^ k ( i ) x ^ k ( i ) .   Save   the   x ^ k   and   P ^ k ( θ k ) .

3.3. Convergence of CCRPF Algorithm

Due to the change of waveform parameter, resulting in the change of the measurement noise, convergence results regarding CRPFs may be not valid for the proposed algorithm. We assess the convergence. More details about the preliminary definitions and the derivation of the conditions are referred to in [26].
Lemma 2.
Let Δ C ( x k | y k ) be a cost function. With a fixed waveform parameter θ , if the three following conditions are met:
(1) The set function μ k ( A { x k ( i ) } i = 1 M ) = x A μ ( Δ C ( x k | y k ) ) satisfies
lim M Pr [ 1 μ k ( S M ( x k o p t ( θ ) , ε ) ) μ k ( { x k ( i ) ( θ ) } i = 1 M ) δ ] = 0 δ > 0 ,
where Pr [ ] denotes probability, { x k ( i ) } i = 1 M is a set of particles drawn at time step k , and S M { x k o p t ( θ ) , ε } = { x { x k ( i ) } i = 1 M : x x k o p t < ε ; θ } .
(2) The mean incremental cost Δ C k ¯ = i = 1 M ω k ( i ) Δ C ( x k ( i ) | y k ) converges to the minimal incremental cost
lim M | Δ C ( x k o p t | y k ; θ ) Δ C k ¯ | = 0 ( i . p . ) .
(3) The mean cost estimate is asymptotically optimal,
lim M | Δ C ( x ˜ k m e a n | y k ; θ ) Δ C ( x k o p t | y k ; θ ) | = 0 ( i . p . ) ,
where x ˜ k m e a n = i = 1 M π k ( i ) x k ( i ) , then with the adaptive parameter θ k , the set function satisfies:
lim M [ μ k ( S M ( x k o p t ( θ k ) , ε ) ) / μ k ( { x k ( i ) ( θ k ) } i = 1 M ) ] = 1 ( i . p . ) ,
and the mean cost estimate is asymptotically optimal, that is:
lim M | Δ C ( x ˜ k m e a n | y k ; θ k ) Δ C ( x k o p t | y k ; θ k ) | = 0 ( i . p . ) .
See Appendix B for a proof.

4. Numerical Results and Discussion

4.1. Dynamic Model

The evolution of the target state and the corresponding measurements are described by a known discrete-time stochastic model separately [47]. Consider the target moving in the x y plane according to the model:
x k + 1 = F k x k + v k ,
where [ x 1 , k , x 2 , k , x 3 , k , x 4 , k ] T  denotes the state vector Xk of the target, ( x 1 , k x 2 , k ) , and ( x 3 , k , x 4 , k ) denotes the position and the velocity, respectively. v k = [ 0   0   v x   v y ] T is the system noise. We assume that the sampling period is short enough for the velocity to be a constant during the period, so the state transition matrix is defined as:
F k = [ F 11 F 12 0 F 22 ] , F 11 = F 22 = [ 1   0 0   1 ] , F 12 = [ T   0 0   T ] .
The measurement equation is considered to be highly nonlinear, and as described as follows, z k = [ r k , r ˙ k , β k ] T is selected as the observable vector, and the observation matrix is denoted as:
z k = [ r k ,   r ˙ k ,   β k ] T + w k ,
where the range r k , range-rate r ˙ k , and bearing β k compose the observable vector, that is:
r k = ( x 1 , k 2 + x 2 , k 2 ) 1 / 2 , r ˙ k = ( x 3 , k 2 + x 4 , k 2 ) 1 / 2 , β k = arctan ( x 2 , k / x 1 , k ) .
Select Equation (15) to be the model of R k . The fixed-waveform parameter is set as λ 0 = 50 × 10 6 s , b 0 = 60 × 10 9 yad / s 2 , and the initial SNR is set as η 0 = 16 , so R 0 = R ( λ 0 , b 0 , η 0 ) . The initial true state of the target is [ 10 ,   5 ,   0.2 ,   0.2 ] T , and the given initial state is set as [ 6   2   0   0 ] T , σ 0 2 , ( i ) = [ 1.5   10   0.1   1.5 ] T . The sampling period is T = 1   s . The generally used heavy-tailed distributions include Laplace distribution, t distribution, uniform distribution, and Gaussian distribution with large variance. Considering that the mixture Gaussian is applied, the sum of the different weighted Gaussian noises with different parameters can be used as the modeling of flicker noise w k ( θ k 1 ) . The pdf of flicker noise can be denoted as:
v x , y 0.1 N ( 0 , 1 ) + 4 × 10 4 N ( 0 , 1 ) + 10 6 N ( 0 , 1 ) ,
w r 0.1 N ( 0 , σ r ) + 0.04 × 10 4 N ( 0 , σ r ) + 2.5 × 10 5 N ( 0 , σ r ) ,
w β 0.1 N ( 0 , σ β ) + 4 × 10 4 N ( 0 , σ β ) + 1.6 × 10 7 N ( 0 , σ β ) ,
where N ( μ i , Σ i ) denotes the Gaussian distribution with the mean of μ i and the variance of Σ i . The particle number is 400.
For a particular scenario and parameters, the overall performance of a filter is evaluated using the metric that is the ARMSE. We use it to estimate the tracking accuracy of the algorithms, and it is defined as:
ARMSE = 1 N m K i = 1 N m k = 1 K [ ( x ^ 1 , k i x 1 , k ) 2 + ( x ^ 3 , k i x 3 , k ) 2 ] ,
where K is the total number of time steps and N m is the total number of independent MC runs. Let ( x 1 , k , x 3 , k ) and ( x ^ 1 , k i , x ^ 3 , k i ) denote the true and the estimated target position at time k at the i - t h MC run, respectively. Then, the RMSE of the estimated states at k can be computed as [48]:
RMSE ( k ) = 1 N m i = 1 N m [ ( x ^ 1 , k i x 1 , k ) 2 + ( x ^ 2 , k i x 2 , k ) 2 ] .
The upper and lower limits of the waveform parameters are determined according to the transmitter specifications, and the waveform library can be obtained as:
P = { λ [ λ min : Δ λ : λ max ] , b [ b min : Δ b : b max ] } ,
where Δλ denotes the step-size of the envelope duration, and Δb denotes the step-size of the chirp rate.

4.2. Selection of Function μ and Parameters δ

Before implementing the simulation experiments, we check the behavior of the proposed method with different function μ and different values of tuning parameter δ in Equations (33), (34) and (41), to select suitable μ and δ .
The selection of the tuning parameters is investigated in Figure 3. Figure 3a shows four cases of mean absolute deviation after the tracking, with varying values set of μ and q . The performance in the case ( μ 1 ,   q = 1 ) seems well at beginning but shows a rapid degradation soon. The performances of the other cases are similar, wherein case ( μ 2 ,   q = 2 ) shows a slightly better overall performance. Three cases with varying values of δ are considered in Figure 3b. It is clear that a small δ may result in a worse improvement. Therefore, it is reasonable to select μ 2 ,   q = 2 ,   δ = 1 for the rest of the simulations according to the analysis and conclusion.

4.3. Simulation Results

4.3.1. Scenario 1: Unknown Statistics in General Environment

We assume that the process noise and measurement noise is independent and temporally white in this case, with the mixture Gaussian pdf as Equations (53)–(55) show, but the given methods are mismatched with the dynamic systems and models.
We apply SPF, CRPF, and CCRPF for target tracking in a two-dimensional space for performance assessment. Figure 4 displays the true path against the tracks of these filters. The other two tracks the target with an error smaller than the SPF, wherein the CCRPF outperforms CRPF slightly. Note how the trajectory of CCRPF deviates from the true path slightly due to the cost function and cognitive structure. EKF does not show the performance of robustness, and we do not show the result here.
All error curves corresponding to the above three filters were obtained by simulation runs. The results for the estimation error of the range and range-rate are shown in Figure 5a,b respectively. Observe from Figure 5 that the SPF introduces large bias in the estimation. CRPF and CCRPF are unbiased throughout the observation period. As for the comparison of CRPF and CCRPF, the overall performance of CCRPF is considered better than that of CRPF. Particularly, the CCRPF estimation is approaching zero bias except for the very beginning stage, indicating a fairly robust performance.
In term of the convergence speed, the convergence of CRPF and CCRPF shows more rapidly than SPF and keeps stable; thus, they are more robust than SPF in tracking performance. The CRPF converges the error asymptotically, while the proposed method achieves convergence in less than 10 time steps, due to the real-time perception of environment in the cognitive algorithm.
CRPF does not have a bad overall performance, but it provides higher peak errors than CCRPF since the cost reference mechanism can only use the fixed waveform, that is to say, it can only be adaptive in the constant environment conceived by the fixed waveform. However, cognitive radar based on CCRPF can not only perceive the dynamic environment by the dynamic waveform, but can also stay adaptive during the recursion after the waveform is chosen. Thus, CCRPF can suppresses the peak error on the fly and limits the overall magnitude of the error, showing the superiority.
A summary of these results and the detailed comparison are also tabulated in Table 1. The parameter of fixed waveform 1 is λ = 10 × 10 6 , b = 10 × 10 9 , and the parameter of fixed waveform 2 is λ = 50 × 10 6 , b = 60 × 10 9 . The first row of the table lists the metrics corresponding to the ranging performance by MC simulations with 40 independent runs, which is computed using Equation (56). The second row gives the performance on estimation of range-rate.
From this table, more concretely, we see that the SPF shows a degradation in performance, it has the lowest tracking accuracy of the compared methods. From the comparison of PFs using two different fixed waveforms, we find that PF with waveform 2 is better than that with waveform 1, due to the effects of the measurement noise obtained by different waveforms, and the same conclusion can be found in CRPFs. It is clear that the best filters for this case was CCRPF, which achieves an excellent performance of a 15% improvement over the SPF-2 and a 3% improvement over the CRPF-2.
The unknown environment has a huge influence on SPF for tracking. The mismatch of the noise is the main cause result in the performance degradation. CRPF is less affected since it takes advantage of the cost reference mechanism, so it has better tracking accuracy than SPF. The mastery of environmental information in time has an extra contribution in the proposed algorithm, which has more obvious performance improvement compared with others because of the robust module working cooperatively with the cognitive structure. This also indicates that it is a good supplement to the cognitive radar tracking approach.
To study how the cognition process evolves across time, we have plotted the waveform selection for both the chirp rate and the duration of the pulse envelope in Figure 6a,b. We observe that the transition of the chirp rate is switched from maximum up-sweep to maximum down-sweep.

4.3.2. Scenario 2: Unknown Statistics in Dynamic Noise Environment

Unknown and abrupt change of noise would be considered in this case. The main parameters are the same with Scenario 1. Measurement noise covariance R is known a priori, and the actual process noise covariance Q is set as:
{ Q 0 = Q , k = 1 , , 60 Q 0 = 80 × Q , k = 61 , , 80 .
Figure 7 shows the RMSE curves corresponding to the four filters. When k = 1 , , 60 with a priori known noise, all algorithms can track the target successfully. In the subsequent phase when Q is increasing from the time step 60 abruptly, although SPF could not be seen as a failure to track the target, its convergence effect and speed are all the worst. CRPF shows a similar convergence speed to CCRPF, because of the similar robust module, but its overall performance during the dynamic noise stage is obviously inferior to the latter. The proposed approach shows the superiority again, with the lowest error and fastest convergence speed.
A summary of the results and the detailed comparison are also tabulated in Table 2. We do not use a different waveform in SPF and CRPF anymore because similar work has already been done in Scenario 1.

5. Conclusions

We have concentrated on the tracking method of the cognitive radar based on particle filter. The CCRPF for cognitive radar is proposed as a significant step toward random dynamic systems with unknown statistics. In this work, the mathematical model of cognitive radar has been derived, CRLB and the corresponding cost function have been designed, the cognitive PF algorithm has been presented by completely reconstructing the propagation and update process, and the proposed algorithm has been developed by updating the cost function and noise variance. Moreover, the convergence of the approach has been proofed. The simulation results illustrate that the state error prediction is more adjacent to the CRLB, and the proposed method showed a good outperforming result over the existing methods in accuracy and robustness on highly nonlinear dynamic systems with unknown statistics or a complicated environment. The application of cognitive radar and the adaptation of a traditional filter have been expanded to a wider scope.
Future work: (1) PF is approximately globally optimal, while cognitive estimation is biased, so the closed-form of the optimal solution is not easy to obtain in the PF-based cognitive tracking method, and we can only use MC to get the solution or proof the convergence. However, the cost function in CRPF might have the optimal solution; thus, the cost function design is deserves to have further study. (2) Using novel SMC methods as the cognitive framework may have better prospects in dealing with dynamic system problems with unknown or uncertain statistics. (3) The lower limit and dynamic change of particle number could also be determined.

Author Contributions

The work presented here was carried out in collaboration between all authors. conceptualization, Y.L. and L.Z.; methodology, L.Z.; validation, L.Z.; formal analysis, L.Z.; funding acquisition, W.C.; investigation, W.C. and L.Z.; data curation, L.Z.; software, Y.Z.; writing—original draft, L.Z.; Writing—review and editing, W.C. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Fundamental Research Funds for the Central Universities under Grant 3102019ZX015.

Acknowledgments

The authors wish to thank Haobin Li in 54th Research Institute of China Electronics Technology Group Corporation and Zeyu Wang in Huawei Technologies Co., Ltd. for their support and helpful advice.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

The following abbreviations are used in this paper:
PFParticle Filtering
MCMonte Carlo
PMCMCParticle Markov Chain Monte Carlo
SMCSequential Monte Carlo
SMC2Sequential Monte Carlo Square
SISSequential Importance Sampling
SIRSampling Importance Resampling
APFAdaptive PF
CRLBCramér–Rao Lower Bound
CPFCognitive PF
SPFStandard PF
MSEMean Square Error
PACPerception-action Cycle
BCRBBayesian Cramér–Rao lower bound
BIMBayesian Fisher Information Matrix
MMSEMinimum MSE
SNRSignal-to-noise Ratio
LFMLinear Frequency Modulation
RMSERoot Mean Square Error
PCRBPosterior Cramér–Rao Bound
CRPFCost-reference Particle Filter
WPSWeighted-particle Set
PMFProbability Mass Function
CCRPFCognitive Cost-reference Particle Filter
PDFProbability Density Function
ARMSEAverage Root Mean Square Error
FIMFisher Information Matrix
KLDKullback–Leibler Distance
GRNNGeneral Regression Neural Network
CKFCubature Kalman Filter
CD-CKFContinuous-Discrete Cubature Kalman Filter

Appendix A

In the MC integration, Equation (1) is as the expected value of f k ( x i ) , so it can be estimated as the sample mean:
μ k , M = 1 M i = 1 M f k ( x i ) .
Supposing μ k = E [ f k ( x ) ] , it can be proved that μ k , M is the unbiased estimation of μ k , and the non-zero bounded variance can be denoted as:
σ k 2 ( θ k ) = D ( f k ( x ) μ k ) 2 p ( x ) d x < + .
Given the confidence level 1 α , the error can be denoted as:
| μ k , M μ k | λ α σ ^ k ( θ k ) / M .
Owing to Condition (1), we can obtain the expression:
lim M | σ k ( θ k ) σ ^ k ( θ k ) | = 0 ( i . p . ) ,
lim M k | μ k , M μ k | = 0 ( i . p . ) .

Appendix B

When we choose the optimal parameter θ k , and from the preliminary definitions we can exploit that:
lim M E Pr [ n M ] [ n M ] = ( 1 γ ) lim M M ( i . p . ) ,
then the following relationship holds:
lim M E Pr [ n M ] [ n M ] μ k ( S M ( x k o p t , ε ; θ k ) )   = ( 1 γ ) lim M M μ k ( S M ( x k o p t , ε ; θ k ) )   ( 1 γ ) lim M M S i n = 0
Thus, we can write:
lim M Pr [ 1 μ k ( S M ( x k o p t , ε ; θ k ) ) μ k ( { x k ( i ) } i = 1 M ) δ ] 1 δ δ S o u t lim M E Pr [ n M ] [ n M ] μ k ( S M ( x k o p t , ε ; θ k ) ) = 0 ,
lim M [ μ k ( S M ( x k o p t ( θ k ) , ε ) ) / μ k ( { x k ( i ) ( θ k ) } i = 1 M ) ] = 1 ( i . p . ) .
The proof of asymptotically optimization of the mean cost estimate is carried out by following steps:
lim M | Δ C ( x ˜ k m e a n | y k ; θ k ) Δ C ( x k o p t | y k ; θ k ) |   lim M | sup x k S M ( x k o p t , ε )   Δ C ( x k | y k ; θ ) Δ C ( x k o p t | y k ; θ ) |   = B ( ε = 1 / M ) = 0

References

  1. Avila, D.; Alvarez, E.; Abusleme, A. Noise analysis in pulse-processing discrete-time filters. IEEE Trans. Nucl. Sci. 2013, 60, 4634–4640. [Google Scholar] [CrossRef]
  2. Doucet, A.; Godsill, S.; Andrieu, C. On sequential Monte Carlo sampling methods for Bayesian filtering. Stat. Comput. 2000, 10, 197–208. [Google Scholar] [CrossRef]
  3. Gordon, N.J.; Salmond, D.J.; Smith, A.F.M. Novel approach to nonlinear/non-Gaussian Bayesian state estimation. In IEE Proceedings F on Radar and Signal Processing; Institution of Engineering and Technology (IET): London, UK, 1993; pp. 107–113. [Google Scholar]
  4. Míguez, J. Analysis of parallelizable resampling algorithms for particle filtering. Signal Process. 2007, 87, 3155–3174. [Google Scholar] [CrossRef]
  5. Rubin, D.B. The calculation of posterior distributions by data augmentation: Comment: A noniterative sampling/importance resampling alternative to the data augmentation algorithm for creating a few imputations when fractions of missing information are modest: The SIR algorithm. J. Am. Stat. Assoc. 1987, 82, 543–546. [Google Scholar]
  6. Julier, S.J.; Uhlmann, J.K. Unscented filtering and nonlinear estimation. Proc. IEEE 2004, 92, 401–422. [Google Scholar] [CrossRef] [Green Version]
  7. Arulampalam, M.S.; Maskell, S.; Gordon, N.; Clapp, T. A Tutorial on Particle Filters for Online Nonlinear/Non-Gaussian Bayesian Tracking. IEEE Trans. Signal Process. 2002, 50, 174–188. [Google Scholar] [CrossRef] [Green Version]
  8. Vaswani, N. Particle Filtering for Large-Dimensional State Spaces with Multimodal Observation Likelihoods. IEEE Trans. Signal Process. 2008, 56, 4583–4597. [Google Scholar] [CrossRef] [Green Version]
  9. Havangi, R. Target tracking based on improved unscented particle filter with Markov chain Monte Carlo. IETE J. Res. 2018, 64, 873–885. [Google Scholar] [CrossRef]
  10. Whiteley, N.; Singh, S.; Godsill, S. Auxiliary Particle Implementation of Probability Hypothesis Density Filter. IEEE Trans. Aerosp. Electron. Syst. 2010, 46, 1437–1454. [Google Scholar] [CrossRef] [Green Version]
  11. Fox, D. Adapting the Sample Size in Particle Filters Through KLD-Sampling. Int. J. Robot. Res. 2003, 22, 985–1003. [Google Scholar] [CrossRef]
  12. Scharcanski, J.; de Oliveira, A.B.; Cavalcanti, P.G.; Yari, Y. A particle-filtering approach for vehicular tracking adaptive to occlusions. IEEE Trans. Veh. Technol. 2010, 60, 381–389. [Google Scholar] [CrossRef]
  13. Sanguino, T.D.J.M.; Gómez, F.P. Toward simple strategy for optimal tracking and localization of robots with adaptive particle filtering. IEEE/ASME Trans. Mechatron. 2016, 21, 2793–2804. [Google Scholar] [CrossRef]
  14. Zhou, T.; Peng, D.; Xu, C.; Zhang, W.; Shen, J. An adaptive particle filter based on kullback-leibler distance for underwater terrain aided navigation with multi-beam sonar. IET Radar Sonar Navig. 2018, 12, 433–441. [Google Scholar] [CrossRef]
  15. Wei, W.; Gao, S.; Zhong, Y.; Gu, C.; Hu, G. Adaptive Square-Root Unscented Particle Filtering Algorithm for Dynamic Navigation. Sensors 2018, 18, 2337. [Google Scholar] [CrossRef] [Green Version]
  16. Fox, D. KLD-sampling: Adaptive particle filters. In Proceedings of the 14th Conference on Advances in Neural Information Processing Systems (NIPS), Vancouver, BC, Canada, 3–8 December 2002; pp. 713–720. [Google Scholar]
  17. Soto, A. Self adaptive particle filter. In Proceedings of the 19th International Joint Conference on Artificial Intelligence, Edinburgh, Scotland, UK, 30 July–5 August 2005; pp. 1398–1403. [Google Scholar]
  18. Andrieu, C.; Doucet, A.; Holenstein, R. Particle markov chain monte carlo methods. J. R. Stat. Soc. Ser. B (Stat. Methodol.) 2010, 72, 269–342. [Google Scholar] [CrossRef] [Green Version]
  19. Chopin, N.; Jacob, P.E.; Papaspiliopoulos, O. SMC2: An efficient algorithm for sequential analysis of state space models. J. R. Stat. Soc. Ser. B (Stat. Methodol.) 2013, 75, 397–426. [Google Scholar] [CrossRef] [Green Version]
  20. Crisan, D.; Miguez, J. Nested particle filters for online parameter estimation in discrete-time state-space Markov models. Bernoulli 2018, 24, 3039–3086. [Google Scholar] [CrossRef]
  21. Martino, L.; Read, J.; Elvira, V.; Louzada, F. Cooperative parallel particle filters for online model selection and applications to urban mobility. Digit. Signal Process. 2017, 60, 172–185. [Google Scholar] [CrossRef] [Green Version]
  22. Carvalho, C.M.; Johannes, M.S.; Lopes, H.F.; Polson, N.G. Particle learning and smoothing. Stat. Sci. 2010, 25, 88–106. [Google Scholar] [CrossRef] [Green Version]
  23. Drovandi, C.C.; McGree, J.M.; Pettitt, A.N. A sequential Monte Carlo algorithm to incorporate model uncertainty in Bayesian sequential design. J. Comput. Graph. Stat. 2014, 23, 3–24. [Google Scholar] [CrossRef] [Green Version]
  24. Urteaga, I.; Bugallo, M.F.; Djurić, P.M. Sequential Monte Carlo methods under model uncertainty. In Proceedings of the 2016 IEEE Statistical Signal Processing Workshop (SSP), Palma de Mallorca, Spain, 26–29 June 2016; pp. 1–5. [Google Scholar]
  25. Stateczny, A.; Kazimierski, W. A comparison of the target tracking in marine navigational radars by means of GRNN filter and numerical filter. In Proceedings of the IEEE Radar Conference, Rome, Italy, 26–30 May 2008; pp. 1–4. [Google Scholar]
  26. Míguez, J.; Bugallo, M.F.; Djuric, P.M. A New Class of Particle Filters for Random Dynamic Systems with Unknown Statistics. EURASIP J. Adv. Signal Process. 2004, 2004, 303619. [Google Scholar] [CrossRef] [Green Version]
  27. Djuric, P.M.; Bugallo, M.F. Cost-reference particle filtering for dynamic with nonlinear and conditionally linear states. In Proceedings of the Nonlinear Statistical Signal Processing Workshop, Cambridge, UK, 13–15 September 2006; pp. 183–188. [Google Scholar]
  28. Yu, Y.H. Combining H∞ filter and cost-reference particle filter for conditionally linear dynamic systems in unknown non-Gaussian noises. Signal Process. 2013, 93, 1871–1878. [Google Scholar] [CrossRef]
  29. Lim, J. Particle filtering for nonlinear dynamic state systems with unknown noise statistics. Nonlinear Dyn. 2014, 78, 1369–1388. [Google Scholar] [CrossRef]
  30. Míguez, J. Analysis of selection methods for cost-reference particle filtering with applications to maneuvering target tracking and dynamic optimization. Digit. Signal Process. 2007, 17, 787–807. [Google Scholar] [CrossRef]
  31. Bugallo, M.F.; Maiz, C.S.; Miguez, J.; Djuric, P.M. Cost-Reference Particle Filters and Fusion of Information. In Proceedings of the IEEE 13th Digital Signal Processing Workshop & 5th IEEE Signal Processing Education Workshop, Marco Island, FL, USA, 4–7 January 2009; pp. 286–291. [Google Scholar]
  32. Haykin, S. Cognitive radar: A way of the future. IEEE Signal Process. Mag. 2006, 23, 30–40. [Google Scholar] [CrossRef]
  33. Bell, K.L.; Johnson, J.T.; Smith, G.E.; Baker, C.J.; Rangaswamy, M. Cognitive radar for target tracking using a software defined radar system. In Proceedings of the IEEE Radar Conference (RadarCon), Arlington, VA, USA, 10–15 May 2015; pp. 1394–1399. [Google Scholar]
  34. Smith, G.E.; Cammenga, Z.; Mitchell, A.; Bell, K.L.; Johnson, J.; Rangaswamy, M.; Baker, C. Experiments with cognitive radar. IEEE Aerosp. Electron. Syst. Mag. 2016, 31, 34–46. [Google Scholar] [CrossRef]
  35. Xue, Y. Cognitive Radar: Theory and Simulations. Ph.D. Thesis, McMaster University, Hamilton, ON, Canada, 2010. [Google Scholar]
  36. Wang, J.; Qin, Y.; Wang, H.; Li, X. Dynamic waveform selection for manoeuvering target tracking in clutter. IET Radar Sonar Navig. 2013, 7, 815–825. [Google Scholar] [CrossRef]
  37. Wang, S.; Bi, D.; Ruan, H.; Chen, S. Cognitive structure adaptive particle filter for radar manoeuvring target tracking. IET Radar Sonar Navig. 2018, 13, 23–30. [Google Scholar] [CrossRef]
  38. Ristic, B.; Arulampalam, S.; Gordon, N. A Tutorial on Particle Filters. In Beyond the Kalman Filter: Particle Filters for Tracking Applications; Artech House: Boston, MA, USA; London, UK, 2004; Chapter 3, Section 2; pp. 37–41. [Google Scholar]
  39. Guerci, J.R. Introduction. In Cognitive Radar: The Knowledge-Aided Fully Adaptive Approach; Artech House: Norwood, MA, USA, 2010; Chapter 1, Section 2; pp. 14–16. [Google Scholar]
  40. Bell, K.L.; Baker, C.J.; Smith, G.E.; Johnson, J.T.; Rangaswamy, M. Cognitive radar framework for target detection and tracking. IEEE J. Sel. Top. Signal Process. 2015, 9, 1427–1439. [Google Scholar] [CrossRef]
  41. Kershaw, D.J.; Evans, R.J. Optimal waveform selection for tracking systems. IEEE Trans. Inf. Theory 1994, 40, 1536–1550. [Google Scholar] [CrossRef]
  42. Van Trees, H.L. Estimation of the Parameters of a Random Process. In Detection, Estimation, and Modulation Theory, Part III: Radar-Sonar Signal Processing and Gaussian Signals in Noise; Wiley-Interscience: New York, NY, USA, 2001; Chapter 6, Section 3; pp. 177–184. [Google Scholar]
  43. Tichavsky, P.; Muravchik, C.H.; Nehorai, A. Posterior Cramer-Rao bounds for discrete-time nonlinear filtering. IEEE Trans. Signal Process. 1998, 46, 1386–1396. [Google Scholar] [CrossRef] [Green Version]
  44. Haykin, S. Kalman Filters. In Adaptive Filter Theory, 5th ed.; Pearson Education: Essex, UK, 2014; Chapter 14, Section 5; pp. 573–575. [Google Scholar]
  45. Kershaw, D.; Evans, R. Waveform selective probabilistic data association. IEEE Trans. Aerosp. Electron. Syst. 1997, 33, 1180–1188. [Google Scholar] [CrossRef]
  46. Andrieu, C.; Freitas, J.F.G.; Doucet, A. Sequential Bayesian Estimation and Model Selection Applied to Neural Networks; Technical Report CUED/F-INFENG/TR 341; Cambridge University Engineering Department: Cambridge, UK, 1999. [Google Scholar]
  47. Challa, S.; Morelande, M.; Musicki, D.; Evans, R. Maneuvering object tracking. In Fundamentals of Object Tracking; Cambridge University Press: Cambridge, UK, 2011; Chapter 3, Section 2; pp. 66–72. [Google Scholar]
  48. Li, X.R.; Jilkov, V.P. Survey of maneuvering target tracking. Part I. Dynamic models. IEEE Trans. Aerosp. Electron. Syst. 2004, 39, 1333–1364. [Google Scholar]
Figure 1. Cognitive radar system for target tracking.
Figure 1. Cognitive radar system for target tracking.
Sensors 20 03669 g001
Figure 2. Mean square error (MSE) and posterior Cramér–Rao bound (PCRB): (a) Comparison of PCRB and MSE; (b) Comparison of PCRB with fixed waveform and dynamic waveform, position x k .
Figure 2. Mean square error (MSE) and posterior Cramér–Rao bound (PCRB): (a) Comparison of PCRB and MSE; (b) Comparison of PCRB with fixed waveform and dynamic waveform, position x k .
Sensors 20 03669 g002
Figure 3. Selection of functions and parameters: (a) Mean absolute deviation for different μ functions; (b) Mean absolute deviation for different δ values.
Figure 3. Selection of functions and parameters: (a) Mean absolute deviation for different μ functions; (b) Mean absolute deviation for different δ values.
Sensors 20 03669 g003
Figure 4. Tracking example.
Figure 4. Tracking example.
Sensors 20 03669 g004
Figure 5. Comparison of root mean square error (RMSE) between cognitive CRPF (CCRPF) with PF and CRPF: (a) estimated position; (b) Estimated velocity.
Figure 5. Comparison of root mean square error (RMSE) between cognitive CRPF (CCRPF) with PF and CRPF: (a) estimated position; (b) Estimated velocity.
Sensors 20 03669 g005
Figure 6. Waveform selection across time: (a) On chip rate; (b) On length of pulse envelope.
Figure 6. Waveform selection across time: (a) On chip rate; (b) On length of pulse envelope.
Sensors 20 03669 g006
Figure 7. RMSE of the estimated range.
Figure 7. RMSE of the estimated range.
Sensors 20 03669 g007
Table 1. Performance comparison of algorithms.
Table 1. Performance comparison of algorithms.
ARMSESPF-1 1SPF-2 2CRPF-1 3CRPF-2 4CCRPF
Range1.541.091.260.960.93
Range-rate0.370.240.170.160.14
1 SPF-1: SPF with fixed waveform 1, 2 SPF-2: SPF with fixed waveform 2, 3 CRPF-1: CRPF with fixed waveform 1, 4 CRPF-2: CRPF with fixed waveform 2.
Table 2. Performance comparison of algorithms.
Table 2. Performance comparison of algorithms.
ARMSESPFCRPFCCRPF
k 30 0.510.580.23
k > 30 1.490.200.19

Share and Cite

MDPI and ACS Style

Zhong, L.; Li, Y.; Cheng, W.; Zheng, Y. Cost-Reference Particle Filter for Cognitive Radar Tracking Systems with Unknown Statistics. Sensors 2020, 20, 3669. https://doi.org/10.3390/s20133669

AMA Style

Zhong L, Li Y, Cheng W, Zheng Y. Cost-Reference Particle Filter for Cognitive Radar Tracking Systems with Unknown Statistics. Sensors. 2020; 20(13):3669. https://doi.org/10.3390/s20133669

Chicago/Turabian Style

Zhong, Lei, Yong Li, Wei Cheng, and Yi Zheng. 2020. "Cost-Reference Particle Filter for Cognitive Radar Tracking Systems with Unknown Statistics" Sensors 20, no. 13: 3669. https://doi.org/10.3390/s20133669

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop