Cost-Reference Particle Filter for Cognitive Radar Tracking Systems with Unknown Statistics

Zhong, Lei; Li, Yong; Cheng, Wei; Zheng, Yi

doi:10.3390/s20133669

Open AccessArticle

Cost-Reference Particle Filter for Cognitive Radar Tracking Systems with Unknown Statistics

School of Electronics and Information, Northwestern Polytechnical University, Xi’an 710072, China

^*

Author to whom correspondence should be addressed.

Sensors 2020, 20(13), 3669; https://doi.org/10.3390/s20133669

Submission received: 23 May 2020 / Revised: 26 June 2020 / Accepted: 28 June 2020 / Published: 30 June 2020

(This article belongs to the Section Intelligent Sensors)

Download

Browse Figures

Versions Notes

Abstract

:

A novel robust particle filtering algorithm is proposed for updating both the waveform and noise parameter for tracking accuracy simultaneously and adaptively. The approach is a significant step for cognitive radar towards more robust tracking in random dynamic systems with unknown statistics. Meanwhile, as an intelligent sensor, it would be most desirable for cognitive radar to develop the application of a traditional filter to be adaptive and to expand the adaptation to a wider scope. In this paper, after analysis of the Bayesian bounds and the corresponding cost function design, we propose the cognitive radar tracking method based on a particle filter by completely reconstructing the propagation and the update process with a cognitive structure. Moreover, we develop the cost-reference particle filter based on optimizing the cost function design according to the complicated system or environment with unknown statistics. With this method, the update of the estimation cost and variance arrives at the approximate optimization, and the estimation error can be more adjacent to corresponding low bounds. Simulations about the tracking implementation in unknown noise are utilized to demonstrate the superiority of the proposed algorithm to the existing methods in traditional radar.

Keywords:

cognitive radar; particle filter; target tracking; bayesian bounds; nonlinear model

1. Introduction

1.1. Problem Statement

Cognitive radar can basically be defined as: an intelligent radar system of hardware and software in which the transmit and receive parameters (i.e., pulse length, pulse repetition frequency (PRF), modulation, power, frequency, and polarization) are selected, in real-time, and use adaptation between the information extracted from the sensor/processor and the design and transmission of subsequent waveforms, in response to the observed scene to optimize the performance of a given application. The problem of target tracking in cognitive radar system has received considerable attention. It is well known that most of the state-space dynamical systems are nonlinear or non-Gaussian. For example, the multi-state transition of a drone from hovering to maneuvering is often nonlinear, and the measurement noise is mainly flicker noise or heavy-tailed noise [1].

Despite the performance decline of extended Kalman filter (EKF) and unscented Kalman filter (UKF) in highly nonlinear problems, and that general analytical solutions are intractable in nonlinear or non-Gaussian systems, solutions continue to emerge from different viewpoints. However, most of the methods rely on the assumption that the noise has known statistics, or they require accurate mathematical representation of the dynamics of the system evolution; otherwise, it is almost impossible to directly approximate the true distribution. In practice, sometimes the assumption is in accordance with the actual situation, but sometimes not. The same situation occurs in cognitive radar systems. Without explicit mathematical models or a priori information, how cognitive radar can still exert its advantages is a concern of this article.

We briefly review various existing methods to cognitive radar tracking problem that involve particle filter (PF) in some relevant manner.

1.2. Related Works

For handling arbitrary nonlinear models and arbitrary noise distributions, methods based on Monte Carlo (MC) methodology have emerged in Bayesian estimation. MC is a simulation-based method aimed at estimating the posteriori pdf of the state given the observations. Markov Chain Monte Carlo (MCMC) and Sequential MC (SMC) are two main tools in it, and they can sample from high dimensional probability distributions. The performance of MCMC would be unreliable when the proposal distributions that are used to explore the space are poorly chosen and/or if highly correlated variables are updated independently. While, PF became popular for it is particularly suitable for real-time estimation.

The Sequential Importance Sampling (SIS) algorithm was used for the first time to solve the problem of nonlinear filtering [2]. The formal establishment of PF was attributed to the proposal of resampling technology [3,4]. Meanwhile, the idea of Sampling Importance Resampling (SIR) was discovered and developed [5]. PF is allowed to express a complete and precise state posterior distribution, so any statistical data such as mean, variance, and modulus can be easily calculated, and theoretically, the accuracy is higher than that of EKF and UKF in nonlinear systems [6]. Hence, despite its computational pressure, it is still quite attractive to us. During the development of PF, there are several inevitable drawbacks need to be addressed: (1) How to approximate the optimal proposal distribution further; (2) How to overcome the problems of weight degeneracy, sample impoverishment, etc., to make the resampling more effective; (3) How to make the algorithm efficient and online.

Therefore, more variants of PF were produced [7]. Efficient importance sampling techniques were studied in [8,9] for the first problem, but these algorithms require that the posterior distribution of the states are assumed as a priori known and can be approximated by a Gaussian distribution. An auxiliary variable particle filter was used to deal with the second problem [10], but the filtering performance degrades when the state noise is strong, and because the likelihood function and weight value need to be calculated twice for each particle, the calculation amount increases. Adaptive PF (APF) can release the computation burden by adjusting the number of particles dynamically [11,12,13,14,15]. This method chooses a small particle number if the density is focused on a small part of the state-space, and chooses a large number if the state uncertainty is high [16]. KLD-APF is based on Kullback–Leibler (KL) information or KL distance (KLD) sampling, but it ignores any mismatch between the true and the proposal distribution [17]. When the mismatch happens, the addition of particle number will only increase the computational load. On the other hand, reducing the particle number may aggravate the sample impoverishment and further weaken the effect of resampling.

In addition to these drawbacks, PF would still be invalid when some issues are encountered. PF requires that the conditional pdf of the observed variable can be estimated, otherwise the weight of the particles cannot be calculated. Thus, particle MCMC (PMCMC) was proposed by combing SMC with MCMC as an efficient approach [18]. Based on PMCMC, the SMC² algorithm is motivated to tackle the intractable problem of probable increments in state-space models [19]. Similar to the SMC² scheme, nested particle filters for online parameter estimation were proposed but in a purely recursive manner, in order to address the problem of approximating the posterior probability distribution of the fixed parameters of a state-space dynamical system [20]. On robustness, cooperative parallel particle filters were designed for the dual purpose of Bayesian inference and on-Line Model Selection, with the online adaptation for the particle number [21]. Particle learning was proved in [22] to be outperforming existing PF alternatives and a competitor to MCMC. In the presence of model uncertainty where discrete data are encountered, a new SMC method was proposed for the filtering and prediction of time-varying signals [23]. A similar method with better predictive powers was proposed in [24], wherein the resampling step could be dynamically adjusted and the predictive powers could be updated sequentially as more data were observed. Moreover, what deserves attention is that Bayes is not the only choice, as a neural filter based on GRNN is also outperforming numerical filters in state vector estimation during dynamic changes of target movement parameters [25].

Inspired by the previous research, we might find a solution independent of the prior information through updating the proposal distribution and its stepwise approximation to the true distribution by iterative methods. The authors of [26] designed a cost function in PF to iteratively update the state and variance when the prior information is unknown. Naturally, it has been applied to many areas such as target tracking, autonomous vehicle positioning, sensor network, orthogonal frequency division multiplexing (OFDM) systems, wireless local area network (WLAN), and tilt estimation [27,28,29]. There are some improvements to the algorithm. In [30], a particle selection algorithm was proposed and analyzed for implementation with parallel computing devices and to circumvent the main drawback of the conventional resampling techniques. Authors of [31] melded random measures of two or more cost-reference particle filters to obtain a fused random measure that combines the information from the individual cost-reference particle filters. However, there is no further study on the optimization of the cost function, especially when the tracking structure can be adaptive. Cognitive radar designs the optimal estimator for Bayesian framework in [32], but the majority of the existing Bayesian tracking methods for cognitive radar applications are based on the Kalman filter for linear systems [33,34], and Kalman-like solutions, e.g., cubature KF (CKF) and continuous-discrete (CD)-CKF for nonlinear problems [35]. Few researches use PF. In [36], a particle filter combined with probabilistic data association is used as a tracker. In [37], a cognitive structure was designed as only a part of PF, namely, a parallel structure of PF, while EKF was used by waveform selection to adapt the particle number and reduce the computation cost.

1.3. Contributions and Organization of the Paper

When we specify PF as the nonlinear tracking method in a cognitive radar, we consider using the cognitive structure to expand the state-space of PF or its variants to another dimension where the waveform parameters can be changed from fixed to dynamic, and we can optimize the cost function design and corresponding lower bound of the estimation error. In turn, using the particle filter tracking method based on the optimal cost function can expand the scope of the application of cognitive radar. The main contributions of this paper will be presented by the following items:

(1) The cognitive radar tracking method based on PF is proposed and the mathematical model of which it is derived for the first time. Push the cognitive radar tracking framework from existing the Kalman-like to the developing SMC-like. Not only the data process but also the waveform design is in the PF iteration, that is, a fully cognitive PF.

(2) Refine the idea of cost-reference PF with cognitive framework. When the estimation of the parameters (i.e., signal-to-noise ratio (SNR)) in cognitive radar mismatches the real situation, a novel cognitive cost-reference particle filter algorithm is proposed to bring about robustness to the existing cognitive radar tracking methods and intelligence for the current adaptive method. The convergence is proofed mathematically.

(3) The Cramér–Rao Lower Bound (CRLB) of the proposed cognitive PF is derived, and the corresponding cost function is designed.

The rest of this paper is organized as follows: the standard PF, the mathematical model of cognitive radar, and the interface of PF to cognitive algorithm are presented, and the CRLB of the cognitive PF is derived in Section 2. The principle of the cost-reference particle filter is restated, the proposed cognitive scheme is presented along with the implementation steps, and the convergence of the solution is proofed in Section 3. Section 4 shows the dynamic model, the maneuvering target tracking in unknown non-Gaussian noise, and the simulation results. Section 5 presents the conclusions.

2. PF for Cognitive Radar Tracking

In this section, the cognitive tracking method based on PF is formed to address the non-linear and non-Gaussian state estimation problem in cognitive radar.

2.1. PF and Cognitive Radar Model

2.1.1. Standard PF

The basic idea is to generate a set of random samples in the state-space according to the empirical conditioning distribution of the system state vector. They are called particles, the weight and position of which are continuously adjusted according to the measurement, and the initial empirical conditioning distribution is modified. The MC estimate of the integral can be written as:

E [f (x)] = \int f (x) p (x) d x = \int f (x) [p (x) / q (x)] q (x) d x .

(1)

Let

ω (x_{}^{(i)}) = p (x_{}^{(i)}) / q (x_{}^{(i)})

denote the importance weights. Equation (1) is calculated by generating

N ≫ 1

independent samples

{x_{}^{(i)} : i = 1, \dots, N}

distributed according to

q (x)

. The weighted sum is formed and normalized as:

E_{N} [f (x)] = \frac{1}{N} \sum_{i = 1}^{N} f (x_{}^{(i)}) ω (x_{}^{(i)}),

(2)

E [f (x)] = \frac{\frac{1}{N} \sum_{i = 1}^{N} f (x_{}^{(i)}) ω (x_{}^{(i)})}{\frac{1}{N} \sum_{j = 1}^{N} ω (x_{}^{(j)})} = \sum_{i = 1}^{N} f (x_{}^{(i)}) \tilde{ω} (x_{}^{(i)}) .

(3)

The weight update equation can be shown to be:

ω (x_{k}^{(i)}) = ω (x_{k - 1}^{(i)}) \frac{p (z_{k}^{} | x_{k}^{(i)}) p (x_{k}^{(i)} | x_{k - 1}^{(i)})}{q (x_{k}^{(i)} | x_{k - 1}^{(i)})} .

(4)

Substitution of

q (x_{k}^{(i)} | x_{k - 1}^{(i)}) = p (x_{k}^{(i)} | x_{k - 1}^{(i)})

into Equation (4) yields:

ω (x_{k}^{(i)}) = ω (x_{k - 1}^{(i)}) p (z_{k}^{} | x_{k}^{(i)}) .

(5)

When

N \to \infty

, the posterior probability density can be approximated by the following:

p (x_{k}^{} | z_{1 : k}^{}) \approx \sum_{i = 1}^{N} ω (x_{k}^{(i)}) δ (x_{k}^{} - x_{k}^{(i)}),

(6)

where

δ (\cdot)

denotes the Dirac delta measure, with defining properties: (1)

δ (x_{k}^{} - x_{k}^{(i)}) = 0

for

x_{k}^{} \neq x_{k}^{(i)}

, (2)

δ (x_{k}^{} - x_{k}^{(i)}) = + \infty

for

x_{k}^{} = x_{k}^{(i)}

; and (3)

\int_{- \infty}^{\infty} δ (x_{k}^{} - x_{k}^{(i)}) d x_{k}^{} = 1

. More details about the SIR-PF can refer to [38]. It has been termed standard PF (SPF) in the simulation below. The accuracy approximates the optimal estimation.

2.1.2. Cognitive Radar System Model

The radar transmitter through the emission exploiting waveform selection stimulates the background with the goal to obtain a response from it such as a radar echo. The mentioned response is perceived by radar receiver, which plays the equivalent role of the human senses. The detector performs a low-level processing of received sensor data. The scene analyzer is conceived by exploiting previously acquired information to estimate the statistical characteristics of the operating environment. The tracker acts as a high-level processor.

The functions manager receives requests for radar activities (e.g., search, tracking) from the keyboard according to a specific mode of working or operator requirements. The functional interaction is with the radar hardware, the signal processor, the data processor, and the external peripherals. Starting from the state estimate

{\hat{x}}_{k - 1}^{}

and state error covariance at time step k − 1, the transmitter uses waveform parameter

θ_{k - 1}

to illuminate the environment. The receiver performs a prediction and obtains

{\hat{x}}_{k}^{(i)}

(i.e., Equation (49)) with the previous value and current measurement. Then,

{\hat{x}}_{k}^{(i)}

is utilized to evaluate the CRLB, and thus, the measurement error covariance matrix

R_{k} (θ)

. For each waveform of the library, the predicted error covariance matrix

P_{k}

is computed for evaluation (i.e., Equation (5) in Algorithm 1). At this point, a cost function (i.e., Equation (18)) is to be formulated, and the waveform at the next transmission is chosen as the one that minimizes the weighted mean square error (MSE) on the target state, namely minimizing the cost function by the optimal waveform design or parameters adjustment. The transmitted signal then provides a new optimized measurement

z_{k}

to feed the PF processor and track the target. The obtained target state estimate

{\hat{x}}_{k}^{}

is finally exploited to predict the next target state

{\hat{x}}_{k + 1}^{(i)}

, and optimizes the new transmitted waveform according to the closed-loop paradigm. This constructs the PAC in cognitive radar [39].

Summarizing, the block scheme of a cognitive tracking system architecture and the information-flow in the mathematical model is displayed in Figure 1. Notice that, if the blocks in red are cancelled, the rest blocks in blue degrade to the traditional radar system.

2.2. PF-Based Cognitive Tracking Algorithm

2.2.1. Bayesian Bounds for Cognitive Radar

The recursive Bayesian estimators, such as the EKF or PF, work in the non-linear system employing the feed-forward processing chain, and do not consider the feedback information from the receiver to the transmission of subsequent waveforms. So far, the corresponding recursive BCRB on the MSE matrix of the state vector was derived and presented in [40]. It can be obtained by calculating the Fisher information matrix (FIM) [41], the inverse of which is the unbiased estimator of the CRLB of the state estimation error covariance [42]. In this paper, the PF based on the cognitive structure characterized by feedback is analyzed, and the estimation is biased. Bayesian FIM (BIM) for this case can be developed and computed iteratively using this form [43]:

B_{k}^{↑} (θ_{k} | z_{1 : k - 1}; Θ_{k - 1}) = B_{k}^{-} (θ_{k} | z_{1 : k - 1}; Θ_{k - 1}) + J_{k}^{-} (θ_{k} | z_{1 : k - 1}; Θ_{k - 1}),

(7)

where

B_{k}^{-} (θ_{k} | z_{1 : k - 1}; Θ_{k - 1})

represents the contribution of the prior information is prior, called the prior term, and is denoted as

J_{P}

; the

J_{k}^{-} (θ_{k} | z_{1 : k - 1}^{}; Θ_{k - 1})

represents the contribution of the data, called data term and denoted as

J_{D}

.

J_{D} = E_{k} {J_{F} (x_{k}^{})} = E_{k} {- E_{z_{k}^{} | x_{k}^{}; θ_{k}} {\frac{\partial^{2} \ln f (z_{k}^{} | x_{k}^{}; θ_{k})}{\partial x_{k}^{2}}}},

(8)

J_{P} = - E_{x_{k}^{}} {\frac{\partial^{2} \ln f (x_{k}^{})}{\partial x_{k}^{2}}} .

(9)

The lower bound is defined as the inverse of the BIM, as the form:

B C R B = {[J_{D} + J_{P}]}^{- 1} .

(10)

The lower bound of CR would be smaller than the general BCRB, which can be proved as:

{[J_{D} (θ_{k}) + J_{P} (θ_{k})]}^{- 1} \leq {[J_{D} + J_{P}]}^{- 1} = B C R B .

(11)

2.2.2. Cost Function Design

As we know, CRLB is the possible MMSE of an unbiased estimation. It can be proved that minimizing the trace of the covariance is equal to the MMSE of the expectation of tracking state [38]; thus, we choose the minimum trace with respect to the waveform parameters as the processor cost function modeled by:

C_{Θ} (θ_{k} | z_{1 : k - 1}^{}; Θ_{k - 1}) = E_{x_{k} z_{k} | I_{k} θ_{k}} [{(x_{k} - {\hat{x}}_{k} (z_{k}^{}))}^{T} (x_{k} - {\hat{x}}_{k} (z_{k}^{}))] \approx tr {B_{k}^{↑} {(θ_{k} | z_{1 : k - 1}^{}; Θ_{k - 1})}^{- 1}},

(12)

where

tr (\cdot)

is an operator that extracts the trace of

B_{k}^{↑} {(θ_{k} | z_{1 : k - 1}^{}; Θ_{k - 1})}^{- 1}

, and the waveform parameters are denoted as

Θ \equiv {θ_{1}, θ_{2}, \dots, θ_{k}}

.

{\hat{x}}_{k + 1} (I_{k}, x_{k + 1}, z_{k + 1}; θ_{k})

is the posterior predictive value of the state estimation given waveform parameter. Since

{\hat{x}}_{k + 1}

is related to

z_{k + 1}

, which is a nonlinear mapping of

x_{k + 1}

, we use approximation rather than integration to solve this expectation.

C_{Θ} (θ_{k} | z_{1 : k - 1}^{}; Θ_{k - 1})

is approximated by

tr {B_{k}^{↑} {(θ_{k} | z_{1 : k - 1}^{}; Θ_{k - 1})}^{- 1}}

[33,35]. The model of

P_{k + 1 | k + 1}

in KF can refer to [44], while in PF,

P_{k + 1 | k + 1}

is modeled by a different form [38]. More details of the relation between

P_{k + 1 | k + 1}

and

R_{k} (θ_{k - 1})

are referred to in [35]. We derive the model of measurement noise covariance based on waveform selection as [45]:

R_{k} (θ_{k - 1}) = Γ B_{k}^{↑} {(θ_{k} | z_{1 : k - 1}^{}; Θ_{k - 1})}^{- 1} Γ = Γ U^{- 1} (θ_{k - 1}) Γ / η,

(13)

Γ ≜ diag [\frac{c}{2}, \frac{c}{4 π f_{c}}],

(14)

where

η = 2 E_{R} / N_{0}

is the SNR, and

N_{0}

denotes the spectral density of the complex noise envelope

\tilde{n} (k)

.

U (θ_{k - 1})

is a scaled version of FIM.

Γ

denotes a symmetric matrix defined as this form.

c

is the speed of waveform propagation, and

f_{c}

is the carrier frequency.

When the LFM signal with Gaussian amplitude modulation is selected as the transmit waveform, with the duration of Gaussian pulse λ and the chirp rate b chosen to form the parameter vector

θ_{k - 1} = [λ, b]

, the measurement noise covariance matrix can be modeled by

R_{k, r \dot{r}} (θ_{k - 1})

[41]:

R_{k} (θ_{k - 1}) = R_{k, r \dot{r}} (θ_{k - 1}) = [\begin{matrix} \frac{c^{2} λ^{2}}{2 η} & - \frac{c^{2} b λ^{2}}{2 π f_{c} η} \\ - \frac{c^{2} b λ^{2}}{2 π f_{c} η} & \frac{c^{2}}{{(2 π f_{c})}^{2} η} (\frac{1}{2 λ^{2}} + 2 b^{2} λ^{2}) \end{matrix}] .

(15)

The sensor cost function is defined as:

C_{Θ} (θ_{k}) = {\begin{cases} 0 & θ_{k} \in Θ \\ \infty & otherwise \end{cases} .

(16)

The loss function is defined with respective to the cost function of the processor and sensor, modeled by:

L_{C, Θ} (θ_{k} | z_{1 : k - 1}^{}; Θ_{k - 1}) \equiv C_{Θ} (θ_{k} | z_{1 : k - 1}^{}; Θ_{k - 1}) + C_{Θ} (θ_{k}) .

(17)

The choice of the optimal parameter is equivalent to the optimization problem given by:

θ_{k} = \underset{θ_{k}^{} \in P}{\arg \min} C_{Θ} (θ | z_{1 : k - 1}^{}; Θ_{k - 1}) s . t . θ \in Θ .

(18)

2.2.3. Algorithm and Numerical Simulation

We have restated the principle of the cognitive radar system, and have given the interface from PF to it to induce the human cognition mechanism into the PF algorithm. As with the statements before, SPF is available to be the tracking method of the cognitive radar for a tracking target with a nonlinear dynamic and non-Gaussian distribution. The basic algorithm steps of SPF along with the interface to cognitive tracking framework is given as follows Algorithm 1:

Algorithm 1. Cognitive Radar Tracking Recursion Based on PF

Initialization

1.

{\hat{x}}_{0}^{(i)} \sim p (x_{0})

,

ω_{k}^{(i)} = 1 / N_{s}

,

i = 1, \dots, N_{s}

Controller Optimization

2.

{\hat{x}}_{k}^{(i)} \sim q (x_{k}^{(i)} | {\hat{x}}_{k - 1}^{(i)}, z_{k})

, the mean

E_{p_{k} (x_{k}^{} | {\hat{x}}_{k - 1}^{(i)})} [{\hat{x}}_{k}^{}] = f_{x} ({\hat{x}}_{k - 1}^{(i)})

,

i = 1, \dots, N_{s}

3.

ω_{k}^{(i)} (θ) \propto p (z_{k} | {\hat{x}}_{k}^{(i)}) p ({\hat{x}}_{k}^{(i)} | x_{k - 1}^{(i)}) / q (x_{k}^{(i)} | x_{0 : k}^{(i)}, z_{1 : k})

, i = 1, \dots, N_{s}

, normalize {\tilde{ω}}_{k}^{(i)} (θ) = ω_{k} (x_{0 : k}^{(i)}) / \sum_{i = 1}^{N_{s}} ω_{k} (x_{0 : k}^{(i)})

4.

If N_{e f f} = 1 / \sum_{i = 1}^{N_{s}} {({\tilde{ω}}_{k}^{(i)})}^{2} < N_{t h} (empirical)

, then [{{\hat{x}}_{k}^{(i)}, {\hat{ω}}_{k}^{(i)} (θ)}_{i = 1}^{N_{s}}] = RESAMPLE [{x_{k}^{(i)}, {\tilde{ω}}_{k}^{(i)} (θ)}_{i = 1}^{N_{s}}]

5.

P_{k} (θ) = \sum_{i = 1}^{N_{s}} ω_{k}^{(i)} (θ) ({\hat{x}}_{k}^{(i)} - {\hat{x}}_{k}^{}) {({\hat{x}}_{k}^{(i)} - {\hat{x}}_{k}^{})}^{T}

6.

θ_{k}^{*} = \arg \min_{θ_{k}^{} \in P} [Tr ({\bar{P}}_{k + 1 | k + 1}^{} (θ_{k}))]

. Select the optimal θ_{k} = θ_{k}^{*}

Motion Update and Measurement

7.

f^{-} ({\hat{x}}_{k - 1}^{(i)}) = \int q (x_{k}^{(i)} | x_{k - 1}^{(i)}; θ_{k}) f ({\hat{x}}_{k - 1}^{(i)}) d x_{k - 1}^{}

, {\hat{x}}_{k - 1}^{(i)} = μ_{k}^{-} = E_{k}^{-} [{\hat{x}}_{k}^{}]

8.

{\hat{y}}_{k} = \arg \min_{y} \ln f (z_{k}^{} | y; θ_{k})

Information Update and State Estimation

9.

{{\hat{x}}_{k}^{(i)}, ω_{k}^{(i)}} = {x_{k}^{(i)}, {\tilde{ω}}_{k}^{(i)}; θ_{k}}

, i = 1, \dots, N_{s}

, {\hat{x}}_{k} \approx \sum_{i = 1}^{N_{s}} ω_{k}^{(i)} {\hat{x}}_{k}^{(i)}

As an illustration of the CRLB, consider the scenario that the target is assumed to move with a constant velocity along a straight line in the Cartesian plane. The target kinematic state (position, velocity)

{[x_{1, k}^{}, x_{2, k}^{}, x_{3, k}^{}, x_{4, k}^{}]}^{T}

is estimated from the noise-corrupted measurements

z_{k} = {[r_{k}, β_{k}]}^{T} + w_{k}

with

r_{k} = {(x_{1, k}^{2} + x_{2, k}^{2})}^{1 / 2}

,

β_{k} = \arctan (x_{2, k}^{} / x_{1, k}^{})

. Thus, the measurement equation is nonlinear. The measurement noise

w_{k}

is assumed to be white, zero-mean Gaussian, with covariance

R = d i a g [σ_{r}^{2}, σ_{β}^{2}]

, and with the sampling interval T = 0.1 s. The covariance of the process noise is

Q = σ_{x}^{2} \cdot d i a g [T_{0}, T_{0}]

, wherein

T_{0} = [0.5 T^{4} 0.5 T^{3}; 0.5 T^{3} T^{2}]

. The initial true state of the target is

{[4000, 80, 12, 000, - 20]}^{T}

, and the given initial state is set as

{[5000, 90, 15, 000, - 20]}^{T}

, with an estimation error covariance of

P_{0} = 1000 I

.

Figure 2 shows the results in terms of posterior Cramér–Rao bound (PCRB) and root mean square errors (RMSEs) in the range estimation. As observed in Figure 2a, when we perform the proposed CPF for target tracking in cognitive radar systems, the MSE is initially larger than PCRB, due to the initialization that is not exactly matched to PCRB. Very soon though, CPF demonstrates a fast convergence. Then, MSE agrees with the theoretical curve for PCRB, presenting a considerably well overall performance. Figure 2b compare the PCRB curves obtained using a fixed waveform in a traditional radar and dynamic waveform in cognitive radar. Obviously, the PCRB in the cognitive radar provides a valid lower bound for all time steps, and the convergence is faster.

As we know, the CRLB does not care about the estimation method, but only reflects the best effect of using the existing available information to estimate the parameters. Thus, it can be seen from the figures that after the cognitive system utilizes the extra environmental information through the closed-loop structure, it is possible to lower the previous bounds further.

3. Cognitive Tracking Problem Model in Unknown Environment

In this section, we develop the cost-reference PF (CRPF) for a cognitive robust filter. This means that we extend the classic cognitive radar scenarios to an unknown environment. Specifically, the general description of the CRPF approach is listed first, then the details on fusing the cost with cognitive method are specified.

3.1. Cost-Reference Particle Filtering Approach

As we all know, SPF works well when the mathematical forms of the probability distributions of the noise are assumed explicit, but when the mathematical representation of the dynamics of the system evolution is unknown a priori, or the assumptions of probabilistic models cannot be achieved, SPF may fail and the estimation results may be inaccurate even if they have the cognitive structure.

Unlike basic PF, which only gives the state vectors in sequential algorithm, CRPF gives the state samples and associated costs, namely the WPS as the form:

Ξ_{k} = {x_{k}^{(i)}, C_{k}^{(i)}}_{i = 1}^{M},

(19)

where

C_{k}^{(i)} = C (x_{0 : k}^{(i)} | y_{1 : k}^{}, λ)

is the cost of the particle

x_{k}^{(i)}

. The cost function can be denoted by a recursive additive structure:

C (x_{0 : k}^{} | z_{1 : k}^{}, λ) = λ C (x_{0 : k - 1}^{} | z_{1 : k - 1}^{}, λ) + Δ C (x_{k}^{} | z_{k}^{}),

(20)

where

λ

is a forgetting factor, which is used to avoid attributing an excessive weight to old observations.

Δ C (x_{k}^{} | z_{k}^{})

is the incremental cost function, the prediction of which can be obtained as:

Δ C (f_{x} (x_{k - 1}^{}) | z_{k}^{}) = R (x_{k - 1}^{} | z_{k}^{}),

(21)

where the one-step risk function is introduced, and the risk of particle

i

can be computed as:

R : R^{L_{x}} \times R^{L_{y}} \to R, x_{k - 1}^{}, z_{k}^{} \to R (x_{k - 1}^{} | z_{k}^{}),

(22)

R_{k + 1}^{(i)} = λ C_{k}^{(i)} + R (x_{k}^{(i)} | z_{k + 1}^{}) .

(23)

We assign the particles in

Ξ_{k + 1}

the PMF, which is, according to the cost, defined as:

π_{k + 1}^{(i)} \propto μ (C_{k + 1}^{(i)}),

(24)

where a monotonically decreasing function can be used as

μ (\cdot)

, which is selected to guarantee an adequate discrimination of low-cost particles from higher ones. Then, we would obtain the minimum cost by maximum the function below:

{\tilde{x}}_{0 : k + 1}^{\min} = x_{k + 1}^{(i_{0})}, i_{0} = \arg \max_{i} {π_{k + 1}^{(i)}} .

(25)

Notice that the mean value here is equal to the minimum cost estimate, and it even has slight advantages over the latter one. More details can be referred to in [26], and the presentation can be found in Algorithm 2.

The overall procedure of the CRPF algorithm is summarized in Algorithm 2.

Algorithm 2. CRPF algorithm for target tracking problem

Initialization

1.

x_{0}^{(i)} \sim p_{0} (x_{0})

, C_{0}^{(i)} = 0

, σ_{0}^{2, (i)}

, i = 1, \dots, N_{s}

, the weighted - particle set Ξ_{0} = {x_{0}^{(i)}, C_{0}^{(i)}}_{i = 1}^{N_{s}}

PMF Update

2.

R_{k}^{(i)} = λ C_{k - 1}^{(i)} + {‖ z_{k} - f_{y} (f_{x} (x_{k - 1}^{(i)})) ‖}^{q}

, q = 1, 2

, {\hat{π}}_{k}^{(i)} \propto μ (R_{k}^{(i)}) = \frac{1}{{(R_{k}^{(i)} - \min {R_{k}^{(i)}}_{i = 1}^{N_{s}} + δ)}^{β}}

, i = 1, \dots, N_{s} .

3.

{{\hat{x}}_{k - 1}^{(i)}, {\hat{C}}_{k - 1}^{(i)}}_{i = 1}^{N_{s}} = RESAMPLE [{{\hat{π}}_{k}^{(i)}}_{i = 1}^{N_{s}}]

Particle Propagation and Variance Update

4.

E_{p_{k} (x_{k}^{} | {\hat{x}}_{k - 1}^{(i)})} [x_{k}^{}] = f_{x} ({\hat{x}}_{k - 1}^{(i)})

, C o v_{p_{k} (x_{k}^{} | {\hat{x}}_{k - 1}^{(i)})} [x_{k}^{}] = σ_{k}^{2, (i)} I_{[x]}

, x_{k}^{(i)} \sim p_{k} (x_{k}^{} | {\hat{x}}_{k - 1}^{(i)}) .

5.

σ_{k}^{2, (i)} = {\begin{cases} σ_{k - 1}^{2, (i)} & t \leq 10 \\ \frac{k - 1}{k} σ_{k - 1}^{2, (i)} + \frac{{‖ x_{k}^{(i)} - g ({\hat{x}}_{k - 1}^{(i)}) ‖}^{2}}{k \times \dim [x]} & t > 10 \end{cases}

, i = 1, \dots, N_{s}

State Estimation

6.

C_{k}^{(i)} = λ C_{k - 1}^{(i)} + {‖ z_{k} - f_{y} (x_{k}^{(i)}) ‖}^{q}

, {\hat{π}}_{k}^{(i)} \propto μ_{2} (C_{k}^{(i)}) = \frac{1}{{(C_{k}^{(i)} - \min {C_{k}^{(i)}}_{i = 1}^{N_{s}} + δ)}^{β}}

, δ, β > 0

, normalized to π_{k}^{(i)},

7.

{\hat{x}}_{k} = x_{k}^{m e a n} = \sum_{i = 1}^{N_{s}} π_{k}^{(i)} x_{k}^{(i)}

, i = 1, \dots, N_{s}

3.2. Cognitive Cost-Reference Particle Filtering

Although the CRPF approach can be used in the dynamic system to deal with the uncertainty problem, by controlling the risk and cost and adapting the variance of the noise, it is also limited by the system performance, e.g., the waveform parameter, process noise, SNR, etc. If the iteration can be performed in another dimension, namely in the varying information entropy caused by the varying parameters, through perceiving the exterior environment, the estimation error may be decreased further. To address this challenge, we propose a cognitive cost-reference particle filter algorithm to perform the robust adaptive filter for target tracking.

3.2.1. Cost Function Design

Considering the more complicated feedback in cognitive CRPF (CCRPF), it seems not easy to achieve the closed-form solution of the CRLB even iteratively [46]. Thus, we use the MC approach to approximate the BIM.

Breaking through the limitation of the traditional CRPF method in Equation (25) that only minimizes the cost of particles in a one-time step, we are dedicated to designing a new cost function based on parameter adaption in cognitive radar framework. The processor cost function is defined with a recursive additive structure as follows:

C (x_{0 : k}^{} | z_{1 : k}^{}, λ; θ_{k}) = λ {\hat{C}}_{k}^{} + Δ C_{k + 1}^{} (θ_{k}) = \sum_{i = 0}^{k} λ^{(k - i)} Δ C (x_{k}^{} | z_{k}^{}; θ_{k}) .

(26)

Lemma 1.

Let

μ_{k, M}

be the unbiased estimation of mean value

μ_{k}

, let

σ_{k}^{(i)} (θ_{k})

be the variance, and let

Δ C (x_{k}^{} | y_{k}^{}; θ_{k})

be a cost function.

If the two following conditions are met:

(1) The

{\hat{σ}}_{k}^{(i)} (θ_{k})

is asymptotically to the

σ_{k}^{(i)} (θ_{k})

, that is

\lim_{k \to \infty} | σ_{k}^{(i)} (θ_{k}) - {\hat{σ}}_{k}^{(i)} (θ_{k}) | = 0 (i . p .)

(27)

where i.p. stands for “in probability”.

(2) The mean incremental cost

\bar{Δ C_{k}} = \sum_{i = 1}^{M} ω_{k}^{(i)} Δ C (x_{k}^{(i)} | y_{k}^{}; θ_{k})

converges to the minimal incremental cost

\lim_{M \to \infty} | Δ C (x_{k}^{o p t} | y_{k}^{}; θ_{k}) - \bar{Δ C_{k}} | = 0 (i . p .),

(28)

and then

μ_{k, M}

is asymptotically to

μ_{k}

, so namely the error satisfies

\lim_{k \to \infty} | μ_{k, M} - μ_{k} | = 0 (i . p .) .

(29)

See Appendix A for a proof.

It can be seen from Equation (26) that the cost function changes with the waveform parameters. According to Lemma 1, we can achieve the asymptotically optimal sequence of state vectors by using the MC approach. The loss function is defined with respect to the cost function of the processor and sensor, modeled by:

L_{C, Θ} (θ_{k} | z_{1 : k - 1}^{}; Θ_{k - 1}) \equiv L {C_{k + 1}^{} (θ_{k} | z_{1 : k - 1}^{}; Θ_{k - 1}), T_{Θ} (θ_{k})},

(30)

where

P^{k}

is the waveform library at time k. The loss function should be modeled to balance the predicted conditional Bayes risk/cost and measurement cost. Thus, we define it to be the sum of them, that is,

L_{C, Θ} (θ_{k} | z_{1 : k - 1}^{}; Θ_{k - 1}) \equiv C_{k + 1}^{} (θ_{k} | z_{1 : k - 1}^{}; Θ_{k - 1}) + T_{Θ} (θ_{k}) .

(31)

The next

θ_{k}^{*}

is chosen to minimize the loss function, and the choice of the optimal parameter is equivalent to the optimization problem given by:

θ_{k}^{*} = \underset{θ_{k}^{} \in P}{\arg \min} C_{k + 1}^{} (θ_{k} | z_{1 : k - 1}^{}; Θ_{k - 1}) s.t. θ_{k} \in Θ .

(32)

Standard PF uses the statistical reference, which gives the particles the equal weighting after resampling. CRPF uses the cost reference, which preserves the particle cost after resampling to shift the random grid representation of the cost function toward its local minima. The proposed method is to shift the cost function toward another dimension, namely the vector space effected by the waveform library, to further minimize the cost signal estimates.

3.2.2. Sequential Algorithm and Design Issue

According to Equation (21) and the definition of the risk function, the risk function

R (x_{k}^{} | z_{k + 1}^{}; θ_{k})

is a prediction of the cost increment

Δ C (x_{k + 1}^{} | z_{k + 1}; θ_{k})

, so

R (\cdot)

and

Δ C (\cdot)

are related closely, which can be chosen simply as:

Δ C (x_{k}^{} | z_{k}; θ_{k}) = {‖ z_{k} - f_{y} (x_{k}^{}; θ_{k}) ‖}^{q},

(33)

R (x_{k}^{} | z_{k + 1}^{}; θ_{k}) = {‖ z_{k + 1} - f_{y} (f_{x} (x_{k}^{}; θ_{k})) ‖}^{q} .

(34)

After resampling, particles are drawn according to the propagation density, in which we add the waveform parameter

θ_{k}

because of the cognitive framework, denoted as:

x_{k + 1}^{(i)} \sim p_{k + 1} (x_{k + 1}^{} | {\hat{x}}_{k}^{(i)}; θ_{k}) .

(35)

The expected

E [\cdot]

with respect to the PDF

p (\cdot)

is used as the constraint of Equation (35), denoted as:

E_{p_{k + 1} (x_{k + 1}^{} | x_{k}^{}; θ_{k})} [x_{k + 1}^{}] = f_{x} (x_{k}^{}; θ_{k}) .

(36)

Therefore, the particles propagate randomly according to the dynamic model, which changes with

θ_{k}

, that is, the conditional pdf

p_{k + 1} (x_{k + 1}^{} | x_{k}^{})

still can be arbitrary, but the range of the constraint will be changed by the parameters in another dimension. A zero-mean Gaussian pdf with adaptive variance is used for the particle propagation, denoted as:

x_{k + 1}^{(i)} \sim N (f_{x} ({\hat{x}}_{k - 1}^{(i)}; θ_{k}), σ_{k}^{2, (i)} I_{L_{x}}),

(37)

where

σ_{k}^{2, (i)}

is the variance, and

I_{L_{x}}

is the

L_{x} \times L_{x}

identity function. One choice of adaptive selection for

σ_{k}^{2, (i)}

is to compute it by the recursive model.

σ_{k}^{2, (i)} (θ_{k}) = \frac{t - 1}{t} σ_{k - 1}^{2, (i)} + \frac{{‖ x_{k}^{(i)} - f_{x} ({\hat{x}}_{k - 1}^{(i)}; θ_{k}) ‖}^{2}}{k L_{x}} .

(38)

In terms of the cost function:

C_{k + 1}^{(i)} (θ_{k}) = λ {\hat{C}}_{k}^{(i)} + Δ C_{k + 1}^{(i)} (θ_{k}),

(39)

where

Δ C_{k + 1}^{(i)}

is also affected by the newly updated

θ_{k}

, shown as:

Δ C_{k + 1}^{(i)} (θ_{k}) = Δ C (x_{k + 1}^{(i)} | z_{k + 1}^{}; θ_{k}) .

(40)

The next transmitted waveform is selected. The function

μ

in Equation (41) is selected to assign large probability masses to lower-cost particles. When we add

θ_{k}

to the cost function, the probability masses will be re-assigned as follows:

{\hat{π}}_{k}^{(i)} \propto μ_{2} {C_{k}^{(i)} (θ_{k})} = \frac{1}{{(C_{k}^{(i)} (θ_{k}) - \min_{p} {C_{k}^{(p)} (θ_{k})} + δ)}^{β}},

(41)

where

0 < δ < 1

,

β > 1

. The current state estimation and its prediction covariance matrix is updated by:

i_{0} = \arg \max {\hat{π}}_{k}^{(i)}, {\hat{x}}_{k}^{\min} = {\hat{x}}_{k}^{(i_{0})} .

(42)

We can obtain the optimal sequence of state vectors and the pointwise solution, denoted as:

{\hat{x}}_{k}^{m e a n} = \sum_{i = 1}^{N_{s}} {\hat{π}}_{k}^{(i)} {\hat{x}}_{k}^{(i)} .

(43)

The cognitive structure is combined with the cost-reference function couple, considering the internal and external mechanisms to form the complete CCRPF algorithm. The implementation details are given in Algorithm 3. This can be reduced to the CRPF if using a fixed waveform.

Algorithm 3. CCRPF algorithm for maneuvering target tracking problem.

Initialization,

(for i = 1, \dots, N_{s}),

1.

generate x_{0}^{(i)} \sim p_{0} (x_{0})

, assign the \cos t C_{0}^{(i)} = 0

, and initialize σ_{0}^{2, (i)} .

PMF Update

2. Start with the initial waveform parameter

θ

. For each

θ

compute:

3.

{{\hat{x}}_{k - 1}^{(i)}, ω_{k - 1}^{(i)}} = {x_{k - 1}^{(i)}, ω_{k - 1}^{(i)}}

, i = 1, \dots, N_{s}

, z_{k} = f_{y} (x_{k}^{}, w_{k}; θ)

4.

R_{k}^{(i)} (θ) = λ C_{k - 1}^{(i)} + {‖ z_{k} - f_{y} (f_{x} (x_{k - 1}^{(i)})) ‖}^{q}

, q = 1, 2

; i = 1, \dots, N_{s}

, π_{k}^{(i)} \propto μ (R_{k}^{(i)}) = \frac{1}{{(R_{k}^{(i)} - \min {R_{k}^{(i)}}_{i = 1}^{N_{s}} + δ)}^{β}}

5.

Resampling {\hat{x}}_{k - 1}^{} = {{\hat{x}}_{k - 1}^{(i)}, {\hat{C}}_{k - 1}^{(i)}}_{i = 1}^{N_{s}}

according to π_{k}^{(i)}

Particle Propagation and Waveform Selection

6.

x_{k}^{(i)} \sim p_{k} (x_{k}^{} | {\hat{x}}_{k - 1}^{(i)})

, compute the cost C_{k}^{(i)} (θ) = λ C_{k - 1}^{(i)} + {‖ z_{k} - f_{y} (x_{k}^{(i)}) ‖}^{q}

7.

Compute the \cos t function θ_{k}^{*} = \underset{θ}{\arg \min} C_{Θ}^{(i)} (θ_{k} | z_{1 : k - 1}^{}; Θ_{k - 1}) + C_{Θ} (θ_{k})

8.

Select the optimal waveform parameter θ_{k} = θ_{k}^{*}

Particle and Measurement Recursive Update

9.

{\hat{x}}_{k}^{(i)} = {\hat{x}}_{k}^{(i)}

, z_{k} = f_{y} (x_{k}^{}, w_{k}; θ_{k})

, C_{k}^{(i)} = C_{k}^{(i)} (θ_{k})

10.

σ_{k}^{2, (i)} (θ_{k}) = {\begin{cases} σ_{k - 1}^{2, (i)} & t \leq 10 \\ \frac{k - 1}{k} σ_{k - 1}^{2, (i)} + \frac{{‖ x_{k}^{(i)} - f_{x} ({\hat{x}}_{k - 1}^{(i)}) ‖}^{2}}{k \times \dim [x]} & t > 10 \end{cases}

, i = 1, …, N_s

Information Update and State Estimation

11.

{\hat{π}}_{k}^{(i)} \propto μ_{2} (C_{k}^{(i)}; θ_{k}) = \frac{1}{{(C_{k}^{(i)} - \min {C_{k}^{(i)}}_{i = 1}^{N_{s}} + δ)}^{β}},

where α, β > 0 . Normalize the PMF .

12.

{\hat{x}}_{k} = {\hat{x}}_{k}^{m e a n} = \sum_{i = 1}^{N_{s}} {\hat{π}}_{k}^{(i)} {\hat{x}}_{k}^{(i)}

. Save the {\hat{x}}_{k}

and {\hat{P}}_{k}^{} (θ_{k}) .

3.3. Convergence of CCRPF Algorithm

Due to the change of waveform parameter, resulting in the change of the measurement noise, convergence results regarding CRPFs may be not valid for the proposed algorithm. We assess the convergence. More details about the preliminary definitions and the derivation of the conditions are referred to in [26].

Lemma 2.

Let

Δ C (x_{k}^{} | y_{k}^{})

be a cost function. With a fixed waveform parameter

θ

, if the three following conditions are met:

(1) The set function

μ_{k} (A \subseteq {x_{k}^{(i)}}_{i = 1}^{M}) = \sum_{x \in A}^{} μ (Δ C (x_{k} | y_{k}^{}))

satisfies

\lim_{M \to \infty} \Pr [1 - \frac{μ_{k} (S^{M} (x_{k}^{o p t} (θ), ε))}{μ_{k} ({x_{k}^{(i)} (θ)}_{i = 1}^{M})} \geq δ] = 0 \forall δ > 0,

(44)

where

\Pr [\cdot]

denotes probability,

{x_{k}^{(i)}}_{i = 1}^{M}

is a set of particles drawn at time step

k

, and

S^{M} {x_{k}^{o p t} (θ), ε} = {x \in {x_{k}^{(i)}}_{i = 1}^{M} : ‖ x - x_{k}^{o p t} ‖ < ε; θ}

.

(2) The mean incremental cost

\bar{Δ C_{k}} = \sum_{i = 1}^{M} ω_{k}^{(i)} Δ C (x_{k}^{(i)} | y_{k}^{})

converges to the minimal incremental cost

\lim_{M \to \infty} | Δ C (x_{k}^{o p t} | y_{k}^{}; θ) - \bar{Δ C_{k}} | = 0 (i . p .) .

(45)

(3) The mean cost estimate is asymptotically optimal,

\lim_{M \to \infty} | Δ C ({\tilde{x}}_{k}^{m e a n} | y_{k}^{}; θ) - Δ C (x_{k}^{o p t} | y_{k}^{}; θ) | = 0 (i . p .),

(46)

where

{\tilde{x}}_{k}^{m e a n} = \sum_{i = 1}^{M} π_{k}^{(i)} x_{k}^{(i)}

, then with the adaptive parameter

θ_{k}

, the set function satisfies:

\lim_{M \to \infty} [μ_{k} (S^{M} (x_{k}^{o p t} (θ_{k}), ε)) / μ_{k} ({x_{k}^{(i)} (θ_{k})}_{i = 1}^{M})] = 1 (i . p .),

(47)

and the mean cost estimate is asymptotically optimal, that is:

\lim_{M \to \infty} | Δ C ({\tilde{x}}_{k}^{m e a n} | y_{k}^{}; θ_{k}) - Δ C (x_{k}^{o p t} | y_{k}^{}; θ_{k}) | = 0 (i . p .) .

(48)

See Appendix B for a proof.

4. Numerical Results and Discussion

4.1. Dynamic Model

The evolution of the target state and the corresponding measurements are described by a known discrete-time stochastic model separately [47]. Consider the target moving in the

x - y

plane according to the model:

x_{k + 1} = F_{k} x_{k} + v_{k},

(49)

where

{[x_{1, k}^{}, x_{2, k}^{}, x_{3, k}^{}, x_{4, k}^{}]}^{T}

denotes the state vector X_k of the target,

(x_{1, k}^{} x_{2, k}^{})

, and

(x_{3, k}^{}, x_{4, k}^{})

denotes the position and the velocity, respectively.

v_{k} = {[00 v_{x} v_{y}]}^{T}

is the system noise. We assume that the sampling period is short enough for the velocity to be a constant during the period, so the state transition matrix is defined as:

F_{k} = [\begin{array}{l} F_{11} & F_{12} \\ 0 & F_{22} \end{array}], F_{11} = F_{22} = [\begin{array}{l} 10 \\ 01 \end{array}], F_{12} = [\begin{array}{l} T 0 \\ 0 T \end{array}] .

(50)

The measurement equation is considered to be highly nonlinear, and as described as follows,

z_{k} = {[r_{k}, {\dot{r}}_{k}, β_{k}]}^{T}

is selected as the observable vector, and the observation matrix is denoted as:

z_{k} = {[r_{k}, {\dot{r}}_{k}, β_{k}]}^{T} + w_{k},

(51)

where the range

r_{k}

, range-rate

{\dot{r}}_{k}

, and bearing

β_{k}

compose the observable vector, that is:

r_{k} = {(x_{1, k}^{2} + x_{2, k}^{2})}^{1 / 2}, {\dot{r}}_{k} = {(x_{3, k}^{2} + x_{4, k}^{2})}^{1 / 2}, β_{k} = \arctan (x_{2, k}^{} / x_{1, k}^{}) .

(52)

Select Equation (15) to be the model of

R_{k}

. The fixed-waveform parameter is set as

λ_{0} = 50 \times 10^{- 6} s

,

b_{0} = 60 \times 10^{9} yad / s^{2}

, and the initial SNR is set as

η_{0} = 16

, so

R_{0} = R (λ_{0}, b_{0}, η_{0})

. The initial true state of the target is

{[10, - 5, - 0.2, 0.2]}^{T}

, and the given initial state is set as

{[6 - 2 0 0]}^{T}

,

σ_{0}^{2, (i)} = {[1.5 10 0.1 1.5]}^{T}

. The sampling period is

T = 1 s

. The generally used heavy-tailed distributions include Laplace distribution, t distribution, uniform distribution, and Gaussian distribution with large variance. Considering that the mixture Gaussian is applied, the sum of the different weighted Gaussian noises with different parameters can be used as the modeling of flicker noise

w_{k} (θ_{k - 1})

. The pdf of flicker noise can be denoted as:

v_{x, y} \sim 0.1 N (0, 1) + 4 \times 10^{- 4} N (0, 1) + 10^{- 6} N (0, 1),

(53)

w_{r} \sim \sqrt{0.1} N (0, σ_{r}) + 0.04 \times 10^{- 4} N (0, σ_{r}) + 2.5 \times 10^{- 5} N (0, σ_{r}),

(54)

w_{β} \sim 0.1 N (0, σ_{β}) + 4 \times 10^{- 4} N (0, σ_{β}) + 1.6 \times 10^{- 7} N (0, σ_{β}),

(55)

where

N (μ_{i}, Σ_{i})

denotes the Gaussian distribution with the mean of

μ_{i}

and the variance of

Σ_{i}

. The particle number is 400.

For a particular scenario and parameters, the overall performance of a filter is evaluated using the metric that is the ARMSE. We use it to estimate the tracking accuracy of the algorithms, and it is defined as:

ARMSE = \sqrt{\frac{1}{N_{m} K} \sum_{i = 1}^{N_{m}} \sum_{k = 1}^{K} [{({\hat{x}}_{1, k}^{i} - x_{1, k})}^{2} + {({\hat{x}}_{3, k}^{i} - x_{3, k})}^{2}]},

(56)

where

K

is the total number of time steps and

N_{m}

is the total number of independent MC runs. Let

(x_{1, k}, x_{3, k})

and

({\hat{x}}_{1, k}^{i}, {\hat{x}}_{3, k}^{i})

denote the true and the estimated target position at time

k

at the

i - t h

MC run, respectively. Then, the RMSE of the estimated states at

k

can be computed as [48]:

RMSE (k) = \sqrt{\frac{1}{N_{m}} \sum_{i = 1}^{N_{m}} [{({\hat{x}}_{1, k}^{i} - x_{1, k})}^{2} + {({\hat{x}}_{2, k}^{i} - x_{2, k})}^{2}]} .

(57)

The upper and lower limits of the waveform parameters are determined according to the transmitter specifications, and the waveform library can be obtained as:

P = {λ \in [λ_{\min} : Δ λ : λ_{\max}], b \in [b_{\min} : Δ b : b_{\max}]},

(58)

where Δλ denotes the step-size of the envelope duration, and Δb denotes the step-size of the chirp rate.

4.2. Selection of Function μ and Parameters δ

Before implementing the simulation experiments, we check the behavior of the proposed method with different function

μ

and different values of tuning parameter

δ

in Equations (33), (34) and (41), to select suitable

μ

and

δ

.

The selection of the tuning parameters is investigated in Figure 3. Figure 3a shows four cases of mean absolute deviation after the tracking, with varying values set of

μ

and

q

. The performance in the case (

μ 1, q = 1

) seems well at beginning but shows a rapid degradation soon. The performances of the other cases are similar, wherein case (

μ 2, q = 2

) shows a slightly better overall performance. Three cases with varying values of

δ

are considered in Figure 3b. It is clear that a small

δ

may result in a worse improvement. Therefore, it is reasonable to select

μ 2, q = 2, δ = 1

for the rest of the simulations according to the analysis and conclusion.

4.3. Simulation Results

4.3.1. Scenario 1: Unknown Statistics in General Environment

We assume that the process noise and measurement noise is independent and temporally white in this case, with the mixture Gaussian pdf as Equations (53)–(55) show, but the given methods are mismatched with the dynamic systems and models.

We apply SPF, CRPF, and CCRPF for target tracking in a two-dimensional space for performance assessment. Figure 4 displays the true path against the tracks of these filters. The other two tracks the target with an error smaller than the SPF, wherein the CCRPF outperforms CRPF slightly. Note how the trajectory of CCRPF deviates from the true path slightly due to the cost function and cognitive structure. EKF does not show the performance of robustness, and we do not show the result here.

All error curves corresponding to the above three filters were obtained by simulation runs. The results for the estimation error of the range and range-rate are shown in Figure 5a,b respectively. Observe from Figure 5 that the SPF introduces large bias in the estimation. CRPF and CCRPF are unbiased throughout the observation period. As for the comparison of CRPF and CCRPF, the overall performance of CCRPF is considered better than that of CRPF. Particularly, the CCRPF estimation is approaching zero bias except for the very beginning stage, indicating a fairly robust performance.

In term of the convergence speed, the convergence of CRPF and CCRPF shows more rapidly than SPF and keeps stable; thus, they are more robust than SPF in tracking performance. The CRPF converges the error asymptotically, while the proposed method achieves convergence in less than 10 time steps, due to the real-time perception of environment in the cognitive algorithm.

CRPF does not have a bad overall performance, but it provides higher peak errors than CCRPF since the cost reference mechanism can only use the fixed waveform, that is to say, it can only be adaptive in the constant environment conceived by the fixed waveform. However, cognitive radar based on CCRPF can not only perceive the dynamic environment by the dynamic waveform, but can also stay adaptive during the recursion after the waveform is chosen. Thus, CCRPF can suppresses the peak error on the fly and limits the overall magnitude of the error, showing the superiority.

A summary of these results and the detailed comparison are also tabulated in Table 1. The parameter of fixed waveform 1 is

λ = 10 \times 10^{- 6}

,

b = 10 \times 10^{9}

, and the parameter of fixed waveform 2 is

λ = 50 \times 10^{- 6}

,

b = 60 \times 10^{9}

. The first row of the table lists the metrics corresponding to the ranging performance by MC simulations with 40 independent runs, which is computed using Equation (56). The second row gives the performance on estimation of range-rate.

From this table, more concretely, we see that the SPF shows a degradation in performance, it has the lowest tracking accuracy of the compared methods. From the comparison of PFs using two different fixed waveforms, we find that PF with waveform 2 is better than that with waveform 1, due to the effects of the measurement noise obtained by different waveforms, and the same conclusion can be found in CRPFs. It is clear that the best filters for this case was CCRPF, which achieves an excellent performance of a 15% improvement over the SPF-2 and a 3% improvement over the CRPF-2.

The unknown environment has a huge influence on SPF for tracking. The mismatch of the noise is the main cause result in the performance degradation. CRPF is less affected since it takes advantage of the cost reference mechanism, so it has better tracking accuracy than SPF. The mastery of environmental information in time has an extra contribution in the proposed algorithm, which has more obvious performance improvement compared with others because of the robust module working cooperatively with the cognitive structure. This also indicates that it is a good supplement to the cognitive radar tracking approach.

To study how the cognition process evolves across time, we have plotted the waveform selection for both the chirp rate and the duration of the pulse envelope in Figure 6a,b. We observe that the transition of the chirp rate is switched from maximum up-sweep to maximum down-sweep.

4.3.2. Scenario 2: Unknown Statistics in Dynamic Noise Environment

Unknown and abrupt change of noise would be considered in this case. The main parameters are the same with Scenario 1. Measurement noise covariance

R

is known a priori, and the actual process noise covariance

Q

is set as:

{\begin{cases} Q_{0} = Q^{\circ}, & k = 1, \dots, 60 \\ Q_{0} = 80 \times Q^{\circ}, & k = 61, \dots, 80 \end{cases} .

(59)

Figure 7 shows the RMSE curves corresponding to the four filters. When

k = 1, \dots, 60

with a priori known noise, all algorithms can track the target successfully. In the subsequent phase when

Q

is increasing from the time step 60 abruptly, although SPF could not be seen as a failure to track the target, its convergence effect and speed are all the worst. CRPF shows a similar convergence speed to CCRPF, because of the similar robust module, but its overall performance during the dynamic noise stage is obviously inferior to the latter. The proposed approach shows the superiority again, with the lowest error and fastest convergence speed.

A summary of the results and the detailed comparison are also tabulated in Table 2. We do not use a different waveform in SPF and CRPF anymore because similar work has already been done in Scenario 1.

5. Conclusions

We have concentrated on the tracking method of the cognitive radar based on particle filter. The CCRPF for cognitive radar is proposed as a significant step toward random dynamic systems with unknown statistics. In this work, the mathematical model of cognitive radar has been derived, CRLB and the corresponding cost function have been designed, the cognitive PF algorithm has been presented by completely reconstructing the propagation and update process, and the proposed algorithm has been developed by updating the cost function and noise variance. Moreover, the convergence of the approach has been proofed. The simulation results illustrate that the state error prediction is more adjacent to the CRLB, and the proposed method showed a good outperforming result over the existing methods in accuracy and robustness on highly nonlinear dynamic systems with unknown statistics or a complicated environment. The application of cognitive radar and the adaptation of a traditional filter have been expanded to a wider scope.

Future work: (1) PF is approximately globally optimal, while cognitive estimation is biased, so the closed-form of the optimal solution is not easy to obtain in the PF-based cognitive tracking method, and we can only use MC to get the solution or proof the convergence. However, the cost function in CRPF might have the optimal solution; thus, the cost function design is deserves to have further study. (2) Using novel SMC methods as the cognitive framework may have better prospects in dealing with dynamic system problems with unknown or uncertain statistics. (3) The lower limit and dynamic change of particle number could also be determined.

Author Contributions

The work presented here was carried out in collaboration between all authors. conceptualization, Y.L. and L.Z.; methodology, L.Z.; validation, L.Z.; formal analysis, L.Z.; funding acquisition, W.C.; investigation, W.C. and L.Z.; data curation, L.Z.; software, Y.Z.; writing—original draft, L.Z.; Writing—review and editing, W.C. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Fundamental Research Funds for the Central Universities under Grant 3102019ZX015.

Acknowledgments

The authors wish to thank Haobin Li in 54th Research Institute of China Electronics Technology Group Corporation and Zeyu Wang in Huawei Technologies Co., Ltd. for their support and helpful advice.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

The following abbreviations are used in this paper:

PF	Particle Filtering
MC	Monte Carlo
PMCMC	Particle Markov Chain Monte Carlo
SMC	Sequential Monte Carlo
SMC²	Sequential Monte Carlo Square
SIS	Sequential Importance Sampling
SIR	Sampling Importance Resampling
APF	Adaptive PF
CRLB	Cramér–Rao Lower Bound
CPF	Cognitive PF
SPF	Standard PF
MSE	Mean Square Error
PAC	Perception-action Cycle
BCRB	Bayesian Cramér–Rao lower bound
BIM	Bayesian Fisher Information Matrix
MMSE	Minimum MSE
SNR	Signal-to-noise Ratio
LFM	Linear Frequency Modulation
RMSE	Root Mean Square Error
PCRB	Posterior Cramér–Rao Bound
CRPF	Cost-reference Particle Filter
WPS	Weighted-particle Set
PMF	Probability Mass Function
CCRPF	Cognitive Cost-reference Particle Filter
PDF	Probability Density Function
ARMSE	Average Root Mean Square Error
FIM	Fisher Information Matrix
KLD	Kullback–Leibler Distance
GRNN	General Regression Neural Network
CKF	Cubature Kalman Filter
CD-CKF	Continuous-Discrete Cubature Kalman Filter

Appendix A

In the MC integration, Equation (1) is as the expected value of

f_{k} (x_{i}^{})

, so it can be estimated as the sample mean:

μ_{k, M} = \frac{1}{M} \sum_{i = 1}^{M} f_{k} (x_{i}^{}) .

(A1)

Supposing

μ_{k} = E [f_{k} (x)]

, it can be proved that

μ_{k, M}

is the unbiased estimation of

μ_{k}

, and the non-zero bounded variance can be denoted as:

σ_{k}^{2} (θ_{k}) = \int_{D} {(f_{k} (x) - μ_{k})}^{2} p (x) d x < + \infty .

(A2)

Given the confidence level

1 - α

, the error can be denoted as:

| μ_{k, M} - μ_{k} | \leq λ_{α} {\hat{σ}}_{k}^{} (θ_{k}) / \sqrt{M} .

(A3)

Owing to Condition (1), we can obtain the expression:

\lim_{M \to \infty} | σ_{k} (θ_{k}) - {\hat{σ}}_{k}^{} (θ_{k}) | = 0 (i . p .),

(A4)

\lim_{\begin{array}{l} M \to \infty \\ k \to \infty \end{array}} | μ_{k, M} - μ_{k} | = 0 (i . p .) .

(A5)

Appendix B

When we choose the optimal parameter

θ_{k}

, and from the preliminary definitions we can exploit that:

\lim_{M \to \infty} E_{\Pr [n_{M}]} [n_{M}] = (1 - γ) \lim_{M \to \infty} M (i . p .),

(A6)

then the following relationship holds:

\begin{array}{l} \lim_{M \to \infty} & \frac{E_{\Pr [n_{M}]} [n_{M}]}{μ_{k} (S^{M} (x_{k}^{o p t}, ε; θ_{k}))} \\ = (1 - γ) \lim_{M \to \infty} \frac{M}{μ_{k} (S^{M} (x_{k}^{o p t}, ε; θ_{k}))} \\ \leq (1 - γ) \lim_{M \to \infty} \frac{M}{S_{i n}} = 0 \end{array}

(A7)

Thus, we can write:

\lim_{M \to \infty} \Pr [1 - \frac{μ_{k} (S^{M} (x_{k}^{o p t}, ε; θ_{k}))}{μ_{k} ({x_{k}^{(i)}}_{i = 1}^{M})} \geq δ] ≅ \frac{1 - δ}{δ} S_{o u t} \lim_{M \to \infty} \frac{E_{\Pr [n_{M}]} [n_{M}]}{μ_{k} (S^{M} (x_{k}^{o p t}, ε; θ_{k}))} = 0,

(A8)

\lim_{M \to \infty} [μ_{k} (S^{M} (x_{k}^{o p t} (θ_{k}), ε)) / μ_{k} ({x_{k}^{(i)} (θ_{k})}_{i = 1}^{M})] = 1 (i . p .) .

(A9)

The proof of asymptotically optimization of the mean cost estimate is carried out by following steps:

\begin{array}{l} \lim_{M \to \infty} & | Δ C ({\tilde{x}}_{k}^{m e a n} | y_{k}^{}; θ_{k}) - Δ C (x_{k}^{o p t} | y_{k}^{}; θ_{k}) | \\ \leq \lim_{M \to \infty} | \sup_{x_{k}^{} \in S^{M} (x_{k}^{o p t}, ε)} Δ C (x_{k}^{} | y_{k}^{}; θ) - Δ C (x_{k}^{o p t} | y_{k}^{}; θ) | \\ = B (ε = 1 / \sqrt{M}) = 0 \end{array}

(A10)

References

Avila, D.; Alvarez, E.; Abusleme, A. Noise analysis in pulse-processing discrete-time filters. IEEE Trans. Nucl. Sci. 2013, 60, 4634–4640. [Google Scholar] [CrossRef]
Doucet, A.; Godsill, S.; Andrieu, C. On sequential Monte Carlo sampling methods for Bayesian filtering. Stat. Comput. 2000, 10, 197–208. [Google Scholar] [CrossRef]
Gordon, N.J.; Salmond, D.J.; Smith, A.F.M. Novel approach to nonlinear/non-Gaussian Bayesian state estimation. In IEE Proceedings F on Radar and Signal Processing; Institution of Engineering and Technology (IET): London, UK, 1993; pp. 107–113. [Google Scholar]
Míguez, J. Analysis of parallelizable resampling algorithms for particle filtering. Signal Process. 2007, 87, 3155–3174. [Google Scholar] [CrossRef]
Rubin, D.B. The calculation of posterior distributions by data augmentation: Comment: A noniterative sampling/importance resampling alternative to the data augmentation algorithm for creating a few imputations when fractions of missing information are modest: The SIR algorithm. J. Am. Stat. Assoc. 1987, 82, 543–546. [Google Scholar]
Julier, S.J.; Uhlmann, J.K. Unscented filtering and nonlinear estimation. Proc. IEEE 2004, 92, 401–422. [Google Scholar] [CrossRef] [Green Version]
Arulampalam, M.S.; Maskell, S.; Gordon, N.; Clapp, T. A Tutorial on Particle Filters for Online Nonlinear/Non-Gaussian Bayesian Tracking. IEEE Trans. Signal Process. 2002, 50, 174–188. [Google Scholar] [CrossRef] [Green Version]
Vaswani, N. Particle Filtering for Large-Dimensional State Spaces with Multimodal Observation Likelihoods. IEEE Trans. Signal Process. 2008, 56, 4583–4597. [Google Scholar] [CrossRef] [Green Version]
Havangi, R. Target tracking based on improved unscented particle filter with Markov chain Monte Carlo. IETE J. Res. 2018, 64, 873–885. [Google Scholar] [CrossRef]
Whiteley, N.; Singh, S.; Godsill, S. Auxiliary Particle Implementation of Probability Hypothesis Density Filter. IEEE Trans. Aerosp. Electron. Syst. 2010, 46, 1437–1454. [Google Scholar] [CrossRef] [Green Version]
Fox, D. Adapting the Sample Size in Particle Filters Through KLD-Sampling. Int. J. Robot. Res. 2003, 22, 985–1003. [Google Scholar] [CrossRef]
Scharcanski, J.; de Oliveira, A.B.; Cavalcanti, P.G.; Yari, Y. A particle-filtering approach for vehicular tracking adaptive to occlusions. IEEE Trans. Veh. Technol. 2010, 60, 381–389. [Google Scholar] [CrossRef]
Sanguino, T.D.J.M.; Gómez, F.P. Toward simple strategy for optimal tracking and localization of robots with adaptive particle filtering. IEEE/ASME Trans. Mechatron. 2016, 21, 2793–2804. [Google Scholar] [CrossRef]
Zhou, T.; Peng, D.; Xu, C.; Zhang, W.; Shen, J. An adaptive particle filter based on kullback-leibler distance for underwater terrain aided navigation with multi-beam sonar. IET Radar Sonar Navig. 2018, 12, 433–441. [Google Scholar] [CrossRef]
Wei, W.; Gao, S.; Zhong, Y.; Gu, C.; Hu, G. Adaptive Square-Root Unscented Particle Filtering Algorithm for Dynamic Navigation. Sensors 2018, 18, 2337. [Google Scholar] [CrossRef] [Green Version]
Fox, D. KLD-sampling: Adaptive particle filters. In Proceedings of the 14th Conference on Advances in Neural Information Processing Systems (NIPS), Vancouver, BC, Canada, 3–8 December 2002; pp. 713–720. [Google Scholar]
Soto, A. Self adaptive particle filter. In Proceedings of the 19th International Joint Conference on Artificial Intelligence, Edinburgh, Scotland, UK, 30 July–5 August 2005; pp. 1398–1403. [Google Scholar]
Andrieu, C.; Doucet, A.; Holenstein, R. Particle markov chain monte carlo methods. J. R. Stat. Soc. Ser. B (Stat. Methodol.) 2010, 72, 269–342. [Google Scholar] [CrossRef] [Green Version]
Chopin, N.; Jacob, P.E.; Papaspiliopoulos, O. SMC2: An efficient algorithm for sequential analysis of state space models. J. R. Stat. Soc. Ser. B (Stat. Methodol.) 2013, 75, 397–426. [Google Scholar] [CrossRef] [Green Version]
Crisan, D.; Miguez, J. Nested particle filters for online parameter estimation in discrete-time state-space Markov models. Bernoulli 2018, 24, 3039–3086. [Google Scholar] [CrossRef]
Martino, L.; Read, J.; Elvira, V.; Louzada, F. Cooperative parallel particle filters for online model selection and applications to urban mobility. Digit. Signal Process. 2017, 60, 172–185. [Google Scholar] [CrossRef] [Green Version]
Carvalho, C.M.; Johannes, M.S.; Lopes, H.F.; Polson, N.G. Particle learning and smoothing. Stat. Sci. 2010, 25, 88–106. [Google Scholar] [CrossRef] [Green Version]
Drovandi, C.C.; McGree, J.M.; Pettitt, A.N. A sequential Monte Carlo algorithm to incorporate model uncertainty in Bayesian sequential design. J. Comput. Graph. Stat. 2014, 23, 3–24. [Google Scholar] [CrossRef] [Green Version]
Urteaga, I.; Bugallo, M.F.; Djurić, P.M. Sequential Monte Carlo methods under model uncertainty. In Proceedings of the 2016 IEEE Statistical Signal Processing Workshop (SSP), Palma de Mallorca, Spain, 26–29 June 2016; pp. 1–5. [Google Scholar]
Stateczny, A.; Kazimierski, W. A comparison of the target tracking in marine navigational radars by means of GRNN filter and numerical filter. In Proceedings of the IEEE Radar Conference, Rome, Italy, 26–30 May 2008; pp. 1–4. [Google Scholar]
Míguez, J.; Bugallo, M.F.; Djuric, P.M. A New Class of Particle Filters for Random Dynamic Systems with Unknown Statistics. EURASIP J. Adv. Signal Process. 2004, 2004, 303619. [Google Scholar] [CrossRef] [Green Version]
Djuric, P.M.; Bugallo, M.F. Cost-reference particle filtering for dynamic with nonlinear and conditionally linear states. In Proceedings of the Nonlinear Statistical Signal Processing Workshop, Cambridge, UK, 13–15 September 2006; pp. 183–188. [Google Scholar]
Yu, Y.H. Combining H∞ filter and cost-reference particle filter for conditionally linear dynamic systems in unknown non-Gaussian noises. Signal Process. 2013, 93, 1871–1878. [Google Scholar] [CrossRef]
Lim, J. Particle filtering for nonlinear dynamic state systems with unknown noise statistics. Nonlinear Dyn. 2014, 78, 1369–1388. [Google Scholar] [CrossRef]
Míguez, J. Analysis of selection methods for cost-reference particle filtering with applications to maneuvering target tracking and dynamic optimization. Digit. Signal Process. 2007, 17, 787–807. [Google Scholar] [CrossRef]
Bugallo, M.F.; Maiz, C.S.; Miguez, J.; Djuric, P.M. Cost-Reference Particle Filters and Fusion of Information. In Proceedings of the IEEE 13th Digital Signal Processing Workshop & 5th IEEE Signal Processing Education Workshop, Marco Island, FL, USA, 4–7 January 2009; pp. 286–291. [Google Scholar]
Haykin, S. Cognitive radar: A way of the future. IEEE Signal Process. Mag. 2006, 23, 30–40. [Google Scholar] [CrossRef]
Bell, K.L.; Johnson, J.T.; Smith, G.E.; Baker, C.J.; Rangaswamy, M. Cognitive radar for target tracking using a software defined radar system. In Proceedings of the IEEE Radar Conference (RadarCon), Arlington, VA, USA, 10–15 May 2015; pp. 1394–1399. [Google Scholar]
Smith, G.E.; Cammenga, Z.; Mitchell, A.; Bell, K.L.; Johnson, J.; Rangaswamy, M.; Baker, C. Experiments with cognitive radar. IEEE Aerosp. Electron. Syst. Mag. 2016, 31, 34–46. [Google Scholar] [CrossRef]
Xue, Y. Cognitive Radar: Theory and Simulations. Ph.D. Thesis, McMaster University, Hamilton, ON, Canada, 2010. [Google Scholar]
Wang, J.; Qin, Y.; Wang, H.; Li, X. Dynamic waveform selection for manoeuvering target tracking in clutter. IET Radar Sonar Navig. 2013, 7, 815–825. [Google Scholar] [CrossRef]
Wang, S.; Bi, D.; Ruan, H.; Chen, S. Cognitive structure adaptive particle filter for radar manoeuvring target tracking. IET Radar Sonar Navig. 2018, 13, 23–30. [Google Scholar] [CrossRef]
Ristic, B.; Arulampalam, S.; Gordon, N. A Tutorial on Particle Filters. In Beyond the Kalman Filter: Particle Filters for Tracking Applications; Artech House: Boston, MA, USA; London, UK, 2004; Chapter 3, Section 2; pp. 37–41. [Google Scholar]
Guerci, J.R. Introduction. In Cognitive Radar: The Knowledge-Aided Fully Adaptive Approach; Artech House: Norwood, MA, USA, 2010; Chapter 1, Section 2; pp. 14–16. [Google Scholar]
Bell, K.L.; Baker, C.J.; Smith, G.E.; Johnson, J.T.; Rangaswamy, M. Cognitive radar framework for target detection and tracking. IEEE J. Sel. Top. Signal Process. 2015, 9, 1427–1439. [Google Scholar] [CrossRef]
Kershaw, D.J.; Evans, R.J. Optimal waveform selection for tracking systems. IEEE Trans. Inf. Theory 1994, 40, 1536–1550. [Google Scholar] [CrossRef]
Van Trees, H.L. Estimation of the Parameters of a Random Process. In Detection, Estimation, and Modulation Theory, Part III: Radar-Sonar Signal Processing and Gaussian Signals in Noise; Wiley-Interscience: New York, NY, USA, 2001; Chapter 6, Section 3; pp. 177–184. [Google Scholar]
Tichavsky, P.; Muravchik, C.H.; Nehorai, A. Posterior Cramer-Rao bounds for discrete-time nonlinear filtering. IEEE Trans. Signal Process. 1998, 46, 1386–1396. [Google Scholar] [CrossRef] [Green Version]
Haykin, S. Kalman Filters. In Adaptive Filter Theory, 5th ed.; Pearson Education: Essex, UK, 2014; Chapter 14, Section 5; pp. 573–575. [Google Scholar]
Kershaw, D.; Evans, R. Waveform selective probabilistic data association. IEEE Trans. Aerosp. Electron. Syst. 1997, 33, 1180–1188. [Google Scholar] [CrossRef]
Andrieu, C.; Freitas, J.F.G.; Doucet, A. Sequential Bayesian Estimation and Model Selection Applied to Neural Networks; Technical Report CUED/F-INFENG/TR 341; Cambridge University Engineering Department: Cambridge, UK, 1999. [Google Scholar]
Challa, S.; Morelande, M.; Musicki, D.; Evans, R. Maneuvering object tracking. In Fundamentals of Object Tracking; Cambridge University Press: Cambridge, UK, 2011; Chapter 3, Section 2; pp. 66–72. [Google Scholar]
Li, X.R.; Jilkov, V.P. Survey of maneuvering target tracking. Part I. Dynamic models. IEEE Trans. Aerosp. Electron. Syst. 2004, 39, 1333–1364. [Google Scholar]

Figure 1. Cognitive radar system for target tracking.

Figure 2. Mean square error (MSE) and posterior Cramér–Rao bound (PCRB): (a) Comparison of PCRB and MSE; (b) Comparison of PCRB with fixed waveform and dynamic waveform, position

x_{k}

.

Figure 2. Mean square error (MSE) and posterior Cramér–Rao bound (PCRB): (a) Comparison of PCRB and MSE; (b) Comparison of PCRB with fixed waveform and dynamic waveform, position

x_{k}

.

Figure 3. Selection of functions and parameters: (a) Mean absolute deviation for different

μ

functions; (b) Mean absolute deviation for different

δ

values.

Figure 3. Selection of functions and parameters: (a) Mean absolute deviation for different

μ

functions; (b) Mean absolute deviation for different

δ

values.

Figure 4. Tracking example.

Figure 5. Comparison of root mean square error (RMSE) between cognitive CRPF (CCRPF) with PF and CRPF: (a) estimated position; (b) Estimated velocity.

Figure 6. Waveform selection across time: (a) On chip rate; (b) On length of pulse envelope.

Figure 7. RMSE of the estimated range.

Table 1. Performance comparison of algorithms.

ARMSE	SPF-1 ¹	SPF-2 ²	CRPF-1 ³	CRPF-2 ⁴	CCRPF
Range	1.54	1.09	1.26	0.96	0.93
Range-rate	0.37	0.24	0.17	0.16	0.14

¹ SPF-1: SPF with fixed waveform 1, ² SPF-2: SPF with fixed waveform 2, ³ CRPF-1: CRPF with fixed waveform 1, ⁴ CRPF-2: CRPF with fixed waveform 2.

Table 2. Performance comparison of algorithms.

ARMSE	SPF	CRPF	CCRPF
$k \leq 30$	0.51	0.58	0.23
$k > 30$	1.49	0.20	0.19

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Zhong, L.; Li, Y.; Cheng, W.; Zheng, Y. Cost-Reference Particle Filter for Cognitive Radar Tracking Systems with Unknown Statistics. Sensors 2020, 20, 3669. https://doi.org/10.3390/s20133669

AMA Style

Zhong L, Li Y, Cheng W, Zheng Y. Cost-Reference Particle Filter for Cognitive Radar Tracking Systems with Unknown Statistics. Sensors. 2020; 20(13):3669. https://doi.org/10.3390/s20133669

Chicago/Turabian Style

Zhong, Lei, Yong Li, Wei Cheng, and Yi Zheng. 2020. "Cost-Reference Particle Filter for Cognitive Radar Tracking Systems with Unknown Statistics" Sensors 20, no. 13: 3669. https://doi.org/10.3390/s20133669

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Cost-Reference Particle Filter for Cognitive Radar Tracking Systems with Unknown Statistics

Abstract

1. Introduction

1.1. Problem Statement

1.2. Related Works

1.3. Contributions and Organization of the Paper

2. PF for Cognitive Radar Tracking

2.1. PF and Cognitive Radar Model

2.1.1. Standard PF

2.1.2. Cognitive Radar System Model

2.2. PF-Based Cognitive Tracking Algorithm

2.2.1. Bayesian Bounds for Cognitive Radar

2.2.2. Cost Function Design

2.2.3. Algorithm and Numerical Simulation

3. Cognitive Tracking Problem Model in Unknown Environment

3.1. Cost-Reference Particle Filtering Approach

3.2. Cognitive Cost-Reference Particle Filtering

3.2.1. Cost Function Design

3.2.2. Sequential Algorithm and Design Issue

3.3. Convergence of CCRPF Algorithm

4. Numerical Results and Discussion

4.1. Dynamic Model

4.2. Selection of Function μ and Parameters δ

4.3. Simulation Results

4.3.1. Scenario 1: Unknown Statistics in General Environment

4.3.2. Scenario 2: Unknown Statistics in Dynamic Noise Environment

5. Conclusions

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

Abbreviations

Appendix A

Appendix B

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI