Abstract

The weapon target assignment (WTA) is a classical problem of defense-related applications which is proved to be a NP-complete problem. In this paper, a practical and available dynamic weapon target assignment (DWTA) formulation is given which incorporates two meaningful and conflicting objectives, that is, minimizing weapon costs and maximizing combat benefits. As we know, heuristic methods have some shortcomings such as slow convergence speed and local optimum in solving the nonlinear integer optimization problem. To this end, a novel DWTA algorithm based on cross-entropy (CE) method is introduced, where the resources requirement condition for targets is taken into consideration. The CE method associates an estimation problem with the DWTA optimization problem, and then, the estimation problem is transformed into a convex optimization problem. The Karush–Kuhn–Tucker conditions are applied to solve the convex optimization problem, and the iteration formulas to find the optimal solution are deducted. Furthermore, in order to verify the performance of CE method in dealing with the DWTA problem, several simulations in different combat scenarios are implemented. The results reveal that, compared with the benchmark heuristic and Monte-Carlo (MC) methods, there are some notable advantages in solving the DWTA problem based on CE method with regard to the solution quality and time consumption.

1. Introduction

Weapon target assignment (WTA) problem is the core content in the research of combat command aided decision-making, which can be categorized as a combinatorial optimization problem and nondeterministic polynomial complete problem [1], whose solution space expands exponentially with the increasing of the number of weapons and targets. Generally, WTA problem can be divided into static weapon target assignment (SWTA) problem [2, 3] and dynamic weapon target assignment (DWTA) problem [46]. In SWTA, all weapons are assigned to targets simultaneously and all information is known; however, in DWTA, many dynamic changes such as time window and weapon consumption should be considered; in that case, the solving algorithm for DWTA problem must have the good real-time performance. Furthermore, some researchers pay much attention to sensor weapon target assignment problem [79], where small and sensitive sensors loaded by weapons play a critical role to enhance the combat effectiveness, and in order to fit the combat scene pertinently, multiobjective optimization problem [10, 11] is researched to depict the combat purpose in different perspectives.

Concretely, Kline et al. [2] proposed a nonlinear branch and bound algorithm to solve the SWTA problem which sought to minimize the residual value of each target. Zhou et al. [3] proposed a discrete particle swarm optimization (PSO) algorithm to maximize the ratio of global operation effectiveness to the costs of consumed weapons. Lai and Wu [4] proposed an improved simplified swarm optimization method with two novel schemes to minimize the threat value in the multistage WTA. Chang et al. [5] proposed an artificial bee colony algorithm with heuristic factor initialization to get the minimal expected value of surviving targets for all stages. Mu et al. [7] built a sensor WTA model with the probability of detection and killing, and then multiscale quantum harmonic oscillator algorithm was applied to solve the optimization problem, and Chen et al. [8] used the particle swarm optimization algorithm to solve the sensor WTA problem while Xin et al. [9] proposed a marginal-return-based constructive heuristic method to deal with and solve sensor WTA problem. Juan et al. [12] built a multiobjective WTA formulation and NSGAII algorithm was applied to solve the problem, and Li et al. [10] compared the adaptive NSGAII algorithm and adaptive MOEA/D algorithm and gave a comparison study. Gao et al. [11] proposed the D-NSGA-III-A algorithm with the adaptive operator selection mechanism. Both NSGAII and D-NSGA-III-A are extensions of the GA algorithm.

With the development of computer performance, numerous methods are developed to solve the WTA problems, such as branch and bound [2], PSO [3, 8, 13], artificial bee colony algorithm [5], genetic algorithm [1012], large-scale neighborhood algorithm [14, 15], and ant colony algorithm [16, 17]. Some new methods are also applied to deal with the WTA problem such as simplified swarm optimization [4], multiscale quantum harmonic oscillator algorithm [7], marginal-return-based constructive heuristic [9], and Markov decision model [18]. Some scholars are also interested in hybrid algorithms [1921]. Although the application of these methods provides a variety of ideas for solving WTA problems, it is necessary to further expand the optimization method for the realistic and dynamic combat scenarios. Rubinstein [22] indicated that it is feasible and effective to utilize CE method to estimate the probability of rare event and solve complicated nonlinear combination optimization problems. As a simple but powerful tool, CE method is available for numerous practical applications, such as combination optimization problems [23], deep learning [24], and multi-UAVs task allocation [25, 26].

As an efficient deterministic optimization method, branch and bound method is involved to solve nonlinear optimization problem [2, 25]. Le Thi et al. [25] compared the performance of CE method and branch and bound method in terms of solving the nonlinear UAV task assignment problem. To evaluate the effectiveness of CE method, Huang et al. [26] applied it in various test problems and compared it with the exhaust search method. The comparable research, among CE method and other benchmark heuristic methods, is insufficient; in this paper, in order to verify the performance of solving the DWTA problem based on CE method, two benchmark heuristic algorithms (PSO and GA) and Monte-Carlo algorithm are compared with the CE method.

The aim of this paper is to reveal the detail process of applying cross-entropy to solve the discrete nonlinear DWTA problem under the multiple resources requirement condition. The contribution of this paper is as follows:(i)A novel DWTA formulation with two significant and conflicting objectives is constructed, which minimize the weapon costs and maximize the combat benefits. Multiple resources requirement conditions changeable are taken into consideration which limit the feasibility of weapon in multistage combat. The kill probability is randomly selected between zero and one which is distinct from those models with an assumption that all firepower units have the same kill probability for different types of targets [15]. With these considerations, the DWTA formulation given in this paper is more practical and realistic.(ii)The idea of estimating rare event probability with CE is extended to the solution of discrete nonlinear DWTA problem. To be specific, the DWTA problem depicted in this paper is transformed into an estimation problem, and then it is subsequently transformed into a convex optimization problem, detailed iteration formulas are deducted in the following text, and then the algorithm of solving DWTA problem based on CE method is acquired.(iii)24 DWTA problems of different scales are simulated to cover the different combat scenarios; furthermore, the benchmark heuristic and Monte-Carlo methods are implemented to verify the performance of CE method.

The rest of the paper is structured as follows: in Section 2, a DWTA formulation based on weapon costs and combat benefits under multiple resources requirement condition is presented. Furthermore, in Section 3 we apply CE method to solve the DWTA problem and give the elaborate deduction procedure based on Lagrange multiplier method and KKT conditions. Section 4 employs several combat scenarios to verify the performance of CE method compared with benchmark heuristic and MC methods. Section 5 summarizes the research contents of this paper and prospects the follow-up research work.

2. DWTA Formulation

2.1. Combat Scenario

The purpose of WTA problem is to get maximal combat benefits by utilizing weapons to intercept targets under certain or uncertain combat environment. In this paper, we present a DWTA formulation based on consumed weapon costs and combat benefits incorporating multiple resources requirement condition, and a typical scenario of the weapons against the targets is shown in Figure 1. We assume that there are n targets and m types of weapon, and each target has different combat ability value and resources requirement in the combat scenario; at the same time, the resources contained by each weapon and the inventory quantity of each weapon are taken into consideration. The solution of the DWTA problem is based on the following assumptions.Assumption 1. The kill probability of using the same weapon to intercept different target is different and the kill probability of using different weapon to intercept the same target is different. The value of kill probability is randomly selected.Assumption 2. The weapon satisfied with the resources requirement condition of target can be used to interception task. As depicted in Figure 1, weapon 1 can intercept target 1; however, weapon 2, which does not satisfy the resources condition of target 1, is limited to intercept target 1.Assumption 3. Every type of weapon can be launched once simultaneously, so the objective of WTA is to assign m weapons to n targets in every stage, and then the evaluation of the combat result is implemented, which allows us to decide whether to go on the next weapon target assignment. Actually, this DWTA model belongs to shoot-look-shoot variant.Assumption 4. The evaluation process is deterministic as long as the solution of WTA is acquired, which allows us to decide the next procedure according to the cessation conditions. The detail of cessation conditions will be described later.

2.2. Design of DWTA Formulation

After the description of the combat scenario in Figure 1, the detailed DWTA formulation will be introduced in this subsection, cost function constituted by consumed weapons and benefit function described by the combat ability value in each combat stage are defined at first, and the DWTA formulation is built based on a tradeoff between cost function and benefit function. Some notations used in this paper are described in Table 1.

In order to describe the cost of consumed weapons clearly, is the auxiliary function of decision variable we defined, which depicts whether to assignweapon i to target j at stage t, the specific definition is as follows:

Furthermore, we define the auxiliary function of combat feasibility which is utilized to decide whether the weapon i could intercept the target j:where is the number of resource k contained by weapon i at stage t, is the number of resource k required by target j at stage t, and both and are nonnegative integers. Weapon i can be used to intercept target j if and only if the available resource number of the weapon i meets the requirement condition of target j.

Consequently, we get the cost function at stage t.

After the t stage, the benefit function is equal to the sum of the combat ability value of all targets.

Hence, we have built the cost function and the benefit function, weight strategy is adopted to balance the two conflicting objectives, and the DWTA objective function at stage t is acquired as follows:where , and are the weight parameters. Let , which means destroying targets regardless of cost, and let , which means destroying targets at the lowest cost.

The constraint conditions of the WTA problem are depicted in the following formulas:

Constraint (6) depicts that each weapon must be assigned to one of the targets at stage t. Constraint (7) depicts that it is necessary to stockpile enough weapons to complete weapon target assignment. Constraint (8) limits the range of ; that is, is a positive integer and belongs to the set J. There are two noticeable situations as follows:

Situation 1. A certain type of weapon is exhausted.

Situation 2. A certain type of weapon does not satisfy the resource requirement condition of any target.
In these two conditions which mean that this weapon cannot intercept any target, in order to deal with the two situations, we set up a new hypothetical target named target n + 1 and get a new target set , we can assign this type of weapon to target n + 1, and we set . It is an excellent technique to deal with these two situations. Consequently, we can summarize that the DWTA formulation at stage t

3. Algorithm Design Based on Cross-Entropy

3.1. The Cross-Entropy Method

Originally, CE method is based on importance sampling and used to estimate the probability of rare event, and Rubinstein [22] put forward an optimization method based on the idea of CE, which could solve the discrete multiextreme value problems. Consider the following general minimization problem: let S be a real-valued function on , where is a finite set of states. Our goal is to find the minimum of S over , which is denoted as , and is the corresponding state.

The basic idea of CE method applied to formula (10) is to associate an estimation problem with the optimization problem. In that case, the optimization problem is converted into an estimation problem with probability density function (pdf) .where is a value close to , is the probability measure, denotes the corresponding expectation operator, and is the indicator function.

A better way to estimate l is to use another pdf , and we can represent l as in the following formula:

An unbiased estimator of l is depicted in the following formula:where is generated from pdf .

It is well known that the best way to estimate l is to use the change of measure with the pdf :

In order to choose the pdf from the family of pdf , the idea is to choose the reference parameter such that the distance between the densities and is minimal. Minimizing the Kullback–Leibler distance between the densities and is equivalent to the following formula:

It is obvious that minimizing problem (16) is equivalent to solving the minimization problem.

Finally, formula (17) is converted into the following formula:where is the random sample from . In typical applications, the function in formula (16) is convex and differentiable with respect to [27]. Thus, we can get the solution of (18) from the following equation:where the gradient is with respect to .

The idea of CE method is to construct simultaneously two sequences of levels and parameters , where is close to the optimal and is the corresponding parameter of pdf . In other words, each iteration of the CE method has two main phases. In the first phase, is updated, and in the second is updated. To be specific, starting , we can obtain the two sequences as follows:Phase 1: adaptive updating of For a fixed , let be a -quantile of under ; that is, satisfieswhere is generated from .A simple estimator of can be obtained by drawing a random sample from , calculating the performances for all I, ordering them from the smallest to the biggest, , and evaluating the -quantile asPhase 2: adaptive updating of For fixed and , derive from the solution of (19), to be specific

3.2. Solving DWTA Problem Based on CE Method

In this paper, our purpose is to apply CE method to solve the DWTA problem modeled above, and interested readers can get detailed theory of CE method in [22, 28]. In order to solve formulation (9) at stage t, we construct the following discrete Probability Distribution Matrix (PDM) :where the element in PDM denotes the probability of allocating weapon i to target j at stage t, and if every element belongs to , we can get the final solution according to the PDM. Furthermore, the PDM satisfies the following constraint condition:where the set because we use target n + 1 to deal with Situation 1 and Situation 2, and it is obvious that constraint condition (6) is equivalent to constraint condition (23). It is worth noting that initializing the PDM at stage t according to the following formula can assure that sample generated from PDM meets the resources requirement:

According to the PDM at stage t, we can get the probability distribution function :where is the element of column in line i of PDM .

Furthermore, we can get the following probability distribution function:where formulas (25) and (26) are equivalent, and the reader can refer to Appendix for details.

In each iteration process of CE method, are N samples generated from the PDM , and we can compute the objective function values , and then we sort the N objective function values in ascending order, and we can get the sequence of objective function values and the sequence of samples ; let , and is the symbol of rounding down, which is the -quantile. We select H samples corresponding to the H minimum objective function values; that is, , to update the PDM . We can get the following optimization problem according to formula (18):

According to formula (26), we can simplify formula (27):

In order to simplify the description, let , , and then optimization problem (27) can be rewritten as follows:

It is obvious that problem (29) belongs to a convex optimization problem, and we can construct the Lagrange function as follows:

According to Karush–Kuhn–Tucker conditions, we can get the following formula:

and the following formula is derived according to the upper formula (31):

Then, the element in PDM can be updated according to the following formula:

Before the cessation conditions of the DWTA are analyzed, we give the updated formula of the remaining weapon inventory and the remaining combat ability value after the t stage.where .

The cessation conditions of the DWTA include two points:Cessation condition 1: all available weapons are exhaustedCessation condition 2: the combat purpose is implemented, which means the percentage of the remaining combat ability value is less than the preselected percentage

Consequently, we can get the DWTA algorithm based on CE method as follows (Algorithm 1).

Input:
Output:
(1)Let t = 1, begin the dynamic weapon target assignment.
(2)Compute the initial PDM according to formula (24) and let tCE = 0.
(3)Generate N samples according to PDM .
(4)Compute the objective function values .
(5)Sort the N objective function values in ascending order and select H samples corresponding to the H minimum objective function values.
(6)Update the PDM using the H samples according to formula (33).
(7)If all elements of PDM belong to {0, 1}, set , output the optimum X and S, else let , goto step 3 until the CE method achieves the convergence conditions.
(8)Update the remaining combat ability value of every target and the remaining weapon inventory .
(9)If any of the two cessation conditions is reached, let , output . Otherwise, let , go to step 2.
3.3. Complexity Analysis

Let N denote the sample size, denote the iteration number of CE method, m denote the number of weapon types, and T denote the final combat stage. The computational cost of CE method includes four parts: the initialization (), sample (), sort (), and update (); the computational complexity of CE method can be denoted as , to be specific, ; therefore, the time complexity of the CE method can be computed as , and the time complexity of the DWTA algorithm based on CE method can be denoted as . Let Num denote the population size of PSO and GA. Let D denote the iteration number of PSO and GA, and we can get the complexity of DWTA algorithm based on PSO ang GA denoted as .

4. DWTA Simulations

4.1. Parameter Setting for DWTA

The following contents in this subsection include three parts: parameters setting rules of combat scenario, parameters selection of algorithms, performance index design and parameters setting of combat scenario. The first part is the parameters setting rules of combat scenario which is depicted in Table 2. Both the number of weapon types (m) and the number of targets (n) are integers between 1 and 80, that is, . The combat ability value of every target at the first stage is a random number between 70 and 200, that is, ; it is worth noting that the combat ability value of the hypothetical target is equal to zero, that is, . The cost of every single weapon is a random number between 20 and 80, that is, . The kill probability of weapon i to target j is a random number between 0.5 and 1, that is, . is integer between 0 and 3. is integer between 0 and 5. The expected combat purpose is a random number between 0 and 1.

Subsequently, we give the scheme of parameter selection of CE, PSO, GA, and MC methods in Table 3. The CE method includes two parameters, sample size and quantile, and the main parameters of PSO and GA methods are population size Num and iteration number D, and we set sample size of MC method equal to 5e4 in the simulation.

The third part is performance index design depicted in Table 4 which measures the solution quality of different algorithms. The percentage between the sum of the cost of weapon consumed in all stages and the total value of weapons, denoted as PSCWC, is to depict combat costs in whole combat.

The index PRCAVT denotes the percentage between the remaining combat ability value of targets at T stage and the initial combat value of targets, which measures whether to achieve the expected combat purpose.

The index CI is the product of the index PRCAVT and the index PSCWC. The index avg-PSCWC is the average of PSCWC corresponding to the six DWTA problems. The index avg-PRCAVT is the average of PRCAVT corresponding to the six DWTA problems. The computation time (CT) is used to depict the solving speed of different algorithms.

Finally, we categorize the number of weapon stockpiles and weight parameters of DWTA model into the following four situations so as to cover different combat scenarios.

Scenario 1. Weapon stockpiles are sufficient, and the weapon costs are important the same as with the combat benefits.

Scenario 2. Weapon stockpiles are sufficient, and the combat benefits are more important than the weapon costs.

Scenario 3. Weapon stockpiles are not sufficient, and the weapon costs are important the same as with the combat benefits.

Scenario 4. Weapon stockpiles are not sufficient, and the combat benefits are more important than the weapon costs.It is worth noting that the same scale DWTA problems in Scenario 1 and Scenario 2 have the same parameters such as the inventory quantity of weapons and kill probability, and the same is true for Scenario 3 and Scenario 4. Besides, in order to describe the effectiveness of the DWTA model and the advantages of CE method in solving different scales of DWTA problems, we set up the following six scales DWTA problems. The choices of parameter m and parameter n are shown in Table 5.
We set the expected combat purpose in advance, which depicts that we stop the combat task when the index PRCAVT is less than ; note that another stop condition is that all available weapons are exhausted. We set the relevant parameters of CE, PSO, and GA methods as follows, and , , . In this paper, ten simulations of the same scale DWTA problem are carried out to calculate the statistical average of PSCWC, PRCAVT, and CT. All algorithms are implemented in an Intel Core I5 @ 3.06 GHz processor with 8 GB RAM.

4.2. Results and Analysis

The results of simulation are presented in Figures 24 and Tables 68. What needs to be pointed out is that the smaller the indexes PSCWC and avg-PSCWC are, the smaller the cost of weapons consumed is. The smaller the indexes PRCAVT and avg-PRCAVT are, the greater the combat benefits are. The smaller the index CI is, the better the solution quality is. The smaller the CT index is, the faster the solving speed is.

Figures 2(a), 3(a), and 4(a) depict the combat results in Scenario 1; the index PSCWC has the ascending sequence, that is, CE < GA < PSO < MC; furthermore, Figure 4(a) indicates that the value of index CI for CE method is the smallest among others, and from Tables 6 and 7, the indexes avg-PSCWC of CE, GA, PSO, and MC methods are 21.98%, 29.98%, 37.16%, and 44.96%, and the indexes avg-PRCAVT of CE, GA, PSO, and MC methods are 7.97%, 8.28%, 7.18%, and 6.85%, which indicates that the solution of CE method is more excellent than these solutions of GA, PSO, and MC methods; that is, the expected combat purpose is achieved and multitudes of weapon costs are saved. The same conclusion can be reached in Scenario 2.

As for Scenario 3, the CE, GA, PSO, and MC methods have nearly the same PSCWC index depicted in Figure 2(c), and from Figure 3(c), the PRCAVT index has the ascending sequence, that is, CE < GA < PSO < MC, and Figure 4(c) indicates that the value of index CI for CE method is the smallest among others; furthermore, from Tables 6 and 7, the indexes avg-PSCWC of CE, GA, PSO, and MC methods are 61.31%, 61.62%, 63.61%, and 64.46%, and the indexes avg-PRCAVT of CE, GA, PSO, and MC methods are 27.19%, 33.30%, 37.61%, and 42.82%. Consequently, compared with the GA, PSO, and MC methods, the assignment results of CE method are more effective in the condition that the number of weapons consumed is the same. According to the combat result of Scenario 3, we can get the conclusion that the defense can get better combat benefits when using CE method to complete the DWTA task under the condition that the weapons are inadequate. The same conclusion can be reached in Scenario 4.

Comparing Scenario 1 with Scenario 2 and comparing Scenario 3 with Scenario 4, the simulation results are shown in Figures 2 and 3. From the indexes PSCWC and PRCAVT, the defense gets more combat benefits in Scenario 2 and Scenario 4 but spends more weapon costs. We can get the conclusion that the defense can save weapon costs to some extent by adjusting weight parameters and in different scenarios.

The computation time of solving the different scales DWTA problem is recorded in the four combat scenarios, and the relevant results are shown in Table 8; it is obvious that the index CT presents ascending sequence, that is, , and the computation cost of CE method is one-order-of-magnitude lower than that of GA and PSO method; consequently, we could get that the CE method has great advantage in solving speed over GA, PSO, and MC methods.

5. Conclusion

In this paper, a novel DWTA model under the resource requirement condition which is shoot-look-shoot variant is constructed, where weight parameters are introduced to balance the weapon costs and combat benefits. Subsequently, a hypothetical target is constructed to deal with the resource requirement condition, a new dynamic weapon target assignment method based on CE is introduced, and the detailed derivation process of using CE method to solve DWTA problem is given. Finally, to evaluate the solution quality of CE method, the dynamic weapon target assignment task based on CE, GA, PSO, and MC methods is simulated in several combat scenarios. Three conclusions are summarized based on the simulation results. Firstly, the expected combat purpose is achieved and a good supply of weapon costs are saved by dealing with the DWTA problem based on CE method under the condition that weapon inventory is adequate. Secondly, more combat benefits are acquired by dealing with the DWTA problem based on CE method under the condition that weapon inventory is inadequate. Thirdly, the solving speed of CE method is obviously faster than GA, PSO, and MC methods which more fit the real-time combat scenario. Besides, another strong point of the DWTA formulation is that adjusting weight parameters in different scenarios can save weapon costs to some extent.

For future research, we will focus on the promotion of CE method in its efficiency and time consumption by sample selection and appropriate parameter settings.

Appendix

Proof: Formulas (25) and (26) are equivalent.

Let , so .

Because of , we can get and .

Then, we can get the following formula: .

Finally, we can get

Consequently, formulas (25) and (26) are equivalent.

Data Availability

The simulation data used to support the findings of this study are included within the article.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

Acknowledgments

This paper was funded by Science and Technology on Space Intelligent Control Laboratory for National Defense.