Abstract
Based on the idea of data-driven control, a novel iterative adaptive dynamic programming (ADP) algorithm based on the globalized dual heuristic programming (GDHP) technique is used to solve the optimal control problem of discrete-time nonlinear switched systems. In order to solve the Hamilton–Jacobi–Bellman (HJB) equation of switched systems, the iterative ADP method is proposed and the strict convergence analysis is also provided. Three neural networks are constructed to implement the iterative ADP algorithm, where a novel model network is designed to identify the system dynamics, a critic network is used to approximate the cost function and its partial derivatives, and an action network is provided to obtain the approximate optimal control law. Two simulation examples are described to illustrate the effectiveness of the proposed method by comparing with the heuristic dynamic programming (HDP) and dual heuristic programming (DHP) methods.
Similar content being viewed by others
References
Tanwani A, Shim H, Liberzon D (2013) Observability for switched linear systems: characterization and observer design. IEEE Trans Autom Control 58(4):891–904
Sun K, Mou S, Qiu J (2018) Adaptive fuzzy control for nontriangular structural stochastic switched nonlinear systems with full state constraints. IEEE Trans Fuzzy Syst 27(8):1587–1601
Wang Y, Zhao J, Jiang B (2013) Stabilization of a class of switched linear neutral systems under asynchronous switching. IEEE Trans Autom Control 58(8):2114–2119
Krishnasamy R, Balasubramaniam P (2015) A descriptor system approach to the delay-dependent exponential stability analysis for switched neutral systems with nonlinear perturbations. Nonlinear Anal Hybrid Syst 15:23–36
Liu J, Zhang K, Sun C (2016) Robust optimal control of switched autonomous systems. IMA J Math Control Inf 2016:1379–1385
Leonessa A, Haddad WM, Chellaboina V (1998) Nonlinear system stabilization via stability-based switching. IEEE Conf Decis Control 3:2983–2996
Brooks ML (1992) Lyapunov stability designs for switching regulator/low-pass filter system. IEEE Trans Aerosp Electron Syst 28(3):881–884
Li X, Xue L, Sun C (2018) Linear quadratic tracking control of unknown discrete-time systems using value iteration algorithm. Neurocomputing 314:86–93
Szab Z, Bokor J, Balas G (2009) Controllability and stabilizability of linear switched systems. In: European control conference, pp 3245–3250
Carla S, Daniele C, Alessandro G, Alberto B (2006) Optimal control of continuous-time switched affine systems. IEEE Trans Autom Control 51(5):726–741
Xu X, Antsaklis PJ (2004) Optimal control of switched systems based on parameterization of the switching instants. IEEE Trans Autom Control 49(1):2–16
Liu C, Gong Z (2014) Optimal control of switched autonomous systems. In: Optimal control of switched systems arising in fermentation processes, pp 77–87
Sebastian S (2006) Hybrid methods in multi-criteria dynamic programming. Appl Math Comput 180:38–45
Effati S, Roohparvar H (2006) Iterative dynamic programming for solving linear and nonlinear differential equations. Appl Math Comput 175:247–257
Lincoln B, Rantzer A (2006) Relaxing dynamic programming. IEEE Trans Autom Control 51(8):1249–1260
Perves DC (2007) Approximate dynamic programming: solving the curses of dimensionality. Optim Methods Softw 24:155–155
Murray JJ, Cox CJ, Lendaris GG, Saeks R (2002) Adaptive dynamic programming. IEEE Trans Syst 32(2):140–153
Zhang H, Luo Y, Liu D (2009) Neural-network-based near-optimal control for a class of discrete-time affine nonlinear systems with control constraints. IEEE Trans Neural Netw 20(9):1490–1503
Al-Tamimi A, Lewis FL, Abu-Khalaf M (2007) Model-free Q-learning designs for linear discrete-time zero-sum games with application to H-infinity control. Automatica 43:473–481
Werbos PJ (1992) Approximate dynamic programming for real-time control and neural modeling. Handbook of Intelligent Control
Song R, Yang D, Zhang H (2010) Near-optimal control laws based on Heuristic Dynamic Programming iteration algorithm. In: International conference on networking, pp 261–266
Zhong X, He H (2018) GrHDP solution for optimal consensus control of multiagent discrete-time systems. IEEE Trans Syst Man Cybern Syst 2018:1–13
Li H, Wen Y, Sun W (2017) Optimal control of molecular distillation based on dual heuristic dynamic programming. In: IEEE advanced information technology, electronic and automation control conference, pp 1011–1016
Mu C, Wang D, He H (2017) Novel iterative neural dynamic programming for data-based approximate optimal control design. Automatica 81:240–252
Luo Y, Liang M (2011) Approximate optimal tracking control for a class of discrete-time non-affine systems based on GDHP algorithm. In: The fourth international workshop on advanced computational intelligence, pp 143–149
Zhong X, Zhen N, He H (2017) Gr-GDHP: a new architecture for globalized dual heuristic dynamic programming. IEEE Trans Cybern 47(10):3318–3330
Fan QY, Yang GH (2017) Adaptive nearly optimal control for a class of continuous-time nonaffine nonlinear systems with inequality constraints. ISA Trans 66:122–133
Mu C, Zhang Y (2020) Learning-based robust tracking control of quadrotor with time-varying and coupling uncertainties. IEEE Trans Neural Netw Learn Syst 31(1):259–273
Sun K, Qiu J, Karimi H (2019) A novel finite-time control for nonstrict feedback saturated nonlinear systems with tracking error constraint. IEEE Trans Syst Man Cybern Syst. https://doi.org/10.1109/TSMC.2019.2958072
Lewis F, Vrabie D, Syrmos V (2012) Optimal control. Wiley, Hoboken
Xu X, Antsaklis PJ (2003) Results and perspectives on computational methods for optimal control of switched systems. Hybrid Syst Comput Control 2632:540–555
Lewis FL, Syrmos V (1995) Optimal control. Wiley, New York
Al-Tamimi A, Lewis FL, Abu-Khalaf M (2008) Discrete-time nonlinear HJB solution using approximate dynamic programming: convergence proof. IEEE Trans Syst 38(4):943–949
Zhang H, Qin C, Luo Y (2014) Neural-network-based constrained optimal control scheme for discrete-time switched nonlinear system using dual heuristic programming. IEEE Trans Autom Sci Eng 11(3):839–849
Fan G, Liu Z, Chen H (2012) Controller and observer design for a class of discrete-time nonlinear switching systems. Int J Control Autom Syst 10:1193–1203
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
This work was supported in part by the National Natural Science Foundation of China under Grants 61773284 and 61873181.
Rights and permissions
About this article
Cite this article
Mu, C., Liao, K., Ren, L. et al. Approximately Optimal Control of Discrete-Time Nonlinear Switched Systems Using Globalized Dual Heuristic Programming. Neural Process Lett 52, 1089–1108 (2020). https://doi.org/10.1007/s11063-020-10278-9
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11063-020-10278-9