Residential College | false |
Status | 已發表Published |
Hamiltonian-Driven Adaptive Dynamic Programming With Approximation Errors | |
Yang, Yongliang1; Modares, Hamidreza2; Vamvoudakis, Kyriakos G.3; He, Wei1; Xu, Cheng Zhong4; Wunsch, Donald C.5 | |
2021-09-08 | |
Source Publication | IEEE Transactions on Cybernetics |
ABS Journal Level | 3 |
ISSN | 2168-2267 |
Volume | 52Issue:12Pages:13762-13773 |
Abstract | In this article, we consider an iterative adaptive dynamic programming (ADP) algorithm within the Hamiltonian-driven framework to solve the Hamilton-Jacobi-Bellman (HJB) equation for the infinite-horizon optimal control problem in continuous time for nonlinear systems. First, a novel function, ``min-Hamiltonian,'' is defined to capture the fundamental properties of the classical Hamiltonian. It is shown that both the HJB equation and the policy iteration (PI) algorithm can be formulated in terms of the min-Hamiltonian within the Hamiltonian-driven framework. Moreover, we develop an iterative ADP algorithm that takes into consideration the approximation errors during the policy evaluation step. We then derive a sufficient condition on the iterative value gradient to guarantee closed-loop stability of the equilibrium point as well as convergence to the optimal value. A model-free extension based on an off-policy reinforcement learning (RL) technique is also provided. Finally, numerical results illustrate the efficacy of the proposed framework. |
Keyword | Hamilton-jacobi-bellman (Hjb) Equation Hamiltonian-driven Framework Inexact Adaptive Dynamic Programming (Adp) Optimal Control |
DOI | 10.1109/TCYB.2021.3108034 |
URL | View the original |
Indexed By | SCIE |
Language | 英語English |
Funding Project | Research on Key Technologies and Platforms for Collaborative Intelligence Driven Auto-driving Cars ; Efficient Integration and Dynamic Cognitive Technology and Platform for Urban Public Services |
WOS Research Area | Automation & Control Systems ; Computer Science |
WOS Subject | Automation & Control Systems ; Computer Science, Artificial Intelligence ; Computer Science, Cybernetics |
WOS ID | WOS:000732870400001 |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
Scopus ID | 2-s2.0-85114716935 |
Fulltext Access | |
Citation statistics | |
Document Type | Journal article |
Collection | DEPARTMENT OF COMPUTER AND INFORMATION SCIENCE Faculty of Science and Technology THE STATE KEY LABORATORY OF INTERNET OF THINGS FOR SMART CITY (UNIVERSITY OF MACAU) INSTITUTE OF COLLABORATIVE INNOVATION |
Corresponding Author | Xu, Cheng Zhong |
Affiliation | 1.School of Automation and Electrical Engineering and the Institute of Artificial Intelligence, University of Science and Technology Beijing, Beijing 100083, China. 2.Mechanical Engineering Department, Michigan State University, East Lansing, MI 48824 USA. 3.Daniel Guggenheim School of Aerospace Engineering, Georgia Tech, Atlanta, GA 30332 USA. 4.State Key Laboratory of Internet of Things for Smart City, Faculty of Science and Technology, University of Macau, Macau, China (e-mail: [email protected]) 5.Department of Electrical and Computer Engineering, Missouri University of Science and Technology, Rolla, MO 65401 USA. |
Corresponding Author Affilication | Faculty of Science and Technology |
Recommended Citation GB/T 7714 | Yang, Yongliang,Modares, Hamidreza,Vamvoudakis, Kyriakos G.,et al. Hamiltonian-Driven Adaptive Dynamic Programming With Approximation Errors[J]. IEEE Transactions on Cybernetics, 2021, 52(12), 13762-13773. |
APA | Yang, Yongliang., Modares, Hamidreza., Vamvoudakis, Kyriakos G.., He, Wei., Xu, Cheng Zhong., & Wunsch, Donald C. (2021). Hamiltonian-Driven Adaptive Dynamic Programming With Approximation Errors. IEEE Transactions on Cybernetics, 52(12), 13762-13773. |
MLA | Yang, Yongliang,et al."Hamiltonian-Driven Adaptive Dynamic Programming With Approximation Errors".IEEE Transactions on Cybernetics 52.12(2021):13762-13773. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment