Hamiltonian-Driven Adaptive Dynamic Programming With Approximation Errors

doi:10.1109/TCYB.2021.3108034

UM > Faculty of Science and Technology > DEPARTMENT OF COMPUTER AND INFORMATION SCIENCE

Residential College	false
Status	已發表Published
	Hamiltonian-Driven Adaptive Dynamic Programming With Approximation Errors
	Yang, Yongliang 1; Modares, Hamidreza 2; Vamvoudakis, Kyriakos G.3; He, Wei 1; Xu, Cheng Zhong4 ; Wunsch, Donald C.5
	2021-09-08
Source Publication	IEEE Transactions on Cybernetics
ABS Journal Level	3
ISSN	2168-2267
Volume	52 Issue:12 Pages:13762-13773
Abstract	In this article, we consider an iterative adaptive dynamic programming (ADP) algorithm within the Hamiltonian-driven framework to solve the Hamilton-Jacobi-Bellman (HJB) equation for the infinite-horizon optimal control problem in continuous time for nonlinear systems. First, a novel function, ``min-Hamiltonian,'' is defined to capture the fundamental properties of the classical Hamiltonian. It is shown that both the HJB equation and the policy iteration (PI) algorithm can be formulated in terms of the min-Hamiltonian within the Hamiltonian-driven framework. Moreover, we develop an iterative ADP algorithm that takes into consideration the approximation errors during the policy evaluation step. We then derive a sufficient condition on the iterative value gradient to guarantee closed-loop stability of the equilibrium point as well as convergence to the optimal value. A model-free extension based on an off-policy reinforcement learning (RL) technique is also provided. Finally, numerical results illustrate the efficacy of the proposed framework.
Keyword	Hamilton-jacobi-bellman (Hjb) Equation Hamiltonian-driven Framework Inexact Adaptive Dynamic Programming (Adp) Optimal Control
DOI	10.1109/TCYB.2021.3108034
URL	View the original
Indexed By	SCIE
Language	英語English
Funding Project	Research on Key Technologies and Platforms for Collaborative Intelligence Driven Auto-driving Cars ; Efficient Integration and Dynamic Cognitive Technology and Platform for Urban Public Services
WOS Research Area	Automation & Control Systems ; Computer Science
WOS Subject	Automation & Control Systems ; Computer Science, Artificial Intelligence ; Computer Science, Cybernetics
WOS ID	WOS:000732870400001
Publisher	Institute of Electrical and Electronics Engineers Inc.
Scopus ID	2-s2.0-85114716935
Fulltext Access	View Full-Text via DOI View Full-Text via Web of Science View Full-Text via Scopus
Citation statistics
Document Type	Journal article
Collection	DEPARTMENT OF COMPUTER AND INFORMATION SCIENCE Faculty of Science and Technology THE STATE KEY LABORATORY OF INTERNET OF THINGS FOR SMART CITY (UNIVERSITY OF MACAU) INSTITUTE OF COLLABORATIVE INNOVATION
Corresponding Author	Xu, Cheng Zhong
Affiliation	1.School of Automation and Electrical Engineering and the Institute of Artificial Intelligence, University of Science and Technology Beijing, Beijing 100083, China. 2.Mechanical Engineering Department, Michigan State University, East Lansing, MI 48824 USA. 3.Daniel Guggenheim School of Aerospace Engineering, Georgia Tech, Atlanta, GA 30332 USA. 4.State Key Laboratory of Internet of Things for Smart City, Faculty of Science and Technology, University of Macau, Macau, China (e-mail: [email protected]) 5.Department of Electrical and Computer Engineering, Missouri University of Science and Technology, Rolla, MO 65401 USA.
Corresponding Author Affilication	Faculty of Science and Technology
Recommended Citation GB/T 7714	Yang, Yongliang,Modares, Hamidreza,Vamvoudakis, Kyriakos G.,et al. Hamiltonian-Driven Adaptive Dynamic Programming With Approximation Errors[J]. IEEE Transactions on Cybernetics, 2021, 52(12), 13762-13773.
APA	Yang, Yongliang., Modares, Hamidreza., Vamvoudakis, Kyriakos G.., He, Wei., Xu, Cheng Zhong., & Wunsch, Donald C. (2021). Hamiltonian-Driven Adaptive Dynamic Programming With Approximation Errors. IEEE Transactions on Cybernetics, 52(12), 13762-13773.
MLA	Yang, Yongliang,et al."Hamiltonian-Driven Adaptive Dynamic Programming With Approximation Errors".IEEE Transactions on Cybernetics 52.12(2021):13762-13773.

Files in This Item:
There are no files associated with this item.

If you have any objections to this item, please fill out the form below and the administrator will contact you as soon as possible.
Content:
Email：	*
Affiliation No.
Verification Code:	Refresh

Any comments and suggestions are welcomed.
Title:	*
Content:
Email：	*
Verification Code:	Refresh