Residential Collegefalse
Status已發表Published
Sparse online kernelized actor-critic Learning in reproducing kernel Hilbert space
Yang, Yongliang1; Zhu, Hufei1; Zhang, Qichao2,3; Zhao, Bo4; Li, Zhenning5; Wunsch, Donald C.6
2021-08-07
Source PublicationArtificial Intelligence Review
ISSN0269-2821
Volume55Pages:23-58
Abstract

In this paper, we develop a novel non-parametric online actor-critic reinforcement learning (RL) algorithm to solve optimal regulation problems for a class of continuous-time affine nonlinear dynamical systems. To deal with the value function approximation (VFA) with inherent nonlinear and unknown structure, a reproducing kernel Hilbert space (RKHS)-based kernelized method is designed through online sparsification, where the dictionary size is fixed and consists of updated elements. In addition, the linear independence check condition, i.e., an online criteria, is designed to determine whether the online data should be inserted into the dictionary. The RHKS-based kernelized VFA has a variable structure in accordance with the online data collection, which is different from classical parametric VFA methods with a fixed structure. Furthermore, we develop a sparse online kernelized actor-critic learning RL method to learn the unknown optimal value function and the optimal control policy in an adaptive fashion. The convergence of the presented kernelized actor-critic learning method to the optimum is provided. The boundedness of the closed-loop signals during the online learning phase can be guaranteed. Finally, a simulation example is conducted to demonstrate the effectiveness of the presented kernelized actor-critic learning algorithm.

KeywordActor-critic Learning Non-parametric Learning Online Sparsification Reproducing Kernel Hilbert Space Value Function Approximation
DOI10.1007/s10462-021-10045-9
URLView the original
Indexed BySCIE
Language英語English
WOS Research AreaComputer Science
WOS SubjectComputer Science, Artificial Intelligence
WOS IDWOS:000682662600001
PublisherSPRINGER, VAN GODEWIJCKSTRAAT 30, 3311 GZ DORDRECHT, NETHERLANDS
Scopus ID2-s2.0-85112674869
Fulltext Access
Citation statistics
Document TypeJournal article
CollectionTHE STATE KEY LABORATORY OF INTERNET OF THINGS FOR SMART CITY (UNIVERSITY OF MACAU)
Corresponding AuthorZhao, Bo
Affiliation1.School of Automation and Electrical Engineering, University of Science and Technology Beijing, Beijing, 100083, China
2.State Key Laboratory of Management and Control for Complex Systems, Institute of Automation, Chinese Academy of Sciences, Beijing, 100190, China
3.University of Chinese Academy of Sciences, Beijing, China
4.School of Systems Science, Beijing Normal University, Beijing, 100875, China
5.State Key Laboratory of Internet of Things for Smart City, University of Macau, Taipa, 59193, Macao
6.Department of Electrical and Computer Engineering, Missouri University of Science and Technology, Rolla, 65401, United States
Recommended Citation
GB/T 7714
Yang, Yongliang,Zhu, Hufei,Zhang, Qichao,et al. Sparse online kernelized actor-critic Learning in reproducing kernel Hilbert space[J]. Artificial Intelligence Review, 2021, 55, 23-58.
APA Yang, Yongliang., Zhu, Hufei., Zhang, Qichao., Zhao, Bo., Li, Zhenning., & Wunsch, Donald C. (2021). Sparse online kernelized actor-critic Learning in reproducing kernel Hilbert space. Artificial Intelligence Review, 55, 23-58.
MLA Yang, Yongliang,et al."Sparse online kernelized actor-critic Learning in reproducing kernel Hilbert space".Artificial Intelligence Review 55(2021):23-58.
Files in This Item:
There are no files associated with this item.
Related Services
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[Yang, Yongliang]'s Articles
[Zhu, Hufei]'s Articles
[Zhang, Qichao]'s Articles
Baidu academic
Similar articles in Baidu academic
[Yang, Yongliang]'s Articles
[Zhu, Hufei]'s Articles
[Zhang, Qichao]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Yang, Yongliang]'s Articles
[Zhu, Hufei]'s Articles
[Zhang, Qichao]'s Articles
Terms of Use
No data!
Social Bookmark/Share
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.