Sparse online kernelized actor-critic Learning in reproducing kernel Hilbert space

doi:10.1007/s10462-021-10045-9

UM > THE STATE KEY LABORATORY OF INTERNET OF THINGS FOR SMART CITY (UNIVERSITY OF MACAU)

Residential College	false
Status	已發表Published
	Sparse online kernelized actor-critic Learning in reproducing kernel Hilbert space
	Yang, Yongliang 1; Zhu, Hufei 1; Zhang, Qichao 2,3; Zhao, Bo 4; Li, Zhenning5 ; Wunsch, Donald C.6
	2021-08-07
Source Publication	Artificial Intelligence Review
ISSN	0269-2821
Volume	55 Pages:23-58
Abstract	In this paper, we develop a novel non-parametric online actor-critic reinforcement learning (RL) algorithm to solve optimal regulation problems for a class of continuous-time affine nonlinear dynamical systems. To deal with the value function approximation (VFA) with inherent nonlinear and unknown structure, a reproducing kernel Hilbert space (RKHS)-based kernelized method is designed through online sparsification, where the dictionary size is fixed and consists of updated elements. In addition, the linear independence check condition, i.e., an online criteria, is designed to determine whether the online data should be inserted into the dictionary. The RHKS-based kernelized VFA has a variable structure in accordance with the online data collection, which is different from classical parametric VFA methods with a fixed structure. Furthermore, we develop a sparse online kernelized actor-critic learning RL method to learn the unknown optimal value function and the optimal control policy in an adaptive fashion. The convergence of the presented kernelized actor-critic learning method to the optimum is provided. The boundedness of the closed-loop signals during the online learning phase can be guaranteed. Finally, a simulation example is conducted to demonstrate the effectiveness of the presented kernelized actor-critic learning algorithm.
Keyword	Actor-critic Learning Non-parametric Learning Online Sparsification Reproducing Kernel Hilbert Space Value Function Approximation
DOI	10.1007/s10462-021-10045-9
URL	View the original
Indexed By	SCIE
Language	英語English
WOS Research Area	Computer Science
WOS Subject	Computer Science, Artificial Intelligence
WOS ID	WOS:000682662600001
Publisher	SPRINGER, VAN GODEWIJCKSTRAAT 30, 3311 GZ DORDRECHT, NETHERLANDS
Scopus ID	2-s2.0-85112674869
Fulltext Access	View Full-Text via DOI View Full-Text via Web of Science View Full-Text via Scopus
Citation statistics
Document Type	Journal article
Collection	THE STATE KEY LABORATORY OF INTERNET OF THINGS FOR SMART CITY (UNIVERSITY OF MACAU)
Corresponding Author	Zhao, Bo
Affiliation	1.School of Automation and Electrical Engineering, University of Science and Technology Beijing, Beijing, 100083, China 2.State Key Laboratory of Management and Control for Complex Systems, Institute of Automation, Chinese Academy of Sciences, Beijing, 100190, China 3.University of Chinese Academy of Sciences, Beijing, China 4.School of Systems Science, Beijing Normal University, Beijing, 100875, China 5.State Key Laboratory of Internet of Things for Smart City, University of Macau, Taipa, 59193, Macao 6.Department of Electrical and Computer Engineering, Missouri University of Science and Technology, Rolla, 65401, United States
Recommended Citation GB/T 7714	Yang, Yongliang,Zhu, Hufei,Zhang, Qichao,et al. Sparse online kernelized actor-critic Learning in reproducing kernel Hilbert space[J]. Artificial Intelligence Review, 2021, 55, 23-58.
APA	Yang, Yongliang., Zhu, Hufei., Zhang, Qichao., Zhao, Bo., Li, Zhenning., & Wunsch, Donald C. (2021). Sparse online kernelized actor-critic Learning in reproducing kernel Hilbert space. Artificial Intelligence Review, 55, 23-58.
MLA	Yang, Yongliang,et al."Sparse online kernelized actor-critic Learning in reproducing kernel Hilbert space".Artificial Intelligence Review 55(2021):23-58.

Files in This Item:
There are no files associated with this item.

If you have any objections to this item, please fill out the form below and the administrator will contact you as soon as possible.
Content:
Email：	*
Affiliation No.
Verification Code:	Refresh

Any comments and suggestions are welcomed.
Title:	*
Content:
Email：	*
Verification Code:	Refresh