UM

Browse/Search Results:  1-4 of 4 Help

Selected(0)Clear Items/Page:    Sort:
Model-Free λ-Policy Iteration for Discrete-Time Linear Quadratic Regulation Journal article
Yang, Yongliang, Kiumarsi, Bahare, Modares, Hamidreza, Xu, Chengzhong. Model-Free λ-Policy Iteration for Discrete-Time Linear Quadratic Regulation[J]. IEEE Transactions on Neural Networks and Learning Systems, 2023, 34(2), 635-649.
Authors:  Yang, Yongliang;  Kiumarsi, Bahare;  Modares, Hamidreza;  Xu, Chengzhong
Favorite | TC[WOS]:117 TC[Scopus]:113  IF:10.2/10.4 | Submit date:2023/03/06
Algebraic Riccati Equation (Are)  Fixed-point Theory  Off-policy Reinforcement Learning (Rl)  Optimal Control  
Effective Multiplatform Advertising Policy Journal article
Huang, Kaifan, Yang, Lu Xing, Yang, Xiaofan, Tang, Yuan Yan. Effective Multiplatform Advertising Policy[J]. IEEE Transactions on Systems, Man, and Cybernetics: Systems, 2022, 52(7), 4483-4493.
Authors:  Huang, Kaifan;  Yang, Lu Xing;  Yang, Xiaofan;  Tang, Yuan Yan
Favorite | TC[WOS]:5 TC[Scopus]:5  IF:8.6/8.7 | Submit date:2022/05/13
Adaptation Models  Advertising  Expected Benefit  Marketing  Mathematical Model  Media  Multiplatform Advertising (Mpa) Policy  Numerical Models  Optimal Control  Optimal Control  Optimality System  Upper Bound  Word-of-mouth (Wom) Propagation.  
Risk-sensitive finite-horizon piecewise deterministic Markov decision processes Journal article
Yonghui Huang, Zhaotong Lian, Xianping Guo. Risk-sensitive finite-horizon piecewise deterministic Markov decision processes[J]. OPERATIONS RESEARCH LETTERS, 2020, 48(1), 96-103.
Authors:  Yonghui Huang;  Zhaotong Lian;  Xianping Guo
Favorite | TC[WOS]:7 TC[Scopus]:8  IF:0.8/1.1 | Submit date:2019/11/05
Piecewise Deterministic Markov Decision Processes  Unbounded Transition Rates  Hjb Equation  Optimal Policy  Risk Sensitive  Finite Horizon  
Learning-based web service composition in uncertain environments Journal article
Yu L., Wang Z., Meng L., Qiu X., Zhou J.. Learning-based web service composition in uncertain environments[J]. Journal of Web Engineering, 2014, 13(5-6), 450-468.
Authors:  Yu L.;  Wang Z.;  Meng L.;  Qiu X.;  Zhou J.
Favorite |  | Submit date:2018/12/22
Optimal policy  Partially observable markov decision process  Reinforcement learning algorithm  Success rate of service composition  Web service composition