×
验证码:
换一张
Forgotten Password?
Stay signed in
Login With UMPASS
English
|
繁體
Login With UMPASS
Log In
ALL
ORCID
TI
AU
PY
SU
KW
TY
JN
DA
IN
PB
FP
ST
SM
Study Hall
Image search
Paste the image URL
Home
Faculties & Institutes
Scholars
Publications
Subjects
Statistics
News
Search in the results
Faculties & Institutes
THE STATE KEY LA... [2]
Authors
Document Type
Journal article [2]
Date Issued
2022 [2]
Language
英語English [2]
Source Publication
IEEE Transaction... [1]
Mathematics [1]
Indexed By
SCIE [2]
Funding Organization
Funding Project
×
Knowledge Map
UM
Start a Submission
Submissions
Unclaimed
Claimed
Attach Fulltext
Bookmarks
Browse/Search Results:
1-2 of 2
Help
Selected(
0
)
Clear
Items/Page:
5
10
15
20
25
30
35
40
45
50
55
60
65
70
75
80
85
90
95
100
Sort:
Select
Issue Date Ascending
Issue Date Descending
Title Ascending
Title Descending
Author Ascending
Author Descending
WOS Cited Times Ascending
WOS Cited Times Descending
Submit date Ascending
Submit date Descending
Journal Impact Factor Ascending
Journal Impact Factor Descending
A Deep Reinforcement Learning Recommender System With Multiple Policies for Recommendations
Journal article
Mingsheng Fu, Liwei Huang, Ananya Rao, Athirai A. Irissappane, Jie Zhang, Hong Qu. A Deep Reinforcement Learning Recommender System With Multiple Policies for Recommendations[J]. IEEE Transactions on Industrial Informatics, 2022, 19(2), 2049-2061.
Authors:
Mingsheng Fu
;
Liwei Huang
;
Ananya Rao
;
Athirai A. Irissappane
;
Jie Zhang
; et al.
Favorite
|
TC[WOS]:
6
TC[Scopus]:
7
IF:
11.7
/
11.4
|
Submit date:2023/02/22
Deep Reinforcement Learning (Drl)
Multitask Markov Decision Process (Mdp)
Recommender System
Noise-Regularized Advantage Value for Multi-Agent Reinforcement Learning
Journal article
Wang, Siying, Chen, Wenyu, Hu, Jian, Hu, Siyue, Huang, Liwei. Noise-Regularized Advantage Value for Multi-Agent Reinforcement Learning[J]. Mathematics, 2022, 10(15), 2728.
Authors:
Wang, Siying
;
Chen, Wenyu
;
Hu, Jian
;
Hu, Siyue
;
Huang, Liwei
Favorite
|
TC[WOS]:
1
TC[Scopus]:
1
IF:
2.3
/
2.2
|
Submit date:2023/01/30
Advantage Function
Exploration
Multi-agent Reinforcement Learning
Noise Injection
Proximal Policy Optimization