PP-PG: Combining Parameter Perturbation with Policy Gradient Methods for Effective and Efficient Explorations in Deep Reinforcement Learning
Shilei Li, Meng Li, Jiongming Su, Shaofei Chen, Zhimin Yuan, Qing Ye
- Anthology ID:
- DBLP:journals/tist/LiLSCYY21
- Volume:
- 2021 Volume 12 Issue 3
- Year:
- 2021
- Venue:
- tist_journal
- Pages:
- 35:1–35:21
- URL:
- https://doi.org/10.1145/3452008
- DOI:
- 10.1145/3452008
- DBLP:
- journals/tist/LiLSCYY21