Pseudo Dyna-Q: A Reinforcement Learning Framework for Interactive Recommendation
Lixin Zou, Long Xia, Pan Du, Zhuo Zhang, Ting Bai, Weidong Liu, Jian-Yun Nie, Dawei Yin
- Anthology ID:
- DBLP:conf/wsdm/ZouXDZB0NY20
- Volume:
- WSDM '20: The Thirteenth ACM International Conference on Web Search and Data Mining, Houston, TX, USA, February 3-7, 2020
- Year:
- 2020
- Venue:
- wsdm_conference
- Publisher:
- ACM
- Pages:
- 816–824
- URL:
- https://doi.org/10.1145/3336191.3371801
- DOI:
- 10.1145/3336191.3371801
- DBLP:
- conf/wsdm/ZouXDZB0NY20