Sequential Search with Off-Policy Reinforcement Learning
Dadong Miao, Yanan Wang, Guoyu Tang, Lin Liu, Sulong Xu, Bo Long, Yun Xiao, Lingfei Wu, Yunjiang Jiang
- Anthology ID:
- DBLP:conf/cikm/MiaoWTLXLXWJ21
- Volume:
- CIKM '21: The 30th ACM International Conference on Information and Knowledge Management, Virtual Event, Queensland, Australia, November 1 - 5, 2021
- Year:
- 2021
- Venue:
- cikm_conference
- Publisher:
- ACM
- Pages:
- 4006–4015
- URL:
- https://doi.org/10.1145/3459637.3481954
- DOI:
- 10.1145/3459637.3481954
- DBLP:
- conf/cikm/MiaoWTLXLXWJ21