Reinforcement Learning to Rank with Pairwise Policy Gradient
Jun Xu, Zeng Wei, Long Xia, Yanyan Lan, Dawei Yin, Xueqi Cheng, Ji-Rong Wen
- Anthology ID:
- DBLP:conf/sigir/XuWXLYCW20
- Volume:
- Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval, SIGIR 2020, Virtual Event, China, July 25-30, 2020
- Year:
- 2020
- Venue:
- sigirconf_conference
- Publisher:
- ACM
- Pages:
- 509–518
- URL:
- https://doi.org/10.1145/3397271.3401148
- DOI:
- 10.1145/3397271.3401148
- DBLP:
- conf/sigir/XuWXLYCW20