Zeng Wei
2020
Reinforcement Learning to Rank with Pairwise Policy Gradient
Jun Xu
|
Zeng Wei
|
Long Xia
|
Yanyan Lan
|
Dawei Yin
|
Xueqi Cheng
|
Ji-Rong Wen
Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval, SIGIR 2020, Virtual Event, China, July 25-30, 2020
Search
Co-authors
- Ji-Rong Wen 1
- Jun Xu 1
- Long Xia 1
- Zeng Wei 1
- Yanyan Lan 1
- show all...