Reinforcement Learning to Rank with Pairwise Policy Gradient - IR Anthology

Main » SIGIR » 2020 » Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval, SIGIR 2020, Virtual Event, China, July 25-30, 2020 »

Reinforcement Learning to Rank with Pairwise Policy Gradient

Jun Xu, Zeng Wei, Long Xia, Yanyan Lan, Dawei Yin, Xueqi Cheng, Ji-Rong Wen

Anthology ID:

DBLP:conf/sigir/XuWXLYCW20

Volume:

Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval, SIGIR 2020, Virtual Event, China, July 25-30, 2020

Year:

2020

Venue:

sigirconf_conference

Publisher:

ACM

Pages:

509–518

URL:

https://doi.org/10.1145/3397271.3401148

DOI:

10.1145/3397271.3401148

DBLP:

conf/sigir/XuWXLYCW20

BibTeX:

Internet Archive Scholar

Semantic Scholar