Reinforcement online learning to rank with unbiased reward shaping
Shengyao Zhuang, Zhihao Qiao, Guido Zuccon
- Anthology ID:
- DBLP:journals/ir/ZhuangQZ22
- Volume:
- 2022 Volume 25 Issue 4
- Year:
- 2022
- Venue:
- ir_journal
- Pages:
- 386–413
- URL:
- https://doi.org/10.1007/s10791-022-09413-y
- DOI:
- 10.1007/s10791-022-09413-y
- DBLP:
- journals/ir/ZhuangQZ22