QoS-Aware Scheduling of Heterogeneous Servers for Inference in Deep Neural Networks
Zhou Fang, Tong Yu, Ole J. Mengshoel, Rajesh K. Gupta
- Anthology ID:
- DBLP:conf/cikm/FangYMG17
- Volume:
- Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, CIKM 2017, Singapore, November 06 - 10, 2017
- Year:
- 2017
- Venue:
- cikm_conference
- Publisher:
- ACM
- Pages:
- 2067–2070
- URL:
- https://doi.org/10.1145/3132847.3133045
- DOI:
- 10.1145/3132847.3133045
- DBLP:
- conf/cikm/FangYMG17