doi dblpQoS-Aware Scheduling of Heterogeneous Servers for Inference in Deep Neural NetworksZhou Fang | Tong Yu | Ole J. Mengshoel | Rajesh K. GuptaProceedings of the 2017 ACM on Conference on Information and Knowledge Management, CIKM 2017, Singapore, November 06 - 10, 2017