Scheduling Large-scale Distributed Training via Reinforcement Learning

Zhanglin Peng, Jiamin Ren, Ruimao Zhang, Lingyun Wu, Xinjiang Wang, Ping Luo. Scheduling Large-scale Distributed Training via Reinforcement Learning. In Naoki Abe, Huan Liu 0001, Calton Pu, Xiaohua Hu, Nesreen Ahmed, Mu Qiao, Yang Song, Donald Kossmann, Bing Liu 0001, Kisung Lee, Jiliang Tang, Jingrui He, Jeffrey Saltz, editors, IEEE International Conference on Big Data, Big Data 2018, Seattle, WA, USA, December 10-13, 2018. pages 1797-1806, IEEE, 2018. [doi]

Authors

Zhanglin Peng

This author has not been identified. Look up 'Zhanglin Peng' in Google

Jiamin Ren

This author has not been identified. Look up 'Jiamin Ren' in Google

Ruimao Zhang

This author has not been identified. Look up 'Ruimao Zhang' in Google

Lingyun Wu

This author has not been identified. Look up 'Lingyun Wu' in Google

Xinjiang Wang

This author has not been identified. Look up 'Xinjiang Wang' in Google

Ping Luo

This author has not been identified. Look up 'Ping Luo' in Google