Scalable Distributed Training of Recommendation Models: An ASTRA-SIM + NS3 case-study with TCP/IP transport

Saeed Rashidi, Pallavi Shurpali, Srinivas Sridharan 0002, Naader Hassani, Dheevatsa Mudigere, Krishnakumar Nair, Misha Smelyanski, Tushar Krishna. Scalable Distributed Training of Recommendation Models: An ASTRA-SIM + NS3 case-study with TCP/IP transport. In IEEE Symposium on High-Performance Interconnects, HOTI 2020, Piscataway, NJ, USA, August 19-21, 2020. pages 33-42, IEEE, 2020. [doi]

Authors

Saeed Rashidi

This author has not been identified. Look up 'Saeed Rashidi' in Google

Pallavi Shurpali

This author has not been identified. Look up 'Pallavi Shurpali' in Google

Srinivas Sridharan 0002

This author has not been identified. Look up 'Srinivas Sridharan 0002' in Google

Naader Hassani

This author has not been identified. Look up 'Naader Hassani' in Google

Dheevatsa Mudigere

This author has not been identified. Look up 'Dheevatsa Mudigere' in Google

Krishnakumar Nair

This author has not been identified. Look up 'Krishnakumar Nair' in Google

Misha Smelyanski

This author has not been identified. Look up 'Misha Smelyanski' in Google

Tushar Krishna

This author has not been identified. Look up 'Tushar Krishna' in Google