Rethinking gradient sparsification as total error minimization

Atal Narayan Sahu, Aritra Dutta, Ahmed M. Abdelmoniem, Trambak Banerjee, Marco Canini, Panos Kalnis. Rethinking gradient sparsification as total error minimization. In Marc'Aurelio Ranzato, Alina Beygelzimer, Yann N. Dauphin, Percy Liang, Jennifer Wortman Vaughan, editors, Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, NeurIPS 2021, December 6-14, 2021, virtual. pages 8133-8146, 2021. [doi]

Authors

Atal Narayan Sahu

This author has not been identified. Look up 'Atal Narayan Sahu' in Google

Aritra Dutta

This author has not been identified. Look up 'Aritra Dutta' in Google

Ahmed M. Abdelmoniem

This author has not been identified. Look up 'Ahmed M. Abdelmoniem' in Google

Trambak Banerjee

This author has not been identified. Look up 'Trambak Banerjee' in Google

Marco Canini

This author has not been identified. Look up 'Marco Canini' in Google

Panos Kalnis

This author has not been identified. Look up 'Panos Kalnis' in Google