Densifying Assumed-Sparse Tensors - Improving Memory Efficiency and MPI Collective Performance During Tensor Accumulation for Parallelized Training of Neural Machine Translation Models

Derya Cavdar, Valeriu Codreanu, Can Karakus, John A. Lockman III, Damian Podareanu, Vikram A. Saletore, Alexander Sergeev, Don D. Smith II, Victor Suthichai, Quy Ta, Srinivas Varadharajan, Lucas A. Wilson, Rengan Xu, Pei Yang. Densifying Assumed-Sparse Tensors - Improving Memory Efficiency and MPI Collective Performance During Tensor Accumulation for Parallelized Training of Neural Machine Translation Models. In Michèle Weiland, Guido Juckeland, Carsten Trinitis, Ponnuswamy Sadayappan, editors, High Performance Computing - 34th International Conference, ISC High Performance 2019, Frankfurt/Main, Germany, June 16-20, 2019, Proceedings. Volume 11501 of Lecture Notes in Computer Science, pages 23-39, Springer, 2019. [doi]

@inproceedings{CavdarCKLPSSSST19,
  title = {Densifying Assumed-Sparse Tensors - Improving Memory Efficiency and MPI Collective Performance During Tensor Accumulation for Parallelized Training of Neural Machine Translation Models},
  author = {Derya Cavdar and Valeriu Codreanu and Can Karakus and John A. Lockman III and Damian Podareanu and Vikram A. Saletore and Alexander Sergeev and Don D. Smith II and Victor Suthichai and Quy Ta and Srinivas Varadharajan and Lucas A. Wilson and Rengan Xu and Pei Yang},
  year = {2019},
  doi = {10.1007/978-3-030-20656-7_2},
  url = {https://doi.org/10.1007/978-3-030-20656-7_2},
  researchr = {https://researchr.org/publication/CavdarCKLPSSSST19},
  cites = {0},
  citedby = {0},
  pages = {23-39},
  booktitle = {High Performance Computing - 34th International Conference, ISC High Performance 2019, Frankfurt/Main, Germany, June 16-20, 2019, Proceedings},
  editor = {Michèle Weiland and Guido Juckeland and Carsten Trinitis and Ponnuswamy Sadayappan},
  volume = {11501},
  series = {Lecture Notes in Computer Science},
  publisher = {Springer},
  isbn = {978-3-030-20656-7},
}