Novel HPC techniques to batch execution of many variable size BLAS computations on GPUs

Ahmad Abdelfattah, Azzam Haidar, Stanimire Tomov, Jack Dongarra. Novel HPC techniques to batch execution of many variable size BLAS computations on GPUs. In William D. Gropp, Pete Beckman, Zhiyuan Li, Francisco J. Cazorla, editors, Proceedings of the International Conference on Supercomputing, ICS 2017, Chicago, IL, USA, June 14-16, 2017. ACM, 2017. [doi]

@inproceedings{AbdelfattahHTD17-0,
  title = {Novel HPC techniques to batch execution of many variable size BLAS computations on GPUs},
  author = {Ahmad Abdelfattah and Azzam Haidar and Stanimire Tomov and Jack Dongarra},
  year = {2017},
  doi = {10.1145/3079079.3079103},
  url = {http://doi.acm.org/10.1145/3079079.3079103},
  researchr = {https://researchr.org/publication/AbdelfattahHTD17-0},
  cites = {0},
  citedby = {0},
  booktitle = {Proceedings of the International Conference on Supercomputing, ICS 2017, Chicago, IL, USA, June 14-16, 2017},
  editor = {William D. Gropp and Pete Beckman and Zhiyuan Li and Francisco J. Cazorla},
  publisher = {ACM},
  isbn = {978-1-4503-5020-4},
}