Fast implementation of DGEMM on Fermi GPU

Guangming Tan, Linchuan Li, Sean Triechle, Everett Phillips, Yungang Bao, Ninghui Sun. Fast implementation of DGEMM on Fermi GPU. In Scott Lathrop, Jim Costa, William Kramer, editors, Conference on High Performance Computing Networking, Storage and Analysis, SC 2011, Seattle, WA, USA, November 12-18, 2011. pages 35, ACM, 2011. [doi]

@inproceedings{TanLTPBS11,
  title = {Fast implementation of DGEMM on Fermi GPU},
  author = {Guangming Tan and Linchuan Li and Sean Triechle and Everett Phillips and Yungang Bao and Ninghui Sun},
  year = {2011},
  doi = {10.1145/2063384.2063431},
  url = {http://doi.acm.org/10.1145/2063384.2063431},
  researchr = {https://researchr.org/publication/TanLTPBS11},
  cites = {0},
  citedby = {0},
  pages = {35},
  booktitle = {Conference on High Performance Computing Networking, Storage and Analysis, SC 2011, Seattle, WA, USA, November 12-18, 2011},
  editor = {Scott Lathrop and Jim Costa and William Kramer},
  publisher = {ACM},
  isbn = {978-1-4503-0771-0},
}