Jack J. Dongarra, Mark Gates, Jakub Kurzak, Piotr Luszczek, Yaohung M. Tsai. Autotuning Numerical Dense Linear Algebra for Batched Computation With GPU Hardware Accelerators. Proceedings of the IEEE, 106(11):2040-2055, 2018. [doi]
Abstract is missing.