Performance-Portable Autotuning of OpenCL Kernels for Convolutional Layers of Deep Neural Networks

Yaohung M. Tsai, Piotr Luszczek, Jakub Kurzak, Jack J. Dongarra. Performance-Portable Autotuning of OpenCL Kernels for Convolutional Layers of Deep Neural Networks. In 2nd Workshop on Machine Learning in HPC Environments, MLHPC@SC, Salt Lake City, UT, USA, November 14, 2016. pages 9-18, IEEE Computer Society, 2016. [doi]

Abstract

Abstract is missing.