Optimizing GPU Code for CPU Execution Using OpenCL and Vectorization: A Case Study on Image Coding

Pedro M. M. Pereira, Patrício Domingues, Nuno M. M. Rodrigues, Gabriel Falcão, Sérgio M. M. de Faria. Optimizing GPU Code for CPU Execution Using OpenCL and Vectorization: A Case Study on Image Coding. In Jesús Carretero, Javier García Blas, Ryan K. L. Ko, Peter Mueller, Koji Nakano, editors, Algorithms and Architectures for Parallel Processing - 16th International Conference, ICA3PP 2016, Granada, Spain, December 14-16, 2016, Proceedings. Volume 10048 of Lecture Notes in Computer Science, pages 537-545, Springer, 2016. [doi]

@inproceedings{PereiraDRFF16,
  title = {Optimizing GPU Code for CPU Execution Using OpenCL and Vectorization: A Case Study on Image Coding},
  author = {Pedro M. M. Pereira and Patrício Domingues and Nuno M. M. Rodrigues and Gabriel Falcão and Sérgio M. M. de Faria},
  year = {2016},
  doi = {10.1007/978-3-319-49583-5_42},
  url = {http://dx.doi.org/10.1007/978-3-319-49583-5_42},
  researchr = {https://researchr.org/publication/PereiraDRFF16},
  cites = {0},
  citedby = {0},
  pages = {537-545},
  booktitle = {Algorithms and Architectures for Parallel Processing - 16th International Conference, ICA3PP 2016, Granada, Spain, December 14-16, 2016, Proceedings},
  editor = {Jesús Carretero and Javier García Blas and Ryan K. L. Ko and Peter Mueller and Koji Nakano},
  volume = {10048},
  series = {Lecture Notes in Computer Science},
  publisher = {Springer},
  isbn = {978-3-319-49582-8},
}