A W-cycle algorithm for efficient batched SVD on GPUs

Junmin Xiao, Qing Xue, Hui Ma, Xiaoyang Zhang, Guangming Tan. A W-cycle algorithm for efficient batched SVD on GPUs. In Jaejin Lee, Kunal Agrawal, Michael F. Spear, editors, PPoPP '22: 27th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, Seoul, Republic of Korea, April 2 - 6, 2022. pages 465-466, ACM, 2022. [doi]

@inproceedings{XiaoXMZT22,
  title = {A W-cycle algorithm for efficient batched SVD on GPUs},
  author = {Junmin Xiao and Qing Xue and Hui Ma and Xiaoyang Zhang and Guangming Tan},
  year = {2022},
  doi = {10.1145/3503221.3508443},
  url = {https://doi.org/10.1145/3503221.3508443},
  researchr = {https://researchr.org/publication/XiaoXMZT22},
  cites = {0},
  citedby = {0},
  pages = {465-466},
  booktitle = {PPoPP '22: 27th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, Seoul, Republic of Korea, April 2 - 6, 2022},
  editor = {Jaejin Lee and Kunal Agrawal and Michael F. Spear},
  publisher = {ACM},
  isbn = {978-1-4503-9204-4},
}