Flover: A Temporal Fusion Framework for Efficient Autoregressive Model Parallel Inference

Jinghan Yao, Nawras Alnaasan, Tian Chen, Aamir Shafi, Hari Subramoni, Dhabaleswar K. D. K. Panda. Flover: A Temporal Fusion Framework for Efficient Autoregressive Model Parallel Inference. In 30th IEEE International Conference on High Performance Computing, Data, and Analytics, HiPC 2023, Goa, India, December 18-21, 2023. pages 107-116, IEEE, 2023. [doi]

@inproceedings{YaoACSSP23,
  title = {Flover: A Temporal Fusion Framework for Efficient Autoregressive Model Parallel Inference},
  author = {Jinghan Yao and Nawras Alnaasan and Tian Chen and Aamir Shafi and Hari Subramoni and Dhabaleswar K. D. K. Panda},
  year = {2023},
  doi = {10.1109/HiPC58850.2023.00026},
  url = {https://doi.org/10.1109/HiPC58850.2023.00026},
  researchr = {https://researchr.org/publication/YaoACSSP23},
  cites = {0},
  citedby = {0},
  pages = {107-116},
  booktitle = {30th IEEE International Conference on High Performance Computing, Data, and Analytics, HiPC 2023, Goa, India, December 18-21, 2023},
  publisher = {IEEE},
  isbn = {979-8-3503-8322-5},
}