Md. Tahmid Rahman Laskar, Elena Khasanova, Xue-Yong Fu, Cheng Chen, Shashi Bhushan TN. Query-OPT: Optimizing Inference of Large Language Models via Multi-Query Instructions in Meeting Summarization. In Franck Dernoncourt, Daniel Preotiuc-Pietro, Anastasia Shimorina, editors, Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing: EMNLP 2024 - Industry Track, Miami, Florida, USA, November 12-16, 2024. pages 1140-1151, Association for Computational Linguistics, 2024. [doi]
Abstract is missing.