Efficient Training Corpus Retrieval for Large Language Model Fine Tuning: A Case Study in Cancer

Avisha Das, Chiamaka S. Diala, Guocai Chen, Zhao Li, Rongbin Li, Omer Anjum, W. Jim Zheng. Efficient Training Corpus Retrieval for Large Language Model Fine Tuning: A Case Study in Cancer. In Mowafa S. Househ, Zain Ul Abideen Tariq, Mahmood Al-Zubaidi, Uzair Shah, Elaine Huesing, editors, MEDINFO 2025 - Healthcare Smart × Medicine Deep - Proceedings of the 20th World Congress on Medical and Health Informatics, Taipei, Taiwan, 9-13 August 2025. Volume 329 of Studies in Health Technology and Informatics, pages 1251-1255, IOS Press, 2025. [doi]

Abstract

Abstract is missing.