Enhancing Large Language Model Performance with Reinforcement Learning from Human Feedback: A Comprehensive Study on Q&A, Summarization, and Classification

Nirdosh Rawal, Prudhvith Tavva, Prakash Selvakumar. Enhancing Large Language Model Performance with Reinforcement Learning from Human Feedback: A Comprehensive Study on Q&A, Summarization, and Classification. In International Conference on Electrical, Computer and Energy Technologies, ICECET 2024, Sydney, Australia, July 25-27, 2024. pages 1-6, IEEE, 2024. [doi]

Abstract

Abstract is missing.