Enhancing Large Language Model Performance with Reinforcement Learning from Human Feedback: A Comprehensive Study on Q&A, Summarization, and Classification - researchr publication

researchr

You are not signed in
Sign in
Sign up

Nirdosh Rawal, Prudhvith Tavva, Prakash Selvakumar. Enhancing Large Language Model Performance with Reinforcement Learning from Human Feedback: A Comprehensive Study on Q&A, Summarization, and Classification. In International Conference on Electrical, Computer and Energy Technologies, ICECET 2024, Sydney, Australia, July 25-27, 2024. pages 1-6, IEEE, 2024. [doi]

Abstract is missing.

runs on WebDSL