kGym: A Platform and Dataset to Benchmark Large Language Models on Linux Kernel Crash Resolution

Alex Mathai, Chenxi Huang, Petros Maniatis, Aleksandr Nogikh, Franjo Ivancic, Junfeng Yang, Baishakhi Ray. kGym: A Platform and Dataset to Benchmark Large Language Models on Linux Kernel Crash Resolution. In Amir Globersons, Lester Mackey, Danielle Belgrave, Angela Fan, Ulrich Paquet, Jakub M. Tomczak, Cheng Zhang 0005, editors, Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, NeurIPS 2024, Vancouver, BC, Canada, December 10 - 15, 2024. 2024. [doi]

Authors

Alex Mathai

This author has not been identified. Look up 'Alex Mathai' in Google

Chenxi Huang

This author has not been identified. Look up 'Chenxi Huang' in Google

Petros Maniatis

This author has not been identified. Look up 'Petros Maniatis' in Google

Aleksandr Nogikh

This author has not been identified. Look up 'Aleksandr Nogikh' in Google

Franjo Ivancic

This author has not been identified. It may be one of the following persons: Look up 'Franjo Ivancic' in Google

Junfeng Yang

This author has not been identified. Look up 'Junfeng Yang' in Google

Baishakhi Ray

This author has not been identified. Look up 'Baishakhi Ray' in Google