Position: AI Competitions Provide the Gold Standard for Empirical Rigor in GenAI Evaluation - researchr publication authors

researchr

You are not signed in
Sign in
Sign up

D. Sculley, William Cukierski, Phil Culliton, Sohier Dane, Maggie Demkin, Ryan Holbrook, Addison Howard, Paul Mooney, Walter Reade, Meg Risdal, Nate Keating. Position: AI Competitions Provide the Gold Standard for Empirical Rigor in GenAI Evaluation. In Forty-second International Conference on Machine Learning, ICML 2025, Vancouver, BC, Canada, July 13-19, 2025 - Position Paper Track. OpenReview.net, 2025. [doi]

This author has not been identified. Look up 'D. Sculley' in GoogleThis author has not been identified. Look up 'William Cukierski' in GoogleThis author has not been identified. Look up 'Phil Culliton' in GoogleThis author has not been identified. Look up 'Sohier Dane' in GoogleThis author has not been identified. Look up 'Maggie Demkin' in GoogleThis author has not been identified. Look up 'Ryan Holbrook' in GoogleThis author has not been identified. Look up 'Addison Howard' in GoogleThis author has not been identified. Look up 'Paul Mooney' in GoogleThis author has not been identified. Look up 'Walter Reade' in GoogleThis author has not been identified. Look up 'Meg Risdal' in GoogleThis author has not been identified. Look up 'Nate Keating' in Google

runs on WebDSL