Predicting the Performance of Black-box Language Models with Follow-up Queries

Dylan Sam, Marc Finzi, Zico Kolter. Predicting the Performance of Black-box Language Models with Follow-up Queries. In Danielle Belgrave, Cheng Zhang 0005, Laura N. Montoya, Hsuan-Tien Lin, Razvan Pascanu, Piotr Koniusz, Marzyeh Ghassemi, Nancy Chen, Iván Vladimir Meza Ruíz, Arturo Loaiza-Bonilla, editors, Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, NeurIPS 2025, San Diago, CA, USA, December 2-7, 2025 / Mexico City, Mexico, November 30 - December 5, 2025. 2025. [doi]

Authors

Dylan Sam

This author has not been identified. Look up 'Dylan Sam' in Google

Marc Finzi

This author has not been identified. Look up 'Marc Finzi' in Google

Zico Kolter

This author has not been identified. Look up 'Zico Kolter' in Google