- Vasu Sharma, Karthik Padthe, Newsha Ardalani, Kushal Tirumala, Russell Howes, Hu Xu 0001, Po-Yao Huang 0001, Daniel Li Chen, Armen Aghajanyan, Gargi Ghosh, Luke Zettlemoyer. Text Quality-Based Pruning for Efficient Training of Language Models. J. Data-centric Mach. Learn. Res., 2, 2025.
- Ronak Tali, Ali Rabeh, Cheng-Hau Yang, Mehdi Shadkhah, Samundra Karki, Abhisek Upadhyaya, Suriya Dhakshinamoorthy, Marjan Saadati, Soumik Sarkar, Adarsh Krishnamurthy, Chinmay Hegde, Aditya Balu, Baskar Ganapathysubramanian. FlowBench: A Large Scale Benchmark for Flow Simulation over Complex Geometries. J. Data-centric Mach. Learn. Res., 2, 2025.
- Ziwei Yang, Xuxi Chen, Biqing Zhu, Tianlong Chen 0001, Zhangyang Wang. Deep Learning for Accurate Diagnosis of Viral Infections through scRNA-seq Analysis: A Comprehensive Benchmark Study. J. Data-centric Mach. Learn. Res., 2, 2025.
- Helen Jin, Shreya Havaldar, Chaehyeon Kim, Anton Xue, Weiqiu You, Helen Qu, Marco Gatti, Daniel A. Hashimoto, Bhuvnesh Jain, Amin Madani, Masao Sako, Lyle H. Ungar, Eric Wong 0001. The FIX Benchmark: Extracting Features Interpretable to eXperts. J. Data-centric Mach. Learn. Res., 2, 2025.
- Angus Dempster, Navid Mohammadi Foumani, Chang Wei Tan, Lynn Miller, Amish Mishra, Mahsa Salehi, Charlotte Pelletier, Daniel F. Schmidt, Geoffrey I. Webb. MONSTER: Monash Scalable Time Series Evaluation Repository. J. Data-centric Mach. Learn. Res., 2, 2025.
- Hugo Jair Escalante, Isabelle Guyon, Addison Howard, Walter Reade, Sébastien Treguer. Challenge design roadmap. J. Data-centric Mach. Learn. Res., 2, 2025.
- David Rousseau, Antoine Marot, Zhen Xu 0007. Towards impactful challenges: post-challenge paper, benchmarks and other dissemination actions. J. Data-centric Mach. Learn. Res., 2, 2025.
- Pu Ren, N. Benjamin Erichson, Junyi Guo, Shashank Subramanian, Omer San, Zarija Lukic, Michael W. Mahoney. SuperBench: A Super-Resolution Benchmark Dataset for Scientific Machine Learning. J. Data-centric Mach. Learn. Res., 2, 2025.
- Lingjiao Chen, Bilge Acun, Newsha Ardalani, Yifan Sun 0010, Feiyang Kang, Hanrui Lyu, Yongchan Kwon, Ruoxi Jia 0001, Carole-Jean Wu, Matei Zaharia, James Zou 0001. Data Acquisition: A New Frontier in Data-centric AI. J. Data-centric Mach. Learn. Res., 2, 2025.
- Lukas Helff, Wolfgang Stammer, Hikaru Shindo, Devendra Singh Dhami, Kristian Kersting. V-LoL: A Diagnostic Dataset for Visual Logical Learning. J. Data-centric Mach. Learn. Res., 2, 2025.