H. S. V. N. S. Kowndinya Renduchintala, KrishnaTeja Killamsetty, Sumit Bhatia, Milan Aggarwal, Ganesh Ramakrishnan, Rishabh K. Iyer, Balaji Krishnamurthy. INGENIOUS: Using Informative Data Subsets for Efficient Pre-Training of Language Models. In Houda Bouamor, Juan Pino 0001, Kalika Bali, editors, Findings of the Association for Computational Linguistics: EMNLP 2023, Singapore, December 6-10, 2023. pages 6690-6705, Association for Computational Linguistics, 2023. [doi]
Abstract is missing.