INGENIOUS: Using Informative Data Subsets for Efficient Pre-Training of Language Models

H. S. V. N. S. Kowndinya Renduchintala, KrishnaTeja Killamsetty, Sumit Bhatia, Milan Aggarwal, Ganesh Ramakrishnan, Rishabh K. Iyer, Balaji Krishnamurthy. INGENIOUS: Using Informative Data Subsets for Efficient Pre-Training of Language Models. In Houda Bouamor, Juan Pino 0001, Kalika Bali, editors, Findings of the Association for Computational Linguistics: EMNLP 2023, Singapore, December 6-10, 2023. pages 6690-6705, Association for Computational Linguistics, 2023. [doi]

Authors

H. S. V. N. S. Kowndinya Renduchintala

This author has not been identified. Look up 'H. S. V. N. S. Kowndinya Renduchintala' in Google

KrishnaTeja Killamsetty

This author has not been identified. Look up 'KrishnaTeja Killamsetty' in Google

Sumit Bhatia

This author has not been identified. Look up 'Sumit Bhatia' in Google

Milan Aggarwal

This author has not been identified. Look up 'Milan Aggarwal' in Google

Ganesh Ramakrishnan

This author has not been identified. Look up 'Ganesh Ramakrishnan' in Google

Rishabh K. Iyer

This author has not been identified. Look up 'Rishabh K. Iyer' in Google

Balaji Krishnamurthy

This author has not been identified. Look up 'Balaji Krishnamurthy' in Google