Scaling Laws vs Model Architectures: How does Inductive Bias Influence Scaling?

Yi Tay, Mostafa Dehghani 0001, Samira Abnar, Hyung Won Chung, William Fedus, Jinfeng Rao, Sharan Narang, Vinh Q. Tran 0002, Dani Yogatama, Donald Metzler. Scaling Laws vs Model Architectures: How does Inductive Bias Influence Scaling?. In Houda Bouamor, Juan Pino 0001, Kalika Bali, editors, Findings of the Association for Computational Linguistics: EMNLP 2023, Singapore, December 6-10, 2023. pages 12342-12364, Association for Computational Linguistics, 2023. [doi]

@inproceedings{Tay0ACFRN0YM23,
  title = {Scaling Laws vs Model Architectures: How does Inductive Bias Influence Scaling?},
  author = {Yi Tay and Mostafa Dehghani 0001 and Samira Abnar and Hyung Won Chung and William Fedus and Jinfeng Rao and Sharan Narang and Vinh Q. Tran 0002 and Dani Yogatama and Donald Metzler},
  year = {2023},
  url = {https://aclanthology.org/2023.findings-emnlp.825},
  researchr = {https://researchr.org/publication/Tay0ACFRN0YM23},
  cites = {0},
  citedby = {0},
  pages = {12342-12364},
  booktitle = {Findings of the Association for Computational Linguistics: EMNLP 2023, Singapore, December 6-10, 2023},
  editor = {Houda Bouamor and Juan Pino 0001 and Kalika Bali},
  publisher = {Association for Computational Linguistics},
  isbn = {979-8-89176-061-5},
}