Ido Amos, Jonathan Berant, Ankit Gupta 0001. Never Train from Scratch: Fair Comparison of Long-Sequence Models Requires Data-Driven Priors (Extended Abstract). In Proceedings of the Thirty-Fourth International Joint Conference on Artificial Intelligence, IJCAI 2025, Montreal, Canada, August 16-22, 2025. pages 10846-10851, ijcai.org, 2025. [doi]
Abstract is missing.