Bowen Sun, Riccardo Pinciroli, Giuliano Casale, Evgenia Smirni. DeepBAT: Performance and Cost Optimization of Serverless Inference Using Transformers. In IEEE International Parallel and Distributed Processing Symposium, IPDPS 2025, Milano, Italy, June 3-7, 2025. pages 335-346, IEEE, 2025. [doi]
Abstract is missing.