DeepBAT: Performance and Cost Optimization of Serverless Inference Using Transformers

Bowen Sun, Riccardo Pinciroli, Giuliano Casale, Evgenia Smirni. DeepBAT: Performance and Cost Optimization of Serverless Inference Using Transformers. In IEEE International Parallel and Distributed Processing Symposium, IPDPS 2025, Milano, Italy, June 3-7, 2025. pages 335-346, IEEE, 2025. [doi]

Authors

Bowen Sun

This author has not been identified. Look up 'Bowen Sun' in Google

Riccardo Pinciroli

This author has not been identified. Look up 'Riccardo Pinciroli' in Google

Giuliano Casale

This author has not been identified. It may be one of the following persons: Look up 'Giuliano Casale' in Google

Evgenia Smirni

This author has not been identified. Look up 'Evgenia Smirni' in Google