Adhitya Mohan, Richard Thompson, Eric Keller, Mark Zhao. SHARD: A Compatibility Framework for Deploying Transformer Models on Edge NPUs. In Proceedings of the Sixth European Workshop on Machine Learning and Systems, EuroMLSys 2026, Edinburgh, Scotland, UK, April 27-30, 2026. pages 200-207, ACM, 2026. [doi]
Abstract is missing.