Confidential LLM Inference: Performance and Cost Across CPU and GPU TEEs

Marcin Chrapek, Marcin Copik, Etienne Mettaz, Torsten Hoefler. Confidential LLM Inference: Performance and Cost Across CPU and GPU TEEs. In IEEE International Symposium on Workload Characterization, IISWC 2025, Irvine, CA, USA, October 12-14, 2025. pages 84-98, IEEE, 2025. [doi]

References

No references recorded for this publication.

Cited by

No citations of this publication recorded.