Accelerating Triton convolutions by exposing Shared Memory to the Programmer

Shourya Goel, Ricky Dev, Pradeep Ramachandran. Accelerating Triton convolutions by exposing Shared Memory to the Programmer. In 32nd IEEE International Conference on High Performance Computing, Data and Analytics, HiPC 2025 - Workshop, Hyderabad, India, December 17-20, 2025. pages 235-236, IEEE, 2025. [doi]

Authors

Shourya Goel

This author has not been identified. Look up 'Shourya Goel' in Google

Ricky Dev

This author has not been identified. Look up 'Ricky Dev' in Google

Pradeep Ramachandran

This author has not been identified. Look up 'Pradeep Ramachandran' in Google