The following publications are possibly variants of this publication:
- Automatic Generation of Micro-kernels for Performance Portability of Matrix Multiplication on RISC-V Vector ProcessorsFrancisco D. Igual, Luis Piñuel, Sandra Catalán, Héctor Martínez, Adrián Castelló 0001, Enrique S. Quintana-Ortí. sc 2023: 1521-1532 [doi]
- High Performance and Energy Efficient Integer Matrix Multiplication for Deep LearningPau San Juan, Pedro Alonso-Jordá, Enrique S. Quintana-Ortí. pdp 2021: 122-125 [doi]
- Low precision matrix multiplication for efficient deep learning in NVIDIA Carmel processorsPablo San Juan, Rafael Rodríguez-Sánchez, Francisco D. Igual, Pedro Alonso-Jordá, Enrique S. Quintana-Ortí. tjs, 77(10):11257-11269, 2021. [doi]
- Tackling the Matrix Multiplication Micro-Kernel Generation with ExoAdrián Castelló 0001, Julian Bellavita, Grace Dinh, Yuka Ikarashi, Héctor Martínez. CGO 2024: 182-193 [doi]