VEXP: A Low-Cost RISC-V ISA Extension for Accelerated Softmax Computation in Transformers

Run Wang, Gamze Islamoglu, Andrea Belano, Viviane Potocnik, Francesco Conti 0001, Angelo Garofalo, Luca Benini. VEXP: A Low-Cost RISC-V ISA Extension for Accelerated Softmax Computation in Transformers. In IEEE 32nd Symposium on Computer Arithmetic, ARITH 2025, El Paso, TX, USA, May 4-7, 2025. pages 37-44, IEEE, 2025. [doi]

@inproceedings{WangIBP0GB25,
  title = {VEXP: A Low-Cost RISC-V ISA Extension for Accelerated Softmax Computation in Transformers},
  author = {Run Wang and Gamze Islamoglu and Andrea Belano and Viviane Potocnik and Francesco Conti 0001 and Angelo Garofalo and Luca Benini},
  year = {2025},
  doi = {10.1109/ARITH64983.2025.00016},
  url = {https://doi.org/10.1109/ARITH64983.2025.00016},
  researchr = {https://researchr.org/publication/WangIBP0GB25},
  cites = {0},
  citedby = {0},
  pages = {37-44},
  booktitle = {IEEE 32nd Symposium on Computer Arithmetic, ARITH 2025, El Paso, TX, USA, May 4-7, 2025},
  publisher = {IEEE},
  isbn = {979-8-3315-2159-2},
}