Inference performance of large language models on a 64-core RISC-V CPU with silicon-enabled vectors

Adriano Marques Garcia, Giulio Malenza, Robert Birke, Marco Aldinucci. Inference performance of large language models on a 64-core RISC-V CPU with silicon-enabled vectors. Future Generation Comp. Syst., 177:108242, 2026. [doi]

Abstract

Abstract is missing.