The following publications are possibly variants of this publication:
- Optimizing half precision Winograd convolution on ARM many-core processorsDedong Xie, Zhen Jia, Zili Zhang, Xin Jin 0008. apsys 2022: 53-60 [doi]
- Optimizing Pointwise Convolutions on Multi-core DSPsYang Wang, Qinglin Wang, Xiangdong Pei, Songzhu Mei, Jie Liu 0002. ica3pp 2024: 209-223 [doi]
- Optimizing Massively Parallel Winograd Convolution on ARM ProcessorDongsheng Li, Dan Huang, Zhiguang Chen, Yutong Lu. icpp 2021: [doi]
- Reformulating the direct convolution for high-performance deep learning inference on ARM processorsSergio Barrachina 0001, Adrián Castelló 0001, Manuel F. Dolz, Tze Meng Low, Héctor Martínez, Enrique S. Quintana-Ortí, Upasana Sridhar, Andrés E. Tomás. jsa, 135:102806, February 2023. [doi]