241 | -- | 242 | Jianbin Fang, Jidong Zhai, Zheng Wang 0001. Editorial for the special issue on programming models and system software for High-Performance Computing (HPC) environments |
243 | -- | 262 | Junsheng Chang, Kai Lu, Yang Guo 0003, Yongwen Wang, Zhenyu Zhao, Libo Huang, Hongwei Zhou, Yao Wang 0002, Fei Lei, Biwei Zhang. A survey of compute nodes with 100 TFLOPS and beyond for supercomputers |
263 | -- | 273 | Jianfeng Liu, Wangrong Gao, Hanzheng Liang, Lin Peng, Ting Wang 0009. Towards a universal and portable assembly code size reduction: a case study of RISC-V ISA |
274 | -- | 286 | Haoran Lin, Lifeng Yan, Qixin Chang, Haitian Lu, Chenlin Li, Quanjie He, Zeyu Song, Xiaohui Duan, Zekun Yin, Yuxuan Li, Zhao Liu, Wei Xue, Haohuan Fu, Lin Gan, Guangwen Yang, Weiguo Liu. O2ath: an OpenMP offloading toolkit for the sunway heterogeneous manycore platform |
287 | -- | 300 | Yicheng Sui, Yufei Sun, Changqing Shi, Haotian Wang, Zhiqiang Zhang, Jiahao Wang, Yuzhi Zhang. Opencl-pytorch: an OpenCL-based extension of PyTorch |
301 | -- | 318 | Juncheng Hu, Xilong Che, Bowen Kan, Yuhan Shao. LS-HTC: an HTC system for large-scale jobs |
319 | -- | 329 | Changqing Shi, Yufei Sun, Yicheng Sui, Yuqiao Chen, Haotian Wang, Yuzhi Zhang. oclCUB: an OpenCL parallel computing library for deep learning operators |
330 | -- | 342 | Zongjing Chen, Kangjin Huang, Yonggang Che, Chuanfu Xu, Jian Zhang, Zhe Dai, Ming Li. Extending OP2 framework to support portable parallel programming of complex applications |
343 | -- | 364 | Shaojie Tan, Qingcai Jiang, Zhenwei Cao, Xiaoyu Hao, Junshi Chen, Hong An. Uncovering the performance bottleneck of modern HPC processor with static code analyzer: a case study on Kunpeng 920 |