Journal: CCF Trans. High Perform. Comput.

Volume 7, Issue 3

179 -- 193Zhao Mao, Xingjun Zhang, Longxiang Wang. KANETAS: an elastic scheduler for heterogeneous many-core systems
194 -- 210Dongting Chen, Jie Shen 0003, Chun Huang, Xin Yi. An empirical study of error-free transformations for enhancing mathematical function precision
211 -- 225Hengzhong Liang, Han Huang, Xianwei Zhang 0001. SuCL: supply unified communication layer to improve SYCL-based heterogeneous computing
226 -- 244Zhangjie Tan, Jinfang Jia, Zhengsheng Ning, Jianqiang Huang, Xiaoying Wang 0002. Research on GPU transplantation optimization of PRM scalar advection scheme in GRAPES global forecast system
245 -- 259Yalin Zhu, Youquan Chang, Jiapeng Zhang, Yingjie Song, Zhuo Tang. An optimized hierarchical MapReduce framework in supercomputing Internet environment
260 -- 274Da Huo, Xin You, Zhibo Xuan, Hailong Yang, Zhongzhi Luan, Depei Qian. Hotspy: identifying performance hotspot with graph neural network based static analysis
275 -- 290Yunkun Liao, Jingya Wu, Wenyan Lu, Huawei Li 0001, Xiaowei Li 0001, Guihai Yan. FUS: FPGA-based Universal Sketch with homogeneous and heterogeneous memory architectures
291 -- 304Ronghui Cao, Peng Zhang, Yiming Wu, Jun Liu, Haibin Su. Adaptive container scheduling based on reinforcement learning in kubernetes

Volume 7, Issue 2

85 -- 99Pin Chen, Qing Mo, Zexin Xu, XianWei Zhang, Yutong Lu. Star-gen: an HPC-AI framework for constructing large-scale computational materials database
100 -- 113Wentao Feng, Shizhe Shang, Pengfei Li, Hailong Yang, Zhongzhi Luan, Depei Qian. SyncNOVA: an end-to-end fine-grained profiling tool oN lOck behaVior detection and critical section diAgnosis
114 -- 128Ningxi Tian, Silu Huang, Xiaowen Xu. Mixed precision block-Jacobi preconditioner: algorithms, performance evaluation and feature analysis
129 -- 141Jianfei Xu, Lianhua He, Zhong Jin. Mixed precision SpMV on GPUs for irregular data with hierarchical precision selection
142 -- 154Wenlong Fan, Haobo Hua, Jiandong Shang, Zhuxin Wen, Hengliang Guo, Litao Zhang. Optimizing 2D convolution for DCUs
155 -- 168Xiangyu Meng, Xun Wang, Mingzhen Li, Guangming Tan, Weile Jia. An interpretable DeePMD-kit performance model for emerging supercomputers
169 -- 177Heming Zhong, Xiaojian Pan, Zengquang He, Haoling Wang, Dan Huang, Zhiguang Chen. GPU acceleration for DNA sequence alignment algorithm and its application

Volume 7, Issue 1

1 -- 16Hanzheng Liang, Chencheng Deng, Peng Zhang 0061, Jianbin Fang, Tao Tang 0001, Chun Huang. An empirical performance evaluation of SYCL on ARM multi-core processors
17 -- 28Youxuan Xu, Tong Wu, Shigang Li 0002, Xueying Wang, Jingjing Wang. SparkAttention: high-performance multi-head attention for large models on Volta GPU architecture
29 -- 42Tao Huang, Yonggui Liang, Shubao Yu, Kexin Chen. TxCocket: an innovative solution for efficient cross-node data transmission enabled by CXL-based shared memory
43 -- 57Wenhao Dai, Ziyi Jia, Yuesi Bai, Qingxiao Sun. Convergence-aware operator-wise mixed-precision training
58 -- 71Jin Zhang 0018, Jincheng Zhou, Xiang Zhang 0008, Di Ma, Chunye Gong. Fine-grained vectorized merge sorting on RISC-V: from register to cache
72 -- 84Muchun Peng, Qinglin Wang, Yuechao Liang, Weihao Guo, Shun Yang, Yaling Liang, Yongzhen Shi, Ligang Cao, Jie Liu 0002. GreenB+Tree: an energy-efficient B+tree for MIMD architectures