Journal: IEEE Transactions on Computers

Volume 74, Issue 9

2856 -- 2869Hao Lv, Lei Zhang 0008, Ying Wang 0001. In-Situ NAS: A Plug-and-Search Neural Architecture Search Framework Across Hardware Platforms
2870 -- 2881Zhuo Huang, Hao Fan 0006, Bin Tang, Song Wu 0001, Chen Yu 0003, Hai Jin 0001. CBuild: Cluster-Oriented Collaborative Image Building for Containers
2882 -- 2895Scott Sirri, Zhe Wang 0056, Netanel Raviv, Jeremy T. Fineman, Kunal Agrawal 0001. Efficient Static Schedules for Fault-Tolerant Transmissions on Shared Media
2896 -- 2908Yijun Cui, Junjie Zhong, Bei Wang, Tianyu Xu, Chenghua Wang, Weiqiang Liu 0001. High-Performance Hardware Implementation of Crystals-Dilithium Based on Improved MDC-NTT
2909 -- 2922Wontak Han, Hyunjun Cho, Donghyuk Kim, Joo-Young Kim 0001. SAL-PIM: A Subarray-Level Processing-in-Memory Architecture With LUT-Based Linear Interpolation for Transformer-Based Text Generation
2923 -- 2935Chao Li 0065, Xuchu Huang, Zhicheng Xu, Bo Wen, Ruibin Mao, Min Zhou, Thomas Kämpfe, Kai Ni 0004, Can Li, Xunzhao Yin, Cheng Zhuo. High-Performance In-Memory Bayesian Inference With Multi-Bit Ferroelectric FET
2936 -- 2949Xiaolu Cheng, Xiaoshuang Xing, Wei Li 0059, Hong Xue, Tong Can. An Energy-Efficient and Privacy-Aware MEC-Enabled IoMT Health Monitoring System
2950 -- 2961Qianhui Liu, Jiadong Wang, Yang Wang 0106, Xin Yang 0011, Gang Pan 0001, Haizhou Li 0001. Human-Inspired Computing for Robust and Efficient Audio-Visual Speech Recognition
2962 -- 2976Surong Dai, Jinni Yang, Wenyang Cui, Yaozheng Fang, Ye Lu 0004. CROSC: Compilation-Runtime Joint Optimization for Fast Smart Contract Execution
2977 -- 2990Ping Luo, Xiaoge Deng, Ziqing Wen, Tao Sun 0005, Dongsheng Li 0001. BHerd: Accelerating Federated Learning by Selecting Beneficial Herd of Local Gradients
2991 -- 3002Sherif Eissa, Sander Stuijk, Floran de Putter, Andrea Nardi-Dei, Federico Corradi, Henk Corporaal. STEMS: Spatial-Temporal Mapping for Spiking Neural Networks
3003 -- 3017Yunping Zhao, Sheng Ma, Tiejun Li, Jianmin Zhang, Yuhua Tang. Intra- and Inter-Layer Scheduling Exploration and Optimization for ReRAM-Based DNN Accelerators
3018 -- 3031Wei Li 0121, Zicheng Shen, Xiulong Liu, Chuntao Ding, Jiaxing Shen. Fed-OGD: Mitigating Straggler Effects in Federated Learning via Orthogonal Gradient Descent
3032 -- 3045Fangqi Bi, Guoqi Xie, Yuan Wang, Hao Wen, Zhenli He, Shaowen Yao 0001, Sirong Zhao, Chenglai Xiong, Xingyu Hu, Bo Wan, Yiwen Jiang. Hypercall-Oriented Abnormal VM Status Detection System: A Non-Intrusive Solution for Both Hypervisor and Guests
3046 -- 3058Zhaoyang Huang, Yanjie Tan, Yifu Zhu, Huailiang Tan, Keqin Li 0001. Dynamic DPU Offloading and Computational Resource Management in Heterogeneous Systems
3059 -- 3071Cheolgi Min, Jiwoong Park, Heon Young Yeom, Hyungsoo Jung 0001. MDC+: A Cooperative Approach to Memory-Efficient Fork-Based Checkpointing for In-Memory Database Systems
3072 -- 3086Thomas Benz, Alessandro Ottaviano, Chaoqun Liang, Robert Balas, Angelo Garofalo, Francesco Restuccia 0002, Alessandro Biondi 0001, Davide Rossi, Luca Benini. AXI-REALM: Safe, Modular and Lightweight Traffic Monitoring and Regulation for Heterogeneous Mixed-Criticality Systems
3087 -- 3098Kai-Feng, Guodong Xie, Zhangjian Ji, Dajin Wang. Evaluating Robustness of Subnetworks for the Split-Star Network
3099 -- 3113Yuan Dai, Xuchen Gao, Yunhui Qiu, Jingyuan Li 0003, Yuhang Cao, Yiqing Mao, Sichao Chen, Wenbo Yin, Wai-Shing Luk, Lingli Wang. COFFA: A Co-Design Framework for Fused-Grained Reconfigurable Architecture Towards Efficient Irregular Loop Handling
3114 -- 3128Xu Yang, Qiuhao Wang, Saiyu Qi, Ke Li 0041, Yong Qi. RO(SE)${}^{2}$ 2: Search-Efficient Robust Searchable Encryption With Forward and Backward Security
3129 -- 3142Jiahui Huang, Zhan Li, YuXian Jiang, Zhihan Zhang, Hao Wang 0046, Sheng Chang. A High-Intensity Solution of Hardware Accelerator for Sparse and Redundant Computations in Semantic Segmentation Models
3143 -- 3155Qianqian Wu, Qiang Liu 0014, Ying He 0011, Zefan Wu. Reconfigurable Intelligent Surface Assisted UAV-MCS Based on Transformer Enhanced Deep Reinforcement Learning
3156 -- 3167Qiang He 0002, Quanwei Li, Chuangchuang Zhang, Xingwei Wang 0001, Yuanguo Bi, Liang Zhao 0004, Ammar Hawbani, Keping Yu. Task Optimization Allocation in Vehicle Based Edge Computing Systems With Deep Reinforcement Learning
3168 -- 3180Gabriele Magnani, Daniele Cattaneo 0002, Lev Denisov, Giuseppe Tagliavini, Giovanni Agosta, Stefano Cherubin. Synergistic Memory Optimisations: Precision Tuning in Heterogeneous Memory Hierarchies
3181 -- 3194Marta Andronic, Jiawen Li, George A. Constantinides. PolyLUT: Ultra-Low Latency Polynomial Inference With Hardware-Aware Structured Pruning
3195 -- 3209Francesco Angione, Paolo Bernardi, Giusy Iaria, Claudia Bertani, Vincenzo Tancorre. Automatic Generation of System-Level Test for Un-Core Logic of Large Automotive SoC
3210 -- 3222Lian Liu, Jinxin Yu, Mengdi Wang, Xiaowei Li 0001, Yinhe Han 0001, Ying Wang 0001. DNA: A General Dynamic Neural Network Accelerator
3223 -- 3237Yujun Xie, Yuan Liu 0022. An Efficient LUT6-Based Montgomery Modular Multiplication Using Radix-16 Booth Method
3238 -- 3250Hao Guo, Lei Yang 0024, Qingfeng Zhang, Jiannong Cao 0001. Hybrid Redundancy for Reliable Task Offloading in Collaborative Edge Computing

Volume 74, Issue 8

2529 -- 2541Yunpeng Song, Yujiong Liang, Jialin Liu, Liang Shi 0001. Prophet: SSD Failure Analysis and Prediction Guided by Flash Reliability Characteristics in Data Centers
2542 -- 2551Jin-tao Wang, Tian-Yu Ye. A Novel Two-Round Two-Party Quantum Private Comparison Protocol Based on Quantum Walks
2552 -- 2566Zhe Pan, Shuibing He, Xu Li, Xuechen Zhang 0001, Rui Wang 0076, Yanlong Yin, Gang Chen 0001. Advanced Maximal Biclique Enumeration on GPUs Using Bitmaps
2567 -- 2580Divya Praneetha Ravipati, Victor M. van Santen, Shivendra Singh Parihar, Yogesh Singh Chauhan, Preeti Ranjan Panda, Hussam Amrouch. Cryo-CACTI: Cryogenic-Aware CACTI for Cache Modeling Down to 10K in Advanced 7nm FinFETs
2581 -- 2592Onyeka Josephine Nwobodo, Godlove Suila Kuaban, Valery Nkemeni, Kamil Wereszczynski, Krzysztof A. Cyran. A Hybrid Adaptive Filter for Head Tracking in Augmented Reality (AR)-Based Flight Simulators
2593 -- 2607Xiaopeng Fan, Xiaoshuang Peng, Kaixin Huang, Chuliang Weng. Hyte: A Hotness-Aware Hybrid DRAM-PM Native Table Storage Engine
2608 -- 2620Daniel Bristot de Oliveira, Daniel Casini, Juri Lelli, Tommaso Cucinotta. Timerlat: Real-Time Linux Scheduling Latency Measurements, Tracing, and Analysis
2621 -- 2634Zijun Li 0001, Chenyang Wu, Chuhao Xu, Quan Chen 0002, Shuo Quan, Bin Zha, Qiang Wang, Weidong Han 0003, Jie Wu 0001, Minyi Guo. Lightweight and Holistic-Scalable Serverless Secure Container Runtime for High-Density Deployment and High-Concurrency Startup
2635 -- 2648Ji Li, Qiang He 0002, Xingwei Wang 0001, Ammar Hawbani, Keping Yu, Yuanguo Bi, Liang Zhao 0004. UAV-Assisted Microservice Mobile Edge Computing Architecture: Addressing Post-Disaster Emergency Medical Rescue
2649 -- 2662Weijian Chen 0002, Shuibing He, Ruidong Zhang, Xuechen Zhang 0001, Ping Chen, Siling Yang, Haoyang Qu, Xuan Zhan. ImPACT: Importance-Informed Prefetching and Caching for I/O-Bound DNN Training
2663 -- 2674Weiwei Lin 0001, Jinhui Lin, Haotong Zhang 0003, Wentai Wu, Weizheng Wu, Zhetao Li, Keqin Li 0001. Cacomp: A Cloud-Assisted Collaborative Deep Learning Compiler Framework for DNN Tasks on Edge
2675 -- 2686Weijie Fang, Yanggeng Fu, Jiaquan Gao, Longkun Guo, Gregory Z. Gutin, Xiaoyan Zhang 0001. Acceleration of Timing-Aware Gate-Level Logic Simulation Through One-Pass GPU Parallelism
2687 -- 2701Yilan Zhu, Honghui You, Wei Zhang 0173, Jiming Xu, Qian Lou, Shoumeng Yan, Lei Ju 0001. DAHE: Parameter-Adaptive and Memory Efficient FPGA Acceleration of Homomorphic Encryption
2702 -- 2716Jingya Wu, Wenyan Lu, Haishuang Fan, Hao Kong, Xiaowei Li 0001, Guihai Yan. KPU: Kernel Processing Unit for in-Memory Analytical Query Processing
2717 -- 2730Pingyi Huo, Theodore Michailidis, Yi Zheng, Prapti Panigrahi, Kiwan Maeng, Jishen Zhao, Vijaykrishnan Narayanan. SVDE: Serverless Framework for Low-Latency Video Analytic Queries With Hardware Disaggregation
2731 -- 2744MingYang Song, Zhongyun Hua, Yifeng Zheng, Qing Liao 0001, Xiaohua Jia. Enabling Verifiable Search and Integrity Auditing in Encrypted Decentralized Storage Using One Proof
2745 -- 2756Qinglun Li, Miao Zhang, Nan Yin, Quanjun Yin, Li Shen 0008, Xiaochun Cao. Asymmetrically Decentralized Federated Learning
2757 -- 2771Tharindu B. Hewage, Shashikant Ilager, Maria Alejandra Rodriguez, Rajkumar Buyya. A Framework for Carbon-Aware Real-Time Workload Management in Clouds Using Renewables-Driven Cores
2772 -- 2784Xiao Sui, Qichang Liu, Sisi Duan, Haibin Zhang. Pike: Two-Phase BFT With Linearity and Flexible View Change
2785 -- 2798Yuankai Xu, Yinchen Ni, Tiancheng He, Ruiqi Sun, Yier Jin, An Zou. Real-Time Scheduling and Analysis of Fixed-Priority Tasks on a Basic Heterogeneous Architecture With Multiple CPUs and Many PEs
2799 -- 2811Rui Kong, Yuanchun Li, Weijun Wang 0001, Linghe Kong, Yunxin Liu 0001. Serving MoE Models on Resource-Constrained Edge Devices via Dynamic Expert Swapping
2812 -- 2826Alejandro Valero, Vicente Lorente, Salvador Petit, Julio Sahuquillo. Dual Fast-Track Cache: Organizing Ring-Shaped Racetracks to Work as L1 Caches
2827 -- 2840Hai Zhou 0002, Dan Feng 0001, Yuchong Hu, Wei Wang, Huadong Huang. Fast Garbage Collection in Erasure-Coded Storage Clusters
2841 -- 2855Minghui Wu 0003, Dawei Sun 0001, Shang Gao 0003, Keqin Li 0001, Rajkumar Buyya. Ls-Stream: Lightening Stragglers in Join Operators for Skewed Data Stream Processing

Volume 74, Issue 7

2168 -- 2182Lingxiao Yang, Xuewen Dong, Zhiguo Wan, Di Lu 0001, Yushu Zhang 0001, Yulong Shen. HiCoCS: High Concurrency Cross-Sharding on Permissioned Blockchains
2183 -- 2194Yu Zhang, Renhai Chen, Hangyu Yan, Hongyue Wu, Zhiyong Feng 0002. DCAS-BMT: Dynamic Construction and Adjustment of Skewed Bonsai Merkle Tree for Performance Enhancement in Secure Non-Volatile Memory
2195 -- 2209Yitao Hu, Xiulong Liu, Guotao Yang, Linxuan Li, Kai Zeng, Zhixin Zhao, Sheng Chen 0008, Laiping Zhao, Wenxin Li 0001, Keqiu Li. TightLLM: Maximizing Throughput for LLM Inference via Adaptive Offloading Policy
2210 -- 2222Wenhao Sun, Wendi Sun, Song Chen 0001, Yi Kang. IOPS: A Unified SpMM Accelerator Based on Inner-Outer-Hybrid Product
2223 -- 2237Tong Li 0014, Wei Liu, Xinyu Ma, Shuaipeng Zhu, Jingkun Cao, Duling Xu, Zhaoqi Yang, Senzhen Liu, Taotao Zhang, Yinfeng Zhu 0002, Bo Wu 0002, Kezhi Wang, Ke Xu 0002. Accelerating Loss Recovery for Content Delivery Network
2238 -- 2252Sajjad Tamimi, Arthur Bernhardt, Florian Stock, Ilia Petrov 0001, Andreas Koch 0001. CINDA: Using Cache-Coherent Interconnects for Accelerating Databases by Enabling Near-Data Processing of Update Transactions
2253 -- 2266Tu Dinh Ngoc, Boris Teabe, Georges Da Costa, Daniel Hagimont. Virtual NVMe-Based Storage Function Framework With Fast I/O Request State Management
2267 -- 2277Weizhe Wang, Deng Tang. Differential Fault Attack on HE-Friendly Stream Ciphers: Masta, Pasta, and Elisabeth
2278 -- 2292Zhihong Deng, Chunming Tang 0003, Taotao Li, Zhikang Zeng, Parhat Abla, Debiao He. $\mathtt{SFPoW}$SFPoW: Constructing Secure and Flexible Proof-of-Work Sidechains for Cross-Chain Interoperability With Wrapped Assets
2293 -- 2305Guoqing Xiao 0001, Li Xia, Yuedan Chen, Hongyang Chen, Wangdong Yang. DCGG: A Dynamically Adaptive and Hardware-Software Coordinated Runtime System for GNN Acceleration on GPUs
2306 -- 2320Francesco Antognazza, Alessandro Barenghi, Gerardo Pelosi. An Efficient and Unified RTL Accelerator Design for HQC-128, HQC-192, and HQC-256
2321 -- 2333Lixia Han, Yiyang Chen, Siyuan Chen, Haozhang Yang, Ao Shi, Guihai Yu, Jiaqi Li, Zheng Zhou, Yijiao Wang, Yanzhi Wang, Xiaoyan Liu, JinFeng Kang, Peng Huang 0004. CIMUS: 3D-Stacked Computing-in-Memory Under Image Sensor Architecture for Efficient Machine Vision
2334 -- 2347Liang-Chi Chen, Chien-Chung Ho, Yuan-Hao Chang 0001. Accelerating RNA-Seq Quantification on a Real Processing-in-Memory System
2348 -- 2363Xiaoqian Wu, Peng Wang, Shaoquan Li, Huaxiao Liu, Lei Liu. An Area Optimization Approach for Large-Scale RM-TB Dual Logic Circuits Based on a Multitasking Optimization Algorithm
2364 -- 2375Yan Wang, Bo Lv, Quan Zhou 0003, Junfei Li, Tan Tan. Schedulability Analysis for Self-Suspending Tasks Under EDF-Like Scheduling
2376 -- 2388Yichuan Bai, Xiaopeng Zhang, Qian Wang, Yaqing Li, Yuan Du, Li Du. BE-NPU: A Bandwidth-Efficient Neural Processing Unit With Adaptive Processing Schemes for Reduced Off-Chip Bandwidth Demand
2389 -- 2401Junchao Li, Runsheng Hou, Guangyong Shang, Huanle Zhang, Xiuzhen Cheng 0001, Runyu Pan. FVM: Practical Feather-Weight Virtualization on Commodity Microcontrollers
2402 -- 2416Yunzhen Luo, Yan Ding 0004, Zhuo Tang, Keqin Li 0001, Kenli Li 0001, Chubo Liu. BEAST-GNN: A United Bit Sparsity-Aware Accelerator for Graph Neural Networks
2417 -- 2430Kaiwen Cao, Hanchen Ye, Yihan Pang, Deming Chen. MLCD: Machine Learning-Based Code Version and Device Selection for Heterogeneous Systems
2431 -- 2444Annachiara Ruospo, Matteo Sonza Reorda, Riccardo Mariani, Ernesto Sánchez 0001. An Effective Iterative Statistical Fault Injection Methodology for Deep Neural Networks
2445 -- 2460ShiJing Yuan, Beiyu Dong, Jie Li 0002, Song Guo 0001, Hongyang Chen, Chentao Wu, Jie Wu 0001, Wei Zhao 0001. Adaptive Incentivize for Federated Learning With Cloud-Edge Collaboration Under Multi-Level Information Sharing
2461 -- 2472Na Lin 0001, Zhijiang Wang, Liang Zhao 0004, Ammar Hawbani, Zhi Liu 0002, Mohsen Guizani. Optimizing Multi-AAV Cooperative Tracking for Real-Time Applications in Network-Challenged Environments
2473 -- 2486Jie Cui 0004, Wenting Zhuang, Hong Zhong 0001, Qingyang Zhang, Fengqun Wang, Debiao He. Conditional Privacy-Preserving Transaction for the Unspent Transaction Output-Based Multi-Chain Blockchain System
2487 -- 2500Bin Liu 0023, Yongyao Ma, Zijian Hu, Zeyu Ji, Zhenli He, Keqin Li 0001. GroPipe: A Grouped Pipeline Hybrid Parallel Method for Accelerating DCNNs Training
2501 -- 2514Feixue Han, Yike Wang, Yunbo Zhang, Qing Li 0006, Dayi Zhao, Yong Jiang 0001. Anole: A Pragmatic Blend of Classic and Learning-Based Algorithms in Congestion Control
2515 -- 2528Emre Karabulut, Arsalan Ali Malik, Amro Awad, Aydin Aysu. THEMIS: Time, Heterogeneity, and Energy Minded Scheduling for Fair Multi-Tenant Use in FPGAs

Volume 74, Issue 6

1814 -- 1828Taotao Li, Huawei Huang, Parhat Abla, Zhihong Deng, Qinglin Yang, Anke Xie, Debiao He, Zibin Zheng. DataFly: A Confidentiality-Preserving Data Migration Across Heterogeneous Blockchains
1829 -- 1843Weiling Yang, Pengyu Wang, Jianbin Fang, Dezun Dong, Zhengbin Pang, Runxi He, Peng Zhang 0061, Tao Tang 0001, Chun Huang, Yonggang Che, Jie Ren 0007. nDirect2: A High-Performance Library for Direct Convolutions on Multicore CPUs
1844 -- 1856Nang Hung Nguyen, Truong Thao Nguyen, Trong Nghia Hoang, Hieu H. Pham 0001, Thanh-Hung Nguyen, Phi-Le Nguyen. SAFA: Handling Sparse and Scarce Data in Federated Learning With Accumulative Learning
1857 -- 1871Yingkun Zhou, Zhengshuyuan Tian, Wenhao Yang, Tingting Zhang, Jinpeng Ye, Chenji Han, Tianyi Liu, Fuxin Zhang. ETBench: Characterizing Hybrid Vision Transformer Workloads Across Edge Devices
1872 -- 1885Won Hur, Jiwon Lee, Jaewon Kwon, Minjae Kim, Won Woo Ro. HashScape: Leveraging Virtual Address Dynamics for Efficient Hashed Page Tables
1886 -- 1896Lin Chen, Hao Feng 0010, Jiong Wu. A Path-Based Topology-Agnostic Fault Diagnosis Strategy for Multiprocessor Systems
1897 -- 1910Chuan Luo 0002, Shenghua Cao, Shanyu Guo, Chunming Hu. Towards Effective Local Search for Qubit Mapping
1911 -- 1920Cong Li, Qingni Shen, Zhonghai Wu. Redactable Blockchain From Decentralized Chameleon Hash Functions, Revisited
1921 -- 1935Minghao Tian, Yue Liang 0004, Bowen Liu, Dajiang Liu. CoSpMV: Towards Agile Software and Hardware Co-Design for SpMV Computation
1936 -- 1948Yuchen Liu, Ligang He, Zhigao Zhang, Shenyuan Ren. PFed-NS: An Adaptive Personalized Federated Learning Scheme Through Neural Network Segmentation
1949 -- 1962Xuhang Wang, Zhuoran Song, Chunyu Qi, Fangxin Liu, Naifeng Jing, Li Jiang 0002, Xiaoyao Liang. RTSA: A Run-Through Sparse Attention Framework for Video Transformer
1963 -- 1977Sizhe Zhang, Kyle Juretus, Xun Jiao. Exploring Hyperdimensional Computing Robustness Against Hardware Errors
1978 -- 1989Shahnawaz Ahmad, Mohd Arif, Shabana Mehfuz, Javed Ahmad, Mohd. Nazim. Deep Learning-Based Cloud Security: Innovative Attack Detection and Privacy Focused Key Management
1990 -- 2002Pengwei Wang 0001, Yi Li, Chao Fang, Yichen Zhong, Zhijun Ding. Optimizing Serverless Performance Through Game Theory and Efficient Resource Scheduling
2003 -- 2016Hui Dong, Huaqun Wang, Mengjie Lv, Weibei Fan. Reliable Communication Scheme Based on Completely Independent Spanning Trees in Data Center Networks
2017 -- 2030Dunbo Zhang, Li Shen 0007, Kai Lu. Eliminate Data Divergence in SpMV via Processor and Memory Co-Computing Framework
2031 -- 2044Hao Xu, Jiaqi Zhang, Xiulong Liu, Zhimin Yu, Tingyu Fan, Baochao Chen, Keqiu Li. Tangram: Enabling Efficient and Balanced Dynamic Storage Extension on Sharding Blockchain Systems
2045 -- 2057Xiulong Liu, Zhiyuan Zheng, Hao Xu, Zhelin Liang, Gaowei Shi, Chenyu Zhang, Keqiu Li. Enabling Consistent Sensing Data Sharing Among IoT Edge Servers via Lightweight Consensus
2058 -- 2072Zhican Wang, Hongxiang Fan, Guanghui He. DESA: Dataflow Efficient Systolic Array for Acceleration of Transformers
2073 -- 2086Yu Wen 0003, Aamir Bader Shah, Ruizhi Cao, Chen Zhang, Jiefu Chen, Xuqing Wu 0001, Chenhao Xie 0001, Xin Fu. AR-Light: Enabling Fast and Lightweight Multi-User Augmented Reality via Semantic Segmentation and Collaborative View Synchronization
2087 -- 2100Davide Zoni, Andrea Galimberti, Davide Galli. An FPGA-Based Open-Source Hardware-Software Framework for Side-Channel Security Research
2101 -- 2113Pengyu Mu, Yi Liu 0013, Rui Wang 0014, Guoxiang Liu, Hangcheng An, Qianhe Zhao, Hailong Yang, Chenhao Xie 0001, Zhongzhi Luan, Chunye Gong, Depei Qian. Deep Learning Operators Performance Tuning for Changeable Sized Input Data on Tensor Accelerate Hardware
2114 -- 2128Jianwen Luo 0004, Yuhao Shu, Yajun Ha. RSQC: Recursive Sparse QUBO Construction for Quantum Annealing Machines
2129 -- 2143Wenjie Liu 0001, Bingmei Su, Feiyang Sun. Efficient Quantum Secure Vector Dominance and Its Applications in Computational Geometry
2144 -- 2158Zhaolong Jian, Xu Liu, Qiankun Dong, Longkai Cheng, Xueshuo Xie, Tao Li 0022. SmartZone: Runtime Support for Secure and Efficient On-Device Inference on ARM TrustZone
2159 -- 2167Peng Chen, Jinnuo Li, Wei Cheng 0003, Chi Cheng. Uncover Secrets Through the Cover: A Deep Learning-Based Side-Channel Attack Against Kyber Implementations With Anti-Tampering Covers

Volume 74, Issue 5

1470 -- 1481Peijun Ma, Jie Li, Hongjin Liu, Jiangyi Shi, Shaolin Zhang, Weitao Pan, Yue Hao 0001. Hardware Trojan Detection Methods for Gate-Level Netlists Based on Graph Neural Networks
1482 -- 1494Agustín Navarro-Torres, Biswabandan Panda, Jesús Alastruey-Benedé, Pablo Ibáñez 0001, Víctor Viñals Yúfera, Alberto Ros 0001. A Complexity-Effective Local Delta Prefetcher
1495 -- 1509Chuang Li 0004, Changyao Tan, Gang Liu, Yanhua Wen, Yan Wang 0022, Kenli Li 0001. DC-ORAM: An ORAM Scheme Based on Dynamic Compression of Data Blocks and Position Map
1510 -- 1523Liang Zhao 0004, Zijia Zhao, Ammar Hawbani, Zhi Liu 0002, Zhiyuan Tan 0001, Keping Yu. Dynamic Caching Dependency-Aware Task Offloading in Mobile Edge Computing
1524 -- 1537Chengqing Li, Kai Tan 0006. The Graph Structure of Baker's Maps Implemented on a Computer
1538 -- 1548Dongxiao Yu, Yuan Yuan, Yifei Zou, Xiao Zhang 0015, Yu Liu, LiZhen Cui, Xiuzhen Cheng 0001. Pruning-Based Adaptive Federated Learning at the Edge
1549 -- 1564Johannes Bund, Christoph Lenzen 0001, Moti Medina. Small Hazard-Free Transducers
1565 -- 1576Xin Niu 0001, Xianwei Lv 0001, Wang Chen, Chen Yu 0003, Hai Jin 0001. Computing Tasks Saving Schemes Through Early Exit in Edge Intelligence-Assisted Systems
1577 -- 1588Yiwen Zhang 0002, Hui Zheng. Slack Time Management for Imprecise Mixed-Criticality Systems With Reliability Constraints
1589 -- 1604Liangyuan Wang, Xudong Liu, Haonan Ding, Yi Hu, Kai Peng 0001, Menglan Hu. Energy-Delay-Aware Joint Microservice Deployment and Request Routing With DVFS in Edge: A Reinforcement Learning Approach
1605 -- 1619Hui Sun 0002, Xiangxiang Jiang, Yinliang Yue, Xiao Qin 0001. RGKV: A GPGPU-Empowered Compaction Framework for LSM-Tree-Based KV Stores With Optimized Data Transfer and Parallel Processing
1620 -- 1633Jaekang Shin, Myeonggu Kang, Yunki Han, Junyoung Park, Lee-Sup Kim. AToM: Adaptive Token Merging for Efficient Acceleration of Vision Transformer
1634 -- 1648Xun Wang, Xiangyu Meng 0005, Zhuoqiang Guo, Mingzhen Li, Lijun Liu, Mingfan Li, Qian Xiao, Tong Zhao, Ninghui Sun, Guangming Tan, Weile Jia. 29-Billion Atoms Molecular Dynamics Simulation With Ab Initio Accuracy on 35 Million Cores of New Sunway Supercomputer
1649 -- 1662Zewen Ye, Junhao Huang 0001, Tianshun Huang, Yudan Bai, Jinze Li, Hao Zhang, Guangyan Li, Donglong Chen, Ray C. C. Cheung, Kejie Huang. PQNTRU: Acceleration of NTRU-Based Schemes via Customized Post-Quantum Processor
1663 -- 1677Shiyuan Xu, Xue Chen, Yu Guo 0003, Yuer Yang, Shengling Wang 0001, Siu-Ming Yiu, Xiuzhen Cheng 0001. Lattice-Based Forward Secure Multi-User Authenticated Searchable Encryption for Cloud Storage Systems
1678 -- 1689Lianbo Ma, Yuee Zhou, Ye Ma, Guo Yu 0001, Qing Li 0006, Qiang He 0002, Yan Pei 0001. Defying Multi-Model Forgetting in One-Shot Neural Architecture Search Using Orthogonal Gradient Learning
1690 -- 1701Hao Lu, Jian Liu 0012, Kui Ren 0001. $\mathsf{Aurora}$Aurora: Leaderless State-Machine Replication With High Throughput
1702 -- 1716Haiyang Yu 0001, Hui Zhang, Zhen Yang 0004, Yuwen Chen, Huan Liu 0001. Efficient and Secure Storage Verification in Cloud-Assisted Industrial IoT Networks
1717 -- 1729Qingyang Zhang, Shuai Qian, Jie Cui 0004, Hong Zhong 0001, Fengqun Wang, Debiao He. Blockchain-Based Privacy-Preserving Deduplication and Integrity Auditing in Cloud Storage
1730 -- 1742Jaeyoung Kang 0001, Minxuan Zhou, Weihong Xu, Tajana Rosing. RelHDx: Hyperdimensional Computing for Learning on Graphs With FeFET Acceleration
1743 -- 1757Kaiqi Yang 0002, Qiang He 0002, Xingwei Wang 0001, Zhi Liu 0002, Yufei Liu, Min Huang 0001, Liang Zhao 0004. KDN-Based Adaptive Computation Offloading and Resource Allocation Strategy Optimization: Maximizing User Satisfaction
1758 -- 1770Dan Tang, Rui Dai, Chenguang Zuo, Jingwen Chen, Keqin Li 0001, Zheng Qin 0001. A Low-Rate DoS Attack Mitigation Scheme Based on Port and Traffic State in SDN
1771 -- 1785Zhetao Li, Yong Xiao, Haolin Liu, Xiaofei Liao, Ye Yuan 0001, Junzhao Du. Dynamic Graph Publication With Differential Privacy Guarantees for Decentralized Applications
1786 -- 1799Mahsa Heidari, Bijan Alizadeh. Localizing Multiple Bugs in RTL Designs by Classifying Hit-Statements Using Neural Networks
1800 -- 1813Fei Lyu 0002, Yuanyong Luo, Weiqiang Liu 0001. An Efficient Methodology for Binary Logarithmic Computations of Floating-Point Numbers With Normalized Output Within One ulp of Accuracy

Volume 74, Issue 4

1109 -- 1122Zerun Li, Xiaoming Chen 0003, Yuxin Yang 0002, Feng Min, Xiaoyu Zhang 0009, Yinhe Han 0001. A Data-Centric Software-Hardware Co-Designed Architecture for Large-Scale Graph Processing
1123 -- 1137Xingyan Chen, Tian Du, Mu Wang, Tiancheng Gu, Yu Zhao 0019, Gang Kou, Changqiao Xu, Dapeng Oliver Wu. Towards Optimal Customized Architecture for Heterogeneous Federated Learning With Contrastive Cloud-Edge Model Decoupling
1138 -- 1151Jinquan Wang, Zhisheng Huo, Limin Xiao, Jinqian Yang, Jiantong Huo, Minyi Guo. Hierarchical Hashing: A Dynamic Hashing Method With Low Write Amplification and High Performance for Non-Volatile Memory
1152 -- 1167Jinkai Zhang, Yinghao Yang, Zhe Zhou, Zhicheng Hu, Xin Zhao, Liang Chang 0002, Hang Lu, Xiaowei Li 0001. Trident: The Acceleration Architecture for High-Performance Private Set Intersection
1168 -- 1181Mingyuan Zhao, Hao Sheng 0001, Rongshan Chen, Ruixuan Cong, Tun Wang, Zhenglong Cui, Da Yang 0001, Shuai Wang 0027, Wei Ke 0001. A GPU-Enabled Framework for Light Field Efficient Compression and Real-Time Rendering
1182 -- 1195Haotian Wang 0006, Yan Ding 0004, Yumeng Liu, Weichen Liu, Chubo Liu, Wangdong Yang, Kenli Li 0001. A Context-Awareness and Hardware-Friendly Sparse Matrix Multiplication Kernel for CNN Inference Acceleration
1196 -- 1209Weijie Liu, Kai Lu, Zhiquan Lai, Shengwei Li, Keshi Ge, Dongsheng Li 0001, Xicheng Lu. AutoPipe-H: A Heterogeneity-Aware Data-Paralleled Pipeline Approach on Commodity GPU Servers
1210 -- 1223Kejun Guo, Fuliang Li, Jiaxing Shen, Xing-Wei Wang 0001, Jiannong Cao 0001. Distributed Sketch Deployment for Software Switches
1224 -- 1238Yingxue Gao, Teng Wang, Lei Gong, Chao Wang, Dong Dai 0001, Yang Yang 0080, Xianglan Chen, Xi Li, Xuehai Zhou. Hardware Accelerated Vision Transformer via Heterogeneous Architecture Design and Adaptive Dataflow Mapping
1239 -- 1252Yunho Jang, Dongsu Kim, Yeseul Kim, Jongsun Park 0001. Big-Computing and Little-Storing STT-MRAM PIM Architecture With Charge Domain Based MAC Operation
1253 -- 1266Ke Xu, Ming Tang 0002, Quancheng Wang, Han Wang. Microarchitectural Attacks and Mitigations on Retire Resources in Modern Processors
1267 -- 1277Chun Huang, Jiaying Shao, Baolei Peng, Qingshuang Guo, Panlong Li, Junwei Sun, Yanfeng Wang. Design of a Universal Decoder Model Based on DNA Winner-Takes-All Neural Networks
1278 -- 1292Giusy Iaria, Paolo Bernardi, Claudia Bertani, Lorenzo Cardone, Giuseppe Garozzo, Vincenzo Tancorre. A Comprehensive Scan Test Cost Model to Optimize the Production of Very Large SoCs
1293 -- 1305Jaeil Lim, Jaewon Chung, Donghun Jeong, Daegeun Jee, Euicheol Lim. A New ECC Configuration Method for DRAM System Considering Metadata
1306 -- 1321Kaijie Wei, Hideharu Amano, Ryohei Niwase, Yoshiki Yamaguchi, Takefumi Miyoshi. Qu-Trefoil: Large-Scale Quantum Circuit Simulator Working on FPGA With SATA Storages
1322 -- 1333Yulong Li, Wenxin Li 0001, Yuxuan Du, Yinan Yao, Song Zhang, Linxuan Zhong, Keqiu Li. Flexible Job Scheduling With Spatial-Temporal Compatibility for In-Network Aggregation
1334 -- 1347Sirong Zhao, Guoqi Xie, Chenglai Xiong, Kenli Li 0001, Xuejun Yu, Bo Wan, Yiwen Jiang. AVL Function Table for LeafHooks Insertion With Obfuscated Control Flow Integrity
1348 -- 1361Peyman Dehghanzadeh, Ovishake Sen, Baibhab Chatterjee, Swarup Bhunia. LUNA-CiM: A Programmable Compute-in-Memory Fabric for Neural Network Acceleration
1362 -- 1376Jin Ye, Yajun Peng, Yijun Li 0002, Zhaoyi Li, Jiawei Huang 0001. Asynchronous Control Based Aggregation Transport Protocol for Distributed Deep Learning
1377 -- 1391Trevor E. Pogue, Nicola Nicolici. Karatsuba Matrix Multiplication and Its Efficient Custom Hardware Implementations
1392 -- 1404Guangkuo Yang, Meng Zhang 0014, Peng Guo, Xuepeng Zhan, Shaoqi Yang, Xiaohuan Zhao, Xinyi Guo, Pengpeng Sang, Jixuan Wu, Fei Wu 0005, Jiezhi Chen. High-Precision Error Bit Prediction for 3D QLC NAND Flash Memory: Observations, Analysis, and Modeling
1405 -- 1417Li Yang, Wei Zhang, Yinbin Miao, Yanrong Liang, Xinghua Li 0001, Kim-Kwang Raymond Choo, Robert H. Deng. Secure and Efficient Cross-Modal Retrieval Over Encrypted Multimodal Data
1418 -- 1430Abdulbary Naji, Xingfu Wang, Ping Liu 0008, Ammar Hawbani, Liang Zhao 0004, Xiaohua Xu 0002, Fuyou Miao. NetCRC-NR: In-Network 5G NR CRC Accelerator
1431 -- 1445Zhixin Zhao, Yitao Hu, Guotao Yang, Ziqi Gong, Chen Shen, Laiping Zhao, Wenxin Li 0001, Xiulong Liu, Wenyu Qu. SLOpt: Serving Real-Time Inference Pipeline With Strict Latency Constraint
1446 -- 1460Vasileios Titopoulos, Kosmas Alexandridis, Christodoulos Peltekis, Chrysostomos Nicopoulos, Giorgos Dimitrakopoulos. Optimizing Structured-Sparse Matrix Multiplication in RISC-V Vector Processors
1461 -- 1469Argyris Kokkinis, Georgios Zervakis 0001, Kostas Siozios, Mehdi Baradaran Tahoori, Jörg Henkel. Enabling Printed Multilayer Perceptrons Realization via Area-Aware Neural Minimization

Volume 74, Issue 3

749 -- 761Kento Hasegawa, Kazuki Yamashita, Seira Hidano, Kazuhide Fukushima, Kazuo Hashimoto, Nozomu Togawa. Node-Wise Hardware Trojan Detection Based on Graph Learning
762 -- 775Pietro Nannipieri, Luca Crocetti, Stefano Di Matteo, Luca Fanucci, Sergio Saponara. Hardware Design of an Advanced-Feature Cryptographic Tile Within the European Processor Initiative
776 -- 789Qin Jiang, Saiyu Qi, Xu Yang, Yong Qi, Jianfeng Wang, Youshui Lu, Bochao An, Ee-Chien Chang. Reducing Paging and Exit Overheads in Intel SGX for Oblivious Conjunctive Keyword Search
790 -- 804Florian Bache, Jonas Wloka, Pascal Sasdrich, Tim Güneysu. Multivariate TVLA - Efficient Side-Channel Evaluation Using Confidence Intervals
805 -- 819Wei He, Zhi Zhang 0001, Yueqiang Cheng, Wenhao Wang 0001, Wei Song 0002, Yansong Gao 0001, Qifei Zhang 0001, Kang Li, Dongxi Liu, Surya Nepal. WhistleBlower: A System-Level Empirical Study on RowHammer
820 -- 834Durba Chatterjee, Aritra Hazra, Debdeep Mukhopadhyay. $\mathtt{PARLE}$PARLE-$\mathtt{G}$G: Provable Automated Representation and Analysis Framework for Learnability Evaluation of Generic PUF Compositions
835 -- 847Jiangshan Long, Changhai Ou, Yajun Ma, Yifan Fan, Hua Chen 0011, Shihui Zheng. How to Launch a Powerful Side-Channel Collision Attack?
848 -- 859Deniz Gurevin, Chenglu Jin, Phuong Ha Nguyen, Omer Khan, Marten van Dijk. Secure Remote Attestation With Strong Key Insulation Guarantees
860 -- 874Zhixin Pan, Prabhat Mishra 0001. AI Trojan Attack for Evading Machine Learning-Based Detection of Hardware Trojans
875 -- 886Servio Paguada, Lejla Batina, Ileana Buhan, Igor Armendariz. Being Patient and Persistent: Optimizing An Early Stopping Strategy for Deep Learning in Profiled Attacks
887 -- 900Xiaohai Dai, Zhengxuan Guo, Jiang Xiao, Guanxiong Wang, Yifei Liang, Chen Yu 0003, Hai Jin 0001. Pako: Multi-Valued Byzantine Agreement Comparable to Partially-Synchronous BFT
901 -- 914Liang Zhang, Zhanrong Ou, Changhui Hu, Haibin Kan, Jiheng Zhang. Data Sharing in the Metaverse With Key Abuse Resistance Based on Decentralized CP-ABE
915 -- 928Guanlin Jing, Yifei Zou, Minghui Xu 0001, Yanqiang Zhang, Dongxiao Yu, Zhiguang Shan, Xiuzhen Cheng 0001, Rajiv Ranjan 0001. Nicaea: A Byzantine Fault Tolerant Consensus Under Unpredictable Message Delivery Failures for Parallel and Distributed Computing
929 -- 943Zhonghua Wang, Kai Lu, Jiguang Wan, Hong Jiang 0001, Zeyang Zhao, Peng Xu, Biliang Lai, Guokuan Li, Changsheng Xie. NStore: A High-Performance NUMA-Aware Key-Value Store for Hybrid Memory
944 -- 954Daniel Báscones, Francisco Garcia-Herrero, Oscar Ruano, Carlos González 0002, Daniel Mozos, Juan Antonio Maestro. Protecting the CCSDS 123.0-B-2 Compression Algorithm Against Single-Event Upsets for Space Applications
955 -- 967Yusheng Hua, Xuanhua Shi, Ligang He, Kang He, Teng Zhang 0001, Hai Jin 0001, Yong Chen 0001. RuYi: Optimizing Burst Buffer Through Automated, Fine-Grained Process-to-BB Mapping
968 -- 982Qin Hua, Dingyu Yang, Shiyou Qian, Jian Cao 0001, Guangtao Xue, Minglu Li 0001. Humas: A Heterogeneity- and Upgrade-Aware Microservice Auto-Scaling Framework in Large-Scale Data Centers
983 -- 995Hui Chen 0015, Lianghua Quan, Ke Chen 0018, Weiqiang Liu 0001. High-Radix Generalized Hyperbolic CORDIC and Its Hardware Implementation
996 -- 1010Qinghui Hong, Haoyou Jiang, Pingdan Xiao, Sichun Du, Tao Li. A Parallel Computing Scheme Utilizing Memristor Crossbars for Fast Corner Detection and Rotation Invariance in the ORB Algorithm
1011 -- 1024Hafiz Adnan Niaz, Ravi Reddy Manumachu, Alexey L. Lastovetsky. Accurate and Reliable Energy Measurement and Modelling of Data Transfer Between CPU and GPU in Parallel Applications on Heterogeneous Hybrid Platforms
1025 -- 1039Seungyong Lee 0003, Sanghyun Lee, Minseok Seo, Chunmyung Park, Woojae Shin, Hyuk-Jae Lee, Hyun Kim 0001. NPC: A Non-Conflicting Processing-in-Memory Controller in DDR Memory Systems
1040 -- 1052Zhigao Zheng 0001, Guojia Wan, Jiawei Jiang 0001, Chuang Hu, Hao Liu, Shahid Mumtaz, Bo Du 0001. Lock-Free Triangle Counting on GPU
1053 -- 1065Holger Boche, Yannik N. Böck, Zoe Garcia del Toro, Frank H. P. Fitzek. Feynman Meets Turing: The Uncomputability of Quantum Gate-Circuit Emulation and Concatenation
1066 -- 1078Hongcheng Xie, Yu Guo 0003, Yinbin Miao, Xiaohua Jia. Access-Pattern Hiding Search Over Encrypted Databases by Using Distributed Point Functions
1079 -- 1093Xiaofeng Zou, Cen Chen 0001, Hongen Shao, Qinyu Wang, Xiaobin Zhuang, Yangfan Li, Keqin Li 0001. ReViT: Vision Transformer Accelerator With Reconfigurable Semantic-Aware Differential Attention
1094 -- 1108Borui Li 0001, Hongchang Fan, Yi Gao 0001, Wei Dong 0001. WaWoT: Towards Flexible and Efficient Web of Things Services via WebAssembly on Resource-Constrained IoT Devices

Volume 74, Issue 2

341 -- 355Chen Zhang 0001, Yang Wang 0053, Zhiqiang Xie, Cong Guo 0003, Yunxin Liu, Jingwen Leng, Zhigang Ji, Yuan Xie 0001, Ru Huang 0001. DSTC: Dual-Side Sparse Tensor Core for DNNs Acceleration on Modern GPU Architectures
356 -- 370Yi Bian, Fangyu Zheng, Yuewu Wang, Lingguang Lei, Yuan Ma, Tian Zhou, Jiankuo Dong, Guang Fan, Jiwu Jing. AsyncGBP${}^{+}$+: Bridging SSL/TLS and Heterogeneous Computing Power With GPU-Based Providers
371 -- 385Hongmin Li, Si Wu 0003, Zhipeng Li 0005, Qianli Wang, Yongkun Li 0001, Yinlong Xu. Enabling High Performance and Resource Utilization in Clustered Cache via Hotness Identification, Data Copying, and Instance Merging
386 -- 400Han Zhao 0005, Junxiao Deng, Weihao Cui, Quan Chen 0002, Youtao Zhang, Deze Zeng, Minyi Guo. Adaptive Kernel Fusion for Improving the GPU Utilization While Ensuring QoS
401 -- 413Xinyi Ji, Jiankuo Dong, Junhao Huang 0001, Zhijian Yuan, Wangchen Dai, Fu Xiao 0001, Jingqiang Lin 0001. ECO-CRYSTALS: Efficient Cryptography CRYSTALS on Standard RISC-V ISA
414 -- 426Zixuan Zhu 0001, Xiaolong Zhou, Chundong Wang 0001, Li Tian, Zunkai Huang, Yongxin Zhu 0001. Bit-Sparsity Aware Acceleration With Compact CSD Code on Generic Matrix Multiplication
427 -- 441Mengke Ge, Junpeng Wang 0002, Binhan Chen, Yingjian Zhong, Haitao Du, Song Chen 0001, Yi Kang. Allspark: Workload Orchestration for Visual Transformers on Processing In-Memory Systems
442 -- 454Liang Zhao 0004, Shuo Li, Zhiyuan Tan 0001, Ammar Hawbani, Stelios Timotheou, Keping Yu. A Multi-UAV Cooperative Task Scheduling in Dynamic Environments: Throughput Maximization
455 -- 467Jiguo Li 0001, Licheng Ji, Yichen Zhang 0003, Yang Lu 0001, Jianting Ning. Response-Hiding and Volume-Hiding Verifiable Searchable Encryption With Conjunctive Keyword Search
468 -- 482Xinglei Chen, Zinuo Cai, Hanwen Zhang, Ruhui Ma, Rajkumar Buyya. FasDL: An Efficient Serverless-Based Training Architecture With Communication Optimization and Resource Configuration
483 -- 494Yixun Wei, Zhichao Cao 0002, David H. C. Du. CPI: A Collaborative Partial Indexing Design for Large-Scale Deduplication Systems
495 -- 509Miao Cai, Junru Shen, Yifan Yuan, Zhihao Qu, Baoliu Ye. Scaling Persistent In-Memory Key-Value Stores Over Modern Tiered, Heterogeneous Memory Hierarchies
510 -- 525Xiaohang Wang 0001, Yifan Wang, Yingtao Jiang, Amit Kumar Singh 0002, Mei Yang 0001. On Task Mapping in Multi-chiplet Based Many-Core Systems to Optimize Inter- and Intra-chiplet Communications
526 -- 541Victor Jean-Baptiste Jung, Alessio Burrello, Moritz Scherer 0001, Francesco Conti 0001, Luca Benini. Optimizing the Deployment of Tiny Transformers on Low-Power MCUs
542 -- 554Huize Li, Dan Chen 0006, Tulika Mitra. SADIMM: Accelerating $\underline{\text{S}}$S - parse $\underline{\text{A}}$A - ttention Using $\underline{\text{DIMM}}$DIMM - -Based Near-Memory Processing
555 -- 568Jingweijia Tan, Jiashuo Wang, Kaige Yan, Xiaohui Wei 0002, Xin Fu. Evaluating GPU's Instruction-Level Error Characteristics Under Low Supply Voltages
569 -- 581Mehdi Ghasemi 0003, Soroush Heidari, Young-geun Kim, Carole-Jean Wu, Sarma B. K. Vrudhula. Energy-Efficient, Delay-Constrained Edge Computing of a Network of DNNs
582 -- 596Jordi Fornt, Enrico Reggiani, Pau Fontova-Musté, Narcís Rodas, Alessandro Pappalardo, Osman Sabri Unsal, Adrián Cristal Kestelman, Josep Altet, Francesc Moll, Jaume Abella 0001. Mix-GEMM: Extending RISC-V CPUs for Energy-Efficient Mixed-Precision DNN Inference Using Binary Segmentation
597 -- 608Tayebeh Karimi, Arezoo Kamran. Energy-Delay Efficient Segmented Approximate Adder With Smart Chaining
609 -- 622Fuliang Li, Qin Chen, Jiaxing Shen, Xing-Wei Wang 0001, Jiannong Cao 0001. Performance Characteristics and Guidelines of Offloading Middleboxes Onto BlueField-2 DPU
623 -- 636Sneha Agarwal, Keshav Goel, Mitali Sinha, Sujay Deb. Mitigation of Phase Transitions in Self-Organizing NoC for Stable Queueing Dynamics
637 -- 651Abdoulaye Gamatié, Yuyang Wang, Diego Valdez Duran. Uncovering the Intricacies and Synergies of Processor Microarchitecture Mechanisms Using Explainable AI
652 -- 664Nicolò Bellarmino, Riccardo Cantoro, Sophie M. Fosson, Martin Huch, Tobias Kilian, Ulf Schlichtmann, Giovanni Squillero. COSMO: COmpressed Sensing for Models and Logging Optimization in MCU Performance Screening
665 -- 677Abdulbary Naji, Xingfu Wang, Ammar Hawbani, Aiman Ghannami, Liang Zhao 0004, Xiaohua Xu 0002, Wei Zhao 0023. NetMod: Toward Accelerating Cloud RAN Distributed Unit Modulation Within Programmable Switches
678 -- 690Song Liu 0007, Jie Ma, Zengyuan Zhang, Xinhe Wan, Bo Zhao 0019, Weiguo Wu. Scalpel: High Performance Contention-Aware Task Co-Scheduling for Shared Cache Hierarchy
691 -- 704Xiaofeng Hou, Cheng Xu, Chao Li 0009, Jiacheng Liu 0001, Xuehan Tang, Kwang-Ting Cheng, Minyi Guo. Improving Efficiency in Multi-Modal Autonomous Embedded Systems Through Adaptive Gating
705 -- 716Chenhong Luo, Yong Wang 0009, Yanjun Zhang, Leo Yu Zhang. Distributed Differentially Private Matrix Factorization for Implicit Data via Secure Aggregation
717 -- 730Zhihao Qu, Ninghui Jia, Baoliu Ye, Shihong Hu, Song Guo 0001. FedQClip: Accelerating Federated Learning via Quantized Clipped SGD
731 -- 739Francesco Angione, Paolo Bernardi, Nicola Di Gruttola Giardino, Gabriele Filipponi, Claudia Bertani, Vincenzo Tancorre. A System-Level Test Methodology for Communication Peripherals in System-on-Chips
740 -- 748Kevin Kim, Katherine Parry, David M. Harris, Cedar Turek, Alessandro Maiuolo, Rose Thompson, James E. Stine. Shared Recurrence Floating-Point Divide/Sqrt and Integer Divide/Remainder With Early Termination

Volume 74, Issue 1

1 -- 14Chao Chen, Chengyu Liu 0001, Jianqing Li 0002, Bruno da Silva 0001. Acceleration of Fast Sample Entropy for FPGAs
15 -- 28Ziheng Wang 0002, Xiaoshe Dong, Heng Chen 0002, Yan Kang 0005, Qiang Wang. +
29 -- 42Yujun Xie, Yuan Liu 0022, Xin Zheng 0001, Bohan Lan, Dengyun Lei, Dehao Xiang, Shuting Cai, Xiaoming Xiong. FLALM: A Flexible Low Area-Latency Montgomery Modular Multiplication on FPGA
43 -- 56Kaniz Mishty, Mehdi Sadi. Chiplet-Gym: Optimizing Chiplet-Based AI Accelerator Design With Reinforcement Learning
57 -- 70Xiaohai Dai, Wei Li 0058, Guanxiong Wang, Jiang Xiao, Haoyang Chen, Shufei Li, Albert Y. Zomaya, Hai Jin 0001. Remora: A Low-Latency DAG-Based BFT Through Optimistic Paths
71 -- 85Jingjin Li, Weixiong Jiang, Yuting He, Qingyu Yang, Anqi Gao, Yajun Ha, Ender Özcan, Ruibin Bai, Tianxiang Cui, Heng Yu 0001. FiDRL: Flexible Invocation-Based Deep Reinforcement Learning for DVFS Scheduling in Embedded Systems
86 -- 100Tie Qiu 0001, Jingchen Sun, Ning Chen 0008, Songwei Zhang, Weisheng Si, Xingwei Wang 0001. Olive-Like Networking: A Uniformity Driven Robust Topology Generation Scheme for IoT System
101 -- 115Yannis Steve Nsuloun Fotse, Vianney Kengne Tchendji, Mthulisi Velempini. Federated Learning Based DDoS Attacks Detection in Large Scale Software-Defined Network
116 -- 130Zezhong Ding, Deyu Kong, Zhuoxu Zhang, Xike Xie, Jianliang Xu. ClusPar: A Game-Theoretic Approach for Efficient and Scalable Streaming Edge Partitioning
131 -- 142Sreenitha Kasarapu, Sai Manoj Pudukotai Dinakarrao. Performance and Environment-Aware Advanced Driving Assistance Systems
143 -- 154Shahab Mirzaei-Teshnizi, Parviz Keshavarzi. Parallel Modular Multiplication Using Variable Length Algorithms
155 -- 169Jun Bi, Yuanbo Wen, Xiaqing Li, Yongwei Zhao, Yuxuan Guo, Enshuai Zhou, Xing Hu 0001, Zidong Du, Ling Li 0001, Huaping Chen 0001, Tianshi Chen 0002, Qi Guo 0001. Efficient and Fast High-Performance Library Generation for Deep Learning Accelerators
170 -- 184Pan Zhang, Lei Xu 0019, Lin Mei, Chungen Xu. Sketch-Based Adaptive Communication Optimization in Federated Learning
185 -- 199Jinyi Deng, Xinru Tang, Jiahao Zhang, Yuxuan Li, Linyun Zhang, Fengbin Tu, Shaojun Wei, Yang Hu 0001, Shouyi Yin. Rethinking Control Flow in Spatial Architectures: Insights Into Control Flow Plane Design
200 -- 209Yohan Chatelain, Loïc Tetrel, Christopher J. Markiewicz, Mathias Goncalves, Gregory Kiar, Oscar Esteban, Pierre Bellec, Tristan Glatard. A Numerical Variability Approach to Results Stability Tests and Its Application to Neuroimaging
210 -- 221Jinhua Zhu, Zhen Gao 0005, Pedro Reviriego, Shanshan Liu 0001, Fabrizio Lombardi. Dependability of the K Minimum Values Sketch: Protection and Comparative Analysis
222 -- 236Yaodong Huang, Tingting Yao, Zelin Lin, Xiaojun Shang, Yukun Yuan 0001, Laizhong Cui, Yuanyuan Yang 0001. Efficient Service Function Chain Placement Over Heterogeneous Devices in Deviceless Edge Computing Environments
237 -- 249Arne Symons, Linyan Mei, Steven Colleman, Pouya Houshmand, Sebastian Karl, Marian Verhelst. Stream: Design Space Exploration of Layer-Fused DNNs on Heterogeneous Dataflow Accelerators
250 -- 262Yi Liu 0057, Song Guo 0001, Jie Zhang 0076, Zicong Hong, Yufeng Zhan, Qihua Zhou. Collaborative Neural Architecture Search for Personalized Federated Learning
263 -- 277Yao Xin, Chengjun Jia, Wenjun Li, Ori Rottenstreich, Yang Xu 0010, Gaogang Xie, Zhihong Tian, Jun Li 0002. A Heterogeneous and Adaptive Architecture for Decision-Tree-Based ACL Engine on FPGA
278 -- 292Benteng Zhang, Yingchi Mao, Xiaoming He, Huawei Huang, Jie Wu 0001. Balancing Privacy and Accuracy Using Significant Gradient Protection in Federated Learning
293 -- 306Davide Galli, Francesco Lattari, Matteo Matteucci, Davide Zoni. A Deep Learning-Assisted Template Attack Against Dynamic Frequency Scaling Countermeasures
307 -- 315Dina A. Moussa, Michael Hefenbrock, Mehdi B. Tahoori. Compressed Test Pattern Generation for Deep Neural Networks
316 -- 324Ghassem Jaberipur, Elham Rahman, Jeong-A Lee. Balanced Modular Addition for the Moduli Set $ \{2^{q},2^{q}\mp 1,2^{2q}+1\}${2q,2q∓1,22q+1} via Moduli-($ 2^{q}\mp \sqrt{-1}$2q∓-1) Adders
325 -- 333Aodong Chen, Fei Xu, Li Han 0001, Yuan Dong, Li Chen 0019, Zhi Zhou 0006, Fangming Liu. Opara: Exploiting Operator Parallelism for Expediting DNN Inference on GPUs