researchr
explore
Tags
Journals
Conferences
Authors
Profiles
Groups
calendar
New Conferences
Events
Deadlines
search
search
You are not signed in
Sign in
Sign up
Links
Filter by Year
[-]
OR
AND
NOT
1
1988
1990
1991
1993
1995
1997
1999
2001
2003
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
2026
Filter by Tag
[+]
OR
AND
NOT
1
C++
Meta-Environment
analysis
architecture
caching
compiler
data-flow
data-flow programming
e-science
meta-model
modeling
open-source
optimization
parallel programming
principles
program analysis
programming
rule-based
source-to-source
synchronization
Filter by Author
[+]
OR
AND
NOT
1
Andreia Correia
Dan Alistarh
Dingwen Tao
Erez Petrank
Gagan Agrawal
Guangming Tan
Guy E. Blelloch
Haibo Chen
Jidong Zhai
John M. Mellor-Crummey
Keshav Pingali
Kunal Agrawal
Martin Burtscher
Michael L. Scott
P. Sadayappan
Pedro Ramalhete
Rudolf Eigenmann
Torsten Hoefler
Vivek Sarkar
Xipeng Shen
Filter by Top terms
[+]
OR
AND
NOT
1
applications
concurrent
data
distributed
efficient
gpu
gpus
high
memory
model
multi
parallel
parallelism
performance
programming
programs
scalable
systems
transactional
using
PPOPP (ppopp)
Editions
Publications
Viewing Publication 1 - 100 from 1504
2026
A Distributed Matrix-Block-Vector Multiplication in Presence of System Performance Variability
Yuchen Ma 0001
,
Bin Ren 0002
,
Andreas Stathopoulos
.
ppopp 2026
:
674-686
[doi]
SPIDER: Unleashing Sparse Tensor Cores for Stencil Computation via Strided Swapping
Qiqi Gu 0002
,
Chenpeng Wu
,
Heng Shi 0005
,
Jianguo Yao 0002
.
ppopp 2026
:
218-231
[doi]
Sharded Elimination and Combining for Highly-Efficient Concurrent Stacks
Ajay Singh 0002
,
Nikos Metaxakis
,
Panagiota Fatourou
.
ppopp 2026
:
123-135
[doi]
Trojan Horse: Aggregate-and-Batch for Scaling Up Sparse Direct Solvers on GPU Clusters
Yida Li 0005
,
Siwei Zhang
,
Yiduo Niu
,
Yang Du 0015
,
Qingxiao Sun
,
Zhou Jin 0001
,
Weifeng Liu 0002
.
ppopp 2026
:
369-383
[doi]
Pipelonk: Accelerating End-to-End Zero-Knowledge Proof Generation on GPUs for PLONK-Based Protocols
Zhiyuan Zhang 0008
,
Yanxin Cai
,
Wenhao Yin
,
Xueyu Wu
,
Yi Wang 0003
,
Lei Ju 0001
,
Zhuoran Ji
.
ppopp 2026
:
439-451
[doi]
Binary Compatible Critical Section Delegation
Junyao Zhang 0008
,
Zhuo Wang
,
Zhe Zhou 0001
.
ppopp 2026
:
1-12
[doi]
Fixing Non-blocking Data Structures for Better Compatibility with Memory Reclamation Schemes
Md Amit Hasan Arovi
,
Ruslan Nikolaev 0001
.
ppopp 2026
:
26-39
[doi]
PANA: A Fine-Grained Runtime-Adaptive Load Balancing for Parallel SpMV on Multicore CPUs
Haodong Bian
,
Youhui Zhang
,
Xiang Fei
,
Jianqiang Huang 0002
,
Xiaoying Wang 0002
.
ppopp 2026
:
95-108
[doi]
MetaAttention: A Unified and Performant Attention Framework across Hardware Backends
Feiyang Chen
,
Yu Cheng
,
Lei Wang 0222
,
Yuqing Xia
,
Ziming Miao
,
Lingxiao Ma
,
Fan Yang 0024
,
Jilong Xue
,
Zhi Yang 0001
,
Mao Yang 0004
,
Xingda Wei
,
Haibo Chen 0001
.
ppopp 2026
:
635-647
[doi]
High-Throughput Non-uniformly Quantized 3-bit LLM Inference
Yuang Chen
,
Wenqi Zeng
,
Jeffrey Xu Yu
.
ppopp 2026
:
288-300
[doi]
Multiverse: Transactional Memory with Dynamic Multiversioning
Gaetano Coccimiglio
,
Trevor Brown 0001
,
Srivatsan Ravi
.
ppopp 2026
:
40-52
[doi]
Hapax Locks: Scalable Value-Based Mutual Exclusion
Dave Dice
,
Alex Kogan
.
ppopp 2026
:
13-25
[doi]
CCL-D: A High-Precision Diagnostic System for Slow and Hang Anomalies in Large-Scale Model Training
Yida Gu
,
Fakang Wang
,
Jianhao Fu
,
Zhenhang Sun
,
Qianyu Zhang
,
Hairui Zhao
,
Xingchen Liu
,
Yang Tian
,
Wenjing Huang 0002
,
Zedong Liu
,
Yifan Chen
,
Jinwu Yang
,
Yueyuan Zhou
,
Qian Zhao 0021
,
Haoxu Li
,
Tao Wang
,
Feng Yu
,
Zhan Wang 0003
,
Guangming Tan
,
Dingwen Tao
.
ppopp 2026
:
425-438
[doi]
Scaling GPU-to-CPU Migration for Efficient Distributed Execution on CPU Clusters
Ruobing Han
,
Hyesoon Kim
.
ppopp 2026
:
355-368
[doi]
Proceedings of the 31st ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming, PPoPP 2026, Sydney, NSW, Australia, 31 January 2026 - 4 February 2026
Tony Hosking
,
Madan Musuvathi
,
Kenjiro Taura
, editors,
ACM,
2026.
[doi]
DTMiner: A Data-Centric System for Efficient Temporal Motif Mining
Yinbo Hou
,
Hao Qi 0004
,
Ligang He
,
Jin Zhao 0003
,
Yu Zhang 0027
,
Hui Yu
,
Longlong Lin
,
Lin Gu 0002
,
Wenbin Jiang 0001
,
Xiaofei Liao
,
Hai Jin 0001
.
ppopp 2026
:
591-604
[doi]
ASM-SpMM: Unleashing the Potential of Arm SME for Sparse Matrix Multiplication Acceleration
Jiazhi Jiang
,
Xijia Yao
,
Jiayu Chen
,
Jinhui Wei
,
Dan Huang 0001
,
Yutong Lu
.
ppopp 2026
:
232-244
[doi]
VDHA: Vector-Driven Hash Aggregation for Sparse Matrix-Sparse Vector Multiplication on GPUs
Yuchen Li
,
Zhe Pan
,
Peng Qu
,
Youhui Zhang
.
ppopp 2026
:
259-272
[doi]
TAC: Cache-Based System for Accelerating Billion-Scale GNN Training on Multi-GPU Platform
Zhiqiang Liang
,
Hongyu Gao
,
Jue Wang 0013
,
Fang Liu
,
Xingguo Shi
,
Junyu Gu
,
Peng Di
,
Sian Li
,
Lei Tang
,
Chunbao Zhou
,
Lian Zhao
,
Yangang Wang 0002
,
Xuebin Chi
.
ppopp 2026
:
577-590
[doi]
Laser: Unlocking Layer-Level Scheduling for Efficient Multi-SLO LLM Serving
Jianxiong Liao
,
Quanxing Dong
,
Yunkai Liang
,
Zhi Zhou 0006
,
Xu Chen 0004
.
ppopp 2026
:
509-521
[doi]
zBuffer: Zero-Copy and Metadata-Free Serialization for Fast RPC with Scatter-Gather Reflection
Xiangyu Liu
,
Huiba Li
,
Shun Gai
,
Youmin Chen
,
Yiming Zhang 0003
.
ppopp 2026
:
342-354
[doi]
Characterizing Matrix Multiplication Units across General Parallel Patterns in Scientific Computing
Yuechen Lu
,
Hongwei Zeng
,
Marc Casas
,
Weifeng Liu 0002
.
ppopp 2026
:
687-701
[doi]
ROME: Maximizing GPU Efficiency for All-Pairs Shortest Path via Taming Fine-Grained Irregularities
Weile Luo
,
Yuhan Chen
,
Xiangrui Yu
,
Qiang Wang 0022
,
Ruibo Fan
,
Hongyuan Liu 0002
,
Xiaowen Chu 0001
.
ppopp 2026
:
204-217
[doi]
UFO Trees: Practical and Provably-Efficient Parallel Batch-Dynamic Trees
Quinten De Man
,
Atharva Sharma
,
Kishen N. Gowda
,
Laxman Dhulipala
.
ppopp 2026
:
109-122
[doi]
Dynamic Detection of Inefficient Data Mapping Patterns in Heterogeneous OpenMP Applications
Luke Marzen
,
Junhyung Shim
,
Ali Jannesari
.
ppopp 2026
:
177-189
[doi]
Parallel Dynamic Spatial Indexes
Ziyang Men
,
Bo Huang
,
Yan Gu 0001
,
Yihan Sun 0001
.
ppopp 2026
:
150-163
[doi]
DiggerBees: Depth First Search Leveraging Hierarchical Block-Level Stealing on GPUs
Yuyao Niu
,
Yuechen Lu
,
Weifeng Liu 0002
,
Marc Casas
.
ppopp 2026
:
81-94
[doi]
Root-Down Exposure for Maximal Clique Enumeration on GPUs
Zhe Pan
,
Peng Qu
,
Youhui Zhang
.
ppopp 2026
:
190-203
[doi]
Towards Singular Value Decomposition for Rank-Deficient Matrices: An Efficient and Accurate Algorithm on GPU Architectures
Lu Shi
,
Weiwei Xu
,
Shaoshuai Zhang
.
ppopp 2026
:
648-659
[doi]
Waste-Efficient Work Stealing
Kyle Singer
,
Kunal Agrawal 0001
,
Tao B. Schardl
.
ppopp 2026
:
68-80
[doi]
HierCut: Enabling 16-bit Format Mixed Precision for Molecular Dynamics through Hierarchical Cutoff
Zeyu Song
,
Lin Gan
,
Xiaohui Duan
,
Zhengrui Li
,
Jiayu Fu
,
Yinuo Wang
,
Guangzhao Li
,
Guangwen Yang
.
ppopp 2026
:
315-328
[doi]
MixFusion: A Patch-Level Parallel Serving System for Mixed-Resolution Diffusion Models
Desen Sun
,
Zepeng Zhao
,
Yuke Wang
.
ppopp 2026
:
522-536
[doi]
JanusQuant: Accurate and Efficient 2-bit KV Cache Quantization for Long-Context Inference
Chengyu Sun
,
Yaqi Xia
,
Hulin Wang
,
Donglin Yang
,
Xiaobo Zhou 0002
,
Dazhao Cheng
.
ppopp 2026
:
301-314
[doi]
Elastor: Elastic and Efficient Model Partitioning and Checkpointing for Fault-Tolerant Distributed Training
Xuanyu Wang
,
Fangcheng Fu
,
Haoyang Li 0017
,
Hao Ge
,
Sheng Lin
,
Jiawen Niu
,
Bin Cui 0001
.
ppopp 2026
:
398-412
[doi]
APERTURE: Algorithm-System Co-optimization for Temporal Graph Network Inference
Yiqing Wang
,
Hailong Yang 0002
,
Enze Yu
,
Qingxiao Sun
,
Kejie Ma
,
Kaige Zhang
,
Chenhao Xie 0001
,
Depei Qian
.
ppopp 2026
:
564-576
[doi]
ElasGNN: An Elastic Training Framework for Distributed GNN Training
Siqi Wang
,
Hailong Yang 0002
,
Pengbo Wang
,
Hongliang Cao
,
Yufan Xu 0001
,
Xuezhu Wang
,
Zhongzhi Luan
,
Yi Liu 0013
,
Depei Qian
.
ppopp 2026
:
551-563
[doi]
Concurrent Balanced Augmented Trees
Evan Wrench
,
Ajay Singh 0002
,
Younghun Roh
,
Panagiota Fatourou
,
Siddhartha Jayanti
,
Eric Ruppert
,
Yuanhao Wei
.
ppopp 2026
:
136-149
[doi]
ChituDiffusion: A Data-Characteristic-Aware Serving System for Diffusion Models
Chengzhang Wu
,
Liyan Zheng 0001
,
Haojie Wang 0004
,
Kezhao Huang
,
Zixuan Ma
,
Dong Dong 0001
,
Jidong Zhai
.
ppopp 2026
:
537-550
[doi]
FlashAttention-T: Towards Fully Tensorized Attention by Exploiting Tensor-Vector Parallelism
Jianxing Xu
,
Yuanbo Wen 0001
,
Jun Bi
,
Ruibai Xu
,
Guanglin Xu
,
Rui Zhang 0040
,
Wei Li 0008
,
Ling Li 0001
,
Tianshi Chen 0002
,
Qi Guo 0001
,
Yunji Chen
.
ppopp 2026
:
605-619
[doi]
HelixPipe: Efficient Distributed Training of Long Sequence Transformers with Attention Parallel Pipeline Parallelism
Geng Zhang
,
Shenggan Cheng
,
Xuanlei Zhao
,
Ziming Liu
,
Yang You 0001
.
ppopp 2026
:
413-424
[doi]
Exploiting Efficient Mapping and Pipelined Execution for Accelerating SpMV on Tensor Cores
Kaige Zhang
,
Hailong Yang 0002
,
Xin You
,
Tianyu Feng
,
Yufan Xu 0001
,
Zhongzhi Luan
,
Yi Liu 0013
,
Depei Qian
.
ppopp 2026
:
245-258
[doi]
RoMeo: Mitigating Dual-dimensional Outliers with Rotated Mixed Precision Quantization
Qihao Zhang
,
Mingliang Tang
,
Mingshu Zhai
,
Kinman Lei
,
Jidong Zhai
.
ppopp 2026
:
273-287
[doi]
Faster and Cheaper: Pushing the Sequence Alignment Throughput with Commercial CPUs
Zhonghai Zhang
,
Yewen Li
,
Ke Meng
,
Chunming Zhang
,
Guangming Tan
.
ppopp 2026
:
466-479
[doi]
PIM-zd-tree: A Fast Space-Partitioning Index Leveraging Processing-in-Memory
Yiwei Zhao
,
Hongbo Kang
,
Ziyang Men
,
Yan Gu 0001
,
Guy E. Blelloch
,
Laxman Dhulipala
,
Charles McGuffey
,
Phillip B. Gibbons
.
ppopp 2026
:
480-495
[doi]
2025
WaterWise: Co-optimizing Carbon- and Water-Footprint Toward Environmentally Sustainable Cloud Computing
Yankai Jiang 0002
,
Rohan Basu Roy
,
Raghavendra Kanakagiri
,
Devesh Tiwari
.
ppopp 2025
:
297-311
[doi]
Fairer and More Scalable Reader-Writer Locks by Optimizing Queue Management
Takashi Hoshino 0002
,
Kenjiro Taura
.
ppopp 2025
:
115-127
[doi]
High-performance Visual Semantics Compression for AI-Driven Science
Boyuan Zhang 0002
,
Luanzheng Guo
,
Jiannan Tian
,
Jinyang Liu
,
Daoce Wang
,
Fanjiang Ye
,
Chengming Zhang 0006
,
Jan Strube 0001
,
Nathan R. Tallent
,
Dingwen Tao
.
ppopp 2025
:
557-559
[doi]
SGDRC: Software-Defined Dynamic Resource Control for Concurrent DNN Inference on NVIDIA GPUs
Yongkang Zhang 0003
,
Haoxuan Yu
,
Chenxia Han
,
Cheng Wang
,
Baotong Lu
,
Yunzhe Li
,
Zhifeng Jiang
,
Yang Li
,
Xiaowen Chu 0001
,
Huaicheng Li
.
ppopp 2025
:
267-281
[doi]
Frontier-guided Graph Reordering
Xinmiao Zhang 0004
,
Cheng Liu 0008
,
Shengwen Liang
,
Chenwei Xiong
,
Yu Zhang
,
Lei Zhang 0008
,
Huawei Li 0001
,
Xiaowei Li 0001
.
ppopp 2025
:
542-544
[doi]
Proceedings of the 30th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming, PPoPP 2025, Las Vegas, NV, USA, March 1-5, 2025
ACM,
2025.
[doi]
Big Atomics and Fast Hash Tables
Daniel Anderson
,
Guy E. Blelloch
,
Siddhartha V. Jayanti
.
ppopp 2025
:
539-541
[doi]
Popcorn: Accelerating Kernel K-means on GPUs through Sparse Linear Algebra
Julian Bellavita
,
Thomas Pasquali
,
Laura Del Rio Martin
,
Flavio Vella
,
Giulia Guidi
.
ppopp 2025
:
426-440
[doi]
Minimizing speculation overhead in a parallel recognizer for regular texts
Angelo Borsotti
,
Luca Breveglieri
,
Angelo Morzenti
,
Stefano Crespi-Reghizzi
.
ppopp 2025
:
569-572
[doi]
GLumin: Fast Connectivity Check Based on LUTs For Efficient Graph Pattern Mining
Weichen Cao
,
Ke Meng
,
Zhiheng Lin
,
Guangming Tan
.
ppopp 2025
:
455-468
[doi]
SBMGT: Scaling Bayesian Multinomial Group Testing
Weicong Chen
,
Hao Qi
,
Curtis Tatsuoka
,
Xiaoyi Lu
.
ppopp 2025
:
512-523
[doi]
Accelerating GNNs on GPU Sparse Tensor Cores through N: M Sparsity-Oriented Graph Reordering
Jou-An Chen
,
Hsin-Hsuan Sung
,
Ruifeng Zhang
,
Ang Li 0006
,
Xipeng Shen
.
ppopp 2025
:
16-28
[doi]
Triangle Counting on Tensor Cores
Yuang Chen
,
Jeffrey Xu Yu
.
ppopp 2025
:
560-562
[doi]
Magneto: Accelerating Parallel Structures in DNNs via Co-Optimization of Operators
Zhanyuan Di
,
Leping Wang
,
ZiYi Ren
,
En Shao
,
Jie Zhao
,
Siyuan Feng
,
Dingwen Tao
,
Guangming Tan
,
Ninghui Sun
.
ppopp 2025
:
563-565
[doi]
Reciprocating Locks
Dave Dice
,
Alex Kogan
.
ppopp 2025
:
85-98
[doi]
An AI-Enhanced 1km-Resolution Seamless Global Weather and Climate Model to Achieve Year-Scale Simulation Speed using 34 Million Cores
Xiaohui Duan
,
Yi Zhang
,
Kai Xu
,
Haohuan Fu
,
Bin Yang
,
Yiming Wang
,
Yilun Han
,
Siyuan Chen
,
Zhuangzhuang Zhou
,
Chenyu Wang
,
Dongqiang Huang
,
Huihai An
,
Xiting Ju
,
Haopeng Huang
,
Zhuang Liu
,
Wei Xue
,
Weiguo Liu
,
Bowen Yan
,
Jianye Hou
,
Maoxue Yu
,
Wenguang Chen
,
Jian Li
,
Zhao Jing
,
Hailong Liu
,
Lixin Wu
.
ppopp 2025
:
524-538
[doi]
MARLIN: Mixed-Precision Auto-Regressive Parallel Inference on Large Language Models
Elias Frantar
,
Roberto L. Castro
,
Jiale Chen
,
Torsten Hoefler
,
Dan Alistarh
.
ppopp 2025
:
239-251
[doi]
LibRTS: A Spatial Indexing Library by Ray Tracing
Liang Geng
,
Rubao Lee
,
Xiaodong Zhang
.
ppopp 2025
:
396-411
[doi]
FlashFFTStencil: Bridging Fast Fourier Transforms to Memory-Efficient Stencil Computations on Tensor Core Units
Haozhi Han
,
Kun Li
,
Wei Cui
,
Donglin Bai
,
YiWei Zhang
,
Liang Yuan
,
Yifeng Chen
,
Yunquan Zhang
,
Ting Cao
,
Mao Yang
.
ppopp 2025
:
355-368
[doi]
Setting a Course for Post-Moore Software Performance
Charles E. Leiserson
.
ppopp 2025
:
1
[doi]
Boost Lock-free Queue and Stack with Batching
Ao Li
,
Wenhai Li
,
Yuan Chen
,
Lingfeng Deng
.
ppopp 2025
:
548-550
[doi]
ATTNChecker: Highly-Optimized Fault Tolerant Attention for Large Language Model Training
Yuhang Liang
,
Xinyi Li
,
Jie Ren
,
Ang Li
,
Bo Fang 0002
,
Jieyang Chen
.
ppopp 2025
:
252-266
[doi]
WeiPipe: Weight Pipeline Parallelism for Communication-Effective Long-Context Large Model Training
Junfeng Lin
,
Ziming Liu
,
Yang You 0001
,
Jun Wang
,
Weihao Zhang
,
Rong Zhao
.
ppopp 2025
:
225-238
[doi]
DORADD: Deterministic Parallel Execution in the Era of Microsecond-Scale Computing
Zhengqing Liu
,
Musa Unal
,
Matthew J. Parkinson
,
Marios Kogias
.
ppopp 2025
:
282-296
[doi]
Mario: Near Zero-cost Activation Checkpointing in Pipeline Parallelism
Weijian Liu
,
Mingzhen Li
,
Guangming Tan
,
Weile Jia
.
ppopp 2025
:
197-211
[doi]
Adaptive Parallel Training for Graph Neural Networks
Kaihao Ma
,
Renjie Liu
,
Xiao Yan 0002
,
Zhenkun Cai
,
Xiang Song 0003
,
Minjie Wang
,
Yichao Li
,
James Cheng
.
ppopp 2025
:
29-42
[doi]
RT-BarnesHut: Accelerating Barnes-Hut Using Ray-Tracing Hardware
Vani Nagarajan
,
Rohan Gangaraju
,
Kirshanthan Sundararajah
,
Artem Pelenitsyn
,
Milind Kulkarni 0001
.
ppopp 2025
:
43-56
[doi]
AC-Cache: A Memory-Efficient Caching System for Small Objects via Exploiting Access Correlations
Fulin Nan
,
Ronglong Wu
,
Zhirong Shen
,
Jiahui Yang
,
Li Cheng
,
Zheng Chen
,
Yiming Zhang
,
Jiwu Shu
.
ppopp 2025
:
142-155
[doi]
BerryBees: Breadth First Search by Bit-Tensor-Cores
Yuyao Niu
,
Marc Casas
.
ppopp 2025
:
339-354
[doi]
TensorMD: Molecular Dynamics Simulation with Ab Initio Accuracy of 50 Billion Atoms
Yucheng Ouyang
,
Ying Liu
,
Honghui Shang
,
Zhenchuan Chen
,
Jiahao Shan
,
Huimin Cui
,
Xiaobing Feng
,
Xin Chen
,
Xingyu Gao 0003
,
Lifang Wang
,
Haifeng Song 0003
,
Xin Chen
,
Rongfen Lin
,
Fang Li
.
ppopp 2025
:
551-553
[doi]
Aggregating Funnels for Faster Fetch&Add and Queues
Younghun Roh
,
Yuanhao Wei
,
Eric Ruppert
,
Panagiota Fatourou
,
Siddhartha Jayanti
,
Julian Shun
.
ppopp 2025
:
99-114
[doi]
Transactional Data Structures with Orthogonal Metadata
Yaodong Sheng
,
Ahmed Hassan
,
Michael F. Spear
.
ppopp 2025
:
545-547
[doi]
FlashSparse: Minimizing Computation Redundancy for Fast Sparse Matrix Multiplications on Tensor Cores
Jinliang Shi
,
Shigang Li
,
Youxuan Xu
,
Rongtian Fu
,
Xueying Wang
,
Tong Wu
.
ppopp 2025
:
312-325
[doi]
Publish on Ping: A Better Way to Publish Reservations in Memory Reclamation for Concurrent Data Structures
Ajay Singh
,
Trevor Brown
.
ppopp 2025
:
128-141
[doi]
COMPSO: Optimizing Gradient Compression for Distributed Training with Second-Order Optimizers
Baixi Sun
,
Weijin Liu
,
J. Gregory Pauloski
,
Jiannan Tian
,
Jinda Jia
,
Daoce Wang
,
Boyuan Zhang
,
Mingkai Zheng
,
Sheng Di
,
Sian Jin
,
Zhao Zhang
,
Xiaodong Yu 0001
,
Kamil A. Iskra
,
Pete Beckman
,
Guangming Tan
,
Dingwen Tao
.
ppopp 2025
:
212-224
[doi]
Helios: Efficient Distributed Dynamic Graph Sampling for Online GNN Inference
Jie Sun
,
Zuocheng Shi
,
Li Su
,
Wenting Shen
,
Zeke Wang
,
Yong Li 0020
,
Wenyuan Yu
,
Wei Lin
,
Fei Wu 0001
,
Bingsheng He
,
Jingren Zhou
.
ppopp 2025
:
2-15
[doi]
Balanced Allocations over Efficient Queues: A Fast Relaxed FIFO Queue
Kåre von Geijer
,
Philippas Tsigas
,
Elias Johansson
,
Sebastian Hermansson
.
ppopp 2025
:
382-395
[doi]
Swift Unfolding of Communities: GPU-Accelerated Louvain Algorithm
Zhibin Wang
,
Xi Lin
,
Xue Li
,
Pinhuan Wang
,
Ziheng Meng
,
Hang Liu 0001
,
Chen Tian 0001
,
Sheng Zhong
.
ppopp 2025
:
441-454
[doi]
Crystality: A Programming Model for Smart Contracts on Parallel EVMs
Hao Wang
,
Minghao Pan
,
Jiaping Wang
.
ppopp 2025
:
412-425
[doi]
Effectively Virtual Page Prefetching via Spatial-Temporal Patterns for Memory-intensive Cloud Applications
Yun Wang
,
Liang Chen
,
Tianmai Deng
,
Ben Luo
,
Yibin Shen
,
Zhixiang Wei
,
Yixiao Xu
,
Minglang Huang
,
Zhengwei Qi
.
ppopp 2025
:
156-169
[doi]
Harnessing Inter-GPU Shared Memory for Seamless MoE Communication-Computation Fusion
Hulin Wang
,
Yaqi Xia
,
Donglin Yang
,
Xiaobo Zhou 0002
,
Dazhao Cheng
.
ppopp 2025
:
170-182
[doi]
Improving Tridiagonalization Performance on GPU Architectures
Hansheng Wang
,
Zhekai Duan
,
Zitian Zhao
,
Siqi Wu
,
Saiqi Zheng
,
Qiao Li
,
Xu Jiang
,
Shaoshuai Zhang
.
ppopp 2025
:
469-480
[doi]
TurboFFT: Co-Designed High-Performance and Fault-Tolerant Fast Fourier Transform on GPUs
Shixun Wu
,
Yujia Zhai
,
Jinyang Liu 0003
,
Jiajun Huang
,
Zizhe Jian
,
Huangliang Dai
,
Sheng Di
,
Franck Cappello
,
Zizhong Chen
.
ppopp 2025
:
70-84
[doi]
PANNS: Enhancing Graph-based Approximate Nearest Neighbor Search through Recency-aware Construction and Parameterized Search
Xizhe Yin
,
Chao Gao
,
Zhijia Zhao 0001
,
Rajiv Gupta 0001
.
ppopp 2025
:
369-381
[doi]
EVeREST: An Effective and Versatile Runtime Energy Saving Tool for GPUs
Anna Yue
,
Pen-Chung Yew
,
Sanyam Mehta
.
ppopp 2025
:
57-69
[doi]
Jigsaw: Toward Conflict-free Vectorized Stencil Computation by Tessellating Swizzled Registers
YiWei Zhang
,
Kun Li
,
Liang Yuan
,
Haozhi Han
,
Yunquan Zhang
,
Ting Cao
,
Mao Yang
.
ppopp 2025
:
481-495
[doi]
FastBWA: Practical and Cost-Efficient Genome Sequence Alignment Pipeline
Zhonghai Zhang
,
Yewen Li
,
Ke Meng
,
Chunming Zhang
,
Guangming Tan
.
ppopp 2025
:
554-556
[doi]
Acc-SpMM: Accelerating General-purpose Sparse Matrix-Matrix Multiplication with GPU Tensor Cores
Haisha Zhao
,
San-li
,
Jiaheng Wang
,
Chunbao Zhou
,
Jue Wang
,
Zhikuang Xin
,
Shunde Li
,
Zhiqiang Liang
,
Zhijie Pan
,
Fang Liu
,
Yan Zeng
,
Yangang Wang
,
Xuebin Chi
.
ppopp 2025
:
326-338
[doi]
FlashTensor: Optimizing Tensor Programs by Leveraging Fine-grained Tensor Property
Runxin Zhong
,
Yuyang Jin
,
Chen Zhang 0001
,
Kinman Lei
,
Shuangyu Li
,
Jidong Zhai
.
ppopp 2025
:
183-196
[doi]
A General and Scalable GCN Training Framework on CPU Supercomputers
Chen Zhuang
,
Peng Chen
,
Xin Liu
,
Rio Yokota
,
Nikoli Dryden
,
Lingqi Zhang 0001
,
Toshio Endo
,
Satoshi Matsuoka
,
Mohamed Wahib
.
ppopp 2025
:
566-568
[doi]
Semi-StructMG: A Fast and Scalable Semi-Structured Algebraic Multigrid
Yi Zong
,
Chensong Zhang
,
Longjiang Mu
,
Jianchun Wang
,
Jian Sun
,
Xiaowen Xu
,
Xinliang Wang
,
Peinan Yu
,
Wei Xue
.
ppopp 2025
:
496-511
[doi]
2024
Proceedings of the 15th International Workshop on Programming Models and Applications for Multicores and Manycores, PMAM 2024, Edinburgh, United Kingdom, 3 March 2024
ACM,
2024.
[doi]
Automatic Static Analysis-Guided Optimization of CUDA Kernels
Mark Lou
,
Stefan K. Muller
.
ppopp 2024
:
11-21
[doi]
Language-Agnostic Static Deadlock Detection for Futures
Stefan K. Muller
.
ppopp 2024
:
68-79
[doi]
Locks as a Resource: Fairly Scheduling Lock Occupation with CFL
Jonggyu Park
,
Young Ik Eom
.
ppopp 2024
:
17-29
[doi]
Scaling Up Transactions with Slower Clocks
Pedro Ramalhete
,
Andreia Correia
.
ppopp 2024
:
2-16
[doi]
Sign in
or
sign up
to see more results.