Abstract is missing.
- Keynote: Multicore ProgrammingBarbara Liskov. 1 [doi]
- Keynote: Developing our Quantum FutureKrysta M. Svore. 2 [doi]
- An Open-Source Benchmark Suite for Microservices and Their Hardware-Software Implications for Cloud & Edge SystemsYu Gan, Yanqi Zhang, Dailun Cheng, Ankitha Shetty, Priyal Rathi, Nayan Katarki, Ariana Bruno, Justin Hu, Brian Ritchken, Brendon Jackson, Kelvin Hu, Meghna Pancholi, Yuan He, Brett Clancy, Chris Colen, Fukang Wen, Catherine Leung, Siyuan Wang, Leon Zaruvinsky, Mateo Espinosa, Rick Lin, Zhongling Liu, Jake Padilla, Christina Delimitrou. 3-18 [doi]
- Seer: Leveraging Big Data to Navigate the Complexity of Performance Debugging in Cloud MicroservicesYu Gan, Yanqi Zhang, Kelvin Hu, Dailun Cheng, Yuan He, Meghna Pancholi, Christina Delimitrou. 19-33 [doi]
- RPCValet: NI-Driven Tail-Aware Balancing of µs-Scale RPCsAlexandros Daglis, Mark Sutherland, Babak Falsafi. 35-48 [doi]
- A Framework for Memory Oversubscription Management in Graphics Processing UnitsChen Li 0015, Rachata Ausavarungnirun, Christopher J. Rossbach, Youtao Zhang, Onur Mutlu, Yang Guo, Jun Yang. 49-63 [doi]
- Swizzle Inventor: Data Movement Synthesis for GPU KernelsPhitchaya Mangpo Phothilimthana, Archibald Samuel Elliott, an Wang, Abhinav Jangda, Bastian Hagedorn, Henrik Barthels, Samuel J. Kaufman, Vinod Grover, Emina Torlak, Rastislav Bodík. 65-78 [doi]
- Scalable Processing of Contemporary Semi-Structured Data on Commodity Parallel Processors - A Compilation-based ApproachLin Jiang, Xiaofan Sun, Umar Farooq, Zhijia Zhao 0001. 79-92 [doi]
- Fast and Scalable VMM Live Upgrade in Large Cloud InfrastructureXiantao Zhang, Xiao Zheng, Zhi Wang, Qi Li, Junkang Fu, Yang Zhang, Yibin Shen. 93-105 [doi]
- PARTIES: QoS-Aware Resource Partitioning for Multiple Interactive ServicesShuang Chen, Christina Delimitrou, José F. Martínez. 107-120 [doi]
- X-Containers: Breaking Down Barriers to Improve Performance and Isolation of Cloud-Native ContainersZhiming Shen, Zhen Sun, Gur-Eyal Sela, Eugene Bagdasaryan, Christina Delimitrou, Robbert van Renesse, Hakim Weatherspoon. 121-135 [doi]
- Buffets: An Efficient and Composable Storage Idiom for Explicit Decoupled Data OrchestrationMichael Pellauer, Yakun Sophia Shao, Jason Clemons, Neal Clayton Crago, Kartik Hegde, Rangharajan Venkatesan, Stephen W. Keckler, Christopher W. Fletcher, Joel S. Emer. 137-151 [doi]
- HiWayLib: A Software Framework for Enabling High Performance Communications for Heterogeneous Pipeline ComputationsZhen Zheng, Chanyoung Oh, Jidong Zhai, Xipeng Shen, Youngmin Yi, Wenguang Chen. 153-166 [doi]
- StreamBox-HBM: Stream Analytics on High Bandwidth Hybrid MemoryHongyu Miao, Myeongjae Jeon, Gennady Pekhimenko, Kathryn S. McKinley, Felix Xiaozhu Lin. 167-181 [doi]
- Puddle: A Dynamic, Error-Correcting, Full-Stack Microfluidics PlatformMax Willsey, Ashley P. Stephenson, Chris Takahashi, Pranav Vaid, Bichlien H. Nguyen, Michal Piszczek, Christine Betts, Sharon Newman, Sarang Joshi, Karin Strauss, Luis Ceze. 183-197 [doi]
- Intelligence Beyond the Edge: Inference on Intermittent Embedded SystemsGraham Gobieski, Brandon Lucia, Nathan Beckmann. 199-213 [doi]
- Boosted Race Trees for Low Energy ClassificationGeorgios Tzimpragos, Advait Madhavan, Dilip Vasudevan, Dmitri B. Strukov, Timothy Sherwood. 215-228 [doi]
- Compress Objects, Not Cache Lines: An Object-Based Compressed Memory HierarchyPo-An Tsai, Daniel Sánchez 0003. 229-242 [doi]
- Beating OPT with Statistical Clairvoyance and Variable Size CachingPengcheng Li, Colin Pronovost, William Wilson, Benjamin Tait, Jie Zhou, Chen Ding, John Criswell. 243-256 [doi]
- A Formal Analysis of the NVIDIA PTX Memory Consistency ModelDaniel Lustig, Sameer Sahasrabuddhe, Olivier Giroux. 257-270 [doi]
- Just-In-Time Compilation for Verilog: A New Technique for Improving the FPGA Programming ExperienceEric Schkufza, Michael Wei, Christopher J. Rossbach. 271-286 [doi]
- DCNS: Automated Detection Of Conservative Non-Sleep Defects in the Linux KernelJia-Ju Bai, Julia Lawall, Wende Tan, Shi-Min Hu. 287-299 [doi]
- A Case for Lease-Based, Utilitarian Resource Management on Mobile DevicesYigong Hu, Suyi Liu, Peng Huang. 301-315 [doi]
- Software-Defined Far Memory in Warehouse-Scale ComputersH. Andrés Lagar-Cavilla, Junwhan Ahn, Suleiman Souhlal, Neha Agarwal, Radoslaw Burny, Shakeel Butt, Jichuan Chang, Ashwin Chaugule, Nan Deng, Junaid Shahid, Greg Thelen, Kamil Adam Yurtsever, Yu Zhao, Parthasarathy Ranganathan. 317-330 [doi]
- Nimble Page Management for Tiered Memory SystemsZi Yan, Daniel Lustig, David Nellans, Abhishek Bhattacharjee. 331-345 [doi]
- HawkEye: Efficient Fine-grained OS Support for Huge PagesAshish Panwar, Sorav Bansal, K. Gopinath. 347-360 [doi]
- Architectural Support for Containment-based SecurityHansen Zhang, Soumyadeep Ghosh, Jordan Fix, Sotiris Apostolakis, Stephen R. Beard, Nayana P. Nagendra, Taewook Oh, David I. August. 361-377 [doi]
- CheriABI: Enforcing Valid Pointer Provenance and Minimizing Pointer Privilege in the POSIX C Run-time EnvironmentBrooks Davis, Robert N. M. Watson, Alexander Richardson, Peter G. Neumann, Simon W. Moore, John Baldwin, David Chisnall, James Clarke, Nathaniel Wesley Filardo, Khilan Gudka, Alexandre Joannou, Ben Laurie, A. Theodore Markettos, J. Edward Maste, Alfredo Mazzinghi, Edward Tomasz Napierala, Robert M. Norton, Michael Roe, Peter Sewell, Stacey D. Son, Jonathan Woodruff. 379-393 [doi]
- Context-Sensitive Fencing: Securing Speculative Execution via Microcode CustomizationMohammadkazem Taram, Ashish Venkat, Dean M. Tullsen. 395-410 [doi]
- PMTest: A Fast and Flexible Testing Framework for Persistent Memory ProgramsSihang Liu 0001, Yizhou Wei, Jishen Zhao, Aasheesh Kolli, Samira Manabi Khan. 411-425 [doi]
- Finding and Fixing Performance Pathologies in Persistent Memory Software StacksJian Xu, Juno Kim, Amirsaman Memaripour, Steven Swanson. 427-439 [doi]
- Fine-Grain Checkpointing with In-Cache-Line LoggingNachshon Cohen, David T. Aksun, Hillel Avni, James R. Larus. 441-454 [doi]
- Heterogeneous Isolated Execution for Commodity GPUsInsu Jang, Adrian Tang, Taehoon Kim, Simha Sethumadhavan, Jaehyuk Huh. 455-468 [doi]
- Morpheus: A Vulnerability-Tolerant Secure Architecture Based on Ensembles of Moving Target Defenses with ChurnMark Gallagher, Lauren Biernacki, Shibo Chen, Zelalem Birhanu Aweke, Salessawi Ferede Yitbarek, Misiker Tadesse Aga, Austin Harris, Zhixing Xu, Baris Kasikci, Valeria Bertacco, Sharad Malik, Mohit Tiwari, Todd M. Austin. 469-484 [doi]
- DeepSigns: An End-to-End Watermarking Framework for Ownership Protection of Deep Neural NetworksBita Darvish Rouhani, Huili Chen, Farinaz Koushanfar. 485-497 [doi]
- FA3C: FPGA-Accelerated Deep Reinforcement LearningHyungmin Cho, Pyeongseok Oh, Jiyoung Park, Wookeun Jung, Jaejin Lee. 499-513 [doi]
- 2 : Accelerating Markov Chain Monte Carlo Algorithms for Probabilistic ModelsSubho S. Banerjee, Zbigniew T. Kalbarczyk, Ravishankar K. Iyer. 515-528 [doi]
- Targeting Classical Code to a Quantum AnnealerScott Pakin. 529-543 [doi]
- ProbeGuard: Mitigating Probing Attacks Through Reactive Program TransformationsKoustubha Bhat, Erik van der Kouwe, Herbert Bos, Cristiano Giuffrida. 545-558 [doi]
- kMVX: Detecting Kernel Information Leaks with Multi-variant ExecutionSebastian Österlund, Koen Koning, Pierre Olivier, Antonio Barbalace, Herbert Bos, Cristiano Giuffrida. 559-572 [doi]
- MVEDSUA: Higher Availability Dynamic Software Updates via Multi-Version ExecutionLuís Pina, Anastasios Andronidis, Michael Hicks 0001, Cristian Cadar. 573-585 [doi]
- PnP: Pruning and Prediction for Point-To-Point Iterative Graph AnalyticsChengshuo Xu, Keval Vora, Rajiv Gupta 0001. 587-600 [doi]
- DiGraph: An Efficient Path-based Iterative Directed Graph Processing System on Multiple GPUsYu Zhang, Xiaofei Liao, Hai Jin 0001, Bingsheng He, Haikun Liu, Lin Gu. 601-614 [doi]
- Phoenix: A Substrate for Resilient Distributed Graph AnalyticsRoshan Dathathri, Gurbinder Gill, Loc Hoang, Keshav Pingali. 615-630 [doi]
- BOGO: Buy Spatial Memory Safety, Get Temporal Memory Safety (Almost) FreeTong Zhang, Dongyoon Lee, Changhee Jung. 631-644 [doi]
- Protecting Page Tables from RowHammer Attacks using Monotonic Pointers in DRAM True-CellsXin-Chuan Wu, Timothy Sherwood, Frederic T. Chong, Yanjing Li. 645-657 [doi]
- ρ: Relaxed Hierarchical ORAMChandrasekhar Nagarajan, Ali Shafiee, Rajeev Balasubramonian, Mohit Tiwari. 659-671 [doi]
- uops.info: Characterizing Latency, Throughput, and Port Usage of Instructions on Intel MicroarchitecturesAndreas Abel 0002, Jan Reineke. 673-686 [doi]
- Bootstrapping: Using SMT Hardware to Improve Single-Thread PerformanceSushant Kondguli, Michael Huang. 687-700 [doi]
- CORF: Coalescing Operand Register File for GPUsHodjat Asghari Esfeden, Farzad Khorasani, Hyeran Jeon, Daniel Wong, Nael B. Abu-Ghazaleh. 701-714 [doi]
- PUMA: A Programmable Ultra-efficient Memristor-based Accelerator for Machine Learning InferenceAayush Ankit, Izzat El Hajj, Sai Rahul Chalamalasetti, Geoffrey Ndu, Martin Foltin, R. Stanley Williams, Paolo Faraboschi, Wen-mei W. Hwu, John Paul Strachan, Kaushik Roy 0001, Dejan S. Milojicic. 715-731 [doi]
- FPSA: A Full System Stack Solution for Reconfigurable ReRAM-based NN Accelerator ArchitectureYu Ji, Youyang Zhang, Xinfeng Xie, Shuangchen Li, Peiqi Wang, Xing Hu, Youhui Zhang, Yuan Xie. 733-747 [doi]
- Bit-Tactical: A Software/Hardware Approach to Exploiting Value and Bit Sparsity in Neural NetworksAlberto Delmas Lascorz, Patrick Judd, Dylan Malone Stuart, Zissis Poulos, Mostafa Mahmoud, Sayeh Sharify, Milos Nikolic, Kevin Siu, Andreas Moshovos. 749-763 [doi]
- pLock: A Fast Lock for Architectures with Explicit Inter-core Message PassingXiongchao Tang, Jidong Zhai, Xuehai Qian, Wenguang Chen. 765-778 [doi]
- MV-RLU: Scaling Read-Log-Update with Multi-VersioningJaeho Kim, Ajit Mathew, Sanidhya Kashyap, Madhava Krishnan Ramanathan, Changwoo Min. 779-792 [doi]
- Fast Fine-Grained Global Synchronization on GPUsKai Wang, Don Fussell, Calvin Lin. 793-806 [doi]
- TANGRAM: Optimized Coarse-Grained Dataflow for Scalable NN AcceleratorsMingyu Gao, Xuan Yang, Jing Pu, Mark Horowitz, Christos Kozyrakis. 807-820 [doi]
- Packing Sparse Convolutional Neural Networks for Efficient Systolic Array Implementations: Column Combining Under Joint OptimizationH. T. Kung, Bradley McDanel, Sai Qian Zhang. 821-834 [doi]
- Split-CNN: Splitting Window-based Operations in Convolutional Neural Networks for Memory System OptimizationTian Jin, Seokin Hong. 835-847 [doi]
- Replica: A Wireless Manycore for Communication-Intensive and Approximate DataVimuth Fernando, Antonio Franques, Sergi Abadal, Sasa Misailovic, Josep Torrellas. 849-863 [doi]
- Understanding Real-World Concurrency Bugs in GoTengfei Tu, Xiaoyu Liu, Linhai Song, Yiying Zhang. 865-878 [doi]
- Lazy Determinism for Faster Deterministic MultithreadingTimothy Merrifield, Sepideh Roghanchi, Joseph Devietti, Jakob Eriksson. 879-891 [doi]
- Hop: Heterogeneity-aware Decentralized TrainingQinyi Luo, Jinkun Lin, Youwei Zhuo, Xuehai Qian. 893-907 [doi]
- Astra: Exploiting Predictability to Optimize Deep LearningMuthian Sivathanu, Tapan Chugh, Sanjay S. Singapuram, Lidong Zhou. 909-923 [doi]
- ADMM-NN: An Algorithm-Hardware Co-Design Framework of DNNs Using Alternating Direction Methods of MultipliersAo Ren, Tianyun Zhang, Shaokai Ye, Jiayu Li, Wenyao Xu, Xuehai Qian, Xue Lin, Yanzhi Wang. 925-938 [doi]
- LightStore: Software-defined Network-attached Key-value DrivesChanwoo Chung, Jinhyung Koo, Junsu Im, Arvind, Sungjin Lee. 939-953 [doi]
- SOML Read: Rethinking the Read Operation Granularity of 3D NAND SSDsChun-Yi Liu, Jagadish B. Kotra, Myoungsoo Jung, Mahmut T. Kandemir, Chita R. Das. 955-969 [doi]
- FlatFlash: Exploiting the Byte-Accessibility of SSDs within a Unified Memory-Storage HierarchyAhmed H. M. O. Abulila, Vikram Sharma Mailthody, Zaid Qureshi, Jian Huang, Nam Sung Kim, Jinjun Xiong, Wen-mei W. Hwu. 971-985 [doi]
- Not All Qubits Are Created Equal: A Case for Variability-Aware Policies for NISQ-Era Quantum ComputersSwamit S. Tannu, Moinuddin K. Qureshi. 987-999 [doi]
- Tackling the Qubit Mapping Problem for NISQ-Era Quantum DevicesGushu Li, Yufei Ding, Yuan Xie. 1001-1014 [doi]
- Noise-Adaptive Compiler Mappings for Noisy Intermediate-Scale Quantum ComputersPrakash Murali, Jonathan M. Baker, Ali Javadi-Abhari, Frederic T. Chong, Margaret Martonosi. 1015-1029 [doi]
- Optimized Compilation of Aggregated Instructions for Realistic Quantum ComputersYunong Shi, Nelson Leung, Pranav Gokhale, Zane Rossi, David I. Schuster, Henry Hoffmann, Frederic T. Chong. 1031-1044 [doi]
- Wasabi: A Framework for Dynamically Analyzing WebAssemblyDaniel Lehmann, Michael Pradel. 1045-1058 [doi]
- Safer Program Behavior Sharing Through Trace WringingDeeksha Dangwal, Weilong Cui, Joseph McMahan, Timothy Sherwood. 1059-1072 [doi]
- Debugging Support for Pattern-Matching Languages and AcceleratorsMatthew Casias, Kevin Angstadt, Tommy Tracy II, Kevin Skadron, Westley Weimer. 1073-1086 [doi]
- Minotaur: Adapting Software Testing Techniques for Hardware ErrorsAbdulrahman Mahmoud, Radha Venkatagiri, Khalique Ahmed, Sasa Misailovic, Darko Marinov, Christopher W. Fletcher, Sarita V. Adve. 1087-1103 [doi]