Abstract is missing.
- Graphene: Strong yet Lightweight Row Hammer ProtectionYeonhong Park, Woosuk Kwon, Eojin Lee, Tae Jun Ham, Jung Ho Ahn, Jae W. Lee. 1-13 [doi]
- Persist Level Parallelism: Streamlining Integrity Tree Updates for Secure Persistent MemoryAlexander Freij, Shougang Yuan, Huiyang Zhou, Yan Solihin. 14-27 [doi]
- PThammer: Cross-User-Kernel-Boundary Rowhammer through Implicit AccessesZhi Zhang 0001, Yueqiang Cheng, Dongxi Liu, Surya Nepal, Zhi Wang 0004, Yuval Yarom. 28-41 [doi]
- Draco: Architectural and Operating System Support for System Call SecurityDimitrios Skarlatos, Qingrong Chen, Jianyan Chen, Tianyin Xu, Josep Torrellas. 42-57 [doi]
- SuperNPU: An Extremely Fast Neural Processing Unit Using Superconducting Logic DevicesKoki Ishida, Ilkwon Byun, Ikki Nagaoka, Kosuke Fukumitsu, Masamitsu Tanaka, Satoshi Kawakami, Teruo Tanimoto, Takatsugu Ono, Jangwoo Kim, Koji Inoue. 58-72 [doi]
- Printed Machine Learning ClassifiersMuhammad Husnain Mubarik, Dennis D. Weller, Nathaniel Bleier, Matthew Tomei, Jasmin Aghassi-Hagmann, Mehdi B. Tahoori, Rakesh Kumar 0002. 73-87 [doi]
- Look-Up Table based Energy Efficient Processing in Cache Support for Neural Network AccelerationAkshay Krishna Ramanathan, Gurpreet S. Kalsi, Srivatsa Srinivasa, Tarun Makesh Chandran, Kamlesh R. Pillai, Om Ji Omer, Vijaykrishnan Narayanan, Sreenivas Subramoney. 88-101 [doi]
- FReaC Cache: Folded-logic Reconfigurable Computing in the Last Level CacheAshutosh Dhar, Xiaohao Wang, Hubertus Franke, Jinjun Xiong, Jian Huang, Wen-mei W. Hwu, Nam Sung Kim, Deming Chen. 102-117 [doi]
- BranchNet: A Convolutional Neural Network to Predict Hard-To-Predict BranchesSiavash Zangeneh, Stephen Pruett, Sangkug Lym, Yale N. Patt. 118-130 [doi]
- CHiRP: Control-Flow History Reuse PredictionSamira Mirbagher Ajorpaz, Elba Garza, Gilles Pokam, Daniel A. Jiménez. 131-145 [doi]
- I-SPY: Context-Driven Conditional Instruction Prefetching with CoalescingTanvir Ahmed Khan, Akshitha Sriraman, Joseph Devietti, Gilles Pokam, Heiner Litz, Baris Kasikci. 146-159 [doi]
- Improving the Utilization of Micro-operation Caches in x86 ProcessorsJagadish B. Kotra, John Kalamatianos. 160-172 [doi]
- Virtualized Logical Qubits: A 2.5D Architecture for Error-Corrected Quantum ComputingCasey Duckering, Jonathan M. Baker, David I. Schuster, Frederic T. Chong. 173-185 [doi]
- Optimized Quantum Compilation for Near-Term Algorithms with OpenPulsePranav Gokhale, Ali Javadi-Abhari, Nathan Earnest, Yunong Shi, Frederic T. Chong. 186-200 [doi]
- Systematic Crosstalk Mitigation for Superconducting Qubits via Frequency-Aware CompilationYongshan Ding, Pranav Gokhale, Sophia Fuhui Lin, Richard Rines, Thomas Propson, Frederic T. Chong. 201-214 [doi]
- Circuit Compilation Methodologies for Quantum Approximate Optimization AlgorithmMahabubul Alam, Abdullah Ash-Saki, Swaroop Ghosh. 215-228 [doi]
- Fast-BCNN: Massive Neuron Skipping in Bayesian Convolutional Neural NetworksQiyu Wan, Xin Fu. 229-240 [doi]
- Ptolemy: Architecture Support for Robust Deep LearningYiming Gan, Yuxian Qiu, Jingwen Leng, Minyi Guo, Yuhao Zhu 0001. 241-255 [doi]
- Non-Blocking Simultaneous Multithreading: Embracing the Resiliency of Deep Neural NetworksGil Shomron, Uri C. Weiser. 256-269 [doi]
- FIdelity: Efficient Resilience Analysis Framework for Deep Learning AcceleratorsYi He, Prasanna Balaprakash, Yanjing Li. 270-281 [doi]
- Bit-Exact ECC Recovery (BEER): Determining DRAM On-Die ECC Functions by Exploiting DRAM Data Retention CharacteristicsMinesh Patel, Jeremie S. Kim, Taha Shahroodi, Hasan Hassan, Onur Mutlu. 282-297 [doi]
- DStress: Automatic Synthesis of DRAM Reliability Stress Viruses using Genetic AlgorithmsLev Mukhanov, Dimitrios S. Nikolopoulos, Georgios Karakonstantis. 298-312 [doi]
- FIGARO: Improving System Performance via Fine-Grained In-DRAM Data Relocation and CachingYaohua Wang, Lois Orosa 0001, Xiangjun Peng, Yang Guo 0003, Saugata Ghose, Minesh Patel, Jeremie S. Kim, Juan Gómez-Luna, Mohammad Sadrosadati, Nika Mansouri-Ghiasi, Onur Mutlu. 313-328 [doi]
- PerpLE: Improving the Speed and Effectiveness of Memory Consistency TestingThemis Melissaris, Markos Markakis, Kelly A. Shaw 0001, Margaret Martonosi. 329-341 [doi]
- CATCAM: Constant-time Alteration Ternary CAM with Scalable In-Memory ArchitectureDibei Chen, Zhaoshi Li, Tianzhu Xiong, Zhiwei Liu, Jun Yang, Shouyi Yin, Shaojun Wei, Leibo Liu. 342-355 [doi]
- DUAL: Acceleration of Clustering Algorithms using Digital-based Processing In-MemoryMohsen Imani, Saikishan Pampana, Saransh Gupta, Minxuan Zhou, Yeseong Kim, Tajana Rosing. 356-371 [doi]
- Newton: A DRAM-maker's Accelerator-in-Memory (AiM) Architecture for Machine LearningMingxuan He, Choungki Song, Ilkon Kim, Chunseok Jeong, Seho Kim, Il Park 0001, Mithuna Thottethodi, T. N. Vijaykumar. 372-385 [doi]
- AQUOMAN: An Analytic-Query Offloading MachineShuotao Xu, Thomas Bourgeat, Tianhao Huang, Hojun Kim, Sungjin Lee, Arvind. 386-399 [doi]
- MOUSE: Inference In Non-volatile Memory for Energy Harvesting ApplicationsSalonik Resch, S. Karen Khatamifard, Zamshed I. Chowdhury, Masoud Zabihi, Zhengyang Zhao, Hüsrev Cilasun, Jian-Ping Wang, Sachin S. Sapatnekar, Ulya R. Karpuzcu. 400-414 [doi]
- More with Less - Deriving More Translation Rules with Less Training Data for DBTs Using ParameterizationJinhu Jiang, Rongchao Dong, Zhongjun Zhou, Changheng Song, Wenwen Wang, Pen-Chung Yew, Weihua Zhang. 415-426 [doi]
- Optimizing the Memory Hierarchy by Compositing Automatic Transformations on Computations and DataJie Zhao, Peng Di. 427-441 [doi]
- DiffTune: Optimizing CPU Simulator Parameters with Learned Differentiable SurrogatesAlex Renda, Yishen Chen, Charith Mendis, Michael Carbin. 442-455 [doi]
- Predicting Execution Times With Partial Simulations in Virtual Memory Research: Why and HowMohammad Agbarya, Idan Yaniv, Jayneel Gandhi, Dan Tsafrir. 456-470 [doi]
- gem5-SALAM: A System Architecture for LLVM-based Accelerator ModelingSamuel Rogers, Joshua Slycord, Mohammadreza Baharani, Hamed Tabkhi. 471-482 [doi]
- Shaving Retries with Sentinels for Fast Read over High-Density 3D FlashQiao Li 0001, Min Ye, Yufei Cui, Liang Shi, Xiaoqiang Li, Tei-Wei Kuo, Chun Jason Xue. 483-495 [doi]
- Characterizing and Modeling Non-Volatile Memory SystemsZixuan Wang, Xiao Liu, Jian Yang, Theodore Michailidis, Steven Swanson, Jishen Zhao. 496-508 [doi]
- P-INSPECT: Architectural Support for Programmable Non-Volatile Memory FrameworksApostolos Kokolis, Thomas Shull, Jian Huang 0006, Josep Torrellas. 509-524 [doi]
- Unbounded Hardware Transactional Memory for a Hybrid DRAM/NVM Memory SystemJungi Jeong, Jaewan Hong, Seungryoul Maeng, Changhee Jung, Youngjin Kwon. 525-538 [doi]
- (Almost) Fence-less Persist OrderingSara Mahdizadeh-Shahri, Seyed Armin Vakil-Ghahani, Aasheesh Kolli. 539-554 [doi]
- Speculative Enforcement of Store AtomicityAlberto Ros, Stefanos Kaxiras. 555-567 [doi]
- Boosting Store Buffer Efficiency with Store-Prefetch BurstsJuan M. Cebrian, Stefanos Kaxiras, Alberto Ros. 568-580 [doi]
- D-SOAP: Dynamic Spatial Orientation Affinity Prediction for Caching in Multi-Orientation Memory SystemsMinli Julie Liao, Jack Sampson. 581-595 [doi]
- Pipette: Improving Core Utilization on Irregular Applications through Intra-Core Pipeline ParallelismQuan M. Nguyen, Daniel Sanchez. 596-608 [doi]
- RnR: A Software-Assisted Record-and-Replay Hardware PrefetcherChao Zhang 0039, Yuan Zeng, John Shalf, Xiaochen Guo. 609-621 [doi]
- ConfuciuX: Autonomous Hardware Resource Assignment for DNN Accelerators using Reinforcement LearningSheng-Chun Kao, Geonhwa Jeong, Tushar Krishna. 622-636 [doi]
- Gemini: Learning to Manage CPU Power for Latency-Critical Search EnginesLiang Zhou, Laxmi N. Bhuyan, K. K. Ramakrishnan. 637-349 [doi]
- CuttleSys: Data-Driven Resource Management for Interactive Services on Reconfigurable MulticoresNeeraj Kulkarni, Gonzalo Gonzalez-Pumariega, Amulya Khurana, Christine A. Shoemaker, Christina Delimitrou, David H. Albonesi. 650-664 [doi]
- Jumanji: The Case for Dynamic NUCA in the DatacenterBrian C. Schwedock, Nathan Beckmann. 665-680 [doi]
- Planaria: Dynamic Architecture Fission for Spatial Multi-Tenant Acceleration of Deep Neural NetworksSoroush Ghodrati, Byung Hoon Ahn, Joon Kyung Kim, Sean Kinzer, Brahmendra Reddy Yatham, Navateja Alla, Hardik Sharma, Mohammad Alian, Eiman Ebrahimi, Nam Sung Kim, Cliff Young, Hadi Esmaeilzadeh. 681-697 [doi]
- VR-DANN: Real-Time Video Recognition via Decoder-Assisted Neural Network AccelerationZhuoran Song, Feiyang Wu, Xueyuan Liu, Jing Ke, Naifeng Jing, Xiaoyao Liang. 698-710 [doi]
- Procrustes: a Dataflow and Accelerator for Sparse Deep Neural Network TrainingDingqing Yang, Amin Ghasemazar, Xiaowei Ren, Maximilian Golub, Guy Lemieux, Mieszko Lis. 711-724 [doi]
- Duplo: Lifting Redundant Memory Accesses of Deep Neural Networks for GPU Tensor CoresHyeonjin Kim, Sungwoo Ahn, Yunho Oh, Bogil Kim, Won Woo Ro, William J. Song. 725-737 [doi]
- DUET: Boosting Deep Neural Network Efficiency on Dual-Module ArchitectureLiu Liu, Zheng Qu, Lei Deng, Fengbin Tu, Shuangchen Li, Xing Hu, Zhenyu Gu, Yufei Ding, Yuan Xie 0001. 738-750 [doi]
- TFE: Energy-efficient Transferred Filter-based Engine to Compress and Accelerate Convolutional Neural NetworksHuiyu Mo, Leibo Liu, Wenjing Hu, Wenping Zhu, Qiang Li, Ang Li, Shouyi Yin, Jian Chen, Xiaowei Jiang, Shaojun Wei. 751-765 [doi]
- MatRaptor: A Sparse-Sparse Matrix Multiplication Accelerator Based on Row-Wise ProductNitish Kumar Srivastava, Hanchen Jin, Jie Liu, David H. Albonesi, Zhiru Zhang. 766-780 [doi]
- TensorDash: Exploiting Sparsity to Accelerate Deep Neural Network TrainingMostafa Mahmoud, Isak Edo, Ali Hadi Zadeh, Omar Mohamed Awad, Gennady Pekhimenko, Jorge Albericio, Andreas Moshovos. 781-795 [doi]
- SAVE: Sparsity-Aware Vector Engine for Accelerating DNN Training and Inference on CPUsZhangxiaowen Gong, Houxiang Ji, Christopher W. Fletcher, Christopher J. Hughes, Sara Baghsorkhi, Josep Torrellas. 796-810 [doi]
- GOBO: Quantizing Attention-Based NLP Models for Low Latency and Energy Efficient InferenceAli Hadi Zadeh, Isak Edo, Omar Mohamed Awad, Andreas Moshovos. 811-824 [doi]
- TrainBox: An Extreme-Scale Neural Network Training Server Architecture by Systematically Balancing OperationsPyeongsu Park, Heetaek Jeong, Jangwoo Kim. 825-838 [doi]
- Coordinated Priority-aware Charging of Distributed Batteries in Oversubscribed Data CentersSulav Malla, Qingyuan Deng, Zoh Ebrahimzadeh, Joe Gasperetti, Sajal Jain, Parimala Kondety, Thiara Ortiz, Debra Vieira. 839-851 [doi]
- HyperPlane: A Scalable Low-Latency Notification Accelerator for Software Data PlanesAmirhossein Mirhosseini, Hossein Golestani, Thomas F. Wenisch. 852-867 [doi]
- ThymesisFlow: A Software-Defined, HW/SW co-Designed Interconnect Stack for Rack-Scale Memory DisaggregationChristian Pinto, Dimitris Syrivelis, Michele Gazzetti, Panos K. Koutsovasilis, Andrea Reale, Kostas Katrinis, H. Peter Hofstee. 868-880 [doi]
- A Benchmarking Framework for Interactive 3D Applications in the CloudTianyi Liu, Sen He, Sunzhou Huang, Danny Tsang, Lingjia Tang, Jason Mars, Wei Wang 0054. 881-894 [doi]
- A Locality-Aware Energy-Efficient Accelerator for Graph Mining ApplicationsPengcheng Yao, Long Zheng 0003, Zhen Zeng, Yu Huang 0013, Chuangyi Gui, Xiaofei Liao, Hai Jin 0001, Jingling Xue. 895-907 [doi]
- GraphPulse: An Event-Driven Hardware Accelerator for Asynchronous Graph ProcessingShafiur Rahman, Nael B. Abu-Ghazaleh, Rajiv Gupta. 908-921 [doi]
- AWB-GCN: A Graph Convolutional Network Accelerator with Runtime Workload RebalancingTong Geng, Ang Li, Runbin Shi, Chunshu Wu, Tianqi Wang, Yanfei Li, Pouya Haghi, Antonino Tumeo, Shuai Che, Steven K. Reinhardt, Martin C. Herbordt. 922-936 [doi]
- SeedEx: A Genome Sequencing Accelerator for Optimal Alignments in Subminimal SpaceDaichi Fujiki, Shunhao Wu, Nathan Ozog, Kush Goliya, David T. Blaauw, Satish Narayanasamy, Reetuparna Das. 937-950 [doi]
- GenASM: A High-Performance, Low-Power Approximate String Matching Acceleration Framework for Genome Sequence AnalysisDamla Senol Cali, Gurpreet S. Kalsi, Zülal Bingöl, Can Firtina, Lavanya Subramanian, Jeremie S. Kim, Rachata Ausavarungnirun, Mohammed Alser, Juan Gómez-Luna, Amirali Boroumand, Anant Nori, Allison Scibisz, Sreenivas Subramoney, Can Alkan, Saugata Ghose, Onur Mutlu. 951-966 [doi]
- Selective Replication in Memory-Side GPU CachesXia Zhao 0004, Magnus Jahre, Lieven Eeckhout. 967-980 [doi]
- Deterministic Atomic BufferingYuan-Hsi Chou, Christopher Ng, Shaylin Cattell, Jeremy Intan, Matthew D. Sinclair, Joseph Devietti, Timothy G. Rogers, Tor M. Aamodt. 981-995 [doi]
- BOW: Breathing Operand Windows to Exploit Bypassing in GPUsHodjat Asghari Esfeden, AmirAli Abdolrashidi, Shafiur Rahman, Daniel Wong 0001, Nael B. Abu-Ghazaleh. 996-1008 [doi]
- MDM: The GPU Memory Divergence ModelLu Wang 0019, Magnus Jahre, Almutaz Adileh, Lieven Eeckhout. 1009-1021 [doi]
- Locality-Centric Data and Threadblock Management for Massive GPUsMahmoud Khairy, Vadim Nikiforov, David Nellans, Timothy G. Rogers. 1022-1036 [doi]
- Mesorasi: Architecture Support for Point Cloud Analytics via Delayed-AggregationYu Feng, Boyuan Tian, Tiancheng Xu, Paul N. Whatmough, Yuhao Zhu 0001. 1037-1050 [doi]
- FlexWatts: A Power- and Workload-Aware Hybrid Power Delivery Network for Energy-Efficient MicroprocessorsJawad Haj-Yahya, Mohammed Alser, Jeremie S. Kim, Lois Orosa 0001, Efraim Rotem, Avi Mendelson, Anupam Chattopadhyay, Onur Mutlu. 1051-1066 [doi]
- Building the Computing System for Autonomous Micromobility Vehicles: Design Constraints and Architectural OptimizationsBo Yu, Wei Hu, Leimeng Xu, Jie Tang 0003, Shaoshan Liu, Yuhao Zhu 0001. 1067-1081 [doi]
- AutoScale: Energy Efficiency Optimization for Stochastic Edge Inference Using Reinforcement LearningYoung-geun Kim, Carole-Jean Wu. 1082-1096 [doi]
- NCPU: An Embedded Neural CPU Architecture on Resource-Constrained Low Power Devices for Real-time End-to-End PerformanceTianyu Jia, Yuhao Ju, Russ Joseph, Jie Gu. 1097-1109 [doi]
- CaSA: End-to-end Quantitative Security Analysis of Randomly Mapped CachesThomas Bourgeat, Jules Drean, Yuheng Yang, Lillian Tsai, Joel Emer, Mengjia Yan. 1110-1123 [doi]
- PerSpectron: Detecting Invariant Footprints of Microarchitectural Attacks with PerceptronSamira Mirbagher Ajorpaz, Gilles Pokam, Esmaeil Mohammadian Koruyeh, Elba Garza, Nael B. Abu-Ghazaleh, Daniel A. Jiménez. 1124-1137 [doi]
- Speculation Invariance (InvarSpec): Faster Safe Execution Through Program AnalysisZirui Neil Zhao, Houxiang Ji, Mengjia Yan, Jiyong Yu, Christopher W. Fletcher, Adam Morrison 0001, Darko Marinov, Josep Torrellas. 1138-1152 [doi]
- Hardware-based Always-On Heap Memory SafetyYonghae Kim, Jaekyu Lee, Hyesoon Kim. 1153-1166 [doi]