Abstract is missing.
- GeST: An Automatic Framework For Generating CPU Stress-TestsZacharias Hadjilambrou, Shidhartha Das, Paul N. Whatmough, David M. Bull, Yiannakis Sazeides. 1-10 [doi]
- Characterization of Unnecessary Computations in Web ApplicationsHossein Golestani, Scott A. Mahlke, Satish Narayanasamy. 11-21 [doi]
- Demystifying Crypto-Mining: Analysis and Optimizations of Memory-Hard PoW AlgorithmsRunchao Han, Nikos Foutris, Christos Kotselidis. 22-33 [doi]
- One Size Does Not Fit All: Quantifying and Exposing the Accuracy-Latency Trade-Off in Machine Learning Cloud Service APIs via Tolerance TiersMatthew Halpern, Behzad Boroujerdian, Todd Mummert, Evelyn Duesterwald, Vijay Janapa Reddi. 34-47 [doi]
- The POP Detector: A Lightweight Online Program Phase Detection FrameworkKarl Taht, James Greensky, Rajeev Balasubramonian. 48-57 [doi]
- Racing to Hardware-Validated SimulationAlmutaz Adileh, Cecilia González-Alvarez, Juan Miguel De Haro Ruiz, Lieven Eeckhout. 58-67 [doi]
- Full-System Simulation of Mobile CPU/GPU PlatformsKuba Kaszyk, Harry Wagstaff, Tom Spink, Björn Franke, Michael F. P. O'Boyle, Bruno Bodin, Henrik Uhrenholt. 68-78 [doi]
- Modeling Deep Learning Accelerator Enabled GPUsMd Aamir Raihan, Negar Goli, Tor M. Aamodt. 79-92 [doi]
- Emulating and Evaluating Hybrid Memory for Managed Languages on NUMA HardwareShoaib Akram, Jennifer B. Sartor, Kathryn S. McKinley, Lieven Eeckhout. 93-105 [doi]
- On the Impact of Instruction Address Translation OverheadYufeng Zhou, Xiaowan Dong, Alan L. Cox, Sandhya Dwarkadas. 106-116 [doi]
- Quantifying Process Variations and Its Impacts on SmartphonesGuru Prasad Srinivasa, Scott Haseley, Geoffrey Challen, Mark Hempstead. 117-126 [doi]
- Assessing the Effects of Low Voltage in Branch Prediction UnitsAthanasios Chatzidimitriou, George Papadimitriou, Dimitris Gizopoulos, Shrikanth Ganapathy, John Kalamatianos. 127-136 [doi]
- Tango: A Deep Neural Network Benchmark Suite for Various AcceleratorsAajna Karki, Chethan Palangotu Keshava, Spoorthi Mysore Shivakumar, Joshua Skow, Goutam Madhukeshwar Hegde, Hyeran Jeon. 137-138 [doi]
- PARADISE - Post-Moore Architecture and Accelerator Design Space Exploration Using Device Level Simulation and ExperimentsDilip P. Vasudevan, George Michelogiannakis, David Donofrio, John Shalf. 139-140 [doi]
- A Detailed Model for Contemporary GPU Memory SystemsMahmoud Khairy, Akshay Jain, Tor M. Aamodt, Timothy G. Rogers. 141-142 [doi]
- DSMM: A Dynamic Setting for Memory Management in Apache SparkSuk-Joo Chae, Tae-Sun Chung. 143-144 [doi]
- Fast Modeling of the L2 Cache Reuse Distance Histograms from Software TracesJiancong Ge, Ming Ling. 145-146 [doi]
- FlexCPU: A Configurable Out-of-Order CPU AbstractionBradley Wang, Ayaz Akram, Jason Lowe-Power. 147-148 [doi]
- Hierarchical Page Eviction Policy for Unified Memory in GPUsQi Yu 0003, Bruce R. Childers, Libo Huang, Cheng Qian, Zhiying Wang. 149-150 [doi]
- Analyzing Machine Learning Workloads Using a Detailed GPU SimulatorJonathan Lew, Deval A. Shah, Suchita Pati, Shaylin Cattell, Mengchi Zhang, Amruth Sandhupatla, Christopher Ng, Negar Goli, Matthew D. Sinclair, Timothy G. Rogers, Tor M. Aamodt. 151-152 [doi]
- Empirical Investigation of Stale Value Tolerance on Parallel RNN TrainingJoo Hwan Lee, Hyesoon Kim. 153-164 [doi]
- Characterizing Sources of Ineffectual Computations in Deep Learning NetworksMilos Nikolic, Mostafa Mahmoud, Andreas Moshovos, Yiren Zhao, Robert Mullins. 165-176 [doi]
- Demystifying Bayesian Inference WorkloadsYu Emma Wang, Yuhao Zhu, Glenn G. Ko, Brandon Reagen, Gu-Yeon Wei, David Brooks 0001. 177-189 [doi]
- Workload Characterization of Nondeterministic Programs Parallelized by STATSEnrico Armenio Deiana, Simone Campanoni. 190-201 [doi]
- Parallelism Analysis of Prominent Desktop Applications: An 18- Year PerspectiveSiying Feng, Subhankar Pal, Yichen Yang, Ronald G. Dreslinski. 202-211 [doi]
- µqSim: Enabling Accurate and Scalable Simulation for Interactive MicroservicesYanqi Zhang, Yu Gan, Christina Delimitrou. 212-222 [doi]
- Distributed Software Defined Networking Controller Failure Mode and Availability AnalysisPaul Reeser, Guilhem Tesseyre, Marcus Callaway. 223-232 [doi]
- A Model Driven Approach Towards Improving the Performance of Apache Spark ApplicationsKewen Wang, Mohammad Maifi Hasan Khan, Nhan Nguyen, Swapna S. Gokhale. 233-242 [doi]
- An Improved Dynamic Vertical Partitioning Technique for Semi-Structured DataSahel Sharify, Alan W. Lu, Jin Chen, Arnamoy Bhattacharyya, Ali B. Hashemi, Nick Koudas, Cristiana Amza. 243-256 [doi]
- RPPM: Rapid Performance Prediction of Multithreaded Workloads on Multicore ProcessorsSander De Pestel, Sam Van den Steen, Shoaib Akram, Lieven Eeckhout. 257-267 [doi]
- HeteroMap: A Runtime Performance Predictor for Efficient Processing of Graph Analytics on Heterogeneous Multi-AcceleratorsMasab Ahmad, Halit Dogan, Christopher J. Michael, Omer Khan. 268-281 [doi]
- mRNA: Enabling Efficient Mapping Space Exploration for a Reconfiguration Neural AcceleratorZhongyuan Zhao, Hyoukjun Kwon, Sachit Kuhar, Weiguang Sheng, Zhigang Mao, Tushar Krishna. 282-292 [doi]
- DeLTA: GPU Performance Model for Deep Learning Applications with In-Depth Memory System Traffic AnalysisSangkug Lym, Donghyuk Lee, Mike O'Connor, Niladrish Chatterjee, Mattan Erez. 293-303 [doi]
- Timeloop: A Systematic Approach to DNN Accelerator EvaluationAngshuman Parashar, Priyanka Raina, Yakun Sophia Shao, Yu-Hsin Chen, Victor A. Ying, Anurag Mukkara, Rangharajan Venkatesan, Brucek Khailany, Stephen W. Keckler, Joel S. Emer. 304-315 [doi]