Abstract is missing.
- Machine learning for performance and power modeling/predictionLizy Kurian John. 1-2 [doi]
- Sharing the instruction cache among lean cores on an asymmetric CMP for HPC applicationsUgljesa Milic, Alejandro Rico, Paul M. Carpenter, Alex Ramírez. 3-12 [doi]
- Performance competitiveness of a statically compiled language for server-side Web applicationsYohei Ueda, Moriyoshi Ohara. 13-22 [doi]
- Analyzing the scalability of managed language applications with speedup stacksJennifer B. Sartor, Kristof Du Bois, Stijn Eyerman, Lieven Eeckhout. 23-32 [doi]
- PMAL: Enabling lightweight adaptation of legacy file systems on persistent memory systemsHyunsub Song, Young Je Moon, Se Kwon Lee, Sam H. Noh. 33-42 [doi]
- Chai: Collaborative heterogeneous applications for integrated-architecturesJuan Gómez-Luna, Izzat El Hajj, Li-Wen Chang, Victor Garcia-Flores, Simon Garcia De Gonzalo, Thomas B. Jablin, Antonio J. Peña, Wen-mei W. Hwu. 43-54 [doi]
- Performance analysis of CNN frameworks for GPUsHeehoon Kim, Hyoungwook Nam, Wookeun Jung, Jaejin Lee. 55-64 [doi]
- GaaS workload characterization under NUMA architecture for virtualized GPUHuixiang Chen, Meng Wang, Yang Hu, Mingcong Song, Tao Li. 65-76 [doi]
- Fast IPC estimation for performance projections using proxy suites and decision treesKanishka Lahiri, Subhash Kunnoth. 77-86 [doi]
- Accurate address streams for LLC and beyond (SLAB): A methodology to enable system explorationReena Panda, Xinnian Zheng, Lizy Kurian John. 87-96 [doi]
- Clone morphing: Creating new workload behavior from existing applicationsYipeng Wang, Amro Awad, Yan Solihin. 97-108 [doi]
- Crossing the architectural barrier: Evaluating representative regions of parallel HPC applicationsAlexandra Ferreron, Radhika Jagtap, Sascha Bischoff, Roxana Rusitoru. 109-120 [doi]
- Characterization of GPGPU workloads on a multidimensional heterogeneous processorMatthew A. Watkins, Philip Bedoukian. 121-122 [doi]
- Service capacity measurement by redlining with live production trafficSusie Xia, Zhenyun Zhuang, Anant Rao, Haricharan Ramachandra, Yi Feng, Ramya Pasumarti. 123-124 [doi]
- Predicting memory page stability and its application to memory deduplication and live migrationKarim Elghamrawy, Diana Franklin, Frederic T. Chong. 125-126 [doi]
- Analyzing OpenCL 2.0 workloads using a heterogeneous CPU-GPU simulatorLi Wang, Ren-Wei Tsai, Shao-Chung Wang, Kun-Chih Chen, Po-Han Wang, Hsiang-Yun Cheng, Yi-Chung Lee, Sheng-Jie Shu, Chun-Chieh Yang, Min-Yih Hsu, Li-Chen Kan, Chao-Lin Lee, Tzu-Chieh Yu, Rih-Ding Peng, Chia-Lin Yang, Yuan-Shin Hwang, Jenq Kuen Lee, Shiao Li Tsao, Ming Ouhyoung. 127-128 [doi]
- Microarchitecture level reliability comparison of modern GPU designs: First findingsAlessandro Vallero, Stefano Di Carlo, Sotiris Tselonis, Dimitris Gizopoulos. 129-130 [doi]
- DARTS: Performance-counter driven sampling using binary translatorsRajesh Kumar, Suchita Pati, Kanishka Lahiri. 131-132 [doi]
- Docker characterization on high performance SSDsQiumin Xu, Manu Awasthi, Krishna T. Malladi, Janki Bhimani, Jingpei Yang, Murali Annavaram. 133-134 [doi]
- A taxonomy of out-of-order instruction commitMehdi Alipour, Trevor E. Carlson, Stefanos Kaxiras. 135-136 [doi]
- PTAT: An efficient and precise tool for collecting detailed TLB miss tracesJiutian Zhang, Yuhang Liu, Xiaojing Zhu, Yuan Ruan, Mingyu Chen. 137-138 [doi]
- Proxy benchmarks for emerging big-data workloadsReena Panda, Lizy Kurian John. 139-140 [doi]
- MaxSim: A simulation platform for managed applicationsAndrey Rodchenko, Christos Kotselidis, Andy Nisbet, Antoniu Pop, Mikel Luján. 141-152 [doi]
- dist-gem5: Distributed simulation of computer clustersMohammad Alian, Umur Darbaz, Gábor Dózsa, Stephan Diestelhorst, Daehoon Kim, Nam Sung Kim. 153-162 [doi]
- Prefetching for cloud workloads: An analysis based on address patternsJiajun Wang, Reena Panda, Lizy Kurian John. 163-172 [doi]
- Toolbox for exploration of energy-efficient event processors for human-computer interactionTayyar Rzayev, David H. Albonesi, François Guimbretière, Rajit Manohar, Jaeyeon Kihm. 173-184 [doi]
- HW/SW co-designed processors: Challenges, design choices and a simulation infrastructure for evaluationRakesh Kumar 0003, José Cano, Aleksandar Brankovic, Demos Pavlou, Kyriakos Stavrouz, Enric Gibert, Alejandro Martínez, Antonio Gonzalez. 185-194 [doi]
- OpenSMART: Single-cycle multi-hop NoC generator in BSV and ChiselHyoukjun Kwon, Tushar Krishna. 195-204 [doi]
- StressRight: Finding the right stress for accurate in-development system evaluationJaewon Lee, Hanhwi Jang, Jae-Eon Jo, Gyu-hyeon Lee, Jangwoo Kim. 205-216 [doi]
- SimBench: A portable benchmarking methodology for full-system simulatorsHarry Wagstaff, Bruno Bodin, Tom Spink, Björn Franke. 217-226 [doi]
- Treelogy: A benchmark suite for tree traversalsNikhil Hegde, Jianqiao Liu, Kirshanthan Sundararajah, Milind Kulkarni 0001. 227-238 [doi]
- Evaluating and mitigating bandwidth bottlenecks across the memory hierarchy in GPUsSaumay Dublish, Vijay Nagarajan, Nigel Topham. 239-248 [doi]
- SASSIFI: An architecture-level fault injection tool for GPU application resilience evaluationSiva Kumar Sastry Hari, Timothy Tsai, Mark Stephenson, Stephen W. Keckler, Joel S. Emer. 249-258 [doi]
- Exploring GPU performance, power and energy-efficiency bounds with Cache-aware Roofline ModelingAndre Lopes, Frederico Pratas, Leonel Sousa, Aleksandar Ilic. 259-268 [doi]
- Multi2Sim Kepler: A detailed architectural GPU simulatorXun Gong, Rafael Ubal, David R. Kaeli. 269-278 [doi]