Abstract is missing.
- Exascale Computing - A Fact or a Fiction?Shekhar Borkar. 3 [doi]
- Adaptive Incremental Checkpointing via Delta Compression for Networked Multicore SystemsItthichok Jangjaimon, Nian-Feng Tzeng. 7-18 [doi]
- Towards Scalable Checkpoint Restart: A Collective Inline Memory Contents Deduplication ProposalBogdan Nicolae. 19-28 [doi]
- Optimizing Checkpoints Using NVM as Virtual MemorySudarsun Kannan, Ada Gavrilovska, Karsten Schwan, Dejan S. Milojicic. 29-40 [doi]
- On Closed Nesting and Checkpointing in Fault-Tolerant Distributed Transactional MemoryAditya Dhoke, Binoy Ravindran, Bo Zhang. 41-52 [doi]
- Reliable Service Allocation in CloudsOlivier Beaumont, Lionel Eyraud-Dubois, Hubert Larchevêque. 55-66 [doi]
- Scaling and Scheduling to Maximize Application Performance within Budget Constraints in Cloud WorkflowsMing Mao, Marty Humphrey. 67-78 [doi]
- Optimizing Resource allocation while handling SLA violations in Cloud Computing platformsLionel Eyraud-Dubois, Hubert Larchevêque. 79-87 [doi]
- V-Cache: Towards Flexible Resource Provisioning for Multi-tier Applications in IaaS CloudsYanfei Guo, Palden Lama, Jia Rao, Xiaobo Zhou. 88-99 [doi]
- High-throughput Analysis of Large Microscopy Image Datasets on CPU-GPU Cluster PlatformsGeorge Teodoro, Tony Pan, Tahsin M. Kurç, Jun Kong, Lee A. D. Cooper, Norbert Podhorszki, Scott Klasky, Joel H. Saltz. 103-114 [doi]
- High Performance FFT Based Poisson Solver on a CPU-GPU Heterogeneous PlatformJing Wu, Joseph JáJá. 115-125 [doi]
- Design and Implementation of the Linpack Benchmark for Single and Multi-node Systems Based on Intel® Xeon Phi CoprocessorAlexander Heinecke, Karthikeyan Vaidyanathan, Mikhail Smelyanskiy, Alexander Kobotov, Roman Dubtsov, Greg Henry, Aniruddha G. Shet, George Chrysos, Pradeep Dubey. 126-137 [doi]
- Self-Adaptive OmpSs Tasks in Heterogeneous EnvironmentsJudit Planas, Rosa M. Badia, Eduard Ayguadé, Jesús Labarta. 138-149 [doi]
- RAIR: Interference Reduction in Regionalized Networks-on-ChipLizhong Chen, Kai Hwang, Timothy Mark Pinkston. 153-164 [doi]
- An Analytical Performance Model for Partitioning Off-Chip Memory BandwidthRuisheng Wang, Lizhong Chen, Timothy Mark Pinkston. 165-176 [doi]
- A Case for Handshake in Nanophotonic InterconnectsLei Wang, Jagadish Jayabalan, Minseon Ahn, Haiyin Gu, Ki Hwan Yum, Eun Jung Kim 0001. 177-188 [doi]
- P-sync: A Photonically Enabled Architecture for Efficient Non-local Data AccessDavid Whelihan, Jeffrey J. Hughes, Scott M. Sawyer, Eric Robinson, Michael Wolf, Sanjeev Mohindra, Julie Mullen, Anna Klein, Michelle S. Beard, Nadya T. Bliss, Johnnie Chan, Robert Hendry, Keren Bergman, Luca P. Carloni. 189-200 [doi]
- Optimizations and Analysis of BSP Graph Processing Models on Public CloudsMark Redekopp, Yogesh Simmhan, Viktor K. Prasanna. 203-214 [doi]
- Parallel Label-Setting Multi-objective Shortest Path SearchPeter Sanders, Lawrence Mandow. 215-224 [doi]
- Multi-threaded Graph PartitioningDominique Lasalle, George Karypis. 225-236 [doi]
- High-Productivity and High-Performance Analysis of Filtered Semantic GraphsAydin Buluç, Erika Duriakova, Armando Fox, John R. Gilbert, Shoaib Kamil, Adam Lugowski, Leonid Oliker, Samuel Williams. 237-248 [doi]
- Virtual Systolic Array for QR DecompositionJakub Kurzak, Piotr Luszczek, Mark Gates, Ichitaro Yamazaki, Jack Dongarra. 251-260 [doi]
- Communication-Optimal Parallel Recursive Rectangular Matrix MultiplicationJames Demmel, David Eliahu, Armando Fox, Shoaib Kamil, Benjamin Lipshitz, Oded Schwartz, Omer Spillinger. 261-272 [doi]
- Improving the Performance of the Symmetric Sparse Matrix-Vector Multiplication in MulticoreTheodoros Gkountouvas, Vasileios Karakasis, Kornilios Kourtis, Georgios I. Goumas, Nectarios Koziris. 273-283 [doi]
- Automated Rapid Prototyping of Regular Grid-Based Numerical Applications Using Generalized Elemental SubroutinesYingchong Situ, Ye Wang, Zhiyuan Li. 284-294 [doi]
- A Transparent Collective I/O ImplementationYongen Yu, Jingjin Wu, Zhiling Lan, Douglas H. Rudd, Nickolay Y. Gnedin, Andrey V. Kravtsov. 297-307 [doi]
- A Visual Network Analysis Method for Large-Scale Parallel I/O SystemsCarmen Sigovan, Chris Muelder, Kwan-Liu Ma, Jason Cope, Kamil Iskra, Robert B. Ross. 308-319 [doi]
- FlexIO: I/O Middleware for Location-Flexible Scientific Data AnalyticsFang Zheng, Hongbo Zou, Greg Eisenhauer, Karsten Schwan, Matthew Wolf, Jai Dayal, Tuan-Anh Nguyen, Jianting Cao, Hasan Abbasi, Scott Klasky, Norbert Podhorszki, Hongfeng Yu. 320-331 [doi]
- Burstiness-aware Server Consolidation via Queuing Theory Approach in a Computing CloudZhaoyi Luo, Zhuzhong Qian. 332-341 [doi]
- Pattern-Direct and Layout-Aware Replication Scheme for Parallel I/O SystemsYanlong Yin, Jibing Li, Jun He, Xian-He Sun, Rajeev Thakur. 345-356 [doi]
- Disk-Cache and Parallelism Aware I/O Scheduling to Improve Storage System PerformanceRamya Prabhakar, Mahmut T. Kandemir, Myoungsoo Jung. 357-368 [doi]
- Efficient and Scalable Retrieval Techniques for Global File PropertiesDong H. Ahn, Michael J. Brim, Bronis R. de Supinski, Todd Gamblin, Gregory L. Lee, Matthew P. LeGendre, Barton P. Miller, Adam Moody, Martin Schulz. 369-380 [doi]
- iBridge: Improving Unaligned Parallel File Access with Solid-State DrivesXuechen Zhang, Ke Liu, Kei Davis, Song Jiang. 381-392 [doi]
- Locally Self-Adjusting Tree NetworksChen Avin, Bernhard Haeupler, Zvi Lotker, Christian Scheideler, Stefan Schmid. 395-406 [doi]
- A Network Configuration Algorithm Based on Optimization of Kirchhoff IndexAdam Hackett, Deepak Ajwani, Shoukat Ali, Steve Kirkland, John P. Morrison. 407-417 [doi]
- Malleable SortingPatrick Flick, Peter Sanders, Jochen Speck. 418-426 [doi]
- Adapting Particle Filter Algorithms to Many-Core ArchitecturesMehdi Chitchian, Alexander S. van Amesfoort, Andrea Simonetto, Tamás Keviczky, Henk J. Sips. 427-438 [doi]
- Guided Region-Based GPU Scheduling: Utilizing Multi-thread Parallelism to Hide Memory LatencyJianmin Chen, Xi Tao, Zhen Yang, Jih-Kwon Peir, Xiaoyuan Li, Shih-Lien Lu. 441-451 [doi]
- Optimizing and Auto-Tuning Iterative Stencil Loops for GPUs with the In-Plane MethodWai Teng Tang, Wen Jun Tan, Ratna Krishnamoorthy, Yi Wen Wong, Shyh-hao Kuo, Rick Siow Mong Goh, Stephen John Turner, Weng-Fai Wong. 452-462 [doi]
- Data-Driven Versus Topology-driven Irregular Computations on GPUsRupesh Nasre, Martin Burtscher, Keshav Pingali. 463-474 [doi]
- HQL: A Scalable Synchronization Mechanism for GPUsAyse Yilmazer, David R. Kaeli. 475-486 [doi]
- Pluggable Watchdog: Transparent Failure Detection for MPI ProgramsKeun Soo Yim, Zbigniew Kalbarczyk, Ravishankar K. Iyer. 489-500 [doi]
- Improving the Computing Efficiency of HPC Systems Using a Combination of Proactive and Preventive CheckpointingMohamed-Slim Bouguerra, Ana Gainaru, Leonardo Arturo Bautista Gomez, Franck Cappello, Satoshi Matsuoka, Naoya Maruyama. 501-512 [doi]
- CASTED: Core-Adaptive Software Transient Error Detection for Tightly Coupled CoresKonstantina Mitropoulou, Vasileios Porpodas, Marcelo Cintra. 513-524 [doi]
- Contention Resolution in a Non-synchronized Multiple Access ChannelGianluca De Marco, Dariusz R. Kowalski. 525-533 [doi]
- Generalized Hierarchical All-to-All Exchange PatternsBogdan Prisacari, Germán Rodríguez, Cyriel Minkenberg. 537-547 [doi]
- Minimizing Communication in All-Pairs Shortest PathsEdgar Solomonik, Aydin Buluç, James Demmel. 548-559 [doi]
- Programmable and Scalable Reductions on ClustersJan Ciesko, Javier Bueno, Nikola Puzovic, Alex Ramírez, Rosa M. Badia, Jesús Labarta. 560-568 [doi]
- JVM-Bypass for Efficient Hadoop ShufflingYandong Wang, Cong Xu, Xiaobing Li, Weikuan Yu. 569-578 [doi]
- Resource Management in VMware Powered Cloud: Concepts and TechniquesPradeep Padala. 581 [doi]
- Communication-Avoiding Algorithms for Linear Algebra and BeyondJames Demmel. 585 [doi]
- Oversubscription Bounded Multicast Scheduling in Fat-Tree Data Center NetworksZhiyang Guo, Jun Duan, Yuanyuan Yang. 589-600 [doi]
- Replicate and Bundle (RnB) - A Mechanism for Relieving Bottlenecks in Data CentersShachar Raindel, Yitzhak Birk. 601-610 [doi]
- Profit Aware Load Balancing for Distributed Cloud Data CentersShuo Liu, Shaolei Ren, Gang Quan, Ming Zhao, Shangping Ren. 611-622 [doi]
- Joint Host-Network Optimization for Energy-Efficient Data Center NetworkingHao Jin, Tosmate Cheocherngngarn, Dmita Levy, Alex Smith, Deng Pan, Jason Liu, Niki Pissinou. 623-634 [doi]
- Energy-Efficient Scheduling for Best-Effort Interactive Services to Achieve High Response QualityZhihui Du, Hongyang Sun, Yuxiong He, Yu He, David A. Bader, Huazhe Zhang. 637-648 [doi]
- Perfect Strong Scaling Using No Additional EnergyJames Demmel, Andrew Gearhart, Benjamin Lipshitz, Oded Schwartz. 649-660 [doi]
- A Roofline Model of EnergyJee Whan Choi, Daniel Bedard, Robert J. Fowler, Richard Vuduc. 661-672 [doi]
- A Simplified and Accurate Model of Power-Performance Efficiency on Emergent GPU ArchitecturesShuaiwen Song, Chun-Yi Su, Barry Rountree, Kirk W. Cameron. 673-686 [doi]
- Acceleration of an Asynchronous Message Driven Programming Paradigm on IBM Blue Gene/QSameer Kumar, Yanhua Sun, Laximant V. Kalé. 689-699 [doi]
- Communication-Based Mapping Using Shared PagesMatthias Diener, Eduardo Henrique Molina da Cruz, Philippe Olivier Alexandre Navaux. 700-711 [doi]
- Integrating Asynchronous Task Parallelism with MPISanjay Chatterjee, Sagnak Tasirlar, Zoran Budimlic, Vincent Cavé, Milind Chabbi, Max Grossman, Vivek Sarkar, YongHong Yan. 712-725 [doi]
- DTN-FLOW: Inter-Landmark Data Flow for High-Throughput Routing in DTNsKang Chen, Haiying Shen. 726-737 [doi]
- WHATSUP: A Decentralized Instant News RecommenderAntoine Boutet, Davide Frey, Rachid Guerraoui, Arnaud Jégou, Anne-Marie Kermarrec. 741-752 [doi]
- Crowdsourcing under Real-Time ConstraintsIoannis Boutsis, Vana Kalogeraki. 753-764 [doi]
- Replication-Based Load Balancing in Distributed Content-Based Publish/SubscribeWeixiong Rao, Chao Chen, Pan Hui, Sasu Tarkoma. 765-774 [doi]
- ZHT: A Light-Weight Reliable Persistent Dynamic Scalable Zero-Hop Distributed Hash TableTonglin Li, Xiaobing Zhou, Kevin Brandstatter, Dongfang Zhao, Ke Wang, Anupam Rajendran, Zhao Zhang, Ioan Raicu. 775-787 [doi]
- A Theoretical Framework for Algorithm-Architecture Co-designKenneth Czechowski, Richard W. Vuduc. 791-802 [doi]
- Wait-free Hyperobjects for Task-Parallel Programming SystemsMartin Wimmer. 803-812 [doi]
- Cyclops Tensor Framework: Reducing Communication and Eliminating Load Imbalance in Massively Parallel ContractionsEdgar Solomonik, Devin Matthews, Jeff Hammond, James Demmel. 813-824 [doi]
- Scaling Techniques for Massive Scale-Free Graphs in Distributed (External) MemoryRoger A. Pearce, Maya Gokhale, Nancy M. Amato. 825-836 [doi]
- Scheduling Tree-Shaped Task Graphs to Minimize Memory and MakespanLoris Marchal, Oliver Sinnen, Frédéric Vivien. 839-850 [doi]
- On Graphs, GPUs, and Blind Dating: A Workload to Processor Matchmaking QuestAbdullah Gharaibeh, Lauro Beltrão Costa, Elizeu Santos-Neto, Matei Ripeanu. 851-862 [doi]
- Non Linear Divisible Loads: There is No Free LunchOlivier Beaumont, Hubert Larchevêque, Loris Marchal. 863-873 [doi]
- SIPMaP: A Tool for Modeling Irregular Parallel Computations in the Super Instruction ArchitectureNakul Jindal, Victor Lotrich, Erik Deumens, Beverly A. Sanders. 874-884 [doi]
- Big Data in 10 YearsRaghu Ramakrishnan. 887 [doi]
- HPC Cloud Bad; HPC in the Cloud GoodJosh Simons. 891 [doi]
- Implementing a Blocked Aasen's Algorithm with a Dynamic Scheduler on Multicore ArchitecturesGrey Ballard, Dulceneia Becker, James Demmel, Jack Dongarra, Alex Druinsky, Inon Peled, Oded Schwartz, Sivan Toledo, Ichitaro Yamazaki. 895-907 [doi]
- DLOOP: A Flash Translation Layer Exploiting Plane-Level ParallelismAbdul Rahman Abdurrab, Tao Xie, Wei Wang. 908-918 [doi]
- Exploring Traditional and Emerging Parallel Programming Models Using a Proxy ApplicationIan Karlin, Abhinav Bhatele, Jeff Keasler, Bradford L. Chamberlain, Jonathan Cohen, Zachary Devito, Riyaz Haque, Dan Laney, Edward Luke, Felix Wang, David Richards, Martin Schulz, Charles H. Still. 919-932 [doi]
- Extending the Generality of Molecular Dynamics Simulations on a Special-Purpose MachineDaniele Paolo Scarpazza, Douglas J. Ierardi, Adam K. Lerer, Kenneth M. Mackenzie, Albert C. Pan, Joseph A. Bank, Edmond Chow, Ron O. Dror, J. P. Grossman, Daniel Killebrew, Mark A. Moraes, Cristian Predescu, John K. Salmon, David E. Shaw. 933-945 [doi]
- Algorithms for the Thermal Scheduling ProblemKoyel Mukherjee, Samir Khuller, Amol Deshpande. 949-960 [doi]
- Lock-Free and Wait-Free Slot Scheduling AlgorithmsPooja Aggarwal, Smruti R. Sarangi. 961-972 [doi]
- Distributed Algorithms for Scheduling on Line and Tree Networks with Non-uniform BandwidthsVenkatesan T. Chakaravarthy, Anamitra R. Choudhury, Sambuddha Roy, Yogish Sabharwal. 973-984 [doi]
- Analysis of Randomized Work Stealing with False SharingRichard Cole, Vijaya Ramachandran. 985-998 [doi]
- Extending OpenSHMEM for GPU ComputingSreeram Potluri, Devendar Bureddy, Hao Wang, Hari Subramoni, Dhabaleswar K. Panda. 1001-1012 [doi]
- Deploying Graph Algorithms on GPUs: An Adaptive SolutionDa Li, Michela Becchi. 1013-1024 [doi]
- GPU-based Runtime VerificationShay Berkovich, Borzoo Bonakdarpour, Sebastian Fischmeister. 1025-1036 [doi]
- Kernel Specialization for Improved Adaptability and Performance on Graphics Processing Units (GPUs)Nicholas Moore, Miriam Leeser, Laurie A. Smith King. 1037-1048 [doi]
- The Bounded Data Reuse Problem in Scientific WorkflowsMohsen Zohrevandi, Rida A. Bazzi. 1051-1062 [doi]
- Performance Analysis of the Lattice Boltzmann Model Beyond Navier-StokesAmanda Peters Randles, Vivek Kale, Jeff Hammond, William Gropp, Efthimios Kaxiras. 1063-1074 [doi]
- A Communication-Optimal N-Body Algorithm for Direct InteractionsMichael B. Driscoll, Evangelos Georganas, Penporn Koanantakool, Edgar Solomonik, Katherine A. Yelick. 1075-1084 [doi]
- Exploring SIMD for Molecular Dynamics, Using Intel® Xeon® Processors and Intel® Xeon Phi CoprocessorsSimon J. Pennycook, Chris J. Hughes, M. Smelyanskiy, Stephen A. Jarvis. 1085-1097 [doi]
- Multi-vehicle Coordination for Wireless Energy Replenishment in Sensor NetworksCong Wang, Ji Li, Fan Ye, Yuanyuan Yang. 1101-1111 [doi]
- On Feasibility of Fingerprinting Wireless Sensor Nodes Using Physical PropertiesXiaowei Mei, Donggang Liu, Kun Sun, Dingbang Xu. 1112-1121 [doi]
- Distributed Algorithms for Joint Routing and Frame Aggregation in 802.11n Wireless Mesh NetworksDawei Gong, Yuanyuan Yang. 1122-1132 [doi]
- Distributed Low-Latency Out-of-Order Event Processing for High Data Rate Sensor StreamsChristopher Mutschler, Michael Philippsen. 1133-1144 [doi]
- Agreement via Symmetry Breaking: On the Structure of Weak Subconsensus TasksArmando Castañeda, Sergio Rajsbaum, Michel Raynal. 1147-1158 [doi]
- A Multi-partitioning Approach to Building Fast and Accurate Counting Bloom FiltersKun Huang, Jie Zhang, Dafang Zhang, Gaogang Xie, Kavé Salamatian, Alex X. Liu, Wei Li. 1159-1170 [doi]
- Composing Relaxed TransactionsVincent Gramoli, Rachid Guerraoui, Mihai Letia. 1171-1182 [doi]
- Throughput Enhancement through Selective Time Sharing and Dynamic GroupingJunliang Chen, Bing Bing Zhou, Chen Wang, Peng Lu, PengHao Wang, Albert Y. Zomaya. 1183-1192 [doi]
- Novel Parallelization Schemes for Large-Scale Likelihood-based Phylogenetic InferenceAlexandros Stamatakis, Andre J. Aberer. 1195-1204 [doi]
- Integrating Online Compression to Accelerate Large-Scale Data Analytics ApplicationsTekin Bicer, Jian Yin, David Chiu, Gagan Agrawal, Karen Schuchardt. 1205-1216 [doi]
- Massively Parallel Model of Extended Memory Use in Evolutionary Game DynamicsAmanda Peters Randles, David G. Rand, Christopher Lee, Greg Morrisett, Jayanta Sircar, Martin A. Nowak, Hanspeter Pfister. 1217-1228 [doi]
- Early Experience on the Blue Gene/Q Supercomputing SystemVitali A. Morozov, Kalyan Kumaran, Venkatram Vishwanath, Jiayuan Meng, Michael E. Papka. 1229-1240 [doi]
- Adaptive Cache Bypassing for Inclusive Last Level CachesSaurabh Gupta, Hongliang Gao, Huiyang Zhou. 1243-1253 [doi]
- Hardware-Accelerated Regular Expression Matching with Overlap Handling on IBM PowerEN ProcessorKubilay Atasu, Florian Doerfler, Jan van Lunteren, Christoph Hagleitner. 1254-1265 [doi]
- TM-dietlibc: A TM-aware Real-World System LibraryVesna Smiljkovic, Martin Nowack, Neboja Miletic, Tim Harris, Osman S. Ünsal, Adrián Cristal, Mateo Valero. 1266-1274 [doi]
- Cura: A Cost-Optimized Model for MapReduce in a CloudBalaji Palanisamy, Aameek Singh, Ling Liu, Bryan Langston. 1275-1286 [doi]
- A Scalable Heterogeneous Parallelization Framework for Iterative Local SearchesMartin Burtscher, Hassan Rabeti. 1289-1298 [doi]
- XKaapi: A Runtime System for Data-Flow Task Programming on Heterogeneous ArchitecturesThierry Gautier, João V. F. Lima, Nicolas Maillard, Bruno Raffin. 1299-1308 [doi]
- A Study of the Behavior of Synchronization Methods in Commonly Used Languages and SystemsDaniel Cederman, Bapi Chatterjee, Nhan Nguyen Dang, Yiannis Nikolakopoulos, Marina Papatriantafilou, Philippas Tsigas. 1309-1320 [doi]
- Managing Asynchronous Operations in Coarray Fortran 2.0Chaoran Yang, Karthik Murthy, John M. Mellor-Crummey. 1321-1332 [doi]