Abstract is missing.
- HCW IntroductionBehrooz Shirazi, Uwe Schwiegelshohn. 1-2 [doi]
- Message from the HCW Steering Committee ChairBehrooz Shirazi. 3 [doi]
- Message from the HCW General ChairUwe Schwiegelshohn. 4 [doi]
- Message from the HCW Program ChairShoukat Ali. 5 [doi]
- HCW 2014 Keynote TalkDavid Abramson. 6 [doi]
- Hybrid Multi-elimination ILU Preconditioners on GPUsDimitar Lukarski, Hartwig Anzt, Stanimire Tomov, Jack Dongarra. 7-16 [doi]
- Searching for the Optimal Data Partitioning Shape for Parallel Matrix Matrix Multiplication on 3 Heterogeneous ProcessorsAshley M. DeFlumere, Alexey L. Lastovetsky. 17-28 [doi]
- Taking Advantage of Hybrid Systems for Sparse Direct Solvers via Task-Based RuntimesXavier Lacoste, Mathieu Faverge, George Bosilca, Pierre Ramet, Samuel Thibault. 29-38 [doi]
- Topology-Aware Optimization of Communications for Parallel Matrix Multiplication on Hierarchical Heterogeneous HPC PlatformTania Malik, Vladimir Rychkov, Alexey L. Lastovetsky, Jean-Noël Quintin. 39-47 [doi]
- Scheduling Methods for Accelerating Applications on Architectures with Heterogeneous CoresLinchuan Chen, Xin Huo, Gagan Agrawal. 48-57 [doi]
- Utility Driven Dynamic Resource Management in an Oversubscribed Energy-Constrained Heterogeneous SystemBhavesh Khemka, Ryan Friese, Sudeep Pasricha, Anthony A. Maciejewski, Howard Jay Siegel, Gregory A. Koenig, Sarah Powers, Marcia Hilton, Rajendra Rambharos, Steve Poole. 58-67 [doi]
- An Efficient Algorithm for Scheduling Jobs in Volunteer Computing PlatformsAdel Essafi, Denis Trystram, Zied Zaidi. 68-76 [doi]
- Resource Centered Computing Delivering High Parallel PerformanceJens Gustedt, Stéphane Vialle, Patrick Mercier. 77-88 [doi]
- Point-to-Point and Congestion Bandwidth Estimation: Experimental Evaluation on PlanetLab DataLionel Eyraud-Dubois, Przemyslaw Uznanski. 89-96 [doi]
- Runtime Behavior Comparison of Modern Accelerators and CoprocessorsAyman Tarakji, Niels Ole Salscheider. 97-108 [doi]
- RAW Introduction and CommitteesJürgen Becker, Ramachandran Vaidyanathan, Marco D. Santambrogio, Jim Tørresen, Ron Sass, Philip Heng Wai Leong. 109-110 [doi]
- RAW 2014 KeynotesJoshua D. Walstrom, Maya Gokhale. 111 [doi]
- Twill: A Hybrid Microcontroller-FPGA Framework for Parallelizing Single-Threaded C ProgramsDoug Gallatin, Aaron W. Keen, Chris Lupo, John Oliver. 112-121 [doi]
- A New Dataflow Compiler IR for Accelerating Control-Intensive Code in Spatial HardwareAli Mustafa Zaidi, David J. Greaves. 122-131 [doi]
- Efficient Software-Based Runtime Binary Translation for Coarse-Grained Reconfigurable ArchitecturesToan X. Mai, Jongeun Lee. 132-140 [doi]
- A Dependable Coarse-Grain Reconfigurable Multicore ArrayGeorgios Smaragdos, Danish Anis Khan, Ioannis Sourdis, Christos Strydis, Alirad Malek, Stavros Tzilis. 141-150 [doi]
- Automated Hybrid Interconnect Design for FPGA Accelerators Using Data Communication ProfilingCuong Pham-Quoc, Zaid Al-Ars, Koen Bertels. 151-160 [doi]
- SmartBricks: A Visual Environment to Design and Explore Novel Custom Domain-Specific ArchitecturesAnil Kumar Sistla, Xiaozhong Luo, Mukund Malladi, Marc Reisner, Rajasekhar Ganduri, Gayatri Mehta. 161-169 [doi]
- A Framework for Mapping Dynamic Virtual Kernels onto Heterogeneous Reconfigurable PlatformsHarry Sidiropoulos, Kostas Siozios, Dimitrios Soudris. 170-175 [doi]
- A Hybrid ILP-CP Model for Mapping Directed Acyclic Task Graphs to Multicore ArchitecturesAndreas Emeretlis, George Theodoridis, Panayiotis Alefragis, Nikolaos Voros. 176-182 [doi]
- A Framework for Customizing Virtual 3-D Reconfigurable Platforms at Run-TimeKostas Siozios, Dimitrios Soudris, Michael Hübner. 183-188 [doi]
- Over-clocking of Linear Projection Designs through Device Specific OptimisationsRui Policarpo Duarte, Christos-Savvas Bouganis. 189-198 [doi]
- Influence of Magnetic Fields and X-Radiation on Ring Oscillators in FPGAsMichael Raitza, Markus Vogt, Christian Hochberger, Thilo Pionteck. 199-204 [doi]
- Radiation Tolerance of Color Configuration on an Optically Reconfigurable Gate ArrayTakumi Fujimori, Minoru Watanabe. 205-210 [doi]
- Adaptive Booth Algorithm for Three-Integers Multiplication for Reconfigurable MeshEsti Stein, Yosi Ben-Asher. 211-219 [doi]
- An FPGA Implementation of the Hestenes-Jacobi Algorithm for Singular Value DecompositionXinying Wang, Joseph Zambreno. 220-227 [doi]
- CyGraph: A Reconfigurable Architecture for Parallel Breadth-First SearchOsama G. Attia, Tyler Johnson, Kevin Townsend, Philip Jones, Joseph Zambreno. 228-235 [doi]
- Adaptive Raytracing Implementation Using Partial Dynamic ReconfigurationGianluca Durelli, Fabrizio Spada, Riccardo Cattaneo, Christian Pilato, Danilo Pau, Marco D. Santambrogio. 236-242 [doi]
- PaRA-Sched: A Reconfiguration-Aware Scheduler for Reconfigurable ArchitecturesRiccardo Cattaneo, Riccardo Bellini, Gianluca Durelli, Christian Pilato, Marco D. Santambrogio, Donatella Sciuto. 243-250 [doi]
- An ILP-Based Optimal Circuit Mapping Method for PLDsHiroki Nishiyama, Masato Inagi, Shin'ichi Wakabayashi, Shinobu Nagayama, Keisuke Inoue, Mineo Kaneko. 251-256 [doi]
- High-Level Synthesis from C vs. a DSL-Based ApproachCristiano Bacelar de Oliveira, João M. P. Cardoso, Eduardo Marques. 257-262 [doi]
- An Evaluation of User Satisfaction Driven Scheduling in a Polymorphic Embedded SystemZhang Zhang, Swamy D. Ponpandi, Akhilesh Tyagi. 263-268 [doi]
- A Low-Latency Algorithm and FPGA Design for the Min-Search of LDPC DecodersGeorgios Tzimpragos, Christoforos Kachris, Dimitrios Soudris, Ioannis Tomkos. 269-274 [doi]
- FPGA Redundancy Configurations: An Automated Design Space ExplorationJahanzeb Anwer, Marco Platzner, Sebastian Meisner. 275-280 [doi]
- Hierarchical Pipeline Optimization of Coarse Grained Reconfigurable Processor for Multimedia ApplicationsChen Mei, Peng Cao, Yang Zhang, Bo Liu, Leibo Liu. 281-286 [doi]
- Module Placement Using Constraint Programming in Run-Time Reconfigurable SystemsAlexander Wold, Andreas Agne, Jim Torresen. 287-292 [doi]
- An Efficient Heterogeneous Register File Implementation for FPGAsHasan Erdem Yantir, Arda Yurdakul. 293-298 [doi]
- Minimizing Scrubbing Effort through Automatic Netlist Partitioning and FloorplanningBernhard Schmidt, Daniel Ziener, Jürgen Teich. 299-304 [doi]
- Virtualization Support for FPGA-Based Coprocessors Connected via PCI Express to an Intel Multicore PlatformViet Vu Duy, Timo Sandmann, Steffen Baehr, Oliver Sander, Jürgen Becker. 305-310 [doi]
- HIPS Introduction and CommitteesJohn Cavazos. 311 [doi]
- Bohrium: A Virtual Machine Approach to Portable ParallelismMads Ruben Burgdorff Kristensen, Simon Andreas Frimann Lund, Troels Blum, Kenneth Skovhede, Brian Vinter. 312-321 [doi]
- HATI: Hardware Assisted Thread Isolation for Concurrent C/C++ ProgramsJuan Carlos Martinez Santos, Yunsi Fei. 322-331 [doi]
- A General Model Checking Framework for Various Memory Consistency ModelsTatsuya Abe, Toshiyuki Maeda. 332-341 [doi]
- Autotuning Tensor TranspositionLai Wei, John J. Mellor-Crummey. 342-351 [doi]
- Automatic MPI-IO Tuning with the Periscope Tuning FrameworkWeifeng Liu, Isaías A. Comprés Ureña, Michael Gerndt, Bin Gong. 352-360 [doi]
- Optimizing Collective Communication in UPCJithin Jose, Khaled Hamidouche, Jie Zhang, Akshay Venkatesh, Dhabaleswar K. Panda. 361-370 [doi]
- SWIFT: A Transparent and Flexible Communication Layer for PCIe-Coupled Accelerators and (Co-)ProcessorsSimon Pickartz, Pablo Reble, Carsten Clauss, Stefan Lankes. 371-380 [doi]
- Deterministic Synchronization of Multi-threaded Programs with Operational TransformationChristopher Boelmann, Lorenz Schwittmann, Torben Weis. 381-390 [doi]
- ABC2: Adaptively Balancing Computation and Communication in a DSM Cluster of Multicores for Irregular ApplicationsSai Charan Koduru, Keval Vora, Rajiv Gupta. 391-400 [doi]
- NIDISC Introduction and CommitteesPascal Bouvry, Franciszek Seredynski, El-Ghazali Talbi. 401 [doi]
- Using Physical Stigmergy in Decentralized Optimization under Multiple Non-separable Constraints: Formal Methods and an Intelligent Lighting ExampleTheodore P. Pavlic. 402-411 [doi]
- Hybrid Metaheuristic for Annual Hydropower Generation OptimizationA. Nakib, El-Ghazali Talbi, A. Fuser. 412-419 [doi]
- Machine-Learning-Based Identification of Defect Patterns in Semiconductor Wafer Maps: An Overview and ProposalFatima Adly, Paul D. Yoo, Sami Muhaidat, Yousof Al-Hammadi. 420-429 [doi]
- Data Quality, Consistency, and Interpretation Management for Wind Farms by Using Neural NetworksAlain Fuser, Florent Fontaine, Jack Copper. 430-438 [doi]
- Graph-Based Cellular Automata Approach to Maximum Lifetime Coverage Problem in Wireless Sensor NetworksAntonina Tretyakova, Franciszek Seredynski, Pascal Bouvry. 439-447 [doi]
- GPU Accelerated Nature Inspired Methods for Modelling Large Scale Bi-directional Pedestrian MovementSankha Baran Dutta, Robert D. McLeod, Marcia R. Friesen. 448-456 [doi]
- Improving Bus Ride Comfort Using GLOSA-Based Dynamic Speed OptimisationMarcin Seredynski, Patricia Ruiz, Krzysztof Szczypiorski, Djamel Khadraoui. 457-463 [doi]
- A Genetic Algorithm-Based Sparse Coverage over Urban VANETsHuang Cheng, Xin Fei, Azzedine Boukerche, Mohammed Almulla. 464-469 [doi]
- A Game-Theoretic Approach to Multiobjective Job Scheduling in Cloud Computing SystemsJakub Gasior, Franciszek Seredynski. 470-479 [doi]
- Multi-level and Multi-objective Survey on Cloud SchedulingYacine Kessaci, Nouredine Melab, El-Ghazali Talbi. 480-488 [doi]
- Comparison of Multi-objective Optimization Algorithms for the JShadObf JavaScript ObfuscatorBenoît Bertholon, Sébastien Varrette, Pascal Bouvry. 489-496 [doi]
- HiCOMB Introduction and CommitteesAlba Cristina Magalhaes Alves de Melo, Srinivas Aluru, David A. Bader. 497-498 [doi]
- HiCOMB Keynote and Invited TalksStephen Larson, Ümit V. Çatalyürek, Ananth Kalyanaraman. 499 [doi]
- Constructing Similarity Graphs from Large-Scale Biological Sequence CollectionsJaroslaw Zola. 500-507 [doi]
- Removing Sequential Bottlenecks in Analysis of Next-Generation Sequencing DataYi Wang, Gagan Agrawal, Hatice Gulcin Ozer, Kun Huang. 508-517 [doi]
- Efficient Computation of the Phylogenetic Likelihood Function on the Intel MIC ArchitectureAlexey M. Kozlov, Christian Goll, Alexandros Stamatakis. 518-527 [doi]
- Process Simulation of Complex Biochemical Pathways in Explicit 3D Space Enabled by Heterogeneous Computing PlatformJie Li, Amin Salighehdar, Narayan Ganesan. 528-535 [doi]
- Exploring Large Scale Receptor-Ligand Pairs in Molecular Docking Workflows in HPC CloudsKary A. C. S. Ocaña, Silvia Benza, Daniel de Oliveira, Jonas Dias, Marta Mattoso. 536-545 [doi]
- A Comparison of a Campus Cluster and Open Science Grid Platforms for Protein-Guided Assembly Using Pegasus Workflow Management SystemNatasha Pavlovikj, Kevin Begcy, Sairam Behera, Malachy Campbell, Harkamal Walia, Jitender S. Deogun. 546-555 [doi]
- Design and Optimization of a Metagenomics Analysis Workflow for NVRAMSasha Ames, Jonathan E. Allen, David A. Hysom, G. Scott Lloyd, Maya B. Gokhale. 556-565 [doi]
- Parallelization of the Trinity Pipeline for De Novo Transcriptome AssemblyVipin Sachdeva, C. S. Kim, Kirk E. Jordan, M. D. Winn. 566-575 [doi]
- HiPGA: A High Performance Genome Assembler for Short Read Sequence DataXiaohui Duan, Kun Zhao, Weiguo Liu. 576-584 [doi]
- APDCM Introduction and CommitteesOscar H. Ibarra. 585 [doi]
- Bulk Execution of Oblivious Algorithms on the Unified Memory Machine, with GPU ImplementationKazuya Tani, Daisuke Takafuji, Koji Nakano, Yasuaki Ito. 586-595 [doi]
- A Linear Performance-Breakdown Model for GPU Programming Optimization GuidanceMario A. Chapa M., Sato Hiroyuki. 596-603 [doi]
- A Hybrid Parallel Tridiagonal Solver on Multi-core ArchitecturesGuangping Tang, Kenli Li, Keqin Li, Hang Chen, Jiayi Du. 604-613 [doi]
- A Novel Computational Model for GPUs with Application to I/O Optimal Sorting AlgorithmsAtsushi Koike, Kunihiko Sadakane. 614-623 [doi]
- Predicting Cache Contention for Multithread Applications at Compile TimeMunara Tolubaeva, Yonghong Yan 0001, Barbara M. Chapman. 624-631 [doi]
- Parallelism Extraction Algorithm from Stream-Based Processing Flow Applying Spanning TreeGuyue Wang, Shinichi Yamagiwa, Koichi Wada. 632-641 [doi]
- EEWA: Energy-Efficient Workload-Aware Task Scheduling in Multi-core ArchitecturesQuan Chen, Long Zheng, Minyi Guo, Zhiyi Huang 0001. 642-651 [doi]
- A Platform-Specific Code Smell Alert System for High Performance Computing ApplicationsChunyan Wang, Shoichi Hirasawa, Hiroyuki Takizawa, Hiroaki Kobayashi. 652-661 [doi]
- Optimizing Buffer Sizes for Pipeline Workflow Scheduling with Setup TimesAnne Benoit, Jean-Marc Nicod, Veronika Rehn-Sonigo. 662-670 [doi]
- WECPAR: List Ranking Algorithm and Relative Computational PowerHatem M. El-Boghdadi. 671-678 [doi]
- Assessing the Impact of ABFT and Checkpoint Composite StrategiesGeorge Bosilca, Aurelien Bouteiller, Thomas Hérault, Yves Robert, Jack J. Dongarra. 679-688 [doi]
- Memory-Aware List Scheduling for Hybrid PlatformsJulien Herrmann, Loris Marchal, Yves Robert. 689-698 [doi]
- A Parallel Framework for Handling Non-determinism with Expressive Description LogicsJocelyne Faddoul, Wendy MacCaull. 699-708 [doi]
- Prototyping the MBTAC Processor for the REPLICA CMPMartti Forsell, Jussi Roivainen, Ville Leppänen. 709-716 [doi]
- Evaluation of the Global Address Space Programming Interface (GASPI)Jens Breitbart, Mareike Schmidtobreick, Vincent Heuveline. 717-726 [doi]
- GPS: Towards Simplified Communication on SGL ModelChong Li, Gaétan Hains. 727-736 [doi]
- Near-Optimal Location Tracking Using Sensor NetworksGokarna Sharma, Hari Krishnan, Costas Busch, Steven R. Brandt. 737-746 [doi]
- Self-Stabilizing Algorithm for Maximal 2-Packing with Safe Convergence in an Arbitrary GraphYihua Ding, James Zijun Wang, Pradip K. Srimani. 747-754 [doi]
- Minimum Set Cover of Sparsely Distributed Sensor Nodes by a Collection of Unit DisksSatoshi Fujita. 755-761 [doi]
- An Efficient Implementation of the Gradient-Based Hough Transform Using DSP Slices and Block RAMs on the FPGAXin Zhou, Yasuaki Ito, Koji Nakano. 762-770 [doi]
- HPPAC Introduction and CommitteesDong Li, Robert J. Fowler. 771-772 [doi]
- Characterizing the Impact of Program Optimizations on Power and Energy for Explicit HydrodynamicsEdgar A. León, Ian Karlin. 773-781 [doi]
- Application Power Signature AnalysisChung-Hsing Hsu, Jacob Combs, Jolie Nazor, Fabian Santiago, Rachelle Thysell, Suzanne Rivoire, Stephen W. Poole. 782-789 [doi]
- Metrics for Evaluating Energy Saving Techniques for Resilient HPC SystemsRyan E. Grant, Stephen L. Olivier, James H. Laros III, Ron Brightwell, Allan Porterfield. 790-797 [doi]
- Reducing Static and Dynamic Power of L1 Data Caches in GPGPUsEhsan Atoofian. 798-804 [doi]
- Exploiting DMA for Performance and Energy Optimized STREAM on a DSPGilbert Netzer, Lennart Johnsson, Daniel Ahlin, Eric Stotzer, Pekka Varis, Erwin Laure. 805-814 [doi]
- A Study of Energy and Locality Effects Using Space-Filling CurvesNico Reissman, Jan Christian Meyer, Magnus Jahre. 815-822 [doi]
- Energy-Aware Load Balancing Policies for the Cloud EcosystemAshkan Paya, Dan C. Marinescu. 823-832 [doi]
- Bag-of-Task Scheduling on Power-Aware Clusters Using a DVFS-Based MechanismGeorge Terzopoulos, Helen D. Karatza. 833-840 [doi]
- A Criticality-Aware DVFS Runtime Utility for Optimizing Power Efficiency of Multithreaded ApplicationsHaibo Zhang, Wenting Han, Feng Li, Songtao He, Yichao Cheng, Hong An, Zhitao Chen. 841-848 [doi]
- HPGC Introduction and CommitteesEric E. Aubanel, Virendrakumar C. Bhavsar, Michael A. Frumkin. 849 [doi]
- HPGC KeynotesRajkumar Buyya, Derek Murray. 850-851 [doi]
- Evaluating GPU Passthrough in Xen for High Performance Cloud ComputingAndrew J. Younge, John Paul Walters, Stephen P. Crago, Geoffrey Charles Fox. 852-859 [doi]
- Scalable System Environment Caching and Sharing for Distributed Virtual MachinesTeng Long, Il-Chul Yoon, Alan Sussman, Adam A. Porter, Atif M. Memon. 860-867 [doi]
- Mega Data Center for Elastic Internet ApplicationsHangwei Qian, Michael Rabinovich. 868-874 [doi]
- Cloud-Based Simulation of a Smart Power GridAshkan Paya, Dan C. Marinescu. 875-884 [doi]
- Analyzing Reliability of Virtual Machine Instances with Dynamic Pricing in the Public CloudSeung-Hwan Lim, Gautam S. Thakur, James L. Horey. 885-893 [doi]
- Security of Applications Involving Multiple Organizations and Order Preserving Encryption in Hybrid Cloud EnvironmentsMohammad Ahmadian, Ashkan Paya, Dan C. Marinescu. 894-903 [doi]
- AsHES Introduction and CommitteesYunquan Zhang. 904-906 [doi]
- AsHES KeynoteJeffrey Vetter. 907 [doi]
- Scalable Critical Path Analysis for Hybrid MPI-CUDA ApplicationsFelix Schmitt, Robert Dietrich, Guido Juckeland. 908-915 [doi]
- Dymaxion++: A Directive-Based API to Optimize Data Layout and Memory Mapping for Heterogeneous SystemsShuai Che, Jiayuan Meng, Kevin Skadron. 916-924 [doi]
- Comparison of Parallel Programming Models on Intel MIC Computer ClusterChenggang Lai, Zhijun Hao, Miaoqing Huang, Xuan Shi, Haihang You. 925-932 [doi]
- CoAdELL: Adaptivity and Compression for Improving Sparse Matrix-Vector Multiplication on GPUsMarco Maggioni, Tanya Y. Berger-Wolf. 933-940 [doi]
- Optimizing Krylov Subspace Solvers on Graphics Processing UnitsHartwig Anzt, William Sawyer, Stanimire Tomov, Piotr Luszczek, Ichitaro Yamazaki, Jack Dongarra. 941-949 [doi]
- XSW: Accelerating Biological Database Search on Xeon PhiLipeng Wang, Yuandong Chan, Xiaohui Duan, Haidong Lan, Xiangxu Meng, Weiguo Liu. 950-957 [doi]
- Dynamically Balanced Synchronization-Avoiding LU Factorization with Multicore and GPUsSimplice Donfack, Stanimire Tomov, Jack Dongarra. 958-965 [doi]
- Scalable Fast Multipole Accelerated Vortex MethodsQi Hu, Nail A. Gumerov, Rio Yokota, Lorena A. Barba, Ramani Duraiswami. 966-975 [doi]
- Infiniband-Verbs on GPU: A Case Study of Controlling an Infiniband Network Device from the GPULena Oden, Holger Fröning, Franz-Joseph Pfreundt. 976-983 [doi]
- Programming the Adapteva Epiphany 64-Core Network-on-Chip CoprocessorAnish Varghese, Bob Edwards, Gaurav Mitra, Alistair P. Rendell. 984-992 [doi]
- High-Performance Zonal Histogramming on Large-Scale Geospatial Rasters Using GPUs and GPU-Accelerated ClustersJianting Zhang, Dali Wang. 993-1000 [doi]
- PLC Introduction and CommitteesBarbara M. Chapman. 1001 [doi]
- Transparent GPU Execution of NumPy ApplicationsTroels Blum, Mads Ruben Burgdorff Kristensen, Brian Vinter. 1002-1010 [doi]
- KernelGen - The Design and Implementation of a Next Generation Compiler Platform for Accelerating Numerical Models on GPUsDmitry Mikushin, Nikolay Likhogrud, Eddy Z. Zhang, Christopher Bergstrom. 1011-1020 [doi]
- Using GPU Shared Memory with a Directive-Based ApproachWei Ding, Ligang Lu, Mauricio Araya-Polo, Amik St-Cyr, Detlef Hohl, Barbara M. Chapman. 1021-1028 [doi]
- CFD Builder: A Library Builder for Computational Fluid DynamicsJagan Jayaraj, Pei-Hung Lin, Paul R. Woodward, Pen-Chung Yew. 1029-1038 [doi]
- A Stream Processing Framework for On-Line Optimization of Performance and Energy Efficiency on Heterogeneous SystemsBenjamin Ranft, Oliver Denninger, Philip Pfaffe. 1039-1048 [doi]
- OpenMP Task Scheduling Analysis via OpenMP Runtime API and Tool VisualizationAhmad Qawasmeh, Abid Muslim Malik, Barbara M. Chapman. 1049-1058 [doi]
- A Case Study in Coordination Programming: Performance Evaluation of S-Net vs Intel's Concurrent CollectionsPavel Zaichenkov, Bert Gijsbers, Clemens Grelck, Olga Tveretina, Alex Shafarenko. 1059-1067 [doi]
- EduPar Introduction and CommitteesSushil K. Prasad. 1068-1069 [doi]
- EduPar KeynoteRandy H. Katz. 1070 [doi]
- Limited Time and Experience: Parallelism in CS1Steven Bogaerts. 1071-1078 [doi]
- NSF/IEEE-TCPP Curriculum Implementation at the State University of Nizhni NovgorodVictor P. Gergel, Alexey Liniov, Iosif Meyerov, Alexander Sysoyev. 1079-1084 [doi]
- Parallel and Distributed Computing across the Computer Science CurriculumDavid J. John, Stan J. Thomas. 1085-1090 [doi]
- Service-Oriented Computing and Software Integration in Computing CurriculumYinong Chen, Zhizheng Zhou. 1091-1098 [doi]
- EA: Research-Infused Teaching of Parallel Programming Concepts for Undergraduate Software Engineering StudentsNasser Giacaman, Oliver Sinnen. 1099-1105 [doi]
- Using Patterns to Teach Parallel ComputingClayton Ferner, Barry Wilkinson, Barbara Heath. 1106-1113 [doi]
- Teaching HDFS/MapReduce Systems Concepts to UndergraduatesLinh Bao Ngo, Edward B. Duffy, Amy W. Apon. 1114-1121 [doi]
- Interactively Exploring the Connection between Nested Dissection Orderings for Parallel Cholesky Factorization and Vertex SeparatorsH. Martin Bücker, M. Ali Rostami. 1122-1129 [doi]
- A Portable Cluster for Each StudentDavid Toth. 1130-1134 [doi]
- GABB IntroductionTim Mattson, David A. Bader, Aydin Buluç, John R. Gilbert, Joseph Gonzalez, Jeremy Kepner. 1135-1137 [doi]
- PDSEC Introduction and CommitteesPeter E. Strazdins, Raphaël Couturier, Michelle Mills Strout, Keita Teranishi, Thomas Rauber, Gudula Rünger, Laurence T. Yang. 1138-1139 [doi]
- llamaOS: A Solution for Virtualized High-Performance Computing ClustersWilliam A. Magato, Philip A. Wilsey. 1140-1149 [doi]
- New Algorithm for Computing Eigenvectors of the Symmetric Eigenvalue ProblemAzzam Haidar, Piotr Luszczek, Jack Dongarra. 1150-1159 [doi]
- Exhaustive Key Search on Clusters of GPUsDavide Barbieri, Valeria Cardellini, Salvatore Filippone. 1160-1168 [doi]
- Application Level Fault Recovery: Using Fault-Tolerant Open MPI in a PDE SolverMd. Mohsin Ali, James Southern, Peter E. Strazdins, Brendan Harding. 1169-1178 [doi]
- Nanoscale Cluster Detection in Massive Atom Probe Tomography DataSudip K. Seal, Srikanth B. Yoginath, Michael K. Miller. 1179-1188 [doi]
- Construction of Porous Networks Subjected to Geometric Restrictions by Using OpenMPAngel Gonzalez Mendez, Graciela Román-Alonso, Fernando Rojas-González, Miguel Alfonso Castro-García, Miguel Aguilar Cornejo, Salomon Cordero-Sánchez. 1189-1197 [doi]
- Integration and Evaluation of Decentralized Fairshare Prioritization (Aequus)Daniel Espling, Per-Olov Östberg, Erik Elmroth. 1198-1207 [doi]
- Coordination Languages and MPI Perturbation Theory: The FOX Tuple Space Framework for ResilienceJeremiah J. Wilke. 1208-1217 [doi]
- DisSLib: CC: A Library for Distributed Search with a Central Common Search StateTyson Kendon, Jörg Denzinger. 1218-1227 [doi]
- Improving I/O Performance with Adaptive Data Compression for Big Data ApplicationsHongbo Zou, Yongen Yu, Wei Tang, Hsuanwei Michelle Chen. 1228-1237 [doi]
- Analysis of MPI Shared-Memory Communication Performance from a Cache Coherence PerspectiveBertrand Putigny, Benoit Ruelle, Brice Goglin. 1238-1247 [doi]
- Acceleration of GPU-Based Ultrasound Simulation via Data CompressionAndrew A. Haigh, Eric C. McCreath. 1248-1255 [doi]
- Kd-Tree Based N-Body Simulations with Volume-Mass Heuristic on the GPUKlaus Kofler, Dominik Steinhauser, Biagio Cosenza, Ivan Grasso, Sabine Schindler, Thomas Fahringer. 1256-1265 [doi]
- Nuclear Fusion Simulation Code Optimization and Performance Evaluation on GPU ClusterNorihisa Fujita, Hideo Nuga, Taisuke Boku, Yasuhiro Idomura. 1266-1274 [doi]
- Acceleration of a Python-Based Tsunami Modelling Application via CUDA and OpenHMPPZhe Weng, Peter E. Strazdins. 1275-1284 [doi]
- GPU Enhanced Path Finding for an Unmanned Aerial VehicleRoksana Hossain, Sebastian Magierowski, Geoffrey G. Messier. 1285-1293 [doi]
- DPDNS Introduction and CommitteesDimiter Avresky, Erik Maehle, Salvatore Distefano. 1294-1295 [doi]
- DPDNS KeynoteEdgar Nett. 1296 [doi]
- Maintaining Dependable Communication Service for Mobile Stations in Wireless Mesh Networks by Tracking Capacity DemandsTimo Lindhorst, Burkhard Weseloh, Edgar Nett. 1297-1305 [doi]
- A Load Balancing Behavior for Underwater Robot Swarms to Increase Mission Time and Fault ToleranceAmmar Amory, Thomas Tosik, Erik Maehle. 1306-1313 [doi]
- ExCovery - A Framework for Distributed System Experiments and a Case Study of Service DiscoveryAndreas Dittrich, Stefan Wanja, Miroslaw Malek. 1314-1323 [doi]
- Managing Soft-Errors in Transactional SystemsMohamed Mohamedin, Roberto Palmieri, Binoy Ravindran. 1324-1329 [doi]
- Standby System Reliability through DRBDSalvatore Distefano. 1330-1337 [doi]
- Trust-Based Security for the Spanning Tree ProtocolYingxu Lai, Qiuyue Pan, Zenghui Liu, Yinong Chen, Zhizheng Zhou. 1338-1343 [doi]
- Autonomy Requirements Engineering for Self-Adaptive Science CloudsEmil Vassev, Mike Hinchey. 1344-1353 [doi]
- MTAAP Introduction and CommitteesLuiz DeRose. 1354 [doi]
- A New Parallel Algorithm for Two-Pass Connected Component LabelingSiddharth Gupta, Diana Palsetia, Md. Mostofa Ali Patwary, Ankit Agrawal, Alok N. Choudhary. 1355-1362 [doi]
- Position Paper: Locality-Driven Scheduling of Tasks for Data-Dependent MultithreadingJaime Arteaga, Stéphane Zuckerman, Elkin Garcia, Guang R. Gao. 1363-1367 [doi]
- Position Paper: Leveraging Strength-Based Dynamic Slicing to Identify Control Reconvergence InstructionsWalid J. Ghandour, Nadine J. Ghandour. 1368-1373 [doi]
- Parallel Heuristics for Scalable Community DetectionHao Lu, Mahantesh Halappanavar, Ananth Kalyanaraman, Sutanay Choudhury. 1374-1385 [doi]
- Hardware/Software Vectorization for Closeness Centrality on Multi-/Many-Core ArchitecturesAhmet Erdem Sariyüce, Erik Saule, Kamer Kaya, Ümit V. Çatalyürek. 1386-1395 [doi]
- Revisiting Edge and Node Parallelism for Dynamic GPU Graph AnalyticsAdam McLaughlin, David A. Bader. 1396-1406 [doi]
- A Validation Testsuite for OpenACC 1.0Cheng Wang, Rengan Xu, Sunita Chandrasekaran, Barbara M. Chapman, Oscar Hernandez. 1407-1416 [doi]
- Extracting Maximal Exact Matches on GPUAnas Abu-Doleh, Kamer Kaya, Mohamed Abouelhoda, Ümit V. Çatalyürek. 1417-1426 [doi]
- Predicting an Optimal Sparse Matrix Format for SpMV Computation on GPUB. Neelima, G. Ram Mohana Reddy, Prakash S. Raghavendra. 1427-1436 [doi]
- LSPP Introduction and CommitteesDarren J. Kerbyson, Ram Rajamony, Charles C. Weems. 1437 [doi]
- Higher Dimensional Gaussian NetworksArash Shamaei, Bella Bose, Mary Flahive. 1438-1447 [doi]
- The Power-Performance Tradeoffs of the Intel Xeon Phi on HPC ApplicationsBo Li, Hung-Ching Chang, Shuaiwen Song, Chun-Yi Su, Timmy Meyer, John Mooring, Kirk W. Cameron. 1448-1456 [doi]
- Performance Modeling for Hardware Thread-Level SpeculationYing-Chieh Wang, Che-Rung Lee, Yeh-Ching Chung, I-Hsin Chung, Michael Perrone. 1457-1464 [doi]
- HMC-Sim: A Simulation Framework for Hybrid Memory Cube DevicesJohn D. Leidel, Yong Chen. 1465-1474 [doi]
- Online Monitoring System for Performance Fault DetectionRoberto Gioiosa, Gokcen Kestor, Darren J. Kerbyson. 1475-1484 [doi]
- Towards Extreme-Scale Simulations with Next-Generation Trilinos: A Low Mach Fluid Application Case StudyPaul Lin, Matthew T. Bettencourt, Stefan Domino, Travis Fisher, Mark Hoemmen, Jonathan J. Hu, Eric T. Phipps, Andrey Prokopenko, Sivasankaran Rajamanickam, Christopher M. Siefert, Eric Cyr, Stephen Kennon. 1485-1494 [doi]
- Design and Implementation of a Large Scale Tree-Based QR Decomposition Using a 3D Virtual Systolic Array and a Lightweight RuntimeIchitaro Yamazaki, Jakub Kurzak, Piotr Luszczek, Jack Dongarra. 1495-1504 [doi]
- SupMR: Circumventing Disk and Memory Bandwidth Bottlenecks for Scale-up MapReduceMichael Sevilla, Ike Nassi, Kleoni Ioannidou, Scott A. Brandt, Carlos Maltzahn. 1505-1514 [doi]
- PCO Introduction and CommitteesDidier El Baz. 1515 [doi]
- Towards Energy Efficient Allocation for Applications in Volunteer CloudCongfeng Jiang, Jian Wan, Christophe Cérin, Paolo Gianessi, Yanik Ngoko. 1516-1525 [doi]
- Fast Generation of Large Task Network MappingsKarl-Eduard Berger, François Galea, Bertrand Le Cun, Renaud Sirdey. 1526-1530 [doi]
- Adaptive N to P Portfolio for Solving Constraint Programming Problems on Top of the Parallel Bobpp FrameworkTarek Menouer, Bertrand Le Cun. 1531-1540 [doi]
- Dependent Walks in Parallel Local SearchYves Caniou, Philippe Codognet. 1541-1546 [doi]
- A Parallel Large Neighborhood Search-Based Heuristic for the Disjunctively Constrained Knapsack ProblemMhand Hifi, Stéphane Nègre, Toufik Saadi, Sagvan Saleh, Lei Wu. 1547-1551 [doi]
- Solving Hard MIPLIB2003 Problems with ParaSCIP on Supercomputers: An UpdateYuji Shinano, Tobias Achterberg, Timo Berthold, Stefan Heinz, Thorsten Koch, Michael Winkler. 1552-1561 [doi]
- A Task Scheduling Algorithm Based on Replication for Maximizing Reliability on Heterogeneous Computing SystemsShuli Wang, Kenli Li, Jing Mei, Keqin Li, Yan Wang. 1562-1571 [doi]
- SkewControl: Gini Out of the BottleSi Zheng, Yunhuai Liu, Tian He, Shanshan Li, Xiangke Liao. 1572-1580 [doi]
- The Heuristic Static Load-Balancing Algorithm Applied to the Community Earth System ModelYuri Alexeev, Sheri A. Mickelson, Sven Leyffer, Robert L. Jacob, Anthony P. Craig. 1581-1590 [doi]
- A Distributed Algorithm for a Reconfigurable Modular SurfaceDidier El Baz, Benoît Piranda, Julien Bourgeois. 1591-1598 [doi]
- ParLearning Introduction and CommitteesAbhinav Vishnu, Yinglong Xia. 1599-1600 [doi]
- ParLearning KeynoteEric P. Xing. 1601 [doi]
- Wait-Free Primitives for Initializing Bayesian Network Structure Learning on Multicore ProcessorsHsuan-Yi Chu, Yinglong Xia, Anand V. Panangadan, Viktor K. Prasanna. 1602-1611 [doi]
- gpuRF and gpuERT: Efficient and Scalable GPU Algorithms for Decision Tree EnsemblesKarl Jansson, Håkan Sundell, Henrik Boström. 1612-1621 [doi]
- Training Large Scale Deep Neural Networks on the Intel Xeon Phi Many-Core CoprocessorLei Jin, Zhaokang Wang, Rong Gu, Chunfeng Yuan, Yihua Huang. 1622-1630 [doi]
- Parallel Bayesian Network Modelling for Pervasive Health Monitoring SystemXiujuan Qian, Yongli Wang, Xiaohui Jiang. 1631-1637 [doi]
- Portfolio-Based Selection of Robust Dynamic Loop Scheduling Algorithms Using Machine LearningNitin Sukhija, Brandon Malone, Srishti Srivastava, Ioana Banicescu, Florina M. Ciorba. 1638-1647 [doi]
- A General P2P Scheme for Constructing Large-Scale Virtual EnvironmentsWei Wang, Guisong Yang, Naixue Xiong, Xingyu He, Wenzhong Guo. 1648-1655 [doi]
- Large Scale Discriminative Metric LearningPeter D. Kirchner, Matthias Boehm, Berthold Reinwald, Daby M. Sow, Michael Schmidt, Deepak S. Turaga, Alain Biem. 1656-1663 [doi]
- YAFIM: A Parallel Frequent Itemset Mining Algorithm with SparkHongjian Qiu, Rong Gu, Chunfeng Yuan, Yihua Huang. 1664-1671 [doi]
- The Empirical Research of Virtual Enterprise Knowledge Transfer's Effectiveness Faced to the Independent Innovation AbilityYang Bo, Naixue Xiong, Wenzhong Guo. 1672-1679 [doi]
- A Distributed Speech Algorithm for Large Scale Data Communication SystemsNaixue Xiong, Guoxiang Tong, Wenzhong Guo, Jian Tan, Guanning Wu. 1680-1687 [doi]
- HPDIC Introduction and CommitteesChristophe Cérin, Congfeng Jiang. 1688 [doi]
- Compactor: Optimization Framework at Staging I/O NodesVishwanath Venkatesan, Mohamad Chaarawi, Quincey Koziol, Edgar Gabriel. 1689-1697 [doi]
- Hybrid BFS Approach Using Semi-external MemoryKeita Iwabuchi, Hitoshi Sato, Ryo Mizote, Yuichiro Yasui, Katsuki Fujisawa, Satoshi Matsuoka. 1698-1707 [doi]
- Model-Driven Data Layout Selection for Improving Read PerformanceJialin Liu, Surendra Byna, Bin Dong, Kesheng Wu, Yong Chen. 1708-1716 [doi]
- Scalable and Reliable Data Broadcast with KascadeStephane Martin, Tomasz Buchert, Pierric Willemet, Olivier Richard, Emmanuel Jeanvoine, Lucas Nussbaum. 1717-1726 [doi]
- SOM Clustering Using Spark-MapReduceTugdual Sarazin, Hanane Azzag, Mustapha Lebbah. 1727-1734 [doi]
- Optimizing the Join Operation on Hive to Accelerate Cross-Matching in AstronomyLiang Li, Dixin Tang, Taoying Liu, Hong Liu, Wei Li, Chenzhou Cui. 1735-1745 [doi]
- JSSPP Introduction and CommitteesWalfredo Cirne, Narayan Desai. 1746 [doi]
- CHIUW Introduction and CommitteesBrad Chamberlain. 1747-1749 [doi]