Abstract is missing.
- HCW IntroductionDenis Trystram, Erik Saule. 1-2 [doi]
- Message from the HCW Steering Committee ChairBehrooz Shirazi. 3 [doi]
- Message from the HCW General ChairDenis Trystram. 4 [doi]
- Message from the HCW Program Committee ChairErik Saule. 5 [doi]
- HCW 2016 Keynote TalkMahmut T. Kandemir. 6 [doi]
- Towards a Green, QoS-Enabled Heterogeneous Cloud InfrastructureJulio Proaño, Carmen Carrión, María Blanca Caminero. 7-16 [doi]
- Predicting Job Completion Time in Heterogeneous MapReduce EnvironmentsRekha Singhal, Abhishek Verma. 17-27 [doi]
- Minimizing Rental Cost for Multiple Recipe Applications in the CloudFouad Hanna, Loris Marchal, Jean-Marc Nicod, Laurent Philippe, Veronika Rehn-Sonigo, Hala Sabbah. 28-37 [doi]
- Providing Fairness in Heterogeneous Multicores with a Predictive, Adaptive SchedulerSaeid Barati, Hank Hoffmann. 38-49 [doi]
- clCaffe: OpenCL Accelerated Caffe for Convolutional Neural NetworksJeremy Bottleson, SungYe Kim, Jeff Andrews, Preeti Bindu, Deepak N. Murthy, Jingyi Jin. 50-57 [doi]
- Parallel Graph Partitioning on a CPU-GPU ArchitectureBahareh Goodarzi, Martin Burtscher, Dhrubajyoti Goswami. 58-66 [doi]
- Dynamic Resource Management for Parallel Tasks in an Oversubscribed Energy-Constrained Heterogeneous EnvironmentDylan Machovec, Bhavesh Khemka, Sudeep Pasricha, Anthony A. Maciejewski, Howard Jay Siegel, Gregory A. Koenig, Michael Wright, Marcia Hilton, Rajendra Rambharos, Neena Imam. 67-78 [doi]
- Analyzing the Energy Efficiency of the Fast Multipole Method Using a DVFS-Aware Energy ModelJeeWhan Choi, Richard W. Vuduc. 79-88 [doi]
- Evaluation of Emerging Energy-Efficient Heterogeneous Computing Platforms for Biomolecular and Cellular Simulation WorkloadsJohn E. Stone, Michael J. Hallock, James C. Phillips, Joseph R. Peterson, Zaida Luthey-Schulten, Klaus Schulten. 89-100 [doi]
- RAW Introduction and CommitteesMarco D. Santambrogio, Ramachandran Vaidyanathan, Diana Goehringer, Steven J. E. Wilton. 101-102 [doi]
- RAW 2016 KeynotesPeter Hofstee, Patrick Lysaght, Dirk van den Heuvel. 103-104 [doi]
- Clustering and Mapping Algorithm for Application Distribution on a Scalable FPGA ClusterLester Kalms, Diana Göhringer. 105-113 [doi]
- A Fast and Accurate Cost Model for FPGA Design Space Exploration in HPC ApplicationsSyed Waqar Nabi, Wim Vanderbauwhede. 114-123 [doi]
- Latency, Power, and Security Optimization in Distributed Reconfigurable Embedded SystemsHyunsuk Nam, Roman Lysecky. 124-131 [doi]
- A Reconfigurable Fixed-Point Architecture for Adaptive BeamformingDaniel Llamocca, Daniel N. Aloi. 132-138 [doi]
- Parameterizable FPGA-Based Kalman Filter Coprocessor Using Piecewise Affine ModelingAaron Mills, Phillip H. Jones, Joseph Zambreno. 139-147 [doi]
- High Throughput Large Scale Sorting on a CPU-FPGA Heterogeneous PlatformChi Zhang, Ren Chen, Viktor K. Prasanna. 148-155 [doi]
- An FPGA Architecture to Accelerate the Burrows Wheeler Transform by Using a Linear SorterJuan Andrés Pérez-Celis, José Martínez-Carranza, Alicia Morales-Reyes, Claudia Feregrino Uribe, René Cumplido. 156-161 [doi]
- A 16-Bit Reconfigurable Encryption Processor for p-CipherMohamed El-Hadedy, Hristina Mihajloska, Danilo Gligoroski, Amit Kulkarni, Dirk Stroobandt, Kevin Skadron. 162-171 [doi]
- Dynamic Self-Reconfiguration of a MIPS-Based Soft-Processor ArchitectureStephan Nolting, Guillermo Payá Vayá, Florian Giesemann, Holger Blume, Sebastian Niemann, Christian Müller-Schloer. 172-180 [doi]
- An Application-Specific Instruction Set Processor for Power Quality MonitoringSteffen Vaas, Marc Reichenbach, Dietmar Fey. 181-188 [doi]
- Resource-Efficient Scheduling for Partially-Reconfigurable FPGA-Based SystemsAndrea Purgato, Davide Tantillo, Marco Rabozzi, Donatella Sciuto, Marco D. Santambrogio. 189-197 [doi]
- Scheduler for Inhomogeneous and Irregular CGRAs with Support for Complex Control FlowTajas Ruschke, Lukas Johannes Jung, Dennis Wolf, Christian Hochberger. 198-207 [doi]
- LinROS: A Linux-Based Runtime System for Reconfigurable MPSoCsJens Rettkowski, Philipp Wehner, Evgheni Cutiscev, Diana Göhringer. 208-216 [doi]
- On the Automation of High Level Synthesis of Convolutional Neural NetworksEmanuele Del Sozzo, Andrea Solazzo, Antonio Miele, Marco D. Santambrogio. 217-224 [doi]
- Scala-Based Domain-Specific Language for Creating Accelerator-Based SoCsGianluca C. Durelli, Fabrizio Spada, Christian Pilato, Marco D. Santambrogio. 225-232 [doi]
- OOGen: An Automated Generation Tool for Custom MPSoC Architectures Based on Object-Oriented Programming MethodsHongyuan Ding, Sen Ma, Miaoqing Huang, David L. Andrews. 233-240 [doi]
- A Hardware/Software Co-Design Approach for Control Applications with Static Real-Time ReallocationBenedikt Janßen, Moataz Naserddin, Michael Hübner. 241-246 [doi]
- On How to Improve FPGA-Based Systems Design Productivity via SDAccelGiulia Guidi, Enrico Reggiani, Lorenzo Di Tucci, Gianluca Durelli, Michaela Blott, Marco D. Santambrogio. 247-252 [doi]
- A Rapid Prototyping Method to Reduce the Design Time in Commercial High-Level Synthesis ToolsJones Yudi Mori, André Werner, Florian Fricke, Michael Hübner. 253-258 [doi]
- ARTNoCs: An Evaluation Framework for Hardware Architectures of Real-Time NoCsSalma Hesham, Diana Göhringer, Mohamed A. Abd El ghany. 259-264 [doi]
- A Fully Parameterized Virtual Coarse Grained Reconfigurable Array for High Performance Computing ApplicationsAmit Kulkarni, Elias Vansteenkiste, Dirk Stroobandt, Andreas Brokalakis, Antonis Nikitakis. 265-270 [doi]
- Assessing Multi-task Placement Algorithms in RCUsAnita Tino, Kaamran Raahemifar. 271-276 [doi]
- Efficient Hardware Debugging Using Parameterized FPGA ReconfigurationAlexandra Kourfali, Dirk Stroobandt. 277-282 [doi]
- Enabling Dynamic Reconfiguration of Numerical Methods for the Robotic Motion Control TaskFynn Schwiegelshohn, Florian Kastner, Michael Hübner. 283-288 [doi]
- Hardware Architectures for Frequent Itemset Mining Based on Equivalence Classes PartitioningMartin Letras, Raudel Hernández, René Cumplido. 289-294 [doi]
- Parallel Protein Identification Using an FPGA-Based SolutionFabiola Casasopra, Gea Bianchi, Gianluca C. Durelli, Marco D. Santambrogio. 295-299 [doi]
- Face Recognition Using Local Binary Patterns Histograms (LBPH) on an FPGA-Based System on Chip (SoC)Nikolaos Stekas, Dirk van den Heuvel. 300-304 [doi]
- HIPS Introduction and CommitteesDavid Böhme, Xu Liu. 305-306 [doi]
- HIPS 2016 KeynoteTim Mattson. 307 [doi]
- Detecting Anomalies in Concurrent Programs Based on Dynamic Control Flow ChangesFaheem Ullah, Thomas R. Gross. 308-317 [doi]
- Controlling the Memory Subscription of Distributed Applications with a Task-Based Runtime SystemMarc Sergent, David Goudin, Samuel Thibault, Olivier Aumage. 318-327 [doi]
- Reducing Redundant Search in Parallel Graph Mining Using ExceptionsShingo Okuno, Tasuku Hiraishi, Hiroshi Nakashima, Masahiro Yasugi, Jun Sese. 328-337 [doi]
- Evaluating OpenMP 4.0's Effectiveness as a Heterogeneous Parallel Programming ModelMatt Martineau, Simon McIntosh-Smith, Wayne P. Gaudin. 338-347 [doi]
- Employing Compression Solutions under OpenACCEbad Salehi, Ahmad Lashgar, Amirali Baniasadi. 348-356 [doi]
- CAFe: Coarray Fortran Extensions for Heterogeneous ComputingCraig Edward Rasmussen, Matthew J. Sottile, Søren Rasmussen, Daniel Nagle, William Dumas. 357-365 [doi]
- Embedding Concurrent GeneratorsPeter Mills, Clinton Jeffery. 366-375 [doi]
- The Case for Binary Rewriting at Runtime for Efficient Implementation of High-Level Programming Models in HPCJosef Weidendorfer, Jens Breitbart. 376-385 [doi]
- PTRAM: A Parallel Topology-and Routing-Aware Mapping Framework for Large-Scale HPC SystemsSeyed Hessam Mirsadeghi, Ahmad Afsahi. 386-396 [doi]
- A Comparison of High-Level Programming Choices for Incomplete Sparse Factorization Across Different ArchitecturesJoshua Dennis Booth, Kyungjoo Kim, Sivasankaran Rajamanickam. 397-406 [doi]
- HiCOMB Introduction and CommitteesSrinivas Aluru, David A. Bader, Ananth Kalyanaraman, Jaroslaw Zola. 407 [doi]
- The Divisible Load Balance Problem with Shared Cost and Its Application to Phylogenetic InferenceConstantin Scholl, Kassian Kobert, Tomás Flouri, Alexandros Stamatakis. 408-417 [doi]
- Efficient Computation of Linkage Disequilibria as Dense Linear Algebra OperationsNikolaos Alachiotis, Doru-Thom Popovici, Tze Meng Low. 418-427 [doi]
- Improving Reaction Kernel Performance in Lattice Microbes: Particle-Wise Propensities and Run-Time Generated CodeMichael J. Hallock, Zaida Luthey-Schulten. 428-434 [doi]
- SparkScore: Leveraging Apache Spark for Distributed Genomic InferenceAmir Bahmani, Alexander B. Sibley, Mahmoud Parsian, Kouros Owzar, Frank Mueller. 435-442 [doi]
- A Scalable Pipeline for Transcriptome Profiling Tasks with On-Demand Computing CloudsShayan Shams, Nayong Kim, Xiandong Meng, Ming Tai Ha, Shantenu Jha, Zhong Wang, Joohyun Kim. 443-452 [doi]
- A Memory and Time Scalable Parallelization of the Reptile Error-Correction CodeVipin Sachdeva, Srinivas Aluru, David A. Bader. 453-462 [doi]
- Real-Time Agent-Based Modeling Simulation with in-Situ Visualization of Complex Biological Systems: A Case Study on Vocal Fold Inflammation and HealingNuttiiya Seekhao, Caroline Shung, Joseph JáJá, Luc Mongeau, Nicole Y. K. Li-Jessen. 463-472 [doi]
- A Novel Associative Memory Based Architecture for Sequence AlignmentM. Ali Mirzaei, Francesco Crescioli, Sebastien Viret, William Tromeur, Giovanni Calderini, Giovanni Marchiori, Guillaume Baulieu, Geoffrey Galbit. 473-478 [doi]
- APDCM Introduction and CommitteesOscar H. Ibarra, Koji Nakano, Akihiro Fujiwara, Susumu Matsumae. 479 [doi]
- Stable Matching Beyond Bipartite GraphsJie Wu 0001. 480-488 [doi]
- Fine-Grained Task Migration for Graph Algorithms Using Processing in MemoryPaula Aguilera, Dong Ping Zhang, Nam Sung Kim, Nuwan Jayasena. 489-498 [doi]
- Cross-Layered Security Approach with Compromised Nodes Detection in Cooperative Sensor NetworksWei Chen 0003, Liang Hong, Sachin Shetty, Dan Chia-Tien Lo, Reginald Cooper. 499-508 [doi]
- Model Checking Techniques for State Space Reduction in MANET Protocol VerificationHideharu Kojima, Yuta Nagashima, Tatsuhiro Tsuchiya. 509-516 [doi]
- New Biology Inspired Anonymous Distributed Algorithms to Compute Dominating and Total Dominating Sets in Network GraphsFeng Luo, Pradip K. Srimani. 517-524 [doi]
- Performance of Causal Consistency Algorithms for Partially Replicated SystemsTa Yuan Hsu, Ajay D. Kshemkalyani. 525-534 [doi]
- Performance Analysis of an I/O-Intensive Workflow Executing on Google Cloud and Amazon Web ServicesHassan Nawaz, Gideon Juve, Rafael Ferreira da Silva, Ewa Deelman. 535-544 [doi]
- Performance Models for Split-Execution Computing SystemsTravis S. Humble, Alexander J. McCaskey, Jonathan Schrock, Hadayat Seddiqi, Keith A. Britt, Neena Imam. 545-554 [doi]
- A Model for Entropy of Parallel ExecutionErnesto Gomez, Keith E. Schubert, Ritchie Cai. 555-560 [doi]
- FFT on XMT: Case Study of a Bandwidth-Intensive Regular Algorithm on a Highly-Parallel Many CoreJames Alexander Edwards, Uzi Vishkin. 561-569 [doi]
- Parallelization of Recursive Preorder Traversal Based on Building and Winding Call StacksMakoto Nakayama, Kenichi Yamazaki, Satoshi Tanaka. 570-579 [doi]
- A GPU Based Maximum Common Subgraph Algorithm for Drug Discovery ApplicationsP. B. Jayaraj, K. Rahamathulla, G. Gopakumar. 580-588 [doi]
- Bitwise Parallel Bulk Computation on the GPU, with Application to the CKY Parsing for Context-Free GrammarsToru Fujita, Koji Nakano, Yasuaki Ito. 589-598 [doi]
- An Efficient Implementation of LZW Decompression in the FPGAXin Zhou, Yasuaki Ito, Koji Nakano. 599-607 [doi]
- AsHES Introduction and CommitteesJames Dinan. 608-609 [doi]
- AsHES 2016 KeynoteWen-mei Hwu. 610 [doi]
- Heterogeneous StreamingChris J. Newburn, Gaurav Bansal, Michael Wood, Luis Crivelli, Judit Planas, Alejandro Duran, Paulo Souza, Leonardo Borges, Piotr Luszczek, Stanimire Tomov, Jack Dongarra, Hartwig Anzt, Mark Gates, Azzam Haidar, Yulu Jia, Khairul Kabir, Ichitaro Yamazaki, Jesús Labarta. 611-620 [doi]
- HMC-Sim-2.0: A Simulation Platform for Exploring Custom Memory Cube OperationsJohn D. Leidel, Yong Chen. 621-630 [doi]
- Alpaka - An Abstraction Library for Parallel Kernel AccelerationErik Zenker, Benjamin Worpitz, René Widera, Axel Huebl, Guido Juckeland, Andreas Knüpfer, Wolfgang E. Nagel, Michael Bussmann. 631-640 [doi]
- A Tool for Bottleneck Analysis and Performance Prediction for GPU-Accelerated ApplicationsSouley Madougou, Ana Lucia Varbanescu, Cees de Laat, Rob van Nieuwpoort. 641-652 [doi]
- Hessenberg Reduction with Transient Error Resilience on GPU-Based Hybrid ArchitecturesYulu Jia, Piotr Luszczek, Jack Dongarra. 653-662 [doi]
- Optimization of Block Sparse Matrix-Vector Multiplication on Shared-Memory Parallel ArchitecturesRyan Eberhardt, Mark Hoemmen. 663-672 [doi]
- Basker: A Threaded Sparse LU Factorization Utilizing Hierarchical Parallelism and Data LayoutsJoshua Dennis Booth, Sivasankaran Rajamanickam, Heidi Thornquist. 673-682 [doi]
- Efficiency of General Krylov Methods on GPUs - An Experimental StudyHartwig Anzt, Jack Dongarra, Moritz Kreutzer, Gerhard Wellein, Martin Koehler. 683-691 [doi]
- Refactoring Conventional Task Schedulers to Exploit Asymmetric ARM big.LITTLE Architectures in Dense Linear AlgebraLuis Costero, Francisco D. Igual, Katzalin Olcoz, Sandra Catalán, Rafael Rodríguez-Sánchez, Enrique S. Quintana-Ortí. 692-701 [doi]
- Heterogeneous CAF-Based Load Balancing on Intel Xeon PhiValeria Cardellini, Alessandro Fanfarillo, Salvatore Filippone. 702-711 [doi]
- Topology-Aware GPU Selection on Multi-GPU NodesIman Faraji, Seyed Hessam Mirsadeghi, Ahmad Afsahi. 712-720 [doi]
- PCO Introduction and CommitteesDidier El Baz, Bora Uçar. 721 [doi]
- Scenario Decomposition for 0-1 Stochastic Programs: Improvements and Asynchronous ImplementationKevin Ryan, Deepak Rajan, Shabbir Ahmed. 722-729 [doi]
- PIPS-SBB: A Parallel Distributed-Memory Branch-and-Bound Algorithm for Stochastic Mixed-Integer ProgramsLluís-Miquel Munguía, Geoffrey Oxberry, Deepak Rajan. 730-739 [doi]
- Counting Triangles in Large Graphs on GPUAdam Polak. 740-746 [doi]
- GPU-Based Two Level Parallel B&B for the Blocking Job Shop Scheduling ProblemAdel Dabah, Ahcène Bendjoudi, Didier El Baz, Abdelhakim AitZai. 747-755 [doi]
- Parallel Ant Colony Optimization for Flow Shop Scheduling Subject to Limited Machine AvailabilityYumei Huo, Jun Xiong Huang. 756-765 [doi]
- GPGPU-Based Parallel Algorithms for Scheduling Against Due DateAbhishek Awasthi, Jörg Lässig, Jens Leuschner, Thomas Weise. 766-775 [doi]
- Performance Analysis of Bio-Inspired Scheduling Algorithms for Cloud EnvironmentsAli Al Buhussain, Robson Eduardo De Grande, Azzedine Boukerche. 776-785 [doi]
- Optimizing Metaheuristics and Hyperheuristics through Multi-level Parallelism on a Many-Core SystemJosé-Matías Cutillas-Lozano, Domingo Giménez, Luis-Pedro García. 786-795 [doi]
- A Parallel Ant Colony Optimization for the Maximum-Weight Clique ProblemDidier El Baz, Mhand Hifi, Lei Wu, Xiaochuan Shi. 796-800 [doi]
- Evaluating the Performance of A4SDN on Various Network TopologiesGiovanni Cammarata, Antonella Di Stefano, Giovanni Morana, Daniele Zito. 801-808 [doi]
- Hybrid Heuristics for Mapping Task Problem on Large Scale Heterogeneous PlatformsAnia Kaci, Huy Nam Nguyen, Amir Nakib, Patrick Siarry. 809-816 [doi]
- A Semi-Greedy Heuristic for the Mapping of Large Task GraphsKarl-Eduard Berger, François Galea, Bertrand Le Cun, Renaud Sirdey. 817-824 [doi]
- A High Performance Implementation of Spectral Clustering on CPU-GPU PlatformsYu Jin, Joseph F. JáJá. 825-834 [doi]
- Testing Fine-Grained Parallelism for the ADMM on a Factor-GraphNing Hao, Amirreza Oghbaee, Mohammad Rostami, Nate Derbinsky, José Bento. 835-844 [doi]
- High Performance Parallel Graph Coloring on GPGPUsPingfan Li, Xuhao Chen, Zhe Quan, Jianbin Fang, Huayou Su, Tao Tang, Canqun Yang. 845-854 [doi]
- GABB Introduction and CommitteesTim Mattson. 855 [doi]
- GABB 2016 KeynoteDavid A. Bader. 856 [doi]
- Array Types for a Graph Processing LanguageMark Tullsen, Matthew J. Sottile. 857-866 [doi]
- The Right Way to Search Evolving GraphsJiahao Chen, Weijian Zhang. 867-876 [doi]
- Updating PageRank for Streaming GraphsE. Jason Riedy. 877-884 [doi]
- Application of Graph Sparsification in Developing Parallel Algorithms for Updating Connected ComponentsSriram Srinivasan 0001, Sanjukta Bhowmick, Sajal K. Das. 885-891 [doi]
- Towards a Distributed Large-Scale Dynamic Graph Data StoreKeita Iwabuchi, Scott Sallinen, Roger A. Pearce, Brian Van Essen, Maya Gokhale, Satoshi Matsuoka. 892-901 [doi]
- Enforced Sparse Non-negative Matrix FactorizationBrendan Gavin, Vijay Gadepally, Jeremy Kepner. 902-911 [doi]
- GBTL-CUDA: Graph Algorithms and Primitives for GPUsPeter Zhang, Marcin Zalewski, Andrew Lumsdaine, Samantha Misurda, Scott McMillan. 912-920 [doi]
- Jaccard Coefficients as a Potential Graph BenchmarkPeter M. Kogge. 921-928 [doi]
- PageRank Pipeline Benchmark: Proposal for a Holistic System Benchmark for Big-Data PlatformsPatrick Dreher, Chansup Byun, Chris Hill, Vijay Gadepally, Bradley C. Kuszmaul, Jeremy Kepner. 929-937 [doi]
- EduPar Introduction and CommitteesRamachandran Vaidyanathan, Sushil K. Prasad, Satish Puri. 938-940 [doi]
- EduPar 2016 KeynoteRandal E. Bryant. 941 [doi]
- WebGPU: A Scalable Online Development Platform for GPU Programming CoursesAbdul Dakkak, Carl Pearson, Wen-mei Hwu. 942-949 [doi]
- Parallel Programming with Pictures in a Snap!Annette C. Feng, Wu-chun Feng. 950-957 [doi]
- Modules to Teach Parallel and Distributed Computing Using MPI for Python and DiscoJosé R. Ortiz-Ubarri, Rafael A. Arce-Nazario, Edusmildo Orozco. 958-962 [doi]
- VIPLE: Visual IoT/Robotics Programming Language Environment for Computer Science EducationYinong Chen, Gennaro De Luca. 963-971 [doi]
- Seeing Multithreaded Behavior Using TSGLJoel C. Adams, Patrick A. Crain, Christopher P. Dilley. 972-977 [doi]
- The Suzaku Pattern Programming FrameworkBarry Wilkinson, Clayton Ferner. 978-986 [doi]
- A Flipped Classroom Approach to Teaching Concurrency and ParallelismShirley Moore, Steven R. Dunlop. 987-995 [doi]
- A Parallel Programming Course Based on an Execution Time-Energy Consumption Optimization ProblemJavier Cuenca, Domingo Giménez. 996-1003 [doi]
- HPDAV Introduction and CommitteesWes Bethel. 1004-1005 [doi]
- HPDAV 2016 KeynoteJim Jeffers. 1006 [doi]
- Visualization and Analysis for Near-Real-Time Decision Making in Distributed WorkflowsDavid Pugmire, James Kress, Jong Youl Choi, Scott Klasky, Tahsin M. Kurç, Michael Churchill, Matthew Wolf, Greg Eisenhower, Hank Childs, Kesheng Wu, Alexander Sim, Junmin Gu, Jonathan Low. 1007-1013 [doi]
- High Performance Molecular Visualization: In-Situ and Parallel Rendering with EGLJohn E. Stone, Peter Messmer, Robert Sisneros, Klaus Schulten. 1014-1023 [doi]
- Introducing Acacia-RDF: An X10-Based Scalable Distributed RDF Graph Database EngineMiyuru Dayarathna, Isuru Herath, Yasima Dewmini, Gayan Mettananda, Sameera Nandasiri, Sanath Jayasena, Toyotaro Suzumura. 1024-1032 [doi]
- Towards Asynchronous Many-Task in Situ Data Analysis Using LegionPhilippe P. Pébay, Janine C. Bennett, David S. Hollman, Sean Treichler, Patrick S. McCormick, Christine Sweeney, Hemanth Kolla, Alex Aiken. 1033-1037 [doi]
- Coupling LAMMPS and the vl3 Framework for Co-Visualization of Atomistic SimulationsSilvio Rizzi, Mark Hereld, Joseph A. Insley, Preeti Malakar, Michael E. Papka, Thomas D. Uram, Venkatram Vishwanath. 1038-1042 [doi]
- Developing a Scalable SNMP MonitorKrishna Bharadwaj, Samuel Flores, Joshua Rodriguez, Lance Long, G. Elisabeta Marai. 1043-1047 [doi]
- Immersive Molecular Visualization with Omnidirectional Stereoscopic Ray Tracing and Remote RenderingJohn E. Stone, William R. Sherman, Klaus Schulten. 1048-1057 [doi]
- Tuned to Terrible: A Study of Parallel Particle Advection State of the PracticeRobert Sisneros, David Pugmire. 1058-1067 [doi]
- VarSys IntroductionKirk W. Cameron, Todd Gamblin, Dimitrios S. Nikolopoulos. 1068 [doi]
- Variability: A Tuning HeadacheAllan Porterfield, Sridutt Bhalachandra, Wei Wang 0082, Rob Fowler. 1069-1072 [doi]
- Mitigating Processor Variation through Dynamic Load BalancingBilge Acun, Laxmikant V. Kalé. 1073-1076 [doi]
- Characterizing and Reducing Cross-Platform Performance Variability Using OS-Level VirtualizationIvo Jimenez, Carlos Maltzahn, Jay F. Lofstead, Adam Moody, Kathryn Mohror, Remzi H. Arpaci-Dusseau, Andrea C. Arpaci-Dusseau. 1077-1080 [doi]
- Towards Managing Variability in the CloudAli Anwar, Yue Cheng, Ali Raza Butt. 1081-1084 [doi]
- Near Real-Time Tracking of IoT Device UsersJin-Seong Kim, Jae J. Jang, Im Y. Jung. 1085-1088 [doi]
- HPPAC Introduction and CommitteesBarry Rountree, Shuaiwen Leon Song. 1089 [doi]
- The Right Metric for Efficient Supercomputing: A Ten-Year RetrospectiveChung-Hsing Hsu, Wu-chun Feng. 1090-1093 [doi]
- Overcoming Challenges in Scalable Power Monitoring with the Power APIRyan E. Grant, Michael Levenhagen, Stephen L. Olivier, David Debonis, Kevin T. Pedretti, James H. Laros III. 1094-1097 [doi]
- Achieving Safety for Power Shifting in Overprovisioned High Performance Computing SystemsShirley Moore. 1098-1101 [doi]
- POSITION PAPER: Countering the Noise-Induced Critical Path ProblemRogelio Long, Shirley Moore. 1102-1105 [doi]
- Re-Examining HPC Energy Efficiency Dashboard ElementsNatalie J. Bates, Chung-Hsing Hsu, Neena Imam, Torsten Wilde, Dale Sartor. 1106-1109 [doi]
- A Power-Aware Cost Model for HPC ProcurementNeha Gholkar, Frank Mueller, Barry Rountree. 1110-1113 [doi]
- Energy Claims at Scale: Decreasing the Energy Demand of HPC Workloads at OS LevelChristopher Eibel, Timo Hönig, Wolfgang Schröder-Preikschat. 1114-1117 [doi]
- Systemwide Power Management with ArgoDaniel A. Ellsworth, Tapasya Patki, Swann Perarnau, Sangmin Seo, Abdelhalim Amer, Judicael A. Zounmevo, Rinku Gupta, Kazutomo Yoshii, Henry Hoffmann, Allen D. Malony, Martin Schulz, Peter H. Beckman. 1118-1121 [doi]
- Best Practices for Scalable Power Measurement and ControlScott Walker, Marty McFadden. 1122-1131 [doi]
- LibPowerMon: A Lightweight Profiling Framework to Profile Program Context and System-Level MetricsAniruddha Marathe, Hormozd Gahvari, Jae-Seung Yeom, Abhinav Bhatele. 1132-1141 [doi]
- Power Balancing in an Emulated Exascale EnvironmentMatthias Maiterth, Martin Schulz, Dieter Kranzlmüller, Barry Rountree. 1142-1149 [doi]
- Combining Power and Performance Modeling for Application Analysis: A Case Study Using AspenSand L. Correa, Mariam Umar, Kirk W. Cameron. 1150-1159 [doi]
- Effective Utilization of CUDA Hyper-Q for Improved Power and Performance EfficiencyRyan S. Luley, Qinru Qiu. 1160-1169 [doi]
- Identification of critical parameters for MapReduce energy efficiency using statistical Design of ExperimentsNidhi Tiwari, Umesh Bellur, Santonu Sarkar, Maria Indrawan. 1170-1179 [doi]
- Utilizing Hardware Performance Counters to Model and Optimize the Energy and Performance of Large Scale Scientific Applications on Power-Aware SupercomputersXingfu Wu, Valerie E. Taylor. 1180-1189 [doi]
- Energy, Power, and Performance Characterization of GPGPU Benchmark ProgramsJared Coplin, Martin Burtscher. 1190-1199 [doi]
- PDSEC Introduction and CommitteesPeter Strazdins, Raphaël Couturier, Keita Teranishi, Alan Gray, Thomas Rauber, Gudula Rünger, Laurence T. Yang. 1200-1201 [doi]
- Electron Dynamics Simulation with Time-Dependent Density Functional Theory on Large Scale Symmetric Mode Xeon Phi ClusterYuta Hirokawa, Taisuke Boku, Shunsuke Sato, Kazuhiro Yabana. 1202-1211 [doi]
- Towards an Efficient Task-Based Parallelization over a Runtime System of an Explicit Finite-Volume CFD Code with Adaptive Time SteppingJean Marie Couteyen Carpaye, Jean Roman, Pierre Brenner. 1212-1221 [doi]
- Radiative Heat Transfer Calculation on 16384 GPUs Using a Reverse Monte Carlo Ray Tracing Approach with Adaptive Mesh RefinementAlan Humphrey, Daniel Sunderland, Todd Harman, Martin Berzins. 1222-1231 [doi]
- Application Fault Tolerance for Shrinking Resources via the Sparse Grid Combination TechniquePeter E. Strazdins, Md. Mohsin Ali, Bert J. Debusschere. 1232-1238 [doi]
- Two-Level Checkpointing and Verifications for Linear Task GraphsAnne Benoit, Aurélien Cavelan, Yves Robert, Hongyang Sun. 1239-1248 [doi]
- On the Development of Variable Size Batched Computation for Heterogeneous Parallel ArchitecturesAhmad Abdelfattah, Azzam Haidar, Stanimire Tomov, Jack Dongarra. 1249-1258 [doi]
- Synapse: Synthetic Application Profiler and EmulatorAndré Merzky, Shantenu Jha. 1259-1268 [doi]
- DPDNS Introduction and CommitteesDimiter Avresky, Erik Maehle, Roberto Palmieri. 1269 [doi]
- DPDNS 2016 KeynoteShlomi Dolev. 1270 [doi]
- Distributed Decentralized Domain Name ServiceBrendan Benshoof, Andrew Rosen, Anu G. Bourgeois, Robert W. Harrison. 1279-1287 [doi]
- Management Software for Protocol-level Adaptations in Dependable Network ServicesKaliappa Ravindran. 1288-1297 [doi]
- Mitigating Routing Inefficiencies to Cloud-Storage Providers: A Case StudySoham Sinha, Di Niu, Zhi Wang, Paul Lu. 1298-1306 [doi]
- Leaderless Consensus: The State of the ArtRoberto Palmieri. 1307-1310 [doi]
- Proactive Cloud Management for Highly Heterogeneous Multi-cloud InfrastructuresAlessandro Pellegrini 0001, Pierangelo di Sanzo, Dimiter R. Avresky. 1311-1318 [doi]
- Towards Resiliency Evaluation of Vector ProgramsVishal Chandra Sharma, Ganesh Gopalakrishnan, Sriram Krishnamoorthy. 1319-1328 [doi]
- Analysis of Adaptive Mapping of Parallelized Application on Multicore SystemGilles Bizot, Dimiter Avresky, Fabien Chaix. 1329-1338 [doi]
- LSPP Introduction and CommitteesKevin J. Barker, Christopher D. Carothers, Eric Van Hensbergen. 1339 [doi]
- LSPP 2016 KeynoteMichael E. Papka. 1340 [doi]
- Evaluating the Performance Impact of Multiple Streams on the MIC-Based Heterogeneous PlatformZhaokui Li, Jianbin Fang, Tao Tang, Xuhao Chen, Cheng Chen, Canqun Yang. 1341-1350 [doi]
- Parallel Implementation Strategies for Hierarchical Non-uniform Memory Access Systems by Example of the Scale-Invariant Feature Transform AlgorithmMax Plauth, Wieland Hagen, Frank Feinbube, Felix Eberhardt, Lena Feinbube, Andreas Polze. 1351-1359 [doi]
- Efficient Genetic Algorithm Encoding for Large-Scale Multi-objective Resource AllocationRyan Friese. 1360-1369 [doi]
- Toward an End-to-End Framework for Modeling, Monitoring and Anomaly Detection for Scientific WorkflowsAnirban Mandal, Paul Ruth, Ilya Baldin, Dariusz Król, Gideon Juve, Rajiv Mayani, Rafael Ferreira da Silva, Ewa Deelman, Jeremy S. Meredith, Jeffrey S. Vetter, Vickie E. Lynch, Ben Mayer, James Wynne, Mark Blanco, Christopher D. Carothers, Justin M. LaPre, Brian Tierney. 1370-1379 [doi]
- Modeling the Performance and Energy Impact of Dynamic Power SteeringKevin J. Barker, Darren J. Kerbyson. 1380-1389 [doi]
- ParLearning Introduction and CommitteesCharalampos Chelmis, Sutanay Choudhury, Arindam Pal, Anand V. Panangadan, Weiqin Tong, Yinglong Xia. 1390-1391 [doi]
- ParLearning 2016 KeynotePeter M. Kogge. 1392 [doi]
- A Novel Scalable DBSCAN Algorithm with SparkDianwei Han, Ankit Agrawal, Wei-keng Liao, Alok N. Choudhary. 1393-1402 [doi]
- A Multi-Platform Evaluation of the Randomized CX Low-Rank Matrix Factorization in SparkAlex Gittens, Jey Kottalam, Jiyan Yang, Michael F. Ringenburg, Jatin Chhugani, Evan Racah, Mohitdeep Singh, Yushu Yao, Curt Fischer, Oliver Rübel, Benjamin P. Bowen, Norman G. Lewis, Michael W. Mahoney, Venkat Krishnamurthy, Prabhat. 1403-1412 [doi]
- Cache-Aware Approximate Computing for Decision Tree LearningOrhan Kislal, Mahmut T. Kandemir, Jagadish Kotra. 1413-1422 [doi]
- Accelerating Support Count for Association Rule Mining on GPUsVasileios Zois, Anand V. Panangadan, Viktor K. Prasanna. 1423-1432 [doi]
- A Scheduling Algorithm for Hadoop MapReduce Workflows with Budget Constraints in the Heterogeneous CloudAndrew Wylie, Wei Shi, Jean-Pierre Corriveau, Yang Wang. 1433-1442 [doi]
- An Automatic Tuning System for Solving NP-Hard Problems in CloudsYanik Ngoko, Denis Trystram, Valentin Reis, Christophe Cérin. 1443-1452 [doi]
- GraQL: A Query Language for High-Performance Attributed Graph DatabasesDaniel G. Chavarría-Miranda, Vito Giovanni Castellana, Alessandro Morari, David Haglin, John Feo. 1453-1462 [doi]
- Scalable Overlapping Community DetectionIsmail El-Helw, Rutger F. H. Hofman, Wenzhe Li, Sungjin Ahn, Max Welling, Henri E. Bal. 1463-1472 [doi]
- An Efficient Parallel Nonlinear Clustering Algorithm Using MapReduceXiang-You Peng, Yu-Bo Yang, Chang-Dong Wang, Dong Huang, Jian-Huang Lai. 1473-1476 [doi]
- A New Evaluation System for Scholars and Majors Based on Big-Data TechniquesWenhua Yu, Lei Zhao, Xiangyu He, Jiacheng Zhou, Tong Cheng, Chengzhao Xue, Fan Yang. 1477-1480 [doi]
- Open Source Initiatives and Frameworks Addressing Distributed Real-Time Data AnalyticsSarwar Morshed, Juwel Rana, Marcelo Milrad. 1481-1484 [doi]
- JSSPP Introduction and CommitteesWalfredo Cirne, Narayan Desai. 1485 [doi]
- iWAPT Introduction and CommitteesWeichung Wang. 1486-1487 [doi]
- Auto-Tuning of Hybrid MPI/OpenMP Execution with Code Selection by ppOpen-ATTakahiro Katagiri, Masaharu Matsumoto, Satoshi Ohshima. 1488-1495 [doi]
- Utilization and Expansion of ppOpen-AT for OpenACCSatoshi Ohshima, Takahiro Katagiri, Masaharu Matsumoto. 1496-1505 [doi]
- Measurement Bias from Address AliasingLars Kirkholt Melhus, Rune Erlend Jensen. 1506-1515 [doi]
- Blk-Tune: Blocking Parameter Auto-Tuning to Minimize Input-Output Traffic for Flash-Based Out-of-Core Stencil ComputationsHiroko Midorikawa. 1516-1526 [doi]
- A Time-Cost Based Automatic Scheduling Framework for Matrix Computation on Various Distributed Computing PlatformsRong Gu, Zhiqiang Liu, Chunfeng Yuan, Yihua Huang. 1527-1534 [doi]
- Exploiting Performance Portability in Search Algorithms for AutotuningAmit Roy, Prasanna Balaprakash, Paul D. Hovland, Stefan M. Wild. 1535-1544 [doi]
- Search Space Generation and Pruning System for AutotunersPiotr Luszczek, Mark Gates, Jakub Kurzak, Anthony Danalis, Jack Dongarra. 1545-1554 [doi]
- CHIUW Introduction and CommitteesTom MacDonald, Greg Titus. 1555-1556 [doi]
- CHIUW 2016 KeynoteNikhil Padmanabhan. 1557 [doi]
- Optimizing Chapel for Single-Node EnvironmentsRichard B. Johnson, Jeffrey K. Hollingsworth. 1558-1567 [doi]
- PGAS Access Overhead Characterization in ChapelEngin Kayraklioglu, Olivier Serres, Ahmad Anbar, Hashem Elezabi, Tarek A. El-Ghazawi. 1568-1577 [doi]
- Chplvis: A Communication and Task Visualization Tool for ChapelPhilip A. Nelson, Greg Titus. 1578-1585 [doi]
- Transparently Resilient Task Parallelism for ChapelKonstantina Panagiotopoulou, Hans-Wolfgang Loidl. 1586-1595 [doi]
- HPBDC Introduction and CommitteesDhabaleswar K. Panda, Jianfeng Zhan, Xiaoyi Lu. 1596 [doi]
- Evaluation of SMP Shared Memory Machines for Use with In-Memory and OpenMP Big Data ApplicationsAndrew J. Younge, Christopher Reidy, Robert Henschel, Geoffrey C. Fox. 1597-1606 [doi]
- Hadoop on HPC: Integrating Hadoop and Pilot-Based Dynamic Resource ManagementAndré Luckow, Ioannis Paraskevakos, George Chantzialexiou, Shantenu Jha. 1607-1616 [doi]
- PACM: A Prediction-Based Auto-Adaptive Compression Model for HDFSRuijian Wang, Chao Wang, Li Zha. 1617-1626 [doi]
- SamzaSQL: Scalable Fast Data Management with Streaming SQLMilinda Pathirage, Julian Hyde, Yi Pan, Beth Plale. 1627-1636 [doi]
- Towards High Performance Processing of Streaming Data in Large Data CentersSupun Kamburugamuve, Saliya Ekanayake, Milinda Pathirage, Geoffrey C. Fox. 1637-1644 [doi]
- Extracting Log Patterns from System Logs in LARGEYining Zhao, Haili Xiao. 1645-1652 [doi]
- Exploring the Performance of Spark for a Scientific Use CaseSaba Sehrish, Jim Kowalkowski, Marc F. Paterno. 1653-1659 [doi]
- Big Data for Medical Image Analysis: A Performance StudyRui Zhang, Hongzhi Wang, Renu Tewari, Gero Schmidt, Deepika Kakrania. 1660-1664 [doi]
- HPCMASPA Introduction and CommitteesBenjamin A. Allan, Jim M. Brandt, Ann C. Gentile, Cory Lueninghoener, Nichamon Naksinehaboon, Boyana Norris, Narate Taerat. 1665-1666 [doi]
- HPCMASPA 2016 KeynoteWilliam T. C. Kramer. 1667 [doi]
- Calltree-Controlled Instrumentation for Low-Overhead Survey MeasurementsChristian Iwainsky, Christian H. Bischof. 1668-1677 [doi]
- Automatically Instrumenting Scientific Applications to Produce Heartbeat EventsMohammed Tanash, Nasim Ghazanfari, Omar Aaziz, Jonathan Cook. 1678-1686 [doi]
- Defining Metrics to Distill Large-Scale HPC Platform and Application Performance Data into Actionable QuantitiesAnthony Agelastos. 1687-1691 [doi]
- Using Intrinsic Performance Counters to Assess Efficiency in Task-Based Parallel ApplicationsPatricia Grubel, Hartmut Kaiser, Kevin A. Huck, Jeanine Cook. 1692-1701 [doi]
- Understanding Application and System Performance Through System-Wide MonitoringR. Todd Evans, James C. Browne, William L. Barth. 1702-1710 [doi]
- Large-Scale Persistent Numerical Data Source Monitoring System ExperiencesJim M. Brandt, Ann C. Gentile, Michael T. Showerman, Jeremy Enos, Joshi Fullop, Gregory H. Bauer. 1711-1720 [doi]
- Design and Implementation of a Scalable HPC Monitoring SystemSam Sanchez, Amanda Bonnie, Graham van Heule, Conor Robinson, Adam DeConinck, Kathleen Kelly, Quellyn Snead, Jim M. Brandt. 1721-1725 [doi]
- IPDRM Introduction and CommitteesShuaiwen Leon Song, Todd Gamblin. 1726 [doi]
- IPDRM 2016 KeynoteHenry Hoffmann. 1727 [doi]
- Non-intrusive Migration of MPI Processes in OS-Bypass NetworksSimon Pickartz, Carsten Clauss, Stefan Lankes, Stephan Krempel, Thomas Moschny, Antonello Monti. 1728-1735 [doi]
- Photon: Remote Memory Access Middleware for High-Performance Runtime SystemsEzra Kissel, Martin Swany. 1736-1743 [doi]
- Asynchronous Runtimes in Action: An Introspective Framework for a Next Gen RuntimeJoshua Suetterlein, Joshua Landwehr, Andrès Márquez, Joseph B. Manzano, Guang R. Gao. 1744-1751 [doi]
- OWBP: Flash-Aware Offline Write Buffer PolicyAlireza Haghdoost, David H. C. Du. 1752-1758 [doi]
- Topology-Aware Rank Reordering for MPI CollectivesSeyed Hessam Mirsadeghi, Ahmad Afsahi. 1759-1768 [doi]
- GPUShare: Fair-Sharing Middleware for GPU CloudsAnshuman Goswami, Jeffrey Young, Karsten Schwan, Naila Farooqui, Ada Gavrilovska, Matthew Wolf, Greg Eisenhauer. 1769-1776 [doi]
- Performance Characterization of Hypervisor-and Container-Based Virtualization for HPC on SR-IOV Enabled InfiniBand ClustersJie Zhang, Xiaoyi Lu, Dhabaleswar K. Panda. 1777-1784 [doi]
- Macaca: A Scalable and Energy-Efficient Platform for Coupling Cloud Computing with Distributed Embedded ComputingHeng Zhang, Chunliang Hao, Yanjun Wu, Mingshu Li. 1785-1788 [doi]
- Benchmarking Streaming Computation Engines: Storm, Flink and Spark StreamingSanket Chintapalli, Derek Dagit, Bobby Evans, Reza Farivar, Thomas Graves, Mark Holderbaugh, Zhuo Liu, Kyle Nusbaum, Kishorkumar Patil, Boyang Peng, Paul Poulosky. 1789-1792 [doi]
- ParSocial Introduction and CommitteesEunice E. Santos, John Korah. 1793-1794 [doi]
- ParSocial 2016 KeynoteGeorge Cybenko. 1795 [doi]
- Towards Reliable Social Sensing in Cyber-Physical-Social SystemsChao Huang, Jermaine Marshall, Dong Wang, Mianxiong Dong. 1796-1802 [doi]
- Toward the New Version of D-MASON: Efficiency, Effectiveness and Correctness in Parallel and Distributed Agent-Based SimulationsGennaro Cordasco, Carmine Spagnuolo, Vittorio Scarano. 1803-1812 [doi]
- Emergency-Driven Assured Information Sharing in Secure Online Social Networks: A Position PaperBhavani M. Thuraisingham, Murat Kantarcioglu, Latifur Khan, Barbara Carminati, Elena Ferrari, Leila Bahri. 1813-1820 [doi]
- Efficient Anytime Anywhere Algorithms for Closeness Centrality in Large and Dynamic GraphsEunice E. Santos, John Korah, Vairavan Murugappan, Suresh Subramanian. 1821-1830 [doi]
- Addressing Behavioral Uncertainty in Security Games: An Efficient Robust Strategic Solution for Defender PatrolsThanh Hong Nguyen, Arunesh Sinha, Milind Tambe. 1831-1838 [doi]
- Workshop 24-Roundtable I IntroductionDick Brown, Suzanne Matthews. 1839 [doi]