Abstract is missing.
- FASTHash: FPGA-Based High Throughput Parallel Hash TableYang Yang, Sanmukh R. Kuppannagari, Ajitesh Srivastava, Rajgopal Kannan, Viktor K. Prasanna. 3-22 [doi]
- Running a Pre-exascale, Geographically Distributed, Multi-cloud Scientific SimulationIgor Sfiligoi, Frank Würthwein, Benedikt Riedel, David Schultz. 23-40 [doi]
- TM Streaming-Aggregation Hardware Design and EvaluationRichard L. Graham, Lion Levi, Devendar Bureddy, Gil Bloch, Gilad Shainer, David Cho, George Elias, Daniel Klein, Joshua Ladd, Ophir Maor, Ami Marelli, Valentin Petrov, Evyatar Romlet, Yong Qin, Ido Zemah. 41-59 [doi]
- Predicting Job Power Consumption Based on RJMS Submission Data in HPC SystemsThéo Saillant, Jean-Christophe Weill, Mathilde Mougeot. 63-82 [doi]
- HyPar-Flow: Exploiting MPI and Keras for Scalable Hybrid-Parallel DNN Training with TensorFlowAmmar Ahmad Awan, Arpan Jain, Quentin Anthony, Hari Subramoni, Dhabaleswar K. Panda. 83-103 [doi]
- Time Series Mining at Petascale PerformanceAmir Raoofy, Roman Karlstetter, Dai Yang, Carsten Trinitis, Martin Schulz 0001. 104-123 [doi]
- Shared-Memory Parallel Probabilistic Graphical Modeling Optimization: Comparison of Threads, OpenMP, and Data-Parallel PrimitivesTalita Perciano, Colleen Heinemann, David Camp, Brenton Lessley, E. Wes Bethel. 127-145 [doi]
- Opportunities for Cost Savings with In-Transit VisualizationJames Kress, Matthew Larsen, Jong Choi, Mark Kim, Matthew Wolf, Norbert Podhorszki, Scott Klasky, Hank Childs, David Pugmire. 146-165 [doi]
- 6 JobsEugen Betke, Julian M. Kunkel. 166-184 [doi]
- Embedding Algorithms for Quantum Annealers with Chimera and Pegasus Connection TopologiesStefanie Zbinden, Andreas Bärtschi, Hristo Djidjev, Stephan J. Eidenbenz. 187-206 [doi]
- Solving Acoustic Boundary Integral Equations Using High Performance Tile Low-Rank LU FactorizationNoha Al-Harthi, Rabab Alomairy, Kadir Akbudak, Rui Chen, Hatem Ltaief, Hakan Bagci, David E. Keyes. 209-229 [doi]
- DGEMM Using Tensor Cores, and Its Accurate and Reproducible VersionsDaichi Mukunoki, Katsuhisa Ozaki, Takeshi Ogita, Toshiyuki Imamura. 230-248 [doi]
- Using High-Level Synthesis to Implement the Matrix-Vector Multiplication on FPGAAlessandro Marongiu, Paolo Palazzari. 251-269 [doi]
- Enabling Execution of a Legacy CFD Mini Application on Accelerators Using OpenMPIoannis Nompelis, Gabriele Jost, Alice Koniges, Christopher Daley, David Eder, Christopher Stone. 270-287 [doi]
- Load-Balancing Parallel Relational AlgebraSidharth Kumar, Thomas Gilray. 288-308 [doi]
- Sparse Linear Algebra on AMD and NVIDIA GPUs - The Race Is OnYuhsiang M. Tsai, Terry Cojean, Hartwig Anzt. 309-327 [doi]
- Scaling Genomics Data Processing with Memory-Driven Computing to Accelerate Computational BiologyMatthias Becker, Umesh Worlikar, Shobhit Agrawal, Hartmut Schultze, Thomas Ulas, Sharad Singhal, Joachim L. Schultze. 328-344 [doi]
- Footprint-Aware Power Capping for Hybrid Memory Based SystemsEishi Arima, Toshihiro Hanawa, Carsten Trinitis, Martin Schulz 0001. 347-369 [doi]
- Offsite Autotuning Approach - Performance Model Driven Autotuning Applied to Parallel Explicit ODE MethodsJohannes Seiferth, Matthias Korch, Thomas Rauber. 370-390 [doi]
- Desynchronization and Wave Pattern Formation in MPI-Parallel and Hybrid Memory-Bound ProgramsAyesha Afzal, Georg Hager, Gerhard Wellein. 391-411 [doi]
- Understanding HPC Benchmark Performance on Intel Broadwell and Cascade Lake ProcessorsChristie L. Alappat, Johannes Hofmann 0001, Georg Hager, Holger Fehske, Alan R. Bishop, Gerhard Wellein. 412-433 [doi]
- Timemory: Modular Performance Analysis for HPCJonathan R. Madsen, Muaaz G. Awan, Hugo Brunie, Jack Deslippe, Rahulkumar Gayatri, Leonid Oliker, Yunsong Wang, Charlene Yang, Samuel Williams. 434-452 [doi]
- TeaMPI - Replication-Based Resilience Without the (Performance) PainPhilipp Samfass, Tobias Weinzierl, Benjamin Hazelwood, Michael Bader. 455-473 [doi]
- Pattern-Aware Staging for Hybrid Memory SystemsEishi Arima, Martin Schulz 0001. 474-495 [doi]
- Simplifying Communication Overlap in OpenSHMEM Through Integrated User-Level Thread SchedulingMd. Wasi-ur-Rahman, David Ozog, James Dinan. 496-516 [doi]
- Communication-Aware Hardware-Assisted MPI Overlap EngineMohammadreza Bayatpour, ahanzeb Maqbool Hashmil, Sourav Chakraborty 0003, Kaushik Kandadi Suresh, Seyedeh Mahdieh Ghazimirsaeed, Bharath Ramesh 0005, Hari Subramoni, Dhabaleswar K. Panda. 517-535 [doi]
- ++: Evaluating the Performance of Global-Restart Recovery Methods for MPI Fault ToleranceGiorgis Georgakoudis, Luanzheng Guo, Ignacio Laguna. 536-554 [doi]