Abstract is missing.
- Analyzing the I/O Scalability of a Parallel Particle-in-Cell CodeSandra Méndez, Nicolay J. Hammer, Anupam Karmakar. 9-22 [doi]
- Cost and Performance Modeling for Earth System Data Management and BeyondJakob Lüttgau, Julian M. Kunkel. 23-35 [doi]
- I/O Interference Alleviation on Parallel File Systems Using Server-Side QoS-Based Load-BalancingYuichi Tsujita, Yoshitaka Furutani, Hajime Hida, Keiji Yamamoto, Atsuya Uno, Fumichika Sueyasu. 36-48 [doi]
- Tools for Analyzing Parallel I/OJulian Martin Kunkel, Eugen Betke, Matt Bryson, Philip H. Carns, Rosemary Francis, Wolfgang Frings, Roland Laifer, Sandra Méndez. 49-70 [doi]
- Understanding Metadata Latency with MDWorkbenchJulian Martin Kunkel, George S. Markomanolis. 75-88 [doi]
- From Application to Disk: Tracing I/O Through the Big Data StackRobert Schmidtke, Florian Schintke, Thorsten Schütt. 89-102 [doi]
- IOscope: A Flexible I/O Tracer for Workloads' I/O Pattern CharacterizationAbdulqawi Saif, Lucas Nussbaum, Ye-Qiong Song. 103-116 [doi]
- Exploring Scientific Application Performance Using Large Scale Object StorageSteven Wei Der Chien, Stefano Markidis, Rami Karim, Erwin Laure, Sai Narasimhamurthy. 117-130 [doi]
- Benefit of DDN's IME-FUSE for I/O Intensive HPC ApplicationsEugen Betke, Julian M. Kunkel. 131-144 [doi]
- Performance Study of Non-volatile Memories on a High-End SupercomputerLeonardo Bautista-Gomez, Kai Keller, Osman S. Unsal. 145-156 [doi]
- Self-optimization Strategy for IO Accelerator ParameterizationLionel Vincent, Mamady Nabe, Gaël Goret. 157-170 [doi]
- utmem: Towards Memory Elasticity in Cloud WorkloadsAimilios Tsalapatis, Stefanos Gerangelos, Stratos Psomadakis, Konstantinos Papazafeiropoulos, Nectarios Koziris. 173-183 [doi]
- Efficient Live Migration of Linux ContainersRadostin Stoyanov, Martin J. Kollingbaum. 184-193 [doi]
- Coupling the Uintah Framework and the VisIt Toolkit for Parallel In Situ Data Analysis and Visualization and Computational SteeringAllen Sanderson, Alan Humphrey, John A. Schmidt, Robert Sisneros. 201-214 [doi]
- Binning Based Data Reduction for Vector Field Data of a Particle-In-Cell Fusion SimulationJames Kress, Jong Choi, Scott Klasky, Michael Churchill, Hank Childs, David Pugmire. 215-229 [doi]
- In Situ Analysis and Visualization of Fusion Simulations: Lessons LearnedMark Kim, James Kress, Jong Choi, Norbert Podhorszki, Scott Klasky, Matthew Wolf, Kshitij Mehta, Kevin A. Huck, Berk Geveci, Sujin Phillip, Robert Maynard, Hanqi Guo, Tom Peterka, Kenneth Moreland, Choong-Seock Chang, Julien Dominski, Michael Churchill, David Pugmire. 230-242 [doi]
- Design of a Flexible In Situ Framework with a Temporal Buffer for Data Processing and Visualization of Time-Varying DatasetsKenji Ono, Jorji Nonaka, Hiroyuki Yoshikawa, Takeshi Nanri, Yoshiyuki Morie, Tomohiro Kawanabe, Fumiyoshi Shoji. 243-257 [doi]
- Streaming Live Neuronal Simulation Data into Visualization and AnalysisSimon Oehrl, Jan Müller, Jan Schnathmeier, Jochen Martin Eppler, Alexander Peyser, Hans Ekkehard Plesser, Benjamin Weyers, Bernd Hentschel 0001, Torsten W. Kuhlen, Tom Vierjahn. 258-272 [doi]
- Enabling Explorative Visualization with Full Temporal Resolution via In Situ Calculation of Temporal IntervalsNicole Marsaglia, Shaomeng Li, Hank Childs. 273-293 [doi]
- In-Situ Visualization of Solver Residual FieldsKai Sdeo, Boyan Zheng, Marian Piatkowski, Filip Sadlo. 294-309 [doi]
- An In-Situ Visualization Approach for the K Computer Using Mesa 3D and KVSKengo Hayashi, Naohisa Sakamoto, Jorji Nonaka, Motohiko Matsuda, Fumiyoshi Shoji. 310-322 [doi]
- Comparing Controlflow and Dataflow for Tensor Calculus: Speed, Power, Complexity, and MTBFMilos Kotlar, Veljko Milutinovic. 329-346 [doi]
- Supercomputer in a Laptop: Distributed Application and Runtime Development via Architecture SimulationSamuel Knight, Joseph P. Kenny, Jeremiah J. Wilke. 347-359 [doi]
- CGYRO Performance on Power9 CPUs and Volta GPUsIgor Sfiligoi, J. Candy, Mark Kostuk. 365-372 [doi]
- A 64-GB Sort at 28 GB/s on a 4-GPU POWER9 Node for Uniformly-Distributed 16-Byte Records with 8-Byte KeysGordon C. Fossum, Ting Wang, H. Peter Hofstee. 373-386 [doi]
- Early Experience on Running OpenStaPLE on DAVIDEClaudio Bonati, Enrico Calore, Massimo D'Elia, Michele Mesiti, Francesco Negro, Sebastiano Fabio Schifano, Giorgio Silvi, Raffaele Tripiccione. 387-401 [doi]
- Porting and Benchmarking of BWAKIT Pipeline on OpenPOWER ArchitectureNagarajan Kathiresan, Rashid Al-Ali, Puthen V. Jithesh, Ganesan Narayanasamy, Zaid Al-Ars. 402-410 [doi]
- Improving Performance and Energy Efficiency on OpenPower Systems Using Scalable Hardware-Software Co-designMilos Puzovic, Vadim Elisseev, Kirk E. Jordan, James McDonagh, Alexander Harrison, Robert Sawko. 411-417 [doi]
- Porting DMRG++ Scientific Application to OpenPOWERArghya Chatterjee, Gonzalo Alvarez, Eduardo F. D'Azevedo, Wael R. Elwasif, Oscar Hernandez, Vivek Sarkar. 418-431 [doi]
- Job Management with mpi_jmEvan Berkowitz, Gustav R. Jansen, Kenneth McElvain, André Walker-Loud. 432-439 [doi]
- Compile-Time Library Call Detection Using CAASCADE and XALTJisheng Zhao, Oscar R. Hernandez, Reuben D. Budiardja, M. Graham Lopez, Vivek Sarkar, Jack C. Wells. 440-447 [doi]
- NUMA-Aware Data-Transfer Measurements for Power/NVLink Multi-GPU SystemsCarl Pearson, I-Hsin Chung, Zehra Sura, Wen-mei Hwu, Jinjun Xiong. 448-454 [doi]
- Sparse CSB_Coo Matrix-Vector and Matrix-Matrix Performance on Intel Xeon ArchitecturesBrandon Cook, Charlene Yang, Thorsten Kurth, Jack Deslippe. 463-471 [doi]
- Lessons Learned from Optimizing Kernels for Adaptive Aggregation Multi-grid Solvers in Lattice QCDBálint Joó, Thorsten Kurth. 472-486 [doi]
- Distributed Training of Generative Adversarial Networks for Fast Detector SimulationSofia Vallecorsa, Federico Carminati, Gulrukh Khattak, Damian Podareanu, Valeriu Codreanu, Vikram A. Saletore, Hans Pabst. 487-503 [doi]
- Cache-Aware Roofline Model and Medical Image Processing Optimizations in GPUsEstefania Serrano, Aleksandar Ilic, Leonel Sousa, Javier García Blas, Jesús Carretero. 509-526 [doi]
- How Pre-multicore Methods and Algorithms Perform in Multicore EraAlexey L. Lastovetsky, Muhammad Fahad, Hamidreza Khaleghzadeh, Semyon Khokhriakov, Ravi Reddy, Arsalan Shahid, Lukasz Szustak, Roman Wyrzykowski. 527-539 [doi]
- Impact of Approximate Memory Data Allocation on a H.264 Software Video EncoderGiulia Stazi, Lorenzo Adani, Antonio Mastrandrea, Mauro Olivieri, Francesco Menichelli. 545-553 [doi]
- Residual Replacement in Mixed-Precision Iterative Refinement for Sparse Linear SystemsHartwig Anzt, Goran Flegar, Vedran Novakovic, Enrique S. Quintana-Ortí, Andrés E. Tomás. 554-561 [doi]
- Training Deep Neural Networks with Low Precision Input Data: A Hurricane Prediction Case StudyAlbert Kahira, Leonardo Bautista-Gomez, Rosa M. Badia. 562-569 [doi]
- A Transparent View on Approximate Computing Methods for Tuning ApplicationsMichael Bromberger, Wolfgang Karl. 570-578 [doi]
- Exploring the Effects of Code Optimizations on CPU Frequency MarginsKonstantinos Parasyris, Nikolaos Bellas, Christos D. Antonopoulos, Spyros Lalis. 579-587 [doi]
- Taking Gradients Through Experiments: LSTMs and Memory Proximal Policy Optimization for Black-Box Quantum ControlMoritz August, José Miguel Hernández-Lobato. 591-613 [doi]
- Towards Prediction of Turbulent Flows at High Reynolds Numbers Using High Performance Computing Data and Deep LearningMathis Bode, Michael Gauding, Jens Henrik Göbbert, Baohao Liao, Jenia Jitsev, Heinz Pitsch. 614-623 [doi]
- Using a Graph Visualization Tool for Parallel Program Dynamic Visualization and Communication AnalysisDenise Stringhini, Pedro Spoljaric Gomes, Alvaro Fazenda. 627-636 [doi]
- Offloading C++17 Parallel STL on System Shared Virtual Memory PlatformsPekka Jääskeläinen, John Glossner, Martin Jambor, Aleksi Tervo, Matti Rintala. 637-647 [doi]
- Lessons Learned from a Decade of Providing Interactive, On-Demand High Performance Computing to Scientists and EngineersJulia S. Mullen, Albert Reuther, William Arcand, Bill Bergeron, David Bestor, Chansup Byun, Vijay Gadepally, Michael Houle, Matthew Hubbell, Michael Jones, Anna Klein, Peter Michaleas, Lauren Milechin, Andrew Prout, Antonio Rosa, Siddharth Samsi, Charles Yee, Jeremy Kepner. 655-668 [doi]
- Enabling Interactive Supercomputing at JSC Lessons LearnedJens Henrik Göbbert, Tim Kreuzer, Alice Grosch, Andreas Lintermann, Morris Riedel. 669-677 [doi]
- Interactive Distributed Deep Learning with Jupyter NotebooksSteven Andrew Farrell, Aaron Vose, Oliver Evans, Matthew Henderson, Shreyas Cholia, Fernando Pérez, Wahid Bhimji, Shane Canon, Rollin C. Thomas, Prabhat. 678-687 [doi]
- Performance Portability of Earth System Models with User-Controlled GGDML Code TranslationNabeeh Jum'ah, Julian M. Kunkel. 693-710 [doi]
- Evaluating Performance Portability of Accelerator Programming Models using SPEC ACCEL 1.2 BenchmarksSwen Boehm, Swaroop Pophale, Verónica G. Vergara Larrea, Oscar Hernandez. 711-723 [doi]
- A Beginner's Guide to Estimating and Improving Performance PortabilityHenk Dreuning, Roel Heirman, Ana Lucia Varbanescu. 724-742 [doi]
- Profiling and Debugging Support for the Kokkos Programming ModelSimon D. Hammond, Christian R. Trott, Daniel Ibanez, Daniel Sunderland. 743-754 [doi]