Abstract is missing.
- Towards millions of communicating threadsHoang-Vu Dang, Marc Snir, William Gropp. 1-14 [doi]
- Efficient Large Message Broadcast using NCCL and CUDA-Aware MPI for Deep LearningA. A. Awan, Khaled Hamidouche, Akshay Venkatesh, Dhabaleswar K. Panda. 15-22 [doi]
- Generalisation of Recursive Doubling for AllReduceMartin Ruefenacht, Mark Bull, Stephen Booth. 23-31 [doi]
- Space Performance Tradeoffs in Compressing MPI Group Data StructuresSameer Kumar 0001, Philip Heidelberger, Craig B. Stunkel. 32-40 [doi]
- Modeling MPI Communication Performance on SMP Nodes: Is it Time to Retire the Ping Pong TestWilliam Gropp, Luke N. Olson, Philipp Samfass. 41-50 [doi]
- Introducing Task-Containers as an Alternative to Runtime-StackingJean-Baptiste Besnard, Julien Adam, Sameer Shende, Marc Pérache, Patrick Carribault, Julien Jaeger. 51-63 [doi]
- The MIG Framework: Enabling Transparent Process Migration in Open MPIFederico Reghenzani, Gianmario Pozzi, Giuseppe Massari, Simone Libutti, William Fornaciari. 64-73 [doi]
- Architecting Malleable MPI Applications for Priority-driven Adaptive SchedulingPierre Lemarinier, Khalid Hasanov, Srikumar Venugopal, Kostas Katrinis. 74-81 [doi]
- Infrastructure and API Extensions for Elastic Execution of MPI ApplicationsIsaías A. Comprés Ureña, Ao Mo-Hellenbrand, Michael Gerndt, Hans-Joachim Bungartz. 82-97 [doi]
- A Library for Advanced Datatype ProgrammingJesper Larsson Träff. 98-107 [doi]
- On the Expected and Observed Communication Performance with MPI Derived DatatypesAlexandra Carpen-Amarie, Sascha Hunold, Jesper Larsson Träff. 108-120 [doi]
- MPI Sessions: Leveraging Runtime Infrastructure to Increase Scalability of Applications at ExascaleDaniel Holmes, Kathryn Mohror, Ryan E. Grant, Anthony Skjellum, Martin Schulz, Wesley Bland, Jeffrey M. Squyres. 121-129 [doi]
- Distributed Memory Implementation Strategies for the kinetic Monte Carlo AlgorithmAntónio Esteves, Alfredo Moura. 130-139 [doi]
- How I Learned to Stop Worrying and Love In Situ Analytics: Leveraging Latent Synchronization in MPI Collective AlgorithmsScott Levy, Kurt B. Ferreira, Patrick M. Widener, Patrick G. Bridges, Oscar H. Mondragon. 140-153 [doi]
- The Potential of Diffusive Load Balancing at Large ScaleMatthias Lieber, Kerstin Gößner, Wolfgang E. Nagel. 154-157 [doi]
- Optimization of Message Passing Services on POWER8 InfiniBand ClustersSameer Kumar, Robert Blackmore, Sameh Sharkawi, K. A. Nysal Jan, Amith R. Mamidala, T. J. Chris Ward. 158-166 [doi]
- Using InfiniBand Hardware Gather-Scatter Capabilities to Optimize MPI All-to-AllAna Gainaru, Richard L. Graham, Artem Polyakov, Gilad Shainer. 167-179 [doi]
- Revisiting RDMA Buffer Registration in the Context of Lightweight Multi-kernelsBalazs Gerofi, Masamichi Takagi, Yutaka Ishikawa. 180-183 [doi]
- An Evaluation of the One-Sided Performance in Open MPINathan Hjelm. 184-187 [doi]
- Runtime Correctness Analysis of MPI-3 Nonblocking CollectivesTobias Hilbrich, Matthias Weber, Joachim Protze, Bronis R. de Supinski, Wolfgang E. Nagel. 188-197 [doi]
- CAF Events Implementation Using MPI-3 CapabilitiesAlessandro Fanfarillo, Jeff Hammond. 198-207 [doi]
- Allowing MPI tools builders to forget about FortranSøren Rasmussen, Martin Schulz, Kathryn Mohror. 208-211 [doi]
- FFT data distribution in plane-waves DFT codes. A case study from Quantum ESPRESSOFabio Affinito, Carlo Cavazzoni. 212 [doi]
- Optimizing PARSEC for Knights LandingAlexey Malhanov, Ariel J. Biller, Michael Chuvelev. 213-214 [doi]
- Effective Calculation with Halo communication using Halo FunctionsKeiichiro Fukazawa, Toshiya Takami, Takeshi Soga, Yoshiyuki Morie, Takeshi Nanri. 215-216 [doi]
- MPI usage at NERSC: Present and FutureAlice Koniges, Brandon Cook, Jack Deslippe, Thorsten Kurth, Hongzhang Shan. 217 [doi]
- Performance comparison of Eulerian kinetic Vlasov code between flat-MPI parallelism and hybrid parallelism on Fujitsu FX100 supercomputerTakayuki Umeda, Keiichiro Fukazawa. 218-221 [doi]