Abstract is missing.
- Analyzing the Impact of Lossy Compressor Variability on Checkpointing Scientific SimulationsPavlo Triantafyllides, Tasmia Reza, Jon C. Calhoun. 1-5 [doi]
- Evaluating Burst Buffer Placement in HPC SystemsHarsh Khetawat, Christopher Zimmer, Frank Mueller, Scott Atchley, Sudharshan S. Vazhkudai, Misbah Mubarak. 1-11 [doi]
- Proxy or Imposter? A Method and Case Study to Determine the AnswerOmar Aaziz, Jeanine Cook, Courtenay Vaughan, David Richards. 1-9 [doi]
- FluentPS: A Parameter Server Design with Low-frequency Synchronization for Distributed Deep LearningXin Yao, Xueyu Wu, Cho-Li Wang. 1-12 [doi]
- Multi-physics simulations of particle tracking in arterial geometries with a scalable moving window algorithmGregory Herschlag, John Gounley, Sayan Roychowdhury, Erik W. Draeger, Amanda Randles. 1-11 [doi]
- SMQoS: Improving Utilization and Energy Efficiency with QoS Awareness on GPUsQingxiao Sun, Yi Liu, Hailong Yang, Zhongzhi Luan, Depei Qian. 1-5 [doi]
- MBECN: Enabling ECN with Micro-burst Traffic in Multi-queue Data CenterKexi Kang, Jinghui Zhang, Jiahui Jin, Dian Shen, Junzhou Luo, Wenxin Li, Zhiang Wu 0001. 1-12 [doi]
- Asynchronous Task-Based Execution of the Reverse Time Migration for the Oil and Gas IndustryAmani Alonazi, Hatem Ltaief, David E. Keyes, I. Said, Samuel Thibault. 1-11 [doi]
- μDBSCAN: An Exact Scalable DBSCAN Algorithm for Big Data Exploiting Spatial LocalityAditya Sarma, Poonam Goyal, Sonal Kumari, Anand Wani, Jagat Sesh Challa, Saiyedul Islam, Navneet Goyal. 1-11 [doi]
- LogGOPSC: A Parallel Computation Model Extending Network Contention into LogGOPSBaicheng Yan, Yi Zhou, Limin Xiao, Jiantong Huo, Zhaokai Wang. 1-2 [doi]
- Performance Characterization of DNN Training using TensorFlow and PyTorch on Modern ClustersArpan Jain, Ammar Ahmad Awan, Quentin Anthony, Hari Subramoni, Dhableswar K. D. K. Panda. 1-11 [doi]
- DP_Greedy: A Two-Phase Caching Algorithm for Mobile Cloud ServicesDong Huang, Xiaopeng Fan, Yang Wang, Shuibing He, Chengzhong Xu. 1-10 [doi]
- Leveraging Task-Based Polar Decomposition Using PARSEC on Massively Parallel SystemsDalal Sukkari, Hatem Ltaief, David E. Keyes, Mathieu Faverge. 1-12 [doi]
- Scheduling independent stochastic tasks on heterogeneous cloud platformsYiqin Gao, Louis-Claude Canon, Yves Robert, Frédéric Vivien. 1-11 [doi]
- mSMS: PGAS Runtime with Efficient Thread-based Communication for Global-view ProgrammingHiroko Midorikawa, Kenji Kitagawa, Yugo Sakaguchi. 1-2 [doi]
- HarpGBDT: Optimizing Gradient Boosting Decision Tree for Parallel EfficiencyBo Peng, Judy Qiu, Langshi Chen, Jiayu Li, Miao Jiang, Selahattin Akkas, Egor Smirnov, Ruslan Israfilov, Sergey Khekhnev, Andrey Nikolaev. 1-11 [doi]
- An Empirical Study of Cryptographic Libraries for MPI CommunicationsAbu-Naser, Mohsen Gavahi, Cong Wu, Viet Tung Hoang, Zhi Wang, Xin Yuan. 1-11 [doi]
- Engineering a Distributed Histogram SortRoger Kowalewski, Pascal Jungblut, Karl Fürlinger. 1-11 [doi]
- Scalable, High-Order Continuity Across Block Boundaries of Functional Approximations Computed in ParallelIulian Grindeanu, Tom Peterka, Vijay S. Mahadevan, Youssef S. G. Nashed. 1-9 [doi]
- Efficient User-Level Storage Disaggregation for Deep LearningYue Zhu, Weikuan Yu, Bing Jiao, Kathryn Mohror, Adam Moody, Fahim Chowdhury. 1-12 [doi]
- Cost-efficiency of Large-scale Electronic Structure Simulations with Intel Xeon Phi ProcessorsHoon Ryu, Seungmin Lee. 1-2 [doi]
- Parallelizing Training of Deep Generative Models on Massive Scientific DatasetsSam Ade Jacobs, Jim Gaffney, Tom Benson, Peter B. Robinson, Luc Peterson, Brian K. Spears, Brian Van Essen, David Hysom, Jae-Seung Yeom, Tim Moon, Rushil Anirudh, Jayaraman J. Thiagarajan, Shusen Liu, Peer-Timo Bremer. 1-10 [doi]
- Accelerating Hyperdimensional Classifier on Multiple GPUsZheming Jin, Hal Finkel. 1-2 [doi]
- Give MPI Threading a Fair Chance: A Study of Multithreaded MPI DesignsThananon Patinyasakdikul, David Eberius, George Bosilca, Nathan T. Hjelm. 1-11 [doi]
- Standardized Environment for Monitoring Heterogeneous ArchitecturesConnor Brown, Benjamin Schwaller, Nathan Gauntt, Benjamin Allan, Kevin Davis. 1-5 [doi]
- Workflows for Performance Predictable and Reproducible HPC ApplicationsKeira Haskins, Quincy Wofford, Patrick G. Bridges. 1-2 [doi]
- NORNS: Extending Slurm to Support Data-Driven Workflows through Asynchronous Data StagingAlberto Miranda, Adrian Jackson, Tommaso Tocci, Iakovos Panourgias, Ramon Nou. 1-12 [doi]
- RE-Store: Reliable and Efficient KV-Store with Erasure Coding and ReplicationYuzhe Li, Jiang Zhou, Weiping Wang, Yong Chen. 1-12 [doi]
- DiffTrace: Efficient Whole-Program Trace Analysis and Diffing for DebuggingSaeed Taheri, Ian Briggs, Martin Burtscher, Ganesh Gopalakrishnan. 1-12 [doi]
- ClusterCockpit - A web application for job-specific performance monitoringJan Eitzinger, Thomas Gruber, Ayesha Afzal, Thomas Zeiser, Gerhard Wellein. 1-7 [doi]
- MPI Sessions: Evaluation of an Implementation in Open MPINathan T. Hjelm, Howard Pritchard, Samuel K. Gutiérrez, Daniel J. Holmes, Ralph Castain, Anthony Skjellum. 1-11 [doi]
- Kube-Knots: Resource Harvesting through Dynamic Container Orchestration in GPU-based DatacentersPrashanth Thinakaran, Jashwant Raj Gunasekaran, Bikash Sharma, Mahmut Taylan Kandemir, Chita R. Das. 1-13 [doi]
- Propagation and Decay of Injected One-Off Delays on Clusters: A Case StudyAyesha Afzal, Georg Hager, Gerhard Wellein. 1-10 [doi]
- Improving Resource Utilization in Data Centers using an LSTM-based Prediction ModelKundjanasith Thonglek, Kohei Ichikawa, Keichi Takahashi, Hajimu Iida, Chawanat Nakasan. 1-8 [doi]
- Efficient Distributed Graph Analytics using Triply Compressed Sparse FormatMohammad Hasanzadeh-Mofrad, Rami G. Melhem, Yousuf Ahmad, Mohammad Hammoud. 1-11 [doi]
- X-RDMA: Effective RDMA Middleware in Large-scale Production EnvironmentsTeng Ma, Tao Ma, Zhuo Song, Jingxuan Li, Huaixin Chang, Kang Chen, Hai Jiang, Yongwei Wu. 1-12 [doi]
- Learning from Five-year Resource-Utilization Data of Titan SystemFeiyi Wang, Sarp Oral, Satyabrata Sen, Neena Imam. 1-6 [doi]
- STASH : Fast Hierarchical Aggregation Queries for Effective Visual Spatiotemporal ExplorationsSaptashwa Mitra, Paahuni Khandelwal, Shrideep Pallickara, Sangmi Lee Pallickara. 1-11 [doi]
- A Quantitative Study of Deep Learning Training on Heterogeneous SupercomputersJingoo Han, Luna Xu, M. Mustafa Rafique, Ali Raza Butt, Seung-Hwan Lim. 1-12 [doi]
- Diagnostic Analysis: Directional Relation GraphSandy Kaur, Eun-Kyung Lee. 1-5 [doi]
- Design Exploration of Multi-FPGAs for Accelerating Deep LearningTeng Wang, Lei Gong, Chao Wang, Xuehai Zhou, Huaping Chen. 1-2 [doi]
- Quantifying the Impact of Memory Errors in Deep LearningZhao Zhang 0007, Lei Huang, Ruizhu Huang, Weijia Xu, Daniel S. Katz. 1-12 [doi]
- Large-Scale Analysis of the Docker Hub DatasetNannan Zhao, Vasily Tarasov, Hadeel Albahar, Ali Anwar, Lukas Rupprecht, Dimitrios Skourtis, Amit S. Warke, Mohamed Mohamed 0001, Ali Raza Butt. 1-10 [doi]
- Leveraging Machine Learning for Anticipatory Data Delivery in Extreme Scale In-situ WorkflowsPradeep Subedi, Philip E. Davis, Manish Parashar. 1-11 [doi]
- Mitigating Inter-Job Interference via Process-Level Quality-of-ServiceLee Savoie, David K. Lowenthal, Bronis R. de Supinski, Kathryn Mohror, Nikhil Jain. 1-5 [doi]
- Harmony: An Approach for Geo-distributed Processing of Big-Data ApplicationsHan Zhang, Lavanya Ramapantulu, Yong Meng Teo. 1-11 [doi]
- Building Reliable High-Performance Storage Systems: An Empirical and Analytical StudyZhi Qiao, Song Fu, Hsing-bung Chen, Bradley W. Settlemyer. 1-10 [doi]
- Fast and Scalable Implementations of Influence Maximization AlgorithmsMarco Minutoli, Mahantesh Halappanavar, Ananth Kalyanaraman, Arun V. Sathanur, Ryan Mcclure, Jason McDermott. 1-12 [doi]
- On the Benefits of Anticipating Load Imbalance for Performance Optimization of Parallel ApplicationsAnthony Boulmier, Franck Raynaud, Nabil Abdennadher, Bastien Chopard. 1-9 [doi]
- FSMonitor: Scalable File System Monitoring for Arbitrary Storage SystemsArnab Kumar Paul, Ryan Chard, Kyle Chard, Steven Tuecke, Ali Raza Butt, Ian T. Foster. 1-11 [doi]
- Fast and Faithful Performance Prediction of MPI Applications: the HPL Case StudyTom Cornebize, Arnaud Legrand, Franz C. Heinrich. 1-11 [doi]
- Compact Filters for Fast Online Data PartitioningQing Zheng, Charles D. Cranor, Ankush Jain, Gregory R. Ganger, Garth A. Gibson, George Amvrosiadis, Bradley W. Settlemyer, Gary Grider. 1-12 [doi]
- Algorithm-Based Fault Tolerance for Parallel Stencil ComputationsAurélien Cavelan, Florina M. Ciorba. 1-11 [doi]
- Rapidly Measuring Loop FootprintsOzgur O. Kilic, Nathan R. Tallent, Ryan D. Friese. 1-9 [doi]
- Improving Access to HDFS using NVMeoFDaegyu Han, Beomseok Nam. 1-2 [doi]
- Automatic Power Saving Method by Energy Aware Job SchedulerHiroaki Imade, Takahiro Kagami, Tomohiro Otawa, Kouichi Hirai, Yoshio Sakaguchi, Naoyuki Fujita. 1-2 [doi]
- Training Google Neural Machine Translation on an Intel CPU ClusterDhiraj D. Kalamkar, Kunal Banerjee, Sudarshan Srinivasan, Srinivas Sridharan 0002, Evangelos Georganas, Mikhail E. Smorkalov, Cong Xu, Alexander Heinecke. 1-10 [doi]
- Improving Performance of Data Dumping with Lossy Compression for Scientific SimulationXin Liang, Sheng Di, Dingwen Tao, Sihuan Li, Bogdan Nicolae, Zizhong Chen, Franck Cappello. 1-11 [doi]