Abstract is missing.
- Role of HPC in next-generation AIAnimashree Anandkumar. [doi]
- Breaking the Scalability WallFabrizio Petrini. [doi]
- Computing and Data Challenges in Climate ChangeKatherine A. Yelick. [doi]
- SimGQ: Simultaneously Evaluating Iterative Graph QueriesChengshuo Xu, Abbas Mazloumi, Xiaolin Jiang, Rajiv Gupta 0001. 1-10 [doi]
- WarpCore: A Library for fast Hash Tables on GPUsDaniel Jünger, Robin Kobus, André Müller, Christian Hundt 0002, Kai Xu, Weiguo Liu, Bertil Schmidt. 11-20 [doi]
- Towards High Performance, Portability, and Productivity: Lightweight Augmented Neural Networks for Performance PredictionAjitesh Srivastava, Naifeng Zhang, Rajgopal Kannan, Viktor K. Prasanna. 21-30 [doi]
- Performance Optimization and Scalability Analysis of the MGB Hydrological ModelHenrique R. A. Freitas, Celso L. Mendes, Aleksandar Ilic. 31-40 [doi]
- Exploring Task Parallelism for the Multilevel Fast Multipole AlgorithmMichael P. Lingg, Stephen M. Hughey, Doga Dikbayir, Balasubramaniam Shanker, Hasan Metin Aktulga. 41-50 [doi]
- SparsePipe: Parallel Deep Learning for 3D Point CloudsKeke Zhai, Pan He, Tania Banerjee, Anand Rangarajan 0001, Sanjay Ranka. 51-61 [doi]
- HyPR: Hybrid Page Ranking on Evolving GraphsHemant Kumar Giri, Mridul Haque, Dip Sankar Banerjee. 62-71 [doi]
- Distributing Sparse Matrix/Graph Applications in Heterogeneous Clusters - an Experimental StudyCharilaos Tzovas, Maria Predari, Henning Meyerhenke. 72-81 [doi]
- Processor Pipelining Method for Efficient Deep Neural Network Inference on Embedded DevicesAkshay Parashar, Arun Abraham, Deepak Chaudhary, Vikram Nelvoy Rajendiran. 82-90 [doi]
- Avoiding Communication in Logistic RegressionAditya Devarakonda, James Demmel. 91-100 [doi]
- A Parallel and Scalable Framework for Insider Threat DetectionAbdoulaye Diop, Nahid Emad, Thierry Winter. 101-110 [doi]
- Blink: Towards Efficient RDMA-based Communication Coroutines for Parallel Python ApplicationsAamir Shafi, Jahanzeb Maqbool Hashmi, Hari Subramoni, Dhabaleswar K. Panda. 111-120 [doi]
- Content-defined Merkle Trees for Efficient Container DeliveryYuta Nakamura, Raza Ahmad, Tanu Malik. 121-130 [doi]
- Model Checking as a Service using Dynamic Resource ScalingSurya Teja Palavalasa, Yuvraj Singh, Adhish Singla, Suresh Purini, Venkatesh Choppella. 131-140 [doi]
- Parallel Hierarchical Clustering using Rank-Two Nonnegative Matrix FactorizationLawton Manning, Grey Ballard, Ramakrishnan Kannan, Haesun Park. 141-150 [doi]
- Pipelined Preconditioned Conjugate Gradient Methods for Distributed Memory SystemsManasi Tiwari, Sathish Vadhiyar. 151-160 [doi]
- Fair Allocation of Asymmetric Operations in Storage SystemsThomas Keller, Peter Varman. 161-170 [doi]
- A GPU Algorithm for Earliest Arrival Time Problem in Public Transport NetworksChirayu Anant Haryan, G. Ramakrishna, Rupesh Nasre, Allam Dinesh Reddy. 171-180 [doi]
- 2D Static Resource Allocation for Compressed Linear Algebra and Communication ConstraintsOlivier Beaumont, Lionel Eyraud-Dubois, Mathieu Vérité. 181-191 [doi]
- Algorithms for Preemptive Co-scheduling of Kernels on GPUsLionel Eyraud-Dubois, Cristiana Bentes. 192-201 [doi]
- Understanding HPC Application I/O Behavior Using System Level StatisticsArnab Kumar Paul, Olaf Faaland, Adam Moody, Elsa Gonsiorowski, Kathryn Mohror, Ali Raza Butt. 202-211 [doi]
- AMCilk: A Framework for Multiprogrammed Parallel WorkloadsZhe Wang, Chen Xu, Kunal Agrawal, Jing Li. 212-222 [doi]
- Extending SLURM for Dynamic Resource-Aware Adaptive Batch SchedulingMohak Chadha, Jophin John, Michael Gerndt. 223-232 [doi]
- On the Marriage of Asynchronous Many Task Runtimes and Big Data: A GlanceJoshua Suetterlein, Joseph B. Manzano, Andres Marquez, Guang R. Gao. 233-242 [doi]
- Exposing data locality in HPC-based systems by using the HDFS backendJosé Rivadeneira, Félix García Carballeira, Jesús Carretero 0001, Javier García Blas. 243-250 [doi]
- PufferFish: NUMA-Aware Work-stealing Library using Elastic TasksVivek Kumar. 251-260 [doi]
- Design and Study of Elastic Recovery in HPC ApplicationsKai Keller, Konstantinos Parasyris, Leonardo Bautista-Gomez. 261-270 [doi]
- Accelerating Force-directed Graph Layout with Processing-in-Memory ArchitectureRuihao Li, Shuang Song 0007, Qinzhe Wu, Lizy K. John. 271-282 [doi]
- Nonblocking Persistent Software Transactional MemoryH. Alan Beadle, Wentao Cai 0002, Haosen Wen, Michael L. Scott. 283-293 [doi]
- GPU-FPtuner: Mixed-precision Auto-tuning for Floating-point Applications on GPURuidong Gu, Michela Becchi. 294-304 [doi]
- Batched Small Tensor-Matrix Multiplications on GPUsKeke Zhai, Tania Banerjee, Adeesha Wijayasiri, Sanjay Ranka. 305-314 [doi]
- Temporal Based Intelligent LRU Cache ConstructionPavan Nittur, Anuradha Kanukotla, Narendra Mutyala. 315-322 [doi]
- Boosting LSTM Performance Through Dynamic Precision SelectionFranyell Silfa, José María Arnau, Antonio González 0001. 323-333 [doi]