Abstract is missing.
- Data Flow Execution Models - A Third OpinionVivek Sarkar. 1 [doi]
- HyDetect: A Hybrid CPU-GPU Algorithm for Community DetectionAnwesha Bhowmik, Sathish Vadhiyar. 2-11 [doi]
- Distributed Relational Algebra at ScaleThomas Gilray, Sidharth Kumar. 12-22 [doi]
- Optimizing Breadth-First Search at Scale Using Hardware-Accelerated Space ConsistencyKhaled Z. Ibrahim. 23-33 [doi]
- Shared-Memory Parallel Maximal Biclique EnumerationApurba Das, Srikanta Tirthapura. 34-43 [doi]
- A Deterministic Multi-layered Partitioning Tool for Wire-Length Reduction of Monolithic 3D-ICSoumendu Ghorui, Sabyasachee Banerjee, Subhashis Majumder. 44-51 [doi]
- Mapping Arbitrarily Sparse Two-Body Interactions on One-Dimensional Quantum CircuitsArif M. Khan, Mahantesh Halappanavar, Tobias Hagge, Karol Kowalski, Alex Pothen, Sriram Krishnamoorthy. 52-62 [doi]
- k-NN Sampling for Visualization of Dynamic Data Using LION-tSNEBheekya Dharamsotu, K. Swarupa Rani, Salman Abdul Moiz, C. Raghavendra Rao 0001. 63-72 [doi]
- Analysis in the Data Path of an Object-Centric Data Management SystemRichard Warren, Jérome Soumagne, Jingqing Mu, Houjun Tang, Suren Byna, Bin Dong 0002, Quincey Koziol. 73-82 [doi]
- Exploring Metadata Search Essentials for Scientific Data ManagementWei Zhang, Suren Byna, Chenxu Niu, Yong Chen 0001. 83-92 [doi]
- Designing a Profiling and Visualization Tool for Scalable and In-depth Analysis of High-Performance GPU ClustersPouya Kousha, Bharath Ramesh, Kaushik Kandadi Suresh, Ching-Hsiang Chu, Arpan Jain, Nick Sarkauskas, Hari Subramoni, Dhabaleswar K. Panda. 93-102 [doi]
- Tuning Object-Centric Data Management Systems for Large Scale Scientific ApplicationsHoujun Tang, Suren Byna, Stephen Bailey, Zarija Lukic, Jialin Liu 0002, Quincey Koziol, Bin Dong 0002. 103-112 [doi]
- Replaceability Based Web Service Selection ApproachLalit Purohit, Sandeep Kumar 0004. 113-122 [doi]
- Efficient Parallel Multi-bunch Beam-Beam Simulation in Particle CollidersIoannis Sakiotis, Kamesh Arumugam, Desh Ranjan, Balsa Terzic, Mohammad Zubair. 123-130 [doi]
- Bit-Wise and Multi-GPU Implementations of the DNA Recombination AlgorithmElnaz Tavakoli Yazdi, Ankur Limaye, Ali Akoglu, Tosiron Adegbija, Adam Buntzman. 131-140 [doi]
- Hierarchical Filter and Refinement System Over Large Polygonal Datasets on CPU-GPUYiming Liu, Jie Yang, Satish Puri. 141-151 [doi]
- Geostatistical Modeling and Prediction Using Mixed Precision Tile Cholesky FactorizationSameh Abdulah, Hatem Ltaief, Ying Sun 0002, Marc G. Genton, David E. Keyes. 152-162 [doi]
- Acceleration of Sparse Vector Autoregressive Modeling Using GPUsShreenivas Bharadwaj Venkataramanan, Rahul Garg, Yogish Sabharwal. 163-172 [doi]
- Fast and Accurate Learning of Knowledge Graph Embeddings at ScaleUdit Gupta, Sathish Vadhiyar. 173-182 [doi]
- Genome Sequencing for Disease Diagnosis: The Confluence of Biology and ComputingRamesh Hariharan. 183 [doi]
- On Linear Learning with Manycore ProcessorsEliza Wszola, Celestine Mendler-Dünner, Martin Jaggi, Markus Püschel. 184-194 [doi]
- SPEC2: SPECtral SParsE CNN Accelerator on FPGAsYue Niu, Hanqing Zeng, Ajitesh Srivastava, Kartik Lakhotia, Rajgopal Kannan, Yanzhi Wang, Viktor K. Prasanna. 195-204 [doi]
- Architecture-Centric Bottleneck Analysis for Deep Neural Network ApplicationsJihyun Ryoo, Mengran Fan, Xulong Tang, Huaipan Jiang, Meena Arunachalam, Sharada Naveen, Mahmut T. Kandemir. 205-214 [doi]
- Efficient Sparse Neural Networks Using Regularized Multi Block Sparsity Pattern on a GPUDharma Teja Vooturi, Kishore Kothapalli. 215-224 [doi]
- Memory and Interconnect Optimizations for Peta-Scale Deep Learning SystemsSwagath Venkataramani, Vijayalakshmi Srinivasan, Jungwook Choi, Philip Heidelberger, Leland Chang, Kailash Gopalakrishnan. 225-234 [doi]
- Accelerating Data Loading in Deep Neural Network TrainingChih-Chieh Yang, Guojing Cong. 235-245 [doi]
- Delivering the Future of High-Performance ComputingMark Papermaster. 246 [doi]
- IsoKV: An Isolation Scheme for Key-Value Stores by Exploiting Internal Parallelism in SSDHeerak Lim, Hwajung Kim, Kihyeon Myung, Heon Young Yeom, Yongseok Son. 247-256 [doi]
- SCOR-KV: SIMD-Aware Client-Centric and Optimistic RDMA-Based Key-Value Store for Emerging CPU ArchitecturesDipti Shankar, Xiaoyi Lu, Dhabaleswar K. Panda. 257-266 [doi]
- High-Performance Adaptive MPI Derived Datatype Communication for Modern Multi-GPU SystemsChing-Hsiang Chu, Jahanzeb Maqbool Hashmi, Kawthar Shafie Khorassani, Hari Subramoni, Dhabaleswar K. Panda. 267-276 [doi]
- Online Management of Hybrid DRAM-NVMM Memory for HPCReza Salkhordeh, André Brinkmann. 277-289 [doi]
- User-Level Scheduled Communications for MPIDerek Schafer, Sheikh Ghafoor, Daniel J. Holmes, Martin Ruefenacht, Anthony Skjellum. 290-300 [doi]
- Evaluating the Impact of Energy Efficient Networks on HPC WorkloadsGiorgis Georgakoudis, Nikhil Jain, Takatsugu Ono, Koji Inoue, Shinobu Miwa, Abhinav Bhatele. 301-310 [doi]
- The New World of Heterogeneous AI/ML High Performance Computing with Intel FPGAs MarkJosé Roberto Alvarez. 311 [doi]
- MLBS: Transparent Data Caching in Hierarchical Storage for Out-of-Core HPC ApplicationsTariq Alturkestani, Thierry Tonellot, Hatem Ltaief, Rached Abdelkhalak, Étienne Vincent, David E. Keyes. 312-322 [doi]
- Reducing False Node Failure Predictions in HPCAlvaro Frank, Dai Yang, André Brinkmann, Martin Schulz 0001, Tim Süß. 323-332 [doi]
- Ground-Truth Prediction to Accelerate Soft-Error Impact Analysis for Iterative MethodsBurcu Ozcelik Mutlu, Gokcen Kestor, Adrián Cristal, Osman S. Unsal, Sriram Krishnamoorthy. 333-344 [doi]
- Efficient Memory Pool Allocation Algorithm for CNN InferenceArun Abraham, Manas Sahni, Akshay Parashar. 345-352 [doi]
- A Linux Kernel Scheduler Extension for Multi-core SystemsAleix Roca Nonell, Samuel Rodríguez, Albert Segura, Kevin Marquet, Vicenç Beltran 0001. 353-362 [doi]
- uMMAP-IO: User-Level Memory-Mapped I/O for HPCSergio Rivas-Gomez, Alessandro Fanfarillo, Sébastien Valat, Christophe Laferriere, Philippe Couvee, Sai Narasimhamurthy, Stefano Markidis. 363-372 [doi]
- DeepSparse: A Task-Parallel Framework for SparseSolvers on Deep Memory ArchitecturesMd. Afibuzzaman, Fazlay Rabbi, M. Yusuf Özkaya, Hasan Metin Aktulga, Ümit V. Çatalyürek. 373-382 [doi]
- Worksharing Tasks: An Efficient Way to Exploit Irregular and Fine-Grained Loop ParallelismMarcos Maronas, Kevin Sala, Sergi Mateo, Eduard Ayguadé, Vicenç Beltran 0001. 383-394 [doi]
- Empirical Analysis of Hardware-Assisted GPU VirtualizationAnshuj Garg, Purushottam Kulkarni, Uday Kurkure, Hari Sivaraman, Lan Vu. 395-405 [doi]