Abstract is missing.
- Regression Based WCET Analysis For Sampling Based Motion PlanningHao Wen, Wei Zhang. 1-6 [doi]
- Sparse Dual of the Density Peaks Algorithm for Cluster Analysis of High-dimensional DataDimitris Floros, Tiancheng Liu, Nikos Pitsianis, Xiaobai Sun. 1-14 [doi]
- Evaluating an OpenCL FPGA Platform for HPC: a Case Study with the HACCmk KernelZheming Jin, Hal Finkel. 1-6 [doi]
- Fast Triangle Counting Using CilkAbdurrahman Yasar, Sivasankaran Rajamanickam, Michael M. Wolf, Jonathan W. Berry, Ümit V. Çatalyürek. 1-7 [doi]
- AMulti-GPU PCISPH Implementation with Efficient Memory TransfersKevin Verma, Chong Peng, Kamil Szewc, Robert Wille. 1-7 [doi]
- Exploiting GPU with 3D Stacked Memory to Boost Performance for Data-Intensive ApplicationsHao Wen, Wei Zhang. 1-6 [doi]
- Tangram: Colocating HPC Applications with OversubscriptionQingqing Xiong, Emre Ates, Martin C. Herbordt, Ayse Kivilcim Coskun. 1-7 [doi]
- Designing Algorithms for the EMU Migrating-threads-based ArchitectureMehmet E. Belviranli, Seyong Lee, Jeffrey S. Vetter. 1-7 [doi]
- Performance of Graph Analytics Applications on Many-Core ProcessorsJenna Wise, Emily Lederman, Manoj Kumar, Pratap Pattnaik. 1-7 [doi]
- New Computing Frontiers Enabled via Photovoltaic Fiber Energy GenerationJames Hanford, Andrew Weinert. 1-7 [doi]
- High-Performance Triangle Counting on GPUsYang Hu, Hang Liu, H. Howie Huang. 1-5 [doi]
- Performance Assessment of Hybrid Parallelism for Large-Scale Reservoir Simulation on Multi- and Many-core ArchitecturesAmani Alonazi, Marcin Rogowski, Ahmed Al-Zawawi, David E. Keyes. 1-7 [doi]
- Characterizing I/O optimization opportunities for array-centric applications on HDFSDonghe Kang, Vedang Patel, Kalyan Khandrika, Spyros Blanas, Yang Wang, Srinivasan Parthasarathy 0001. 1-2 [doi]
- GDP: GPU accelerated Detailed PlacementShounak Dhar, David Z. Pan. 1-7 [doi]
- WCET Analysis of GPU L1 Data CachesYijie Huangfu, Wei Zhang. 1-7 [doi]
- An Ensemble Classifier Based on Feature Selection Using Ant Colony OptimizationJianjun Cao, Guojun Lv, Yuling Shang, Nianfeng Weng, Chen Chang, Yi Liu. 1-7 [doi]
- Preliminary Exploration of Large-Scale Triangle Counting on Shared-Memory Multicore SystemJiyuan Zhang, Daniele G. Spampinato, Scott McMillan, Franz Franchetti. 1-6 [doi]
- SlimNets: An Exploration of Deep Model Compression and AccelerationIni Oguntola, Subby Olubeko, Christopher Sweeney. 1-6 [doi]
- Unlocking Performance-Programmability by Penetrating the Intel FPGA OpenCL ToolflowAhmed Sanaullah, Martin C. Herbordt. 1-8 [doi]
- Estimating Edge-Local Triangle Count Heavy Hitters in Edge-Linear Time and Almost-Vertex-Linear SpaceBenjamin W. Priest, Roger Pearce, Geoffrey Sanders. 1-7 [doi]
- Scaling Betweenness Centrality in Dynamic GraphsAlok Tripathy, Oded Green. 1-7 [doi]
- Implementing the Jaccard Index on the Migratory Memory-Side Processing Emu ArchitectureGeraud P. Krawezik, Peter M. Kogge, Timothy J. Dysart, Shannon K. Kuntz, Janice O. McMahon. 1-6 [doi]
- Discovering $k$-Trusses in Large-Scale NetworksAlessio Conte, Daniele De Sensi, Roberto Grossi, Andrea Marino, Luca Versari. 1-6 [doi]
- Investigation of Spectral Clustering for Signed Graph Matrix RepresentationsAlyson Fox, Geoffrey Sanders, Andrew Knyazev. 1-7 [doi]
- Linear Algebraic Formulation of Edge-centric K-truss Algorithms with Adjacency MatricesTze Meng Low, Daniele G. Spampinato, Anurag Kutuluru, Upasana Sridhar, Doru-Thom Popovici, Franz Franchetti, Scott McMillan. 1-7 [doi]
- The Robustness of Modern Deep Learning Architectures against Single Event Upset ErrorsAustin P. Arechiga, Alan J. Michaels. 1-6 [doi]
- GraphChallenge.org: Raising the Bar on Graph Analytic PerformanceSiddharth Samsi, Vijay Gadepally, Michael B. Hurley, Michael Jones 0001, Edward K. Kao, Sanjeev Mohindra, Paul Monticciolo, Albert Reuther, Steven Smith, William Song, Diane Staheli, Jeremy Kepner. 1-7 [doi]
- Update on Static Graph Challenge on GPUMauro Bisson, Massimiliano Fatica. 1-8 [doi]
- TabulaROSA: Tabular Operating System Architecture for Massively Parallel Heterogeneous Compute EnginesJeremy Kepner, Ron Brightwell, Alan Edelman, Vijay Gadepally, Hayden Jananthan, Michael Jones 0001, Sam Madden, Peter Michaleas, Hamed Okhravi, Kevin T. Pedretti, Albert Reuther, Thomas L. Sterling, Mike Stonebraker. 1-8 [doi]
- Improving Performance and Scalability of Algebraic Multigrid through a Specialized MATVECMajid Rasouli, Vidhi Zala, Robert M. Kirby, Hari Sundar. 1-7 [doi]
- Scalable RMA-based Communication Library Featuring Node-local NVMsRyo Matsumiya, Toshio Endo. 1-7 [doi]
- Chapel HyperGraph Library (CHGL)Louis Jenkins, Tanveer Hossain Bhuiyan, Sarah Harun, Christopher Lightsey, David Mentgen, Sinan G. Aksoy, Timothy Stavcnger, Marcin Zalewski, Hugh R. Medal, Cliff Joslyn. 1-6 [doi]
- Interactive Launch of 16, 000 Microsoft Windows Instances on a SupercomputerMichael Jones 0001, Jeremy Kepner, Bradley Orchard, Albert Reuther, William Arcand, David Bestor, Bill Bergeron, Chansup Byun, Vijay Gadepally, Michael Houle, Matthew Hubbell, Anna Klein, Lauren Milechin, Julia S. Mullen, Andrew Prout, Antonio Rosa, Siddharth Samsi, Charles Yee, Peter Michaleas. 1-6 [doi]
- Energy-Efficient DNN Computing on GPUs Through Register File ManagementXin Wang, Wei Zhang. 1-7 [doi]
- A SoPC based Fixed Point System for Spaceborne SAR Real-Time Imaging ProcessingBingyi Li, Changjin Li, Yizhuang Xie, Liang Chen, Hao Shi, Yi Deng. 1-6 [doi]
- K-truss decomposition for Scale-Free Graphs at Scale in Distributed MemoryRoger Pearce, Geoffrey Sanders. 1-6 [doi]
- Stripmap SAR Pulse Interleaved SchedulingJohn Terragnoli, Miriam Leeser, Paul Monticciolo. 1-7 [doi]
- Packed Compressed Sparse Row: A Dynamic Graph RepresentationBrian Wheatman, Helen Xu. 1-7 [doi]
- Efficient and Flexible 2-D Data Controller for SAR Imaging SystemTianyun Sun, Yizhuang Xie, Bingyi Li, He Chen, Xiaoning Liu, Liang Chen. 1-6 [doi]
- Hornet: An Efficient Data Structure for Dynamic Sparse Graphs and Matrices on GPUsFederico Busato, Oded Green, Nicola Bombieri, David A. Bader. 1-7 [doi]
- An Access-Pattern-Aware On-Chip Vector Memory System with Automatic Loading for SIMD ArchitecturesTong Geng, Erkan Diken, Tianqi Wang, Lech Józwiak, Martin C. Herbordt. 1-7 [doi]
- Fast and Adaptive List Intersections on the GPUJames Fox, Oded Green, Kasimir Gabert, Xiaojing An, David A. Bader. 1-7 [doi]
- Design and Implementation of a Dynamic Information Flow Tracking Architecture to Secure a RISC-V Core for IoT ApplicationsChristian Palmiero, Giuseppe Di Guglielmo, Luciano Lavagno, Luca P. Carloni. 1-7 [doi]
- Benchmarking Heterogeneous HPC Systems Including Reconfigurable Fabrics: Community Aspirations for Ideal ComparisonsPeter Jamieson, Ahmed Sanaullah, Martin C. Herbordt. 1-6 [doi]
- Collaborative (CPU + GPU) Algorithms for Triangle Counting and Truss DecompositionVikram S. Mailthody, Ketan Date, Zaid Qureshi, Carl Pearson, Rakesh Nagi, Jinjun Xiong, Wen-mei Hwu. 1-7 [doi]
- Performance portability of a fluidized bed solverV. M. Krushnarao Kotteda, Vinod Kumar, William F. Spotz, Daniel Sunderland. 1-7 [doi]
- Accelerating Dijkstra's Algorithm Using Multiresolution Priority QueuesJordi Ros-Giralt, Alan Commike, Peter Cullen, Richard Lethin. 1-7 [doi]
- Graph algorithms via SuiteSparse: GraphBLAS: triangle counting and K-trussTimothy A. Davis 0001. 1-6 [doi]
- Towards Triangle Counting on GPU using Stable Radix binningNishith Tirpankar, Hari Sundar. 1-6 [doi]
- Parallel Counting of Triangles in Large Graphs: Pruning and Hierarchical Clustering AlgorithmsChun-Yen Kuo, Ching Nam Hang, Pei-Duo Yu, Chee-Wei Tan 0001. 1-6 [doi]
- Sparse Deep Neural Network Exact SolutionsJeremy Kepner, Vikalo Gadepally, Hayden Jananthan, Lauren Milechin, Sid Samsi. 1-8 [doi]
- Triangle Counting with A Multi-Core ComputerEvan Donato, Ming Ouyang, Cristian Peguero-Isalguez. 1-7 [doi]
- PageRank Acceleration for Large Graphs with Scalable Hardware and Two-Step SpMVFazle Sadi, Joe Sweeney, Scott McMillan, Tze Meng Low, James C. Hoe, Larry T. Pileggi, Franz Franchetti. 1-7 [doi]
- Database Operations in D4M.j1Lauren Milechin, Vijay Gadepally, Jeremy Kepner. 1-5 [doi]
- Soft-Core. Multiple-Lane, FPGA-based ADCs for a Liquid Helium EnvironmentZikun Xiang, Tianqi Wang, Tong Geng, Tian Xiang, Xi Jin, Martin C. Herbordt. 1-6 [doi]
- Damping Effect on PageRank DistributionTiancheng Liu, Yuchen Qian, Xi Chen, Xiaobai Sun. 1-11 [doi]
- Scalable Distributed Memory Community Detection Using ViteSayan Ghosh, Mahantesh Halappanavar, Antonino Tumeo, Ananth Kalyanaraman, Assefaw H. Gebremedhin. 1-7 [doi]
- Performance Effects of Dynamic Graph Data Structures in Community Detection AlgorithmsRohit Varkey Thankachan, Brian Paul Swenson, James P. Fairbanks. 1-7 [doi]
- AC922 Data Movement for CORALSteve Roberts, Pradeep Ramanna, John Walthour. 1-5 [doi]
- A Novel 1D-Convolution Accelerator for Low-Power Real-time CNN processing on the EdgeJustin Sanchez, Nasim Soltani, Ramachandra Vikas Chamarthi, Adarsh Sawant, Hamed Tabkhi. 1-8 [doi]
- All-at-once Decomposition of Coupled Billion-scale Tensors in Apache SparkAditya Gudibanda, Tom Henretty, Muthu Manikandan Baskaran, James R. Ezick, Richard Lethin. 1-8 [doi]
- Simulation Approach to Sensor Placement Using Unity 3DKimberlee Chestnut Chang, Nicole Lane, Andrew UhmeYer, Michael Jones 0001, Matthew Hubbell, Albert Reuther, Robert Seater. 1-6 [doi]
- Fault Tolerance Performance Evaluation of Large-Scale Distributed Storage Systems HDFS and Ceph Case StudyYehia Arafa, Atanu Barai, Mai Zheng, Abdel-Hameed A. Badawy. 1-7 [doi]
- Utilizing GPU Parallelism to Improve Fast Spherical Harmonic TransformsMax Carlson, Hari Sundar. 1-6 [doi]
- Optimizing GPU Kernels for Irregular Batch Workloads: A Case Study for Cholesky FactorizationAhmad Abdelfattah, Azzam Haidar, Stanimire Tomov, Jack J. Dongarra. 1-7 [doi]
- Application Aware Tuning of Reconfigurable Multi-Layer Perceptron ArchitecturesAhmed Sanaullah, Chen Yang, Yuri Alexeev, Kazutomo Yoshii, Martin C. Herbordt. 1-9 [doi]
- Dynamic Deployment of Communication Applications to Different Hardware Platforms using Ontological RepresentationsYanji Chen, Mehmet Güngör, Shweta Singh, Alex Tazin, Mieczyslaw M. Kokar, Miriam Leeser. 1-6 [doi]
- A Parallel Implementation of FANO using OpenMP and MPIPlamen Krastev, Albert Reuther, Chansup Byun, Michael Chrisp. 1-5 [doi]
- GoblinCore-64: A RISC-V Based Architecture for Data Intensive ComputingJohn D. Leidel, Xi Wang, Yong Chen. 1-8 [doi]
- ®Mark Barnell, Courtney Raymond, Christopher Capraro, Darrek Isereau, Chris Cicotta, Nathan Stokes. 1-4 [doi]
- Measuring the Impact of Spectre and MeltdownAndrew Prout, William Arcand, David Bestor, Bill Bergeron, Chansup Byun, Vijay Gadepally, Michael Houle, Matthew Hubbell, Michael Jones 0001, Anna Klein, Peter Michaleas, Lauren Milechin, Julie Mullen, Antonio Rosa, Siddharth Samsi, Charles Yee, Albert Reuther, Jeremy Kepner. 1-5 [doi]
- Hyperscaling Internet Graph Analysis with D4M on the MIT SuperCloudVijay Gadepally, Jeremy Kepner, Lauren Milechin, William Arcand, David Bestor, Bill Bergeron, Chansup Byun, Matthew Hubbell, Michael Houle, Micheal Jones, Peter Michaleas, Julie Mullen, Andrew Prout, Antonio Rosa, Charles Yee, Siddharth Samsi, Albert Reuther. 1-6 [doi]
- Computationally Efficient CP Tensor Decomposition Update Framework for Emerging Component Discovery in Streaming DataPierre-David Letourneau, Muthu Manikandan Baskaran, Tom Henretty, James R. Ezick, Richard Lethin. 1-8 [doi]
- Accelerated Aperture Synthesis from Free-flying CollectorsZachary K. Baker, Vinay Ramakrishnaiah, Josh Payne, Jon Woodring, Nicholas Dallmann, William Junor. 1-6 [doi]
- High Performance Computing Techniques with Power Systems SimulationsMatthew Overlin, Christopher Smith. 1-8 [doi]
- Towards Energy-Proportional Anomaly Detection in the Smart GridSpencer Drakontaidis, Michael Stanchi, Gabriel Glazer, Jason Hussey, Aaron St. Leger, Suzanne J. Matthews. 1-7 [doi]
- Exploring Parallel Bitonic Sort on a Migratory Thread ArchitectureKaushik Velusamy, Thomas B. Rolinger, Janice McMahon, Tyler A. Simon. 1-7 [doi]
- Server-class devices for Space Time Adaptive ProcessingJonas Larsson. 1-7 [doi]
- A Distributed Framework for Low-Latency OpenVX over the RDMA NoC of a Clustered ManycoreJulien Hascoet, Benoît Dupont de Dinechin, Karol Desnos, Jean-François Nezan. 1-7 [doi]
- Interactive Supercomputing on 40, 000 Cores for Machine Learning and Data AnalysisAlbert Reuther, Jeremy Kepner, Chansup Byun, Siddharth Samsi, William Arcand, David Bestor, Bill Bergeron, Vijay Gadepally, Michael Houle, Matthew Hubbell, Michael Jones 0001, Anna Klein, Lauren Milechin, Julia S. Mullen, Andrew Prout, Antonio Rosa, Charles Yee, Peter Michaleas. 1-6 [doi]
- Fast and accurate object detection in high resolution 4K and 8K video using GPUsVít Ruzicka, Franz Franchetti. 1-7 [doi]
- Triangle Counting and Truss Decomposition using FPGASitao Huang, Mohamed El-Hadedy, Cong Hao, Qin Li, Vikram S. Mailthody, Ketan Date, Jinjun Xiong, Deming Chen, Rakesh Nagi, Wen-mei Hwu. 1-7 [doi]
- Fast Stochastic Block Partition for Streaming GraphsAhsen J. Uppal, H. Howie Huang. 1-6 [doi]
- A Fast and Efficient Parallel Algorithm for Pruned Landmark LabelingQing Dong, Kartik Lakhotia, Hanqing Zeng, Rajgopal Karman, Viktor K. Prasanna, Guna Seetharaman. 1-7 [doi]
- Large-Scale Bayesian Kinship AnalysisSiddharth Samsi, Bea Yu, Darrell O. Ricke, Philip Fremont-Smith, Jeremy Kepner, Albert Reuther. 1-4 [doi]
- Logarithmic Radix Binning and Vectorized Triangle CountingOded Green, James Fox, Alex Watkins, Alok Tripathy, Kasimir Gabert, Euna Kim, Xiaojing An, Kumar Aatish, David A. Bader. 1-7 [doi]
- Too many secants: a hierarchical approach to secant-based dimensionality reduction on large data setsHenry Kvinge, Elin Farnell, Michael Kirby, Chris Peterson. 1-7 [doi]
- Functionality and Security Co-design Environment for Embedded SystemsJacob Leemaster, Michael Vai, David Whelihan, Haley Whitman, Roger Khazan. 1-5 [doi]
- Chameleon: A Generalized Reconfigurable Open-Source Architecture for Deep Neural Network TrainingMihailo Isakov, Alan Ehret, Michel A. Kinsy. 1-7 [doi]