Abstract is missing.
- Automatic Parallelization to Asynchronous Task-Based Runtimes Through a Generic Runtime LayerCharles Jin, Muthu Baskaran, Benoît Meister, Jonathan Springer. 1-11 [doi]
- An FPGA Decision Tree Classifier to Supervise a Communication SoCAbdelrahman Elkanishy, Derrick T. Rivera, Abdel-Hameed A. Badawy, Paul M. Furth, Z. M. Saifullah, Christopher P. Michael. 1-6 [doi]
- Fast Triangle Counting on GPUChuangyi Gui, Long Zheng 0003, Pengcheng Yao, Xiaofei Liao, Hai Jin 0001. 1-7 [doi]
- Cyber Baselining: Statistical properties of cyber time series and the search for stabilityAlexia Schulz, Ethan Aubin, Pierre Trepagnier, Allan B. Wollaber. 1-7 [doi]
- Spaceland Embedding of Sparse Stochastic GraphsNikos Pitsianis, Alexandros-Stavros Iliopoulos, Dimitris Floros, Xiaobai Sun. 1-8 [doi]
- Securing HPC using Federated AuthenticationAndrew Prout, Anna Klein, Peter Michaleas, Lauren Milechin, Julie Mullen, Antonio Rosa, Siddharth Samsi, Charles Yee, Albert Reuther, Jeremy Kepner, William Arcand, David Bestor, Bill Bergeron, Chansup Byun, Vijay Gadepally, Michael Houle, Matthew Hubbell, Michael Jones 0001. 1-7 [doi]
- Large Scale Organization and Inference of an Imagery Dataset for Public SafetyJeffrey Liu, David Strohschein, Siddharth Samsi, Andrew J. Weinert. 1-6 [doi]
- Write Quick, Run Fast: Sparse Deep Neural Network in 20 Minutes of Development Time via SuiteSparse: GraphBLASTimothy A. Davis 0001, Mohsen Aznaveh, Scott P. Kolodziej. 1-6 [doi]
- Optimizing Xeon Phi for Interactive Data AnalysisChansup Byun, Anne Klein, Lauren Milechin, Peter Michaleas, Julie Mullen, Andrew Prout, Antonio Rosa, Siddharth Samsi, Charles Yee, Albert Reuther, Jeremy Kepner, William Arcand, David Bestor, William Bergeron, Matthew Hubbell, Vijay Gadepally, Michael Houle, Michael Jones 0001. 1-6 [doi]
- A GPU Implementation of the Sparse Deep Neural Network Graph ChallengeMauro Bisson, Massimiliano Fatica. 1-8 [doi]
- A Survey on Hardware Security Techniques Targeting Low-Power SoC DesignsAlan Ehret, Karen Gettings, Bruce R. Jordan, Michel A. Kinsy. 1-8 [doi]
- Deploying AI Frameworks on Secure HPC Systems with ContainersDavid Brayford, Sofia Vallecorsa, Atanas Atanasov, Fabio Baruffa, Walter Riviera. 1-6 [doi]
- Skip the Intersection: Quickly Counting Common Neighbors on Shared-Memory SystemsXiaojing An, Kasimir Gabert, James Fox, Oded Green, David A. Bader. 1-7 [doi]
- Large Scale Parallelization Using File-Based CommunicationsChansup Byun, Anna Klein, Peter Michaleas, Julie Mullen, Andrew Prout, Antonio Rosa, Siddharth Samsi, Charles Yee, Albert Reuther, Jeremy Kepner, William Arcand, David Bestor, Bill Bergeron, Vijay Gadepally, Michael Houle, Matthew Hubbell, Michael Jones 0001. 1-7 [doi]
- Accelerating DNN Inference with GraphBLAS and the GPUXiaoyun Wang, Zhongyi Lin, Carl Yang, John D. Owens. 1-6 [doi]
- Streaming 1.9 Billion Hypersparse Network Updates per Second with D4MJeremy Kepner, Michael Houle, Michael Jones 0001, Anne Klein, Peter Michaleas, Julie Mullen, Andrew Prout, Antonio Rosa, Charles Yee, Albert Reuther, Vijay Gadepally, Lauren Milechin, Siddharth Samsi, William Arcand, David Bestor, William Bergeron, Chansup Byun, Matthew Hubbell. 1-6 [doi]
- Fast Stochastic Block Partitioning via SamplingFrank Wanye, Vitaliy Gleyzer, Wu-chun Feng. 1-7 [doi]
- Low Power Computing and Simultaneous Electro-Optical/Radar Data Processing using IBM's NS16e 16-chip Neuromorphic HardwareMark Barnell, Courtney Raymond, Daniel Brown, Matthew Wilson, Éric Côté. 1-5 [doi]
- A Novel Design of Adaptive and Hierarchical Convolutional Neural Networks using Partial Reconfiguration on FPGAMohammad Farhadi, Mehdi Ghasemi, Yezhou Yang. 1-7 [doi]
- Graph Algorithms in PGAS: Chapel and UPC++Louis Jenkins, Jesun Sahariar Firoz, Marcin Zalewski, Cliff A. Joslyn, Mark Raugas. 1-6 [doi]
- Training Behavior of Sparse Neural Network TopologiesSimon Alford, Ryan A. Robinett, Lauren Milechin, Jeremy Kepner. 1-6 [doi]
- Multithreaded Layer-wise Training of Sparse Deep Neural Networks using Compressed Sparse ColumnMohammad Hasanzadeh-Mofrad, Rami G. Melhem, Yousuf Ahmad, Mohammad Hammoud. 1-6 [doi]
- Scaling and Quality of Modularity Optimization Methods for Graph ClusteringSayan Ghosh, Mahantesh Halappanavar, Antonino Tumeo, Ananth Kalyanaraman. 1-6 [doi]
- Heterogeneous Cache Hierarchy Management for Integrated CPU-GPU ArchitectureHao Wen, Wei Zhang 0002. 1-6 [doi]
- Combinatorial Multigrid: Advanced Preconditioners For Ill-Conditioned Linear SystemsM. Harper Langston, Mitchell Tong Harris, Pierre-David Letourneau, Richard Lethin, James R. Ezick. 1-7 [doi]
- Target-based Resource Allocation for Deep Learning Applications in a Multi-tenancy SystemWenjia Zheng, Yun Song, Zihao Guo, Yongchen Cui, Suwen Gu, Ying Mao, Long Cheng 0003. 1-7 [doi]
- Low Overhead Instruction Latency Characterization for NVIDIA GPGPUsYehia Arafa, Abdel-Hameed A. Badawy, Gopinath Chennupati, Nandakishore Santhi, Stephan J. Eidenbenz. 1-8 [doi]
- Scalable Solvers for Cone Complementarity Problems in Frictional Multibody DynamicsSaibal De, Eduardo Corona, Paramsothy Jayakumar, Shravan K. Veerapaneni. 1-7 [doi]
- Synthesis of Hardware Sandboxes for Trojan Mitigation in Systems on ChipChristophe Bobda, Taylor J. L. Whitaker, Joel Mandebi Mbongue, Sujan Kumar Saha. 1-6 [doi]
- FFTX for Micromechanical Stress-Strain AnalysisAnuva Kulkarni, Daniele G. Spampinato, Franz Franchetti. 1-2 [doi]
- Applying Neuromorphic Computing to Compressive SensingRonald Scrofano, Douglas P. Enright, George C. Valley. 1-2 [doi]
- Exploration of Fine-Grained Parallelism for Load Balancing Eager K-truss on GPU and CPUMark Blanco, Tze Meng Low, Kyungjoo Kim. 1-7 [doi]
- Exploring the Efficiency of OpenCL Pipe for Hiding Memory Latency on Cloud FPGAsArnab A. Purkayastha, Sai Raghavendran, Jhanani Thiagarajan, Hamed Tabkhi. 1-7 [doi]
- An Interactive LiDAR to Camera CalibrationYecheng Lyu, Lin Bai, Mahdi Elhousni, Xinming Huang 0001. 1-6 [doi]
- Breadth-First Search on Dynamic Graphs using Dynamic Parallelism on the GPUDominik Tödling, Martin Winter, Markus Steinberger. 1-7 [doi]
- A data-driven framework for uncertainty quantification of a fluidized bedV. M. Krushnarao Kotteda, Anitha Kommu, Vinod Kumar. 1-7 [doi]
- MeXT: A Flow for Multiprocessor ExplorationChristophe Bobda, Harold Ishebabi, Philipp Mahr, Joel Mandebi Mbongue, Sujan Kumar Saha. 1-7 [doi]
- DistTC: High Performance Distributed Triangle CountingLoc Hoang, Vishwesh Jatala, Xuhao Chen, Udit Agarwal, Roshan Dathathri, Gurbinder Gill, Keshav Pingali. 1-7 [doi]
- Progressive Optimization of Batched LU Factorization on GPUsAhmad Abdelfattah, Stanimire Tomov, Jack J. Dongarra. 1-6 [doi]
- TapirXLA: Embedding Fork-Join Parallelism into the XLA Compiler in TensorFlow Using TapirTao B. Schardl, Siddharth Samsi. 1-8 [doi]
- Improving Parallelism of Breadth First Search (BFS) Algorithm for Accelerated Performance on GPUsHao Wen, Wei Zhang. 1-7 [doi]
- IP Cores for Graph Kernels on FPGAsSanmukh R. Kuppannagari, Rachit Rajat, Rajgopal Kannan, Aravind Dasu, Viktor K. Prasanna. 1-7 [doi]
- Multistart Methods for Quantum Approximate optimizationRuslan Shaydulin, Ilya Safro, Jeffrey Larson. 1-8 [doi]
- ECG Feature Processing Performance Acceleration on SLURM Compute SystemsMichael Nolan, Mark Hernandez, Philip Fremont-Smith, Albert Swiston, Kajal Claypool. 1-4 [doi]
- Linear Algebra-Based Triangle Counting via Fine-Grained Tasking on Heterogeneous Environments : (Update on Static Graph Challenge)Abdurrahman Yasar, Sivasankaran Rajamanickam, Jonathan W. Berry, Michael M. Wolf, Jeffrey S. Young 0001, Ümit V. Çatalyürek. 1-4 [doi]
- Garbled Circuits in the Cloud using FPGA Enabled NodesKai Huang, Mehmet Güngör, Xin Fang, Stratis Ioannidis, Miriam Leeser. 1-6 [doi]
- One Quadrillion Triangles Queried on One Million ProcessorsRoger Pearce, Trevor Steil, Benjamin W. Priest, Geoffrey Sanders. 1-5 [doi]
- Update on k-truss Decomposition on GPUMohammad Almasri, Omer Anjum, Carl Pearson, Zaid Qureshi, Vikram S. Mailthody, Rakesh Nagi, Jinjun Xiong, Wen-mei W. Hwu. 1-7 [doi]
- Artificial Neural Network and Accelerator Co-design using Evolutionary AlgorithmsPhilip Colangelo, Oren Segal, Alexander Speicher, Martin Margala. 1-8 [doi]
- C to D-Wave: A High-level C Compilation Framework for Quantum AnnealersMohamed W. Hassan, Scott Pakin, Wu-chun Feng. 1-8 [doi]
- Improving Scheduling for Irregular Applications with Logarithmic Radix BinningJames Fox, Alok Tripathy, Oded Green. 1-7 [doi]
- Optimizing the Visualization Pipeline of a 3-D Monitoring and Management SystemRebecca Wild, Matthew Hubbell, Jeremy Kepner. 1-5 [doi]
- Distributed Direction-Optimizing Label Propagation for Community DetectionXu Liu 0001, Jesun Sahariar Firoz, Marcin Zalewski, Mahantesh Halappanavar, Kevin J. Barker, Andrew Lumsdaine, Assefaw H. Gebremedhin. 1-6 [doi]
- IdPrism: Rapid Analysis of Forensic DNA Samples Using MPS SNP ProfilesDarrell O. Ricke, James Watkins, Philip Fremont-Smith, Adam Michaleas. 1-5 [doi]
- Combining Tensor Decompositions and Graph Analytics to Provide Cyber Situational Awareness at HPC ScaleJames R. Ezick, Ben Parsons, William Glodek, Tom Henretty, Muthu Manikandan Baskaran, Richard Lethin, John Feo, Tai-Ching Tuan, Christopher Coley, Leslie Leonard, Rajeev Agrawal. 1-7 [doi]
- Hypersparse Neural Network Analysis of Large-Scale Internet TrafficJeremy Kepner, Kenjiro Cho, Kimberly C. Claffy, Vijay Gadepally, Peter Michaleas, Lauren Milechin. 1-11 [doi]
- Singularity for Machine Learning Applications - Analysis of Performance ImpactBruce R. Jordan, David J. Barrett, David Burke, Patrick Jardin, Amelia Littrell, Paul Monticciolo, Michael Newey, Jean Piou, Kara Warner. 1-6 [doi]
- Proactive Cyber Situation Awareness via High Performance ComputingAllan B. Wollaber, Jaime Peña, Benjamin Blease, Leslie Shing, Kenneth Alperin, Serge Vilvovsky, Pierre C. Trepagnier, Neal Wagner, Leslie Leonard. 1-7 [doi]
- Lossless Compression of Internal Files in Parallel Reservoir SimulationMarcin Rogowski, Suha N. Kayum, Florian Mannuss. 1-3 [doi]
- Emerging Applications of 3D Integration and Approximate Computing in High-Performance Computing Systems: Unique Security VulnerabilitiesPruthvy Yellu, Zhiming Zhang, Mohammad Mezanur Rahman Monjur, Ranuli Abeysinghe, Qiaoyan Yu. 1-7 [doi]
- Scalable Lazy-update Multigrid PreconditionersMajid Rasouli, Vidhi Zala, Robert M. Kirby, Hari Sundar. 1-7 [doi]
- Prototype Container-Based Platform for Extreme Quantum Computing Algorithm DevelopmentPatrick Dreher, Madhuvanti Ramasami. 1-7 [doi]
- Scalable Triangle Counting on Distributed-Memory SystemsSeher Acer, Abdurrahman Yasar, Sivasankaran Rajamanickam, Michael M. Wolf, Ümit V. Çatalyürek. 1-5 [doi]
- Introducing DyMonDS-as-a-Service (DyMaaS) for Internet of ThingsMarija Ilic, Rupamathi Jaddivada. 1-9 [doi]
- Fast and Scalable Distributed Tensor DecompositionsMuthu Manikandan Baskaran, Thomas Henretty, James R. Ezick. 1-7 [doi]
- Deep Learning-Based Nuclei Segmentation of Cleared Brain TissuePooya Khorrami, Kevin Brady, Mark Hernandez, Lars Gjesteby, Sara N. Burke, Damon G. Lamb, Matthew A. Melton, Kevin J. Otto, Laura J. Brattain. 1-2 [doi]
- Evaluation of the Imbalance Evolution in Parallel Reservoir SimulationMarcin Rogowski, Suha N. Kayum. 1-7 [doi]
- Hardware IP Classification through Weighted CharacteristicsBrendan McGeehan, Flora Smith, Thao Le, Hunter Nauman, Jia Di. 1-6 [doi]
- BLAST: Blockchain-based Trust Management in Smart Cities and Connected Vehicles SetupFarah Kandah, Brennan Huber, Amani Altarawneh, Sai Medury, Anthony Skjellum. 1-7 [doi]
- Optimal Resource Allocation for Parallel Reservoir SimulationSuha N. Kayum, Marcin Rogowski. 1-4 [doi]
- Towards Improving Rate-Distortion Performance of Transform-Based Lossy Compression for HPC DatasetsJialing Zhang, Aekyeung Moon, Xiaoyan Zhuo, Seung Woo Son 0001. 1-7 [doi]
- Many-target, Many-sensor Ship Tracking and ClassificationLeonard Kosta, John Irvine, Laura Seaman, Hongwei Xi. 1-7 [doi]
- Survey and Benchmarking of Machine Learning AcceleratorsAlbert Reuther, Peter Michaleas, Michael Jones 0001, Vijay Gadepally, Siddharth Samsi, Jeremy Kepner. 1-9 [doi]
- FPGA-Accelerated Spreading for Global PlacementShounak Dhar, Love Singhal, Mahesh A. Iyer, David Z. Pan. 1-7 [doi]
- Using Container Migration for HPC Workloads ResilienceMohamad Sindi, John R. Williams. 1-10 [doi]
- Increasing Accuracy of Iterative Refinement in Limited Floating-Point Arithmetic on Half-Precision AcceleratorsPiotr Luszczek, Ichitaro Yamazaki, Jack J. Dongarra. 1-6 [doi]
- On Computing with Diagonally Structured MatricesShahadat Hossain, Mohammad Sakib Mahmud. 1-6 [doi]
- Survey of Attacks and Defenses on Edge-Deployed Neural NetworksMihailo Isakov, Vijay Gadepally, Karen M. Gettings, Michel A. Kinsy. 1-8 [doi]
- Auxiliary Maximum Likelihood Estimation for Noisy Point Cloud RegistrationCole Campton, Xiaobai Sun. 1-7 [doi]
- A Parallel Simulation Approach to ACAS X DevelopmentAdam Gjersvik, Robert J. Moss. 1-6 [doi]
- H-INDEX: Hash-Indexing for Parallel Triangle Counting on GPUsSantosh Pandey, Xiaoye Sherry Li, Aydin Buluç, JieJun Xu, Hang Liu. 1-7 [doi]
- Performance of Training Sparse Deep Neural Networks on GPUsJianzong Wang, Zhangcheng Huang, Lingwei Kong, Jing Xiao, Pengyu Wang, Lu Zhang, Chao Li. 1-5 [doi]
- Accelerating Sparse Deep Neural Networks on FPGAsSitao Huang, Carl Pearson, Rakesh Nagi, Jinjun Xiong, Deming Chen, Wen-mei W. Hwu. 1-7 [doi]
- Fast Large-Scale Algorithm for Electromagnetic Wave Propagation in 3D MediaMitchell Tong Harris, M. Harper Langston, Pierre-David Létourneau, George Papanicolaou, James R. Ezick, Richard Lethin. 1-7 [doi]
- Scalable Inference for Sparse Deep Neural Networks using Kokkos KernelsJ. Austin Ellis, Sivasankaran Rajamanickam. 1-7 [doi]
- Update on Triangle Counting on GPUCarl Pearson, Mohammad Almasri, Omer Anjum, Vikram S. Mailthody, Zaid Qureshi, Rakesh Nagi, Jinjun Xiong, Wen-mei W. Hwu. 1-7 [doi]
- Embedded Processor-In-Memory Architecture for Accelerating Arithmetic OperationsRichard Muri, Paul Fortier. 1-7 [doi]
- Distributed Deep Learning for Precipitation NowcastingSiddharth Samsi, Christopher J. Mattioli, Mark S. Veillette. 1-7 [doi]
- COMET: A Distributed Metadata Service for Federated Cloud InfrastructuresCong Wang, Komal Thareja, Michael Stealey, Paul Ruth, Ilya Baldin. 1-7 [doi]
- Efficient implementation of sparse matrix-sparse vector multiplication for large scale graph analyticsMauricio J. Serrano. 1-7 [doi]
- Embedded GPU Cluster Computing Framework for Inference of Convolutional Neural NetworksEvan Kain, Diego Wildenstein, Andrew C. Pineda. 1-7 [doi]
- QxSQA: GPGPU-Accelerated Simulated Quantum Annealer within a Non-Linear Optimization and Boltzmann Sampling FrameworkDan Padilha, Serge Weinstock, Mark Hodson. 1-8 [doi]
- Overcoming Limitations of GPGPU-Computing in Scientific ApplicationsConnor Kenyon, Glenn Volkema, Gaurav Khanna 0001. 1-9 [doi]
- Multi-spectral Reuse Distance: Divining Spatial Information from Temporal DataAnthony M. Cabrera, Roger D. Chamberlain, Jonathan C. Beard. 1-8 [doi]
- Fast BFS-Based Triangle Counting on GPUsLeyuan Wang, John D. Owens. 1-6 [doi]
- An Efficient and Composable Parallel Task Programming LibraryChun-Xun Lin, Tsung-Wei Huang, Guannan Guo, Martin D. F. Wong. 1-7 [doi]
- Design and Implementation of Knowledge Base for Runtime Management of Software Defined HardwareHongkuan Zhou, Ajitesh Srivastava, Rajgopal Kannan, Viktor K. Prasanna. 1-7 [doi]
- Deep-Learning Inferencing with High-Performance Hardware AcceleratorsLuke Kljucaric, Alan D. George. 1-7 [doi]
- Concurrent Katz Centrality for Streaming GraphsChunxing Yin, E. Jason Riedy. 1-6 [doi]
- Application of Approximate Matrix Multiplication to Neural Networks and Distributed SLAMBrian Plancher, Camelia D. Brumar, Iulian Brumar, Lillian Pentecost, Saketh Rama, David Brooks 0001. 1-7 [doi]
- Sparse Deep Neural Network Graph ChallengeJeremy Kepner, Simon Alford, Vijay Gadepally, Michael Jones 0001, Lauren Milechin, Ryan A. Robinett, Sid Samsi. 1-7 [doi]
- Message Scheduling for Performant, Many-Core Belief PropagationMark Van der Merwe, Vinu Joseph, Ganesh Gopalakrishnan. 1-7 [doi]