Abstract is missing.
- Scheduler technologies in support of high performance data analysisAlbert Reuther, Chansup Byun, William Arcand, David Bestor, Bill Bergeron, Matthew Hubbell, Michael Jones, Peter Michaleas, Andrew Prout, Antonio Rosa, Jeremy Kepner. 1-6 [doi]
- Mathematical foundations of the GraphBLASJeremy Kepner, Peter Aaltonen, David A. Bader, Aydin Buluç, Franz Franchetti, John R. Gilbert, Dylan Hutchison, Manoj Kumar, Andrew Lumsdaine, Henning Meyerhenke, Scott McMillan, Carl Yang, John D. Owens, Marcin Zalewski, Timothy G. Mattson, José E. Moreira. 1-9 [doi]
- The Open Community Runtime: A runtime system for extreme scale computingTimothy G. Mattson, Romain Cledat, Vincent Cavé, Vivek Sarkar, Zoran Budimlic, Sanjay Chatterjee, Joshua B. Fryman, Ivan Ganev, Robin Knauerhase, Min Lee, Benoît Meister, Brian Nickerson, Nick Pepperling, Bala Seshasayee, Sagnak Tasirlar, Justin Teller, Nick Vrvilo. 1-7 [doi]
- Implementing Hilbert transform for Digital Signal Processing on epiphany many-core coprocessorKyle L. Labowski, Patrick W. Jungwirth, James A. Ross, David A. Richie. 1-6 [doi]
- Enhancing the performance and robustness of the FEAST eigensolverBrendan Gavin, Eric Polizzi. 1-6 [doi]
- Optimizing communication for a 2D-partitioned scalable BFSJeffrey Young, Julian Romera, Matthias Hauck, Holger Fröning. 1-7 [doi]
- A CUDA implementation of the pagerank pipeline benchmarkMauro Bisson, Everett H. Phillips, Massimiliano Fatica. 1-7 [doi]
- A quantum macro assemblerScott Pakin. 1-8 [doi]
- PERFECT case studies demonstrating order of magnitude reduction in power consumptionDavid K. Wittenberg, Edin Kadric, André DeHon, Jonathan Edwards, Jeffrey Smith, Silviu Chiricescu. 1-7 [doi]
- Analyzing heterogeneous computing architectures for ADAS and Mobile Imaging applicationsRafal Malewski, Markus Levy, Peter Torelli. 1-2 [doi]
- Unified and lightweight tasks and conduits: A high level parallel programming frameworkChao Liu, Miriam Leeser. 1-7 [doi]
- LLMapReduce: Multi-level map-reduce for high performance data analysisChansup Byun, Jeremy Kepner, William Arcand, David Bestor, Bill Bergeron, Vijay Gadepally, Matthew Hubbell, Peter Michaleas, Julie Mullen, Andrew Prout, Antonio Rosa, Charles Yee, Albert Reuther. 1-8 [doi]
- Associative array model of SQL, NoSQL, and NewSQL databasesJeremy Kepner, Vijay Gadepally, Dylan Hutchison, Hayden Jananthan, Timothy G. Mattson, Siddharth Samsi, Albert Reuther. 1-9 [doi]
- Novel graph processor architecture, prototype system, and resultsWilliam S. Song, Vitaliy Gleyzer, Alexei Lomakin, Jeremy Kepner. 1-7 [doi]
- High-throughput ingest of data provenance records into AccumuloThomas Moyer, Vijay Gadepally. 1-6 [doi]
- Adding scalability to Internet of Things gateways using parallel computation of edge device dataJanice Canedo, Anthony Skjellum. 1-5 [doi]
- Julia implementation of the Dynamic Distributed Dimensional Data ModelAlexander Chen, Alan Edelman, Jeremy Kepner, Vijay Gadepally, Dylan Hutchison. 1-7 [doi]
- Parameter setting for quantum annealersKristen L. Pudenz. 1-6 [doi]
- Node level power measurements on a petaflop systemDavid Brayford, Christoph Bernau, Carla Guillén, Carmen B. Navarrete. 1-6 [doi]
- CUDA implementation of an optimal online Gaussian-Signal-in-Gaussian-Noise detectorNir Nossenson, Ariel J. Jaffe. 1-7 [doi]
- Novo-G#: Large-scale reconfigurable computing with direct and programmable interconnectsAlan D. George, Martin C. Herbordt, Herman Lam, Abhijeet G. Lawande, Jiayi Sheng, Chen Yang. 1-7 [doi]
- Abstractions considered helpful: A tools architecture for quantum annealersMichael Booth, Edward Dahl, Mark Furtney, Steven P. Reinhardt. 1-2 [doi]
- Havens: Explicit reliable memory regions for HPC applicationsSaurabh Hukerikar, Christian Engelmann. 1-6 [doi]
- Integrating real-time and batch processing in a polystoreJohn Meehan, Stan Zdonik, Shaobo Tian, Yulong Tian, Nesime Tatbul, Adam Dziedzic, Aaron J. Elmore. 1-7 [doi]
- Benchmarking SciDB data import on HPC systemsSiddharth Samsi, Laura J. Brattain, William Arcand, David Bestor, Bill Bergeron, Chansup Byun, Vijay Gadepally, Matthew Hubbell, Michael Jones, Anna Klein, Peter Michaleas, Lauren Milechin, Julie Mullen, Andrew Prout, Antonio Rosa, Charles Yee, Jeremy Kepner, Albert Reuther. 1-5 [doi]
- The BigDAWG polystore system and architectureVijay Gadepally, Peinan Chen, Jennie Duggan, Aaron J. Elmore, Brandon Haynes, Jeremy Kepner, Samuel Madden, Tim Mattson, Michael Stonebraker. 1-6 [doi]
- Cross-institutional research cyberinfrastructure for data intensive scienceW. Christopher Lenhardt, Mike Conway, Erik Scott, Brian Blanton, Ashok Krishnamurthy, Mirsad Hadzikadic, Mladen A. Vouk, Alyson Wilson. 1-6 [doi]
- GPU-accelerated charge mappingAhmed Sanaullah, Saiful A. Mojumder, Kathleen M. Lewis, Martin C. Herbordt. 1-7 [doi]
- Generating massive complex networks with hyperbolic geometry faster in practiceMoritz von Looz, Mustafa Safa Özdayi, Sören Laue, Henning Meyerhenke. 1-6 [doi]
- I-vector speaker and language recognition system on AndroidChristian Vazquez-Machado, Pedro Colon-Hernandez, Pedro A. Torres-Carrasquillo. 1-6 [doi]
- Scalability of VM provisioning systemsMike Jones, Bill Arcand, Bill Bergeron, David Bestor, Chansup Byun, Lauren Milechin, Vijay Gadepally, Matthew Hubbell, Jeremy Kepner, Peter Michaleas, Julie Mullen, Andy Prout, Tony Rosa, Siddharth Samsi, Charles Yee, Albert Reuther. 1-5 [doi]
- Towards parallel implementation of associative inference for cogent confabulationZhe Li, Qinru Qiu, Mangesh Tamhankar. 1-6 [doi]
- BigDAWG polystore query optimization through semantic equivalencesZuohao She, Surabhi Ravishankar, Jennie Duggan. 1-6 [doi]
- Data transformation and migration in polystoresAdam Dziedzic, Aaron J. Elmore, Michael Stonebraker. 1-6 [doi]
- Cross-engine query execution in federated database systemsAnkush M. Gupta, Vijay Gadepally, Michael Stonebraker. 1-6 [doi]
- ToQ.jl: A high-level programming language for D-Wave machines based on JuliaDaniel O'Malley, Velimir V. Vesselinov. 1-7 [doi]
- Parallel motion estimation and GPU-based fast coding unit splitting mechanism for HEVCYih-Chuan Lin, Shang-Che Wu. 1-7 [doi]
- On-chip memory efficient data layout for 2D FFT on 3D memory integrated FPGAShreyas G. Singapura, Rajgopal Kannan, Viktor K. Prasanna. 1-7 [doi]
- High-performance algorithms and data structures to catch elephant flowsJordi Ros-Giralt, Alan Commike, Richard A. Lethin, Sourav Maji, Malathi Veeraraghavan. 1-7 [doi]
- Optimizing simulation speed of FPGA model-based synthesisJeffrey Caldwell, Bo Marr, David Bloom, Dan Thompson. 1-6 [doi]
- Rapid prototyping with symbolic computation: Fast development of quantum annealing solutionsMark Hodson, Duncan Fletcher, Dan Padilha, Tristan Cook. 1-5 [doi]
- Kokkos/Qthreads task-parallel approach to linear algebra based graph analyticsMichael M. Wolf, H. Carter Edwards, Stephen L. Olivier. 1-7 [doi]
- How naive is naive SpMV on the GPU?Markus Steinberger, Andreas Derlery, Rhaleb Zayer, Hans-Peter Seidel. 1-8 [doi]
- Advantages to modeling relational data using hypergraphs versus graphsMichael M. Wolf, Alicia M. Klinvex, Daniel M. Dunlavy. 1-7 [doi]
- LU, QR, and Cholesky factorizations: Programming model, performance analysis and optimization techniques for the Intel Knights Landing Xeon PhiAzzam Haidar, Stanimire Tomov, Konstantin Arturov, Murat Guney, Shane Story, Jack Dongarra. 1-7 [doi]
- The BigDawg monitoring frameworkPeinan Chen, Vijay Gadepally, Michael Stonebraker. 1-6 [doi]
- A scale-free structure for power-law graphsRichard Veras, Tze Meng Low, Franz Franchetti. 1-7 [doi]
- Silicon photonic memory interconnect for many-core architecturesKe Wen, Hang Guan, David M. Calhoun, David Donofrio, John Shalf. 1-7 [doi]
- Benchmarking the graphulo processing frameworkTimothy Weale, Vijay Gadepally, Dylan Hutchison, Jeremy Kepner. 1-5 [doi]
- Accelerated low-rank updates to tensor decompositionsMuthu Manikandan Baskaran, M. Harper Langston, Tahina Ramananandro, David Bruns-Smith, Tom Henretty, James R. Ezick, Richard Lethin. 1-7 [doi]
- Performance analysis and acceleration of explicit integration for large kinetic networks using batched GPU computationsAzzam Haidar, Benjamin Brock, Stanimire Tomov, Michael Guidry, Jay Jay Billings, Daniel Shyles, Jack Dongarra. 1-7 [doi]
- Systems design of cybersecurity in embedded systemsMichael Vai, David Whelihan, N. Evancich, K. J. Kwak, J. Li, M. Britton, J. Foley, M. Lynch, D. Schafer, J. DeMatteis. 1-6 [doi]
- Enhancing HPC security with a user-based firewallAndrew Prout, William Arcand, David Bestor, Bill Bergeron, Chansup Byun, Vijay Gadepally, Matthew Hubbell, Michael Houle, Michael Jones, Peter Michaleas, Lauren Milechin, Julie Mullen, Antonio Rosa, Siddharth Samsi, Albert Reuther, Jeremy Kepner. 1-4 [doi]
- Landmark routing for large graphs in fixed-memory environmentsNewton Campbell, Michael J. Laszlo, Sumitra Mukherjee. 1-7 [doi]
- Design space exploration of GPU Accelerated cluster systems for optimal data transfer using PCIe busJanki Bhimani, Miriam Leeser, Ningfang Mi. 1-7 [doi]
- A framework to integrate MFiX with Trilinos for high fidelity fluidized bed computationsV. M. Krushnarao Kotteda, Ashesh Chattopadhyay, Vinod Kumar, William F. Spotz. 1-6 [doi]
- 3D DRAM based application specific hardware accelerator for SpMVFazle Sadi, Larry T. Pileggi, Franz Franchetti. 1 [doi]
- On SDN-based extreme-scale networksHaitham Ghalwash, Chun-Hsi Huang. 1-7 [doi]
- Polyhedral compilation for energy efficiencyBenoît Pradelle, Muthu Manikandan Baskaran, Thomas Henretty, Benoît Meister, Athanasios Konstantinidis, Richard Lethin. 1-7 [doi]
- Distributed and configurable architecture for neuromorphic applications on heterogeneous clusterKhadeer Ahmed, Qinru Qiu, Mangesh Tamhankar. 1-7 [doi]
- Hypervisor performance analysis for real-time workloadsGeoffrey Phi C. Tran, Yu-An Chen, Dong-In Kang, John Paul Walters, Stephen P. Crago. 1-7 [doi]
- In-storage embedded accelerator for sparse pattern processingSang-Woo Jun, Huy T. Nguyen, Vijay Gadepally, Arvind. 1-7 [doi]
- Software systems for high-performance quantum computingTravis S. Humble, Keith A. Britt. 1-8 [doi]
- From NoSQL Accumulo to NewSQL Graphulo: Design and utility of graph algorithms inside a BigTable databaseDylan Hutchison, Jeremy Kepner, Vijay Gadepally, Bill Howe. 1-9 [doi]
- cuSTINGER: Supporting dynamic graph algorithms for GPUsOded Green, David A. Bader. 1-6 [doi]
- A hardware design for in-brain neural spike sortingYinan Liu, Jiayi Sheng, Martin C. Herbordt. 1-6 [doi]
- A sparse multi-dimensional Fast Fourier Transform with stability to noise in the context of image processing and change detectionPierre-David Letourneau, M. Harper Langston, Richard Lethin. 1-6 [doi]
- Efficient implementation of scatter-gather operations for large scale graph analyticsManoj Kumar, Mauricio J. Serrano, José E. Moreira, Pratap Pattnaik, William P. Horn, Joefon Jann, Gabriel Tanase. 1-7 [doi]
- GPU accelerated, robust method for voxelization of solid objectsCosmin Nita, Iulian Stroia, Lucian Mihai Itu, Constantin Suciu, Viorel Mihalef, Manasi Datar, Saikiran Rapaka, Puneet Sharma. 1-5 [doi]
- KNN in the Jaccard spaceMing Ouyang. 1-7 [doi]
- Computational and memory analysis of Tegra SoCsAndrew Milluzzi, Alan D. George, Herman Lam. 1-7 [doi]
- An approach to big data inspired by statistical mechanicsJohn A. Cortese. 1-6 [doi]
- Real-time, low-latency image processing with high throughput on a multi-core SoCBarath Ramesh, Alan D. George, Herman Lam. 1-7 [doi]