Abstract is missing.
- Build Energy-Efficient GPU Computing Environment for Machine Learning Algorithms with Register File Packing TechniqueXin Wang, Wei Zhang 0265. 1-7 [doi]
- Finding Your Niche: An Evolutionary Approach to HPC TopologiesStephen J. Young, Joshua Suetterlein, Jesun Firoz, Joseph B. Manzano, Kevin J. Barker. 1-9 [doi]
- Zero Trust Architecture Approach for Developing Mission Critical Embedded SystemsMichael Vai, David Whelihan, Eric Simpson, Donato Kava, Alice Lee, Huy Nguyen, Jeffrey J. Hughes, Gabriel Torres, Jeffery Lim, Ben Nahill, Roger Khazan, Fred B. Schneider. 1-5 [doi]
- Mapping of Internet "Coastlines" via Large Scale Anonymized Network Source CorrelationsHayden Jananthan, Jeremy Kepner, Michael Jones 0001, William Arcand, David Bestor, William Bergeron, Chansup Byun, Timothy Davis, Vijay Gadepally, Daniel Grant, Michael Houle 0001, Matthew Hubbell, Anna Klein, Lauren Milechin, Guillermo Morales, Andrew Morris, Julie Mullen, Ritesh Patel, Alex Pentland, Sandeep Pisharody, Andrew Prout, Albert Reuther, Antonio Rosa, Siddharth Samsi, Tyler Trigg, Gabriel Wachman, Charles Yee, Peter Michaleas. 1-9 [doi]
- Parallel Algorithms for Computing Jaccard Weights on Graphs using Linear AlgebraElaheh Hassani, Md Taufique Hussain, Ariful Azad. 1-7 [doi]
- ProtoX: A First LookHet Mankad, Sanil Rao, Phillip Colella, Brian van Straalen, Franz Franchetti. 1-6 [doi]
- Machine Learning Across Network-Connected FPGAsDana Diaconu, Yanyue Xie, Mehmet Güngör, Suranga Handagala, Xue Lin 0001, Miriam Leeser. 1-7 [doi]
- Multiarchitecture Hardware Acceleration of Hyperdimensional ComputingIan Peitzsch, Mark Ciora, Alan D. George. 1-7 [doi]
- Errant Beam Detection Using the AMD Versal ACAP and Vitis AIAnthony M. Cabrera, Yigit A. Yucesan, Frank Y. Liu, Willem Blokland, Jeffrey S. Vetter. 1-6 [doi]
- Hardware Root-of-Trust Support for Operational Technology Cybersecurity in Critical InfrastructuresAlan Ehret, Peter Moore, Milan Stojkov, Michel A. Kinsy. 1-7 [doi]
- From Words to Watts: Benchmarking the Energy Costs of Large Language Model InferenceSiddharth Samsi, Dan Zhao 0007, Joseph McDonald, Baolin Li, Adam Michaleas, Michael Jones 0001, William Bergeron, Jeremy Kepner, Devesh Tiwari, Vijay Gadepally. 1-9 [doi]
- Leveraging Mixed Precision in Exponential Time Integration MethodsCody J. Balos, Steven Roberts, David J. Gardner. 1-8 [doi]
- Parallel Clustering with Resolution VariationNikos Pitsianis, Dimitris Floros, Tiancheng Liu, Xiaobai Sun. 1-8 [doi]
- High-Level Framework for Solving Systems of the PDEs on Distributed SystemsYevhen Pankevych, Oleg Farenyuk. 1-5 [doi]
- Deployment of Real-Time Network Traffic Analysis Using GraphBLAS Hypersparse Matrices and D4M Associative ArraysMichael Jones 0001, Jeremy Kepner, Andrew Prout, Timothy Davis, William Arcand, David Bestor, William Bergeron, Chansup Byun, Vijay Gadepally, Micheal Houle, Matthew Hubbell, Hayden Jananthan, Anna Klein, Lauren Milechin, Guillermo Morales, Julie Mullen, Ritesh Patel, Sandeep Pisharody, Albert Reuther, Antonio Rosa, Siddharth Samsi, Charles Yee, Peter Michaleas. 1-8 [doi]
- UNet Performance with Wafer Scale Engine (Optimization Case Study)Vyacheslav N. Romanov. 1-6 [doi]
- A Massively Parallel BWP Algorithm for Solving Large-Scale Systems of Nonlinear EquationsBruno Silva, Luiz Guerreiro Lopes. 1-6 [doi]
- Modeling and Analyzing Wind Velocity at Entrance Doors to Avoid AccidentsAbu Asaduzzaman, Luke Mercer, Md. Raihan Uddin, Yoel Woldeyes. 1-5 [doi]
- Photonic Accelerators for Image Segmentation in Autonomous Driving and Defect DetectionLakshmi Nair, David P. Widemann, Brad Turcott, Nick Moore, Alexandra Wleklinski, Darius Bunandar, Ioannis Papavasileiou, Shihu Wang, Eric Logan. 1-9 [doi]
- Exploiting Fusion Opportunities in Linear Algebraic Graph Query EnginesYuttapichai Kerdcharoen, Upasana Sridhar, Tze Meng Low. 1-7 [doi]
- Scalable and Portable Pipelines for Predicting 3D Protein Structures on Standalone and HPC SystemsAdam Michaleas, Darrell O. Ricke. 1-4 [doi]
- Creating a Dataset for High-Performance Computing Code Translation using LLMs: A Bridge Between OpenMP Fortran and C++Bin Lei, Caiwen Ding, Le Chen, Pei-Hung Lin, Chunhua Liao. 1-7 [doi]
- RaftGP: Random Fast Graph PartitioningYu Gao, Meng Qin 0002, Yibin Ding, Li Zeng, Chaorui Zhang, Weixi Zhang, Wei Han 0004, Rongqian Zhao, Bo Bai 0001. 1-7 [doi]
- G-MAP: A Graph Neural Network-Based Framework for Memory Access PredictionAbhiram Rao Gorle, Pengmiao Zhang, Rajgopal Kannan, Viktor K. Prasanna. 1-7 [doi]
- Accelerating Multi-Agent DDPG on CPU-FPGA Heterogeneous PlatformSamuel Wiggins, Yuan Meng 0001, Rajgopal Kannan, Viktor K. Prasanna. 1-7 [doi]
- Optimization and Performance Analysis of Shor's Algorithm in QiskitDewang Sun, Naifeng Zhang, Franz Franchetti. 1-7 [doi]
- Decontentioned Stochastic Block PartitionAhsen J. Uppal, Thomas B. Rolinger, H. Howie Huang. 1-6 [doi]
- Parallel Quasi-Concave Set Function Optimization for Scalability Even Without SubmodularityPraneeth Vepakomma, Yulia Kempner, Rodmy Paredes Alfaro, Ramesh Raskar. 1-8 [doi]
- Generating High-Performance Number Theoretic Transform Implementations for Vector ArchitecturesNaifeng Zhang, Austin Ebel, Negar Neda, Patrick Brinich, Benedict Reynwar, Andrew G. Schmidt, Mike Franusich, Jeremy Johnson 0001, Brandon Reagen, Franz Franchetti. 1-7 [doi]
- Parallel Longest Common SubSequence Analysis In ChapelSoroush Vahidi, Baruch Schieber, Zhihui Du, David A. Bader. 1-6 [doi]
- An Analysis of Energy Requirement for Computer Vision AlgorithmsDaniel Edelman, Siddharth Samsi, Joseph McDonald, Adam Michaleas, Vijay Gadepally. 1-7 [doi]
- PaCKD: Pattern-Clustered Knowledge Distillation for Compressing Memory Access Prediction ModelsNeelesh Gupta, Pengmiao Zhang, Rajgopal Kannan, Viktor K. Prasanna. 1-7 [doi]
- High-Level Frameworks: Effect on Transformer Inference Time and Power on Embedded GPU DevicesMarika E. Schubert, David Langerman, Alan D. George. 1-8 [doi]
- Quantifying OpenMP: Statistical Insights into Usage and AdoptionTal Kadosh, Niranjan Hasabnis, Timothy G. Mattson, Yuval Pinter, Gal Oren 0001. 1-7 [doi]
- Focusing and Calibration of Large Scale Network Sensors Using GraphBLAS Anonymized Hypersparse MatricesJeremy Kepner, Michael Jones 0001, Phil Dykstra, Chansup Byun, Timothy Davis, Hayden Jananthan, William Arcand, David Bestor, William Bergeron, Vijay Gadepally, Micheal Houle, Matthew Hubbell, Anna Klein, Lauren Milechin, Guillermo Morales, Julie Mullen, Ritesh Patel, Alex Pentland, Sandeep Pisharody, Andrew Prout, Albert Reuther, Antonio Rosa, Siddharth Samsi, Tyler Trigg, Charles Yee, Peter Michaleas. 1-9 [doi]
- Benchmarking Deep Learning Classifiers for SAR Automatic Target RecognitionJacob Fein-Ashley, Tian Ye 0002, Rajgopal Kannan, Viktor K. Prasanna, Carl E. Busart. 1-6 [doi]
- SMOG: Accelerating Subgraph Matching on GPUsZhibin Wang, Ziheng Meng, Xue Li, Xi Lin, Long Zheng 0003, Chen Tian 0001, Sheng Zhong. 1-7 [doi]
- ANEDA: Adaptable Node Embeddings for Shortest Path Distance ApproximationFrank Pacini, Allison Gunby-Mann, Sarel Cohen, Peter Chin 0001. 1-7 [doi]
- IRIS-DMEM: Efficient Memory Management for Heterogeneous ComputingNarasinga Rao Miniskar, Mohammad Alaul Haque Monil, Pedro Valero-Lara, Frank Y. Liu, Jeffrey S. Vetter. 1-7 [doi]
- 1Sadasivan Shankar. 1-6 [doi]
- Optimizing Compression Schemes for Parallel Sparse Tensor AlgebraHelen Xu 0001, Tao B. Schardl, Michael Pellauer, Joel S. Emer. 1-7 [doi]
- Towards a Flexible Hardware Implementation for Mixed-Radix Fourier TransformsMario Vega, Xiaokun Yang, John Shalf, Doru-Thom Popovici. 1-7 [doi]
- Performance Analysis of Graph Neural Network (GNN) for Manufacturing Feature Recognition ProblemIgor Betkier, Mateusz Oszczypala, Janusz Pobozniak, Sergiusz Sobieski. 1-6 [doi]