Abstract is missing.
- Machine Learning Aboard the ADAPT Gamma-Ray TelescopeYe Htet, Marion Sudvarg, Andrew Butzel, Jeremy D. Buhler, Roger D. Chamberlain, James H. Buckley. 4-10 [doi]
- A Scalable Real-Time Data Assimilation Framework for Predicting Turbulent Atmosphere DynamicsJunqi Yin, Siming Liang, Siyan Liu, Feng Bao 0005, Hristo G. Chipilski, Dan Lu 0001, Guannan Zhang. 11-18 [doi]
- ChatBLAS: The First AI-Generated and Portable BLAS LibraryPedro Valero-Lara, William F. Godoy, Keita Teranishi, Prasanna Balaprakash, Jeffrey S. Vetter. 19-24 [doi]
- Echo State Networks: A Non-Intrusive Approach to Anomaly Detection in ManufacturingKendric Hood, Kelin Rumsey, Mark Cola, Bruce Madigan, Christina Scovel, Mike McAninch, Nathan DeBardeleben. 25-31 [doi]
- MelissaDL x Breed: Towards Data-Efficient On-line Supervised Training of Multi-parametric Surrogates with Active LearningSofya Dymchenko, Abhishek Purandare, Bruno Raffin. 32-40 [doi]
- Fourier neural operators for spatiotemporal dynamics in two-dimensional turbulenceMohammad Atif, Pulkit Dubey, Pratik P. Aghor, Vanessa López-Marrero, Tao Zhang, Abdullah Sharfuddin, Kwangmin Yu, Fan Yang, Foluso Ladeinde, Yangang Liu, Meifeng Lin, Lingda Li. 41-48 [doi]
- ChatVis: Automating Scientific Visualization with a Large Language ModelTanwi Mallick, Orcun Yildiz, David Lenz, Tom Peterka. 49-55 [doi]
- A Comparative Survey: Reusing Small Pre-Trained Models for Efficient Large Model TrainingDhroov Pandey, Jonah Ghebremichael, Zongqing Qi, Tong Shu. 56-63 [doi]
- Enhancing Electron Microscopy Image Classification Using Data AugmentationJordan A. Welsman, Gunther H. Weber, Oluwamayowa O. Amusat, Anna Giannakou, Lavanya Ramakrishnan. 64-71 [doi]
- SciTrust: Evaluating the Trustworthiness of Large Language Models for ScienceEmily Herron, Junqi Yin, Feiyi Wang. 72-78 [doi]
- AI Surrogate Model for Distributed Computing WorkloadsDavid K. Park, Yihui Ren 0001, Ozgur O. Kilic, Tatiana Korchuganova, Sairam Sri Vatsavai, Joseph Boudreau, Tasnuva Chowdhury, Shengyu Feng, Raees Khan, Jaehyung Kim, Scott Klasky, Tadashi Maeno, Paul Nilsson, Verena Ingrid Martinez Outschoorn, Norbert Podhorszki, Frédéric Suter, Wei Yang, Yiming Yang, Shinjae Yoo, Alexei Klimentov, Adolfy Hoisie. 79-86 [doi]
- AstroMLab 2: AstroLLaMA-2-70B Model and Benchmarking Specialised LLMs for AstronomyRui Pan, Tuan-Dung Nguyen, Hardik Arora, Alberto Accomazzi, Tirthankar Ghosal, Yuan-Sen Ting. 87-96 [doi]
- PULSE: Using Mixed-Quality Models for Reducing Serverless Keep-Alive CostKausalya Sankaranarayanan, Rohan Basu Roy, Devesh Tiwari. 99-109 [doi]
- Enabling HPC Scientific Workflows for ServerlessAnderson Andrei Da Silva, Rolando Pablo Hong Enriquez, Gourav Rattihalli, Vijay Thurimella, Rafael Ferreira da Silva, Dejan S. Milojicic. 110-125 [doi]
- Zero-consistency root emulation for unprivileged container image buildReid Priedhorsky, Michael Jennings, Megan Phinney. 126-136 [doi]
- The Fused Multiply-Add and Global Atmospheric Models: A Distributional Investigation into a Surprising Correctness ScenarioTeo Price-Broncucia, Allison H. Baker, Michael Duda. 140-147 [doi]
- Toward Automated Precision Tuning of Weather and Climate Models: A Case StudyJackson Vanover, Alper Altuntas, Cindy Rubio-González. 148-159 [doi]
- Towards Verifying Exact Conditions for Implementations of Density Functional ApproximationsSameerah Helal, Zhe Tao, Cindy Rubio-González, François Gygi, Aditya V. Thakur. 160-169 [doi]
- Impacts of floating-point non-associativity on reproducibility for HPC and deep learning applicationsSanjif Shanmugavelu, Mathieu Taillefumier, Christopher Culver, Oscar R. Hernandez, Mark Coletti, Ada Sedova. 170-179 [doi]
- Toward Automated Detection of Portability Bugs in Kokkos Parallel ProgramsVivek Kale, Hanru Yan, Shyamali Mukherjee, Jackson R. Mayo, Keita Teranishi, Richard Rutledge, Alessandro Orso. 180-188 [doi]
- Facilitating Bug Detection for OpenMP Offloading ApplicationsLechen Yu, Feiyang Jin, Joachim Jenke, Vivek Sarkar. 189-195 [doi]
- ompTest - Unit Testing with OMPTJan-Patrick Lehr, Michael Halkenhäuser, Dhruva R. Chakrabarti, Saiyedul Islam, Dan Palermo, Ron Lieberman. 196-203 [doi]
- Compiler-Aided Correctness Checking of CUDA-Aware MPI ApplicationsAlexander Hück, Tim Ziegler, Simon Schwitanski, Joachim Jenke, Christian H. Bischof. 204-213 [doi]
- Taskgrind: Heavyweight Dynamic Binary Instrumentation for Parallel Programs AnalysisRomain Pereira, George Stelle, Patrick Carribault. 214-221 [doi]
- Designing Quality MPI Correctness Benchmarks: Insights and MetricsTim Jammer, Simon Schwitanski, Emmanuelle Saillard, Alexander Hück, Joachim Jenke, Radjasouria Vinayagame, Christian H. Bischof. 222-226 [doi]
- Correctness Checking of MPI+OpenMP Applications Using Vector Clocks in MUSTCornelius Pätzold, Simon Schwitanski, Joachim Jenke, Felix Tomski, Matthias S. Müller. 227-231 [doi]
- OMPTBench - OpenMP Tool Interface Conformance TestingJan-Patrick Lehr, Michael Halkenhäuser, Dhruva R. Chakrabarti, Saiyedul Islam, Dan Palermo, Ron Lieberman. 232-236 [doi]
- FRSZ2 for In-Register Block Compression Inside GMRES on GPUsThomas Grützmacher, Robert Underwood, Sheng Di, Franck Cappello, Hartwig Anzt. 240-249 [doi]
- An Exploration of How Volume Rendering is Impacted by Lossy Data ReductionYanni Etchi, Daoce Wang, Pascal Grosset, Terece L. Turton, James P. Ahrens, David H. Rogers 0001. 250-259 [doi]
- SZOps: Scalar Operations for Error-bounded Lossy Compressor for Scientific DataTripti Agarwal, Sheng Di, Jiajun Huang, Yafan Huang, Ganesh Gopalakrishnan, Robert Underwood, Kai Zhao 0008, Xin Liang 0001, Guanpeng Li, Franck Cappello. 260-269 [doi]
- Enabling Data Reduction for Flash-X SimulationsRajeev Jain, Houjun Tang, Akash Dhruv, Suren Byna. 270-279 [doi]
- BCSR on GPU: A Way Forward Extreme-scale Graph Processing on Accelerator-enabled Frontier SupercomputerNaw Safrin Sattar, Hao Lu 0001, Feiyi Wang. 280-289 [doi]
- Filling the Void: Data-Driven Machine Learning-based Reconstruction of Sampled Spatiotemporal Scientific Simulation DataAyan Biswas, Aditi Mishra, Meghanto Majumder, Subhashis Hazarika, Alexander Most, Juan Castorena, Christopher Bryan, Patrick S. McCormick, James P. Ahrens, Earl Lawrence, Aric Hagberg. 290-299 [doi]
- Enhancing Lossy Compression Through Cross-Field Information for Scientific ApplicationsYouyuan Liu, Wenqi Jia 0003, Taolue Yang, Miao Yin, Sian Jin. 300-308 [doi]
- Shifting Between Compute and Memory Bounds: A Compression-Enabled Roofline ModelRamasoumya Naraparaju, Tianyu Zhao, Yanting Hu, Dongfang Zhao 0001, Luanzheng Guo, Nathan R. Tallent. 309-316 [doi]
- GPUFASTQLZ: An Ultra Fast Compression Methodology for Fastq Sequence Data on GPUsTaolue Yang, Youyuan Liu, Bo Jiang, Sian Jin. 317-325 [doi]
- Accelerating Viz Pipelines Using Near-Data Computing: An Early ExperienceQing Zheng, Brian Atkinson, Daoce Wang, Jason Lee 0004, John Patchett, Dominic Manno, Gary Grider. 326-335 [doi]
- Improving Input-Step PerformanceJoel C. Adams. 340-346 [doi]
- Introduction to Parallel and Distributed Programming using N-Body SimulationsAlexander Van Craen, Marcel Breyer, Dirk Pflüger. 347-354 [doi]
- Leveraging National Science Data Fabric Services to Train Data ScientistsMichela Taufer, Heberth F. Martinez, Aashish Panta, Paula Olaya, Jack Marquez, Amy Gooch, Giorgio Scorzelli, Valerio Pascucci. 355-362 [doi]
- Hands-On Learning: Teaching Parallel and Distributed Computing through Unplugged Activities in Undergraduate CS CoursesAnurag Dasgupta, Venkat Margapuri, Simon Shamoun, Shubbhi Taneja, Matthew Toups. 363-369 [doi]
- Scrutinizing Variables for Checkpoint Using Automatic DifferentiationXin Huang, Wubiao Xu, Shiman Meng, Weiping Zhang, Xiang Fu, Luanzheng Guo, Kento Sato. 372-379 [doi]
- A Scalable Training-Free Diffusion Model for Uncertainty QuantificationAli Haisam Muhammad Rafid, Junqi Yin, Yuwei Geng, Siming Liang, Feng Bao 0005, Lili Ju, Guannan Zhang. 380-386 [doi]
- Optimizing Uncertainty Estimation on Scientific Visualizations Using Learning ModelsClara J. Almeida, David Guerrero-Pantoja, Erik Pautsch, Silvio Rizzi 0001, George K. Thiruvathukal, Maria Pantoja. 387-393 [doi]
- Ensemble Simulations on Leadership Computing SystemsAntigoni Georgiadou, Henry Monge-Camacho, Tanvir Sohail, Swarnava Ghosh, Arjun Valiya Parambathu, Dilipkumar N. Asthagiri, Dmytro Bykov, Tushar Athawale, Thomas L. Beck. 394-401 [doi]
- Modes, Persistence and Orthogonality: Blowing MPI UpJesper Larsson Träff, Ioannis Vardas, Sascha Hunold. 404-413 [doi]
- Improving MPI Language Support Through Custom Datatype SerializationJake Tronge, Joseph Schuchart, Lisandro Dalcín, Howard Pritchard. 414-424 [doi]
- MPI Progress For AllHui Zhou, Robert Latham, Ken Raffenetti, Yanfei Guo, Rajeev Thakur. 425-435 [doi]
- Design and Implementation of MPI-Native GPU-Initiated MPI Partitioned CommunicationYiltan Hassan Temuçin, Whit Schonbein, Scott Levy, Amir Hossein Sojoodi, Ryan E. Grant, Ahmad Afsahi. 436-447 [doi]
- Beatnik: A Novel Global Communication Mini-ApplicationJason R. Stewart, Patrick G. Bridges. 448-456 [doi]
- Offloaded MPI message matching: an optimistic approachJerónimo S. García, Salvatore Di Girolamo, Sokol Kosta, J. J. Vegas Olmos, Rami Nudelman, Torsten Hoefler, Gil Bloch. 457-469 [doi]
- Achieving High-Performance Fault-Tolerant Routing in HyperX Interconnection NetworksCristóbal Camarero, Alejandro Cano, Carmen Martínez, Ramón Beivide. 472-483 [doi]
- From Failure to Insight: Analyzing Disk Breakdowns in Large-Scale HPC EnvironmentsAnjus George, Meng Wang, Jesse Hanley, Garrett Wilson Ransom, John Bent, Christopher Zimmer 0001. 484-495 [doi]
- Octopus: Experiences with a Hybrid Event-Driven Architecture for Distributed Scientific ComputingHaochen Pan, Ryan Chard, Sicheng Zhou, Alok Kamatar, Rafael Vescovi, Valérie Hayot-Sasson, André Bauer 0001, Maxime Gonthier, Kyle Chard, Ian T. Foster. 496-507 [doi]
- Checkpointing strategies for a fixed-length executionAnne Benoit, Lucas Perotin, Yves Robert, Frédéric Vivien. 508-518 [doi]
- ProTEA: Programmable Transformer Encoder Acceleration on FPGAEhsan Kabir, Jason D. Bakos, David Andrews 0001, Miaoqing Huang. 521-530 [doi]
- DeLiBA-K: Speeding-up Hardware-Accelerated Distributed Storage Access by Tighter Linux Kernel Integration and Use of a Modern APIBabar Khan, Andreas Koch 0001. 531-544 [doi]
- Testing GPU Numerics: Finding Numerical Differences Between NVIDIA and AMD GPUsAnwar Hossain Zahid, Ignacio Laguna, Wei Le. 547-557 [doi]
- Performance Analysis of Scientific Applications on an NVIDIA Grace SystemAmit Ruhela, John Cazes, John D. McCalpin, Carlos Del-Castillo-Negrete, Junjie Li, Hang Liu, Hanning Chen, Chun-Yaung Lu, Kent F. Milfeld, Wenyang Zhang, Ian Wang, Lars Koesterke, John DeSantis, Nic Lewis, Sean Hempel, Dan Stanzione. 558-566 [doi]
- Understanding Data Movement in AMD Multi-GPU Systems with Infinity FabricGabin Schieffer, Ruimin Shi, Stefano Markidis, Andreas Herten, Jennifer Faj, Ivy Peng. 567-576 [doi]
- Testing the Unknown: A Framework for OpenMP Testing via Random Program GenerationIgnacio Laguna, Patrick J. Chapman, Konstantinos Parasyris, Giorgis Georgakoudis, Cindy Rubio-González. 577-587 [doi]
- Benchmarking and Continuous Performance Monitoring of Ookami, an ARM Fujitsu A64FX Testbed ClusterNikolay A. Simakov, Joseph P. White, Matthew D. Jones, Eva Siegmann, Daniel G. Wood, Firat Coskun, Robert J. Harrison. 588-594 [doi]
- Establishing a High-Performance and Productive Ecosystem for Distributed Execution of Python Functions Using Globus ComputeRachana Ananthakrishnan, Yadu Babuji, Josh Bryan, Kyle Chard, Ryan Chard, Ben Clifford, Ian T. Foster, Lev Gorenstein, Kevin Hunter Kesling, Chris Janidlo, Daniel S. Katz, Reid Mello, J. Gregory Pauloski, Lei Wang. 597-606 [doi]
- Experiences in Managing High-performance Computing Management and Support Tools while Upgrading a Campus ClusterYuwu Chen, Trevor Cooper, Christopher Irving, Mahidhar Tatineni, Nicole Wolter, Dmitry Mishin, Subhashini Sivagnanam. 607-612 [doi]
- CANARI: A Monitoring Framework for Cluster Analysis and Node Assessment for Resource IntegrityRyan T. DeRue, Jacob Verburgt. 613-620 [doi]
- A Hierarchical Deep Learning Approach for Predicting Job Queue Times in HPC SystemsAustin Lovell, Philip Wisniewski, Sarah Rodenbeck, Ashish. 621-628 [doi]
- HPCAdvisor: A Tool for Assisting Users in Selecting HPC Resources in the CloudMarco A. S. Netto. 629-637 [doi]
- Batch Updates of Distributed Streaming Graphs using Linear AlgebraElaheh Hassani, Md Taufique Hussain, Ariful Azad. 645-649 [doi]
- Establish the basis for Breadth-First Search on Frontier System: XBFS on AMD GPUsHaoshen Yang, Hao Lu 0001, Naw Safrin Sattar, Hang Liu 0001, Feiyi Wang. 650-658 [doi]
- xBS-GNN: Accelerating Billion-Scale GNNTraining on FPGAYi-Chien Lin, Zhijie Xu, Viktor K. Prasanna. 659-666 [doi]
- Enhancing Scalability and Performance in Influence Maximization with Optimized Parallel ProcessingHanjiang Wu, Huan Xu, Joongun Park, Jesmin Jahan Tithi, Fabio Checconi, Jordi Wolfson-Pou, Fabrizio Petrini, Tushar Krishna. 667-676 [doi]
- Enhancing Small Message Aggregation with Directive-Based Deferred ExecutionAaron Welch, Oscar R. Hernandez, Stephen W. Poole, Wendy Poole. 677-681 [doi]
- Shared Memory-Aware Latency-Sensitive Message Aggregation for Fine-Grained CommunicationKavitha Chandrasekar, Laxmikant V. Kalé. 682-687 [doi]
- NEO-DNND: Communication-Optimized Distributed Nearest Neighbor Graph ConstructionKeita Iwabuchi, Trevor Steil, Benjamin W. Priest, Roger Pearce, Geoffrey Sanders. 688-696 [doi]
- An Adaptive Asynchronous Approach for the Single-Source Shortest Paths ProblemRitvik Rao, Kavitha Chandrasekar, Laxmikant V. Kalé. 697-702 [doi]
- GPU Accelerated Sparse Cholesky FactorizationM. Ozan Karsavuran, Esmond G. Ng, Barry W. Peyton. 703-707 [doi]
- Efficient Tree-based Parallel Algorithms for N-Body Simulations Using C++ Standard ParallelismThomas Lane Cassell, Tom Deakin, Aksel Alpay, Vincent Heuveline, Gonzalo Brito Gadeschi. 708-717 [doi]
- Linear Algebra Approach for Directed Triad Counting and EnumerationYuttapichai Kerdcharoen, Upasana Sridhar, Orathai Sangpetch, Tze Meng Low. 718-726 [doi]
- Performance evaluation and modelling of single-precision matrix multiplication on Cerebras CS-2Ryunosuke Matsuzaki, Daichi Mukunoki, Takaaki Miyajima. 727-731 [doi]
- Performance Analysis of the NICAM Benchmark on MN-Core ProcessorHikaru Takayashiki, Natsuko Saito, Hiroto Imachi, Ryo Sakamoto, Junichiro Makino. 732-736 [doi]
- Predicting Compute Node Unavailability in HPC: A Graph-Based Machine Learning ApproachRoy Krumpak, Joze M. Rozanec, Martin Molan, Matteo Angelinelli, Andrea Bartolini. 737-740 [doi]
- On Scaling Neuronal Network Simulations Using Distributed ComputingVladimir Omelyusik, Khawar Shehzad, Tyler J. Banks, Praveen Rao 0001, Satish S. Nair. 745-749 [doi]
- Jupyter Notebook Attacks Taxonomy: Ransomware, Data Exfiltration, and Security MisconfigurationPhuong Cao. 750-754 [doi]
- Secure Collaborative Model Training with Dynamic Federated Learning in Multi-Domain EnvironmentsAnestis Dalgkitsis, Alexandros Koufakis, Jorrit Stutterheim, Aleandro Mifsud, Priyanka Atwani, Leon Gommans, Cees de Laat, Chrysa Papagianni, Ana Oprescu. 755-759 [doi]
- LIDC: A Location Independent Multi-Cluster Computing Framework for Data Intensive ScienceSankalpa Timilsina, Susmit Shannigrahi. 760-764 [doi]
- SANReN's 100 Gbps Data Transfer Service: Transferring data fast!Kasandra Pillay, Johann Hugo, Ajay Makan, Thokozani Khwela, Thuso Bogopa, Manqoba Shabalala. 765-769 [doi]
- A Study of a Deterministic Networking Framework for Latency Critical Large Scientific Data TransfersVijeth Kumbarahally Lakshminarayana, Carolina Minami Oguchi, Alex Sim, Kesheng Wu, Dipak Ghosal. 770-774 [doi]
- Recent Linux Improvements that Impact TCP Throughput: Insights from R&E NetworksMarcos F. Schwarz, Brian Tierney, Kiran Vasu, Eli Dart, Christian Esteve Rothenberg, Jeronimo Bezerra, Italo Valcy S. Brito. 775-784 [doi]
- Improving transition to IPv6-only via RFC8925 and IPv4 DNS Interventions : A case study in implementing an IPv6-only testbed which informs IPv4-only clients why internet access is unavailableTom Costello, Nick Buraglio, Andy Fleming, Ben Tasker, Brandon Siegel. 785-792 [doi]
- Leveraging In-band Network Telemetry for Automated DDoS Detection in Production Programmable Networks: The AmLight Use CaseHadi Sahin, Jeronimo Bezerra, Italo Brito 0001, Renata Frez, Vasilka Chergarova, Luis Fernandez Lopez, Julio Ibarra. 793-802 [doi]
- Entanglement Capacity Estimates and Throughput Measurements of Quantum ChannelsNageswara S. V. Rao, Muneer Alshowkan, Joseph C. Chapman, Nicholas A. Peters, Hsuan-Hao Lu, Joseph M. Lukens, Saikat Guha. 803-809 [doi]
- QNTN: Establishing a Regional Quantum Network in TennesseeMohamed Shaban, Muhammad Ismail, Mariam Kiran. 810-818 [doi]
- New Techniques to Route in Folded-Clos Topology Data Center NetworksPeter Willis 0003, Nirmala Shenoy, Yin Pan, Bill Stackpole, John F. Hamilton. 819-828 [doi]
- Framework for Integrating Machine Learning Methods for Path-Aware Source RoutingAnees Al-Najjar, Domingos Paraiso, Mariam Kiran, Cristina K. Dominicini, Everson Scherrer Borges, Rafael S. Guimaraes, Magnos Martinello, Harvey Newman. 829-838 [doi]
- Bridging Gaps in Simulation Analysis through a General Purpose, Bidirectional Steering Interface with AscentAndres Sewell, Dimitrios K. Fytanidis, Victor A. Mateevitsi, Cyrus Harrison, Nicole Marsaglia, Thomas Marrinan, Silvio Rizzi 0001, Joseph A. Insley, Michael E. Papka, Steve Petruzza. 841-846 [doi]
- DyTwin: Federated Adaptive Digital Twins for Data Centers - Visualization and Anomaly DetectionEbad Taheri, Pedro Bruel, Pavana Prakash, Gourav Rattihalli, Ninad Hogade, Aditya Dhakal, Rolando P. Hong Enriquez, Torsten Wilde, Leo Popokh, Dejan S. Milojicic, Cullen E. Bash. 847-852 [doi]
- Scalable In-Situ Visualization for Extreme-Scale SPH SimulationsYiqing Zhu, Osman Seckin Simsek, Jean M. Favre, Rubén M. Cabezón, Florina M. Ciorba. 853-858 [doi]
- Can Current SDS Controllers Scale To Modern HPC Infrastructures?Mariana Miranda, Yusuke Tanimura, Jason Haga, Amit Ruhela, Stephen Lien Harrell, John Cazes, Ricardo Macedo, José Pereira 0001, João Paulo 0001. 861-868 [doi]
- Benchmarking Ethernet Interconnect for HPC/AI workloadsLorenzo Pichetti, Daniele De Sensi, Karthee Sivalingam, Stepan Nassyr, Daniele Cesarini, Matteo Turisini, Dirk Pleiter, Aldo Artigiani, Flavio Vella. 869-875 [doi]
- Performance analysis of a stencil code using modern C++Victor Eijkhout, Yojan Chitkara, Daksh Chaplot. 876-885 [doi]
- Protocol Buffer Deserialization DPU Offloading in the RPC DatapathRaphaël Frantz, Jerónimo Sánchez García, Marcin Copik, Idelfonso Tafur Monroy, Juan José Vegas Olmos, Gil Bloch, Salvatore Di Girolamo. 886-895 [doi]
- An Efficient Checkpointing System for Large Machine Learning Model TrainingWubiao Xu, Xin Huang, Shiman Meng, Weiping Zhang, Luanzheng Guo, Kento Sato. 896-900 [doi]
- Predicting Protein Folding on Intel's Data Center GPU Max Series Architecture (PVC)Dhani Ruhela, Madhavan Prasanna, Aaditya Saxena. 901-908 [doi]
- Modeling and Simulation of Collective Algorithms on HPC Network Topologies using Structural Simulation ToolkitSai P. Chenna, Michael Steyer, Nalini Kumar, Maria Garzaran 0001, Philippe Thierry. 909-916 [doi]
- Evaluating Tuning Opportunities of the LLVM/OpenMP RuntimeSmeet Chheda, Gaurav Verma, Shilei Tian, Barbara M. Chapman, Johannes Doerfert. 919-929 [doi]
- Pragma driven shared memory parallelism in Zig by supporting OpenMP loop directivesDavid Kacs, Joseph Lee, Justs Zarins, Nick Brown 0002. 930-938 [doi]
- Fully integrating the Flang Fortran compiler with standard MLIRNick Brown 0002. 939-949 [doi]
- Parallel Runtime Interface for Fortran (PRIF): A Multi-Image Solution for LLVM FlangDan Bonachea, Katherine Rasmussen, Brad Richardson, Damian W. I. Rouson. 950-960 [doi]
- A Flang Plugin for Fortran Feature CharacterizationAnthony M. Cabrera, David E. Bernholdt. 961-967 [doi]
- PIMnast: Balanced Data Placement for GEMV Acceleration with Processing-In-MemoryMohamed Assem Ibrahim, Mahzabeen Islam, Shaizeen Aga. 970-981 [doi]
- ACID Support for Compute eXpress Link Memory TransactionsEllis Giles, Peter J. Varman. 982-995 [doi]
- Multi-level Memory-Centric Profiling on ARM Processors with ARM SPESamuel Miksits, Ruimin Shi, Maya B. Gokhale, Jacob Wahlgren, Gabin Schieffer, Ivy Peng. 996-1005 [doi]
- Sum Reduction with OpenMP Offload on NVIDIA Grace-Hopper SystemZheming Jin. 1006-1013 [doi]
- UpDown: Combining Scalable Address Translation with Locality ControlYuqing Wang, Swann Perarnau, Andrew A. Chien. 1014-1024 [doi]
- Symmetric Locality: Definition and Initial ResultsGiordan Escalona, Dylan McKellips, Chen Ding 0001. 1025-1034 [doi]
- AM-DGCNN: Leveraging Graph Attention Networks and Edge Attributes for Link Classification in Knowledge GraphsDhroov Pandey, Tong Shu. 1037-1045 [doi]
- MDLoader: A Hybrid Model-Driven Data Loader for Distributed Graph Neural Network TrainingJonghyun Bae, Jong Youl Choi, Massimiliano Lupo Pasini, Kshitij Mehta, Pei Zhang, Khaled Z. Ibrahim. 1046-1057 [doi]
- Scalable and Consistent Graph Neural Networks for Distributed Mesh-based Data-driven ModelingShivam Barwey, Riccardo Balin, Bethany Lusch, Saumil Patel, Ramesh Balakrishnan, Pinaki Pal, Romit Maulik, Venkatram Vishwanath. 1058-1070 [doi]
- IRIS-GNN: Leveraging Graph Neural Networks for Scheduling on Truly Heterogeneous Runtime SystemsBeau Johnston, Thibault de Boissiere, Mohammad Alaul Haque Monil, Narasinga Rao Miniskar, Aaron R. Young, Seyong Lee, Jeffrey S. Vetter. 1071-1080 [doi]
- Acceleration of Graph Neural Networks with Heterogenous Accelerators ArchitectureKaiwen Cao, Archit Gajjar, Liad Gerstman, Kun Wu 0002, Sai Rahul Chalamalasetti, Aditya Dhakal, Giacomo Pedretti, Pavana Prakash, Wen-mei Hwu, Deming Chen, Dejan S. Milojicic. 1081-1089 [doi]
- Performance Modeling and Analysis of a de Bruijn Graph Based Local Assembly Kernel on Multiple Vendor GPUsLeAnn M. Lindsey, Nan Ding 0006, Jack Deslippe, Muaaz Awan. 1092-1105 [doi]
- Optimizing MILC-Dslash Performance on NVIDIA A100 GPU: Parallel Strategies using SYCLAmanda S. Dufek, Steven A. Gottlieb, Muaaz Gul Awan, Douglas Adriano Augusto, Jack Deslippe, Brandon Cook 0001. 1106-1116 [doi]
- Development of performance portable spline solver for exa-scale plasma turbulence simulationYuuichi Asahi, Baptiste Legouix, Emily Bourne, Thomas Padioleau, Julien Bigot, Virginie Grandgirard, Kevin Obrejan. 1117-1128 [doi]
- Autonomous Execution for Multi-GPU Systems: Compiler SupportJavid Baydamirli, Tal Ben-Nun, Didem Unat. 1129-1140 [doi]
- Performance Portable Optimizations of an Ice-sheet Modeling Code on GPU-supercomputersOscar Antepara, Samuel Williams 0001, Max Carlson, Jerry Watkins. 1141-1151 [doi]
- Performance portability via C++ PSTL, SYCL, OpenMP, and HIP: the Gaia AVU-GSR case studyGiulio Malenza, Valentina Cesare, Marco Edoardo Santimaria, Robert Birke, Alberto Vecchiato, Ugo Becciani, Marco Aldinucci. 1152-1163 [doi]
- Performance and Power: Systematic Evaluation of AI Workloads on Accelerators with CARAMLChelsea Maria John, Stepan Nassyr, Carolin Penke, Andreas Herten. 1164-1176 [doi]
- High-Performance, Scalable Geometric Multigrid via Fine-Grain Data Blocking for GPUsOscar Antepara, Samuel Williams 0001, Hans Johansen, Mary W. Hall. 1177-1191 [doi]
- A Metric for HPC Programming Model ProductivityWei-Chen Lin, Tom Deakin, Simon McIntosh-Smith. 1192-1205 [doi]
- RAJA Performance Suite: Performance Portability Analysis with Caliper and ThicketOlga Pearce, Jason Burmark, Rich Hornung, Befikir Bogale, Ian Lumsden, Michael McKinsey, Dewi Yokelson, David Böhme, Stephanie Brink, Michela Taufer, Tom Scogland. 1206-1218 [doi]
- Speeding-Up LULESH on HPX: Useful Tricks and Lessons Learned using a Many-Task-Based ApproachTorben Kalkhof, Andreas Koch 0001. 1223-1235 [doi]
- Lamellar: A Rust-based Asynchronous Tasking and PGAS Runtime for High Performance ComputingRyan D. Friese, Roberto Gioiosa, Joseph Cottam, Erdal Mutlu, Gregory Roek, Polykarpos Thomadakis, Mark Raugas. 1236-1251 [doi]
- Applying a Task-Based Approach to Distributed Machine Learning WorkflowsFernando Vázquez-Novoa, Daniele Lezzi, Francesc Lordan, Fatemeh Baghdadi, Davide Cirillo. 1252-1261 [doi]
- Accelerating Multi-GPU Embedding Retrieval with PGAS-Style Communication for Deep Learning Recommendation SystemsYuxin Chen, Aydin Buluç, Katherine A. Yelick, John D. Owens. 1262-1273 [doi]
- Mitigating synchronization bottlenecks in high-performance actor-model-based softwareKyle Klenk, Mohammad Mahdi Moayeri, Junwei Guo, Martyn P. Clark, Raymond J. Spiteri. 1274-1287 [doi]
- ® SHMEM: GPU-initiated OpenSHMEM using SYCLAlex Brooks, Philip Marshall, David Ozog, Md. Wasi-ur-Rahman, Lawrence Stewart, Rithwik Tom. 1288-1301 [doi]
- Initial Experiences with DAOS Object Storage on AuroraRobert Latham, Robert B. Ross, Philip H. Carns, Shane Snyder, Kevin Harms, Kaushik Velusamy, Paul Coffman, Gordon McPheeters. 1304-1310 [doi]
- Mosaic: Detection and Categorization of I/O Patterns in HPC ApplicationsThéo Jolivel, François Tessier, Julien Monniot, Guillaume Pallez. 1311-1319 [doi]
- Copper: Cooperative Caching Layer for Scalable Data Loading in Exascale SupercomputersNoah Lewis, Kaushik Velusamy, Kevin Harms, Huihuo Zheng. 1320-1329 [doi]
- Understanding and Predicting Cross-Application I/O Interference in HPC Storage SystemsChris Egersdoerfer, Md. Hasanur Rashid, Dong Dai 0001, Bo Fang, Tallent Nathan. 1330-1339 [doi]
- Exploring DAOS Interfaces and PerformanceNicolau Manubens, Johann Lombardi, Simon D. Smart, Emanuele Danovaro, Tiago Quintino, Dean Hildebrand, Adrian Jackson. 1340-1348 [doi]
- Fault-Tolerant Deep Learning Cache with Hash Ring for Load Balancing in HPC SystemsSeoyeong Lee, Awais Khan 0002, Yoochan Kim, Junghwan Park, Soon Hwang, Jae Kook Lee, Taeyoung Hong, Christopher Zimmer 0001, Youngjae Kim 0001. 1349-1357 [doi]
- LLM-Inference-Bench: Inference Benchmarking of Large Language Models on AI AcceleratorsKrishna Teja Chitty-Venkata, Siddhisanket Raskar, Bharat Kale, Farah Ferdaus, Aditya Tanikanti, Ken Raffenetti, Valerie Taylor 0001, Murali Emani, Venkatram Vishwanath. 1362-1379 [doi]
- Comprehensive Performance Modeling and System Design Insights for Foundation ModelsShashank Subramanian, Ermal Rrapaj, Peter Harrington, Smeet Chheda, Steven Farrell, Brian Austin, Samuel Williams 0001, Nicholas J. Wright, Wahid Bhimji. 1380-1397 [doi]
- System-Wide Roofline Profiling -a Case Study on NERSC's Perlmutter SupercomputerBrian Austin, Dhruva Kulkarni, Brandon Cook 0001, Samuel Williams 0001, Nicholas J. Wright. 1398-1404 [doi]
- Microarchitectural comparison and in-core modeling of state-of-the-art CPUs: Grace, Sapphire Rapids, and GenoaJan Laukemann, Georg Hager, Gerhard Wellein. 1405-1412 [doi]
- Benchmarking the Evolution of Performance and Energy Efficiency Across Recent Generations of Intel Xeon ProcessorsBalázs Drávai, István Z. Reguly. 1413-1419 [doi]
- Performance Analysis of Runtime Handling of Zero-Copy for OpenMP Programs on MI300A APUsCarlo Bertolli, Thorsten Blass, Lynd Stringer, Nicole Aschenbrenner, Jan-Patrick Lehr, Doru Bercea, Dhruva R. Chakrabarti, Lawrence Meadows, Ron Lieberman. 1420-1429 [doi]
- Ponte Vecchio Across the Atlantic: Single-Node Benchmarking of Two Intel GPU SystemsThomas Applencourt, Aditya Sadawarte, Servesh Muralidharan, Colleen Bertoni, JaeHyuk Kwack, Ye Luo, Esteban Rangel, John R. Tramm, Yasaman Ghadar, Arjen Tamerus, Chris Edsall, Tom Deakin. 1430-1442 [doi]
- Hello SME! Generating Fast Matrix Multiplication Kernels Using the Scalable Matrix ExtensionStefan Remke, Alexander Breuer. 1443-1454 [doi]
- AI-Assisted Design-Space Analysis of High-Performance Arm ProcessorsJoseph Moore, Tom Deakin, Simon McIntosh-Smith. 1455-1467 [doi]
- Impact of Varying BLAS Precision on DCMESHNariman Piroozan, S. John Pennycook, Taufeq Mohammed Razakh, Peter Caday, Nalini Kumar, Aiichiro Nakano. 1468-1480 [doi]
- Assessing the GPU Offload Threshold of GEMM and GEMV Kernels on Modern Heterogeneous HPC SystemsFinn Wilkinson, Alex Cockrean, Wei-Chen Lin, Simon McIntosh-Smith, Tom Deakin. 1481-1495 [doi]
- Understanding VASP Power Profiles on NVIDIA A100 GPUsZhengji Zhao, Brian Austin, Ermal Rrapaj, Nicholas J. Wright. 1496-1505 [doi]
- Workload-Adaptive Scheduling for Efficient Use of Parallel File Systems in High-Performance Computing ClustersAlexander V. Goponenko, Benjamin A. Allan, Jim M. Brandt, Damian Dechev. 1506-1516 [doi]
- Are Noise-Resilient Logical Timers Useful for Performance Analysis?Gregor Corbin, Nour Daoud, Bernd Mohr, Gustavo de Morais, Felix Wolf 0001. 1519-1530 [doi]
- P-MoVE: Performance Monitoring and Visualization with Encoded KnowledgeFatih Tasyaran, Osman Yasal, José A. Morgado, Aleksandar Ilic, Didem Unat, Kamer Kaya. 1531-1542 [doi]
- Analyzing HPC Utilization with PIKA and VampirWilliam R. Williams, Maximilian Knespel, Frank Winkler, Bert Wesarg. 1543-1550 [doi]
- Performance Measurement on Heterogeneous Processors with PAPIWillow E. Cunningham, Vincent M. Weaver. 1551-1561 [doi]
- Inspection of I/O Operations from System Call Traces using Directly-Follows-GraphAravind Sankaran, Ilya Zhukov, Wolfgang Frings, Paolo Bientinesi. 1562-1575 [doi]
- An Incremental Multi-Level, Multi-Scale Approach to Assessment of Multifidelity HPC SystemsShilpika, Bethany Lusch, Venkatram Vishwanath, Michael E. Papka. 1576-1587 [doi]
- Overhead-Guided Instrumentation RefinementSebastian Kreutzer, Peter Arzt, Jonas Rickert, Jan-Patrick Lehr, Christian Iwainsky, Christian H. Bischof. 1588-1598 [doi]
- ActorProf: A Framework for Profiling and Visualizing Fine-grained Asynchronous Bulk Synchronous Parallel ExecutionJiawei Yang, Shubhendra Pal Singhal, Jun Shirako, Akihiro Hayashi, Vivek Sarkar. 1599-1610 [doi]
- Examining the Viability of Row-Scale Disaggregation for Production ApplicationsCurtis Shorts, Ryan E. Grant. 1613-1621 [doi]
- Towards Disaggregated NDP Architectures for Large-scale Graph AnalyticsSuyeon Lee, Vishal Rao, Ada Gavrilovska. 1622-1629 [doi]
- Granularity- and Interference-Aware GPU Sharing with MPSAlex Weaver, Krishna Kavi, Dejan S. Milojicic, Rolando Pablo Hong Enriquez, Ninad Hogade, Alok Mishra 0002, Gayatri Mehta. 1630-1637 [doi]
- Multi-Host Sharing of a Single-Function NVMe Device in a PCIe ClusterJonas Markussen, Lars Bjørlykke Kristiansen, Håkon Kvale Stensland, Pål Halvorsen. 1638-1645 [doi]
- A Software Platform to Support Disaggregated Quantum AcceleratorsErcüment Kaya, Jorge Echavarria, Muhammad Nufail Farooqi, Aleksandra Swierkowska, Patrick Hopf, Burak Mete, Lukas Burgholzer, Robert Wille, Laura Brandon Schulz, Martin Schulz 0001. 1646-1653 [doi]
- Preparing for HPC on RISC-V: Examining Vectorization and Distributed Performance of an Astrophysics Application with HPX and KokkosPatrick Diehl, Panagiotis Syskakis, Gregor Daiß, Steven R. Brandt, Alireza Kheirkhahan, Srinivas Yadav Singanaboina, Dominic Marcello, Chris Taylor, John Leidel, Hartmut Kaiser. 1656-1665 [doi]
- Top-Down Microarchitecture Analysis Approximation Based on Performance Counter Architecture for SiFive RISC-V ProcessorsChan-Yu Mou, Chao-Chieh Hsiao, Jerry Chou. 1666-1675 [doi]
- Web-Based Simulator of Superscalar RISC-V ProcessorsJiri Jaros, Michal Majer, Jakub Horky, Jan Vavra. 1676-1684 [doi]
- Development of Fedora Linux Distribution for RISC-V (RV64G) ArchitectureSurendra Billa, Arif Badar, Rushikesh Jadhav, Yogeshwar Sonawane, Sanjay Wandhekar. 1685-1689 [doi]
- Accelerating stencils on the Tenstorrent Grayskull RISC-V acceleratorNick Brown, Ryan Barton. 1690-1700 [doi]
- High-Performance Eigensolver Combining EigenExa and Iterative RefinementYuki Uchino, Toshiyuki Imamura. 1703-1712 [doi]
- Leveraging Hybrid Classical-Quantum Methods for Efficient Load Rebalancing in HPCJustyna Zawalska, Minh Chung, Katarzyna Rycerz, Laura Brandon Schulz, Martin Schulz 0001, Dieter Kranzlmüller. 1713-1722 [doi]
- A Performance-Portable MultiGPU Implementation of 3D Euler Equations using ProtoX and IRISHet Mankad, Mohammad Alaul Haque Monil, Sanil Rao, Phillip Colella, Brian van Straalen, Franz Franchetti, Jeffrey S. Vetter. 1723-1731 [doi]
- Accelerating an overhead-sensitive atmospheric model on GPUs using asynchronous execution and kernel fusionKazuya Yamazaki. 1732-1739 [doi]
- Sequences of Distributed Matrix-Vector Product for Very Large and Very Sparse Irregular MatricesMaxence Vandromme, Nicolas Hochart, Serge G. Petiton, Jérôme Gurhem, Miwako Tsuji, Mitsuhisa Sato. 1740-1747 [doi]
- An Ising-based Decision Method for Intra Prediction Mode in Video CodingTakuto Momominami, Naoya Niwa, Masahito Kumagai, Kazuhiko Komatsu, Hiroaki Kobayashi, Hiroe Iwasaki. 1748-1754 [doi]
- Federated Single Sign-On and Zero Trust Co-design for AI and HPC Digital Research InfrastructuresSadaf R. Alam, Christopher J. Woods, Matt Williams, Dave Moore, Isaac Prior, Ethan Williams, Anna Price, James Womack, Simon McIntosh-Smith, Fan Yang-Turner, Matt Pryor, Ilja Livenson. 1756-1764 [doi]
- HPC with Enhanced User SeparationAndrew Prout, Albert Reuther, Michael Houle 0001, Michael Jones 0001, Peter Michaleas, LaToya Anderson, William Arcand, Bill Bergeron, David Bestor, Alex Bonn, Daniel Burrill, Chansup Byun, Vijay Gadepally, Matthew Hubbell, Hayden Jananthan, Piotr Luszczek, Lauren Milechin, Guillermo Morales, Julie Mullen, Antonio Rosa, Charles Yee, Jeremy Kepner. 1765-1772 [doi]
- Using Malware Detection Techniques for HPC Application ClassificationThomas Jakobsche, Florina M. Ciorba. 1773-1780 [doi]
- Security Testbed for Preempting Attacks against Supercomputing InfrastructurePhuong Cao, Zbigniew Kalbarczyk, Ravishankar K. Iyer. 1781-1788 [doi]
- Towards Sustainable Post-Exascale Leadership ComputingWoong Shin, James B. White, Wael R. Elwasif, Rafael Ferreira da Silva, Christopher Zimmer 0001, Bronson Messer, Reuben D. Budiardja, Antigoni Georgiadou, Verónica Melesse Vergara, Jack Lange, Matthias Maiterth, Tim Osborne, Leah Huk, John K. Holmen, Nick Hagerty, Ahmad Maroof Karimi, Thomas J. Naughton, Ryan Adamson, Ryan Prout, Feiyi Wang, Scott Atchley, Kevin G. Thach, Thomas Beck, Sarp Oral. 1790-1794 [doi]
- Navigating Exascale Operational Data Analytics: From Inundation to InsightWoong Shin, Tim Osborne, Ahmad Maroof Karimi, Rachel Palumbo, Alex May 0002, Corwin Lester, Jesse Hines, Naw Safrin Sattar, Leah Huk, Scott Simmerman, Wesley Brewer, Jeffrey Miller, Ryan Adamson, Olga A. Kuchar, Ryan Prout, Feiyi Wang, Scott Atchley, Sarp Oral. 1795-1804 [doi]
- Sustainable AI: Experiences, Challenges & RecommendationsEleanor Broadway, Joseph K. L. Lee, Michèle Weiland. 1805-1814 [doi]
- hws: A Tool for Monitoring Hardware Metrics Across Diverse Vendors: A Case Study on Hyperparameter Optimization AlgorithmsMarcel Breyer, Alexander Van Craen, Peter Domanski, Dirk Pflüger. 1815-1825 [doi]
- Increasing Energy Efficiency of Astrophysics Simulations Through GPU Frequency ScalingOsman Seckin Simsek, Jean-Guillaume Piccinali, Florina M. Ciorba. 1826-1834 [doi]
- Exploring the Frontiers of Energy Efficiency using Power Management at System ScaleAhmad Maroof Karimi, Matthias Maiterth, Woong Shin, Naw Safrin Sattar, Hao Lu 0001, Feiyi Wang. 1835-1844 [doi]
- Vendor-neutral and Production-grade Job Power Management in High Performance ComputingNaman Kulshreshtha, Tapasya Patki, Jim Garlick, Mark Grondona, Rong Ge. 1845-1855 [doi]
- Analysis of Power Consumption and GPU Power Capping for MILCFatih Acun, Zhengji Zhao, Brian Austin, Ayse K. Coskun, Nicholas J. Wright. 1856-1861 [doi]
- CEEMS: A Resource Manager Agnostic Energy and Emissions Monitoring StackMahendra Paipuri. 1862-1866 [doi]
- AIOps and Sustainability: Transforming Data Centers for a Greener FutureSubrahmanya Vinayak Joshi, Sergey Serebryakov, Deepak Nanjundaiah, Tejas Hegde. 1867-1871 [doi]
- Input-Dependent Power Usage in GPUsTheo Gregersen, Pratyush Patel, Esha Choukse. 1872-1877 [doi]
- EE-HPC a Framework for Energy Efficient HPC System ManagementChristian Terboven, Radita Liem, J. Gracia, Kingshuk Haldar, J. F. Engels, Pay Giesselmann, David Brayford, T. Wilde, C. Simmendinger, M. Marquardt, Jan Eitzinger, T. Gruber. 1878-1882 [doi]
- LLM4VV: Exploring LLM-as-a-Judge for Validation and Verification TestsuitesZachariah Sollenberger, Jay Patel, Christian Munley, Aaron Jarmusch, Sunita Chandrasekaran. 1885-1893 [doi]
- Towards an Optimized Heterogeneous Distributed Task Scheduler in OpenMP ClusterRémy Neveu, Rodrigo Ceccato, Gustavo Leite, Guido Araujo, Jose Manuel Monsalve Diaz, Hervé Yviquel. 1894-1903 [doi]
- Portability of Fortran's 'do concurrent' on GPUsRonald M. Caplan, Miko M. Stulajter, Jon A. Linker, Jeff Larkin, Henry A. Gabb, Shiquan Su, Ivan Rodriguez, Zachary S. Tschirhart, Nicholas Malaya. 1904-1913 [doi]
- Productive, Vendor-Neutral GPU Programming Using ChapelEngin Kayraklioglu, Andy Stone. 1914-1922 [doi]
- OpenACC offloading of the MFC compressible multiphase flow solver on AMD and NVIDIA GPUsBenjamin Wilfong, Anand Radhakrishnan, Henry Le Berre, Steve Abbott, Reuben D. Budiardja, Spencer H. Bryngelson. 1923-1933 [doi]
- Optimizing the Weather Research and Forecasting Model with OpenMP Offload and CodeeChayanon Namo Wichitrnithed, Woo-Sun Yang, Yun Helen He, Brad Richardson, Koichi Sakaguchi, Manuel Arenaz, William I. Gustafson Jr., Jacob Shpund, Ulises Costi Blanco, Álvaro Goldar Dieste. 1934-1942 [doi]
- Performance Portability of Electron Repulsion Integrals and Their Related Methods across Peta to Exascale ArchitecturesTosaporn Sattasathuchana, Peng Xu, Dossay Oryspayev, Colleen Bertoni, Luke B. Roskop, Mark S. Gordon. 1943-1954 [doi]
- JACC: Leveraging HPC Meta-Programming and Performance Portability with the Just-in-Time and LLVM-based Julia LanguagePedro Valero-Lara, William F. Godoy, Het Mankad, Keita Teranishi, Jeffrey S. Vetter, Johannes P. Blaschke, Michel Schanen. 1955-1966 [doi]
- Heterogeneous computing in a strongly-connected CPU-GPU environment: fast multiple time-evolution equation-based modeling accelerated using data-driven approachTsuyoshi Ichimura, Kohei Fujita, Muneo Hori, Lalith Maddegedara, Jack C. Wells, Alan Gray, Ian Karlin, John Linford. 1967-1978 [doi]
- ACC Saturator: Automatic Kernel Optimization for Directive-Based GPU CodeKazuaki Matsumura, Simon Garcia De Gonzalo, Antonio J. Peña. 1979-1990 [doi]
- Enabling Scientific Collaboration with JupyterHubMin Ragan-Kelley, Matthew L. Henderson, Fernando Pérez, Rollin C. Thomas, Shreyas Cholia, Lavanya Ramakrishnan. 1993-2002 [doi]
- JUmPER: Performance Data Monitoring, Instrumentation and Visualization for Jupyter NotebooksElias Werner, Anton Rygin, Andreas Gocht-Zech, Sebastian Döbel, Matthias Lieber. 2003-2011 [doi]
- Interactive HPC and the LUNARC Desktop EnvironmentJonas Lindemann, Anders Follin. 2012-2019 [doi]
- A software ecosystem for multi-level provenance management in large-scale scientific workflows for AI applicationsGabriele Padovani, Valentine Anantharaj, Ludovica Sacco, Takuya Kurihana, M. Bunino, K. Tsolaki, M. Girone, F. Antonio, C. Sopranzetti, M. Fronza, Sandro Fiore. 2024-2031 [doi]
- Performance Characterization and Provenance of Distributed Task-based Workflows on HPC PlatformsAmal Gueroudji, Chase Phelps, Tanzima Z. Islam, Philip H. Carns, Shane Snyder, Matthieu Dorier, Robert B. Ross, Line C. Pouchard. 2032-2039 [doi]
- Parsl+CWL: Towards Combining the Python and CWL EcosystemsNishchay Karle, Ben Clifford, Yadu Babuji, Ryan Chard, Daniel S. Katz, Kyle Chard. 2040-2047 [doi]
- Towards Generating Contracts for Scientific Data Analysis WorkflowsDuc Anh Vu 0001, Timo Kehrer. 2048-2055 [doi]
- Enabling Low-Overhead HT-HPC Workflows at Extreme Scale using GNU ParallelKetan Maheshwari, William Arndt, Ahmad Maroof Karimi, Junqi Yin, Frédéric Suter, Seth Johnson, Rafael Ferreira da Silva. 2056-2063 [doi]
- Managing Workflow Malleability in Urgent Computing for Earthquake AlertsJorge Ejarque, Marisol Monterrubio Velasco, Cedric Bhihe, Marta Pienkowska, Josep de la Puente, Rosa M. Badia. 2064-2071 [doi]
- A Microservices Architecture Toolkit for Interconnected Science EcosystemsMichael J. Brim, Lance Drane, Marshall T. McDonnell, Christian Engelmann, Addi Malviya-Thakur. 2072-2079 [doi]
- Shepherd: Seamless Integration of Service Workflows into Task-Based Workflows through Log MonitoringMd Saiful Islam, Douglas Thain. 2080-2087 [doi]
- Laminar 2.0: Serverless Stream Processing with Enhanced Code Search and RecommendationsDaniel Rotchford, Samuel Evans, Rosa Filgueira. 2088-2095 [doi]
- Serverless Computing for Dynamic HPC WorkflowsVijay Thurimella, Philipp Raith, Rolando P. Hong Enriquez, Anderson Andrei Da Silva, Gourav Rattihalli, Ada Gavrilovska, Dejan S. Milojicic. 2096-2103 [doi]
- Integrating ORNL's HPC and Neutron Facilities with a Performance-Portable CPU/GPU EcosystemSteven E. Hahn, Philip W. Fackler, William F. Godoy, Ketan Maheshwari, Zachary Morgan, Andrei T. Savici, Christina M. Hoffmann, Pedro Valero-Lara, Jeffrey S. Vetter, Rafael Ferreira da Silva. 2107-2117 [doi]
- Expediting Higher Fidelity Plasma State Reconstructions for the DIII-D National Fusion Facility Using Leadership Class Computing ResourcesSterling Paul Smith, Zichuan Anthony Xing, Torrin Bechtel Amara, Severin Sebastian Denk, Earl William DeShazer, Orso Meneghini, Tom Neiser, Laurie Stephey, Oscar Antepara, Christopher Mitchell Clark, Eli Dart, Pengfei Ding, Sean Flanagan, Raffi Nazikian, David Schissel, Christine Simpson, Nicholas Scoville Tyler, Thomas D. Uram, Samuel Webb Williams. 2118-2126 [doi]
- Scalable Multi-Facility Workflows for Artificial Intelligence Applications in Climate ResearchTakuya Kurihana, Tyler J. Skluzacek, Rafael Ferreira da Silva, Valentine Anantharaj. 2127-2134 [doi]
- Portable Cross-Facility Workflows for X-ray PtychographyAlbert Vong, Steven F. Henke, Oliver Hoidn, Hannah Parraga, Tao Zhou, Apurva Mehta, Nicholas Schwarz. 2135-2143 [doi]
- Matrix Sketching for Online Analysis of LCLS Imaging DatasetsJohn Winnicki, Frédéric Poitevin, Haoyuan Li, Eric Darve. 2144-2153 [doi]
- Active Learning Surrogates for Integrating Electron Microscopy and Computational Insights from Simulations in Autonomous ExperimentsGayathri Saranathan, Ayana Ghosh, Martin Foltin, Ann Mary Justine Koomthanam, Aalap Tripathy, Maxim A. Ziatdinov, Suparna Bhattacharya, Kevin Roccapriore, Paolo Faraboschi. 2154-2161 [doi]
- Bluesky and Raydata: An Integrated Platform for Adaptive Experiment OrchestrationHiran Wijesinghe, Andi M. Barbour, Lutz Wiegart, Evan Carlin, Joshua Einstein-Curtis, Paul Moeller, Robert Nagler, Raven O'Rourke, Nathan Cook, Maksim S. Rakitin. 2162-2167 [doi]
- Cavity in the LoopDavid Scheunert, Jonas Gehrunger, Christian Hochberger, Dieter Lens, Harald Klingbeil. 2168-2175 [doi]