Abstract is missing.
- MPI-RICAL: Data-Driven MPI Distributed Parallelism Assistance with TransformersNadav Schneider, Tal Kadosh, Niranjan Hasabnis, Timothy Mattson, Yuval Pinter, Gal Oren 0001. 1-10 [doi]
- VSCuda: LLM based CUDA extension for Visual Studio CodeBrian Chen, Nafis Mustakin, Alvin Hoang, Sakib Fuad, Daniel Wong. 11-17 [doi]
- A Comparison of Mesh-Free Differentiable Programming and Data-Driven Strategies for Optimal Control under PDE ConstraintsRoussel Desmond Nzoyem Ngueguin, David A. W. Barton, Tom Deakin. 18-28 [doi]
- Autotuning Apache TVM-based Scientific Applications Using Bayesian OptimizationXingfu Wu, Praveen Paramasivam, Valerie E. Taylor. 29-35 [doi]
- Enhancing Heterogeneous Federated Learning with Knowledge Extraction and Multi-Model FusionDuy Phuong Nguyen, Sixing Yu, J. Pablo Muñoz, Ali Jannesari. 36-43 [doi]
- Elastic deep learning through resilient collective operationsJiali Li, George Bosilca, Aurelien Bouteiller, Bogdan Nicolae. 44-50 [doi]
- Towards Foundation Models for Materials Science: The Open MatSci ML ToolkitKin Long Kelvin Lee, Carmelo Gonzales, Matthew Spellings, Mikhail Galkin 0001, Santiago Miret, Nalini Kumar. 51-59 [doi]
- Accelerating Particle and Fluid Simulations with Differentiable Graph Networks for Solving Forward and Inverse ProblemsKrishna Kumar, Yongjin Choi. 60-65 [doi]
- Machine Learning Applied to Single-Molecule Activity PredictionKendric Hood, Qiang Guan. 66-72 [doi]
- Entropy-driven Optimal Sub-sampling of Fluid Dynamics for Developing Machine-learned SurrogatesWesley Brewer, Daniel Martinez, Muralikrishnan Gopalakrishnan Meena, Aditya Kashi, Katarzyna Borowiec, Siyan Liu, Christopher Pilmaier, Greg W. Burgreen, Shanti Bhushan. 73-80 [doi]
- Towards Rapid Autonomous Electron Microscopy with Active Meta-LearningGayathri Saranathan, Martin Foltin, Aalap Tripathy, Maxim A. Ziatdinov, Ann Mary Justine Koomthanam, Suparna Bhattacharya, Ayana Ghosh, Kevin Roccapriore, Sreenivas Rangan Sukumar, Paolo Faraboschi. 81-87 [doi]
- Enabling Performant Thermal Conductivity Modeling with DeePMD and LAMMPS on CPUsNariman Piroozan, Nalini Kumar. 88-94 [doi]
- Protein Generation via Genome-scale Language Models with Bio-physical ScoringGautham Dharuman, Arvind Ramanathan. 95-101 [doi]
- Tencoder: tensor-product encoder-decoder architecture for predicting solutions of PDEs with variable boundary dataAditya Kashi. 102-108 [doi]
- Tournament-Based Pretraining to Accelerate Federated LearningMatt Baughman, Nathaniel Hudson, Ryan Chard, André Bauer 0001, Ian T. Foster, Kyle Chard. 109-115 [doi]
- AI/ML-Derived Whole-Genome Predictor Prospectively and Clinically Predicts Survival and Response to Treatment in Brain CancerSri Priya Ponnapalli, Penelope Miron, Kristy L. S. Miskimen, Kristin A. Waite, Nadiya Sosonkina, Sara E. Coppens, Anthony C. Bryan, Estevan P. Kiernan, Huanming Yang, Jay Bowen, Ghunwa A. Nakouzi, Jill S. Barnholtz-Sloan, Andrew E. Sloan, Tiffany R. Hodges, Orly Alter. 116-118 [doi]
- Optimized Patient-Specific Catheter Placement for Convection-Enhanced Nanoparticle Delivery in Recurrent GlioblastomaChengyue Wu, David A. Hormuth, Chase Christenson, Ryan T. Woodall, Michael R. A. Abdelmalik, William T. Phillips, Thomas J. R. Hughes, Andrew J. Brenner, Thomas E. Yankeelov. 119-120 [doi]
- Entropy-Based Regularization on Deep Learning Models for Anti-Cancer Drug Response PredictionOleksandr Narykov, Yitan Zhu, Thomas S. Brettin, Yvonne A. Evrard, Alexander Partin, Maulik Shukla, Priyanka Vasanthakumari, James H. Doroshow, Rick Stevens. 121-122 [doi]
- Scalable Lead Prediction with Transformers using HPC resourcesArchit Vasan, Thomas S. Brettin, Rick Stevens, Arvind Ramanathan, Venkatram Vishwanath. 123 [doi]
- Automated Whole-Body Tumor Segmentation and Prognosis of Cancer on PET/CTKevin H. Leung. 124-133 [doi]
- Charliecloud's layer-free, Git-based container build cacheReid Priedhorsky, Jordan Ogas, Claude H. Davis IV, Z. Noah Hounshel, Ashlyn Lee, Benjamin Stormer, R. Shane Goff. 134-146 [doi]
- Understanding Energy Performance of Containers Deployment on HPC-Based post-Moore PlatformsPablo Josue Rojas Yepes, Carlos Jaime Barrios Hernández, Luiz Angelo Steffenel. 147-154 [doi]
- Perspectives and Experiences Supporting Containers for Research Computing at the Texas Advanced Computing CenterErik S. Ferlanti, William J. Allen, Ernesto A. B. F. Lima, Yinzhi Wang, John M. Fonner. 155-164 [doi]
- Survey of adaptive containerization architectures for HPCNina Mujkanovic, Juan José Durillo, Nicolay Hammer, Tiziano Müller. 165-176 [doi]
- Mixed-Precision S/DGEMM Using the TF32 and TF64 Frameworks on Low-Precision AI Tensor CoresPedro Valero-Lara, Ian Jorquera, Frank Lui, Jeffrey S. Vetter. 177-186 [doi]
- Mapping High-Level Concurrency from OpenMP and MPI to ThreadSanitizer FibersJoachim Jenke, Simon Schwitanski, Isabel Thärigen, Matthias S. Müller. 187-195 [doi]
- Rethinking Data Race Detection in MPI-RMA ProgramsRadjasouria Vinayagame, Emmanuelle Saillard, Samuel Thibault, Van Man Nguyen, Marc Sergent. 196-204 [doi]
- RMARaceBench: A Microbenchmark Suite to Evaluate Race Detection Tools for RMA ProgramsSimon Schwitanski, Joachim Jenke, Sven Klotz, Matthias S. Müller. 205-214 [doi]
- Data Race Detection Using Large Language ModelsLe Chen, Xianzhong Ding, Murali Emani, Tristan Vanderbruggen, Pei-Hung Lin, Chunhua Liao. 215-223 [doi]
- Towards Correctness Checking of MPI Partitioned Communication in MUSTSimon Schwitanski, Niko Sakic, Joachim Jenke, Felix Tomski, Marc-André Hermanns. 224-227 [doi]
- Adding Microbenchmarks with SIMD Data Races to DataRaceBenchJoachim Jenke, Kaloyan Ignatov, Simon Schwitanski. 228-229 [doi]
- Investigating the Real-World Applicability of MPI Correctness BenchmarksAlexander Hück, Tim Jammer, Joachim Jenke, Christian H. Bischof. 230-233 [doi]
- Improve and Stabilize the Classification Results of DataRaceBenchJoachim Jenke, Simon Schwitanski. 234-237 [doi]
- Highlighting PARCOACH Improvements on MBIPhilippe Virouleau, Emmanuelle Saillard, Marc Sergent, Pierre Lemarinier. 238-241 [doi]
- Democratizing HPC Access and Use with Knowledge GraphsPouya Kousha, Vivekananda Sathu, Matthew Lieber, Hari Subramoni, Dhabaleswar K. Panda 0001. 242-251 [doi]
- What Operations can be Performed Directly on Compressed Arrays, and with What Error?Tripti Agarwal, Harvey Dam, Ponnuswamy Sadayappan, Ganesh Gopalakrishnan, Dorra Ben Khalifa, Matthieu Martel. 252-262 [doi]
- Analyzing Impact of Data Reduction Techniques on Visualization for AMR Applications Using AMReX FrameworkDaoce Wang, Jesus Pulido, Pascal Grosset, Jiannan Tian, James P. Ahrens, Dingwen Tao. 263-271 [doi]
- LibPressio-Predict: Flexible and Fast Infrastructure For Inferring Compression PerformanceRobert Underwood, Sheng Di, Sian Jin, Md Hasanur Rahman, Arham Khan, Franck Cappello. 272-280 [doi]
- Lossy and Lossless Compression for BioFilm Optical Coherence Tomography (OCT)Max Henry Faykus, Jon Calhoun 0001, Melissa C. Smith. 281-288 [doi]
- Streaming Hardware Compressor Generator FrameworkKazutomo Yoshii, Tomohiro Ueno, Kentaro Sano, Antonino Miceli, Franck Cappello. 289-297 [doi]
- Fast 2D Bicephalous Convolutional Autoencoder for Compressing 3D Time Projection Chamber DataYi Huang, Yihui Ren, Shinjae Yoo, Jin Huang. 298-305 [doi]
- Teaching Heterogeneous and Parallel Computing with Google Colab and Raspberry Pi ClustersZhiguang Xu. 306-313 [doi]
- Faculty Development Workshops for Integrating PDC in Early Undergraduate Curricula: An Experience ReportDavid W. Brown, Sheikh K. Ghafoor, Mike Rogers, Ada Haynes. 314-323 [doi]
- Infrastructure for Writing Fork-Join TestsPrasun Dewan. 324-334 [doi]
- Data-Driven Discovery of Anchor Points for PDC ContentMatthew Mcquaigue, Erik Saule, Kalpathi R. Subramanian, Jamie Payton. 335-342 [doi]
- An NSF REU Site Based on Trust and Reproducibility of Intelligent Computation: Experience ReportMary W. Hall, Ganesh Gopalakrishnan, Eric Eide, Johanna Cohoon, Jeff M. Phillips, Mu Zhang 0001, Shireen Y. Elhabian, Aditya Bhaskara, Harvey Dam, Artem Yadrov, Tushar Kataria, Amir Mohammad Tavakkoli, Sameeran Joshi, Mokshagna Sai Teja Karanam. 343-349 [doi]
- AutoLearn: Learning in the Edge to Cloud ContinuumAlicia Esquivel Morel, William Fowler, Kate Keahey, Kyle Zheng, Michael Sherman, Richard Anderson. 350-356 [doi]
- Performance Engineering for Graduate Students: a View from AmsterdamAna Lucia Varbanescu, Stephen Nicholas Swatman, Anuj Pathania. 357-365 [doi]
- Peachy Parallel Assignments (EduHPC 2023)H. Martin Bücker, Jeremiah Corrado, Daniel Fedorin, Diego Garcia-Alvarez, Arturo González-Escribano, John Li, Maria Pantoja, Erik Pautsch, Marieke Plesske, Marcelo Ponce, Silvio Rizzi, Erik Saule, Johannes Schoder, George K. Thiruvathukal, Ramses van Zon, Wolf Weber, David P. Bunde. 366-373 [doi]
- EduHPC Lightning Talk SummaryMichael Alexander, Sanjukta Bhowmick, Befikir Bogale, Gilberto Díaz, Anne C. Elster, Danielle A. Ellsworth, Carlos Jaime Barrios Hernández, Evan Jaffe, Jack Marquez, Alison Melton, Aashish Pandey, Suzanne Parete-Koon, Nigel Tan, Michela Taufer, Verónica G. Melesse Vergara, Lauren Whitnah, George K. Thiruvathukal. 374-378 [doi]
- Uncertainty Quantification of Metal Additive Manufacturing Processing Conditions Through the use of Exascale ComputingRobert Carson, Matthew Rolchigo, John Coleman, Mikhail Titov, James F. Belak, Matt Bement. 379-383 [doi]
- Efficient Probabilistic Tuning of Ensemble Forecasting MethodAlessandro Fanfarillo, Nicholas Malaya, Guido Cervone, Luca Delle Monache. 384-386 [doi]
- Uncertainty Quantification of Reduced-Precision Time Series in Turbulent Channel FlowMartin Karp, Felix Liu, Ronith Stanly, Saleh Rezaeiravesh, Niclas Jansson, Philipp Schlatter, Stefano Markidis. 387-390 [doi]
- Optimized Uncertainty Estimation for Vision Transformers: Enhancing Adversarial Robustness and Performance Using Selective ClassificationErik Pautsch, John Li, Silvio Rizzi, George K. Thiruvathukal, Maria Pantoja. 391-394 [doi]
- Localization of Gamma-ray Bursts in a Balloon-Borne TelescopeYe Htet, Marion Sudvarg, Jeremy Buhler, Roger D. Chamberlain, James H. Buckley. 395-398 [doi]
- Automatic Search Guided Code Optimization Framework for Mixed-Precision Scientific ApplicationsJienan Yao, Wei Xue. 399-403 [doi]
- Using Mixed-Radix Decomposition to Enumerate Computational Resources of Deeply Hierarchical ArchitecturesPhilippe Swartvagher, Sascha Hunold, Jesper Larsson Träff, Ioannis Vardas. 404-415 [doi]
- Efficient data redistribution for malleable applicationsIker Martín-Álvarez, José Ignacio Aliaga, Maribel Castillo, Sergio Iserte. 416-426 [doi]
- Optimizing Irregular Communication with Neighborhood Collectives and Locality-Aware ParallelismGerald Collom, Rui Peng Li, Amanda Bienz. 427-437 [doi]
- Embedding Rust within Open MPIJake Tronge, Howard Pritchard. 438-447 [doi]
- OpenSHMEM Queues: An abstraction for enhancing message rate, bandwidth utilization, and reducing tail latency in OpenSHMEM ApplicationsVishwanath Venkatesan, Manjunath Gorentla Venkata. 448-457 [doi]
- A Statistical Analysis of HPC Network TuningDonald A. Kruse, Whit Schonbein, Matthew G. F. Dosanjh. 458-465 [doi]
- When to checkpoint at the end of a fixed-length reservation?Quentin Barbut, Anne Benoit, Thomas Hérault, Yves Robert, Frédéric Vivien. 466-476 [doi]
- Evaluating the Resiliency of Posits for Scientific ComputingBenjamin Schlueter, Jon Calhoun 0001, Alexandra Poulos. 477-487 [doi]
- Dynamic Selective Protection of Sparse Iterative Solvers via ML Prediction of Soft Error ImpactsZizhao Chen, Thomas Verrecchia, Hongyang Sun 0001, Joshua Dennis Booth, Padma Raghavan. 488-491 [doi]
- Optimizing Write Performance for Checkpointing to Parallel File Systems Using LSM-TreesSerdar Bulut, Steven A. Wright. 492-501 [doi]
- Disk Failure Trends in Alpine Storage SystemAnjus George, Jesse Hanley, Sarp Oral. 502-506 [doi]
- Recovering Detectable Uncorrectable Errors via Spatial Data PredictionKristen Guernsey, Sarah Placke, Alexandra Poulos, Jon Calhoun 0001. 507-515 [doi]
- Using Benford's Law to Identify Unusual Failure RegionsKurt B. Ferreira, Scott Levy. 516-519 [doi]
- Tydi-lang: A Language for Typed Streaming HardwareYongding Tian, Matthijs A. Reukers, Zaid Al-Ars, H. Peter Hofstee, Matthijs Brobbel, Johan Peltenburg, Jeroen van Straten. 520-529 [doi]
- Enabling Communication with FPGA-based Network-attached Accelerators for HPC WorkloadsSteffen Christgau, Dylan Everingham, Florian Mikolajczak, Niklas Schelten, Bettina Schnor, Max Schrötter, Benno Stabernack, Fritjof Steinert. 530-538 [doi]
- OctoRay: Framework for Scalable FPGA Cluster Acceleration of Python Big Data ApplicationsZaid Al-Ars, Jakoba Petri-Koenig, Joost Hoozemans, Luc Dierick, H. Peter Hofstee. 539-546 [doi]
- Altis-SYCL: Migrating Altis Benchmarking Suite from CUDA to SYCL for GPUs and FPGAsChristoph Weckert, Leonardo Solis-Vasquez, Julian Oppermann, Andreas Koch 0001, Oliver Sinnen. 547-555 [doi]
- Stencil-HMLS: A multi-layered approach to the automatic optimisation of stencil codes on FPGAGabriel Rodriguez-Canal, Nick Brown 0002, Maurice Jamieson, Emilien Bauer, Anton Lydike, Tobias Grosser. 556-565 [doi]
- Report on Adaptable Open-Source Disaster Recovery Solution for Multi-Petabyte Storage SystemsHonwai Leong. 566-572 [doi]
- Self-service Monitoring of HPC and Openstack Jobs for UsersSimon Guilbault. 573-580 [doi]
- Heterogeneous Syslog Analysis: There Is HopeAndres Quan, Leah Howell, Hugh Greenberg. 581-587 [doi]
- Overcoming Active Directory Woes with Plain Text Caches and Replacing PasswordsJason St. John, Alex Younts. 588-590 [doi]
- ICE 2.0: Restructuring and Growing an Instructional HPC ClusterJ. Eric Coulter, Michael D. Weiner, Aaron Jezghani, Matthew Guidry, Rubén Lara, Fang (Cherry) Liu, Allan Metts, Ronald Rahaman, Kenneth J. Suda, Peter Wan, Gregory Willcox, Deirdre Womack, Dan Zhou. 591-597 [doi]
- Ramble: A flexible, extensible, and composable experimentation frameworkDouglas Jacobsen, Robert F. Bird. 598-608 [doi]
- Principles for Automated and Reproducible BenchmarkingTuomas Koskela, Ilektra Christidi, Mosè Giordano, Emily Dubrovska, Jamie Quinn, Christopher Maynard, Dave Case, Kaan Olgu, Tom Deakin. 609-618 [doi]
- Experiences Detecting Defective Hardware in Exascale SupercomputersNick Hagerty, Jordan Webb, Verónica G. Melesse Vergara, Matt Ezell. 619-626 [doi]
- Towards Collaborative Continuous Benchmarking for HPCOlga Pearce, Alec Scott, Gregory Becker, Riyaz Haque, Nathan Hanford, Stephanie Brink, Doug Jacobsen, Heidi Poxon, Jens Domke, Todd Gamblin. 627-635 [doi]
- Maximizing Data Utility for HPC Python Workflow ExecutionThanh Son Phung, Ben Clifford, Kyle Chard, Douglas Thain. 636-640 [doi]
- WisdomWombat: A polyglot dataflow CFD code using Python and DragonJulius Donnert, Lindsey Gordon, Christopher Nolting, Peter Mendygral. 641-643 [doi]
- Demonstration of Portable Performance of Scientific Machine Learning on High Performance Computing SystemsKhalid Hossain, Riccardo Balin, Corey Adams, Thomas D. Uram, Kalyan Kumaran, Venkatram Vishwanath, Tanima Dey, Subrata Goswami, Janghaeng Lee, Rebecca Ramer, Koichi Yamada. 644-647 [doi]
- Dragon Proxy Runtimes and Multi-system WorkflowsNick Radcliffe, Kent Lee, Pete Mendygral. 648-651 [doi]
- CaRV - Accelerating Program Optimization through Capture, Replay, ValidateParinaz Barakhshan, Rudolf Eigenmann. 652-662 [doi]
- REMORA Resource Monitor: Usability, Performance and User Interface ImprovementsChun-Yaung Lu, Kent F. Milfeld. 663-672 [doi]
- BaRRT: Buildtime and Runtime Reproducibility Tool for Software Development and TestingSamuel Khuvis. 673-676 [doi]
- PEAK: a Light-Weight Profiler for HPC SystemsYinzhi Wang, Junjie Li. 677-680 [doi]
- PTI-GPU: Kernel Profiling and Assessment on Intel GPUsMariam Umar, Maxwell Jong. 681-684 [doi]
- ZeroSum: User Space Monitoring of Resource Utilization and Contention on Heterogeneous HPC SystemsKevin A. Huck, Allen D. Malony. 685-695 [doi]
- msr-genie: Navigating Model Specific Registers Across Processor GenerationsKyle Fan, Barry Rountree, Tapasya Patki, Aniruddha Marathe, Stephanie Brink, Duncan McFarlane, Eric Green, Kathleen Shoga. 696-703 [doi]
- Centralized provisioning of large language models for a research communityDhruvil Shah, Gil Speyer, Jason Yalim. 704-707 [doi]
- A Fast and Responsive Web-based Framework for Visualizing HPC Application UsageVed Arora, Nayeli Gurrola, Amiya Maji, Guangzhen Jin. 708-711 [doi]
- NPAT - A Power Analysis Tool at NERSCAndy Zhang, Sridutt Bhalachandra, Siqi Deng, Zhengji Zhao. 712-719 [doi]
- Introducing Open OnDemand to Supercomputer FugakuMasahiro Nakao, Hidetomo Kaneyama, Masaru Nagaku, Ikki Fujiwara, Atsuko Takefusa, Shin'ichi Miura, Keiji Yamamoto. 720-727 [doi]
- Towards A Massive-scale Distributed Neighborhood Graph ConstructionKeita Iwabuchi, Trevor Steil, Benjamin Priest, Roger Pearce, Geoffrey Sanders. 728-738 [doi]
- A Parallel Algorithm for Updating a Multi-objective Shortest Path in Large Dynamic NetworksArindam Khanda, S. M. Shovan, Sajal K. Das 0001. 739-746 [doi]
- cuAlign: Scalable Network Alignment on GPU AcceleratorsLizhi Xiang, Arif M. Khan, S. M. Ferdous, Sr Aravind, Mahantesh Halappanavar. 747-755 [doi]
- A New Sparse GEneral Matrix-matrix Multiplication Method for Long Vector Architecture by Hierarchical Row MergingHikaru Takayashiki, Hotaka Yagi, Hiroki Nishimoto, Naoki Yoshifuji. 756-759 [doi]
- TANGO: A GPU optimized traceback approach for sequence alignment algorithmsLeAnn Lindsey, Muhammad Haseeb, Hari Sundar, Muaaz Gul Awan. 760-765 [doi]
- Accelerating Deep Neural Network guided MCTS using Adaptive ParallelismYuan Meng, Qian Wang, Tianxin Zu, Viktor K. Prasanna. 766-769 [doi]
- Filtering Wasteful Vertex Visits in Breadth-First SearchPrachatos Mitra, Alexandros Daglis. 770-773 [doi]
- Experimental Study of TCP Throughput Profiles and Dynamics Over Dedicated ConnectionsNageswara Rao. 774-784 [doi]
- Evaluation of SCION for User-driven Path Control: a Usability StudyAntonio Battipaglia, Leonardo Boldrini, Ralph Koning, Paola Grosso. 785-794 [doi]
- Throughput Optimization with a NUMA-Aware Runtime System for Efficient Scientific Data StreamingHasibul Jamil, Joaquín Chung, Tekin Bicer, Tevfik Kosar, Rajkumar Kettimuthu. 795-805 [doi]
- Elephants Sharing the Highway: Studying TCP Fairness in Large Transfers over High Throughput LinksImtiaz Mahmud, George Papadimitriou 0002, Cong Wang, Mariam Kiran, Anirban Mandal, Ewa Deelman. 806-818 [doi]
- Enhancing perfSONAR Measurement Capabilities using P4 Programmable Data PlanesAli Mazloum, Jose Gomez, Elie F. Kfoury, Jorge Crichigno. 819-829 [doi]
- Dask-Extended External Tasks for HPC/ML In transit WorkflowsAmal Gueroudji, Julien Bigot, Bruno Raffin, Robert Ross. 830-838 [doi]
- A gem5 Implementation of the Sequential Codelet Model: Reducing Overhead and Expanding the Software Memory InterfaceDawson Fox, Jose Manuel Monsalve Diaz, Xiaoming Li. 839-846 [doi]
- MPI-xCCL: A Portable MPI Library over Collective Communication Libraries for Various AcceleratorsChen-Chun Chen, Kawthar Shafie Khorassani, Pouya Kousha, Qinghua Zhou, Jinghan Yao, Hari Subramoni, Dhabaleswar K. Panda 0001. 847-854 [doi]
- Trigger Smart Data Saving Applied to CO2 Capture in Metal-Organic FrameworksEstelle Dirand, Ali Asad, Yann Magnin. 855-861 [doi]
- Scaling Computational Fluid Dynamics: In Situ Visualization of NekRS using SENSEIVictor A. Mateevitsi, Mathis Bode, Nicola J. Ferrier, Paul F. Fischer, Jens Henrik Göbbert, Joseph A. Insley, Yu-Hsiang Lan, Misun Min, Michael E. Papka, Saumil Patel, Silvio Rizzi, Jonathan Windgassen. 862-867 [doi]
- Extensions to the SENSEI In situ Framework for Heterogeneous ArchitecturesBurlen Loring, E. Wes Bethel, Gunther H. Weber, Michael W. Mahoney. 868-874 [doi]
- OpenMP Kernel Language Extensions for Performance Portable GPU CodesShilei Tian, Tom Scogland, Barbara M. Chapman, Johannes Doerfert. 875-883 [doi]
- DPU Offloading Programming with the OpenMP APIMuhammad Usman, Sergio Iserte, Roger Ferrer, Antonio José Peña. 884-891 [doi]
- Precision and Performance Analysis of C Standard Math Library Functions on GPUsAnton Rydahl, Joseph Huber, Ethan Luis Mcdonough, Johannes Doerfert. 892-903 [doi]
- Fortran performance optimisation and auto-parallelisation by leveraging MLIR-based domain specific abstractions in FlangNick Brown 0002, Maurice Jamieson, Anton Lydike, Emilien Bauer, Tobias Grosser. 904-913 [doi]
- An Analysis of Graph Neural Network Memory Access PatternsSatoshi Iwata, Remzi H. Arpaci-Dusseau, Akihiko Kasagi. 914-921 [doi]
- An Efficient Distributed Graph Engine for Deep Learning on GraphsGangda Deng, Ömer Faruk Akgül, Hongkuan Zhou, Hanqing Zeng, Yinglong Xia, Jianbo Li, Viktor Prasanna. 922-931 [doi]
- Addressing Stale Gradients in Scalable Federated Deep Reinforcement LearningJustin Stanley, Ali Jannesari. 932-940 [doi]
- DDStore: Distributed Data Store for Scalable Training of Graph Neural Networks on Large Atomistic Modeling DatasetsJong Youl Choi, Massimiliano Lupo Pasini, Pei Zhang, Kshitij Mehta, Frank Liu, Jonghyun Bae, Khaled Ibrahim. 941-950 [doi]
- HPC-GPT: Integrating Large Language Model for High-Performance ComputingXianzhong Ding, Le Chen, Murali Emani, Chunhua Liao, Pei-Hung Lin, Tristan Vanderbruggen, Zhen Xie, Alberto Cerpa, Wan Du. 951-960 [doi]
- GPU Graph Processing on CXL-Based Microsecond-Latency External MemoryShintaro Sano, Yosuke Bando, Kazuhiro Hiwada, Hirotsugu Kajihara, Tomoya Suzuki, Yu Nakanishi, Daisuke Taki, Akiyuki Kaneko, Tatsuo Shiozawa. 961-972 [doi]
- Dynamic Memory Provisioning on Disaggregated HPC SystemsFelippe Vieira Zacarias, Paul M. Carpenter, Vinicius Petrucci. 973-982 [doi]
- CXL Memory as Persistent Memory for Disaggregated HPC: A Practical ApproachYehonatan Fridman, Suprasad Mutalik Desai, Navneet Singh, Thomas Willhalm, Gal Oren 0001. 983-994 [doi]
- Accelerating In Situ Analysis using Non-volatile MemoryDeepak B. Hegde, Preeti Malakar. 995-1004 [doi]
- Performance Portability Evaluation of Blocked Stencil Computations on GPUsOscar Antepara, Samuel Williams 0001, Hans Johansen, Tuowen Zhao, Samantha Hirsch, Priya Goyal, Mary W. Hall. 1005-1018 [doi]
- Many Cores, Many Models: GPU Programming Model vs. Vendor Compatibility OverviewAndreas Herten. 1019-1026 [doi]
- Benchmarking a portable lattice quantum chromodynamics kernel written in Kokkos and MPISimon Schlepphorst, Stefan Krieg. 1027-1037 [doi]
- Evaluating the performance portability of SYCL across CPUs and GPUs on bandwidth-bound applicationsIstván Z. Reguly. 1038-1047 [doi]
- Porting Batched Iterative Solvers onto Intel GPUs with SYCLPhuong Nguyen, Pratik Nayak, Hartwig Anzt. 1048-1058 [doi]
- Evaluating the Performance of One-sided Communication on CPUs and GPUsNan Ding 0006, Muhammad Haseeb, Taylor L. Groves, Samuel Williams 0001. 1059-1069 [doi]
- Performance Portability of Programming Strategies for Nearest-Neighbor Communication with GPU-Aware MPIJames Buford White III. 1070-1080 [doi]
- MatRIS: Multi-level Math Library Abstraction for Heterogeneity and Performance Portability using IRIS RuntimeMohammad Alaul Haque Monil, Narasinga Rao Miniskar, Keita Teranishi, Jeffrey S. Vetter, Pedro Valero-Lara. 1081-1092 [doi]
- CuPBoP-AMD: Extending CUDA to AMD PlatformsJun Chen, Xule Zhou, Hyesoon Kim. 1093-1104 [doi]
- High-level GPU code: a case study examining JAX and OpenMPNestor Demeure, Theodore Kisner, Reijo Keskitalo, Rollin C. Thomas, Julian Borrill, Wahid Bhimji. 1105-1113 [doi]
- A Performance-Portable SYCL Implementation of CRK-HACC for ExascaleEsteban Miguel Rangel, Simon John Pennycook, Adrian Pope, Nicholas Frontiere, Zhiqiang Ma, Varsha Madananth. 1114-1125 [doi]
- Performance Evaluation of Heterogeneous GPU Programming Frameworks for Hemodynamic SimulationsAristotle X. Martin, Geng Liu, William Ladd, Seyong Lee, John Gounley, Jeffrey Vetter, Saumil Patel, Silvio Rizzi, Victor A. Mateevitsi, Joseph A. Insley, Amanda Randles. 1126-1137 [doi]
- Implementing scalable matrix-vector products for the exact diagonalization methods in quantum many-body physicsTom Westerhout, Bradford L. Chamberlain. 1138-1150 [doi]
- Design and Analysis of the Network Software Stack of an Asynchronous Many-task System - The LCI parcelport of HPXJiakun Yan, Hartmut Kaiser, Marc Snir. 1151-1161 [doi]
- High-Performance Programming and Execution of a Coral Biodiversity Mapping Algorithm Using ChapelScott Bachman, Rebecca Green, Anna Bakker, Helen Fox, Sam Purkis, Ben Harshbarger. 1162-1170 [doi]
- symPACK: A GPU-Capable Fan-Out Sparse Cholesky SolverJulian Bellavita, Mathias Jacquelin, Esmond G. Ng, Dan Bonachea, Johnny Corbino, Paul H. Hargrove. 1171-1184 [doi]
- shmem4py: High-Performance One-Sided Communication for Python ApplicationsMarcin Rogowski, Jeff R. Hammond, David E. Keyes, Lisandro Dalcín. 1185-1193 [doi]
- GrIOt: Graph-based Modeling of HPC Application I/O Call Stacks for Predictive PrefetchLouis-Marie Nicolas, Salim Mimouni, Philippe Couvée, Jalil Boukhobza. 1194-1201 [doi]
- PoliMOR: A Policy Engine "Made-to-Order" for Automated and Scalable Data Management in LustreAnjus George, Christopher Brumgard, Rick Mohr, Ketan Maheshwari, James Simmons, Sarp Oral, Jesse Hanley. 1202-1208 [doi]
- IOMax: Maximizing Out-of-Core I/O Analysis Performance on HPC SystemsIzzet Yildirim, Hariharan Devarajan, Anthony Kougkas, Xian-He Sun, Kathryn M. Mohror. 1209-1215 [doi]
- The I/O Trace Initiative: Building a Collaborative I/O Archive to Advance HPCNafiseh Moti, André Brinkmann, Marc-André Vef, Philippe Deniel, Jesús Carretero 0001, Philip H. Carns, Jean-Thomas Acquaviva, Reza Salkhordeh. 1216-1222 [doi]
- Enhancing Metadata Transfer Efficiency: Unlocking the Potential of DAOS in the ADIOS contextRanjan Sarpangala Venkatesh, Greg Eisenhauer, Scott Klasky, Ada Gavrilovska. 1223-1228 [doi]
- Physical Oscillator Model for SupercomputingAyesha Afzal, Georg Hager, Gerhard Wellein. 1229-1235 [doi]
- Comparative evaluation of bandwidth-bound applications on the Intel Xeon CPU MAX SeriesIstván Z. Reguly. 1236-1244 [doi]
- SPEChpc 2021 Benchmarks on Ice Lake and Sapphire Rapids Infiniband Clusters: A Performance and Energy Case StudyAyesha Afzal, Georg Hager, Gerhard Wellein. 1245-1254 [doi]
- Reducing Memory Requirements for the IPU using Butterfly FactorizationsS. Kazem Shekofteh, Christian Alles, Holger Fröning. 1255-1263 [doi]
- Verifying Performance Guidelines for MPI Collectives at ScaleSascha Hunold. 1264-1268 [doi]
- A Performance Model for Estimating the Cost of Scaling to Practical Quantum AdvantageDaan Camps, Katherine Klymko, Brian Austin, Nicholas J. Wright. 1269-1273 [doi]
- Hardware Specialization: Estimating Monte Carlo Cross-Section Lookup Kernel Performance and AreaKazutomo Yoshii, John R. Tramm, Bryce Allen, Tomohiro Ueno, Kentaro Sano, Andrew R. Siegel, Pete Beckman. 1274-1278 [doi]
- Power Analysis of NERSC Production WorkloadsZhengji Zhao, Ermal Rrapaj, Sridutt Bhalachandra, Brian Austin, Hai Ah Nam, Nicholas Wright. 1279-1287 [doi]
- Adaptive Stopping Rule for Performance MeasurementsViyom Mittal, Pedro Bruel, Dejan S. Milojicic, Eitan Frachtenberg. 1288-1297 [doi]
- Latency and Bandwidth Microbenchmarks of US Department of Energy Systems in the June 2023 Top 500 ListChristopher M. Siefert, Carl Pearson, Stephen L. Olivier, Andrey Prokopenko, Jonathan Hu, Timothy J. Fuller. 1298-1305 [doi]
- Risk-Aware Scheduling Algorithms for Variable Capacity ResourcesLucas Perotin, Chaojie Zhang, Rajini Wijayawardana, Anne Benoit, Yves Robert, Andrew A. Chien. 1306-1315 [doi]
- A Reinforcement Learning Based Backfilling Strategy for HPC Batch JobsElliot Kolker-Hicks, Di Zhang, Dong Dai 0001. 1316-1323 [doi]
- Evaluating the Potential of Elastic Jobs in HPC SystemsDavid Eberius, Md. Wasi-ur-Rahman, David Ozog. 1324-1333 [doi]
- Modelling Data Locality of Sparse Matrix-Vector Multiplication on the A64FXSergej Breiter, James D. Trotter, Karl Fürlinger. 1334-1342 [doi]
- Extra-Deep: Automated Empirical Performance Modeling for Distributed Deep LearningMarcus Ritter, Felix Wolf 0001. 1343-1356 [doi]
- An Event Model for Trace-Based Performance Analysis of MPI Partitioned Point-to-Point CommunicationIsabel Thärigen, Marc-André Hermanns, Markus Geimer. 1357-1367 [doi]
- Filtering and Ranking of Code Regions for Parallelization via Hotspot Detection and OpenMP Overhead AnalysisSeyed Ali Mohammadi, Lukas Rothenberger, Gustavo de Morais, Bertin Nico Görlich, Erik Lille, Hendrik Rüthers, Felix Wolf 0001. 1368-1379 [doi]
- Enabling Agile Analysis of I/O Performance Data with PyDarshanJakob Lüttgau, Shane Snyder, Tyler Reddy, Nikolaus Awtrey, Kevin Harms, Jean Luca Bez, Rui Wang, Robert Latham, Philip H. Carns. 1380-1391 [doi]
- GPUscout: Locating Data Movement-related Bottlenecks on GPUsSoumya Sen, Stepan Vanecek, Martin Schulz. 1392-1402 [doi]
- FROOM: A Framework of Operators for OTF2 ModificationJan Frenzel, Apurv Deepak Kulkarni, Sebastian Döbel, Bert Wesarg, Maximilian Knespel, Holger Brunst. 1403-1411 [doi]
- Using Azure Quantum Resource Estimator for Assessing Performance of Fault Tolerant Quantum ComputationWim van Dam, Mariia Mykhailova, Mathias Soeken. 1412-1419 [doi]
- A Reference Implementation for a Quantum Message Passing InterfaceYue Shi, Tommy Nguyen, Samuel Alexander Stein, Tim Stavenger, Marvin Warner, Martin Roetteler, Torsten Hoefler, Ang Li. 1420-1425 [doi]
- TISCC: A Surface Code Compiler and Resource Estimator for Trapped-Ion ProcessorsTyler LeBlond, Ryan S. Bennink, Justin G. Lietz, Christopher M. Seck. 1426-1435 [doi]
- BGLS: A Python Package for the Gate-by-Gate Sampling Algorithm to Simulate Quantum CircuitsAlex Shapiro, Ryan LaRose. 1436-1442 [doi]
- Fast Simulation of High-Depth QAOA CircuitsDanylo Lykov, Ruslan Shaydulin, Yue Sun, Yuri Alexeev, Marco Pistoia. 1443-1451 [doi]
- MEMQSim: Highly Memory-Efficient and Modularized Quantum State-Vector SimulationBoyuan Zhang, Bo Fang, Qiang Guan, Ang Li, Dingwen Tao. 1452-1453 [doi]
- JuliQAOA: Fast, Flexible QAOA SimulationJohn K. Golden, Andreas Bärtschi, Dan O'Malley, Elijah Pelofske, Stephan J. Eidenbenz. 1454-1459 [doi]
- Enabling Scalable VQE Simulation on Leading HPC SystemsMeng Wang, Fei Hua, Chenxu Liu, Nicholas P. Bauman, Karol Kowalski, Daniel Claudino, Travis S. Humble, Prashant Nair, Ang Li 0006. 1460-1467 [doi]
- QASMTrans: A QASM Quantum Transpiler Framework for NISQ DevicesFei Hua, Meng Wang, Gushu Li, Bo Peng, Chenxu Liu, Muqing Zheng, Samuel Alexander Stein, Yufei Ding, Eddy Z. Zhang, Travis Humble, Ang Li 0006. 1468-1477 [doi]
- Enabling Quantum Computer Simulations on AMD GPUs: a HIP Backend for Google's qsimStefano Markidis. 1478-1486 [doi]
- QArchSearch: A Scalable Quantum Architecture Search PackageAnkit Kulshrestha, Ilya Safro, Yuri Alexeev. 1487-1491 [doi]
- An Ising-based Model for Qubit MappingHayato Ushijima-Mwesigwa, Xiaoyuan Liu 0001. 1492-1498 [doi]
- Prototype of a Batched Quantum Circuit Simulator for the Vector EngineKeichi Takahashi, Toshio Mori, Hiroyuki Takizawa. 1499-1505 [doi]
- Sunfish: An Open Centralized Composable HPC Management FrameworkPhil Cayton, Michael Aguilar, Christian Pinto. 1506-1511 [doi]
- RISA: Round-Robin Intra-Rack Friendly Scheduling Algorithm for Disaggregated DatacentersRashadul Kabir, Ryan Gary Kim, Mahdi Nikdast. 1512-1520 [doi]
- Automatic Generation of Micro-kernels for Performance Portability of Matrix Multiplication on RISC-V Vector ProcessorsFrancisco D. Igual, Luis Piñuel, Sandra Catalán, Héctor Martínez, Adrián Castelló 0001, Enrique S. Quintana-Ortí. 1521-1532 [doi]
- Evaluating HPX and Kokkos on RISC-V using an astrophysics application Octo-TigerPatrick Diehl, Gregor Daiß, Steven R. Brandt, Alireza Kheirkhahan, Hartmut Kaiser, Christopher Taylor, John Leidel. 1533-1542 [doi]
- Short Reasons for Long Vectors in HPC CPUs: A Study Based on RISC-VPablo Vizcaino, Georgios Ieronymakis, Nikolaos Dimou, Vassilis Papaefstathiou, Jesús Labarta, Filippo Mantovani. 1543-1549 [doi]
- Challenges and Opportunities in the Co-design of Convolutions and RISC-V Vector ProcessorsSonia Rani Gupta, Nikela Papadopoulou, Miquel Pericàs. 1550-1556 [doi]
- An Empirical Comparison of the RISC-V and AArch64 Instruction SetsDaniel Weaver, Simon McIntosh-Smith. 1557-1565 [doi]
- Is RISC-V ready for HPC prime-time: Evaluating the 64-core Sophon SG2042 RISC-V CPUNick Brown 0002, Maurice Jamieson, Joseph K. L. Lee, Paul Wang. 1566-1574 [doi]
- RDARuntime: An OS for AI AcceleratorsBenjamin Glick, Arjun Sabnis, Renate Kempf, Arnav Goel, Aarti Lalwani, Guoyao Feng, Kiran Ranganath. 1575-1587 [doi]
- GPU Acceleration in Unikernels Using Cricket GPU VirtualizationNiklas Eiling, Martin Kröning, Jonathan Klimt, Philipp Fensch, Stefan Lankes, Antonello Monti. 1588-1595 [doi]
- CARAT KOP: Towards Protecting the Core HPC Kernel from Linux Kernel ModulesThomas Filipiuk, Nick Wanninger, Nadharm Dhiantravan, Carson Surmeier, Alex Bernat, Peter A. Dinda. 1596-1605 [doi]
- Fine-grained accelerator partitioning for Machine Learning and Scientific Computing in Function as a Service PlatformAditya Dhakal, Philipp Raith, Logan T. Ward, Rolando P. Hong Enriquez, Gourav Rattihalli, Kyle Chard, Ian T. Foster, Dejan S. Milojicic. 1606-1613 [doi]
- Analysis and Characterization of Performance Variability for OpenMP RuntimeMinyu Cui, Nikela Papadopoulou, Miquel Pericàs. 1614-1622 [doi]
- Vertical Scaling of Variational Multiscale Modeling for Fluid Dynamics: Successes, Challenges, and OpportunitiesChristopher J. Coley. 1623-1634 [doi]
- FFTX-IRIS: Towards Performance Portability and Heterogeneity for SPIRAL Generated CodeSanil Rao, Mohammad Alaul Haque Monil, Het Mankad, Jeffrey Vetter, Franz Franchetti. 1635-1641 [doi]
- Value-Based Resource Management at SoC ScaleSerhan Gener, Md Sahil Hassan, Ali Akoglu. 1642-1650 [doi]
- CHARM-SYCL: New Unified Programming Environment for Multiple Accelerator TypesNorihisa Fujita, Beau Johnston, Ryohei Kobayashi, Keita Teranishi, Seyong Lee, Taisuke Boku, Jeffrey S. Vetter. 1651-1661 [doi]
- Accelerator integration in a tile-based SoC: lessons learned with a hardware floating point compression engineXueyang Liu, Patricia Gonzalez-Guerrero, Ivy Peng, Ronald G. Minnich, Maya B. Gokhale. 1662-1669 [doi]
- GPU-based LU Factorization and Solve on Batches of Matrices with Band StructureAhmad Abdelfattah, Stanimire Tomov, Piotr Luszczek, Hartwig Anzt, Jack J. Dongarra. 1670-1679 [doi]
- Task-Based Polar Decomposition Using SLATE on Massively Parallel Systems with Hardware AcceleratorsDalal Sukkari, Mark Gates, Mohammed A. Al Farhan, Hartwig Anzt, Jack J. Dongarra. 1680-1687 [doi]
- Advancing the distributed Multi-GPU ChASE library through algorithm optimization and NCCL libraryXinzhe Wu, Edoardo Di Napoli. 1688-1696 [doi]
- Moment Representation of Regularized Lattice Boltzmann Methods on NVIDIA and AMD GPUsPedro Valero-Lara, Jeffrey S. Vetter, John Gounley, Amanda Randles. 1697-1704 [doi]
- Optimization of Ported CFD Kernels on Intel Data Center GPU Max 1550 using oneAPI ESIMDMohammad Zubair, Aaron Walden, Gabriel Nastac, Eric J. Nielsen, Christoph Bauinger, Xiao Zhu. 1705-1712 [doi]
- Massively Distributed Finite-Volume Flux ComputationRyuichi Sai, Mathias Jacquelin, François P. Hamon, Mauricio Araya-Polo, Randolph R. Settgast. 1713-1720 [doi]
- Parallel Symbolic Cholesky FactorizationTobias Ribizel, Hartwig Anzt. 1721-1727 [doi]
- Checkpoint/Restart for CUDA KernelsNiklas Eiling, Stefan Lankes, Antonello Monti. 1728-1737 [doi]
- Implementation-Oblivious Transparent Checkpoint-Restart for MPIYao Xu, Leonid Belyaev, Twinkle Jain, Derek Schafer, Anthony Skjellum, Gene Cooperman. 1738-1747 [doi]
- Asynchronous Multi-Level Checkpointing: An Enabler of Reproducibility using Checkpoint History AnalyticsKevin Assogba, Bogdan Nicolae, Hubertus van Dam, M. Mustafa Rafique. 1748-1756 [doi]
- Benchmarking and In-depth Performance Study of Large Language Models on Habana Gaudi ProcessorsChengming Zhang 0006, Baixi Sun, Xiaodong Yu 0001, Zhen Xie, Weijian Zheng, Kamil A. Iskra, Pete Beckman, Dingwen Tao. 1757-1766 [doi]
- Pareto Optimization of CNN Models via Hardware-Aware Neural Architecture Search for Drainage Crossing Classification on Resource-Limited DevicesYuke Li, Jiwon Baik, Md Marufi Rahman, Iraklis Anagnostopoulos, Ruopu Li, Tong Shu. 1767-1775 [doi]
- Short Paper: Accelerating Hyperparameter Optimization Algorithms with Mixed PrecisionMarcel Aach, Rakesh Sarma, Eray Inanc, Morris Riedel, Andreas Lintermann. 1776-1779 [doi]
- Accuracy-Constrained Efficiency Optimization and GPU Profiling of CNN Inference for Detecting Drainage Crossing LocationsYicheng Zhang, Dhroov Pandey, Di Wu, Turja Kundu, Ruopu Li, Tong Shu. 1780-1788 [doi]
- Domain-Specific Energy Modeling for Drug Discovery and Magnetohydrodynamics ApplicationsLorenzo Carpentieri, Marco D'Antonio, Kaijie Fan, Luigi Crisci, Biagio Cosenza, Federico Ficarelli, Daniele Cesarini, Gianmarco Accordi, Davide Gadioli, Gianluca Palermo, Peter Thoman, Philip Salzmann, Philipp Gschwandtner, Markus Wippler, Filippo Marchetti, Daniele Gregori, Andrea Rosario Beccari. 1789-1800 [doi]
- An End-to-End HPC Framework for Dynamic Power ObjectivesDaniel Curtis Wilson, Fatih Acun, Siddhartha Jana, Federico Ardanaz, Jonathan M. Eastep, Ioannis Ch. Paschalidis, Ayse K. Coskun. 1801-1811 [doi]
- PM100: A Job Power Consumption Dataset of a Large-scale Production HPC SystemFrancesco Antici, Mohsen Seyedkazemi Ardebili, Andrea Bartolini, Zeynep Kiziltan. 1812-1819 [doi]
- Augmenting ML-based Predictive Modelling with NLP to Forecast a Job's Power ConsumptionFrancesco Antici, Keiji Yamamoto, Jens Domke, Zeynep Kiziltan. 1820-1830 [doi]
- Automatic Energy-Efficient Job Scheduling in HPC: A Novel SLURM Plugin ApproachAnders Aaen Springborg, Michele Albano, Samuel Xavier de Souza. 1831-1838 [doi]
- Energy consumption comparison of parallel linear systems solver algorithms on HPC infrastructureSofia Montebugnoli, Anna Ciampolini. 1839-1848 [doi]
- Analyzing the Performance Impact of HPC Workloads with Gramine+SGX on 3rd Generation Xeon Scalable ProcessorsShinobu Miwa, Shin'ichiro Matsuo. 1849-1858 [doi]
- Reducing HPC energy footprint for large scale GPU accelerated workloadsGabriel Hautreux, Etienne Malaboeuf. 1859-1865 [doi]
- Emissions and energy efficiency on large-scale high performance computing facilities: ARCHER2 UK national supercomputing service case studyAdrian Jackson, Alan Simpson, Andrew Turner. 1866-1870 [doi]
- Energy Efficiency of Quantum Statevector Simulation at ScaleJakub Adamski, James P. Richings, Oliver Thomson Brown. 1871-1875 [doi]
- Sustainability in HPC: Vision and OpportunitiesMohak Chadha, Eishi Arima, Amir Raoofy, Michael Gerndt, Martin Schulz 0001. 1876-1880 [doi]
- Accurate Measurement of Application-level Energy Consumption for Energy-Aware Large-Scale SimulationsOsman Seckin Simsek, Jean-Guillaume Piccinali, Florina M. Ciorba. 1881-1884 [doi]
- Evaluating Total Environmental Impact for a Computing InfrastructureAdrian Jackson, Jon Hays, Alex Owen, Nicholas Walton, Alison Packer, Anish Mudaraddi. 1885-1889 [doi]
- Comparing Power Signatures of HPC Workloads: Machine Learning vs SimulationAnish Govind, Sridutt Bhalachandra, Zhengji Zhao, Ermal Rrapaj, Brian Austin, Hai Ah Nam. 1890-1893 [doi]
- ReAPER: Region Aware Power and Energy RegulatorAnna Yue, Torsten Wilde, Steven Martin, Pen-Chung Yew, Sanyam Mehta. 1894-1897 [doi]
- Porting and optimizing Meso-NH to AMD MI250X GPUsNaima Alaoui Ismaili, Philippe Wautelet, Juan Escobar Munoz, Gabriel Hautreux. 1898-1905 [doi]
- Comparing a Naive and a Tree-Based N-Body Algorithm using Different Standard SYCL Implementations on Various HardwareTim Thüring, Marcel Breyer, Dirk Pflüger. 1906-1917 [doi]
- Specialized Kernels for Optimizing GPU Offload in OpenMPDhruva Chakrabarti, Gregory Rodgers, Carlo Bertolli, Gheorghe-Teodor Bercea, Jan-Patrick Lehr, Lynd Stringer, Jan Leyonberg, Dan Palermo, Ron Lieberman. 1918-1928 [doi]
- Analysis of MURaM, a Solar Physics Application, for Scalability, Performance and PortabilityEric Wright, Cena Brown, Damien Przybylski, Matthias Rempel, Supreeth Suresh, Sunita Chandrasekaran. 1929-1938 [doi]
- Performance-Portable GPU Acceleration of the EFIT Tokamak Plasma Equilibrium Reconstruction CodeOscar Antepara, Samuel Williams, Scott Kruger, Torrin Bechtel, Joseph McClenaghan, Lang Lao. 1939-1948 [doi]
- Characterizing the Performance of Triangle Counting on Graphcore's IPU ArchitectureReet Barik, Siddhisanket Raskar, Murali Emani, Venkatram Vishwanath. 1949-1957 [doi]
- Memory Transfer Decomposition: Exploring Smart Data Movement Through Architecture-Aware StrategiesDiego A. Roa Perdomo, Rodrigo Ceccato, Rémy Neveu, Hervé Yviquel, Xiaoming Li, Jose M. Monsalve Diaz, Johannes Doerfert. 1958-1967 [doi]
- Accelerating Data-Intensive Seismic Research Through Parallel Workflow Optimization and Federated CyberinfrastructureMarcus Adair, Ivan Rodero, Manish Parashar, Diego Melgar. 1968-1977 [doi]
- TaskVine: Managing In-Cluster Storage for High-Throughput Data Intensive WorkflowsBarry Sly-Delgado, Thanh Son Phung, Colin Thomas, David Simonetti, Andrew Hennessee, Ben Tovar, Douglas Thain. 1978-1988 [doi]
- Julia as a unifying end-to-end workflow language on the Frontier exascale systemWilliam F. Godoy, Pedro Valero-Lara, Caira Anderson, Katrina W. Lee, Ana Gainaru, Rafael Ferreira da Silva, Jeffrey S. Vetter. 1989-1999 [doi]
- Delivering Rules-Based Workflows for ScienceDavid Marchant, Mark Blomqvist, Philip Jensen, Iben Lilholm, Martin Nørgaard. 2000-2008 [doi]
- Laminar: A New Serverless Stream-based Framework with Semantic Code Search and Code CompletionZaynab Zahra, Zihao Li, Rosa Filgueira. 2009-2020 [doi]
- Optimization towards Efficiency and Stateful of dispel4pyLiang Liang, Heting Zhang, Guang Yang, Thomas Heinis, Rosa Filgueira. 2021-2032 [doi]
- Scale Composite BaaS Services With AFCL WorkflowsThomas Larcher, Sashko Ristov. 2033-2041 [doi]
- End-to-End Workflows for Climate Science: Integrating HPC Simulations, Big Data Processing, and Machine LearningDonatello Elia, Sonia Scardigno, Jorge Ejarque, Alessandro D'Anca, Gabriele Accarino, Enrico Scoccimarro, Davide Donno, Daniele Peano, Francesco Immorlano, Giovanni Aloisio. 2042-2052 [doi]
- A data science pipeline synchronisation method for edge-fog-cloud continuumDante D. Sánchez-Gallegos, José Luis González Compeán, Jesús Carretero 0001, Heidy Marisol Marín Castro. 2053-2064 [doi]
- A Systematic Mapping Study of Italian Research on WorkflowsMarco Aldinucci, Elena Maria Baralis, Valeria Cardellini, Iacopo Colonnelli, Marco Danelutto, Sergio Decherchi, Giuseppe Di Modica, Luca Ferrucci, Marco Gribaudo, Francesco Iannone, Marco Lapegna, Doriana Medic, Giuseppa Muscianisi, Francesca Righetti, Eva Sciacca, Nicola Tonellotto, Mauro Tortonesi, Paolo Trunfio, Tullio Vardanega. 2065-2076 [doi]
- Fluxion: A Scalable Graph-Based Resource Model for HPC Scheduling ChallengesTapasya Patki, Dong H. Ahn, Daniel Milroy, Jae-Seung Yeom, Jim Garlick, Mark Grondona, Stephen Herbein, Thomas Scogland. 2077-2088 [doi]
- Distributed Data Locality-Aware Job AllocationAna Markovic, Dimitris S. Kolovos, Leandro Soares Indrusiak. 2089-2096 [doi]
- Novel Approaches Toward Scalable Composable Workflows in Hyper-Heterogeneous Computing EnvironmentsJonathan Bader, Jim Belak, Matthew T. Bement, Matthew Berry, Robert Carson, Daniela Cassol, Stephen Chan, John Coleman, Kastan Day, Alejandro Duque, Kjiersten Fagnan, Jeff Froula, Shantenu Jha, Daniel S. Katz, Piotr Kica, Volodymyr V. Kindratenko, Edward Kirton, Ramani Kothadia, Daniel E. Laney, Fabian Lehmann, Ulf Leser, Sabina Licholai, Maciej Malawski, Mario Melara, Elais Player Jackson, Matthew Rolchigo, Setareh Sarrafan, Seung-Jin Sul, Abdullah Syed, Lauritz Thamsen, Mikhail Titov, Matteo Turilli, Silvina Caíno-Lores, Anirban Mandal. 2097-2108 [doi]
- Streaming Data from Experimental Facilities to Supercomputers for Real-Time Data ProcessingSinisa Veseli, John Hammonds, Steven Henke, Hannah Parraga, Nicholas Schwarz. 2109-2117 [doi]
- Cross-Facility Orchestration of Electrochemistry Experiments and ComputationsAnees Al-Najjar, Nageswara S. V. Rao, Craig Bridges, Sheng Dai. 2118-2125 [doi]
- Empowering Scientific Discovery Through Computing at the Advanced Photon SourceHannah Parraga, John Hammonds, Steven Henke, Sinisa Veseli, William E. Allcock, Benoît Côté, Ryan Chard, Suresh Narayanan, Nicholas Schwarz. 2126-2132 [doi]
- Demonstrating Cross-Facility Data Processing at Scale With Laue MicrodiffractionMichael Prince, Doga Gürsoy, Dina Sheyfer, Ryan Chard, Benoît Côté, Hannah Parraga, Barbara Frosik, Jonathan Z. Tischler, Nicholas Schwarz. 2133-2139 [doi]
- Linking the Dynamic PicoProbe Analytical Electron-Optical Beam Line / Microscope to SupercomputersAlexander Brace, Rafael Vescovi, Ryan Chard, Nickolaus D. Saint, Arvind Ramanathan, Nestor J. Zaluzec, Ian T. Foster. 2140-2146 [doi]
- Exploring Benchmarks for Self-Driving Labs using Color MatchingTobias Ginsburg, Kyle Hippe, Ryan Lewis, Aileen Cleary, Doga Ozgulbas, Rory Butler, Casey Stone, Abraham Stroka, Rafael Vescovi, Ian T. Foster. 2147-2152 [doi]