Abstract is missing.
- Efficient Process-to-Node Mapping Algorithms for Stencil ComputationsKonrad von Kirchbach, Markus Lehr, Sascha Hunold, Christian Schulz 0003, Jesper Larsson Träff. 1-11 [doi]
- CuVPP: Filter-based Longest Prefix Matching in Software Data PlanesMinseok Kwon, Krishna Prasad Neupane, John Marshall, M. Mustafa Rafique. 12-22 [doi]
- HAN: a Hierarchical AutotuNed Collective Communication FrameworkXi Luo, Wei Wu 0016, George Bosilca, Yu Pei, Qinglei Cao, Thananon Patinyasakdikul, Dong Zhong, Jack J. Dongarra. 23-34 [doi]
- DelveFS - An Event-Driven Semantic File System for Object StoresMarc-André Vef, Rebecca Steiner, Reza Salkhordeh, Jörg Steinkamp, Florent Vennetier, Jean-François Smigielski, André Brinkmann. 35-46 [doi]
- NeoMPX: Characterizing and Improving Estimation of Multiplexing Hardware Counters for PAPIYichao Wang, Jie Wang, Jin-Kun Chen, Sicheng Zuo, Xiao-Ming Su, James Lin. 47-56 [doi]
- Grade10: A Framework for Performance Characterization of Distributed Graph ProcessingTim Hegeman, Animesh Trivedi, Alexandru Iosup. 57-68 [doi]
- Evaluating Worksharing Tasks on Distributed EnvironmentsMarcos Maroñas, Xavier Teruel, J. Mark Bull, Eduard Ayguadé, Vicenç Beltran 0001. 69-80 [doi]
- Resilient Scheduling of Moldable Jobs on Failure-Prone PlatformsAnne Benoit, Valentin Le Fèvre, Lucas Perotin, Padma Raghavan, Yves Robert, Hongyang Sun. 81-91 [doi]
- Modeling the Performance of Scientific Workflow Executions on HPC Platforms with Burst BuffersLoïc Pottier, Rafael Ferreira da Silva, Henri Casanova, Ewa Deelman. 92-103 [doi]
- Co-scheML: Interference-aware Container Co-scheduling Scheme Using Machine Learning Application Profiles for GPU ClustersSejin Kim, Yoonhee Kim. 104-108 [doi]
- Opportunities and limitations of Quality-of-Service in Message Passing applications on adaptively routed Dragonfly and Fat Tree networksJeremiah J. Wilke, Joseph P. Kenny. 109-118 [doi]
- MonSTer: An Out-of-the-Box Monitoring Tool for High Performance Computing SystemsJie Li, Ghazanfar Ali, Ngan Nguyen, Jon Hass, Alan Sill, Tommy Dang, Yang Chen. 119-129 [doi]
- Dynamic Kernel Fusion for Bulk Non-contiguous Data Transfer on GPU ClustersChing-Hsiang Chu, Kawthar Shafie Khorassani, Qinghua Zhou, Hari Subramoni, Dhabaleswar K. Panda. 130-141 [doi]
- Autoscaling High-Throughput Workloads on Container OrchestratorsChao Zheng, Nathaniel Kremer-Herman, Tim Shaffer, Douglas Thain. 142-152 [doi]
- SSP: Speeding up Small Flows for Proactive Transport in DatacentersYang Bai, Dezun Dong, Shan Huang, Zejia Zhou, Xiangke Liao. 153-161 [doi]
- Quantifying the impact of network congestion on application performance and network metricsYijia Zhang 0002, Taylor Groves, Brandon Cook 0001, Nicholas J. Wright, Ayse K. Coskun. 162-168 [doi]
- Analysis of Cooling Water Temperature Impact on Computing Performance and Energy ConsumptionJorji Nonaka, Toshihiro Hanawa, Fumiyoshi Shoji. 169-175 [doi]
- E2Clab: Exploring the Computing Continuum through Repeatable, Replicable and Reproducible Edge-to-Cloud ExperimentsDaniel Rosendo, Pedro Silva 0007, Matthieu Simonin, Alexandru Costan, Gabriel Antoniu. 176-186 [doi]
- Streaming File Transfer Optimization for Distributed Science WorkflowsDavut Ucar, Engin Arslan. 187-197 [doi]
- Exploring the Potential of Fast Delta Encoding: Marching to a Higher Compression RatioHaoliang Tan, Zhiyuan Zhang, Xiangyu Zou, Qing Liao 0001, Wen Xia. 198-208 [doi]
- Staging Based Task Execution for Data-driven, In-Situ Scientific WorkflowsZhe Wang, Pradeep Subedi, Matthieu Dorier, Philip E. Davis, Manish Parashar. 209-220 [doi]
- Flexible Data Redistribution in a Task-Based Runtime SystemQinglei Cao, George Bosilca, Wei Wu 0016, Dong Zhong, Aurelien Bouteiller, Jack J. Dongarra. 221-225 [doi]
- DeepClone: Lightweight State Replication of Deep Learning Models for Data Parallel TrainingBogdan Nicolae, Justin M. Wozniak, Matthieu Dorier, Franck Cappello. 226-236 [doi]
- Exploring Non-Volatility of Non-Volatile Memory for High Performance Computing Under FailuresJie Ren 0015, Kai Wu, Dong Li 0001. 237-247 [doi]
- HCL: Distributing Parallel Data Structures in Extreme ScalesHariharan Devarajan, Anthony Kougkas, Keith Bateman, Xian-He Sun. 248-258 [doi]
- Predicting MPI Collective Communication Performance Using Machine LearningSascha Hunold, Abhinav Bhatele, George Bosilca, Peter Knees. 259-269 [doi]
- Decomposing MPI Collectives for Exploiting Multi-lane CommunicationJesper Larsson Träff, Sascha Hunold. 270-280 [doi]
- Power Budgeting of Big Data Applications in Container-based ClustersJonatan Enes, Guillaume Fieni, Roberto R. Expósito, Romain Rouvoy, Juan Touriño. 281-287 [doi]
- Estimating Power Consumption of Containers and Virtual Machines in Data CentersXusheng Zhang, Ziyu Shen, Bin Xia 0003, Zheng Liu 0001, Yun Li 0009. 288-293 [doi]
- Fast Scalable Approximate Nearest Neighbor Search for High-dimensional DataK. G. Renga Bashyam, Sathish Vadhiyar. 294-302 [doi]
- A Hybrid MPI+PGAS Approach to Improve Strong Scalability Limits of Finite Element SolversNiclas Jansson. 303-313 [doi]
- Towards Data-Flow Parallelization for Adaptive Mesh Refinement ApplicationsKevin Sala, Alejandro Rico, Vicenç Beltran 0001. 314-325 [doi]
- Towards End-to-end SDC Detection for HPC Applications Equipped with Lossy CompressionSihuan Li, Sheng Di, Kai Zhao, Xin Liang, Zizhong Chen, Franck Cappello. 326-336 [doi]
- Efficient Execution of Dynamic Programming Algorithms on Apache SparkMohammad Mahdi Javanmard, Zafar Ahmad, Jaroslaw Zola, Louis-Noël Pouchet, Rezaul Chowdhury, Robert J. Harrison. 337-348 [doi]
- ECS2: A Fast Erasure Coding Library for GPU-Accelerated Storage Systems with Parallel & Direct IOChanJung Chang, Jerry Chou, Yu-Ching Chou, I-Hsin Chung. 349-358 [doi]
- tf-Darshan: Understanding Fine-grained I/O Performance in Machine Learning WorkloadsSteven Wei Der Chien, Artur Podobas, Ivy Bo Peng, Stefano Markidis. 359-370 [doi]
- Extending High-Level Synthesis with High-Performance Computing Performance VisualizationJens Huthmann, Artur Podobas, Lukas Sommer, Andreas Koch 0001, Kentaro Sano. 371-380 [doi]
- Parallel Particle Advection Bake-Off for Scientific Visualization WorkloadsRoba Binyahib, David Pugmire, Abhishek Yenpure, Hank Childs. 381-391 [doi]
- Data Life Aware Model Updating Strategy for Stream-based Online Deep LearningWei Rang, Donglin Yang, Dazhao Cheng, Kun Suo, Wei Chen 0038. 392-398 [doi]
- Optimizing GPU Memory Transactions for Convolution OperationsGangzhao Lu, Weizhe Zhang, Zheng Wang. 399-403 [doi]
- System-Level vs. Application-Level CheckpointingJonas Posner. 404-405 [doi]
- An HPC-based Prediction on the Practicality of Long-distance Quantum Key DistributionsHoon Ryu, Ji-Hoon Kang. 406-407 [doi]
- Performance Evaluation of Supercomputer Fugaku using Breadth-First Search Benchmark in Graph500Masahiro Nakao, Koji Ueno, Katsuki Fujisawa, Yuetsu Kodama, Mitsuhisa Sato. 408-409 [doi]
- OctCNN: An Energy-Efficient FPGA Accelerator for CNNs using Octave Convolution AlgorithmWenqi Lou, Chao Wang, Lei Gong, Xuehai Zhou. 410-411 [doi]
- ChOWDER: A New Approach for Viewing 3D Web GIS on Ultra-High-Resolution Scalable DisplayTomohiro Kawanabe, Kazuma Hatta, Kenji Ono. 412-413 [doi]
- An FPGA-based Sound Field Rendering SystemYiyu Tan, Toshiyuki Imamura. 414-415 [doi]
- The Case for Better Integrating Scalable Data Stores and Stream-Processing SystemsAntonis Papaioannou, Chrysostomos Zeginis, Kostas Magoutis. 416-417 [doi]
- Prompt Report on Exa-Scale HPL-AI BenchmarkShuhei Kudo, Keigo Nitadori, Takuya Ina, Toshiyuki Imamura. 418-419 [doi]
- Implementing a Comprehensive Networks-on-Chip Generator with Optimal ConfigurationsHao Zhang, Itta Ohmura, Makoto Taiji. 420-421 [doi]
- Toward OpenACC-enabled GPU-FPGA Accelerated ComputingNorihisa Fujita, Ryohei Kobayashi, Yoshiki Yamaguchi, Kohji Yoshikawa, Makito Abe, Masayuki Umemura. 422-423 [doi]
- PIKA: Center-Wide and Job-Aware Cluster MonitoringRobert Dietrich, Frank Winkler, Andreas Knüpfer, Wolfgang E. Nagel. 424-432 [doi]
- HPC System Data Pipeline to Enable Meaningful Insights through Analysis-Driven VisualizationsBenjamin Schwaller, Nick Tucker, Tom Tucker, Benjamin A. Allan, Jim M. Brandt. 433-441 [doi]
- MAP: A Visual Analytics System for Job Monitoring and AnalysisAshish Pal, Preeti Malakar. 442-448 [doi]
- Towards workload-adaptive scheduling for HPC clustersAlexander V. Goponenko, Ramin Izadpanah, Jim M. Brandt, Damian Dechev. 449-453 [doi]
- Democratizing Parallel Filesystem MonitoringRichard Todd Evans. 454-458 [doi]
- LDMS Monitoring of EDR InfiniBand NetworksBenjamin A. Allan, Michael Aguilar, Benjamin Schwaller, Steven Langer. 459-463 [doi]
- Energy Optimization and Analysis with EARJulita Corbalán, Lluis Alonso, Jordi Aneas, Luigi Brochard. 464-472 [doi]
- Toward an End-to-End Auto-tuning Framework in HPC PowerStackXingfu Wu, Aniruddha Marathe, Siddhartha Jana, Ondrej Vysocky, Jophin John, Andrea Bartolini, Lubomir Riha, Michael Gerndt, Valerie E. Taylor, Sridutt Bhalachandra. 473-483 [doi]
- Evaluation of Power Management Control on the Supercomputer FugakuYuetsu Kodama, Tetsuya Odajima, Eishi Arima, Mitsuhisa Sato. 484-493 [doi]
- HUD-Oden: A Practical Evaluation Environment for Analyzing Hot-Water Cooled ProcessorsJorji Nonaka, Fumiyoshi Shoji. 494-498 [doi]
- Global Experiences with HPC Operational Data Measurement, Collection and AnalysisMichael Ott, Woong Shin, Norman Bourassa, Torsten Wilde, Stefan Ceballos, Melissa Romanus, Natalie J. Bates. 499-508 [doi]
- A Study of Operational Impact on Power Usage Effectiveness using Facility Metrics and Server Operation Logs in the K ComputerMasaaki Terai, Fumiyoshi Shoji, Toshiyuki Tsukamoto, Yukihiro Yamochi. 509-513 [doi]
- A Supercomputing Center Experience With Cooling Control DesignMichael Kercher, Gary New. 514-518 [doi]
- Investigative Report on Electrical Commissioning in HPC Data CentersJoseph Prisco, Grant Stewart, Herbert Huber, Randy Rannow, Jason Hick, Dave Martinez, Brandon Hong, Aditya M. Deshpande. 519-522 [doi]
- Preliminary Performance Evaluation of the Fujitsu A64FX Using HPC ApplicationsTetsuya Odajima, Yuetsu Kodama, Miwako Tsuji, Motohiko Matsuda, Yutaka Maruyama, Mitsuhisa Sato. 523-530 [doi]
- The Effects of Wide Vector Operations on Processor CachesAndrei Poenaru, Simon McIntosh-Smith. 531-539 [doi]
- CoreNEURON: Performance and Energy Efficiency Evaluation on Intel and Arm CPUsJoel Criado, Marta Garcia-Gasulla, Pramod S. Kumbhar, Omar Awile, Ioannis Magkanaris, Filippo Mantovani. 540-548 [doi]
- Investigating Applications on the A64FXAdrian Jackson, Michèle Weiland, Nick Brown 0002, Andrew Turner, Mark Parsons. 549-558 [doi]
- Porting Applications to Arm-based ProcessorsBine Brank, Stepan Nassyr, Fatemeh Pouyan, Dirk Pleiter. 559-566 [doi]
- Performance Evaluation of ParalleX Execution model on Arm-based PlatformsNikunj Gupta, Rohit Ashiwal, Bine Brank, Sateesh K. Peddoju, Dirk Pleiter. 567-575 [doi]
- On the Usage of the Arm C Language Extensions for a High-Order Finite-Element KernelSylvain Jubertie, Guillaume Quintin, Fabrice Dupros. 576-579 [doi]