Abstract is missing.
- TCUDA: A QoS-based GPU Sharing Framework for Autonomous Navigation SystemsPangbo Sun, Hao Wu, Jiangming Jin, Ziyue Jiang, Yifan Gong 0003. 1-10 [doi]
- Seriema: RDMA-based Remote Invocation with a Case-Study on Monte-Carlo Tree SearchHammurabi Mendes, Bryce Wiedenbeck, Aidan O'Neill. 11-20 [doi]
- Exploring the Effects of Silent Data Corruption in Distributed Deep Learning TrainingElvis Rojas, Diego Pérez, Esteban Meneses. 21-30 [doi]
- Mixed and Multi-Precision SpMV for GPUs with Row-wise Precision SelectionErhan Tezcan, Tugba Torun, Fahrican Kosar, Kamer Kaya, Didem Unat. 31-40 [doi]
- gem5-ndp: Near-Data Processing Architecture Simulation From Low Level Caches to DRAMJoão Vieira, Nuno Roma, Gabriel Falcão 0001, Pedro Tomás. 41-50 [doi]
- Approximate Memory with Protected Static AllocationJoão Fabrício Filho, Isaías B. Felzmann, Lucas Wanner 0001. 51-59 [doi]
- Dynamic Set Stealing to Improve Cache PerformanceBrady Testa, Samira Mirbagher Ajorpaz, Daniel A. Jiménez. 60-70 [doi]
- Avoiding Unnecessary Caching with History-Based Preemptive BypassingArthur M. Krause, Paulo C. Santos 0001, Philippe O. A. Navaux. 71-80 [doi]
- Memory-Side Acceleration and Sparse Compression for Quantized Packed ConvolutionsAlex Weaver, Krishna Kavi, Pranathi Vasireddy, Gayatri Mehta. 81-90 [doi]
- NUMA-Aware Dense Matrix Factorizations and Inversion with Look-Ahead on Multicore ProcessorsSandra Catalán, Francisco D. Igual, Rafael Rodríguez-Sánchez, José R. Herrero, Enrique S. Quintana-Ortí. 91-99 [doi]
- An MPI-Parallel Algorithm for Static and Dynamic Top-k Harmonic CentralityAlexander van der Grinten, Geert Custers, Duy Le Thanh, Henning Meyerhenke. 100-109 [doi]
- Efficient Strategies for Graph Pattern Mining Algorithms on GPUsSamuel Ferraz, Vinícius Vitor dos Santos Dias, Carlos H. C. Teixeira, George Teodoro, Wagner Meira Jr.. 110-119 [doi]
- A predictive approach for dynamic replication of operators in distributed stream processing systemsDaniel Wladdimiro, Luciana Arantes, Pierre Sens 0001, Nicolas Hidalgo. 120-129 [doi]
- Convergence of HPC and Big Data in extreme-scale data analysis through the DCEx programming modelJavier García Blas, Javier Fernández Muñoz, Jesús Carretero 0001, Fabrizio Marozzo, Domenico Talia, Paolo Trunfio, Alberto Fernández-Pena, Daniel Martín de Blas. 130-139 [doi]
- A Multi-GPU Python Solver for Low-Temperature Non-Equilibrium PlasmasJames Almgren-Bell, Nader Al Awar, Dilip S. Geethakrishnan, Milos Gligoric, George Biros. 140-149 [doi]
- Ion-Molecule Collision Cross-Section Simulation using Linked-cell and Trajectory ParallelizationSamuel Cajahuaringa, Leandro N. Zanotto, Daniel L. Z. Caetano, Sandro Rigo, Hervé Yviquel, Munir S. Skaf, Guido Araujo. 150-159 [doi]
- Convolution Operators for Deep Learning Inference on the Fujitsu A64FX ProcessorManuel F. Dolz, Héctor Martínez, Pedro Alonso 0002, Enrique S. Quintana-Ortí. 160-169 [doi]
- Characterizing Prefetchers using CacheObserverGuillaume Didier 0001, Clémentine Maurice, Antoine Geimer, Walid J. Ghandour. 170-179 [doi]
- FiBHA: Fixed Budget Hybrid CNN AcceleratorFareed Qararyah, Muhammad Waqar Azhar, Pedro Trancoso. 180-190 [doi]
- Setting up an experimental framework for analysing an immersion cooling systemThierry Arrabal, Lucas Betencourt, Eddy Caron, Laurent Lefèvre. 191-200 [doi]
- Prof5: A RISC-V profiler toolJonathas Silveira, Lucas Castro, Victor Araújo, Rodrigo Zeli, Daniel Lazari, Marcelo Guedes, Rodolfo Azevedo, Lucas Wanner 0001. 201-210 [doi]
- Study of the Processor and Memory Power and Energy Consumption of Coupled Sparse/Dense SolversEmmanuel Agullo, Marek Felsöci, Amina Guermouche, Hervé Mathieu, Guillaume Sylvand, Bastien Tagliaro. 211-220 [doi]
- A Test for FLOPs as a Discriminant for Linear Algebra AlgorithmsAravind Sankaran, Paolo Bientinesi. 221-230 [doi]
- IntP: Quantifying cross-application interference via system-level instrumentationMiguel G. Xavier, Carlos H. C. Cano, Vinícius Meyer, César A. F. De Rose. 231-240 [doi]
- Metrics for Packing Efficiency and Fairness of HPC Cluster Batch Job SchedulingAlexander V. Goponenko, Kenneth Lamar, Christina L. Peterson, Benjamin A. Allan, Jim M. Brandt, Damian Dechev. 241-252 [doi]
- Optimizing Execution Time and Costs of Cross-Silo Federated Learning Applications with Datasets on different Cloud ProvidersRafaela C. Brum, Pierre Sens 0001, Luciana Arantes, Maria Clicia Stelling de Castro, Lúcia Maria de A. Drummond. 253-262 [doi]
- Strategies for Fault-Tolerant Tightly-Coupled HPC Workloads Running on Low-Budget Spot Cloud InfrastructuresVanderlei Munhoz, Márcio Castro, Odorico Mendizabal. 263-272 [doi]
- Performance Improvements of Parallel Applications thanks to MPI-4.0 HintsMaxim Moraru, Adrien Roussel, Hugo Taboada, Christophe Jaillet, Marc Pérache, Michaël Krajecki. 273-282 [doi]
- Taming the Big Data Monster: Managing Petabytes of Data with Multi-Model DatabasesYang Chen, Feng Zhang, Yinhao Hong, Yunpeng Chai, Wei Lu, Hong Chen, Xiaoyong Du 0001, Peipei Wang, Le Mi, Jintao Li, Xilin Tang, Yanliang Zhou, Wei Zhou, Peng Zhang, Fengyi Chen, Pengfei Li, Yu Li. 283-292 [doi]
- Parallelizing Git Checkout: a Case Study of I/O ParallelismMatheus Tavares Bernardino, Alfredo Goldman. 293-304 [doi]
- Analyzing Power Decisions in Data Center Powered by Renewable SourcesIgor Fontana De Nardin, Patricia Stolf, Stéphane Caux. 305-314 [doi]
- Automatic aggregation of subtask accesses for nested OpenMP-style tasksOmar Shaaban, Jimmy Aguilar Mena, Vicenç Beltran 0001, Paul M. Carpenter, Eduard Ayguadé, Jesús Labarta Mancho. 315-325 [doi]
- STEER: Asymmetry-aware Energy Efficient Task Scheduler for Cluster-based Multicore ArchitecturesJing Chen, Madhavan Manivannan, Bhavishya Goel, Mustafa Abduljabbar, Miquel Pericàs. 326-335 [doi]
- Mitigating Unnecessary Throttling in Linux CFS Bandwidth ControlOdin Ugedal, Rakesh Kumar. 336-345 [doi]