Abstract is missing.
- Exploiting Non-conventional DVFS on GPUs: Application to Deep LearningFrancisco Mendes, Pedro Tomás, Nuno Roma. 1-9 [doi]
- Design Space Exploration of Accelerators and End-to-End DNN Evaluation with TFLITE-SOCNicolas Bohm Agostini, Shi Dong, Elmira Karimi, Marti Torrents Lapuerta, José Cano, José L. Abellán, David R. Kaeli. 10-19 [doi]
- Hardware Multiversioning for Fail-Operational Multithreaded ApplicationsRico Amslinger, Christian Piatka, Florian Haas, Sebastian Weis, Theo Ungerer, Sebastian Altmeyer. 20-27 [doi]
- On-chip Parallel Photonic Reservoir Computing using Multiple Delay LinesSyed Ali Hasnain, Rabi Mahapatra. 28-34 [doi]
- Online Sharing-Aware Thread Mapping in Software Transactional MemoryDouglas Pereira Pasqualin, Matthias Diener, André Rauber Du Bois, Maurício Lima Pilla. 35-42 [doi]
- Optically Connected Memory for Disaggregated Data CentersJorge González, Alexander Gazman, Maarten Hattink, Mauricio G. Palma, Meisam Bahadori, Ruth Rubio-Noriega, Lois Orosa 0001, Madeleine Glick, Onur Mutlu, Keren Bergman, Rodolfo Azevedo. 43-50 [doi]
- AIR: A Light-Weight Yet High-Performance Dataflow Engine based on Asynchronous Iterative RoutingVinu E. Venugopal, Martin Theobald, Samira Chaychi, Amal Tawakuli. 51-58 [doi]
- An Optimal Model for Optimizing the Placement and Parallelism of Data Stream Processing Applications on Cloud-Edge ComputingFelipe Rodrigo de Souza, Marcos Dias de Assunção, Eddy Caron, Alexandre Da Silva Veith. 59-66 [doi]
- Evaluating Computation and Data Placements in Edge Infrastructures through a Common SimulatorAnderson Andrei Da Silva, Clément Mommessin, Pierre Neyron, Denis Trystram, Adwait Bauskar, Adrien Lebre, Alexandre van Kempen, Yanik Ngoko, Yoann Ricordel. 67-74 [doi]
- Optimizing Green Energy Consumption of Fog Computing ArchitecturesAdrien Gougeon, Benjamin Camus, Anne-Cécile Orgerie. 75-82 [doi]
- Energy-Efficient Time Series Analysis Using Transprecision ComputingIvan Fernandez, Ricardo Quislant, Eladio Gutiérrez, Oscar G. Plata. 83-90 [doi]
- High Performance and Portable Convolution Operators for Multicore ProcessorsPablo San Juan, Adrián Castelló, Manuel F. Dolz, Pedro Alonso-Jordá, Enrique S. Quintana-Ortí. 91-98 [doi]
- High-Performance Low-Memory Lowering: GEMM-based Algorithms for DNN ConvolutionAndrew Anderson 0001, Aravind Vasudevan, Cormac Keane, David Gregg. 99-106 [doi]
- Optimized Transactional Data Structure Approach to Concurrency Control for In-Memory DatabasesChristina L. Peterson, Amalee Wilson, Peter Pirkelbauer, Damian Dechev. 107-115 [doi]
- Reliable and Energy-aware Mapping of Streaming Series-parallel Applications onto Hierarchical PlatformsChangjiang Gou, Anne Benoit, Mingsong Chen, Loris Marchal, Tongquan Wei. 116-123 [doi]
- Scalable and Efficient Spatial-Aware Parallelization Strategies for Multimedia RetrievalGuilherme Andrade, George Teodoro, Renato Ferreira 0001. 124-131 [doi]
- Scheduling Methods to Reduce Response Latency of Function as a ServicePawel Zuk, Krzysztof Rzadca. 132-140 [doi]
- Selective Protection for Sparse Iterative Solvers to Reduce the Resilience OverheadHongyang Sun, Ana Gainaru, Manu Shantharam, Padma Raghavan. 141-148 [doi]
- sputniPIC: An Implicit Particle-in-Cell Code for Multi-GPU SystemsSteven Wei Der Chien, Jonas Nylund, Gabriel Bengtsson, Ivy Bo Peng, Artur Podobas, Stefano Markidis. 149-156 [doi]
- Using Skip Graphs for Increased NUMA LocalitySamuel Thomas, Roxana Hayne, Jonad Pulaj, Hammurabi Mendes. 157-166 [doi]
- A Fast and Concise Parallel Implementation of the 8x8 2D IDCT using HalideMartin Johnson, Daniel P. Playne. 167-174 [doi]
- Controlling Garbage Collection and Request Admission to Improve Performance of FaaS ApplicationsDavid Quaresma, Daniel Fireman, Thiago Emmanuel Pereira. 175-182 [doi]
- On the Memory Underutilization: Exploring Disaggregated Memory on HPC SystemsIvy Bo Peng, Roger Pearce, Maya B. Gokhale. 183-190 [doi]
- Predicting the Energy Consumption of CUDA Kernels using SimGridDorra Boughzala, Laurent Lefèvre, Anne-Cécile Orgerie. 191-198 [doi]
- TASO: Time and Space Optimization for Memory-Constrained DNN InferenceYuan Wen, Andrew Anderson 0001, Valentin Radu, Michael F. P. O'Boyle, David Gregg. 199-208 [doi]
- XPySom: High-Performance Self-Organizing MapsRiccardo Mancini, Antonio Ritacco, Giacomo Lanciano, Tommaso Cucinotta. 209-216 [doi]
- A Robotic Communication Middleware Combining High Performance and High ReliabilityWei Liu, Hao Wu, Ziyue Jiang, Yifan Gong, Jiangming Jin. 217-224 [doi]
- MASA-StarPU: Parallel Sequence Comparison with Multiple Scheduling Policies and PruningRafael A. Lopes, Samuel Thibault, Alba C. M. A. Melo. 225-232 [doi]
- PSU: A Framework for Dynamic Software Updates in Multi-threaded C-Language ProgramsMarcus Karpoff, José Nelson Amaral, Kai-Ting Amy Wang, Rayson Ho, Brice Dobry. 233-240 [doi]
- Towards Communication Profile, Topology and Node Failure Aware Process PlacementIoannis Vardas, Manolis Ploumidis, Manolis Marazakis. 241-248 [doi]
- OmpTracing: Easy Profiling of OpenMP ProgramsVitoria Pinho, Hervé Yviquel, Marcio Machado Pereira, Guido Araujo. 249-256 [doi]
- Analyzing the Loop Scheduling Mechanisms on Julia MultithreadingDiana A. Barros, Cristiana Bentes. 257-264 [doi]
- Performance Analysis and Optimization of the Vector-Kronecker Product MultiplicationAlexandre Azevedo, Cristiana Bentes, Maria Clicia Stelling de Castro, Claude Tadonki. 265-272 [doi]
- JAMPI: A C++ Parallel Programming Interface Allowing the Implementation of Custom and Generic Scheduling MechanismsDaniel Di Domenico, Gerson G. H. Cavalheiro. 273-280 [doi]
- Towards Pervasive Containerization of HPC Job SchedulersChristophe Cérin, Nicolas Grenèche, Tarek Menouer. 281-288 [doi]
- Towards Profile-Guided Optimization for Safe and Efficient Parallel Stream Processing in RustStefan Sydow, Mohannad Nabelsee, Sabine Glesner, Paula Herber. 289-296 [doi]
- Re-evaluation of Atomic Operations and Graph Coloring for Unstructured Finite Volume GPU SimulationsXi Zhang, Xu Sun, Xiaohu Guo, Yunfei Du, Yutong Lu, Yang Liu. 297-304 [doi]
- Extending Heterogeneous Applications to Remote Co-processors with rOpenCLRui Alves, José Rufino. 305-312 [doi]
- FFT Optimizations and Performance Assessment Targeted towards Satellite and Airborne Radar ProcessingMaron Schlemon, Jamin Naghmouchi. 313-320 [doi]
- A Highly Efficient SGEMM Implementation using DMA on the Intel/Movidius Myriad-2Suyash Bakshi, Lennart Johnsson. 321-328 [doi]