Journal: TACO

Volume 17, Issue 4

0 -- 0Xinfeng Xie, Xing Hu 0001, Peng Gu, Shuangchen Li, Yu Ji 0002, Yuan Xie 0001. NNBench-X: A Benchmarking Methodology for Neural Network Accelerator Designs
0 -- 0Sam (Likun) Xi, Yuan Yao 0006, Kshitij Bhardwaj, Paul N. Whatmough, Gu-Yeon Wei, David Brooks 0001. SMAUG: End-to-End Full-Stack Simulation Infrastructure for Deep Learning Workloads
0 -- 0George Christou, Giorgos Vasiliadis, Vassilis Papaefstathiou, Antonis Papadogiannakis, Sotiris Ioannidis. On Architectural Support for Instruction Set Randomization
0 -- 0Cristóbal Ramírez, César-Alejandro Hernández-Calderón, Oscar Palomar, Osman S. Unsal, Marco A. Ramírez, Adrián Cristal. A RISC-V Simulator and Benchmark Suite for Designing and Evaluating Vector Architectures
0 -- 0Yemao Xu, Dezun Dong, Yawei Zhao, Weixia Xu, Xiangke Liao. OD-SGD: One-Step Delay Stochastic Gradient Descent for Distributed Training
0 -- 0Gokul Subramanian Ravi, Joshua San Miguel, Mikko H. Lipasti. SHASTA: Synergic HW-SW Architecture for Spatio-temporal Approximation
0 -- 0Rolando Brondolin, Marco D. Santambrogio. A Black-box Monitoring Approach to Measure Microservices Runtime Performance
0 -- 0Albin Eldstål-Ahrens, Ioannis Sourdis. MemSZ: Squeezing Memory Traffic with Lossy Compression
0 -- 0Athanasios Stratikopoulos, Christos Kotselidis, John Goodacre, Mikel Luján. FastPath_MP: Low Overhead & Energy-efficient FPGA-based Storage Multi-paths
0 -- 0Utpal Bora 0001, Santanu Das, Pankaj Kukreja, Saurabh Joshi, Ramakrishna Upadrasta, Sanjay Rajopadhye. LLOV: A Fast Static Data-Race Checker for OpenMP Programs
0 -- 0Anchu Rajendran, V. Krishna Nandivada. DisGCo: A Compiler for Distributed Graph Analytics
0 -- 0Steffen Maass, Mohan Kumar Kumar, Taesoo Kim, Tushar Krishna, Abhishek Bhattacharjee. ECOTLB: Eventually Consistent TLBs
0 -- 0Dennis Pinto, José María Arnau, Antonio González 0001. Design and Evaluation of an Ultra Low-power Human-quality Speech Recognition System
0 -- 0Yu Zhang 0027, Xiaofei Liao, Lin Gu 0002, Hai Jin 0001, Kan Hu, Haikun Liu, Bingsheng He. AsynGraph: Maximizing Data Parallelism for Efficient Iterative Graph Processing on GPUs
0 -- 0Aravind Acharya, Uday Bondhugula, Albert Cohen 0001. Effective Loop Fusion in Polyhedral Compilation Using Fusion Conflict Graphs
0 -- 0S. VenkataKeerthy, Rohit Aggarwal, Shalini Jain 0002, Maunendra Sankar Desarkar, Ramakrishna Upadrasta, Y. N. Srikant. IR2VEC: LLVM IR Based Scalable Program Embeddings
0 -- 0Jhe-Yu Liou, Xiaodong Wang 0020, Stephanie Forrest, Carole-Jean Wu. GEVO: GPU Code Optimization Using Evolutionary Computation

Volume 17, Issue 3

0 -- 0Arnab Das, Sriram Krishnamoorthy, Ian Briggs, Ganesh Gopalakrishnan, Ramakrishna Tipireddy. FPDetect: Efficient Reasoning About Stencil Programs Using Selective Direct Evaluation
0 -- 0Karel Adámek, Sofia Dimoudi, Mike Giles, Wesley Armour. GPU Fast Convolution via the Overlap-and-Save Method in Shared Memory
0 -- 0Ram Rangan, Mark W. Stephenson, Aditya Ukarande, Shyam Murthy, Virat Agarwal, Marc Blackstein. Zeroploit: Exploiting Zero Valued Operands in Interactive Gaming Applications
0 -- 0Tarek S. Abdelrahman. Cooperative Software-hardware Acceleration of K-means on a Tightly Coupled CPU-FPGA System
0 -- 0Muhammad Huzaifa, Johnathan Alsop, Abdulrahman Mahmoud, Giordano Salvador, Matthew D. Sinclair, Sarita V. Adve. Inter-kernel Reuse-aware Thread Block Scheduling
0 -- 0Savvas Sioutas, Sander Stuijk, Twan Basten, Henk Corporaal, Lou J. Somers. Schedule Synthesis for Halide Pipelines on GPUs
0 -- 0Luca Cerina, Marco D. Santambrogio, Giuseppe Franco, Claudio Gallicchio, Alessio Micheli. EchoBay: Design and Optimization of Echo State Networks under Memory and Time Constraints
0 -- 0David R. Kaeli. Editorial: A Message from the Editor-in-Chief
0 -- 0Jaekyu Lee, Yasuo Ishii, Dam Sunwoo. Securing Branch Predictors with Two-Level Encryption

Volume 17, Issue 2

0 -- 0Anita Tino, Caroline Collange, André Seznec. SIMT-X: Extending Single-Instruction Multi-Threading to Out-of-Order Cores
0 -- 0Stefano Cherubin, Daniele Cattaneo, Michele Chiari, Giovanni Agosta. Dynamic Precision Autotuning with TAFFO
0 -- 0Jiachen Xue, T. N. Vijaykumar, Mithuna Thottethodi. Network Interface Architecture for Remote Indirect Memory Access (RIMA) in Datacenters
0 -- 0Qinggang Wang, Long Zheng 0003, Jieshan Zhao, Xiaofei Liao, Hai Jin 0001, Jingling Xue. A Conflict-free Scheduler for High-performance Graph Processing on Multi-pipeline FPGAs
0 -- 0Amir Hossein Nodehi Sabet, Junqiao Qiu, Zhijia Zhao 0001, Sriram Krishnamoorthy. Reliability Analysis for Unreliable FSM Computations
0 -- 0Ahmet Erdem, Cristina Silvano, Thomas Boesch, Andrea C. Ornstein, Surinder-pal Singh, Giuseppe Desoli. Runtime Design Space Exploration and Mapping of DCNNs for the Ultra-Low-Power Orlando SoC
0 -- 0Charu Kalra, Fritz Previlon, Norm Rubin, David R. Kaeli. ArmorAll: Compiler-based Resilience Targeting GPU Applications

Volume 17, Issue 1

0 -- 0Hao Wu, Weizhi Liu, Huanxin Lin, Cho-Li Wang. A Model-Based Software Solution for Simultaneous Multiple Kernels on GPUs
0 -- 0Xuanhua Shi, Wei Liu, Ligang He, Hai Jin 0001, Ming Li, Yong Chen 0001. Optimizing the SSD Burst Buffer by Traffic Detection
0 -- 0Nikolaos Tampouratzis, Ioannis Papaefstathiou, Antonios Nikitakis, Andreas Brokalakis, Stamatis Andrianakis, Apostolos Dollas, Marco Marcon, Emanuele Plebani. A Novel, Highly Integrated Simulator for Parallel and Distributed Systems
0 -- 0Yuhao Li, Dan Sun, Benjamin C. Lee. Dynamic Colocation Policies with Reinforcement Learning
0 -- 0Lijuan Jiang, Chao Yang 0002, Wenjing Ma. Enabling Highly Efficient Batched Matrix Multiplications on SW26010 Many-core Processor
0 -- 0Yang Song 0006, Bill Lin. Improving Memory Efficiency in Heterogeneous MPSoCs through Row-Buffer Locality-aware Forwarding
0 -- 0Yohann Uguen, Florent de Dinechin, Victor Lezaud, Steven Derrien. Application-Specific Arithmetic in High-Level Synthesis Tools
0 -- 0Mustafa Cavus, Resit Sendag, Joshua J. Yi. Informed Prefetching for Indirect Memory Accesses