Abstract is missing.
- Exploring Bit-Slice Sparsity in Deep Neural Networks for Efficient ReRAM-Based DeploymentJingyang Zhang, Huanrui Yang, Fan Chen 0001, Yitu Wang, Hai Li. 1-5 [doi]
- Discovering Low-Precision Networks Close to Full-Precision Networks for Efficient InferenceJeffrey L. McKinstry, Steven K. Esser, Rathinakumar Appuswamy, Deepika Bablani, John V. Arthur, Izzet B. Yildiz, Dharmendra S. Modha. 6-9 [doi]
- QPyTorch: A Low-Precision Arithmetic Simulation FrameworkTianyi Zhang, Zhiqiu Lin, Guandao Yang, Christopher De Sa. 10-13 [doi]
- Trained Rank Pruning for Efficient Deep Neural NetworksYuhui Xu, Yuxi Li, Shuai Zhang 0009, Wei Wen, Botao Wang, Wenrui Dai, Yingyong Qi, Yiran Chen, Weiyao Lin, Hongkai Xiong. 14-17 [doi]
- Pushing the limits of RNN CompressionUrmish Thakker, Igor Fedorov, Jesse G. Beu, Dibakar Gope, Chu Zhou, Ganesh Dasika, Matthew Mattina. 18-21 [doi]
- YOLO Nano: a Highly Compact You Only Look Once Convolutional Neural Network for Object DetectionAlexander Wong, Mahmoud Famouri, Mohammad Javad Shafiee, Francis Li, Brendan Chwyl, Jonathan Chung. 22-25 [doi]
- Instant Quantization of Neural Networks using Monte Carlo MethodsGonçalo Mordido, Matthijs Van Keirsbilck, Alexander Keller 0001. 26-30 [doi]
- Progressive Stochastic Binarization of Deep NetworksDavid Hartmann, Michael Wand 0001. 31-35 [doi]
- Q8BERT: Quantized 8Bit BERTOfir Zafrir, Guy Boudoukh, Peter Izsak, Moshe Wasserblat. 36-39 [doi]
- Towards Co-designing Neural Network Function Approximators with In-SRAM ComputingShamma Nasrin, Diaa Badawi, Ahmet Enis Çetin, Wilfred Gomes, Amit Ranjan Trivedi. 40-43 [doi]
- Training Compact Models for Low Resource Entity Tagging using Pre-trained Language ModelsPeter Izsak, Shira Guskin, Moshe Wasserblat. 44-47 [doi]
- Algorithm-hardware Co-design for Deformable ConvolutionQijing Huang 0001, Dequan Wang, Yizhao Gao, Yaohui Cai, Zhen Dong, Bichen Wu, Kurt Keutzer, John Wawrzynek. 48-51 [doi]
- Bit Efficient Quantization for Deep Neural NetworksPrateeth Nayak, David Zhang, Sek Chai. 52-56 [doi]
- Spoken Language Understanding on the EdgeAlaa Saade, Joseph Dureau, David Leroy, Francesco Caltagirone, Alice Coucke, Adrien Ball, Clément Doumouro, Thibaut Lavril, Alexandre Caulier, Théodore Bluche, Thibault Gisselbrecht, Maël Primet. 57-61 [doi]
- Neural Networks Weights Quantization: Target None-retraining Ternary (TNT)Tianyu Zhang, Lei Zhu, Qian Zhao, Kilho Shin. 62-65 [doi]
- On Hardware-Aware Probabilistic Frameworks for Resource Constrained Embedded ApplicationsLaura Isabel Galindez Olascoaga, Wannes Meert, Nimish Shah, Guy Van den Broeck, Marian Verhelst. 66-70 [doi]