Abstract is missing.
- Impact of Memory DoS Attacks on Cloud Applications and Real-Time Detection SchemesZhuozhao Li, Tanmoy Sen, Haiying Shen, Mooi Choo Chuah. [doi]
- Adaptive Bulk Search: Solving Quadratic Unconstrained Binary Optimization Problems on Multiple GPUsRyota Yasudo, Koji Nakano, Yasuaki Ito, Masaru Tatekawa, Ryota Katsuki, Takashi Yazane, Yoko Inaba. [doi]
- SPECcast: A Methodology for Fast Performance Evaluation with SPEC CPU 2017 Multiprogrammed WorkloadsPablo Prieto, Pablo Abad Fidalgo, Jose Angel Herrero, José-Ángel Gregorio, Valentin Puente. [doi]
- Developing a Loss Prediction-based Asynchronous Stochastic Gradient Descent Algorithm for Distributed Training of Deep Neural NetworksJunyu Li, Ligang He, Shenyuan Ren, Rui Mao. [doi]
- URSA: Precise Capacity Planning and Fair Scheduling based on Low-level Statistics for Public CloudsWei Zhang 0149, Ningxin Zheng, Quan Chen 0002, Yong Yang, Zhuo Song, Tao Ma, Jingwen Leng, Minyi Guo. [doi]
- Polo: Receiver-Driven Congestion Control for Low Latency over Commodity Network FabricChang Ruan, Jianxin Wang, Wanchun Jiang, Tao Zhang. [doi]
- Optimizing Flow Bandwidth Consumption with Traffic-diminishing Middlebox PlacementYang Chen, Jie Wu, Bo Ji. [doi]
- Large-scale Simulations of Peridynamics on Sunway Taihulight SupercomputerXinyuan Li, Huang Ye, Jian Zhang. [doi]
- XShot: Light-weight Link Failure Localization using Crossed Probing Cycles in SDNHongyun Gao, Laiping Zhao, Huanbin Wang, Zhao Tian, Lihai Nie, Keqiu Li. [doi]
- Fast Spectral Graph Layout on Multicore PlatformsAshirbad Mishra, Shad Kirmani, Kamesh Madduri. [doi]
- Adaptive Distributed Convolutional Neural Network Inference at the Network Edge with ADCNNSai Qian Zhang, Jieyu Lin, Qi Zhang. [doi]
- Robustness of the Young/Daly formula for stochastic iterative applicationsYishu Du, Loris Marchal, Guillaume Pallez Aupy, Yves Robert. [doi]
- SWMapper: Scalable Read Mapper on SunWay TaihuLightKai Xu, Xiaohui Duan, Xiangxu Meng, Xin Li, Bertil Schmidt, Weiguo Liu. [doi]
- GraBi: Communication-Efficient and Workload-Balanced Partitioning for Bipartite GraphsFeng Sheng, Qiang Cao, Hong Jiang 0001, Jie Yao. [doi]
- CARD: A Congestion-Aware Request Dispatching Scheme for Replicated Metadata Server ClusterShangming Cai, Dongsheng Wang, Zhanye Wang, Haixia Wang. [doi]
- Rendering Server Allocation for MMORPG Players in Cloud GamingIryanto Jaya, Wentong Cai 0001, Yusen Li. [doi]
- OVERSEE: Outsourcing Verification to Enable Resource Sharing in Edge EnvironmentXiaoqing Cai, Jiuchen Shi, Rui Yuan, Chang Liu, Wenli Zheng, Quan Chen 0002, Chao Li 0009, Jingwen Leng, Minyi Guo. [doi]
- Huffman Coding with Gap Arrays for GPU AccelerationNaoya Yamamoto, Koji Nakano, Yasuaki Ito, Daisuke Takafuji, Akihiko Kasagi, Tsuguchika Tabaru. [doi]
- Balancing Fairness and Efficiency for Cache Sharing in Semi-external Memory SystemShanjiang Tang, Qifei Chai, Ce Yu, Yusen Li, Chao Sun. [doi]
- Dual-Way Gradient Sparsification for Asynchronous Distributed Deep LearningZijie Yan, Danyang Xiao, Mengqiang Chen, Jieying Zhou, Weigang Wu. [doi]
- AMRT: Anti-ECN Marking to Improve Utilization of Receiver-driven Transmission in Data CenterJinbin Hu, Jiawei Huang, Zhaoyi Li, Jianxin Wang, Tian He 0001. [doi]
- ParSecureML: An Efficient Parallel Secure Machine Learning Framework on GPUsZheng Chen, Feng Zhang, Amelie Chi Zhou, Jidong Zhai, Chenyang Zhang, Xiaoyong Du 0001. [doi]
- The Art of CPU-Pinning: Evaluating and Improving the Performance of Virtualization and Containerization PlatformsDavood Ghatreh Samani, Chavit Denninnart, Josef Bacik, Mohsen Amini Salehi. [doi]
- HCAPP: Scalable Power Control for Heterogeneous 2.5D Integrated SystemsKramer Straube, Jason Lowe-Power, Christopher Nitta, Matthew K. Farrens, Venkatesh Akella. [doi]
- An Efficient Wear-level Architecture using Self-adaptive Wear LevelingJianming Huang, Yu Hua 0001, Pengfei Zuo, Wen Zhou, Fangting Huang. [doi]
- An Adaptive Erasure-Coded Storage Scheme with an Efficient Code-Switching AlgorithmZizhong Wang, Haixia Wang, Airan Shao, Dongsheng Wang. [doi]
- DeepHop on Edge: Hop-by-hop Routing byDistributed Learning with Semantic AttentionBo He, Jingyu Wang, Qi Qi 0001, Haifeng Sun, Zirui Zhuang, Cong Liu, Jianxin Liao. [doi]
- Graffix: Efficient Graph Processing with a Tinge of GPU-Specific ApproximationsSomesh Singh 0001, Rupesh Nasre. [doi]
- CCHL: Compression-Consolidation Hardware Logging for Efficient Failure-Atomic Persistent Memory UpdatesXueliang Wei, Dan Feng 0001, Wei Tong, Jingning Liu, Chengning Wang, Liuqing Ye. [doi]
- Detailed Analysis and Optimization of CUDA K-means AlgorithmMartin Krulis, Miroslav Kratochvíl. [doi]
- GOSH: Embedding Big Graphs on Small HardwareTaha Atahan Akyildiz, Amro Alabsi Aljundi, Kamer Kaya. [doi]
- Cooperative Game for Multiple Chargers with Dynamic Network TopologyChi Lin, Ziwei Yang, Yu Sun, Jing Deng 0001, Lei Wang, Guowei Wu. [doi]
- Federated Learning with Proximal Stochastic Variance Reduced Gradient AlgorithmsCanh T. Dinh, Nguyen H. Tran, Tuan-Dung Nguyen, Wei Bao, Albert Y. Zomaya, Bing Bing Zhou. [doi]
- Delta-DNN: Efficiently Compressing Deep Neural Networks via Exploiting Floats SimilarityZhenbo Hu, Xiangyu Zou, Wen Xia, Sian Jin, Dingwen Tao, Yang Liu, Weizhe Zhang, Zheng Zhang. [doi]
- Energy-aware strategies for reliability-oriented real-time task allocation on heterogeneous platformsLi Han 0001, Yiqin Gao, Jing Liu, Yves Robert, Frédéric Vivien. [doi]
- Generating Robust Parallel Programs via Model Driven Prediction of Compiler Optimizations for Non-determinismGirish Mururu, Kaushik Ravichandran, Ada Gavrilovska, Santosh Pande. [doi]
- Mass: Workload-Aware Storage Policy for OpenStack SwiftYu Chen, Wei Tong, Dan Feng 0001, Zike Wang. [doi]
- DNNARA: A Deep Neural Network Accelerator using Residue Arithmetic and Integrated PhotonicsJiaxin Peng, Yousra Al-Kabani, Shuai Sun, Volker J. Sorger, Tarek A. El-Ghazawi. [doi]
- An Online Learning-Based Task Offloading Framework for 5G Small Cell NetworksXueying Zhang, Ruiting Zhou, Zhi Zhou, John C. S. Lui, Zongpeng Li. [doi]
- Balancing Graph Processing Workloads Using Work Stealing on Heterogeneous CPU-FPGA SystemsMatthew Agostini, Francis O'Brien, Tarek Abdelrahman. [doi]
- Prune the Unnecessary: Parallel Pull-Push Louvain Algorithms with Automatic Edge PruningJesmin Jahan Tithi, Andrzej Stasiak, Sriram Aananthakrishnan, Fabrizio Petrini. [doi]
- Reducing Latency in Multi-Tenant Data Centers via Cautious Congestion WatchAhmed M. Abdelmoniem, Hengky Susanto, Brahim Bensaou. [doi]
- A Reinforcement Learning Based System for Minimizing Cloud Storage Service CostHaoyu Wang, Haiying Shen, Qi Liu, Kevin Zheng, Jie Xu 0004. [doi]
- Parallel Shift-Invert Spectrum Slicing on Distributed Architectures with GPU AcceleratorsDavid B. Williams-Young, Chao Yang 0001. [doi]
- Extremely Low-bit Convolution Optimization for Quantized Neural Network on Modern Computer ArchitecturesQingchang Han, Yongmin Hu, Fengwei Yu, Hailong Yang, Bing Liu, Peng Hu, Ruihao Gong, Yanfei Wang, Rui Wang, Zhongzhi Luan, Depei Qian. [doi]
- A Rack-Aware Pipeline Repair Scheme for Erasure-Coded Distributed Storage SystemsTong Liu, Shakeel Alibhai, Xubin He. [doi]
- Improving Load Balance via Resource Exchange in Large-Scale Search EnginesKaiyue Duan, Yusen Li, Trent G. Marbach, Gang Wang, Xiaoguang Liu. [doi]
- First Time Miss : Low Overhead Mitigation for Shared Memory Cache Side ChannelsKartik Ramkrishnan, Stephen McCamant, Pen-Chung Yew, Antonia Zhai. [doi]
- Revisiting Sparse Dynamic Programming for the 0/1 Knapsack ProblemTarequl Islam Sifat, Nirmal Prajapati, Sanjay V. Rajopadhye. [doi]
- Efficient Block Algorithms for Parallel Sparse Triangular SolveZhengyang Lu, Yuyao Niu, Weifeng Liu 0001. [doi]
- Safe, Fast Sharing of memcached as a Protected LibraryChris Kjellqvist, Mohammad Hedayati, Michael L. Scott. [doi]
- ShadowTutor: Distributed Partial Distillation for Mobile Video DNN InferenceJae-won Chung, Jae Yun Kim, Soo-Mook Moon. [doi]
- DIESEL: A Dataset-Based Distributed Storage and Caching System for Large-Scale Deep Learning TrainingLipeng Wang, Songgao Ye, Baichen Yang, Youyou Lu, Hequan Zhang, Shengen Yan, Qiong Luo 0001. [doi]
- E-LAS: Design and Analysis of Completion-Time Agnostic Scheduling for Distributed Deep Learning ClusterAbeda Sultana, Li Chen, Fei Xu, Xu Yuan. [doi]
- CapelliniSpTRSV: A Thread-Level Synchronization-Free Sparse Triangular Solve on GPUsJiya Su, Feng Zhang, Weifeng Liu 0001, Bingsheng He, Ruofan Wu, Xiaoyong Du 0001, Rujia Wang. [doi]
- Algorithm-Based Checkpoint-Recovery for the Conjugate Gradient MethodCarlos Pachajoa, Christina Pacher, Markus Levonyak, Wilfried N. Gansterer. [doi]
- FEEL: A Federated Edge Learning System for Efficient and Privacy-Preserving Mobile HealthcareYeting Guo, Fang Liu, Zhiping Cai, Li Chen, Nong Xiao. [doi]
- Selective Coflow Completion for Time-sensitive Distributed Applications with PocoShouxi Luo, Pingzhi Fan, Huanlai Xing, Hongfang Yu. [doi]
- Vector Forward Mode Automatic Differentiation on SIMD/SIMT architecturesJan Hückelheim, Michel Schanen, Sri Hari Krishna Narayanan, Paul D. Hovland. [doi]
- Towards High-Efficiency Data Centers via Job-Aware Network SchedulingYang Shi, Mei Wen, Chunyuan Zhang. [doi]
- OPS: Optimized Shuffle Management System for Apache SparkYuchen Cheng, Chunghsuan Wu, Yanqiang Liu, Rui Ren, Hong Xu, Bin Yang, Zhengwei Qi. [doi]
- Experiences on the characterization of parallel applications in embedded systems with Extrae/ParaverAdrian Munera, Sara Royuela, Germán Llort, Estanislao Mercadal, Franck Wartel, Eduardo Quiñones. [doi]
- Automatic Identification and Precise Attribution of DRAM Bandwidth ContentionChristian Helm, Kenjiro Taura. [doi]
- SeRW: Adaptively Separating Read and Write upon SSDs of Hybrid Storage Server in CloudsFan Deng, Qiang Cao, Shucheng Wang, Shuyang Liu, Jie Yao, Yuanyuan Dong, Puyuan Yang. [doi]
- SkyChain: A Deep Reinforcement Learning-Empowered Dynamic Blockchain Sharding SystemJianting Zhang, Zicong Hong, Xiaoyu Qiu, Yufeng Zhan, Song Guo 0001, Wuhui Chen. [doi]
- Reliability Augmentation of Requests with Service Function Chain Requirements in Mobile Edge-Cloud NetworksWeifa Liang, Yu Ma 0001, Wenzheng Xu, Xiaohua Jia, Sid Chi-Kin Chau. [doi]
- Performance Portable Supernode-based Sparse Triangular Solver for Manycore ArchitecturesIchitaro Yamazaki, Sivasankaran Rajamanickam, Nathan D. Ellingwood. [doi]
- Toward Large-Scale Image Segmentation on SummitSudip K. Seal, Seung-Hwan Lim, Dali Wang, Jacob D. Hinkle, Dalton D. Lunga, Aristeidis Tsaris. [doi]
- Deep Reinforcement Learning based Elasticity-compatible Heterogeneous Resource Management for Time-critical ComputingZixia Liu, Liqiang Wang, Gang Quan. [doi]
- Detecting Anomalous Computation with RNNs on GPU-Accelerated HPC MachinesPengfei Zou, Ang Li, Kevin J. Barker, Rong Ge 0002. [doi]
- Optimizing Linearizable Bulk Operations on Data StructuresMatthew Rodriguez, Michael Spear. [doi]
- Scalable Coordination of Hierarchical ParallelismVinay Devadas, Matthew Curtis-Maury. [doi]
- A GPU Register File using Static Data CompressionAlexandra Angerd, Erik Sintorn, Per Stenström. [doi]
- Enabling performance portability of data-parallel OpenMP applications on asymmetric multicore processorsJuan Carlos Saez, Fernando Castro, Manuel Prieto-Matías. [doi]
- DQEMU: A Scalable Emulator with Retargetable DBT on Distributed PlatformsZiyi Zhao, Zhang Jiang, Ximing Liu, Xiaoli Gong, Wenwen Wang, Pen-Chung Yew. [doi]
- PS: Periodic Strategy for the 40-100Gbps Energy Efficient EthernetWanchun Jiang, Kaiqin Liao, Yulong Yan, Jianxin Wang. [doi]
- On Network Locality in MPI-Based HPC ApplicationsFelix Zahn, Holger Fröning. [doi]
- Memory-Centric Communication Mechanism for Real-time Autonomous Navigation ApplicationsWei Liu, Yifan Gong, Hao Wu, Jidong Zhai, Jiangming Jin. [doi]