Abstract is missing.
- H2Cloud: Maintaining the Whole Filesystem in an Object Storage CloudMinghao Zhao, Zhenhua Li 0001, Ennan Zhai, Gareth Tyson, Chen Qian, Zhenyu Li 0001, Leiyu Zhao. [doi]
- Scalable Solutions for Automated Single Pulse Identification and Classification in Radio AstronomyThomas R. Devine, Katerina Goseva-Popstojanova, Di Pang. [doi]
- Characterizing the Impact of Soft Errors Affecting Floating-point ALUs using RTL-Ievel Fault InjectionOmer Subasi, Chun-Kai Chang, Mattan Erez, Sriram Krishnamoorthy. [doi]
- Charging Task Scheduling for Directional Wireless Charger NetworksHaipeng Dai, Ke Sun, Alex X. Liu, Lijun Zhang, Jiaqi Zheng, Guihai Chen. [doi]
- A Multilevel Subtree Method for Single and Batched Sparse Cholesky FactorizationMeng Tang, Mohamed Gadou, Steven C. Rennich, Timothy A. Davis 0001, Sanjay Ranka. [doi]
- PBCS: An Efficient Parallel Characteristic Set Method for Solving Boolean Polynomial SystemsJuan Zhao, Junqiang Song, Min Zhu, Jincai Li, Zhenyu Huang, Xiaoyong Li, Xiaoli Ren. [doi]
- CAMPS: Conflict-Aware Memory-Side Prefetching Scheme for Hybrid Memory CubeMuhammad M. Rafique, Zhichun Zhu. [doi]
- GLP4NN: A Convergence-invariant and Network-agnostic Light-weight Parallelization Framework for Deep Neural Networks on Modern GPUsHao Fu, Shanjiang Tang, Bingsheng He, Ce Yu, Jizhou Sun. [doi]
- NFV Middlebox Placement with Balanced Set-up Cost and Bandwidth ConsumptionYang Chen, Jie Wu. [doi]
- Modeling Application Resilience in Large-scale Parallel ExecutionKai Wu, Wenqian Dong, Qiang Guan, Nathan DeBardeleben, Dong Li. [doi]
- ParaPLL: Fast Parallel Shortest-path Distance Query on Large-scale Weighted GraphsKun Qiu, Yuanyang Zhu, Jing Yuan, Jin Zhao, Xin Wang, Tilman Wolf. [doi]
- Using Static Allocation Algorithms for Matrix Matrix Multiplication on Multicores and GPUsLionel Eyraud-Dubois, Thomas Lambert. [doi]
- ran-GJS: Orchestrating Data Analytics for Heterogeneous Geo-distributed EdgesYibo Jin, Zhuzhong Qian, Song Guo 0001, Sheng Zhang 0001, Xiaoliang Wang, Sanglu Lu. [doi]
- Scalable Behavioral Emulation of Extreme-Scale Systems Using Structural Simulation ToolkitAjay Ramaswamy, Nalini Kumar, Aravind Neelakantan, Herman Lam, Greg Stitt. [doi]
- Reducing Communication in Proximal Newton Methods for Sparse Least Squares ProblemsSaeed Soori, Aditya Devarakonda, Zachary Blanco, James Demmel, Mert Gürbüzbalaban, Maryam Mehri Dehnavi. [doi]
- SPECTR: Scalable Parallel Short Read Error Correction on Multi-core and Many-core ArchitecturesKai Xu, Robin Kobus, Yuandong Chan, Ping Gao, Xiangxu Meng, Yanjie Wei, Bertil Schmidt, Weiguo Liu. [doi]
- Massively Parallel Huffman Decoding on GPUsAndré Weißenberger, Bertil Schmidt. [doi]
- Interference between I/O and MPI Traffic on Fat-tree NetworksKevin A. Brown, Nikhil Jain, Satoshi Matsuoka, Martin Schulz 0001, Abhinav Bhatele. [doi]
- Balanced k-means for Parallel Geometric PartitioningMoritz von Looz, Charilaos Tzovas, Henning Meyerhenke. [doi]
- Leverage Redundancy in Hardware Transactional Memory to Improve Cache ReliabilityZhichao Yan, Hong Jiang 0001, Witawas Srisa-an, Sharad C. Seth, Yujuan Tan. [doi]
- Power Efficient High Performance Packet I/OXuesong Li, Wenxue Cheng, Tong Zhang 0018, Jing Xie, Fengyuan Ren, BaiLong Yang. [doi]
- HUS-Graph: I/O-Efficient Out-of-Core Graph Processing with Hybrid Update StrategyXianghao Xu, Fang Wang 0001, Hong Jiang, Yongli Cheng, Dan Feng 0001, Yongxuan Zhang. [doi]
- Joint Optimization of MapReduce Scheduling and Network Policy in Hierarchical CloudsDonglin Yang, Wei Rang, Dazhao Cheng. [doi]
- Accelerating FM-index Search for Genomic Data ProcessingYuanrong Wang, Xueqi Li, Dawei Zang, Guangming Tan, Ninghui Sun. [doi]
- Varbench: an Experimental Framework to Measure and Characterize Performance VariabilityBrian Kocoloski, John R. Lange. [doi]
- Partitioning and Communication Strategies for Sparse Non-negative Matrix FactorizationOguz Kaya, Ramakrishnan Kannan, Grey Ballard. [doi]
- Optimizing for KNL Usage Modes When Data Doesn't Fit in MCDRAMNeil Butcher, Stephen L. Olivier, Jonathan W. Berry, Simon D. Hammond, Peter M. Kogge. [doi]
- Nemo: NUMA-aware Concurrency Control for Scalable Transactional MemoryMohamed Mohamedin, Sebastiano Peluso, Masoomeh Javidi Kishi, Ahmed Hassan, Roberto Palmieri. [doi]
- Efficient SSD Caching by Avoiding Unnecessary Writes using Machine LearningHua Wang, Xinbo Yi, Ping Huang, Bin Cheng, Ke Zhou 0001. [doi]
- A Communication-Efficient Causal Broadcast ProtocolJoão Paulo de Araujo, Luciana Arantes, Elias Procópio Duarte Jr., Luiz A. Rodrigues, Pierre Sens. [doi]
- NumLock: Towards Optimal Multi-Granularity Locking in HierarchiesSaurabh Kalikar, Rupesh Nasre. [doi]
- Disk Failure Prediction in Data Centers via Online LearningJiang Xiao, Zhuang Xiong, Song Wu, Yusheng Yi, Hai Jin 0001, Kan Hu. [doi]
- Massively Scaling the Metal Microscopic Damage Simulation on Sunway TaihuLight SupercomputerShigang Li 0002, Baodong Wu, Yunquan Zhang, Xianmeng Wang, Jianjiang Li, Changjun Hu, Jue Wang, Yangde Feng, Ningming Nie. [doi]
- Efficient Search for Free Blocks in the WAFL File SystemRam Kesavan, Matthew Curtis-Maury, Mrinal K. Bhattacharjee. [doi]
- PRIONN: Predicting Runtime and IO using Neural NetworksMichael R. Wyatt II, Stephen Herbein, Todd Gamblin, Adam Moody, Dong H. Ahn, Michela Taufer. [doi]
- Vectorized Parallel Sparse Matrix-Vector Multiplication in PETSc Using AVX-512Hong Zhang, Richard Tran Mills, Karl Rupp, Barry F. Smith. [doi]
- A Framework for Auto-Parallelization and Code Generation: An Integrative Case Study with Legacy FORTRAN CodesKonstantinos Krommydas, Paul Sathre, Ruchira Sasanka, Wu-chun Feng. [doi]
- Duchy: Achieving Both SSD Durability and Controllable SMR Cleaning Overhead in Hybrid Storage SystemsXuchao Xie, Tianye Yang, Qiong Li, Dengping Wei, Liquan Xiao. [doi]
- Optimization of the Spherical Harmonics Transform based Tree Traversals in the Helmholtz FMM AlgorithmMichael P. Lingg, Stephen M. Hughey, Hasan Metin Aktulga. [doi]
- Heterogeneous Wireless Charger Placement with ObstaclesXiaoyu Wang, Haipeng Dai, Weijun Wang, Jiaqi Zheng, Guihai Chen, Wanchun Dou, Xiaobing Wu. [doi]
- Bandwidth Reduced Parallel SpMV on the SW26010 Many-Core PlatformQiao Sun, Changyou Zhang, Changmao Wu, Jiajia Zhang, Leisheng Li. [doi]
- Cache Assisted Randomized Sharing Counters in Network MeasurementQian Liu, Haipeng Dai, Alex X. Liu, Qi Li, Xiaoyu Wang, Jiaqi Zheng. [doi]
- A Comprehensive Study on Bugs in Actor SystemsBrandon Hedden, Xinghui Zhao. [doi]
- Topology-induced Enhancement of MappingsRoland Glantz, Maria Predari, Henning Meyerhenke. [doi]
- Efficient Runtime Support for a Partitioned Global Logical Address SpaceD. Brian Larkins, John Snyder, James Dinan. [doi]
- Cross-Rack-Aware Updates in Erasure-Coded Data CentersZhirong Shen, Patrick P. C. Lee. [doi]
- IS-ASGD: Accelerating Asynchronous SGD using Importance SamplingFei Wang, Xiaofeng Gao, Jun Ye, Guihai Chen. [doi]
- Performance & Energy Tradeoffs for Dependent Distributed Applications Under System-wide Power CapsHuazhe Zhang, Henry Hoffmann. [doi]
- The Case for Semi-Permanent Cache Occupancy: Understanding the Impact of Data Locality on Network ProcessingMatthew G. F. Dosanjh, S. Mahdieh Ghazimirsaeed, Ryan E. Grant, Whit Schonbein, Michael J. Levenhagen, Patrick G. Bridges, Ahmad Afsahi. [doi]
- Constructing Dynamic Policies for Paging Mode SelectionJason Hiebel, Laura E. Brown, Zhenlin Wang. [doi]
- Memory Coalescing for Hybrid Memory CubeXi Wang, John D. Leidel, Yong Chen. [doi]
- DAG-SFC: Minimize the Embedding Cost of SFC with Parallel VNFsXu Lin, Deke Guo, Yulong Shen, Guoming Tang, Bangbang Ren. [doi]
- Task-parallel Analysis of Molecular Dynamics TrajectoriesIoannis Paraskevakos, André Luckow, Mahzad Khoshlessan, George Chantzialexiou, Thomas E. Cheatham, Oliver Beckstein, Geoffrey C. Fox, Shantenu Jha. [doi]
- KeyBin2: Distributed Clustering for Scalable and In-Situ AnalysisXinyu Chen, Jeremy Benson, Matt Peterson, Michela Taufer, Trilce Estrada. [doi]
- Revisiting Multi-pass Scatter and Gather on GPUsZhuohang Lai, Qiong Luo 0001, Xiaoying Jia 0001. [doi]
- Load-Balanced Slim Fly NetworksMd Shafayat Rahman, Md Atiqul Mollah, Peyman Faizian, Xin Yuan 0001. [doi]
- Integrating Low-latency Analysis into HPC System MonitoringRamin Izadpanah, Nichamon Naksinehaboon, Jim M. Brandt, Ann C. Gentile, Damian Dechev. [doi]
- Improving Resource Utilization through Demand Aware Process SchedulingBrandon Nesterenko, Qing Yi, Jia Rao. [doi]
- Improving MPI Multi-threaded RMA Communication PerformanceNathan Hjelm, Matthew G. F. Dosanjh, Ryan E. Grant, Taylor L. Groves, Patrick G. Bridges, Dorian C. Arnold. [doi]
- Energy-Efficient Speculative Execution using Advanced Reservation for Heterogeneous ClustersAmelie Chi Zhou, Tien-Dat Phan, Shadi Ibrahim, Bingsheng He. [doi]
- Unveiling Thread Communication Bottlenecks Using Hardware-Independent MetricsArya Mazaheri, Felix Wolf 0001, Ali Jannesari. [doi]
- A Generic Approach to Scheduling and Checkpointing WorkflowsLi Han, Valentin Le Fèvre, Louis-Claude Canon, Yves Robert, Frédéric Vivien. [doi]
- Vectorised Computation of Diverging EnsemblesJan Hückelheim, Paul D. Hovland, Sri Hari Krishna Narayanan, Paulius Velesko. [doi]
- Less Provisioning: A Fine-grained Resource Scaling Engine for Long-running Services with Tail Latency GuaranteesBinlei Cai, Rongqi Zhang, Laiping Zhao, Keqiu Li. [doi]
- Toward Performant and Energy-efficient Queries in Three-tier Wireless Sensor NetworksJiayao Wang, Abdullah Al Mamun, Tonglin Li, Linhua Jiang, Dongfang Zhao. [doi]
- Dual-Paradigm Stream ProcessingSong Wu 0001, Zhiyi Liu, Shadi Ibrahim, Lin Gu, Hai Jin 0001, Fei Chen. [doi]
- Matrix Factorization on GPUs with Memory Optimization and Approximate ComputingWei Tan, Shiyu Chang, Liana Fong, Cheng Li, Zijun Wang, Liangliang Cao. [doi]
- A Write-efficient and Consistent Hashing Scheme for Non-Volatile MemoryXiaoyi Zhang, Dan Feng 0001, Yu Hua 0001, Jianxi Chen, Mandi Fu. [doi]
- Improving First Level Cache Efficiency for GPUs Using Dynamic Line ProtectionXian Zhu, Robert Wernsman, Joseph Zambreno. [doi]
- ImageNet Training in MinutesYang You, Zhao Zhang, Cho-Jui Hsieh, James Demmel, Kurt Keutzer. [doi]
- Energy-efficient Application Resource Scheduling using Machine Learning ClassifiersConnor Imes, Steven A. Hofmeyr, Henry Hoffmann. [doi]
- MPI-Vector-IO: Parallel I/O and Partitioning for Geospatial Vector DataSatish Puri, Anmol Paudel, Sushil K. Prasad. [doi]
- FFS-VA: A Fast Filtering System for Large-scale Video AnalyticsChen Zhang, Qiang Cao, Hong Jiang, Wenhui Zhang, Jingjun Li, Jie Yao. [doi]
- A Fast Sparse Triangular Solver for Structured-grid Problems on Sunway Many-core Processor SW26010Xinliang Wang, Ping Xu, Wei Xue, Yulong Ao, Chao Yang 0002, Haohuan Fu, Lin Gan, Guangwen Yang, Weimin Zheng. [doi]
- UHCL-Darknet: An OpenCL-based Deep Neural Network Framework for Heterogeneous Multi-/Many-core ClustersLonglong Liao, Kenli Li, Keqin Li, Canqun Yang, Qi Tian 0001. [doi]
- Implementing Push-Pull Efficiently in GraphBLASCarl Yang, Aydin Buluç, John D. Owens. [doi]
- NumaMMA: NUMA MeMory AnalyzerFrançois Trahay, Manuel Selva, Lionel Morel, Kevin Marquet. [doi]
- MND-MST: A Multi-Node Multi-Device Parallel Boruvka's MST AlgorithmRintu Panja, Sathish Vadhiyar. [doi]
- A Performance Model to Execute Workflows on High-Bandwidth-Memory ArchitecturesAnne Benoit, Swann Perarnau, Loïc Pottier, Yves Robert. [doi]
- Combining Task-based Parallelism and Adaptive Mesh Refinement Techniques in Molecular Dynamics SimulationsRaphaël Prat, Laurent Colombet, Raymond Namyst. [doi]
- C-Graph: A Highly Efficient Concurrent Graph Reachability Query FrameworkLi Zhou, Ren Chen, Yinglong Xia, Radu Teodorescu. [doi]
- Parallelizing Pruning-based Graph Structural ClusteringYulin Che, Shixuan Sun, Qiong Luo 0001. [doi]
- Learning Driven Parallelization for Large-Scale Video Workload in Hybrid CPU-GPU ClusterHaitao Zhang, Bingchang Tang, Xin Geng, Huadong Ma. [doi]
- Click-Based Asynchronous Mesh Network with Bounded Bundled DataAnping He, Guangbo Feng, Jilin Zhang, Pengfei Li, Yong Hei, Hong Chen. [doi]
- FULT: Fast User-Level Thread Scheduling Using Bit-VectorsHoang-Vu Dang, Marc Snir. [doi]
- Reference-distance Eviction and Prefetching for Cache Management in SparkTiago B. G. Perez, Xiaobo Zhou 0002, Dazhao Cheng. [doi]
- Communication-Avoiding for Dynamical Core of Atmospheric General Circulation ModelJunmin Xiao, Shigang Li 0002, Baodong Wu, He Zhang, Kun Li, Erlin Yao, Yunquan Zhang, Guangming Tan. [doi]
- A Distributed Infomap Algorithm for Scalable and High-Quality Community DetectionJianping Zeng, Hongfeng Yu. [doi]
- An Empirical Comparison of k-Shortest Simple Path Algorithms on MulticoresDeepak Ajwani, Erika Duriakova, Neil Hurley, Ulrich Meyer 0001, Alexander Schickedanz. [doi]
- Index Shard Replication Strategies for Improving Resource Utilization in Large Scale Search EnginesYusen Li, Xueyan Tang, Wentong Cai, Jiancong Tong, Xiaoguang Liu, Gang Wang 0001, Chuansong Gao, Xuan Cao, Guanhui Geng, Minghui Li. [doi]
- CSTF: Large-Scale Sparse Tensor Factorizations on Distributed PlatformsZachary Blanco, Bangtian Liu, Maryam Mehri Dehnavi. [doi]