Abstract is missing.
- Coarse grain parallelization of deep neural networksMarc González Tallada. 1 [doi]
- High performance model based image reconstructionXiao Wang, Amit Sabne, Sherman J. Kisner, Anand Raghunathan, Charles A. Bouman, Samuel P. Midkiff. 2 [doi]
- Exploiting accelerators for efficient high dimensional similarity searchSandeep R. Agrawal, Christopher M. Dee, Alvin R. Lebeck. 3 [doi]
- Declarative coordination of graph-based parallel programsFlávio Cruz, Ricardo Rocha, Seth Copen Goldstein. 4 [doi]
- Distributed HalideTyler Denniston, Shoaib Kamil, Saman P. Amarasinghe. 5 [doi]
- Parallel type-checking with haskell using saturating LVars and stream generatorsRyan R. Newton, Ömer S. Agacan, Peter P. Fogg, Sam Tobin-Hochstadt. 6 [doi]
- Articulation points guided redundancy elimination for betweenness centralityLei Wang 0004, Fan Yang, Liangji Zhuang, Huimin Cui, Fang Lv, Xiaobing Feng. 7 [doi]
- Multi-core on-the-fly SCC decompositionVincent Bloemen, Alfons Laarman, Jaco van de Pol. 8 [doi]
- A high-performance parallel algorithm for nonnegative matrix factorizationRamakrishnan Kannan, Grey Ballard, Haesun Park. 9 [doi]
- AUTOGEN: automatic discovery of cache-oblivious parallel recursive algorithms for solving dynamic programsRezaul Alam Chowdhury, Pramod Ganapathi, Jesmin Jahan Tithi, Charles Bachmeier, Bradley C. Kuszmaul, Charles E. Leiserson, Armando Solar-Lezama, Yuan Tang. 10 [doi]
- Gunrock: a high-performance graph processing library on the GPUYangzihao Wang, Andrew A. Davidson, Yuechao Pan, Yuduo Wu, Andy Riffel, John D. Owens. 11 [doi]
- GPU multisplitSaman Ashkiani, Andrew A. Davidson, Ulrich Meyer, John D. Owens. 12 [doi]
- Keep calm and react with foresight: strategies for low-latency and energy-efficient elastic data stream processingTiziano De Matteis, Gabriele Mencagli. 13 [doi]
- Work stealing for interactive services to meet target latencyJing Li, Kunal Agrawal, Sameh Elnikety, Yuxiong He, I-Ting Angelina Lee, Chenyang Lu, Kathryn S. McKinley. 14 [doi]
- Adding approximate countersGuy L. Steele Jr., Jean-Baptiste Tristan. 15 [doi]
- A wait-free queue as fast as fetch-and-addChaoran Yang, John M. Mellor-Crummey. 16 [doi]
- Lease/release: architectural support for scaling contended data structuresSyed Kamran Haider, William Hasenplaugh, Dan Alistarh. 17 [doi]
- Optimistic concurrency with OPTIKRachid Guerraoui, Vasileios Trigonakis. 18 [doi]
- Refined transactional lock elisionDave Dice, Alex Kogan, Yossi Lev. 19 [doi]
- Drinking from both glasses: combining pessimistic and optimistic tracking of cross-thread dependencesMan Cao, Minjia Zhang, Aritra Sengupta, Michael D. Bond. 20 [doi]
- Be my guest: MCS lock now welcomes guestsTianzheng Wang, Milind Chabbi, Hideaki Kimura. 21 [doi]
- Contention-conscious, locality-preserving locksMilind Chabbi, John M. Mellor-Crummey. 22 [doi]
- DomLock: a new multi-granularity locking technique for hierarchiesSaurabh Kalikar, Rupesh Nasre. 23 [doi]
- Benchmarking weak memory modelsCarl G. Ritson, Scott Owens. 24 [doi]
- The virtues of conflict: analysing modern concurrencyGanesh Narayanaswamy, Saurabh Joshi 0001, Daniel Kroening. 25 [doi]
- Causal consistency: beyond memoryMatthieu Perrin, Achour Mostéfaoui, Claude Jard. 26 [doi]
- ESTIMA: extrapolating scalability of in-memory applicationsGeorgios Chatzopoulos, Aleksandar Dragojevic, Rachid Guerraoui. 27 [doi]
- Grain graphs: OpenMP performance analysis made easyAnanya Muddukrishna, Peter A. Jonsson, Artur Podobas, Mats Brorsson. 28 [doi]
- Production-guided concurrency debuggingNuno Machado, Brandon Lucia, Luís E. T. Rodrigues. 29 [doi]
- Affinity-aware work-stealing for integrated CPU-GPU processorsNaila Farooqui, Rajkishore Barik, Brian T. Lewis, Tatiana Shpeisman, Karsten Schwan. 30 [doi]
- An interval constrained memory allocator for the Givy GAS runtimeFrançois Gindraud, Fabrice Rastello, Albert Cohen 0001, François Broquedis. 31 [doi]
- A programming system for future proofing performance critical librariesLi-Wen Chang, Izzat El Hajj, Hee-Seok Kim, Juan Gómez-Luna, Abdul Dakkak, Wen-mei W. Hwu. 32 [doi]
- A scalable lock-free hash table with open addressingJesper Puge Nielsen, Sven Karlsson. 33 [doi]
- and general?(!)Tobias Maier, Peter Sanders, Roman Dementiev. 34 [doi]
- CUDA acceleration for Xen virtual machines in infiniband clusters with rCUDAJavier Prades, Carlos Reaño, Federico Silla. 35 [doi]
- Effect of portable fine-grained locality on energy efficiency and performance in concurrent search treesIbrahim Umar, Otto J. Anshus, Phuong Hoai Ha. 36 [doi]
- Efficient distributed workstealing via matchmakingHrushit Parikh, Vinit Deodhar, Ada Gavrilovska, Santosh Pande. 37 [doi]
- Data-centric combinatorial optimization of parallel codeHao Luo, Guoyang Chen, Pengcheng Li, Chen Ding, Xipeng Shen. 38 [doi]
- DSMR: a shared and distributed memory algorithm for single-source shortest path problemSaeed Maleki, Donald Nguyen, Andrew Lenharth, María Jesús Garzarán, David A. Padua, Keshav Pingali. 39 [doi]
- Generic messages: capability-based shared memory parallelism for event-loop systemsLuca Salucci, Daniele Bonetta, Stefan Marr, Walter Binder. 40 [doi]
- Hybrid CPU-GPU scheduling and execution of tree traversalsJianqiao Liu, Nikhil Hegde, Milind Kulkarni. 41 [doi]
- Improving efficacy of internal binary search trees using local recoveryArunmoezhi Ramachandran, Neeraj Mittal. 42 [doi]
- Merge-based sparse matrix-vector multiplication (SpMV) using the CSR storage formatDuane Merrill, Michael Garland. 43 [doi]
- NUMA-aware scheduling and memory allocation for data-flow task-parallel applicationsAndi Drebes, Antoniu Pop, Karine Heydemann, Nathalie Drach, Albert Cohen 0001. 44 [doi]
- On designing NUMA-aware concurrency control for scalable transactional memoryMohamed Mohamedin, Roberto Palmieri, Sebastiano Peluso, Binoy Ravindran. 45 [doi]
- On ordering transaction commitMohamed M. Saad, Roberto Palmieri, Binoy Ravindran. 46 [doi]
- OPR: deterministic group replay for one-sided communicationXuehai Qian, Koushik Sen, Paul Hargrove, Costin Iancu. 47 [doi]
- Preemption-aware planning on big-data systemsMarco Rabozzi, Matteo Mazzucchelli, Roberto Cordone, Giovanni Matteo Fumarola, Marco D. Santambrogio. 48 [doi]
- Samsara parallel: a non-BSP parallel-in-time modelYifeng Chen, Kun Huang, Bei Wang, Guohui Li, Xiang Cui. 49 [doi]
- Scalable adaptive NUMA-aware lock: combining local locking and remote locking for efficient concurrencyMingzhe Zhang, Francis C. M. Lau, Cho-Li Wang, Luwei Cheng, Haibo Chen. 50 [doi]
- SPIRIT: a runtime system for distributed irregular tree applicationsNikhil Hegde, Jianqiao Liu, Milind Kulkarni. 51 [doi]
- Tidex: a mutual exclusion lockPedro Ramalhete, Andreia Correia. 52 [doi]
- Unifying fixed code and fixed data mapping of load-imbalanced pipelined loopsAristeidis Mastoras, Thomas R. Gross. 53 [doi]
- User-assisted storage reuse determination for dynamic task graphsMehmet Can Kurt, Bin Ren, Sriram Krishnamoorthy, Gagan Agrawal. 54 [doi]
- Verification of MPI Java programs using software model checkingWaqas ur Rehman, Muhammad Sohaib Ayub, Junaid Haroon Siddiqui. 55 [doi]