Abstract is missing.
- Julia Cloud Matrix Machine: Dynamic Matrix Language Acceleration on Multicore Clusters in the CloudJay Hwan Lee, Yeonsoo Kim, Younghyun Ryu, Wasuwee Sodsong, Hyunjun Jeon, Jinsik Park, Bernd Burgstaller, Bernhard Scholz. 1-10 [doi]
- Distributed Cell Set : A Library for Space-Dependent Communication/Computation Overlap on Manycore ClusterYoshiki Kawanishi, Patrick Finnerty, Tomio Kamada, Chikara Ohta. 11-19 [doi]
- Towards Maximum Throughput of Dataflow Software Pipeline under Resource ConstraintsSiddhisanket Raskar, Thomas Applencourt, Kalyan Kumaran, Guang Gao. 20-28 [doi]
- Studying the expressiveness and performance of parallelization abstractions for linear pipelinesAristeidis Mastoras, Albert-Jan Nicholas Yzelman. 29-38 [doi]
- Harmonic CUDA: Asynchronous Programming on GPUsJonathan D. Wapman, Sean Treichler, Serban D. Porumbescu, John D. Owens. 39-49 [doi]
- MPI-based Remote OpenMP Offloading: A More Efficient and Easy-to-use ImplementationBaodi Shan, Mauricio Araya-Polo, Abid M. Malik, Barbara M. Chapman. 50-59 [doi]
- Exploring OpenMP GPU Offloading for Implementing Convolutional Neural NetworksKewei Yan, Yaying Shi, Yonghong Yan 0001. 60-69 [doi]