Optimizing Data Shuffling in Data-Parallel Computation by Understanding User-Defined Functions

Jiaxing Zhang, Hucheng Zhou, Rishan Chen, Xuepeng Fan, Zhenyu Guo, Haoxiang Lin, Jack Y. Li, Wei Lin, Jingren Zhou, Lidong Zhou. Optimizing Data Shuffling in Data-Parallel Computation by Understanding User-Defined Functions. In Steven D. Gribble, Dina Katabi, editors, Proceedings of the 9th USENIX Symposium on Networked Systems Design and Implementation, NSDI 2012, San Jose, CA, USA, April 25-27, 2012. pages 295-308, USENIX Association, 2012. [doi]

Abstract

Abstract is missing.