Intermediate Data Placement Strategy for Different Data Skew Levels Based on Random Sampling in Spark

Xueqian Gong, Chunlin Li 0001, Youlong Luo. Intermediate Data Placement Strategy for Different Data Skew Levels Based on Random Sampling in Spark. In Proceedings of the 4th International Conference on Big Data and Computing, ICBDC 2019, Guangzhou, China, May 10-12, 2019. pages 17-23, ACM, 2019. [doi]

Abstract

Abstract is missing.