RRPlib: A spark library for representing HDFS blocks as a set of random sample data blocks

Tamer Z. Emara, Joshua Zhexue Huang. RRPlib: A spark library for representing HDFS blocks as a set of random sample data blocks. Science of Computer Programming, 184, 2019. [doi]

Abstract

Abstract is missing.