RRPlib: A spark library for representing HDFS blocks as a set of random sample data blocks

Tamer Z. Emara, Joshua Zhexue Huang. RRPlib: A spark library for representing HDFS blocks as a set of random sample data blocks. Science of Computer Programming, 184, 2019. [doi]

@article{EmaraH19-0,
  title = {RRPlib: A spark library for representing HDFS blocks as a set of random sample data blocks},
  author = {Tamer Z. Emara and Joshua Zhexue Huang},
  year = {2019},
  doi = {10.1016/j.scico.2019.102301},
  url = {https://doi.org/10.1016/j.scico.2019.102301},
  researchr = {https://researchr.org/publication/EmaraH19-0},
  cites = {0},
  citedby = {0},
  journal = {Science of Computer Programming},
  volume = {184},
}