The impact of columnar file formats on SQL-on-hadoop engine performance: A study on ORC and Parquet

Todor Ivanov, Matteo Pergolesi. The impact of columnar file formats on SQL-on-hadoop engine performance: A study on ORC and Parquet. Concurrency - Practice and Experience, 32(5), 2020. [doi]

@article{IvanovP20,
  title = {The impact of columnar file formats on SQL-on-hadoop engine performance: A study on ORC and Parquet},
  author = {Todor Ivanov and Matteo Pergolesi},
  year = {2020},
  doi = {10.1002/cpe.5523},
  url = {https://doi.org/10.1002/cpe.5523},
  researchr = {https://researchr.org/publication/IvanovP20},
  cites = {0},
  citedby = {0},
  journal = {Concurrency - Practice and Experience},
  volume = {32},
  number = {5},
}