Large Scale Distributed Data Science from scratch using Apache Spark 2.0

James Shanahan, Liang Dai. Large Scale Distributed Data Science from scratch using Apache Spark 2.0. In Rick Barrett, Rick Cummings, Eugene Agichtein, Evgeniy Gabrilovich, editors, Proceedings of the 26th International Conference on World Wide Web Companion, Perth, Australia, April 3-7, 2017. pages 955-957, ACM, 2017. [doi]

@inproceedings{ShanahanD17,
  title = {Large Scale Distributed Data Science from scratch using Apache Spark 2.0},
  author = {James Shanahan and Liang Dai},
  year = {2017},
  doi = {10.1145/3041021.3051108},
  url = {http://doi.acm.org/10.1145/3041021.3051108},
  researchr = {https://researchr.org/publication/ShanahanD17},
  cites = {0},
  citedby = {0},
  pages = {955-957},
  booktitle = {Proceedings of the 26th International Conference on World Wide Web Companion, Perth, Australia, April 3-7, 2017},
  editor = {Rick Barrett and Rick Cummings and Eugene Agichtein and Evgeniy Gabrilovich},
  publisher = {ACM},
  isbn = {978-1-4503-4914-7},
}