DataHub: Collaborative Data Science & Dataset Version Management at Scale

Anant P. Bhardwaj, Souvik Bhattacherjee, Amit Chavan, Amol Deshpande, Aaron J. Elmore, Samuel Madden, Aditya G. Parameswaran. DataHub: Collaborative Data Science & Dataset Version Management at Scale. In CIDR 2015, Seventh Biennial Conference on Innovative Data Systems Research, Asilomar, CA, USA, January 4-7, 2015, Online Proceedings. www.cidrdb.org, 2015. [doi]

Abstract

Abstract is missing.