Gregory Zynda, Niall Gaffney, Mehmet Dalkilic, Matthew W. Vaughn. Feature frequency profiles for automatic sample identification using PySpark. In Proceedings of the 5th Workshop on Python for High-Performance and Scientific Computing, PyHPC 2015, Austin, Texas, USA, November 15, 2015. ACM, 2015. [doi]