A Dataset and an Approach for Identity Resolution of 38 Million Author IDs extracted from 2B Git Commits

Tanner Fry, Tapajit Dey, Andrey Karnauch, Audris Mockus. A Dataset and an Approach for Identity Resolution of 38 Million Author IDs extracted from 2B Git Commits. In Sunghun Kim 0001, Georgios Gousios, Sarah Nadi, Joseph Hejderup, editors, MSR '20: 17th International Conference on Mining Software Repositories, Seoul, Republic of Korea, 29-30 June, 2020. pages 518-522, ACM, 2020. [doi]

Abstract

Abstract is missing.