Write a Blog >>
MSR 2021
Mon 17 - Wed 19 May 2021
co-located with ICSE 2021
Mon 17 May 2021 10:16 - 10:19 at MSR Room 1 - Resources for MSR Research Chair(s): Felipe Ebert

Name disambiguation is a complex but highly relevant challenge whenever analysing real-world user data, such as data from version control systems. We propose gambit, a rule-based disambiguation tool that only relies on name and email information. We evaluate its performance against two commonly used algorithms with similar characteristics, on manually disambiguated ground-truth data from the Gnome GTK project. Our results show that gambit significantly outperforms both algorithms in terms of precision as well as F1 score.

Mon 17 May

Displayed time zone: Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna change

10:00 - 10:50
Resources for MSR ResearchTechnical Papers / Data Showcase at MSR Room 1
Chair(s): Felipe Ebert Eindhoven University of Technology
10:01
3m
Talk
PSIMiner: A Tool for Mining Rich Abstract Syntax Trees from Code
Technical Papers
Egor Spirin JetBrains Research; National Research University Higher School of Economics, Egor Bogomolov JetBrains Research, Vladimir Kovalenko JetBrains Research, Timofey Bryksin JetBrains Research, Saint Petersburg State University
Pre-print
10:04
3m
Talk
Mining DEV for social and technical insights about software development
Technical Papers
Maria Papoutsoglou Aristotle University of Thessaloniki, Johannes Wachs Vienna University of Economics and Business & Complexity Science Hub Vienna, Georgia Kapitsaki University of Cyprus
Pre-print
10:07
3m
Talk
TNM: A Tool for Mining of Socio-Technical Data from Git Repositories
Technical Papers
Nikolai Sviridov ITMO University, Mikhail Evtikhiev JetBrains Research, Vladimir Kovalenko JetBrains Research
Pre-print
10:10
3m
Talk
Identifying Versions of Libraries used in Stack Overflow Code Snippets
Technical Papers
Ahmed Zerouali Vrije Universiteit Brussel, Camilo Velázquez-Rodríguez Vrije Universiteit Brussel, Coen De Roover Vrije Universiteit Brussel
Pre-print Media Attached
10:13
3m
Talk
Sampling Projects in GitHub for MSR Studies
Data Showcase
Ozren Dabic Software Institute, Università della Svizzera italiana (USI), Switzerland, Emad Aghajani Software Institute, USI Università della Svizzera italiana, Gabriele Bavota Software Institute, USI Università della Svizzera italiana
Pre-print
10:16
3m
Talk
gambit – An Open Source Name Disambiguation Tool for Version Control Systems
Technical Papers
Christoph Gote Chair of Systems Design, ETH Zurich, Christian Zingg Chair of Systems Design, ETH Zurich
Pre-print Media Attached
10:19
31m
Live Q&A
Discussions and Q&A
Technical Papers


Information for Participants
Mon 17 May 2021 10:00 - 10:50 at MSR Room 1 - Resources for MSR Research Chair(s): Felipe Ebert
Info for room MSR Room 1:

Go directly to this room on Clowdr