What Makes a Popular Academic AI Repository?Journal-First
Wed 26 May 2021 03:20 - 03:40 at Blended Sessions Room 3 - 1.3.3. Open Science
Many AI researchers are publishing code, data and other resources that accompany their papers in GitHub repositories. In this paper, we refer to these repositories as academic AI repositories. Our preliminary study shows that highly cited papers are more likely to have popular academic AI repositories (and vice versa). Hence, in this study, we perform an empirical study on academic AI repositories to highlight good software engineering practices of popular academic AI repositories for AI researchers.
We collect 1,149 academic AI repositories, in which we label the top 20% repositories that have the most number of stars as popular, and we label the bottom 70% repositories as unpopular. The remaining 10% repositories are set as a gap between popular and unpopular academic AI repositories. We propose 21 features to characterize the software engineering practices of academic AI repositories. Our experimental results show that popular and unpopular academic AI repositories are statistically significantly different in 11 of the studied features—indicating that the two groups of repositories have significantly different software engineering practices. Furthermore, we find that the number of links to other GitHub repositories in the README file, the number of images in the README file and the inclusion of a license are the most important features for differentiating the two groups of academic AI repositories. Our dataset and code are made publicly available to share with the community.
Tue 25 MayDisplayed time zone: Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna change
15:20 - 16:15 | 1.3.3. Open ScienceNIER - New Ideas and Emerging Results / Journal-First Papers at Blended Sessions Room 3 +12h Chair(s): Sira Vegas Universidad Politecnica de Madrid | ||
15:20 20mPaper | What Makes a Popular Academic AI Repository?Journal-First Journal-First Papers Yuanrui Fan College of Computer Science and Technology, Zhejiang University, Xin Xia Huawei Software Engineering Application Technology Lab, David Lo Singapore Management University, Ahmed E. Hassan School of Computing, Queen's University, Shanping Li Zhejiang University Pre-print Media Attached | ||
15:40 20mPaper | Publish or Perish, but do not Forget your Software ArtifactsJournal-First Journal-First Papers Robert Heumüller Chair of Software Engineering, Faculty of Computer Science, Otto-von-Guericke-University Magdeburg, Sebastian Nielebock Otto-von-Guericke University Magdeburg, Germany, Jacob Krüger Otto von Guericke University Magdeburg, Frank Ortmeier Otto-von-Guericke-University Magdeburg, Faculty of Computer Science, Chair of Software Engineering DOI Pre-print Media Attached | ||
16:00 15mPaper | Stop Building Castles on a Swamp! The Crisis of Reproducing Automatic Search in Evidence-based Software EngineeringNIER NIER - New Ideas and Emerging Results Zheng Li University of Concepción Pre-print Media Attached |
Wed 26 MayDisplayed time zone: Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna change
03:20 - 04:15 | 1.3.3. Open ScienceJournal-First Papers / NIER - New Ideas and Emerging Results at Blended Sessions Room 3 | ||
03:20 20mPaper | What Makes a Popular Academic AI Repository?Journal-First Journal-First Papers Yuanrui Fan College of Computer Science and Technology, Zhejiang University, Xin Xia Huawei Software Engineering Application Technology Lab, David Lo Singapore Management University, Ahmed E. Hassan School of Computing, Queen's University, Shanping Li Zhejiang University Pre-print Media Attached | ||
03:40 20mPaper | Publish or Perish, but do not Forget your Software ArtifactsJournal-First Journal-First Papers Robert Heumüller Chair of Software Engineering, Faculty of Computer Science, Otto-von-Guericke-University Magdeburg, Sebastian Nielebock Otto-von-Guericke University Magdeburg, Germany, Jacob Krüger Otto von Guericke University Magdeburg, Frank Ortmeier Otto-von-Guericke-University Magdeburg, Faculty of Computer Science, Chair of Software Engineering DOI Pre-print Media Attached | ||
04:00 15mPaper | Stop Building Castles on a Swamp! The Crisis of Reproducing Automatic Search in Evidence-based Software EngineeringNIER NIER - New Ideas and Emerging Results Zheng Li University of Concepción Pre-print Media Attached |