Evaluating SZZ Implementations Through a Developer-informed OracleTechnical Track
Thu 27 May 2021 02:50 - 03:10 at Blended Sessions Room 1 - 2.3.1. Defect Prediction: Automation #1
The SZZ algorithm for identifying bug-inducing changes has been widely used to evaluate defect prediction techniques and to empirically investigate when, how, and by whom bugs are introduced. Over the years, researchers have proposed several heuristics to improve the SZZ accuracy, providing various implementations of SZZ. However, fairly evaluating those implementations on a reliable oracle is an open problem: SZZ evaluations usually rely on (i) the manual analysis of the SZZ output to classify the identified bug-inducing commits as true or false positives; or (ii) a golden set linking bug-fixing and bug-inducing commits. In both cases, these manual evaluations are performed by researchers with limited knowledge of the studied subject systems. Ideally, there should be a golden set created by the original developers of the studied systems.
We propose a methodology to build a “developer-informed” oracle for the evaluation of SZZ variants. We use Natural Language Processing (NLP) to identify bug-fixing commits in which developers explicitly reference the commit(s) that introduced a fixed bug. This was followed by a manual filtering step aimed at ensuring the quality and accuracy of the oracle. Once built, we used the oracle to evaluate several widely used variants of the SZZ algorithm in terms of their accuracy. Our evaluation helped us to distill a set of important insights and lessons learned to further improve the SZZ algorithm.
Wed 26 MayDisplayed time zone: Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna change
14:30 - 15:30 | 2.3.1. Defect Prediction: Automation #1Technical Track / SEIP - Software Engineering in Practice at Blended Sessions Room 1 +12h Chair(s): Carolyn Seaman University of Maryland Baltimore County | ||
14:30 20mPaper | Automatic Web Testing using Curiosity-Driven Reinforcement LearningTechnical Track Technical Track YAN ZHENG Nanyang Technological University, Yi Liu Southern University of Science and Technology, Xiaofei Xie Nanyang Technological University, Yepang Liu Southern University of Science and Technology, China, Lei Ma University of Alberta, Jianye Hao Tianjin University, Yang Liu Nanyang Technological University Pre-print Media Attached | ||
14:50 20mPaper | Evaluating SZZ Implementations Through a Developer-informed OracleTechnical Track Technical Track Giovanni Rosa University of Molise, Luca Pascarella Delft University of Technology, Simone Scalabrino University of Molise, Rosalia Tufano Università della Svizzera Italiana, Gabriele Bavota Software Institute, USI Università della Svizzera italiana, Michele Lanza Software Institute, USI Università della Svizzera italiana, Rocco Oliveto University of Molise Pre-print Media Attached | ||
15:10 20mPaper | D2A: A Dataset Built for AI-Based Vulnerability Detection Methods Using Differential AnalysisSEIP SEIP - Software Engineering in Practice Yunhui Zheng IBM Research, Saurabh Pujar IBM Research, Burn Lewis IBM Research, Luca Buratti IBM Research, Edward Epstein IBM Research, Bo Yang IBM Research, Jim A. Laredo IBM Research, USA, Alessandro Morari IBM Research, Zhong Su IBM Research Pre-print Media Attached |
Thu 27 MayDisplayed time zone: Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna change
02:30 - 03:30 | 2.3.1. Defect Prediction: Automation #1SEIP - Software Engineering in Practice / Technical Track at Blended Sessions Room 1 | ||
02:30 20mPaper | Automatic Web Testing using Curiosity-Driven Reinforcement LearningTechnical Track Technical Track YAN ZHENG Nanyang Technological University, Yi Liu Southern University of Science and Technology, Xiaofei Xie Nanyang Technological University, Yepang Liu Southern University of Science and Technology, China, Lei Ma University of Alberta, Jianye Hao Tianjin University, Yang Liu Nanyang Technological University Pre-print Media Attached | ||
02:50 20mPaper | Evaluating SZZ Implementations Through a Developer-informed OracleTechnical Track Technical Track Giovanni Rosa University of Molise, Luca Pascarella Delft University of Technology, Simone Scalabrino University of Molise, Rosalia Tufano Università della Svizzera Italiana, Gabriele Bavota Software Institute, USI Università della Svizzera italiana, Michele Lanza Software Institute, USI Università della Svizzera italiana, Rocco Oliveto University of Molise Pre-print Media Attached | ||
03:10 20mPaper | D2A: A Dataset Built for AI-Based Vulnerability Detection Methods Using Differential AnalysisSEIP SEIP - Software Engineering in Practice Yunhui Zheng IBM Research, Saurabh Pujar IBM Research, Burn Lewis IBM Research, Luca Buratti IBM Research, Edward Epstein IBM Research, Bo Yang IBM Research, Jim A. Laredo IBM Research, USA, Alessandro Morari IBM Research, Zhong Su IBM Research Pre-print Media Attached |