Learning Autocompletion from Real-World DatasetsSEIP
Thu 27 May 2021 07:30 - 07:50 at Blended Sessions Room 3 - 2.5.3. Code Completion
Code completion is a popular software development tool integrated into all major IDEs. Many neural language models have achieved promising results in completion suggestion prediction on synthetic benchmarks. However, a recent study When Code Completion Fails: a Case Study on Real-World Completions demonstrates that these results may not translate to improvements in real-world performance. To combat this effect, we train models on real-world code completion examples and find that these models outperform models trained on committed source code and working version snapshots by 12.8% and 13.8% accuracy respectively. We observe this improvement across modeling technologies and show through A/B testing that it corresponds to a 6.2% increase in programmers’ actual autocompletion usage. Furthermore, our study characterizes a large corpus of logged autocompletion usages to investigate why training on real-world examples leads to stronger models.
Wed 26 MayDisplayed time zone: Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna change
18:50 - 19:50 | 2.5.3. Code CompletionSEIP - Software Engineering in Practice / Technical Track at Blended Sessions Room 3 +12h Chair(s): Marsha Chechik University of Toronto | ||
18:50 20mPaper | Siri, Write the Next MethodTechnical Track Technical Track Fengcai Wen Software Institute, USI Università della Svizzera italiana, Emad Aghajani Software Institute, USI Università della Svizzera italiana, Csaba Nagy Software Institute, USI Università della Svizzera italiana, Michele Lanza Software Institute, USI Università della Svizzera italiana, Gabriele Bavota Software Institute, USI Università della Svizzera italiana Pre-print Media Attached | ||
19:10 20mPaper | Code Prediction by Feeding Trees to TransformersTechnical Track Technical Track Seohyun Kim Facebook, Jinman Zhao University of Wisconsin-Madison, USA, Yuchi Tian Columbia University, Satish Chandra Facebook, USA Pre-print Media Attached | ||
19:30 20mPaper | Learning Autocompletion from Real-World DatasetsSEIP SEIP - Software Engineering in Practice Pre-print Media Attached |
Thu 27 MayDisplayed time zone: Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna change
06:50 - 07:50 | 2.5.3. Code CompletionTechnical Track / SEIP - Software Engineering in Practice at Blended Sessions Room 3 | ||
06:50 20mPaper | Siri, Write the Next MethodTechnical Track Technical Track Fengcai Wen Software Institute, USI Università della Svizzera italiana, Emad Aghajani Software Institute, USI Università della Svizzera italiana, Csaba Nagy Software Institute, USI Università della Svizzera italiana, Michele Lanza Software Institute, USI Università della Svizzera italiana, Gabriele Bavota Software Institute, USI Università della Svizzera italiana Pre-print Media Attached | ||
07:10 20mPaper | Code Prediction by Feeding Trees to TransformersTechnical Track Technical Track Seohyun Kim Facebook, Jinman Zhao University of Wisconsin-Madison, USA, Yuchi Tian Columbia University, Satish Chandra Facebook, USA Pre-print Media Attached | ||
07:30 20mPaper | Learning Autocompletion from Real-World DatasetsSEIP SEIP - Software Engineering in Practice Pre-print Media Attached |