Learning Autocompletion from Real-World Datasets (ICSE 2021 - SEIP - Software Engineering in Practice) - ICSE 2021

Write a Blog >>

Mon 17 May - Sat 5 June 2021

Who

Gareth Aye, Seohyun Kim, Hongyu Li

Track

ICSE 2021 SEIP - Software Engineering in Practice

Time Zone

The program is currently displayed in (GMT+02:00) Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna.

Use conference time zone: (GMT+02:00) Amsterdam, Berlin, Bern, Rome, Stockholm, ViennaSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

When

Wed 26 May 2021 19:30 - 19:50 at Blended Sessions Room 3 - 2.5.3. Code Completion Chair(s): Marsha Chechik
Thu 27 May 2021 07:30 - 07:50 at Blended Sessions Room 3 - 2.5.3. Code Completion

Abstract

Code completion is a popular software development tool integrated into all major IDEs. Many neural language models have achieved promising results in completion suggestion prediction on synthetic benchmarks. However, a recent study When Code Completion Fails: a Case Study on Real-World Completions demonstrates that these results may not translate to improvements in real-world performance. To combat this effect, we train models on real-world code completion examples and find that these models outperform models trained on committed source code and working version snapshots by 12.8% and 13.8% accuracy respectively. We observe this improvement across modeling technologies and show through A/B testing that it corresponds to a 6.2% increase in programmers’ actual autocompletion usage. Furthermore, our study characterizes a large corpus of logged autocompletion usages to investigate why training on real-world examples leads to stronger models.

Link to Preprint

https://arxiv.org/abs/2011.04542

Gareth Aye

Facebook, Inc.

United States

Seohyun Kim

Facebook

United States

Hongyu Li

Facebook, Inc.

YT Video

Time Zone

The program is currently displayed in (GMT+02:00) Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna.

Use conference time zone: (GMT+02:00) Amsterdam, Berlin, Bern, Rome, Stockholm, ViennaSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Session Program

Wed 26 May
Displayed time zone: Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna change

	18:50 - 19:50	2.5.3. Code CompletionSEIP - Software Engineering in Practice / Technical Track at Blended Sessions Room 3 +12h Chair(s): Marsha Chechik University of Toronto

	18:50 20m Paper		Siri, Write the Next MethodTechnical Track Technical Track Fengcai Wen Software Institute, USI Università della Svizzera italiana, Emad Aghajani Software Institute, USI Università della Svizzera italiana, Csaba Nagy Software Institute, USI Università della Svizzera italiana, Michele Lanza Software Institute, USI Università della Svizzera italiana, Gabriele Bavota Software Institute, USI Università della Svizzera italiana Pre-print Media Attached
	19:10 20m Paper		Code Prediction by Feeding Trees to TransformersTechnical Track Technical Track Seohyun Kim Facebook, Jinman Zhao University of Wisconsin-Madison, USA, Yuchi Tian Columbia University, Satish Chandra Facebook, USA Pre-print Media Attached
	19:30 20m Paper		Learning Autocompletion from Real-World DatasetsSEIP SEIP - Software Engineering in Practice Gareth Aye Facebook, Inc., Seohyun Kim Facebook, Hongyu Li Facebook, Inc. Pre-print Media Attached

Thu 27 May
Displayed time zone: Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna change

	06:50 - 07:50	2.5.3. Code CompletionTechnical Track / SEIP - Software Engineering in Practice at Blended Sessions Room 3

	06:50 20m Paper		Siri, Write the Next MethodTechnical Track Technical Track Fengcai Wen Software Institute, USI Università della Svizzera italiana, Emad Aghajani Software Institute, USI Università della Svizzera italiana, Csaba Nagy Software Institute, USI Università della Svizzera italiana, Michele Lanza Software Institute, USI Università della Svizzera italiana, Gabriele Bavota Software Institute, USI Università della Svizzera italiana Pre-print Media Attached
	07:10 20m Paper		Code Prediction by Feeding Trees to TransformersTechnical Track Technical Track Seohyun Kim Facebook, Jinman Zhao University of Wisconsin-Madison, USA, Yuchi Tian Columbia University, Satish Chandra Facebook, USA Pre-print Media Attached
	07:30 20m Paper		Learning Autocompletion from Real-World DatasetsSEIP SEIP - Software Engineering in Practice Gareth Aye Facebook, Inc., Seohyun Kim Facebook, Hongyu Li Facebook, Inc. Pre-print Media Attached