CODIT: Code Editing with Tree-Based Neural Models (ICSE 2021 - Journal-First Papers)

Who

Saikat Chakraborty, Yangruibo Ding, Miltiadis Allamanis, Baishakhi Ray

Track

ICSE 2021 Journal-First Papers

Time Zone

The program is currently displayed in (GMT+02:00) Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna.

Use conference time zone: (GMT+02:00) Amsterdam, Berlin, Bern, Rome, Stockholm, ViennaSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Tue 25 May 2021 15:20 - 15:40 at Blended Sessions Room 2 - 1.3.2. Deep Neural Networks: Supporting SE Tasks #1 Chair(s): Ayse Tosun
Wed 26 May 2021 03:20 - 03:40 at Blended Sessions Room 2 - 1.3.2. Deep Neural Networks: Supporting SE Tasks #1

Abstract

The way developers edit day-to-day code tends to be repetitive, often using existing code elements. Many researchers have tried to automate repetitive code changes by learning from specific change templates which are applied to limited scope. The advancement of deep neural networks and the availability of vast open-source evolutionary data opens up the possibility of automatically learning those templates from the wild. However, deep neural network based modeling for code changes and code in general introduces some specific problems that needs specific attention from research community. For instance, compared to natural language, source code vocabulary can be significantly larger. Further, good changes in code do not break its syntactic structure. Thus, deploying state-of-the-art neural network models without adapting the methods to the source code domain yields sub-optimal results. To this end, we propose a novel tree-based neural network system to model source code changes and learn code change patterns from the wild. Specifically, we propose a tree-based neural machine translation model to learn the probability distribution of changes in code. We realize our model with a change suggestion engine, CODIT, and train the model with more than 30k real-world changes and evaluate it on 6k patches. Our evaluation shows the effectiveness of CODIT in learning and suggesting patches. CODIT can also learn specific bug fix pattern from bug fixing patches and can fix 27 bugs out of 75 one line bugs in Defects4J.

Link to Publication

https://ieeexplore.ieee.org/document/9181462

Link to Preprint

https://arxiv.org/abs/1810.00314

DOI

https://doi.org/10.1109/TSE.2020.3020502

Saikat Chakraborty

Columbia University

United States

Yangruibo Ding

Columbia University

Miltiadis Allamanis

Microsoft Research, UK

United Kingdom

Baishakhi Ray

Columbia University, USA

United States

YT Video