MSR 2019 - Mining Challenge

The International Conference on Mining Software Repositories (MSR) has hosted a mining challenge since 2006. With this challenge, we call upon everyone interested to apply their tools to a common dataset. The challenge is for researchers and practitioners to bravely use their mining tools and approaches on a dare.

The important dates for the Mining Challenge are:

Abstracts due: February 1, 2019 (AOE)
Papers due: February 6, 2019 (AOE)
Author notification: March 1, 2019 (AOE)
Camera-ready: March 15, 2019 (AOE)

Please see the Call for Mining Challenge Papers for all details.

Time Zone

The program is currently displayed in (GMT-04:00) Eastern Time (US & Canada).

Use conference time zone: (GMT-04:00) Eastern Time (US & Canada)Select other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

You're viewing the program in a time zone which is different from your device's time zone change time zone

Sun 26 May
Displayed time zone: Eastern Time (US & Canada) change

16:00 - 18:00	Mining Challenge presentationsMSR 2019 Mining Challenge at Place du Canada

16:00 10m Talk		SOTorrent: Studying the Origin, Evolution, and Usage of Stack Overflow Code Snippets MSR 2019 Mining Challenge A: Sebastian Baltes University of Trier, A: Christoph Treude The University of Adelaide, A: Stephan Diehl Computer Science, University Trier, Germany Pre-print
16:10 7m Talk		Mining Rule Violations in JavaScript Code Snippets MSR 2019 Mining Challenge Uriel Ferreira Campos , Guilherme Smethurst , João Pedro Moraes , Rodrigo Bonifácio University of Brasília, Brazil, Gustavo Pinto UFPA Pre-print
16:17 7m Talk		Snakes in Paradise?: Insecure Python-related Coding Practices in Stack Overflow MSR 2019 Mining Challenge Akond Rahman North Carolina State University, Effat Farhana , Nasif Imtiaz North Carolina State University Pre-print
16:24 7m Talk		Man vs Machine -- A Study into language identification of Stackoverflow code snippets MSR 2019 Mining Challenge Jens Dietrich Victoria University of Wellington, Markus Luczak-Roesch , Elroy Dalefield Pre-print
16:31 7m Talk		Python Coding Style Compliance on Stack Overflow MSR 2019 Mining Challenge Nikolaos Bafatakis , Niels Boecker , Wenjie Boon , Martin Cabello Salazar , Jens Krinke University College London, Gazi Oznacar , Robert White University College London, UK Pre-print Media Attached
16:38 7m Talk		Towards Mining Answer Edits to Extract Evolution Patterns in Stack Overflow MSR 2019 Mining Challenge Themistoklis Diamantopoulos Electrical and Computer Engineering Dept, Aristotle University of Thessaloniki, Maria-Ioanna Sifaki Electrical and Computer Engineering Dept, Aristotle University of Thessaloniki, Andreas Symeonidis Aristotle University of Thessaloniki Pre-print Media Attached
16:45 7m Talk		Analyzing Comment-induced Updates on Stack Overflow MSR 2019 Mining Challenge Abhishek Soni , Sarah Nadi University of Alberta Pre-print
16:52 7m Talk		What Edits Are Done on Highly Answered Stack Overflow Questions? An Empirical Study MSR 2019 Mining Challenge Xianhao Jin Virginia Tech, USA, Francisco Servant Virginia Tech Pre-print
16:59 7m Talk		Can Duplicate Posts on Stack Overflow Benefit the Software Development Community? MSR 2019 Mining Challenge Durham Abric McGill University, Oliver Clark , Matthew Caminiti , Keheliya Gallaba McGill University, Shane McIntosh McGill University Pre-print
17:06 7m Talk		How Often and What StackOverflow Posts Do Developers Reference in Their GitHub Projects? MSR 2019 Mining Challenge Saraj Singh Manes , Olga Baysal Carleton University Pre-print
17:13 7m Talk		Characterizing Duplicate Code Snippets between Stack Overflow and Tutorials MSR 2019 Mining Challenge Manziba Nishi , Agnieszka Ciborowska , Kostadin Damevski Virginia Commonwealth University Pre-print
17:20 7m Talk		Challenges with Responding to Static Analysis Tool Alerts MSR 2019 Mining Challenge Nasif Imtiaz North Carolina State University, Akond Rahman North Carolina State University, Effat Farhana , Laurie Williams North Carolina State University Pre-print
17:27 7m Talk		Impact of stack overflow code snippets on software cohesion: a preliminary study MSR 2019 Mining Challenge Mashal Ahmad , Mel Ó Cinnéide DOI Pre-print
17:34 7m Talk		We Need to Talk about Microservices: an Analysis from the Discussions on StackOverflow MSR 2019 Mining Challenge Alan Bandeira , Carlos Filho , Matheus Paixao State University of Ceara, Brazil, Paulo Maia State University of Ceará Pre-print Media Attached
17:41 7m Talk		What do developers know about machine learning: a study of ML discussions on StackOverflow MSR 2019 Mining Challenge Hareem-e-Sahar , Abdul Ali Bangash University of Alberta, Canada, Alexander William Wong , Shaiful Chowdhury University of Alberta, Abram Hindle University of Alberta, Karim Ali University of Alberta
17:48 12m		Recap + voting MSR 2019 Mining Challenge

Accepted Papers

	Title
	Analyzing Comment-induced Updates on Stack Overflow MSR 2019 Mining Challenge Abhishek Soni, Sarah Nadi Pre-print
	Can Duplicate Posts on Stack Overflow Benefit the Software Development Community? MSR 2019 Mining Challenge Durham Abric, Oliver Clark, Matthew Caminiti, Keheliya Gallaba, Shane McIntosh Pre-print
	Challenges with Responding to Static Analysis Tool Alerts MSR 2019 Mining Challenge Nasif Imtiaz, Akond Rahman, Effat Farhana, Laurie Williams Pre-print
	Characterizing Duplicate Code Snippets between Stack Overflow and Tutorials MSR 2019 Mining Challenge Manziba Nishi, Agnieszka Ciborowska , Kostadin Damevski Pre-print
	How Often and What StackOverflow Posts Do Developers Reference in Their GitHub Projects? MSR 2019 Mining Challenge Saraj Singh Manes, Olga Baysal Pre-print
	Impact of stack overflow code snippets on software cohesion: a preliminary study MSR 2019 Mining Challenge Mashal Ahmad, Mel Ó Cinnéide DOI Pre-print
	Man vs Machine -- A Study into language identification of Stackoverflow code snippets MSR 2019 Mining Challenge Jens Dietrich, Markus Luczak-Roesch, Elroy Dalefield Pre-print
	Mining Rule Violations in JavaScript Code Snippets MSR 2019 Mining Challenge Uriel Ferreira Campos, Guilherme Smethurst, João Pedro Moraes, Rodrigo Bonifácio, Gustavo Pinto Pre-print
	Python Coding Style Compliance on Stack Overflow MSR 2019 Mining Challenge Nikolaos Bafatakis, Niels Boecker, Wenjie Boon, Martin Cabello Salazar, Jens Krinke, Gazi Oznacar, Robert White Pre-print Media Attached
	Recap + voting MSR 2019 Mining Challenge
	Snakes in Paradise?: Insecure Python-related Coding Practices in Stack Overflow MSR 2019 Mining Challenge Akond Rahman, Effat Farhana, Nasif Imtiaz Pre-print
	SOTorrent: Studying the Origin, Evolution, and Usage of Stack Overflow Code Snippets MSR 2019 Mining Challenge A: Sebastian Baltes, A: Christoph Treude, A: Stephan Diehl Pre-print
	Towards Mining Answer Edits to Extract Evolution Patterns in Stack Overflow MSR 2019 Mining Challenge Themistoklis Diamantopoulos, Maria-Ioanna Sifaki, Andreas Symeonidis Pre-print Media Attached
	We Need to Talk about Microservices: an Analysis from the Discussions on StackOverflow MSR 2019 Mining Challenge Alan Bandeira, Carlos Filho, Matheus Paixao, Paulo Maia Pre-print Media Attached
	What do developers know about machine learning: a study of ML discussions on StackOverflow MSR 2019 Mining Challenge Hareem-e-Sahar , Abdul Ali Bangash, Alexander William Wong, Shaiful Chowdhury, Abram Hindle, Karim Ali
	What Edits Are Done on Highly Answered Stack Overflow Questions? An Empirical Study MSR 2019 Mining Challenge Xianhao Jin, Francisco Servant Pre-print

Call for Mining Challenge Papers

This year, the challenge is about mining SOTorrent, a dataset providing the version history of Stack Overflow posts at the level of whole posts and individual text and code blocks. Moreover, the dataset connects Stack Overflow posts to other platforms by aggregating URLs from text blocks and comments, and by collecting references from GitHub files to Stack Overflow posts. Analyses can be based on SOTorrent alone or expanded to also include data from other resources such as GHTorrent. The overall goal is to study the origin, evolution, and usage of Stack Overflow code snippets. Questions that are, to the best of our knowledge, not sufficiently answered yet include:

How are code snippets on Stack Overflow maintained?
How many clones of code snippets exist inside Stack Overflow?
How can we detect buggy versions of Stack Overflow code snippets and find them in GitHub projects?
How frequently are code snippets copied from external sources into Stack Overflow and then co-evolve there?
How do snippets copied from Stack Overflow to GitHub co-evolve?
Does the evolution of Stack Overflow code snippets follow patterns?
Do these patterns differ between programming languages?
Are the licenses of external sources compatible with Stack Overflow’s license (CC BY-SA 3.0)?
How many code blocks on Stack Overflow do not contain source code (and are only used for markup)?
Can we reliably predict bug-fixing edits to code on Stack Overflow?
Can we reliably predict popularity of Stack Overflow code snippets on GitHub?

These are just some of the questions that could be answered using SOTorrent. We encourage challenge participants to adapt the above questions or formulate their own research questions about the origin, evolution, and usage of content on Stack Overflow.

How to Participate in the Challenge

First, familiarize yourself with the SOTorrent dataset:

Read our MSR 2018 paper about SOTorrent and the preprint of our mining challenge proposal, which contains exemplary queries.
Study the project page of SOTorrent, which includes the most recent database layout and links to the online and download versions of the dataset.
Create a new issue here in case you have problems with the dataset or want to suggest ideas for improvements.

Then, use the dataset to answer your research questions, report your findings in a four-page challenge paper (see information below), submit your abstract before February 1, 2019, and your final paper before February 6, 2019. If your paper is accepted, present your results at MSR 2019 in Montreal, Canada!

Submission

A challenge paper should describe the results of your work by providing an introduction to the problem you address and why it is worth studying, the version of the dataset you used, the approach and tools you used, your results and their implications, and conclusions. Make sure your report highlights the contributions and the importance of your work. See also our open science policy regarding the publication of software and additional data you used for the challenge.

Challenge papers must not exceed 4 pages plus 1 additional page only with references and must conform to the MSR 2019 format and submission guidelines. Each submission will be reviewed by at least three members of the program committee. Submissions should follow the IEEE Conference Proceedings Formatting Guidelines, with title in 24pt font and full text in 10pt type. LaTEX users must use \documentclass[10pt,conference]{IEEEtran} without including the compsoc or compsocconf option.

IMPORTANT: The mining challenge track of MSR 2019 follows the double-blind submission model. Submissions should not reveal the identity of the authors in any way. This means that authors should:

leave out author names and affiliations from the body and metadata of the submitted pdf
ensure that any citations to related work by themselves are written in the third person, for example “the prior work of XYZ” as opposed to “our prior work [2]”
not refer to their personal, lab or university website; similarly, care should be taken with personal accounts on github, bitbucket, Google Drive, etc.
not upload unblinded versions of their paper on archival websites during bidding/reviewing, however uploading unblinded versions prior to submission is allowed and sometimes unavoidable (e.g., thesis)

Authors having further questions on double blind reviewing are encouraged to contact the Mining Challenge Chairs via email.

Papers must be submitted electronically through EasyChair, should not have been published elsewhere, and should not be under review or submitted for review elsewhere for the duration of consideration. ACM plagiarism policy and procedures shall be followed for cases of double submission. The submission must also comply with the IEEE Policy on Authorship.

Upon notification of acceptance, all authors of accepted papers will be asked to complete a copyright form and will receive further instructions for preparing their camera ready versions. At least one author of each accepted paper is expected to register and present the results at MSR 2019 in Montreal, Canada. All accepted contributions will be published in the electronic conference proceedings.

The official publication date is the date the proceedings are made available in the ACM or IEEE Digital Libraries. This date may be up to two weeks prior to the first day of ICSE 2019. The official publication date affects the deadline for any patent filings related to the published work. Purchases of additional pages in the proceedings is not allowed.

If you use the SOTorrent dataset, please cite our challenge proposal:

@inproceedings{msr2019challenge,
title={SOTorrent: Studying the Origin, Evolution, and Usage of Stack Overflow Code Snippets},
author={Baltes, Sebastian and Treude, Christoph and Diehl, Stephan},
year={2019},
booktitle={Proceedings of the 16th International Conference on Mining Software Repositories (MSR 2019)},
preprint={http://empirical-software.engineering/assets/pdf/msr19-sotorrent.pdf}
}

Important Dates

Abstracts due: February 1, 2019 (AOE)

Papers due: February 6, 2019 (AOE)

Author notification: March 1, 2019 (AOE)

Camera-ready: March 15, 2019 (AOE)

Open Science Policy

Openness in science is key to fostering progress via transparency, reproducibility and replicability. Our steering principle is that all research output should be accessible to the public and that empirical studies should be reproducible. In particular, we actively support the adoption of open data and open source principles. To increase reproducibility and replicability, we encourage all contributing authors to disclose:

the source code of the software they used to retrieve and analyze the data
the (anonymized and curated) empirical data they retrieved in addition to the SOTorrent dataset
a document with instructions for other researchers describing how to reproduce or replicate the results

Already upon submission, authors can privately share their anonymized data and software on preserved archives such as Zenodo or Figshare (tutorial available here). Zenodo accepts up to 50GB per dataset (more upon request). There is no need to use Dropbox or Google Drive. After acceptance, data and software should be made public so that they receive a DOI and become citable. Zenodo and Figshare accounts can easily be linked with GitHub repositories to automatically archive software releases. In the unlikely case that authors need to upload terabytes of data, Archive.org may be used.

We encourage authors to self-archive pre- and postprints of their papers in open, preserved repositories such as arXiv.org. This is legal and allowed by all major publishers including ACM and IEEE and it lets anybody in the world reach your paper. Note that you are usually not allowed to self-archive the PDF of the published article (that is, the publisher proof or the Digital Library version).

Please note that the success of the open science initiative depends on the willingness (and possibilities) of authors to disclose their data and that all submissions will undergo the same review process independent of whether or not they disclose their analysis code or data. We encourage authors who cannot disclose industrial or otherwise non-public data, for instance due to non-disclosure agreements, to provide an explicit (short) statement in the paper.

Best Mining Challenge Paper Award

As mentioned above, all submissions will undergo the same review process independent of whether or not they disclose their analysis code or data. However, only accepted papers for which code and data are available on preserved archives, as described in the open science policy, will be considered by the program committee for the best mining challenge paper award.

Best Student Presentation Award

Like in the previous years, there will be a public voting during the conference to select the best mining challenge presentation. This award often goes to authors of compelling work who present an engaging story to the audience. To increase student involvement, starting with MSR 2019, only students can compete for this award.

Organization

Sebastian Baltes, University of Trier, Germany

Christoph Treude, The University of Adelaide, Australia

Stephan Diehl, University of Trier, Germany

09:05 - 10:30	Keynote + Q&A + DiscussionMSR 2019 Keynote at Place du Canada

09:05 45m Talk		Keynote: We Won! Now What? MSR 2019 Keynote A: Robert DeLine Microsoft Research
09:50 10m		Q&A for Keynote MSR 2019 Keynote
10:00 30m		Discussion: Ethical MSR MSR 2019 Keynote Thomas Zimmermann Microsoft Research, Alexander Serebrenik Eindhoven University of Technology

11:00 - 11:45	Session II: Defect Prediction and Testing (Part 1)MSR 2019 Technical Papers at Centre-Ville Chair(s): Patanamon Thongtanunam The University of Melbourne

11:00 15m Full-paper		DeepJIT: An End-To-End Deep LearningFramework for Just-In-Time Defect Prediction MSR 2019 Technical Papers Thong Hoang Singapore Management University, Singapore, Hoa Khanh Dam University of Wollongong, Yasutaka Kamei Kyushu University, David Lo Singapore Management University, Naoyasu Ubayashi Kyushu University
11:16 15m Full-paper		Lessons learned from using a deep tree-based model for software defect prediction in practice MSR 2019 Technical Papers Hoa Khanh Dam University of Wollongong, Trang Pham Deakin University, Shien Wee Ng University of Wollongong, Truyen Tran , John Grundy Monash University, Aditya Ghose , Taeksu Kim , Chul-Joo Kim
11:32 6m Short-paper		Empirical study in using version histories for change risk classification MSR 2019 Technical Papers Max Kiehn , Xiangyi Pan , Fatih Camci
11:39 6m Short-paper		Snoring: a Noise in Defect Prediction Datasets MSR 2019 Technical Papers Aalok Ahluwalia , Davide Falessi California Polytechnic State University, Massimiliano Di Penta University of Sannio

11:00 - 11:45	Session I: Representations for Mining (Part 1)MSR 2019 Technical Papers / MSR 2019 Data Showcase at Place du Canada Chair(s): Chanchal K. Roy University of Saskatchewan

11:00 15m Full-paper		SCOR: Source Code Retrieval With Semantics and Order MSR 2019 Technical Papers Shayan Akbar , Avinash Kak Pre-print Media Attached
11:16 6m Short-paper		PathMiner : A Library for Mining of Path-Based Representations of Code MSR 2019 Technical Papers Vladimir Kovalenko TU Delft, Egor Bogomolov Higher School of Economics, JetBrains Research, Timofey Bryksin , Alberto Bacchelli University of Zurich DOI Pre-print Media Attached
11:23 15m Full-paper		Import2vec: learning embeddings for software libraries MSR 2019 Technical Papers Bart Theeten Nokia Bell Labs, Belgium, Frederik Vandeputte , Tom Van Cutsem Nokia Bell Labs Pre-print
11:39 6m Talk		Semantic Source Code Models Using Identifier Embeddings MSR 2019 Data Showcase Vasiliki Efstathiou Athens University of Economics and Business, Diomidis Spinellis Athens University of Economics and Business Pre-print

11:55 - 12:30	Session IV: Defect Prediction and Testing (Part 2)MSR 2019 Technical Papers / MSR 2019 Data Showcase at Centre-Ville Chair(s): Jesus M. Gonzalez-Barahona Universidad Rey Juan Carlos

11:55 6m Talk		A Dataset of Non-Functional Bugs MSR 2019 Data Showcase Aida Radu , Sarah Nadi University of Alberta Pre-print
12:01 6m Short-paper		Does UML Modeling Associate with Higher Software Quality in Open-Source Software? MSR 2019 Technical Papers Adithya Raghuraman , Truong Ho-Quang , Michel Chaudron Chalmers University of Technology, Alexander Serebrenik Eindhoven University of Technology, Bogdan Vasilescu Carnegie Mellon University Pre-print
12:07 6m Short-paper		STRAIT: A Tool for Automated Software Reliability Growth Analysis MSR 2019 Technical Papers Stanislav Chren Masaryk University, Radoslav Micko , Barbora Buhnova Masaryk University, Bruno Rossi Masaryk University Pre-print
12:13 6m Talk		A Data Set of Program Invariants and Error Paths MSR 2019 Data Showcase Dirk Beyer LMU Munich DOI Pre-print Media Attached
12:19 6m Short-paper		Test Coverage in Python Programs MSR 2019 Technical Papers Hongyu Zhai , Casey Casalnuovo University of California at Davis, USA, Prem Devanbu University of California
12:25 6m Short-paper		On the Effectiveness of Manual and Automatic Unit Test Generation: Ten Years Later MSR 2019 Technical Papers Domenico Serra , Giovanni Grano University of Zurich, Fabio Palomba , Filomena Ferrucci University of Salerno, Harald Gall University of Zurich, Alberto Bacchelli University of Zurich DOI Pre-print Media Attached

11:55 - 12:30	Session III: Representations for Mining (Part 2)MSR 2019 Technical Papers / MSR 2019 Data Showcase at Place du Canada Chair(s): Nicole Novielli University of Bari

11:55 15m Full-paper		Exploring Word Embedding Techniques to Improve Sentiment Analysis of Software Engineering Texts MSR 2019 Technical Papers Eeshita Biswas , K. Vijay-Shanker , Lori Pollock University of Delaware, USA Pre-print
12:10 6m Talk		Cleaning StackOverflow for Machine Translation MSR 2019 Data Showcase Musfiqur Rahman Concordia University, Montreal, Canada, Peter Rigby Concordia University, Montreal, Canada, Dharani Palani Concordia University, Tien N. Nguyen University of Texas at Dallas
12:16 15m Full-paper		Predicting Good Configurations for GitHub and Stack Overflow Topic Models MSR 2019 Technical Papers Christoph Treude The University of Adelaide, Markus Wagner Pre-print

	13:50 - 14:35	Discussion: Data vs. Theory-driven ResearchMSR 2019 Paper Presentations at Place du Canada Chair(s): Michael W. Godfrey University of Waterloo, Canada, Andy Zaidman TU Delft

14:45 - 15:30	Session VI: Energy and EconomicsMSR 2019 Data Showcase / MSR 2019 Technical Papers at Centre-Ville Chair(s): Maleknaz Nayebi Polytechnique Montréal

14:45 15m Full-paper		Recommending Energy-Efficient Java Collections MSR 2019 Technical Papers Wellington de Oliveira Júnior , Renato Santos , Fernando Castor Federal University of Pernambuco (UFPE), José Benito Fernandes De Araújo Neto , Gustavo Pinto UFPA Pre-print
15:01 6m Talk		GreenHub Farmer: Real-world data for Android Energy Mining MSR 2019 Data Showcase Rui Pereira HASLab/INESC TEC & Universidade do Minho & Universidade da Beira Interior, Marco Couto HASLab/INESC TEC & Universidade do Minho, João Paulo Fernandes Release/LISP, CISUC, Bruno Cabral , Hugo Matalonga University of Minho, Simão Melo de Sousa , Fernando Castor Federal University of Pernambuco (UFPE) Pre-print
15:08 6m Talk		GreenSource: a large-scale collection of Android code, tests and energy metrics MSR 2019 Data Showcase Rui Rua HASLab/INESC TEC & Universidade do Minho, Marco Couto HASLab/INESC TEC & Universidade do Minho, João Saraiva University of Minho, Portugal
15:15 6m Short-paper		Striking Gold in Software Repositories? An Econometric Study of Cryptocurrencies on GitHub MSR 2019 Technical Papers Asher Trockman University of Evansville, Rijnard van Tonder Carnegie Mellon University, Bogdan Vasilescu Carnegie Mellon University Pre-print
15:22 6m Talk		Panel Data of Cryptocurrency Development Activity on GitHub MSR 2019 Data Showcase Rijnard van Tonder Carnegie Mellon University, Asher Trockman University of Evansville, Claire Le Goues Carnegie Mellon University

14:45 - 15:30	Session V: Large-Scale MiningMSR 2019 Technical Papers / MSR 2019 Data Showcase at Place du Canada Chair(s): Robert Dyer Bowling Green State University

14:45 15m Full-paper		Time Present and Time Past: Analyzing the Evolution of JavaScript Code in the Wild MSR 2019 Technical Papers Dimitris Mitropoulos , Panos Louridas , Vitalis Salis , Diomidis Spinellis Athens University of Economics and Business Pre-print
15:01 6m Talk		The Software Heritage Graph Dataset: public software development under one roof MSR 2019 Data Showcase Antoine Pietri Inria, Diomidis Spinellis Athens University of Economics and Business, Stefano Zacchiroli University Paris Diderot and Inria, France Pre-print
15:08 15m Full-paper		World of Code: An Infrastructure for Mining the Universe of Open Source VCS Data MSR 2019 Technical Papers Yuxing Ma , Christopher Bogart Carnegie Mellon University, Sadika Amreen , Russell Zaretzki , Audris Mockus University of Tennessee - Knoxville
15:24 6m Short-paper		Crossflow: A Framework for Distributed Mining of Software Repositories MSR 2019 Technical Papers Dimitris Kolovos University of York, Patrick Neubauer University of York, UK, Konstantinos Barmpis , Nicholas Matragkas , Richard Paige McMaster University Pre-print

08:45 - 09:30	Session II: Automatic SummarizationMSR 2019 Technical Papers at Centre-Ville Chair(s): Xin Xia Monash University

08:45 15m Full-paper		Generating Commit Messages from Diffs using Pointer-generator Network MSR 2019 Technical Papers Qin Liu , Zihe Liu School of Software Engineering, Tongji University, Shanghai, China, Hongming Zhu , Hongfei Fan , Bowen Du , Yu Qian
09:00 15m Full-paper		Automatically Generating Documentation for Lambda Expressions in Java MSR 2019 Technical Papers Anwar Alqaimi , Patanamon Thongtanunam The University of Melbourne, Christoph Treude The University of Adelaide Pre-print
09:15 15m Full-paper		Extracting API Tips from Developer Question and Answer Websites MSR 2019 Technical Papers Shaohua Wang New Jersey Institute of Technology, USA, Nhathai Phan , Yan Wang , Yong Zhao

08:45 - 09:30	Session I: APIs & Dependencies (Part 1)MSR 2019 Technical Papers at Place du Canada Chair(s): Philipp Leitner Chalmers University of Technology & University of Gothenburg

08:45 15m Full-paper		Investigating Next-Steps in Static API-Misuse Detection MSR 2019 Technical Papers Sven Amann CQSE GmbH, Hoan Nguyen Iowa State University, Sarah Nadi University of Alberta, Tien N. Nguyen University of Texas at Dallas, Mira Mezini TU Darmstadt, Germany Pre-print
09:00 15m Full-paper		Identifying Experts in Software Libraries and Frameworks among GitHub Users MSR 2019 Technical Papers João Eduardo Montandon Universidade Federal de Minas Gerais (UFMG), Luciana L. Silva , Marco Tulio Valente Federal University of Minas Gerais, Brazil Pre-print
09:15 15m Full-paper		Data-Driven Solutions to Detect API Compatibility Issues in Android: An Empirical Study MSR 2019 Technical Papers Simone Scalabrino University of Molise, Gabriele Bavota Università della Svizzera italiana (USI), Mario Linares-Vasquez Universidad de los Andes, Michele Lanza Universita della Svizzera italiana (USI), Rocco Oliveto University of Molise

09:40 - 10:30	Session IV: SecurityMSR 2019 Data Showcase / MSR 2019 Technical Papers at Centre-Ville Chair(s): Sarah Nadi University of Alberta

09:40 15m Full-paper		Automated Software Vulnerability Assessment with Concept Drift MSR 2019 Technical Papers Triet Le The University of Adelaide, Bushra Sabir , Muhammad Ali Babar
09:55 6m Talk		A Manually-Curated Dataset of Fixes to Vulnerabilities of Open-Source Software MSR 2019 Data Showcase Serena Elisa Ponta , Henrik Plate , Antonino Sabetta , Michele Bezzi , Cédric Dangremont
10:01 15m Full-paper		Negative Results on Mining Crypto-API Usage Rules in Android Apps MSR 2019 Technical Papers Jun Gao University of Luxembourg, SnT, Pingfan Kong Interdisciplinary Centre for Security, Reliability and Trust, University of Luxembourg, Li Li Monash University, Australia, Tegawendé F. Bissyandé SnT, University of Luxembourg, Jacques Klein University of Luxembourg, SnT
10:16 6m Talk		A Dataset of Parametric Cryptographic Misuses MSR 2019 Data Showcase Anna-Katharina Wickert TU Darmstadt, Germany, Michael Reif TU Darmstadt, Germany, Michael Eichberg TU Darmstadt, Germany, Anam Dodhy , Mira Mezini TU Darmstadt, Germany Link to publication DOI Pre-print Media Attached
10:22 6m Talk		RmvDroid: Towards A Reliable Android Malware Dataset with App Metadata MSR 2019 Data Showcase Haoyu Wang Beijing University of Posts and Telecommunications, China, Junjun Si , Hao Li , Yao Guo Peking University

09:40 - 10:30	Session III: APIs & Dependencies (Part 2)MSR 2019 Data Showcase / MSR 2019 Technical Papers at Place du Canada Chair(s): Georgios Gousios TU Delft

09:40 6m Talk		The Maven Dependency Graph: a Temporal Graph-based Representation of Maven Centra MSR 2019 Data Showcase Amine Benelallam , Nicolas Harrand , César Soto-Valero KTH Royal Institute of Technology, Benoit Baudry KTH Royal Institute of Technology, Sweden, Olivier Barais Pre-print
09:46 15m Full-paper		The Emergence of Software Diversity in Maven Central MSR 2019 Technical Papers César Soto-Valero KTH Royal Institute of Technology, Amine Benelallam , Nicolas Harrand , Olivier Barais , Benoit Baudry KTH Royal Institute of Technology, Sweden Pre-print
10:01 15m Full-paper		Dependency Versioning in the Wild MSR 2019 Technical Papers Jens Dietrich Victoria University of Wellington, David J. Pearce Victoria University of Wellington, New Zealand, Jacob Stringer , Amjed Tahir Massey University, Kelly Blincoe University of Auckland Pre-print
10:16 15m Full-paper		Splitting APIs: An Exploratory Study of Software Unbundling MSR 2019 Technical Papers Anderson Severo de Matos , João Bosco Ferreira Filho , Lincoln Rocha Federal University of Ceará

11:00 - 11:45	Session VI: Software Quality (part 1)MSR 2019 Technical Papers at Centre-Ville Chair(s): Fabio Palomba University of Zurich

11:00 15m Full-paper		The Rise of Android Code Smells: Who Is to Blame? MSR 2019 Technical Papers Sarra Habchi University of Lille, Romain Rouvoy University Lille 1 and INRIA, Naouel Moha University of Montreal
11:15 15m Full-paper		Assessing Diffusion and Perception of Test Smells in Scala Projects MSR 2019 Technical Papers Jonas De Bleser Sofware Languages Lab, Vrije Universiteit Brussel, Dario Di Nucci Vrije Universiteit Brussel, Coen De Roover Vrije Universiteit Brussel Pre-print
11:30 15m Full-paper		style-analyzer: fixing code style inconsistencies with interpretable unsupervised algorithms MSR 2019 Technical Papers Vadim Markovtsev source{d}, Hugo Mougard source{d}, Waren Long source{d}, Egor Bulychev , Konstantin Slavnov Pre-print

11:00 - 11:45	Session V: Collaboration & Communication (Part 1)MSR 2019 Technical Papers at Place du Canada Chair(s): Peter Rigby Concordia University, Montreal, Canada

11:00 15m Full-paper		An Empirical Study of Multiple Names and Email Addresses in OSS Version Control Repositories MSR 2019 Technical Papers Jiaxin Zhu Institute of Software at Chinese Academy of Sciences, China, Jun Wei Institute of Software, Chinese Academy of Sciences, China
11:15 15m Full-paper		Characterizing the Roles of Contributors in Open-source Scientific Software Projects MSR 2019 Technical Papers Reed Milewicz Sandia National Laboratories, Gustavo Pinto UFPA, Paige Rodeghero University of Notre Dame Pre-print
11:30 15m Full-paper		git2net - Mining Time-Stamped Co-Editing Networks from Large git Repositories MSR 2019 Technical Papers Christoph Gote Chair of Systems Design, ETH Zurich, Ingo Scholtes , Frank Schweitzer DOI Pre-print

11:55 - 12:30	Session VIII: Software Quality (part 2)MSR 2019 Technical Papers / MSR 2019 Data Showcase at Centre-Ville Chair(s): Yasutaka Kamei Kyushu University

11:55 15m Full-paper		A Large-scale Study about Quality and Reproducibility of Jupyter Notebooks MSR 2019 Technical Papers João Felipe Pimentel , Leonardo Murta Universidade Federal Fluminense (UFF), Vanessa Braganholo , Juliana Freire Pre-print
12:10 15m Full-paper		Cross-language clone detection by learning over abstract syntax trees MSR 2019 Technical Papers Daniel Perez Imperial College London, Shigeru Chiba University of Tokyo, Japan Pre-print
12:25 6m Talk		SeSaMe: A Data Set of Semantically Similar Java Methods MSR 2019 Data Showcase Marius Kamp , Patrick Kreutzer , Michael Philippsen Friedrich-Alexander University Erlangen-Nürnberg (FAU)

11:55 - 12:30	Session VII: Collaboration & Communication (Part 2)MSR 2019 Technical Papers at Place du Canada Chair(s): Kelly Blincoe University of Auckland

11:55 15m Full-paper		Can Issues Reported at Stack Overflow Questions be Reproduced? An Exploratory Study MSR 2019 Technical Papers Saikat Mondal University of Saskatchewan, Masud Rahman University of Saskatchewan , Chanchal K. Roy University of Saskatchewan Pre-print
12:10 15m Full-paper		Exploratory Study of Slack Q&A Chats as a Mining Source for Software Engineering Tools MSR 2019 Technical Papers Preetha Chatterjee University of Delaware, USA, Kostadin Damevski Virginia Commonwealth University, Lori Pollock University of Delaware, USA, Vinay Augustine , Nicholas A. Kraft ABB Corporate Research Pre-print
12:25 6m Short-paper		Impacts of Daylight Saving Time on Software Development MSR 2019 Technical Papers Junichi Hayashi Osaka University, Yoshiki Higo Osaka University, Shinsuke Matsumoto Osaka University, Shinji Kusumoto Osaka University Pre-print

	13:50 - 14:35	Discussion: SE for AI for SEMSR 2019 Paper Presentations at Place du Canada Chair(s): Neil Ernst University of Victoria, Tim Menzies North Carolina State University

14:45 - 15:30	Session X: Building on DataMSR 2019 Data Showcase / MSR 2019 Technical Papers at Centre-Ville Chair(s): Cor-Paul Bezemer University of Alberta, Canada

14:45 15m Full-paper		Standing on Shoulders or Feet? The Usage of the MSR Data Papers MSR 2019 Technical Papers Zoe Kotti Athens University of Economics and Business, Diomidis Spinellis Athens University of Economics and Business Pre-print
15:00 6m Talk		Boa Meets Python: A Boa Dataset of Data Science Software in Python Language MSR 2019 Data Showcase Sumon Biswas Iowa State University, Md Johirul Islam Iowa State University, Yijia Huang , Hridesh Rajan Iowa State University Pre-print Media Attached
15:06 6m Talk		A Benchmark of Data Loss Bugs for Android Apps MSR 2019 Data Showcase Oliviero Riganelli , Marco Mobilio , Daniela Micucci University of Milano-Bicocca, Italy, Leonardo Mariani University of Milano Bicocca
15:12 6m Talk		RapidRelease - A Dataset of Projects and Issues on GitHub with Rapid Release MSR 2019 Data Showcase Saket Joshi Indian Institute of Technology Tirupati, Sridhar Chimalakonda Indian Institute of Technology Tirupati
15:18 6m Short-paper		A Tool to Analyze Packages in Software Containers MSR 2019 Technical Papers Ahmed Zerouali UMONS, Valerio Cosentino Bitergia, Jesus M. Gonzalez-Barahona Universidad Rey Juan Carlos, Gregorio Robles Universidad Rey Juan Carlos, Tom Mens University of Mons Pre-print
15:24 6m Talk		An Empirical History of Permission Requests and Mistakes in Open Source Android Apps MSR 2019 Data Showcase Gian Luca Scoccia , Anthony Peruma Rochester Institute of Technology, Virginia Pujols , Ben Christians , Daniel Krutz Rochester Institute of Technology

14:45 - 15:30	Session IX: TraceabilityMSR 2019 Technical Papers at Place du Canada Chair(s): Francisco Servant Virginia Tech

14:45 15m Full-paper		Predicting Co-Changes between Functionality Specifications and Source Code in Behavior Driven Development MSR 2019 Technical Papers Aidan Z.H. Yang Queen's University, Canada, Daniel Alencar Da Costa Queen's University, Kingston, Ontario, Ying Zou Queen's University, Kingston, Ontario
15:01 6m Short-paper		Tracing Back Log Data to its Log Statement: From Research to Practice MSR 2019 Technical Papers Daan Schipper , Maurício Aniche Delft University of Technology, Netherlands, Arie van Deursen Delft University of Technology Pre-print
15:08 6m Short-paper		Beyond GumTree: A hybrid approach to generate edit scripts MSR 2019 Technical Papers Junnosuke Matsumoto , Yoshiki Higo Osaka University, Shinji Kusumoto Osaka University Pre-print
15:15 6m Short-paper		The Impact of Systematic Edits in History Slicing MSR 2019 Technical Papers Ryosuke Funaki , Shinpei Hayashi Tokyo Institute of Technology, Motoshi Saeki Tokyo Institute of Technology Pre-print
15:22 6m Short-paper		Scalable Software Merging Studies with MERGANSER MSR 2019 Technical Papers Moein Owhadi-Kareshk , Sarah Nadi University of Alberta

Mining ChallengeMSR 2019

Program Display Configuration

Sun 26 MayDisplayed time zone: Eastern Time (US & Canada) change

Mon 27 MayDisplayed time zone: Eastern Time (US & Canada) change

Accepted Papers

Call for Mining Challenge Papers

Sebastian BaltesMining Challenge Co-Chair

University of Trier

Germany

Stephan DiehlMining Challenge Co-Chair

Computer Science, University Trier, Germany

Germany

Christoph TreudeMining Challenge Co-Chair

The University of Adelaide

Australia

Rana AlkadhiCommittee Member

Saudi Arabia

Le AnCommittee Member

Polytechnique Montreal

Canada

Moritz BellerCommittee Member

Delft University of Technology

Stefanie BeyerCommittee Member

University of Klagenfurt

Austria

Gemma CatolinoCommittee Member

University of Salerno

Italy

Chunyang ChenCommittee Member

Monash University

Australia

Denae FordCommittee Member

Microsoft Research

United States

Jin L.C. GuoCommittee Member

McGill University

Steffen HerboldCommittee Member

University of Göttingen

Germany

Akinori IharaCommittee Member

Wakayama University

Japan

Brittany JohnsonCommittee Member

University of Massachusetts Amherst

United States

Maria KechagiaCommittee Member

University College London

United Kingdom

Sarah NadiCommittee Member

University of Alberta

Canada

Klérisson PaixãoCommittee Member

Federal University of Uberlândia

Brazil

Sebastian ProkschCommittee Member

University of Zurich

Switzerland

Florian ReitzCommittee Member

Schloss Dagstuhl LZI

Germany

Mauricio SotoCommittee Member

Carnegie Mellon University

Costa Rica

Kate StewartCommittee Member

Linux Foundation

United States

Christopher VendomeCommittee Member

Miami University

United States

Shaowei WangCommittee Member

Queen's University

Canada

Minghui ZhouCommittee Member

Peking University

China

Sun 26 May
Displayed time zone: Eastern Time (US & Canada) change

Mon 27 May
Displayed time zone: Eastern Time (US & Canada) change