A preview of the MSR 2013 conference, May 18-19, 2013, in San Francisco, CA. REGISTER NOW! Early registration discounts until April 14. http://msrconf.org
2. FANTASTIC KEYNOTES
Gail C. Murphy: “What is Software Development Productivity, Anyway?”
This talk will focus on the way software productivity has been characterized in
software engineering and how mining can support such a characterization.
Gail C. Murphy is a Professor of Computer Science at the University of British Columbia. She is also a Co-founder and
Chief Science Officer at Tasktop Technologies Inc. Her research interests are in improving the productivity of software
developers and knowledge workers by giving them tools to identify, manage and coordinate the information that really
matters for their work.
Brian Doll: “Striking Gold: Building Software Better, Together”
This talk will tell us how, by exploring GitHub, one can answer questions
about the success of software projects and of software development teams.
Bring your pan and pick. We're going to get our hands dirty.
Brian Doll is a technology & business hacker who has been building things on the web for 17 years. Brian works at
GitHub, helping millions of people build a better world with code.
3. CUTTING-EDGE RESEARCH
Out of over 100 submissions the Program Committee selected the best new
research results to uncover insight from software repositories. Find out about the
latest trends in analytics and data science for software development.
4. MSR TECHNICAL TRACK
Ten technical sessions with 36 presentations on the latest trends in mining, such
as mobile apps, unstructured data, classification and prediction of changes/bugs,
code review, social mining, search-driven development and software evolution.
Bug Triaging MSR Goes Mobile
Why So Complicated? Simple Term Filtering and Asking for (and about) Permissions
Weighting for Location-Based Bug Report Used by Android Apps
Assignment Recommendation Ryan Stevens, Jonathan Ganz, Premkumar Devanbu,
Ramin Shokripour, John Anvik, Zarinah Mohd Kasirun, Hao Chen, and Vladimir Filkov
and Sima Zamani
Retrieving and Analyzing Mobile Apps Feature
Which Work-Item Updates Need Your Response? Requests from Online Reviews
Debdoot Mukherjee and Malika Garg Claudia Iacob and Rachel Harrison
Bug Report Assignee Recommendation
using Activity Profile
Hoda Naguib, Nitesh Narayan, Bernd Brügge, and
Dina Helal
5. MSR TECHNICAL TRACK
Ten technical sessions with 36 presentations on the latest trends in mining, such
as mobile apps, unstructured data, classification and prediction of changes/bugs,
code review, social mining, search-driven development and software evolution.
Changes and Fixes Software Evolution
Understanding the Evolution of Type-3 Clones
Will My Patch Make It? And How Fast?
Ripon K. Saha, Chanchal K. Roy, Kevin A. Schneider, and
Case Study on the Linux Kernel
Dewayne E. Perry
Yujuan Jiang, Bram Adams, and Daniel M. German
An Empirical Study of the Fault-Proneness of Clone
Linux Variability Anomalies: Mutation and Clone Migration
What Causes Them and How Do They Get Fixed? Shuai Xie, Foutse Khomh, and Ying Zou
Sarah Nadi, Christian Dietrich, Reinhard Tartler, Intensive Metrics for the Study of the Evolution of Open
Richard C. Holt, and Daniel Lohmann Source Projects: Case Studies from Apache Software
Foundation Projects
The Impact of Tangled Code Changes Santiago Gala, Gregorio Robles, Jesús González-Barahona,
Kim Herzig and Andreas Zeller and Israel Herraiz
A Preliminary Investigation of Using Age and Distance
Measures in the Detection of Evolutionary Couplings
Abdulkareem Alali, Brian Bartman, Christian Newman, and
Jonathan Maletic
6. MSR TECHNICAL TRACK
Ten technical sessions with 36 presentations on the latest trends in mining, such
as mobile apps, unstructured data, classification and prediction of changes/bugs,
code review, social mining, search-driven development and software evolution.
Analysis of Bug Reports Software Ecosystems, Big Data
Search-Based Duplicate Defect Detection: Mining Source Code Repositories at Massive Scale
An Industrial Experience using Language Modeling
Mehdi Amoui, Nilam Kaushik, Abraham Al-Dabbagh, Miltiadis Allamanis and Charles Sutton
Ladan Tahvildari, Shimin Li, and Weining Liu
Do Software Categories Impact Coupling Metrics?
A Contextual Approach towards More Accurate Lucas Batista Leite De Souza and
Duplicate Bug Report Detection Marcelo De Almeida Maia
Anahita Alipour, Abram Hindle, and Eleni Stroulia
Bug Resolution Catalysts: Identifying Essential
Non-committers from Bug Repositories
Senthil Mani, Seema Nagar, Debdoot Mukherjee,
Ramasuri Narayanam, Vibha Singhal Sinha, and
Amit A. Nanavati
7. MSR TECHNICAL TRACK
Ten technical sessions with 36 presentations on the latest trends in mining, such
as mobile apps, unstructured data, classification and prediction of changes/bugs,
code review, social mining, search-driven development and software evolution.
Classification and Localization of Social Mining
Bugs and Changes
Fixing the 'Out of Sight Out of Mind' Problem: One Year of
Mood Based Microblogging in a Distributed Software Team
Discovering, Reporting, and Fixing Performance Bugs
Kevin Dullemond, Ben Van Gameren,
Adrian Nistor, Tian Jiang, and Lin Tan Arie van Deursen, and Margaret-Anne Storey
Improving Bug Localization using Correlations in Crash Communication in Open Source Software Development
Reports Mailing Lists
Shaohua Wang, Foutse Khomh, and Ying Zou Anja Guzzi, Alberto Bacchelli, Michele Lanza,
Martin Pinzger, and Arie van Deursen
Testing Principles, Current Practices, and Effects of
Change Localization Tag Recommendation In Software Information Sites
Steven Raemaekers, Gabriela Nane, Arie van Deursen, Xin Xia, David Lo, Xinyu Wang, and Bo Zhou
and Joost Visser
Using Developer Interaction Data to Compare Expertise
Metrics
Romain Robbes and David Röthlisberger
8. MSR TECHNICAL TRACK
Ten technical sessions with 36 presentations on the latest trends in mining, such
as mobile apps, unstructured data, classification and prediction of changes/bugs,
code review, social mining, search-driven development and software evolution.
Search-Driven Development 10 Years of MSR
Assisting Code Search with Automatic Query The MSR Cookbook: Mining a Decade of Research
Reformulation for Bug Localization Hadi Hemmati, Sarah Nadi, Olga Baysal,
Bunyamin Sisman and Avinash C. Kak Oleksii Kononenko, Wei Wang, Reid Holmes, and
Michael W. Godfrey
Mining Succinct and High-Coverage API Usage
Patterns from Source Code Happy Birthday! A Trend Analysis on Past MSR Papers
Jue Wang, Yingnong Dang, Hongyu Zhang, Kai Chen, Kevin Wyckmans, Alessandro Murgia, Ahmed Lamkanfi,
Tao Xie, and Dongmei Zhang and Serge Demeyer
Rendezvous: A Search Engine for Binary Code Replicating Mining Studies with SOFAS
Wei Ming Khoo, Alan Mycroft, and Ross Anderson Giacomo Ghezzi and Harald C. Gall
9. MSR TECHNICAL TRACK
Ten technical sessions with 36 presentations on the latest trends in mining, such
as mobile apps, unstructured data, classification and prediction of changes/bugs,
code review, social mining, search-driven development and software evolution.
Mining Unstructured Data Predictor Models
Automatically Mining Software-Based, Semantically- Better Cross Company Defect Prediction
Similar Words from Comment-Code Mappings Fayola Peters, Tim Menzies, and Andrian Marcus
Matthew J. Howard, Lori Pollock, K. Vijay-Shanker, and
Samir Gupta Using Citation Influence to Predict Software Defects
Wei Hu and Kenny Wong
Strategies for Avoiding Text Fixture Smells during
Software Evolution Revisiting Software Development Effort Estimation
Michaela Greiler, Andy Zaidman, Arie van Deursen, Based on Early Phase Development Activities
and Margaret-Anne Storey Masateru Tsunoda, Koji Toda, Kyohei Fushida,
Yasutaka Kamei, Meiyappan Nagappan, and
Contextual Analysis of Program Logs for Understanding Naoyasu Ubayashi
System Behaviors
Qiang Fu, Jian-Guang Lou, Qingwei Lin, Rui Ding,
Dongmei Zhang, and Tao Xie
11. MSR DATA SHOWCASE
Out of 27 submissions the Data Committee selected 15 carefully curated datasets
ready at your fingertips. You name it, we have it! Android, Apache, Eclipse, Gnome,
Maven, Mozilla, and Ruby. Change histories, bug reports, code reviews, and more.
Gerrit Software Code Review Data from Android A Network of Rails: A Graph Dataset of Ruby on Rails
Murtuza Mukadam, Christian Bird, and Peter C. Rigby and Associated Projects
Patrick Wagstrom, Corey Jergensen, and Anita Sarma
Who does what during a Code Review?
Datasets of OSS Peer Review Repositories The GHTorent Dataset and Tool Suite
Kazuki Hamasaki, Raula Gaikovina Kula, Norihiro Yoshida, Ana Erika Georgios Gousios
Camargo Cruz, Kenji Fujiwara, and Hajimu Iida
Project Roles in the Apache Software Foundation: A Dataset
A Dataset from Change History to Support Evaluation of Megan Squire
Software Maintenance Tasks
Apache-Affiliated Twitter Screen Names: A Dataset
Bogdan Dit, Andrew Holtzhauer, Denys Poshyvanyk, and Huzefa Kagdi
Megan Squire
Apache Commits: Social Network Dataset
An Unabridged Source Code Dataset for Research in Software Reuse
Alexander C. MacLean and Charles D. Knutson
Werner Janjic, Oliver Hummel, Marcus Schumacher, and Colin Atkinson
The Eclipse and Mozilla Defect Tracking Dataset:
A Historical Dataset of Software Engineering Conferences
A Genuine Dataset for Mining Bug Information
Bogdan Vasilescu, Alexander Serebrenik, and Tom Mens
Ahmed Lamkanfi, Javier Pérez, and Serge Demeyer
A Dataset for Evaluating Identifier Splitters
The Maven Repository Dataset of Metrics, Changes, and Dependencies
David Binkley, Dawn Lawrie, Lori Pollock, Emily Hill, and Vijay Shanker
Steven Raemaekers, Arie van Deursen, and Joost Visser
INVocD: Identifier Name Vocabulary Dataset
A Historical Dataset for the Gnome Ecosystem
Simon Butler, Michel Wermelinger, Yijun Yu, and Helen Sharp
Mathieu Goeminne, Maëlick Claes, and Tom Mens
13. MSR MINING CHALLENGE
The task: Mine Stack Overflow data for the best insight. Out of 29 submissions
the Challenge Committee selected the 12 best to compete at the conference.
You choose who wins and gets a Microsoft Surface with Windows RT.
Why, When, and What: Analyzing Stack Overflow Questions Exploring Activeness of Users in Q&A Forums
by Topic, Type, and Code Vibha Singhal Sinha, Senthil Mani, and Monika Gupta
Miltiadis Allamanis and Charles Sutton
A Study of Innovation Diffusion through Link Sharing
on Stack Overflow
Detecting API Usage Obstacles: A Study of iOS and
Carlos Gomez, Brendan Cleary, and Leif Singer
Android Developer Questions
Wei Wang and Michael W. Godfrey
Deficient Documentation Detection: A Methodology to
Locate Deficient Project Documentation using Topic Analysis
Making Sense of Online Code Snippets
Joshua Campbell, Chenlei Zhang, Zhen Xu, Abram Hindle,
Siddharth Subramanian and Reid Holmes
and James Miller
Encouraging User Behaviour with Achievements: Building Reputation in StackOverflow: An Empirical Investigation
An Empirical Study Amiangshu Bosu, Christopher S. Corley, Dustin Heaton, Debarshi
Scott Grant and Buddy Betts Chatterji, Jeffrey C. Carver, and Nicholas A. Kraft
Is Programming Knowledge Related to Age? An Exploratory Analysis of Mobile Development Issues
An Exploration of Stack Overflow using Stack Overflow
Patrick Morrison and Emerson Murphy-Hill Mario Linares-Vásquez, Bogdan Dit, and Denys Poshyvanyk
Answering Questions about Unanswered Questions of Stack Overflow
A Discriminative Model Approach for Suggesting Tags Automatically
Muhammad Asaduzzaman, Ahmed Mashiyat, Chanchal K. Roy, and
for Stack Overflow Questions Kevin A. Schneider
Avigit K. Saha, Ripon K. Saha, and Kevin A. Schneider