Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.

MSR 2016 data showcase - Mining Code Review Repositories

4,449 views

Published on

The presentation slides of our data paper in MSR 2016 - "Mining the Modern Code Review Repositories: A Dataset of People, Process and Product".

Published in: Software
  • You have to choose carefully. ⇒ www.WritePaper.info ⇐ offers a professional writing service. I highly recommend them. The papers are delivered on time and customers are their first priority. This is their website: ⇒ www.WritePaper.info ⇐
       Reply 
    Are you sure you want to  Yes  No
    Your message goes here
  • Have you ever used the help of ⇒ www.HelpWriting.net ⇐? They can help you with any type of writing - from personal statement to research paper. Due to this service you'll save your time and get an essay without plagiarism.
       Reply 
    Are you sure you want to  Yes  No
    Your message goes here
  • Be the first to like this

MSR 2016 data showcase - Mining Code Review Repositories

  1. 1. Mining the Modern Code Review Repositories: A Dataset of People, Process and Product Xin Yang Raula G. Kula Norihiro Yoshida Hajimu Iida May 14–15, 2016. Austin, Texas MSR 2016 data showcase Osaka University Japan Nagoya University Japan NAIST Japan NAIST Japan
  2. 2. A Code Review Dataset 1 Code Review Source Code Human / Social (anonymized usernames and email addresses)
  3. 3. Why we made this dataset? 2 * Hamasaki et al., “Who does what during a code review? datasets of OSS peer review repositories”. MSR '13 Our previous work (Hamasaki et al. MSR '13)*
  4. 4. Our previous work (Hamasaki et al. MSR '13)* Why we made this dataset? 2 Some feedback: “Hard to query...” “Hard to convert...” “Unable to access the source code...” * Hamasaki et al., “Who does what during a code review? datasets of OSS peer review repositories”. MSR '13
  5. 5. Our previous work (Hamasaki et al. MSR '13)* Why we made this dataset? 2 Some feedback: “Hard to query...” “Hard to convert...” “Unable to access the source code...” * Hamasaki et al., “Who does what during a code review? datasets of OSS peer review repositories”. MSR '13
  6. 6. * Hamasaki et al., “Who does what during a code review? datasets of OSS peer review repositories”. MSR '13 Our previous work (Hamasaki et al. MSR '13)* Some feedback: “Hard to query...” “Hard to convert...” “Unable to access the source code...” Why we made this dataset? 2 ★ Easy to query / analyze ★ Easy to export / convert ★ Able to access the source code
  7. 7. 3 Modern Code Review (MCR) Key Attributes Large Codebases (Repositories) High Volume Submissions (Patches) Large Communities (Participants) </></></></></></> </></></>
  8. 8. The Concept 4
  9. 9. Process Product People The Concept 4
  10. 10. 4 years 3 years 7 years 4 years 3 years 611 20 567 111 189 173,749 13,597 63,610 110,172 9,168 5,091 437 3,334 1,437 759 Dataset Statistics (updated to May 2015) 5 </></></>
  11. 11. 6 Dataset Schema (Check our wiki for details)
  12. 12. ★ Promote peer review research and link to other research topics ★ Encourage researchers to use this as a benchmark of techniques and different approaches 7 Our Goals
  13. 13. goo.gl/Wi4UoJ Get Your Copy Now!!!
  14. 14. Thanks! Any questions? Contact: Xin Yang kin-y@is.naist.jp @seeleather

×