TA-RE: An Exchange Language for Mining Software Repositories

Thomas Zimmermann
Thomas ZimmermannResearcher at Microsoft Research
TA-RE 1 :  An Exchange Language for Mining Software Repositories   Sunghun Kim,  Thomas Zimmermann , Miryung Kim, Ahmed Hassan,  Audris Mockus, Tudor Girba,  Martin Pinzger, E. James Whitehead, Jr., and Andreas Zeller 1 TA-RE is a Korean word and means “group” or “cluster”.
Software repositories have been getting a lot of attention... ,   but Extraction Intermediate Data Analysis SCM  Repository
Software repositories have been getting a lot of attention..., but Analysis SCM  Repository Extraction requires a non-trivial effort Extracted data depend on the heuristics Difficult to  reproduce  existing repository mining results Extraction Intermediate Data
Our proposal TA-RE Corpus: Extracted Data transactions changes snapshots nature counts references Change statistic Change pattern analysis Origin analysis Co-change analysis Code clone analysis Bug prediction
TA-RE is a work in progress ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
TA-RE is a work in progress ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],WE NEED YOU!!!
1 of 6

More Related Content

Similar to TA-RE: An Exchange Language for Mining Software Repositories(20)

More from Thomas Zimmermann(20)

Software Analytics = Sharing InformationSoftware Analytics = Sharing Information
Software Analytics = Sharing Information
Thomas Zimmermann3.3K views
MSR 2013 PreviewMSR 2013 Preview
MSR 2013 Preview
Thomas Zimmermann21.8K views
Analytics for smarter software development Analytics for smarter software development
Analytics for smarter software development
Thomas Zimmermann2.6K views
Klingon Countdown TimerKlingon Countdown Timer
Klingon Countdown Timer
Thomas Zimmermann1.3K views
Data driven games user researchData driven games user research
Data driven games user research
Thomas Zimmermann1.5K views
Security trend analysis with CVE topic modelsSecurity trend analysis with CVE topic models
Security trend analysis with CVE topic models
Thomas Zimmermann1.5K views
Analytics for software developmentAnalytics for software development
Analytics for software development
Thomas Zimmermann4.6K views
Cross-project defect predictionCross-project defect prediction
Cross-project defect prediction
Thomas Zimmermann1.9K views
Quality of Bug Reports in Open SourceQuality of Bug Reports in Open Source
Quality of Bug Reports in Open Source
Thomas Zimmermann1.6K views
Meet Tom and his FishMeet Tom and his Fish
Meet Tom and his Fish
Thomas Zimmermann1.5K views
Got Myth? Myths in Software EngineeringGot Myth? Myths in Software Engineering
Got Myth? Myths in Software Engineering
Thomas Zimmermann5.9K views

Recently uploaded(20)

TA-RE: An Exchange Language for Mining Software Repositories

  • 1. TA-RE 1 : An Exchange Language for Mining Software Repositories Sunghun Kim, Thomas Zimmermann , Miryung Kim, Ahmed Hassan, Audris Mockus, Tudor Girba, Martin Pinzger, E. James Whitehead, Jr., and Andreas Zeller 1 TA-RE is a Korean word and means “group” or “cluster”.
  • 2. Software repositories have been getting a lot of attention... , but Extraction Intermediate Data Analysis SCM Repository
  • 3. Software repositories have been getting a lot of attention..., but Analysis SCM Repository Extraction requires a non-trivial effort Extracted data depend on the heuristics Difficult to reproduce existing repository mining results Extraction Intermediate Data
  • 4. Our proposal TA-RE Corpus: Extracted Data transactions changes snapshots nature counts references Change statistic Change pattern analysis Origin analysis Co-change analysis Code clone analysis Bug prediction
  • 5.
  • 6.