Action-based Recommendation in Pull-request Development

•Download as PPTX, PDF•

0 likes•149 views

Slides of the paper: Muhammad Ilyas Azeem, Sebastiano Panichella, Andrea Di Sorbo, Alexander Serebrenik, and Qing Wang: Action-based Recommendation in Pull-request Development . International Conference on Software and System Processes (ICSSP2020).

Presentations & Public Speaking

1
Action-based Recommendation in Pull-request
Development
Institute of Software, Chinese Academy of SciencesInstitute of Software, Chinese Academy of Sciences
Muhammad Ilyas
Azeem
Sebastiano
Panichella
Alexander
Serebrenik Qing WangAndrea Di Sorbo

Popular GitHub Open-Source Projects
 Receives numerous pull requests daily
 E.g. Kubernetes receives more the 500 pull requests daily
2

Issues for integrator
 Job of the integrator is critical
 Ensure software quality
 Communication with contributors
 Manual selection of PRs:
Requires more effort & time
Especially when integrators have large workload & limited resources
3

Proposed Solution
 CARTESIAN (aCceptance And Response classiﬁcaTion-based requESt
IdentifcAtioN)
 CARTESIAN recommends three actions on PRs:
 Accept, Response, and Reject
 To implement CARTESIAN we followed two steps:
 Feature Extraction Process
 Classification model
4

Feature Extraction Process
PRs are crawled from 19 popular GitHub projects
Features have been extracted from the following four dimensions:
Pull request Project Contributors Integrator
5

Selected Features
Pull request
Files changed
Commits
Labels count
Etc.
6
Project
Project age
Team size
Open issues
Etc.
Contributors
Core member
Followers
PRs accept rate
Etc.
Integrator
Review comments
User comments
Participant count
Etc.

Classification Model
CARTESIAN models the PR recommendation as a multi class
problem.
CARTESIAN recommends three actions on PRs:
 Accept: These are the PRs accepted without any discussion
 Respond: These are the PRs accepted after discussion with the contributors
 Reject: These are the PRs which have not been accepted
7

Experimental Design
Dataset Overview
Crawled popular GitHub projects belonging to various domains and
programming languages
Pull requests time span (Project’s creation time to February 2018)
GitHub REST API V3
8

Experimental Design
Selected Projects
Dataset Distribution
9

Experiment I (RQ1)
Seven classifiers have been trained:
Logistic Regression, SVM, Random Forest, Decision Trees, Naive Bayes,
K-Nearest Neighbor and XGBoost models
Features selection: using features importance analysis
Evaluation metrics: Accuracy, Recall, Precision, F-Measure
10

Experiment II (RQ2)
CARTESIAN Assessment:
1. Firstly, we compared CARTESIAN with baseline models, the
prioritizing criteria studied by Gousios et al.
FIFO model and Sized-Based Model (SBM)
2. Secondly we performed qualitatively analysis of top@20 PRs
Evaluation metrics: Mean Average Precision (MAP) and Average Recall (AR)
11

Results for RQ1
 XGBoost outperformed the rest of the classifiers
 XGBoost is selected as the ultimate classifier for
CARTESIAN
 CARTESIAN achieved an average precision and recall
of 86%
12

Features Importance Analysis
 Number of review & discussion
comments, the role of submitter, and
the number of participants in the
discussion are the most relevant
features
The classification accuracy is largely
driven by features in the Contributor
and Integrator dimensions.
13

Results for RQ(2)
CARTESIAN outperformed the baseline models in top@20 MAP and AR
14

Results for RQ(2)
Qualitative analysis shows that CARTESIAN recommends useful PRs
to the integrator e.g. bug fixes, new features requests etc.
15

Conclusion
 CARTESIAN can be helpful for integrators of popular GitHub
projects
 It has achieved better results: an average precision and recall of
about 86%
Besides, CARTESIAN prioritize useful PRs on the top of the list
16

Future Work
Our plan is to:
 Integrator CARTESIAN to GitHub
 Evaluate its usefulness, and
discover additional factors (quality metrics) that can be used to
improve the performance
17

Similar to Action-based Recommendation in Pull-request Development

QuASD/PROFES 2017, Innsbruck (Austria): Workshop with Marcus Ciolkowski (Principal IT Consultant at QAware), Liliana Guzmán, Adam Trendowicz, Felix Salfner Abstract: Due to cost and time constraints, software quality is often neglected in the evolution and adaptation of software. Thus, maintainability suffers, maintenance costs rise, and the development takes longer. These effects are referred to as “technical debt”. The challenge for project managers is to find a balance when using the given budget and schedule, either by reducing technical debt or by adding technical features. This balance is needed to keep time to market for current product releases short and future maintenance costs at an acceptable level. Method: The project ProDebt aimed at developing an innovative methodology and a software tool to support the strategic planning of technical debt in the context of agile software development. In this project, we created quality models and collected corresponding measurement data for two case studies in two different companies. Alltogether, the two case studies contributed 5-6 years of data, from the end of 2011, resp. mid-2012, until today. Using measurement and effort data, we trained a machine-learning model to predict productivity based on measurement data - representing the technical debt of a file at a given point in time. Result: We developed a prototype and a prediction model for forecasting potential savings based on proposed refactorings of key drivers of technical debt identified by the model. In this paper, we present the approach and the experiences made during model development.

ProDebt's Lessons Learned from Planning Technical Debt Strategically

QAware GmbH

Expertool GRC Accelerator

slideshareneilj

IM426 3A G5.ppt

MohamedSalem979344

B2 2006 sizing_benchmarking (1)

Steve Feldman

B2 2006 sizing_benchmarking

Steve Feldman

Spm project planning

Kanchana Devi

Six sigma ajal

AJAL A J

Supply Chain Network Strategy with SCOR

Richard Freggi

Downloads abc 2006 presentation downloads-ramesh_babu

Hem Rana

Software Project Management Presentation Final

Minhas Kamal

SE2023 0201 Software Analysis and Design.pptx

Bharat Chawda

The concept of continuous integration (CI) and continuous delivery (CD) based on daily builds, test automation and automated deployment, is becoming popular and widely used in the industry. Automation efforts support both quality improvements and agility acceleration. Recently the scope of test automation has expanded, system-level testing is now automated and included in continuous daily builds. Before the automation era, system tests were only executed on build artefacts once all implementation was completed. However, system testing is now conducted in parallel with implementation of product code as regression tests. This continuous system test practice allows the development team to lower the cost of bug fixes. On the other hand, the concept of system testing is not well understood from the perspective of traditional QA. For example the bug curve is smoothly convergent in traditional QA, but it is rapidly convergent in a continuous system test environment. In this report, source code, bugs and implementation metrics are analyzed for better understanding of continuous system test concepts. Profile: DevOps engineer in Rakuten inc. He started his career as a search engine developer, then developed a system test automation framework for non-functional tests including resilience testing and search quality testing. Currently he leads a test automation team and DevOps team for expanding the concept of Continuous System Test in the organization.

Metrics Analysis on Continuous System Test (ASQN 2016)

Kotaro Ogino

Z suzanne van_den_bosch

Hoopeer Hoopeer

Alleman coonce-agile-2017 may2

Glen Alleman

Recsys 2018 overview and highlights

Sandra Garcia

The Use of Development History in Software Refactoring Using a Multi-Objectiv...

Ali Ouni

The failure of many software systems are mainly due to the lack of the requirement engineering. Where software requirement play a very vital role in the field of software engineering. The main task of the requirement engineering are eliciting the requirements from the customer and to prioritize those requirements to make decisions in the software design. Prioritization of the software requirement is very much useful in giving priority within the set of requirements. Requirement prioritization is very much important when there are strict constraints on schedule and the resources, then the software engineer must take some decisions on neglecting or to give prioritization to some of the requirements that are to be added to the project which makes it successful. This paper is the frame work of comparison of various techniques and to propose a most competent method among them.

A Comparative Study of Software Requirement, Elicitation, Prioritization and ...

IJERA Editor

Software Design Document

Nadia Nahar

Icse 2011 ds_1

SAIL_QU

A Review Of Code Reviewer Recommendation Studies Challenges And Future Direc...

Sheila Sinclair

Similar to Action-based Recommendation in Pull-request Development (20)

ProDebt's Lessons Learned from Planning Technical Debt Strategically

Expertool GRC Accelerator

IM426 3A G5.ppt

B2 2006 sizing_benchmarking (1)

B2 2006 sizing_benchmarking

Spm project planning

Six sigma ajal

Supply Chain Network Strategy with SCOR

Downloads abc 2006 presentation downloads-ramesh_babu

Software Project Management Presentation Final

SE2023 0201 Software Analysis and Design.pptx

Metrics Analysis on Continuous System Test (ASQN 2016)

Z suzanne van_den_bosch

Alleman coonce-agile-2017 may2

Recsys 2018 overview and highlights

The Use of Development History in Software Refactoring Using a Multi-Objectiv...

A Comparative Study of Software Requirement, Elicitation, Prioritization and ...

Software Design Document

Icse 2011 ds_1

A Review Of Code Reviewer Recommendation Studies Challenges And Future Direc...

More from Sebastiano Panichella

The 3rd Intl. Workshop on NL-based Software Engineering

Sebastiano Panichella

Diversity-guided Search Exploration for Self-driving Cars Test Generation thr...

Sebastiano Panichella

SBFT Tool Competition 2024 -- Python Test Case Generation Track

Sebastiano Panichella

SBFT Tool Competition 2024 - CPS-UAV Test Case Generation Track

Sebastiano Panichella

Simulation-based Testing of Unmanned Aerial Vehicles with Aerialist

Sebastiano Panichella

Testing with Fewer Resources: Toward Adaptive Approaches for Cost-effective ...

Sebastiano Panichella

COSMOS: DevOps for Complex Cyber-physical Systems

Sebastiano Panichella

Testing and Development Challenges for Complex Cyber-Physical Systems: Insigh...

Sebastiano Panichella

An Empirical Characterization of Software Bugs in Open-Source Cyber-Physical ...

Sebastiano Panichella

Automated Identification and Qualitative Characterization of Safety Concerns ...

Sebastiano Panichella

The 2nd Intl. Workshop on NL-based Software Engineering

Sebastiano Panichella

The 16th Intl. Workshop on Search-Based and Fuzz Testing

Sebastiano Panichella

Here are the slides of the presentation of the paper entitled "Simulation-based Test Case Generation for Unmanned Aerial Vehicles in the Neighborhood of Real Flights". It was presented at the IEEE International Conference on Software Testing, Verification, and Validation (ICST) 2023. The presentation concerns the ongoing research in the COSMOS H2020 project (https://www.cosmos-devops.org/), as outlined by the ICST Program (https://conf.researchr.org/program/icst-2023/program-icst-2023/?past=Show%20upcoming%20events%20only).

Simulation-based Test Case Generation for Unmanned Aerial Vehicles in the Nei...

Sebastiano Panichella

Exposed! A case study on the vulnerability-proneness of Google Play Apps

Sebastiano Panichella

Search-based Software Testing (SBST) '22 Workshop Co-Chairs: Giovani Guizzo UNIVERSITY COLLEGE LONDON, UNITED KINGDOM Sebastiano Panichella ZURICH UNIVERSITY OF APPLIED SCIENCE, SWITZERLAND Competition Co-Chairs: Alessio Gambi UNIVERSITY OF PASSAU, GERMANY Gunel Jahangirova UNIVERSITÀ DELLA SVIZZERA ITALIANA, SWITZERLAND Vincenzo Riccio UNIVERSITÀ DELLA SVIZZERA ITALIANA, SWITZERLAND Fiorella Zampetti UNIVERSITY OF SANNIO, ITALY Website Chair: Rebecca Moussa UNIVERSITY COLLEGE LONDON, UNITED KINGDOM Program Committee: Nazareno Aguirre, Universidad Nacional de Río Cuarto - CONICET, Argentina Aldeida Aleti, Monash University, Australia Giuliano Antoniol, Ecole Polytechnique de Montréal, Canada Kate Bowers, Oakland University, USA Jose Campos, University of Washington, USA Thelma E. Colanzi, State University of Maringá, Brazil Byron DeVries, Grand Valley State University, USA Gordon Fraser, University of Passau, Germany Erik Fredericks, Oakland University, USA Gregory Gay, Chalmers and the University of Gothenburg, Sweden Alessandra Gorla, IMDEA Software Institute, Spain Gregory Kapfhammer, Allegheny College, USA Yiling Lou, Peking University, China Mitchell Olsthoorn, Delft University of Technology, Netherlands Justyna Petke, University College London, UK Silvia R. Vergilio, Universidade Federal do Paraná, Brazil Simone do Rocio Senger de Souza, University of São Paulo, Brazil Thomas Vogel, Humboldt-Universität zu Berlin, Germany Jie Zhang, University College London, UK

Search-based Software Testing (SBST) '22

Sebastiano Panichella

NL-based Software Engineering (NLBSE) '22

Sebastiano Panichella

Tool Competition Introduction NLP-based approaches and tools have been proposed to improve the efficiency of software engineers, processes, and products, by automatically processing natural language artifacts (issues, emails, commits, etc.). We believe that the availability of accurate tools is becoming increasingly necessary to improve Software Engineering (SE) processes. One important process is issue management and prioritization where developers have to understand, classify, prioritize, assign, etc. incoming issues reported by end-users and developers. This year, we are pleased to announce the first edition of the NLBSE’22 tool competition on issue report classification, an important task in issue management and prioritization. For the competition, we provide a dataset encompassing more than 800k labeled issue reports (as bugs, enhancements, and questions) extracted from real open-source projects. You are invited to leverage this dataset for evaluating your classification approaches and compare the achieved results against a proposed baseline approach (based on FastText). Competition overview We created a Colab notebook with detailed information about the competition (provided data, baseline approach, paper submission, paper format, etc.). If you want to participate, you must: Train and tune a multi-label multi-class classifier using the provided training set. The classifier should assign one label to an issue. Evaluate your classifier on the provided test set Write a paper (4 pages max.) describing: The architecture and details of the classifier The procedure used to pre-process the data The procedure used to tune the classifier on the training set The results of your classifier on the test set Additional info.: provide a link to your code/tool with proper documentation on how to run it Submit the paper by emailing the tool competition organizers (see below) Submissions will be evaluated and accepted based on correctness and reproducibility, defined by the following criteria: Clarity and detail of the paper content Availability of the code/tool, released as open-source Correct training/tuning/evaluation of your code/tool on the provided data Clarity of the code documentation The accepted submissions will be published at the workshop proceedings. The submissions will be ranked based on the F1 score achieved by the proposed classifiers on the test set, as indicated in the papers. The submission with the highest F1 score will be the winner of the competition. How to participate? Email your paper to Oscar Chaparro (oscarch@wm.edu) and Rafael Kallis (rk@rafaelkallis.com) by the submission deadline.

NLBSE’22: Tool Competition

Sebastiano Panichella

"An NLP-based Tool for Software Artifacts Analysis" at @ICSME2021.

Sebastiano Panichella

An Empirical Investigation of Relevant Changes and Automation Needs in Modern...

Sebastiano Panichella

Search-Based Software Testing Tool Competition 2021 by Sebastiano Panichella,...

Sebastiano Panichella

More from Sebastiano Panichella (20)

The 3rd Intl. Workshop on NL-based Software Engineering

Diversity-guided Search Exploration for Self-driving Cars Test Generation thr...

SBFT Tool Competition 2024 -- Python Test Case Generation Track

SBFT Tool Competition 2024 - CPS-UAV Test Case Generation Track

Simulation-based Testing of Unmanned Aerial Vehicles with Aerialist

Testing with Fewer Resources: Toward Adaptive Approaches for Cost-effective ...

COSMOS: DevOps for Complex Cyber-physical Systems

Testing and Development Challenges for Complex Cyber-Physical Systems: Insigh...

An Empirical Characterization of Software Bugs in Open-Source Cyber-Physical ...

Automated Identification and Qualitative Characterization of Safety Concerns ...

The 2nd Intl. Workshop on NL-based Software Engineering

The 16th Intl. Workshop on Search-Based and Fuzz Testing

Simulation-based Test Case Generation for Unmanned Aerial Vehicles in the Nei...

Exposed! A case study on the vulnerability-proneness of Google Play Apps

Search-based Software Testing (SBST) '22

NL-based Software Engineering (NLBSE) '22

NLBSE’22: Tool Competition

"An NLP-based Tool for Software Artifacts Analysis" at @ICSME2021.

An Empirical Investigation of Relevant Changes and Automation Needs in Modern...

Search-Based Software Testing Tool Competition 2021 by Sebastiano Panichella,...

Recently uploaded

(Vivek)Call Us, 8448380779,Call girls in Delhi NCr – We Offer best in class call girls. escort Service At Affordable Price At low Rate with Space Night 8000 We Are One Of The Oldest Escort and Call girls Agencies in Delhi. You Will Find That Our Female Escorts Are Full Of Fun, Sexy And They Would Love Enjoy Your Company. We Have A Fantastic Selection Of Escort Ladies Available For In-Calls As Well As Out-Calls. Our Escorts Are Not Only Beautiful But All Have Great Personalities Making Them The Perfect Companion For Any Occasion. In-Call:- You Can Come At Our Place in Delhi Our place Which Is Very Clean Hygienic 100% safe Accommodation. Out-Call:- You have To Come Pick The Girl From My Place We Are Also Provide Door Step Services (Delhi Ncr, Noida, Gurgaon, Faridabad, Ghaziabad Note:- Pic Collectors Time Passers Bargainers Stay Away As We Respect The Value For Your Money Time And Expect The Same From You Hygienic:- Full Ac room And Clean Rooms Available In Hotel 24 * 7 Hourly In Delhi NCR More Details, With WhatsApp Number, +91-8448380779

Busty Desi⚡Call Girls in Sector 51 Noida Escorts >༒8448380779 Escort Service-...

Delhi Call girls

Introduction to Prompt Engineering (Focusing on ChatGPT)

Chameera Dedduwage

My Presentation "In Your Hands" by Halle Bailey

hlharris

Dreaming Marissa Sánchez Music Video Treatment

nswingard

Presentation on Engagement in Book Clubs

samaasim06

Mohammad_Alnahdi_Oral_Presentation_Assignment.pptx

mohammadalnahdi22

Re-membering the Bard: Revisiting The Compleat Wrks of Wllm Shkspr (Abridged)...

Hasting Chen

Dreaming Music Video Treatment _ Project & Portfolio III

NhPhngng3

The workplace ecosystem of the future 24.4.2024 Fabritius_share ii.pdf

Senaatti-kiinteistöt

Thirunelveli call girls Tamil escorts 7877702510

Vipesco

lONG QUESTION ANSWER PAKISTAN STUDIES10.

lodhisaajjda

SaaStr Workshop Wednesday w/ Lucas Price, Yardstick

saastr

Yesterday in Lagos, I had the honour of delivering a lecture titled “If this Giant Must Walk; Manifesto for a New Nigeria“ at the Inaugural Memorial Lecture of Prince Emeka Obasi, founder and publisher of Business Hallmark and the inspirational figure behind the Public Policy Research and Analysis Centre - promoters of the revered Zik Leadership and Governance Awards. My lecture focused on the challenges of nation-building in Nigeria and how we can approach them in a way that promotes progress and unity. I discussed the many sources of concern about our country’s future prospects, including violent conflicts, revisionist contestations of the Amalgamation Act of 1914, discontent with the Nigerian economy, and dysfunctions in the federal system. Central to my lecture was the call to address these challenges by crafting a new manifesto for Nigeria. This manifesto, I proposed, should champion integrity, compassion, character, competence, and commitment to national unity and progress within a framework of democratic governance and cultural diversity. I firmly believe that by doing so, we can guide Nigeria to stride forward with pride and purpose. I want to thank the Board and Management of the Public Policy Forum for their kind invitation to speak at this important event and for their commitment to promoting public policy research and analysis in Nigeria. My gratitude also to the late Prince Emeka Obasi, a true inspiration and a builder of bridges across divides, for his contributions to the country. My thoughts are with his family and loved ones.

If this Giant Must Walk: A Manifesto for a New Nigeria

Kayode Fayemi

💚❤️Call girl in Chandigarh ☎️8868886958☎️ Call Girl service in Chandigarh☎️ Chandigarh Call Girls Service ☎️ Call Girls In Chandigarh Call Girl Service In Chandigarh 💯Call Ruhi 🔝8868886958🔝Chandigarh Call Girl WhatsApp Chat: ☎️ +91-8868886958 Call Girls In Chandigarh offer you everything from intimate moments to wild nights - be it intimate or wild. Their girls are always prepared to meet all your needs and desires so you're assured a memorable experience with them. Beautiful Babes are sure to turn heads wherever they go, captivating men with their seductive personalities and captivating looks - not to mention their sensually alluring bodies that are sure to satisfy all your naughty fantasies, from blowjobs to anal massages; not forgetting BDSM and orgies as well. Show them some appreciation when the time is right. Call Girl Chandigarh that their clients can be very demanding. To maintain their reputation and gain more clients, these sexy call girls always work tirelessly to provide exceptional service that ensures each customer's happiness - so much so that many men seek them out to have an unforgettable experience with Passionate about providing erotic pleasure for their clients - fulfilling all sexy fantasies while offering role play services such as BDSM. ★OUR BEST SERVICES: - FOR BOOKING ★ A-Level (5-star escort) ★ Strip-tease ★ BBBJ (Bareback Blowjob) ★ Spending time in my rooms ★ BJ (Blowjob Without a Condom) ★. Extra ball (Have ride many times) ☛ ☛ ☛ ✔✔ secure✔✔ 100% safe WHATSAPP CALL ME💥✨✨⭐⭐⚡💦💦💨🔥💫💫 Their services range from erotic encounters and movie dates to in-call and out-call services, making this option available 24/7. Available for services including NSA, threesomes and foursomes sessions as well as massage services and casual foreplay - they even provide real girlfriend experience. Find one online or visit local women's clubs, hotels or restaurants. Hiring a Chandigarh call girl for an evening or day out can be the perfect addition to your social gathering or office event, or you could arrange for her to visit your home or hotel room. Not only are these women gorgeous, they're intelligent as well, with great senses of humor - sure to please and leave you wanting more. If you want a cheap escort in Chandigarh, look for someone who is either a student or works part time so that she will always be available when you need her. In addition to this, look for housewives as this ensures they have stable lives that can keep you satisfied for extended periods. These beautiful ladies will capture your attention at first sight with stunning eyes and full, seductive lips; not to mention a seductive personality that makes you want to spend more time with them; moreover they are discreet enough to meet up anywhere nearby! Hiring a call girl in Chandigarh can be done through either the internet or calling her directly, with services like massage or sex sessions offered to request from specific agencies within.

No Advance 8868886958 Chandigarh Call Girls , Indian Call Girls For Full Nigh...

Sheetaleventcompany

Chiulli_Aurora_Oman_Raffaele_Beowulf.pptx

raffaeleoman

I had the honour of delivering the Keynote Address at the 34th Annual Conference of the Nigerian Political Science Association (NPSA) in Lokoja. The conference theme, "Governance and Nation-Building in Nigeria: Some Reflections on Options for Policy," is relevant and timely. The challenges of nation-building are manifesting worldwide, including our continent and country. Violent conflicts and subterranean discord and discontent are prevalent, and the revival of irredentist ethno-regional and religious identities further complicates the situation. I believe that the conference was a great opportunity to discuss these challenges and reflect on practical policy options. I salute the President of the NPSA, Professor Hassan Saliu, for his leadership and the Executive Committee's effort to mobilize the membership despite a paucity of resources and the prevailing tough economic times. I am optimistic that the NPSA will continue to thrive, and I urge all of us to work towards the continued stability and unity of our country.

Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...

Kayode Fayemi

ANCHORING SCRIPT FOR A CULTURAL EVENT.docx

NikitaBankoti2

BDSM⚡Call Girls in Sector 97 Noida Escorts >༒8448380779 Escort Service

Delhi Call girls

Causes of poverty in France presentation.pptx

CamilleBoulbin1

Call Girl Number in Khar Mumbai📲 9892124323 💞 Full Night Enjoy

Pooja Nehwal

Recently uploaded (20)

Busty Desi⚡Call Girls in Sector 51 Noida Escorts >༒8448380779 Escort Service-...

Introduction to Prompt Engineering (Focusing on ChatGPT)

My Presentation "In Your Hands" by Halle Bailey

Dreaming Marissa Sánchez Music Video Treatment

Presentation on Engagement in Book Clubs

Mohammad_Alnahdi_Oral_Presentation_Assignment.pptx

Re-membering the Bard: Revisiting The Compleat Wrks of Wllm Shkspr (Abridged)...

Dreaming Music Video Treatment _ Project & Portfolio III

The workplace ecosystem of the future 24.4.2024 Fabritius_share ii.pdf

Thirunelveli call girls Tamil escorts 7877702510

lONG QUESTION ANSWER PAKISTAN STUDIES10.

SaaStr Workshop Wednesday w/ Lucas Price, Yardstick

If this Giant Must Walk: A Manifesto for a New Nigeria

No Advance 8868886958 Chandigarh Call Girls , Indian Call Girls For Full Nigh...

Chiulli_Aurora_Oman_Raffaele_Beowulf.pptx

Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...

ANCHORING SCRIPT FOR A CULTURAL EVENT.docx

BDSM⚡Call Girls in Sector 97 Noida Escorts >༒8448380779 Escort Service

Causes of poverty in France presentation.pptx

Call Girl Number in Khar Mumbai📲 9892124323 💞 Full Night Enjoy

Action-based Recommendation in Pull-request Development

1. 1 Action-based Recommendation in Pull-request Development Institute of Software, Chinese Academy of SciencesInstitute of Software, Chinese Academy of Sciences Muhammad Ilyas Azeem Sebastiano Panichella Alexander Serebrenik Qing WangAndrea Di Sorbo

2. Popular GitHub Open-Source Projects  Receives numerous pull requests daily  E.g. Kubernetes receives more the 500 pull requests daily 2

3. Issues for integrator  Job of the integrator is critical  Ensure software quality  Communication with contributors  Manual selection of PRs: Requires more effort & time Especially when integrators have large workload & limited resources 3

4. Proposed Solution  CARTESIAN (aCceptance And Response classiﬁcaTion-based requESt IdentifcAtioN)  CARTESIAN recommends three actions on PRs:  Accept, Response, and Reject  To implement CARTESIAN we followed two steps:  Feature Extraction Process  Classification model 4

5. Feature Extraction Process PRs are crawled from 19 popular GitHub projects Features have been extracted from the following four dimensions: Pull request Project Contributors Integrator 5

6. Selected Features Pull request Files changed Commits Labels count Etc. 6 Project Project age Team size Open issues Etc. Contributors Core member Followers PRs accept rate Etc. Integrator Review comments User comments Participant count Etc.

7. Classification Model CARTESIAN models the PR recommendation as a multi class problem. CARTESIAN recommends three actions on PRs:  Accept: These are the PRs accepted without any discussion  Respond: These are the PRs accepted after discussion with the contributors  Reject: These are the PRs which have not been accepted 7

8. Experimental Design Dataset Overview Crawled popular GitHub projects belonging to various domains and programming languages Pull requests time span (Project’s creation time to February 2018) GitHub REST API V3 8

9. Experimental Design Selected Projects Dataset Distribution 9

10. Experiment I (RQ1) Seven classifiers have been trained: Logistic Regression, SVM, Random Forest, Decision Trees, Naive Bayes, K-Nearest Neighbor and XGBoost models Features selection: using features importance analysis Evaluation metrics: Accuracy, Recall, Precision, F-Measure 10

11. Experiment II (RQ2) CARTESIAN Assessment: 1. Firstly, we compared CARTESIAN with baseline models, the prioritizing criteria studied by Gousios et al. FIFO model and Sized-Based Model (SBM) 2. Secondly we performed qualitatively analysis of top@20 PRs Evaluation metrics: Mean Average Precision (MAP) and Average Recall (AR) 11

12. Results for RQ1  XGBoost outperformed the rest of the classifiers  XGBoost is selected as the ultimate classifier for CARTESIAN  CARTESIAN achieved an average precision and recall of 86% 12

13. Features Importance Analysis  Number of review & discussion comments, the role of submitter, and the number of participants in the discussion are the most relevant features The classification accuracy is largely driven by features in the Contributor and Integrator dimensions. 13

14. Results for RQ(2) CARTESIAN outperformed the baseline models in top@20 MAP and AR 14

15. Results for RQ(2) Qualitative analysis shows that CARTESIAN recommends useful PRs to the integrator e.g. bug fixes, new features requests etc. 15

16. Conclusion  CARTESIAN can be helpful for integrators of popular GitHub projects  It has achieved better results: an average precision and recall of about 86% Besides, CARTESIAN prioritize useful PRs on the top of the list 16

17. Future Work Our plan is to:  Integrator CARTESIAN to GitHub  Evaluate its usefulness, and discover additional factors (quality metrics) that can be used to improve the performance 17

18. Thanks for your attention 18

Action-based Recommendation in Pull-request Development

Recommended

Recommended

More Related Content

Similar to Action-based Recommendation in Pull-request Development

Similar to Action-based Recommendation in Pull-request Development (20)

More from Sebastiano Panichella

More from Sebastiano Panichella (20)

Recently uploaded

Recently uploaded (20)

Action-based Recommendation in Pull-request Development