SlideShare a Scribd company logo
1 of 45
Understanding the Factors for Fast Answers in
Technical Q&A Websites: An Empirical Study of Four
Stack Exchange Websites
Journal First Presentation - Empirical Software Engineering
Shaowei
Wang
Tse-Hsun
(Peter) Chen
Ahmed E.
Hassan
1
Developers are always facing problems
2
Technical Q&A websites provide platforms for developers to
seek help from others
3
4
~10,000 new
questions per
day
5
~10,000 new
questions per
day
~50 million
monthly visitors
6
~10,000 new
questions per
day
~50 million
monthly visitors
~13 million
questions and
~24 million
answers
7
Almost one million questions get their accepted
answers after more than one week on Stack Overflow
8
What factors impact the speed of questions getting
accepted answers?
9
We study the top four most popular Q&A websites in
Stack Exchange network
10
• Selection criteria for studied questions:
• Questions that have an accepted answer
• Questions that have at least a score of 1
• Questions that are not self-answered
We study the top four most popular Q&A websites in
Stack Exchange network
11
We study the top four most popular Q&A websites in
Stack Exchange network
55,853 questions
70,336 questions
7,134 questions
10,776 questions
12
We study the relationship between the studied factors and
the speed of getting an accepted answer
Metrics
calculation
Model
construction
Model
interpretation
Model
assessment
13
We study the relationship between the studied factors and
the speed of getting an accepted answer
Metrics
calculation
Model
construction
Model
interpretation
Model
assessment
Question
(16 factors)
Answer
(4 factors)
Asker
(20 factors)
Answerer
(6 factors)
14
Fast-answered
questions (top
20%)
Slow-answered
questions
(bottom 20%)
Correlation&
redundancy
analysis
Non-linear
logistic regression
model building
We study the relationship between the studied factors and
the speed of getting an accepted answer
Metrics
calculation
Model
construction
Model
interpretation
Model
assessment
15
We study the relationship between the studied factors and
the speed of getting an accepted answer
AUC
Metrics
calculation
Model
construction
Model
interpretation
Model
assessment
16
Explanatory power
(Wald χ2 test)
Relationship visualization
We study the relationship between the studied factors and
the speed of getting an accepted answer
Metrics
calculation
Model
construction
Model
interpretation
Model
assessment
17
Our models achieve an AUC of 0.85-0.95
AUC=0.95
AUC=0.94
AUC=0.85
AUC=0.86
18
Our models achieve an AUC of 0.85-0.95
AUC=0.95
AUC=0.94
AUC=0.85
AUC=0.86
Our models have a good enough
fit for interpretation.
19
Top 1 factor: past speed of answering questions of an
answerer
Past speed of
answering questions
of an answerer
20
A question tends to receive a fast accepted answer from
answerers who previously answered questions fast
Probabilityofgetting
aslowacceptedanswer
Past speed of answering questions
of an answerer before (hours in
logarithm scale)
21
A question tends to receive a fast accepted answer from
answerers who previously answered questions fast
A wide confidence interval indicates
that the relationship is less clear due
to the lack of data points in
that data range.
Probabilityofgetting
aslowacceptedanswer
Probabilityofgetting
aslowacceptedanswer
Past speed of answering questions
of an answerer before (hours in
logarithm scale)
22
Past speed of answering questions of an answerer (hours in logarithm scale)
Probabilityofgetting
aslowacceptedanswer
A question tends to receive a fast accepted answer from
answerers who previously answered questions fast
23
Top 2 factor: length of body of a question
Past speed of
answering questions
of an answerer
Length of body of a
question
24
Probabilityofgetting
aslowacceptedanswer
A long question tends to receive a slow accepted answer
Length of body of a question (characters in logarithmic scale)
25
Top 3 factor: past speed of getting accepted answers
of tags of a question
Past speed of
answering questions
of an answerer
Past speed of getting
accepted answers of
tags of a question
Length of body of a
question
26
Probabilityofgetting
aslowacceptedanswer
A question with tags that received accepted answers fast
tends to receive a fast accepted answer
Time of getting accepted answers of tags of a question in the past (hours in logarithm scale)
27
Fast accepted answers rely heavily on the answerer
0
10
20
30
40
50
60
70
Stack Overflow Mathematics Ask Ubuntu Super User
%ofexplanatorypower
Question Asker Answer Answerer
28
0
10
20
30
40
50
60
70
Stack Overflow Mathematics Ask Ubuntu Super User
%ofexplanatorypower
Question Asker Answer Answerer
Fast accepted answers rely heavily on the answerer
29
Suggestions for Technical Q&A website designers
Deliver questions to the right answerers and
motivate them to answer questions faster.
30
86% - 96% of the accepted answers are posted by
answerers that answered more than 5 questions before
31
• Non-frequent answerers (<= 5 answers)
• People that answered no more than 5 answers in the past
• Frequent answerers (> 5 answers)
• People that answered more than 5 answers in the past
Non-frequent answerers vs. Frequent answerers
32
Non-frequent answerers are the bottleneck for fast answersMeantimeofposting
anacceptedanswer(hours)
33
34
The current incentive system only motivates frequent answerers
well, but not non-frequent answerers
Non-frequent answerers are answering questions that are
as important as ones answered by non-frequent answerers
Meanscoreofquestions
35
Suggestions for Technical Q&A website designers
Deliver questions to the right answerers and
motivate them to answer questions faster.
Improve the incentive system to attract the non-
frequent answerers to become more active.
36
Frequent answerers tend to answer shorter questions
37
Frequent answerers probably game the incentive system
Yeah, some folks are going to specialize in super-fast answers
to easy questions and get more rep points than deserved,…
The bigger problem is that this has the side effect of causing
interesting but more difficult questions to get ignored. Typical
example: someone asks a question that gets a lot of views and two or more upvotes,
but it's hard enough that no one can answer within an hour or so.
38
Suggestions for Technical Q&A website designers
Deliver questions to the right answerers and
motivate them to answer questions faster.
Improve the incentive system to attract the non-
frequent answerers to become more active.
Improve the incentive system to factor in the value
and difficulty of questions.
39
41
42
43
44
Shaowei Wang
shaowei@cs.queensu.ca
51

More Related Content

Similar to Factors Impacting Speed of Answers on Technical Q&A Sites

Effective Business Communication with Precision Questioning and Answering
Effective Business Communication with Precision Questioning and AnsweringEffective Business Communication with Precision Questioning and Answering
Effective Business Communication with Precision Questioning and AnsweringSociety of Women Engineers
 
Traditional versus adaptive techniques
Traditional versus adaptive techniquesTraditional versus adaptive techniques
Traditional versus adaptive techniquesAravind Ganesh
 
Surveys that work: training course for Rosenfeld Media, day 3
Surveys that work: training course for Rosenfeld Media, day 3 Surveys that work: training course for Rosenfeld Media, day 3
Surveys that work: training course for Rosenfeld Media, day 3 Caroline Jarrett
 
Surveys that work: training course for Rosenfeld media, day 2
Surveys that work: training course for Rosenfeld media, day 2Surveys that work: training course for Rosenfeld media, day 2
Surveys that work: training course for Rosenfeld media, day 2Caroline Jarrett
 
Surveys that work:training course for Rosenfeld Media, day 1
Surveys that work:training course for Rosenfeld Media, day 1Surveys that work:training course for Rosenfeld Media, day 1
Surveys that work:training course for Rosenfeld Media, day 1Caroline Jarrett
 
CrowdSourcing- Location based Quries
CrowdSourcing- Location based QuriesCrowdSourcing- Location based Quries
CrowdSourcing- Location based Quriespurushottam02468
 
Exploring Capturable Everyday Memory for Autobiographical Authentication, at ...
Exploring Capturable Everyday Memory for Autobiographical Authentication, at ...Exploring Capturable Everyday Memory for Autobiographical Authentication, at ...
Exploring Capturable Everyday Memory for Autobiographical Authentication, at ...Jason Hong
 
Towards Discovering the Role of Emotions in Stack Overflow
Towards Discovering the Role of Emotions in Stack OverflowTowards Discovering the Role of Emotions in Stack Overflow
Towards Discovering the Role of Emotions in Stack OverflowNicole Novielli
 
Répondre à la question automatique avec le web
Répondre à la question automatique avec le webRépondre à la question automatique avec le web
Répondre à la question automatique avec le webAhmed Hammami
 
Online Question and Answers Resources for the Bioinformatics Community
Online Question and Answers Resources for the Bioinformatics CommunityOnline Question and Answers Resources for the Bioinformatics Community
Online Question and Answers Resources for the Bioinformatics CommunityHoffman Lab
 
Addictive links, Keynote talk at WWW 2014 workshop
Addictive links, Keynote talk at WWW 2014 workshopAddictive links, Keynote talk at WWW 2014 workshop
Addictive links, Keynote talk at WWW 2014 workshopPeter Brusilovsky
 
Asking Clarifying Questions in Open-Domain Information-Seeking Conversations
Asking Clarifying Questions in Open-Domain Information-Seeking ConversationsAsking Clarifying Questions in Open-Domain Information-Seeking Conversations
Asking Clarifying Questions in Open-Domain Information-Seeking ConversationsMohammad Aliannejadi
 
Real-Time Community Question Answering: Exploring Content Recommendation and ...
Real-Time Community Question Answering: Exploring Content Recommendation and ...Real-Time Community Question Answering: Exploring Content Recommendation and ...
Real-Time Community Question Answering: Exploring Content Recommendation and ...Jinho Choi
 
Knowledge Management for Real
Knowledge Management for RealKnowledge Management for Real
Knowledge Management for RealCherwell Software
 
Four ways to make a better survey WebExpo2023
Four ways to make a better survey WebExpo2023Four ways to make a better survey WebExpo2023
Four ways to make a better survey WebExpo2023Caroline Jarrett
 
Using surveys to uncover audience insights
Using surveys to uncover audience insightsUsing surveys to uncover audience insights
Using surveys to uncover audience insightsCaroline Jarrett
 
QuestionPro Integrates with TryMyUI to Launch the Survey Respondent Score
QuestionPro Integrates with TryMyUI to Launch the Survey Respondent ScoreQuestionPro Integrates with TryMyUI to Launch the Survey Respondent Score
QuestionPro Integrates with TryMyUI to Launch the Survey Respondent ScoreJames Wirth
 
Breakthrough Thinking Begins By Asking the Right Questions
Breakthrough Thinking Begins By Asking the Right QuestionsBreakthrough Thinking Begins By Asking the Right Questions
Breakthrough Thinking Begins By Asking the Right QuestionsClarity Thinker
 

Similar to Factors Impacting Speed of Answers on Technical Q&A Sites (20)

Effective Business Communication with Precision Questioning and Answering
Effective Business Communication with Precision Questioning and AnsweringEffective Business Communication with Precision Questioning and Answering
Effective Business Communication with Precision Questioning and Answering
 
Traditional versus adaptive techniques
Traditional versus adaptive techniquesTraditional versus adaptive techniques
Traditional versus adaptive techniques
 
Surveys that work: training course for Rosenfeld Media, day 3
Surveys that work: training course for Rosenfeld Media, day 3 Surveys that work: training course for Rosenfeld Media, day 3
Surveys that work: training course for Rosenfeld Media, day 3
 
Surveys that work: training course for Rosenfeld media, day 2
Surveys that work: training course for Rosenfeld media, day 2Surveys that work: training course for Rosenfeld media, day 2
Surveys that work: training course for Rosenfeld media, day 2
 
Surveys that work:training course for Rosenfeld Media, day 1
Surveys that work:training course for Rosenfeld Media, day 1Surveys that work:training course for Rosenfeld Media, day 1
Surveys that work:training course for Rosenfeld Media, day 1
 
CrowdSourcing- Location based Quries
CrowdSourcing- Location based QuriesCrowdSourcing- Location based Quries
CrowdSourcing- Location based Quries
 
Exploring Capturable Everyday Memory for Autobiographical Authentication, at ...
Exploring Capturable Everyday Memory for Autobiographical Authentication, at ...Exploring Capturable Everyday Memory for Autobiographical Authentication, at ...
Exploring Capturable Everyday Memory for Autobiographical Authentication, at ...
 
Towards Discovering the Role of Emotions in Stack Overflow
Towards Discovering the Role of Emotions in Stack OverflowTowards Discovering the Role of Emotions in Stack Overflow
Towards Discovering the Role of Emotions in Stack Overflow
 
Répondre à la question automatique avec le web
Répondre à la question automatique avec le webRépondre à la question automatique avec le web
Répondre à la question automatique avec le web
 
Online Question and Answers Resources for the Bioinformatics Community
Online Question and Answers Resources for the Bioinformatics CommunityOnline Question and Answers Resources for the Bioinformatics Community
Online Question and Answers Resources for the Bioinformatics Community
 
Addictive links, Keynote talk at WWW 2014 workshop
Addictive links, Keynote talk at WWW 2014 workshopAddictive links, Keynote talk at WWW 2014 workshop
Addictive links, Keynote talk at WWW 2014 workshop
 
Addictive links: Adaptive Navigation Support in College-Level Courses
Addictive links: Adaptive Navigation Support in College-Level CoursesAddictive links: Adaptive Navigation Support in College-Level Courses
Addictive links: Adaptive Navigation Support in College-Level Courses
 
Asking Clarifying Questions in Open-Domain Information-Seeking Conversations
Asking Clarifying Questions in Open-Domain Information-Seeking ConversationsAsking Clarifying Questions in Open-Domain Information-Seeking Conversations
Asking Clarifying Questions in Open-Domain Information-Seeking Conversations
 
ISEC-2021-Presentation-Saikat-Mondal
ISEC-2021-Presentation-Saikat-MondalISEC-2021-Presentation-Saikat-Mondal
ISEC-2021-Presentation-Saikat-Mondal
 
Real-Time Community Question Answering: Exploring Content Recommendation and ...
Real-Time Community Question Answering: Exploring Content Recommendation and ...Real-Time Community Question Answering: Exploring Content Recommendation and ...
Real-Time Community Question Answering: Exploring Content Recommendation and ...
 
Knowledge Management for Real
Knowledge Management for RealKnowledge Management for Real
Knowledge Management for Real
 
Four ways to make a better survey WebExpo2023
Four ways to make a better survey WebExpo2023Four ways to make a better survey WebExpo2023
Four ways to make a better survey WebExpo2023
 
Using surveys to uncover audience insights
Using surveys to uncover audience insightsUsing surveys to uncover audience insights
Using surveys to uncover audience insights
 
QuestionPro Integrates with TryMyUI to Launch the Survey Respondent Score
QuestionPro Integrates with TryMyUI to Launch the Survey Respondent ScoreQuestionPro Integrates with TryMyUI to Launch the Survey Respondent Score
QuestionPro Integrates with TryMyUI to Launch the Survey Respondent Score
 
Breakthrough Thinking Begins By Asking the Right Questions
Breakthrough Thinking Begins By Asking the Right QuestionsBreakthrough Thinking Begins By Asking the Right Questions
Breakthrough Thinking Begins By Asking the Right Questions
 

More from SAIL_QU

Studying the Integration Practices and the Evolution of Ad Libraries in the G...
Studying the Integration Practices and the Evolution of Ad Libraries in the G...Studying the Integration Practices and the Evolution of Ad Libraries in the G...
Studying the Integration Practices and the Evolution of Ad Libraries in the G...SAIL_QU
 
Studying the Dialogue Between Users and Developers of Free Apps in the Google...
Studying the Dialogue Between Users and Developers of Free Apps in the Google...Studying the Dialogue Between Users and Developers of Free Apps in the Google...
Studying the Dialogue Between Users and Developers of Free Apps in the Google...SAIL_QU
 
Improving the testing efficiency of selenium-based load tests
Improving the testing efficiency of selenium-based load testsImproving the testing efficiency of selenium-based load tests
Improving the testing efficiency of selenium-based load testsSAIL_QU
 
Studying User-Developer Interactions Through the Distribution and Reviewing M...
Studying User-Developer Interactions Through the Distribution and Reviewing M...Studying User-Developer Interactions Through the Distribution and Reviewing M...
Studying User-Developer Interactions Through the Distribution and Reviewing M...SAIL_QU
 
Studying online distribution platforms for games through the mining of data f...
Studying online distribution platforms for games through the mining of data f...Studying online distribution platforms for games through the mining of data f...
Studying online distribution platforms for games through the mining of data f...SAIL_QU
 
Understanding the Factors for Fast Answers in Technical Q&A Websites: An Empi...
Understanding the Factors for Fast Answers in Technical Q&A Websites: An Empi...Understanding the Factors for Fast Answers in Technical Q&A Websites: An Empi...
Understanding the Factors for Fast Answers in Technical Q&A Websites: An Empi...SAIL_QU
 
Investigating the Challenges in Selenium Usage and Improving the Testing Effi...
Investigating the Challenges in Selenium Usage and Improving the Testing Effi...Investigating the Challenges in Selenium Usage and Improving the Testing Effi...
Investigating the Challenges in Selenium Usage and Improving the Testing Effi...SAIL_QU
 
Mining Development Knowledge to Understand and Support Software Logging Pract...
Mining Development Knowledge to Understand and Support Software Logging Pract...Mining Development Knowledge to Understand and Support Software Logging Pract...
Mining Development Knowledge to Understand and Support Software Logging Pract...SAIL_QU
 
Which Log Level Should Developers Choose For a New Logging Statement?
Which Log Level Should Developers Choose For a New Logging Statement?Which Log Level Should Developers Choose For a New Logging Statement?
Which Log Level Should Developers Choose For a New Logging Statement?SAIL_QU
 
Towards Just-in-Time Suggestions for Log Changes
Towards Just-in-Time Suggestions for Log ChangesTowards Just-in-Time Suggestions for Log Changes
Towards Just-in-Time Suggestions for Log ChangesSAIL_QU
 
The Impact of Task Granularity on Co-evolution Analyses
The Impact of Task Granularity on Co-evolution AnalysesThe Impact of Task Granularity on Co-evolution Analyses
The Impact of Task Granularity on Co-evolution AnalysesSAIL_QU
 
A Framework for Evaluating the Results of the SZZ Approach for Identifying Bu...
A Framework for Evaluating the Results of the SZZ Approach for Identifying Bu...A Framework for Evaluating the Results of the SZZ Approach for Identifying Bu...
A Framework for Evaluating the Results of the SZZ Approach for Identifying Bu...SAIL_QU
 
How are Discussions Associated with Bug Reworking? An Empirical Study on Open...
How are Discussions Associated with Bug Reworking? An Empirical Study on Open...How are Discussions Associated with Bug Reworking? An Empirical Study on Open...
How are Discussions Associated with Bug Reworking? An Empirical Study on Open...SAIL_QU
 
A Study of the Relation of Mobile Device Attributes with the User-Perceived Q...
A Study of the Relation of Mobile Device Attributes with the User-Perceived Q...A Study of the Relation of Mobile Device Attributes with the User-Perceived Q...
A Study of the Relation of Mobile Device Attributes with the User-Perceived Q...SAIL_QU
 
A Large-Scale Study of the Impact of Feature Selection Techniques on Defect C...
A Large-Scale Study of the Impact of Feature Selection Techniques on Defect C...A Large-Scale Study of the Impact of Feature Selection Techniques on Defect C...
A Large-Scale Study of the Impact of Feature Selection Techniques on Defect C...SAIL_QU
 
Studying the Dialogue Between Users and Developers of Free Apps in the Google...
Studying the Dialogue Between Users and Developers of Free Apps in the Google...Studying the Dialogue Between Users and Developers of Free Apps in the Google...
Studying the Dialogue Between Users and Developers of Free Apps in the Google...SAIL_QU
 
What Do Programmers Know about Software Energy Consumption?
What Do Programmers Know about Software Energy Consumption?What Do Programmers Know about Software Energy Consumption?
What Do Programmers Know about Software Energy Consumption?SAIL_QU
 
Threshold for Size and Complexity Metrics: A Case Study from the Perspective ...
Threshold for Size and Complexity Metrics: A Case Study from the Perspective ...Threshold for Size and Complexity Metrics: A Case Study from the Perspective ...
Threshold for Size and Complexity Metrics: A Case Study from the Perspective ...SAIL_QU
 
Revisiting the Experimental Design Choices for Approaches for the Automated R...
Revisiting the Experimental Design Choices for Approaches for the Automated R...Revisiting the Experimental Design Choices for Approaches for the Automated R...
Revisiting the Experimental Design Choices for Approaches for the Automated R...SAIL_QU
 
Measuring Program Comprehension: A Large-Scale Field Study with Professionals
Measuring Program Comprehension: A Large-Scale Field Study with ProfessionalsMeasuring Program Comprehension: A Large-Scale Field Study with Professionals
Measuring Program Comprehension: A Large-Scale Field Study with ProfessionalsSAIL_QU
 

More from SAIL_QU (20)

Studying the Integration Practices and the Evolution of Ad Libraries in the G...
Studying the Integration Practices and the Evolution of Ad Libraries in the G...Studying the Integration Practices and the Evolution of Ad Libraries in the G...
Studying the Integration Practices and the Evolution of Ad Libraries in the G...
 
Studying the Dialogue Between Users and Developers of Free Apps in the Google...
Studying the Dialogue Between Users and Developers of Free Apps in the Google...Studying the Dialogue Between Users and Developers of Free Apps in the Google...
Studying the Dialogue Between Users and Developers of Free Apps in the Google...
 
Improving the testing efficiency of selenium-based load tests
Improving the testing efficiency of selenium-based load testsImproving the testing efficiency of selenium-based load tests
Improving the testing efficiency of selenium-based load tests
 
Studying User-Developer Interactions Through the Distribution and Reviewing M...
Studying User-Developer Interactions Through the Distribution and Reviewing M...Studying User-Developer Interactions Through the Distribution and Reviewing M...
Studying User-Developer Interactions Through the Distribution and Reviewing M...
 
Studying online distribution platforms for games through the mining of data f...
Studying online distribution platforms for games through the mining of data f...Studying online distribution platforms for games through the mining of data f...
Studying online distribution platforms for games through the mining of data f...
 
Understanding the Factors for Fast Answers in Technical Q&A Websites: An Empi...
Understanding the Factors for Fast Answers in Technical Q&A Websites: An Empi...Understanding the Factors for Fast Answers in Technical Q&A Websites: An Empi...
Understanding the Factors for Fast Answers in Technical Q&A Websites: An Empi...
 
Investigating the Challenges in Selenium Usage and Improving the Testing Effi...
Investigating the Challenges in Selenium Usage and Improving the Testing Effi...Investigating the Challenges in Selenium Usage and Improving the Testing Effi...
Investigating the Challenges in Selenium Usage and Improving the Testing Effi...
 
Mining Development Knowledge to Understand and Support Software Logging Pract...
Mining Development Knowledge to Understand and Support Software Logging Pract...Mining Development Knowledge to Understand and Support Software Logging Pract...
Mining Development Knowledge to Understand and Support Software Logging Pract...
 
Which Log Level Should Developers Choose For a New Logging Statement?
Which Log Level Should Developers Choose For a New Logging Statement?Which Log Level Should Developers Choose For a New Logging Statement?
Which Log Level Should Developers Choose For a New Logging Statement?
 
Towards Just-in-Time Suggestions for Log Changes
Towards Just-in-Time Suggestions for Log ChangesTowards Just-in-Time Suggestions for Log Changes
Towards Just-in-Time Suggestions for Log Changes
 
The Impact of Task Granularity on Co-evolution Analyses
The Impact of Task Granularity on Co-evolution AnalysesThe Impact of Task Granularity on Co-evolution Analyses
The Impact of Task Granularity on Co-evolution Analyses
 
A Framework for Evaluating the Results of the SZZ Approach for Identifying Bu...
A Framework for Evaluating the Results of the SZZ Approach for Identifying Bu...A Framework for Evaluating the Results of the SZZ Approach for Identifying Bu...
A Framework for Evaluating the Results of the SZZ Approach for Identifying Bu...
 
How are Discussions Associated with Bug Reworking? An Empirical Study on Open...
How are Discussions Associated with Bug Reworking? An Empirical Study on Open...How are Discussions Associated with Bug Reworking? An Empirical Study on Open...
How are Discussions Associated with Bug Reworking? An Empirical Study on Open...
 
A Study of the Relation of Mobile Device Attributes with the User-Perceived Q...
A Study of the Relation of Mobile Device Attributes with the User-Perceived Q...A Study of the Relation of Mobile Device Attributes with the User-Perceived Q...
A Study of the Relation of Mobile Device Attributes with the User-Perceived Q...
 
A Large-Scale Study of the Impact of Feature Selection Techniques on Defect C...
A Large-Scale Study of the Impact of Feature Selection Techniques on Defect C...A Large-Scale Study of the Impact of Feature Selection Techniques on Defect C...
A Large-Scale Study of the Impact of Feature Selection Techniques on Defect C...
 
Studying the Dialogue Between Users and Developers of Free Apps in the Google...
Studying the Dialogue Between Users and Developers of Free Apps in the Google...Studying the Dialogue Between Users and Developers of Free Apps in the Google...
Studying the Dialogue Between Users and Developers of Free Apps in the Google...
 
What Do Programmers Know about Software Energy Consumption?
What Do Programmers Know about Software Energy Consumption?What Do Programmers Know about Software Energy Consumption?
What Do Programmers Know about Software Energy Consumption?
 
Threshold for Size and Complexity Metrics: A Case Study from the Perspective ...
Threshold for Size and Complexity Metrics: A Case Study from the Perspective ...Threshold for Size and Complexity Metrics: A Case Study from the Perspective ...
Threshold for Size and Complexity Metrics: A Case Study from the Perspective ...
 
Revisiting the Experimental Design Choices for Approaches for the Automated R...
Revisiting the Experimental Design Choices for Approaches for the Automated R...Revisiting the Experimental Design Choices for Approaches for the Automated R...
Revisiting the Experimental Design Choices for Approaches for the Automated R...
 
Measuring Program Comprehension: A Large-Scale Field Study with Professionals
Measuring Program Comprehension: A Large-Scale Field Study with ProfessionalsMeasuring Program Comprehension: A Large-Scale Field Study with Professionals
Measuring Program Comprehension: A Large-Scale Field Study with Professionals
 

Recently uploaded

Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)OPEN KNOWLEDGE GmbH
 
why an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdfwhy an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdfjoe51371421
 
Professional Resume Template for Software Developers
Professional Resume Template for Software DevelopersProfessional Resume Template for Software Developers
Professional Resume Template for Software DevelopersVinodh Ram
 
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...soniya singh
 
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer DataAdobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer DataBradBedford3
 
Advancing Engineering with AI through the Next Generation of Strategic Projec...
Advancing Engineering with AI through the Next Generation of Strategic Projec...Advancing Engineering with AI through the Next Generation of Strategic Projec...
Advancing Engineering with AI through the Next Generation of Strategic Projec...OnePlan Solutions
 
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio, Inc.
 
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...MyIntelliSource, Inc.
 
What is Binary Language? Computer Number Systems
What is Binary Language?  Computer Number SystemsWhat is Binary Language?  Computer Number Systems
What is Binary Language? Computer Number SystemsJheuzeDellosa
 
EY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityEY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityNeo4j
 
Unit 1.1 Excite Part 1, class 9, cbse...
Unit 1.1 Excite Part 1, class 9, cbse...Unit 1.1 Excite Part 1, class 9, cbse...
Unit 1.1 Excite Part 1, class 9, cbse...aditisharan08
 
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...kellynguyen01
 
cybersecurity notes for mca students for learning
cybersecurity notes for mca students for learningcybersecurity notes for mca students for learning
cybersecurity notes for mca students for learningVitsRangannavar
 
Cloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackCloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackVICTOR MAESTRE RAMIREZ
 
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...gurkirankumar98700
 
buds n tech IT solutions
buds n  tech IT                solutionsbuds n  tech IT                solutions
buds n tech IT solutionsmonugehlot87
 
Implementing Zero Trust strategy with Azure
Implementing Zero Trust strategy with AzureImplementing Zero Trust strategy with Azure
Implementing Zero Trust strategy with AzureDinusha Kumarasiri
 
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...Christina Lin
 
Salesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantSalesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantAxelRicardoTrocheRiq
 

Recently uploaded (20)

Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)
 
why an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdfwhy an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdf
 
Professional Resume Template for Software Developers
Professional Resume Template for Software DevelopersProfessional Resume Template for Software Developers
Professional Resume Template for Software Developers
 
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
 
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer DataAdobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
 
Advancing Engineering with AI through the Next Generation of Strategic Projec...
Advancing Engineering with AI through the Next Generation of Strategic Projec...Advancing Engineering with AI through the Next Generation of Strategic Projec...
Advancing Engineering with AI through the Next Generation of Strategic Projec...
 
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
 
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
 
What is Binary Language? Computer Number Systems
What is Binary Language?  Computer Number SystemsWhat is Binary Language?  Computer Number Systems
What is Binary Language? Computer Number Systems
 
EY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityEY_Graph Database Powered Sustainability
EY_Graph Database Powered Sustainability
 
Unit 1.1 Excite Part 1, class 9, cbse...
Unit 1.1 Excite Part 1, class 9, cbse...Unit 1.1 Excite Part 1, class 9, cbse...
Unit 1.1 Excite Part 1, class 9, cbse...
 
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
 
cybersecurity notes for mca students for learning
cybersecurity notes for mca students for learningcybersecurity notes for mca students for learning
cybersecurity notes for mca students for learning
 
Cloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackCloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStack
 
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
 
buds n tech IT solutions
buds n  tech IT                solutionsbuds n  tech IT                solutions
buds n tech IT solutions
 
Call Girls In Mukherjee Nagar 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...
Call Girls In Mukherjee Nagar 📱  9999965857  🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...Call Girls In Mukherjee Nagar 📱  9999965857  🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...
Call Girls In Mukherjee Nagar 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...
 
Implementing Zero Trust strategy with Azure
Implementing Zero Trust strategy with AzureImplementing Zero Trust strategy with Azure
Implementing Zero Trust strategy with Azure
 
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
 
Salesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantSalesforce Certified Field Service Consultant
Salesforce Certified Field Service Consultant
 

Factors Impacting Speed of Answers on Technical Q&A Sites

  • 1. Understanding the Factors for Fast Answers in Technical Q&A Websites: An Empirical Study of Four Stack Exchange Websites Journal First Presentation - Empirical Software Engineering Shaowei Wang Tse-Hsun (Peter) Chen Ahmed E. Hassan 1
  • 2. Developers are always facing problems 2
  • 3. Technical Q&A websites provide platforms for developers to seek help from others 3
  • 4. 4
  • 6. ~10,000 new questions per day ~50 million monthly visitors 6
  • 7. ~10,000 new questions per day ~50 million monthly visitors ~13 million questions and ~24 million answers 7
  • 8. Almost one million questions get their accepted answers after more than one week on Stack Overflow 8
  • 9. What factors impact the speed of questions getting accepted answers? 9
  • 10. We study the top four most popular Q&A websites in Stack Exchange network 10
  • 11. • Selection criteria for studied questions: • Questions that have an accepted answer • Questions that have at least a score of 1 • Questions that are not self-answered We study the top four most popular Q&A websites in Stack Exchange network 11
  • 12. We study the top four most popular Q&A websites in Stack Exchange network 55,853 questions 70,336 questions 7,134 questions 10,776 questions 12
  • 13. We study the relationship between the studied factors and the speed of getting an accepted answer Metrics calculation Model construction Model interpretation Model assessment 13
  • 14. We study the relationship between the studied factors and the speed of getting an accepted answer Metrics calculation Model construction Model interpretation Model assessment Question (16 factors) Answer (4 factors) Asker (20 factors) Answerer (6 factors) 14
  • 15. Fast-answered questions (top 20%) Slow-answered questions (bottom 20%) Correlation& redundancy analysis Non-linear logistic regression model building We study the relationship between the studied factors and the speed of getting an accepted answer Metrics calculation Model construction Model interpretation Model assessment 15
  • 16. We study the relationship between the studied factors and the speed of getting an accepted answer AUC Metrics calculation Model construction Model interpretation Model assessment 16
  • 17. Explanatory power (Wald χ2 test) Relationship visualization We study the relationship between the studied factors and the speed of getting an accepted answer Metrics calculation Model construction Model interpretation Model assessment 17
  • 18. Our models achieve an AUC of 0.85-0.95 AUC=0.95 AUC=0.94 AUC=0.85 AUC=0.86 18
  • 19. Our models achieve an AUC of 0.85-0.95 AUC=0.95 AUC=0.94 AUC=0.85 AUC=0.86 Our models have a good enough fit for interpretation. 19
  • 20. Top 1 factor: past speed of answering questions of an answerer Past speed of answering questions of an answerer 20
  • 21. A question tends to receive a fast accepted answer from answerers who previously answered questions fast Probabilityofgetting aslowacceptedanswer Past speed of answering questions of an answerer before (hours in logarithm scale) 21
  • 22. A question tends to receive a fast accepted answer from answerers who previously answered questions fast A wide confidence interval indicates that the relationship is less clear due to the lack of data points in that data range. Probabilityofgetting aslowacceptedanswer Probabilityofgetting aslowacceptedanswer Past speed of answering questions of an answerer before (hours in logarithm scale) 22
  • 23. Past speed of answering questions of an answerer (hours in logarithm scale) Probabilityofgetting aslowacceptedanswer A question tends to receive a fast accepted answer from answerers who previously answered questions fast 23
  • 24. Top 2 factor: length of body of a question Past speed of answering questions of an answerer Length of body of a question 24
  • 25. Probabilityofgetting aslowacceptedanswer A long question tends to receive a slow accepted answer Length of body of a question (characters in logarithmic scale) 25
  • 26. Top 3 factor: past speed of getting accepted answers of tags of a question Past speed of answering questions of an answerer Past speed of getting accepted answers of tags of a question Length of body of a question 26
  • 27. Probabilityofgetting aslowacceptedanswer A question with tags that received accepted answers fast tends to receive a fast accepted answer Time of getting accepted answers of tags of a question in the past (hours in logarithm scale) 27
  • 28. Fast accepted answers rely heavily on the answerer 0 10 20 30 40 50 60 70 Stack Overflow Mathematics Ask Ubuntu Super User %ofexplanatorypower Question Asker Answer Answerer 28
  • 29. 0 10 20 30 40 50 60 70 Stack Overflow Mathematics Ask Ubuntu Super User %ofexplanatorypower Question Asker Answer Answerer Fast accepted answers rely heavily on the answerer 29
  • 30. Suggestions for Technical Q&A website designers Deliver questions to the right answerers and motivate them to answer questions faster. 30
  • 31. 86% - 96% of the accepted answers are posted by answerers that answered more than 5 questions before 31
  • 32. • Non-frequent answerers (<= 5 answers) • People that answered no more than 5 answers in the past • Frequent answerers (> 5 answers) • People that answered more than 5 answers in the past Non-frequent answerers vs. Frequent answerers 32
  • 33. Non-frequent answerers are the bottleneck for fast answersMeantimeofposting anacceptedanswer(hours) 33
  • 34. 34 The current incentive system only motivates frequent answerers well, but not non-frequent answerers
  • 35. Non-frequent answerers are answering questions that are as important as ones answered by non-frequent answerers Meanscoreofquestions 35
  • 36. Suggestions for Technical Q&A website designers Deliver questions to the right answerers and motivate them to answer questions faster. Improve the incentive system to attract the non- frequent answerers to become more active. 36
  • 37. Frequent answerers tend to answer shorter questions 37
  • 38. Frequent answerers probably game the incentive system Yeah, some folks are going to specialize in super-fast answers to easy questions and get more rep points than deserved,… The bigger problem is that this has the side effect of causing interesting but more difficult questions to get ignored. Typical example: someone asks a question that gets a lot of views and two or more upvotes, but it's hard enough that no one can answer within an hour or so. 38
  • 39. Suggestions for Technical Q&A website designers Deliver questions to the right answerers and motivate them to answer questions faster. Improve the incentive system to attract the non- frequent answerers to become more active. Improve the incentive system to factor in the value and difficulty of questions. 39
  • 40. 41
  • 41. 42
  • 42. 43
  • 43. 44
  • 45. 51

Editor's Notes

  1. hi, thanks for the introduction and for your coming. I am shaowei, a postdoc from queen’s university. Today I will present our paper, which is understanding the factors for faster answers in technique q&a website. This paper is down together with peter from Concordia and Ahmed from queen’s.
  2. Developers keep facing problems, whenever they do development, testing, maintenance. Problems fill developers’ life.
  3. To help
  4. Developers spend 58% of their time on comprehension activities. ~50 million monthly visitors
  5. Developers spend 58% of their time on comprehension activities. ~50 million monthly visitors
  6. Developers spend 58% of their time on comprehension activities. ~50 million monthly visitors
  7. In other words, developers ask questions very frequently.
  8. The median waiting time of a question to get answer is 0.5 hour in general. How to shorten the waiting time to get an accepted answer is an interesting question to study.
  9. To understand the factors that impact the speed of ,, and provide insights for users and websites designers to improve their system.
  10. To achieve this goal, we study four most popular websites in stack exchange network .
  11. We select the questions that have at least 1 score, cos we want to make sure the question has enough attention from the community and the quality is reasonable
  12. 55k from stack overflow
  13. To understand the factors that may impact the speed of getting an accepted answer for a question.
  14. The reason we only select the top 20% and bottom 20% is that we want to find the factors that really the impact the really and fast questions.
  15. Remove oopti and table as well.
  16. Inset equation x2
  17. Logo and auc
  18. Logo and auc
  19. Wide gray area means larger confidence interval. the relationship is less clear probability of getting a slow answer increases significantly when the value of A Median Speed Answer increases up until an inflection point with a small confidence interval (i.e., the gray bands are narrow). After the inflection point, the curve goes down gradually but with a wide confidence interval. After the inflection point, the probability goes down slowly with a larger uncertainty (i.e., the relationship is less clear due to the lack of data points in that data range).
  20. Wide gray area means larger confidence interval. the relationship is less clear probability of getting a slow answer increases significantly when the value of A Median Speed Answer increases up until an inflection point with a small confidence interval (i.e., the gray bands are narrow). After the inflection point, the curve goes down gradually but with a wide confidence interval. After the inflection point, the probability goes down slowly with a larger uncertainty (i.e., the relationship is less clear due to the lack of data points in that data range).
  21. Wide gray area means larger confidence interval. the relationship is less clear probability of getting a slow answer increases significantly when the value of A Median Speed Answer increases up until an inflection point with a small confidence interval (i.e., the gray bands are narrow). After the inflection point, the curve goes down gradually but with a wide confidence interval. After the inflection point, the probability goes down slowly with a larger uncertainty (i.e., the relationship is less clear due to the lack of data points in that data range). More importantly, this finding is hold across the different sites.
  22. Speed for an answerer to answer questions in the past Length of an answer body (controlling factor) Length of an question body
  23. Wide gray area means larger confidence interval. the relationship is less clear probability of getting a slow answer increases significantly when the value of A Median Speed Answer increases up until an inflection point with a small confidence interval (i.e., the gray bands are narrow). After the inflection point, the curve goes down gradually but with a wide confidence interval. After the inflection point, the probability goes down slowly with a larger uncertainty (i.e., the relationship is less clear due to the lack of data points in that data range).
  24. Speed for an answerer to answer questions in the past Length of an answer body (controlling factor) Length of an question body
  25. Tag also matters.
  26. Logo and number
  27. In general, fast accepted answer rely on the people who answer the question.
  28. We look at the improvement of reputation score for people with different reputation score.
  29. There are non-frequent answers
  30. The questions that are answered by non-frequent answerers are as important as these are answered by frequent answerer. However, Non-frequent answerers are the bottleneck for fast answers. So the possible explanation is that some new questions require concert knowledge that only such non-frequent answerers have. such non-frequent answerers do not actively stay on SO, therefore delay the answers.
  31. Long title
  32. To find the possible reason of this, we explore the posts on stack overflow meta
  33. select top 5 * from posts as a join posts as q on q.acceptedanswerid = a.id where DATEDIFF(week, a.creationdate, q.creationdate) > 1
  34. Developers spend 58% of their time on comprehension activities.
  35. There are non-frequent answers