SlideShare a Scribd company logo
1 of 22
Download to read offline
A Query Routing Model to Rank
Expert Candidates on Twitter
Cleyton Souza, Jonathas Magalhães, Evandro Costa and
Joseana Fechine
LIA - Laboratory of Artificial Intelligence
UFCG - Federal University of Campina Grande
Campina Grande - Brazil
Introduction
• What is Social Query?
– It is the process of asking questions trough social
media (e.g., Twitter, Facebook, etc.)! [Morris et al.]
– The common strategy is sharing the question with
everyone, but this way there is no guarantee that you
will receive a good and quick answer
• Directing questions to someone is more efficient.

• What is Query Routing?
– It is the process of directing questions to appropriate
answerers (people able to help)!
cleyton.caetano.souza@copin.ufcg.edu.br

2
Introduction
• What are we proposing?
– A Query Routing Model: a technique that finds
the most suitable person to help you based on
knowledge, trust and activity.
– We are focusing in the Twitter context!

A Query Routing Model to Rank Expert
Candidates on Twitter

cleyton.caetano.souza@copin.ufcg.edu.br

3
Agenda
•
•
•
•

Introduction
Related Work
Proposal
Evaluation
– Methodology
– Results
– Treats to Validity

• Conclusion & Future Work
cleyton.caetano.souza@copin.ufcg.edu.br

4
Related Word (1/2)
• What are the differentials of our proposal to
Previous Work?
– Context – We are focusing on a Social Network
Context;
• While previous work focused on Community Question
and Answering context…
• Why did we choose Twitter?
– It is one of the most popular Online Social Networks;
– Less than 18% percent of questions asked on Twitter are
answered [Paul et al.];
– [Nichols and Kang] confirmed that directing questions
significantly improve the response rate;
cleyton.caetano.souza@copin.ufcg.edu.br

5
Related Word (2/2)
• What are the differentials of our proposal to
Previous Work?
– Problem – We are leading with the Query Routing
problem as a Multi-criteria Decision Making
Problem (Weight Product Model – WPM);
• While previous work applied mainly probabilistic
models…
• Why did we choose WPM?
– [Triantaphyllou and Mann] confirmed that for problems with
dependence up to three variables, WPM achieves the best
performance

cleyton.caetano.souza@copin.ufcg.edu.br

6
Proposal
• Some user on Twitter has a question
• Our model analyzes the question and orders his
followers based on three criteria (further details
[Souza et al.])
– Knowledge (K) – using bag of words strategy;
– Trust (T) – a combination of similarity and
conversation rate;
– Activity (A) – mean latency time between
consecutive messages;

• What do we want?
– We want to find the best combination of K, T and A!
cleyton.caetano.souza@copin.ufcg.edu.br

7
Knowledge
• We want to ask someone who knows about the
topic of the question
• We used Vector Space Model
– Users and question are represented by a vector of
terms
– We match users and questions using cosine
similarity between these vectors

cleyton.caetano.souza@copin.ufcg.edu.br

8
Trust/Closeness
• Sometimes, we want receive answers from
people close to us
• How we automatically discover these people
– We analyze the conversation rate between the
questioner and each follower
– We analyze the followers set similarity between
the questioner and each follower
– We established that trust is the product between
conversation rate and followers set similarity
cleyton.caetano.souza@copin.ufcg.edu.br

9
Activity
• Sometimes, we prefer a quick answer with low
quality instead a high quality answer but slow
• Our assumption is that people who produces
a lot of content in a short time will provide
quick answers
• Activity is a mean latency time between
consecutive posts

cleyton.caetano.souza@copin.ufcg.edu.br

10
Proposal
• How we compare the criteria configuration of
the followers?
– We use Weight Product Model - we compare two
users using the following function:
𝑐𝑜𝑚𝑝 𝑢, 𝑣 =

𝑚𝑎𝑝 𝐾 𝑢
𝑚𝑎𝑝 𝐾 𝑣

𝑤𝑘

∗

𝑚𝑎𝑝 𝑇 𝑢
𝑚𝑎𝑝 𝑇 𝑣

𝑤𝑡

*

𝑚𝑎𝑝 𝐴 𝑢
𝑚𝑎𝑝 𝐴 𝑣

𝑤𝑎

– The result of comparison tell us who is the best
user!
– We sum the victories of each user and order them
based on their total of victories!
cleyton.caetano.souza@copin.ufcg.edu.br

11
Evaluation
• We used a Quantitative Approach!
• Methodology
1. We selected 160 questions and their answers
published on Twitter
2. We manually ranked the answers of each
question based on their utility

cleyton.caetano.souza@copin.ufcg.edu.br

12
Evaluation
• We manually ranked the answers of each
question based on their utility
Question

How Much it costs go to Disneyland?

Answer

Answer Type

Utility

I don’t know

A unhelpful answer

1

I think @someone knows

Indicating someone or some source

2

Between $1000 and $2000

A uncertainty answer

3

I was last year and I spent $700

A direct answer

4

• We used as tie-breaker the order in which the
answers were given
cleyton.caetano.souza@copin.ufcg.edu.br

13
Evaluation
• Methodology
4. We crawled information about their questioners and
answerers (user profile, followers set, following set, tweets);
5. We ranked the answerers using our proposal
6. We compared both ranks using nDCG

• Our aim is answer the following questions
– Does our Model perform well to predict the utility of
the answers?
– Does WPM reach better performance than the use of
criteria individually?
cleyton.caetano.souza@copin.ufcg.edu.br

14
Results
Question Type
[Morris et al.]

Recommendation

Amount of Questions
56

17

Opinion
Factual Knowledge

40
15

Rhetorical

3

Invitation

8

Favor
Social connection

12
9

Offer
Mean

160

cleyton.caetano.souza@copin.ufcg.edu.br

Mean of nDCG

0,92 ± 0,23
0,83 ± 0,31
0,91 ± 0,26
0,90 ± 0,25

0,99 ± 0,01
1,00 ± 0,00
0,87 ± 0,28
0,84 ± 0,31
0,90

15
Does our Model perform well to predict the
aptitude of the expert candidates?
• Promising results
– We reach a mean of nDCG bigger than 0.9;
– A one-tailed binomial test statically confirmed that
QR model predicted the ideal rank in more than
64% of cases (p-value= 0.03219 and α=5%);
• An improvement in comparison with [Souza et al. 2012]

cleyton.caetano.souza@copin.ufcg.edu.br

16
Does WPM reach better performance than
the use of individually criteria?

Figure 1: Boxplot comparing WPM with Individually Criterion
cleyton.caetano.souza@copin.ufcg.edu.br

17
Does WPM reach better performance than
the use of individually criteria?
• We performed a pairwise comparison using
Wilcoxon Signed Rank Test (α=5%)
Hypotheses

P-value

Conclusion

WPM has a nDCG distribution better than Knowledge

1.357e-15

True

WPM has a nDCG distribution better than Activity

6.701e-16

True

WPM has a nDCG distribution better than Trust

4.025e-16

True

cleyton.caetano.souza@copin.ufcg.edu.br

18
Treats to Validity
• Evaluation Methodology
• Few Questions
• Manually order answers

cleyton.caetano.souza@copin.ufcg.edu.br

19
Conclusion & Future Work
• We proposed a QR Model for Twitter
– We achieved promising results in a young field
– We confirmed the superiority of WPM use
– We created a public dataset for future research in the
area

• Future Work
– Is directing questions to experts more effective than
sharing questions?
– How is the relationship between the weights given to
criteria with the qualities (truth, intimacy, speed) of
the received answer?
cleyton.caetano.souza@copin.ufcg.edu.br

20
References
•

•

•

•

•

•

M. Morris, J. Teevan, and K. Panovich, “What do people ask their social networks, and
why?: a survey study of status message q&a behavior”, Proceedings of the 28th ACM
International Conference on Human Factors in Computing Systems, 2010, pp. 1739–
1748
J. Nichols, and J. Kang. “Asking questions of targeted strangers on social networks”.
Proceedings of the ACM Conference on Computer Supported Cooperative Work, 2012,
pp. 999–1002.
S. Paul, L. Hong, and E. Chi, “Is Twitter a good place for asking questions? a
characterization study”. Proceedings of the 5th International AAAI Conference on
Weblogs and Social Media, 2011, pp. 578–581.
C. Souza, J. Magalhães and E. Costa. “A Formal Model to the Routing Questions Problem
in the Context of Twitter”. Proceedings of the IADIS International Conference
WWW/Internet, 2011 .
C. Souza, J. Magalhães, E. Costa e J. Fechine. “Predicting Potential Responders in
Twitter : A Query Routing Algorithm”. Proceedings of the 12th International Conference
on Computational Science and Its Applications, 2012, pp. 714–729.
E. Triantaphyllou, and S. Mann, “An examination of the effectiveness of multidimensional decision-making methods: A decision-making paradox,” Decision Support
Systems, vol. 5, 1989, pp. 303–312
cleyton.caetano.souza@copin.ufcg.edu.br

21
Questions?

cleyton.caetano.souza@copin.ufcg.edu.br

22

More Related Content

What's hot

The Future of Online Testing with MOOCs: An Exploratory Analysis of Current P...
The Future of Online Testing with MOOCs: An Exploratory Analysis of Current P...The Future of Online Testing with MOOCs: An Exploratory Analysis of Current P...
The Future of Online Testing with MOOCs: An Exploratory Analysis of Current P...
Eamon Costello
 

What's hot (8)

The Future of Online Testing with MOOCs: An Exploratory Analysis of Current P...
The Future of Online Testing with MOOCs: An Exploratory Analysis of Current P...The Future of Online Testing with MOOCs: An Exploratory Analysis of Current P...
The Future of Online Testing with MOOCs: An Exploratory Analysis of Current P...
 
De carlo rizk 2010 icelw
De carlo rizk 2010 icelwDe carlo rizk 2010 icelw
De carlo rizk 2010 icelw
 
An agent-based model of the effects of message interventions on opinion dynam...
An agent-based model of the effects of message interventions on opinion dynam...An agent-based model of the effects of message interventions on opinion dynam...
An agent-based model of the effects of message interventions on opinion dynam...
 
RecSys Challenge 2016
RecSys Challenge 2016RecSys Challenge 2016
RecSys Challenge 2016
 
A Combination of Simple Models by Forward Predictor Selection for Job Recomme...
A Combination of Simple Models by Forward Predictor Selection for Job Recomme...A Combination of Simple Models by Forward Predictor Selection for Job Recomme...
A Combination of Simple Models by Forward Predictor Selection for Job Recomme...
 
Learning Beyond the Horizon: Using Peerwise to increase engagement for studen...
Learning Beyond the Horizon: Using Peerwise to increase engagement for studen...Learning Beyond the Horizon: Using Peerwise to increase engagement for studen...
Learning Beyond the Horizon: Using Peerwise to increase engagement for studen...
 
[WI 2017] Affective Prediction By Collaborative Chains In Movie Recommendation
[WI 2017] Affective Prediction By Collaborative Chains In Movie Recommendation[WI 2017] Affective Prediction By Collaborative Chains In Movie Recommendation
[WI 2017] Affective Prediction By Collaborative Chains In Movie Recommendation
 
On the Validity of Peer Grading and a Cloud Teaching Assistant System
On the Validity of Peer Grading and a Cloud Teaching Assistant SystemOn the Validity of Peer Grading and a Cloud Teaching Assistant System
On the Validity of Peer Grading and a Cloud Teaching Assistant System
 

Viewers also liked

My reume templet
My reume templetMy reume templet
My reume templet
gabfall
 

Viewers also liked (20)

A Recommender System for Predicting User Engagement in Twitter
A Recommender System for Predicting User Engagement in TwitterA Recommender System for Predicting User Engagement in Twitter
A Recommender System for Predicting User Engagement in Twitter
 
Predicting Potential Responders in Twitter: A Query Routing Algorithm
Predicting Potential Responders in Twitter: A Query Routing AlgorithmPredicting Potential Responders in Twitter: A Query Routing Algorithm
Predicting Potential Responders in Twitter: A Query Routing Algorithm
 
Recommending Scientific Papers: Investigating the User Curriculum
Recommending Scientific Papers: Investigating the User CurriculumRecommending Scientific Papers: Investigating the User Curriculum
Recommending Scientific Papers: Investigating the User Curriculum
 
Naive Bayes
Naive BayesNaive Bayes
Naive Bayes
 
An Ontology Based Approach for Sharing Distributed Educational
An Ontology Based Approach for Sharing Distributed EducationalAn Ontology Based Approach for Sharing Distributed Educational
An Ontology Based Approach for Sharing Distributed Educational
 
Improving a Recommender System Through Integration of User Profiles: a Semant...
Improving a Recommender System Through Integration of User Profiles: a Semant...Improving a Recommender System Through Integration of User Profiles: a Semant...
Improving a Recommender System Through Integration of User Profiles: a Semant...
 
Sistemas de Recomendação: Conceitos, Técnicas, Ferramentas e Aplicações
Sistemas de Recomendação: Conceitos, Técnicas, Ferramentas e AplicaçõesSistemas de Recomendação: Conceitos, Técnicas, Ferramentas e Aplicações
Sistemas de Recomendação: Conceitos, Técnicas, Ferramentas e Aplicações
 
Redes Bayesianas
Redes BayesianasRedes Bayesianas
Redes Bayesianas
 
Probabilidade
ProbabilidadeProbabilidade
Probabilidade
 
Enhancing the Status Message Question Asking Process on Facebook
Enhancing the Status Message Question Asking Process on FacebookEnhancing the Status Message Question Asking Process on Facebook
Enhancing the Status Message Question Asking Process on Facebook
 
An Open and Inspectable Learner Modeling with a Negotiation Mechanism to Solv...
An Open and Inspectable Learner Modeling with a Negotiation Mechanism to Solv...An Open and Inspectable Learner Modeling with a Negotiation Mechanism to Solv...
An Open and Inspectable Learner Modeling with a Negotiation Mechanism to Solv...
 
K-Nearest Neighbor
K-Nearest NeighborK-Nearest Neighbor
K-Nearest Neighbor
 
Social Query: A Query Routing System for Twitter
Social Query: A Query Routing System for TwitterSocial Query: A Query Routing System for Twitter
Social Query: A Query Routing System for Twitter
 
My reume templet
My reume templetMy reume templet
My reume templet
 
Cover note
Cover noteCover note
Cover note
 
BSS Student Leaders TTT
BSS Student Leaders TTTBSS Student Leaders TTT
BSS Student Leaders TTT
 
21 career-planning-people-development
21 career-planning-people-development21 career-planning-people-development
21 career-planning-people-development
 
Formatos
FormatosFormatos
Formatos
 
Presentacion de mi autobiografia
Presentacion de mi autobiografiaPresentacion de mi autobiografia
Presentacion de mi autobiografia
 
Apres dl224 2015_scie
Apres dl224 2015_scieApres dl224 2015_scie
Apres dl224 2015_scie
 

Similar to A Query Routing Model to Rank Expertcandidates on Twitter

Klaus Dieter Rossade online assessment
Klaus Dieter Rossade online assessmentKlaus Dieter Rossade online assessment
Klaus Dieter Rossade online assessment
EADTU
 
Comparative Study of Different Approaches for Measuring Difficulty Level of Q...
Comparative Study of Different Approaches for Measuring Difficulty Level of Q...Comparative Study of Different Approaches for Measuring Difficulty Level of Q...
Comparative Study of Different Approaches for Measuring Difficulty Level of Q...
ijtsrd
 
Comparative Study of Different Approaches for Measuring Difficulty Level of Q...
Comparative Study of Different Approaches for Measuring Difficulty Level of Q...Comparative Study of Different Approaches for Measuring Difficulty Level of Q...
Comparative Study of Different Approaches for Measuring Difficulty Level of Q...
ijtsrd
 
Aligning Learning Analytics with Classroom Practices & Needs
Aligning Learning Analytics with Classroom Practices & NeedsAligning Learning Analytics with Classroom Practices & Needs
Aligning Learning Analytics with Classroom Practices & Needs
Simon Knight
 

Similar to A Query Routing Model to Rank Expertcandidates on Twitter (20)

[DSC Europe 22] Machine learning algorithms as tools for student success pred...
[DSC Europe 22] Machine learning algorithms as tools for student success pred...[DSC Europe 22] Machine learning algorithms as tools for student success pred...
[DSC Europe 22] Machine learning algorithms as tools for student success pred...
 
Planning and running usability tests
Planning and running usability testsPlanning and running usability tests
Planning and running usability tests
 
The Search for Truth in Objective & Subject Crowdsourcing
The Search for Truth in Objective & Subject CrowdsourcingThe Search for Truth in Objective & Subject Crowdsourcing
The Search for Truth in Objective & Subject Crowdsourcing
 
Crowdsourcing for Information Retrieval: From Statistics to Ethics
Crowdsourcing for Information Retrieval: From Statistics to EthicsCrowdsourcing for Information Retrieval: From Statistics to Ethics
Crowdsourcing for Information Retrieval: From Statistics to Ethics
 
seminar.pptx On the PROJECT TITLE machine Learning
seminar.pptx On the PROJECT TITLE machine Learningseminar.pptx On the PROJECT TITLE machine Learning
seminar.pptx On the PROJECT TITLE machine Learning
 
Organizing Your First Website Usability Test - WordCamp Toronto 2016
Organizing Your First Website Usability Test - WordCamp Toronto 2016Organizing Your First Website Usability Test - WordCamp Toronto 2016
Organizing Your First Website Usability Test - WordCamp Toronto 2016
 
Keynote Sally Jordan - Computer-based assessment friend or foe? - OWD14
Keynote Sally Jordan - Computer-based assessment friend or foe? - OWD14Keynote Sally Jordan - Computer-based assessment friend or foe? - OWD14
Keynote Sally Jordan - Computer-based assessment friend or foe? - OWD14
 
Organizing Your First Website Usability Test - Cornell Drupal Camp 2016 - part 4
Organizing Your First Website Usability Test - Cornell Drupal Camp 2016 - part 4Organizing Your First Website Usability Test - Cornell Drupal Camp 2016 - part 4
Organizing Your First Website Usability Test - Cornell Drupal Camp 2016 - part 4
 
Klaus Dieter Rossade online assessment
Klaus Dieter Rossade online assessmentKlaus Dieter Rossade online assessment
Klaus Dieter Rossade online assessment
 
IRJET- Analysis of Question and Answering Recommendation System
IRJET-  	  Analysis of Question and Answering Recommendation SystemIRJET-  	  Analysis of Question and Answering Recommendation System
IRJET- Analysis of Question and Answering Recommendation System
 
Influence of Timeline and Named-entity Components on User Engagement
Influence of Timeline and Named-entity Components on User Engagement Influence of Timeline and Named-entity Components on User Engagement
Influence of Timeline and Named-entity Components on User Engagement
 
Data driveneducationicwl2016
Data driveneducationicwl2016Data driveneducationicwl2016
Data driveneducationicwl2016
 
eMOOCs2015 Does peer grading work?
eMOOCs2015 Does peer grading work?eMOOCs2015 Does peer grading work?
eMOOCs2015 Does peer grading work?
 
Learning Analytics: Seeking new insights from educational data
Learning Analytics: Seeking new insights from educational dataLearning Analytics: Seeking new insights from educational data
Learning Analytics: Seeking new insights from educational data
 
Lightning Talks (select full talk per category)
Lightning Talks (select full talk per category)Lightning Talks (select full talk per category)
Lightning Talks (select full talk per category)
 
Comparative Study of Different Approaches for Measuring Difficulty Level of Q...
Comparative Study of Different Approaches for Measuring Difficulty Level of Q...Comparative Study of Different Approaches for Measuring Difficulty Level of Q...
Comparative Study of Different Approaches for Measuring Difficulty Level of Q...
 
Comparative Study of Different Approaches for Measuring Difficulty Level of Q...
Comparative Study of Different Approaches for Measuring Difficulty Level of Q...Comparative Study of Different Approaches for Measuring Difficulty Level of Q...
Comparative Study of Different Approaches for Measuring Difficulty Level of Q...
 
Itana gimenesevalplan
Itana gimenesevalplanItana gimenesevalplan
Itana gimenesevalplan
 
Engaging with Users on Public Social Media
Engaging with Users on Public Social MediaEngaging with Users on Public Social Media
Engaging with Users on Public Social Media
 
Aligning Learning Analytics with Classroom Practices & Needs
Aligning Learning Analytics with Classroom Practices & NeedsAligning Learning Analytics with Classroom Practices & Needs
Aligning Learning Analytics with Classroom Practices & Needs
 

Recently uploaded

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 

Recently uploaded (20)

AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 

A Query Routing Model to Rank Expertcandidates on Twitter

  • 1. A Query Routing Model to Rank Expert Candidates on Twitter Cleyton Souza, Jonathas Magalhães, Evandro Costa and Joseana Fechine LIA - Laboratory of Artificial Intelligence UFCG - Federal University of Campina Grande Campina Grande - Brazil
  • 2. Introduction • What is Social Query? – It is the process of asking questions trough social media (e.g., Twitter, Facebook, etc.)! [Morris et al.] – The common strategy is sharing the question with everyone, but this way there is no guarantee that you will receive a good and quick answer • Directing questions to someone is more efficient. • What is Query Routing? – It is the process of directing questions to appropriate answerers (people able to help)! cleyton.caetano.souza@copin.ufcg.edu.br 2
  • 3. Introduction • What are we proposing? – A Query Routing Model: a technique that finds the most suitable person to help you based on knowledge, trust and activity. – We are focusing in the Twitter context! A Query Routing Model to Rank Expert Candidates on Twitter cleyton.caetano.souza@copin.ufcg.edu.br 3
  • 4. Agenda • • • • Introduction Related Work Proposal Evaluation – Methodology – Results – Treats to Validity • Conclusion & Future Work cleyton.caetano.souza@copin.ufcg.edu.br 4
  • 5. Related Word (1/2) • What are the differentials of our proposal to Previous Work? – Context – We are focusing on a Social Network Context; • While previous work focused on Community Question and Answering context… • Why did we choose Twitter? – It is one of the most popular Online Social Networks; – Less than 18% percent of questions asked on Twitter are answered [Paul et al.]; – [Nichols and Kang] confirmed that directing questions significantly improve the response rate; cleyton.caetano.souza@copin.ufcg.edu.br 5
  • 6. Related Word (2/2) • What are the differentials of our proposal to Previous Work? – Problem – We are leading with the Query Routing problem as a Multi-criteria Decision Making Problem (Weight Product Model – WPM); • While previous work applied mainly probabilistic models… • Why did we choose WPM? – [Triantaphyllou and Mann] confirmed that for problems with dependence up to three variables, WPM achieves the best performance cleyton.caetano.souza@copin.ufcg.edu.br 6
  • 7. Proposal • Some user on Twitter has a question • Our model analyzes the question and orders his followers based on three criteria (further details [Souza et al.]) – Knowledge (K) – using bag of words strategy; – Trust (T) – a combination of similarity and conversation rate; – Activity (A) – mean latency time between consecutive messages; • What do we want? – We want to find the best combination of K, T and A! cleyton.caetano.souza@copin.ufcg.edu.br 7
  • 8. Knowledge • We want to ask someone who knows about the topic of the question • We used Vector Space Model – Users and question are represented by a vector of terms – We match users and questions using cosine similarity between these vectors cleyton.caetano.souza@copin.ufcg.edu.br 8
  • 9. Trust/Closeness • Sometimes, we want receive answers from people close to us • How we automatically discover these people – We analyze the conversation rate between the questioner and each follower – We analyze the followers set similarity between the questioner and each follower – We established that trust is the product between conversation rate and followers set similarity cleyton.caetano.souza@copin.ufcg.edu.br 9
  • 10. Activity • Sometimes, we prefer a quick answer with low quality instead a high quality answer but slow • Our assumption is that people who produces a lot of content in a short time will provide quick answers • Activity is a mean latency time between consecutive posts cleyton.caetano.souza@copin.ufcg.edu.br 10
  • 11. Proposal • How we compare the criteria configuration of the followers? – We use Weight Product Model - we compare two users using the following function: 𝑐𝑜𝑚𝑝 𝑢, 𝑣 = 𝑚𝑎𝑝 𝐾 𝑢 𝑚𝑎𝑝 𝐾 𝑣 𝑤𝑘 ∗ 𝑚𝑎𝑝 𝑇 𝑢 𝑚𝑎𝑝 𝑇 𝑣 𝑤𝑡 * 𝑚𝑎𝑝 𝐴 𝑢 𝑚𝑎𝑝 𝐴 𝑣 𝑤𝑎 – The result of comparison tell us who is the best user! – We sum the victories of each user and order them based on their total of victories! cleyton.caetano.souza@copin.ufcg.edu.br 11
  • 12. Evaluation • We used a Quantitative Approach! • Methodology 1. We selected 160 questions and their answers published on Twitter 2. We manually ranked the answers of each question based on their utility cleyton.caetano.souza@copin.ufcg.edu.br 12
  • 13. Evaluation • We manually ranked the answers of each question based on their utility Question How Much it costs go to Disneyland? Answer Answer Type Utility I don’t know A unhelpful answer 1 I think @someone knows Indicating someone or some source 2 Between $1000 and $2000 A uncertainty answer 3 I was last year and I spent $700 A direct answer 4 • We used as tie-breaker the order in which the answers were given cleyton.caetano.souza@copin.ufcg.edu.br 13
  • 14. Evaluation • Methodology 4. We crawled information about their questioners and answerers (user profile, followers set, following set, tweets); 5. We ranked the answerers using our proposal 6. We compared both ranks using nDCG • Our aim is answer the following questions – Does our Model perform well to predict the utility of the answers? – Does WPM reach better performance than the use of criteria individually? cleyton.caetano.souza@copin.ufcg.edu.br 14
  • 15. Results Question Type [Morris et al.] Recommendation Amount of Questions 56 17 Opinion Factual Knowledge 40 15 Rhetorical 3 Invitation 8 Favor Social connection 12 9 Offer Mean 160 cleyton.caetano.souza@copin.ufcg.edu.br Mean of nDCG 0,92 ± 0,23 0,83 ± 0,31 0,91 ± 0,26 0,90 ± 0,25 0,99 ± 0,01 1,00 ± 0,00 0,87 ± 0,28 0,84 ± 0,31 0,90 15
  • 16. Does our Model perform well to predict the aptitude of the expert candidates? • Promising results – We reach a mean of nDCG bigger than 0.9; – A one-tailed binomial test statically confirmed that QR model predicted the ideal rank in more than 64% of cases (p-value= 0.03219 and α=5%); • An improvement in comparison with [Souza et al. 2012] cleyton.caetano.souza@copin.ufcg.edu.br 16
  • 17. Does WPM reach better performance than the use of individually criteria? Figure 1: Boxplot comparing WPM with Individually Criterion cleyton.caetano.souza@copin.ufcg.edu.br 17
  • 18. Does WPM reach better performance than the use of individually criteria? • We performed a pairwise comparison using Wilcoxon Signed Rank Test (α=5%) Hypotheses P-value Conclusion WPM has a nDCG distribution better than Knowledge 1.357e-15 True WPM has a nDCG distribution better than Activity 6.701e-16 True WPM has a nDCG distribution better than Trust 4.025e-16 True cleyton.caetano.souza@copin.ufcg.edu.br 18
  • 19. Treats to Validity • Evaluation Methodology • Few Questions • Manually order answers cleyton.caetano.souza@copin.ufcg.edu.br 19
  • 20. Conclusion & Future Work • We proposed a QR Model for Twitter – We achieved promising results in a young field – We confirmed the superiority of WPM use – We created a public dataset for future research in the area • Future Work – Is directing questions to experts more effective than sharing questions? – How is the relationship between the weights given to criteria with the qualities (truth, intimacy, speed) of the received answer? cleyton.caetano.souza@copin.ufcg.edu.br 20
  • 21. References • • • • • • M. Morris, J. Teevan, and K. Panovich, “What do people ask their social networks, and why?: a survey study of status message q&a behavior”, Proceedings of the 28th ACM International Conference on Human Factors in Computing Systems, 2010, pp. 1739– 1748 J. Nichols, and J. Kang. “Asking questions of targeted strangers on social networks”. Proceedings of the ACM Conference on Computer Supported Cooperative Work, 2012, pp. 999–1002. S. Paul, L. Hong, and E. Chi, “Is Twitter a good place for asking questions? a characterization study”. Proceedings of the 5th International AAAI Conference on Weblogs and Social Media, 2011, pp. 578–581. C. Souza, J. Magalhães and E. Costa. “A Formal Model to the Routing Questions Problem in the Context of Twitter”. Proceedings of the IADIS International Conference WWW/Internet, 2011 . C. Souza, J. Magalhães, E. Costa e J. Fechine. “Predicting Potential Responders in Twitter : A Query Routing Algorithm”. Proceedings of the 12th International Conference on Computational Science and Its Applications, 2012, pp. 714–729. E. Triantaphyllou, and S. Mann, “An examination of the effectiveness of multidimensional decision-making methods: A decision-making paradox,” Decision Support Systems, vol. 5, 1989, pp. 303–312 cleyton.caetano.souza@copin.ufcg.edu.br 21