SlideShare a Scribd company logo
1 of 30
Download to read offline
A formal model to the routing
questions problem in the context of
              twitter
       Cleyton Caetano de Souza
Schedule
1. Introduction
  1. Problem
2. Related Works
3. The model
  1. The problem
  2. Details
4. A solution to the model
5. Conclusion
6. Future Works      Cleyton-UFCG   2
Introduction
• Web has became essential
  – Web, a repository of information
• Search Engines
  – Looking answers
• Social Networks
  – Waiting answers




                       Cleyton-UFCG    3
Problem
• Could occurs problems when you publish your
  question
  – None answer
  – None see
  – Many answers
• Direct the answer to someone
  – You ensure a answer, but will be a good one?



                       Cleyton-UFCG                4
Problem
• Informally, the problem that we proposes to
  solve is given a question posted by a user
  (asker) in Twitter, find among his followers
  that user with the characteristics:
  – (1) knows the answer
  – (2) has the trust of the questioner
  – (3) provide the answer quickly



                        Cleyton-UFCG             5
Related Works
• (Morris, Teevan e Panovich 2010a)
  – 93.5% of users received answers to their question
    after post them and these responses
  – in 90.1% of cases, were provided within one day
• Applications
  – Aardvark (Horowitz and Kamvar 2010)
  – Q-Sabe (Andrade et al 2003)
• The differential of our research

                       Cleyton-UFCG                     6
The Model
• The twitter is defined by the tuple
                   𝑇 = {𝑈, 𝑅}
• Where 𝑈 = {𝑢1 , … , 𝑢 𝑈 } is a set of users
• And 𝑅 is the set of all relationships
  𝑟𝑖,𝑗 between two users 𝑖 and 𝑗.
  – The existence of 𝑟𝑖,𝑗 means that i follows j, this
    way                    𝑟𝑖,𝑗 ≠ 𝑟𝑗,𝑖


                         Cleyton-UFCG                    7
The Model
• Each useru has the attributes
  – 𝐹𝑜𝑙𝑙𝑜𝑤𝑒𝑟𝑠 𝑢 that contains all users which follows 𝑢
  – 𝐹𝑜𝑙𝑙𝑜𝑤𝑖𝑛𝑔 𝑢 that contains all users which are followed
    by 𝑢
  – 𝑀 𝑢 = 𝑚1 , … , 𝑚 𝑀 a ordered list that contains all
   messages posted for 𝑢
• Each message 𝑚 has the attributes
  – 𝑑 𝑚 - the post date
  – 𝑠 𝑚 - the string posted

                              Cleyton-UFCG                8
The Problem
   Given a query 𝑞 posted by 𝑢,
   𝑓 ∈ 𝐹𝑜𝑙𝑙𝑜𝑤𝑒𝑟𝑠 𝑢 and 𝑝 𝑓,𝑞 a function
   that tell us the chances of
    𝑓 provides a good answer
– Find: 𝑓
– To: 𝑀𝑎𝑥 𝑝 𝑓,𝑞
– Over: 𝐹𝑜𝑙𝑙𝑜𝑤𝑒𝑟𝑠 𝑢


                      Cleyton-UFCG        9
The problem
• We believe that 𝑝 𝑓,𝑞 has a correlation with
  three things
  – 𝑘 𝑓,𝑞 – the knowledge that 𝑓 in relation with 𝑞
  – 𝑡 𝑢,𝑓 – the trust of 𝑢 has in 𝑓
  – 𝑎 𝑓 – the level of activity of 𝑓
• That way will actually want to find the best
  combination of: 𝑘 𝑓,𝑞 , 𝑡 𝑢,𝑓 and 𝑎 𝑓


                          Cleyton-UFCG                10
Knowledge
• Each message 𝑚 𝑢 corresponds a fraction of
  the total expertise of 𝑢
                 𝑘𝑢 =                  𝑘   𝑚𝑢
                         𝑚 𝑢 ∈𝑀 𝑢
• In IR we represent this fraction as a vector of
  the words/token contained in 𝑚 𝑢
• So the 𝑘 𝑢 is a vector where each coordinate
  represents a token and its value is the
  frequency of this token in all messages 𝑚 𝑢
                        Cleyton-UFCG                11
Knowledge
• If 𝑡 𝑞 is the frequency of the token 𝑡 in 𝑞, the
  knowledge needed to answer satisfactorily the
  question is calculated as a inner product
  between the vector that represent the
  follower and the vector that represent the
  question
                𝑘 𝑓,𝑞 =             𝑡𝑞 ∗ 𝑡𝑘𝑢
                            𝑡∈𝑞

                          Cleyton-UFCG           12
Trust
• Trust is related to
  – Friendship [Schenkel et al 2008]
  – Similarity [Kuter and Golbeck 2010]
• So we believe (and simplify)
             𝑡 𝑢,𝑣 = 𝑓 𝑢,𝑣 ∗ 𝑠𝑖𝑚 𝑢, 𝑣




                        Cleyton-UFCG      13
Friendship
• Friendship measures the importance of a user
  to another
• In Twitter a good estimative of friendship
  should consider the mentions (connections)
  between 𝑢 and 𝑣, so
                      |𝑚𝑒𝑛𝑡𝑖𝑜𝑛𝑠 𝑢 𝑣 |
            𝑓 𝑢,𝑣   =
                        𝑚𝑒𝑛𝑡𝑖𝑜𝑛𝑠 𝑢



                         Cleyton-UFCG        14
Similarity
• The similarity measures how to users are
  equal under some criterion
• Appears intuitively that the similarity is
  related to equality among the attributes
                   𝐹𝑜𝑙𝑙𝑜𝑤𝑒𝑟𝑠 𝑢 ∩ 𝐹𝑜𝑙𝑙𝑜𝑤𝑒𝑟𝑠 𝑣
    𝑠𝑖𝑚1 𝑢, 𝑣 ∝
                   𝐹𝑜𝑙𝑙𝑜𝑤𝑒𝑟𝑠 𝑢 ∪ 𝐹𝑜𝑙𝑙𝑜𝑤𝑒𝑟𝑠 𝑣
                   𝐹𝑜𝑙𝑙𝑜𝑤𝑖𝑛𝑔 𝑢 ∩ 𝐹𝑜𝑙𝑙𝑜𝑤𝑖𝑛𝑔 𝑣
    𝑠𝑖𝑚2 𝑢, 𝑣 ∝
                   𝐹𝑜𝑙𝑙𝑜𝑤𝑖𝑛𝑔 𝑢 ∪ 𝐹𝑜𝑙𝑙𝑜𝑤𝑖𝑛𝑔 𝑣
            𝑠𝑖𝑚3 𝑢, 𝑣 ∝ 𝑠𝑖𝑚(𝑘 𝑢 , 𝑘 𝑣 )
                     Cleyton-UFCG              15
Similarity
• Any combination of this equations could be
  used
• We choose use

             𝑠𝑖𝑚1 𝑢, 𝑣     𝑠𝑖𝑚2 𝑢, 𝑣     𝑠𝑖𝑚3 𝑢, 𝑣
𝑠𝑖𝑚 𝑢, 𝑣 =              ∗             ∗
           1 − 𝑠𝑖𝑚1 𝑢, 𝑣 1 − 𝑠𝑖𝑚2 𝑢, 𝑣 1 − 𝑠𝑖𝑚3 𝑢, 𝑣




                       Cleyton-UFCG                    16
Activity
• Users not interact with the same intensity
• It seems intuitive that the activity level of a
  user depends on the frequency with he/she
  post new tweets




                       Cleyton-UFCG                 17
Activity
• Activity means the mean time between the
  messages posted by 𝑢
                             |𝑀|
        𝑡𝑜𝑑𝑎𝑦 − 𝑑 𝑚, 𝑀 𝑢 + 𝑖=1 𝑑 𝑚,𝑖+1 − 𝑑 𝑚,𝑖
  𝑎𝑢 =
                         𝑀𝑢 +1
• As lower this value, most active is the user and
  bigger the chances of him give a answer
  quickly


                      Cleyton-UFCG               18
Solving the Model
• Calculate the tuples (𝑘 𝑓,𝑞 , 𝑡 𝑢,𝑓 , 𝑎 𝑓 ) to each
  user is a simple task
• But, how decides who is the best?




                         Cleyton-UFCG                   19
Solving the Model
• We consider this is a problem of decision
  making with multiple criteria
• We decide to use the Weight Product Model
  to solve based on [Triantaphyllou and Mann
  1989]




                    Cleyton-UFCG               20
Solving the Model-Step 1
• The resolution of the model starts calculating
  the tuple (𝑘 𝑓,𝑞 , 𝑡 𝑢,𝑓 , 𝑎 𝑓 ) to each user
   𝑓 𝑢 ∈ 𝐹𝑜𝑙𝑙𝑜𝑤𝑒𝑟𝑠 𝑢




                      Cleyton-UFCG                 21
Solving the Model-Step 2
• The we display this users in a matrix
   𝐹𝑜𝑙𝑙𝑜𝑤𝑒𝑟𝑠 𝑢 𝑥|𝐹𝑜𝑙𝑙𝑜𝑤𝑒𝑟𝑠 𝑢 |




                      Cleyton-UFCG        22
Solving the Model-Step 3
• We create a function 𝑚𝑎𝑝 𝑥 which will map
  the values of (𝑘 𝑓,𝑞 , 𝑡 𝑢,𝑓 , 𝑎 𝑓 ) in a same scale




                        Cleyton-UFCG                     23
Solving the Model-Step 4
• For each pair 𝑓1 , 𝑓2 |𝑓1 ≠ 𝑓2 we calculate
                      𝑥                  𝑦              𝑧
            𝑘 𝑓1 ,𝑞           𝑡 𝑢,𝑓1             𝑎 𝑓1
𝑝 𝑓1,𝑓2 =                 ∗                  *
            𝑘 𝑓2 ,𝑞           𝑡 𝑢,𝑓2             𝑎 𝑓2

• The values 𝑥,𝑦 and 𝑧 are factors of importance
  and must be between 0 and 1, besides that
   𝑥+ 𝑦+ 𝑧=1



                                Cleyton-UFCG                24
Solving the Model-Step 5
• If 𝑝 𝑓1,𝑓2 > 0 we put 1 in position (𝑓1 , 𝑓2 ) and 0
  in position (𝑓2 , 𝑓1 )
• If 𝑝 𝑓1,𝑓2 < 0 we put 0 in position (𝑓1 , 𝑓2 ) and 1
  in position (𝑓2 , 𝑓1 )
• If 𝑝 𝑓1,𝑓2 = 0 we put 1 in position (𝑓1 , 𝑓2 ) and 1
  in position (𝑓2 , 𝑓1 )



                        Cleyton-UFCG                 25
Solving the Model-Step 5




          Cleyton-UFCG     26
Solving the Model-Step 6 (End)
• We calculate the sum of each line of the
  matrix, this number represents the number of
  victories of each user
• In the end we have
• The question will be
  routed to the user
  with more victories


                    Cleyton-UFCG             27
Conclusion
• The differential of our research
  – We focus in a successful network
  – We treat the problem over a new perspective
  – We lead with a recent and interesting problem




                       Cleyton-UFCG                 28
Future Works
• The model was already implemented
• We are investigating if our heuristics are
  coherent
• We will investigating
  – If the indications of the model are accurate
  – If direct questions is more effective
  – What factor of importance is most important


                       Cleyton-UFCG                29
Thank You
• Any Question?




                    Cleyton-UFCG   30

More Related Content

Viewers also liked

Capitalismo, socialismo y comunismo
Capitalismo, socialismo y comunismoCapitalismo, socialismo y comunismo
Capitalismo, socialismo y comunismo
Rafael Verde)
 
doc_marc_pee_gener2010
doc_marc_pee_gener2010doc_marc_pee_gener2010
doc_marc_pee_gener2010
Mari Nuñez
 
Trabajo inlg
Trabajo inlgTrabajo inlg
Trabajo inlg
rjoymille
 

Viewers also liked (8)

Case For Organizational Web 2.0 Ss Version
Case For Organizational Web 2.0 Ss VersionCase For Organizational Web 2.0 Ss Version
Case For Organizational Web 2.0 Ss Version
 
Capitalismo, socialismo y comunismo
Capitalismo, socialismo y comunismoCapitalismo, socialismo y comunismo
Capitalismo, socialismo y comunismo
 
doc_marc_pee_gener2010
doc_marc_pee_gener2010doc_marc_pee_gener2010
doc_marc_pee_gener2010
 
SOCIO-DEMOGRAPHIC DIFFERENCES IN THE PERCEPTIONS OF LEARNING MANAGEMENT SYSTE...
SOCIO-DEMOGRAPHIC DIFFERENCES IN THE PERCEPTIONS OF LEARNING MANAGEMENT SYSTE...SOCIO-DEMOGRAPHIC DIFFERENCES IN THE PERCEPTIONS OF LEARNING MANAGEMENT SYSTE...
SOCIO-DEMOGRAPHIC DIFFERENCES IN THE PERCEPTIONS OF LEARNING MANAGEMENT SYSTE...
 
Modulo iii
Modulo iiiModulo iii
Modulo iii
 
Introducción a las startups
Introducción a las startupsIntroducción a las startups
Introducción a las startups
 
2013 10-04 Vruchtgebruikconstructies
2013 10-04 Vruchtgebruikconstructies2013 10-04 Vruchtgebruikconstructies
2013 10-04 Vruchtgebruikconstructies
 
Trabajo inlg
Trabajo inlgTrabajo inlg
Trabajo inlg
 

Similar to A formal model to the routing questions problem

Validity and Reliability of Cranfield-like Evaluation in Information Retrieval
Validity and Reliability of Cranfield-like Evaluation in Information RetrievalValidity and Reliability of Cranfield-like Evaluation in Information Retrieval
Validity and Reliability of Cranfield-like Evaluation in Information Retrieval
Julián Urbano
 
Caching strategies for in memory neighborhood-based recommender systems
Caching strategies for in memory neighborhood-based recommender systemsCaching strategies for in memory neighborhood-based recommender systems
Caching strategies for in memory neighborhood-based recommender systems
Simon Dooms
 
Optimal Learning for Fun and Profit with MOE
Optimal Learning for Fun and Profit with MOEOptimal Learning for Fun and Profit with MOE
Optimal Learning for Fun and Profit with MOE
Yelp Engineering
 
Scott Clark, Software Engineer, Yelp at MLconf SF
Scott Clark, Software Engineer, Yelp at MLconf SFScott Clark, Software Engineer, Yelp at MLconf SF
Scott Clark, Software Engineer, Yelp at MLconf SF
MLconf
 

Similar to A formal model to the routing questions problem (20)

Recommender Systems: Advances in Collaborative Filtering
Recommender Systems: Advances in Collaborative FilteringRecommender Systems: Advances in Collaborative Filtering
Recommender Systems: Advances in Collaborative Filtering
 
Validity and Reliability of Cranfield-like Evaluation in Information Retrieval
Validity and Reliability of Cranfield-like Evaluation in Information RetrievalValidity and Reliability of Cranfield-like Evaluation in Information Retrieval
Validity and Reliability of Cranfield-like Evaluation in Information Retrieval
 
Incremental collaborative filtering via evolutionary co clustering
Incremental collaborative filtering via evolutionary co clusteringIncremental collaborative filtering via evolutionary co clustering
Incremental collaborative filtering via evolutionary co clustering
 
adversarial robustness lecture
adversarial robustness lectureadversarial robustness lecture
adversarial robustness lecture
 
Least Cost Influence in Multiplex Social Networks
Least Cost Influence in Multiplex Social NetworksLeast Cost Influence in Multiplex Social Networks
Least Cost Influence in Multiplex Social Networks
 
Neural collaborative filtering-발표
Neural collaborative filtering-발표Neural collaborative filtering-발표
Neural collaborative filtering-발표
 
Dueling network architectures for deep reinforcement learning
Dueling network architectures for deep reinforcement learningDueling network architectures for deep reinforcement learning
Dueling network architectures for deep reinforcement learning
 
Online Machine Learning: introduction and examples
Online Machine Learning:  introduction and examplesOnline Machine Learning:  introduction and examples
Online Machine Learning: introduction and examples
 
Bottle sum
Bottle sumBottle sum
Bottle sum
 
Caching strategies for in memory neighborhood-based recommender systems
Caching strategies for in memory neighborhood-based recommender systemsCaching strategies for in memory neighborhood-based recommender systems
Caching strategies for in memory neighborhood-based recommender systems
 
ERF Training Workshop Panel Data 5
ERF Training WorkshopPanel Data 5ERF Training WorkshopPanel Data 5
ERF Training Workshop Panel Data 5
 
more loops lecture by Professor Evan korth
more loops  lecture by Professor Evan korth more loops  lecture by Professor Evan korth
more loops lecture by Professor Evan korth
 
DS M1 full - KQB KtuQbank.pdf
DS M1 full - KQB KtuQbank.pdfDS M1 full - KQB KtuQbank.pdf
DS M1 full - KQB KtuQbank.pdf
 
Optimal Learning for Fun and Profit with MOE
Optimal Learning for Fun and Profit with MOEOptimal Learning for Fun and Profit with MOE
Optimal Learning for Fun and Profit with MOE
 
Proximal Policy Optimization Algorithms, Schulman et al, 2017
Proximal Policy Optimization Algorithms, Schulman et al, 2017Proximal Policy Optimization Algorithms, Schulman et al, 2017
Proximal Policy Optimization Algorithms, Schulman et al, 2017
 
Scott Clark, Software Engineer, Yelp at MLconf SF
Scott Clark, Software Engineer, Yelp at MLconf SFScott Clark, Software Engineer, Yelp at MLconf SF
Scott Clark, Software Engineer, Yelp at MLconf SF
 
Paper Study: Melding the data decision pipeline
Paper Study: Melding the data decision pipelinePaper Study: Melding the data decision pipeline
Paper Study: Melding the data decision pipeline
 
Predicting Potential Responders in Twitter: A Query Routing Algorithm
Predicting Potential Responders in Twitter: A Query Routing AlgorithmPredicting Potential Responders in Twitter: A Query Routing Algorithm
Predicting Potential Responders in Twitter: A Query Routing Algorithm
 
Lessons learnt at building recommendation services at industry scale
Lessons learnt at building recommendation services at industry scaleLessons learnt at building recommendation services at industry scale
Lessons learnt at building recommendation services at industry scale
 
Paper Study: OptNet: Differentiable Optimization as a Layer in Neural Networks
Paper Study: OptNet: Differentiable Optimization as a Layer in Neural NetworksPaper Study: OptNet: Differentiable Optimization as a Layer in Neural Networks
Paper Study: OptNet: Differentiable Optimization as a Layer in Neural Networks
 

Recently uploaded

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 

Recently uploaded (20)

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 

A formal model to the routing questions problem

  • 1. A formal model to the routing questions problem in the context of twitter Cleyton Caetano de Souza
  • 2. Schedule 1. Introduction 1. Problem 2. Related Works 3. The model 1. The problem 2. Details 4. A solution to the model 5. Conclusion 6. Future Works Cleyton-UFCG 2
  • 3. Introduction • Web has became essential – Web, a repository of information • Search Engines – Looking answers • Social Networks – Waiting answers Cleyton-UFCG 3
  • 4. Problem • Could occurs problems when you publish your question – None answer – None see – Many answers • Direct the answer to someone – You ensure a answer, but will be a good one? Cleyton-UFCG 4
  • 5. Problem • Informally, the problem that we proposes to solve is given a question posted by a user (asker) in Twitter, find among his followers that user with the characteristics: – (1) knows the answer – (2) has the trust of the questioner – (3) provide the answer quickly Cleyton-UFCG 5
  • 6. Related Works • (Morris, Teevan e Panovich 2010a) – 93.5% of users received answers to their question after post them and these responses – in 90.1% of cases, were provided within one day • Applications – Aardvark (Horowitz and Kamvar 2010) – Q-Sabe (Andrade et al 2003) • The differential of our research Cleyton-UFCG 6
  • 7. The Model • The twitter is defined by the tuple 𝑇 = {𝑈, 𝑅} • Where 𝑈 = {𝑢1 , … , 𝑢 𝑈 } is a set of users • And 𝑅 is the set of all relationships 𝑟𝑖,𝑗 between two users 𝑖 and 𝑗. – The existence of 𝑟𝑖,𝑗 means that i follows j, this way 𝑟𝑖,𝑗 ≠ 𝑟𝑗,𝑖 Cleyton-UFCG 7
  • 8. The Model • Each useru has the attributes – 𝐹𝑜𝑙𝑙𝑜𝑤𝑒𝑟𝑠 𝑢 that contains all users which follows 𝑢 – 𝐹𝑜𝑙𝑙𝑜𝑤𝑖𝑛𝑔 𝑢 that contains all users which are followed by 𝑢 – 𝑀 𝑢 = 𝑚1 , … , 𝑚 𝑀 a ordered list that contains all messages posted for 𝑢 • Each message 𝑚 has the attributes – 𝑑 𝑚 - the post date – 𝑠 𝑚 - the string posted Cleyton-UFCG 8
  • 9. The Problem Given a query 𝑞 posted by 𝑢, 𝑓 ∈ 𝐹𝑜𝑙𝑙𝑜𝑤𝑒𝑟𝑠 𝑢 and 𝑝 𝑓,𝑞 a function that tell us the chances of 𝑓 provides a good answer – Find: 𝑓 – To: 𝑀𝑎𝑥 𝑝 𝑓,𝑞 – Over: 𝐹𝑜𝑙𝑙𝑜𝑤𝑒𝑟𝑠 𝑢 Cleyton-UFCG 9
  • 10. The problem • We believe that 𝑝 𝑓,𝑞 has a correlation with three things – 𝑘 𝑓,𝑞 – the knowledge that 𝑓 in relation with 𝑞 – 𝑡 𝑢,𝑓 – the trust of 𝑢 has in 𝑓 – 𝑎 𝑓 – the level of activity of 𝑓 • That way will actually want to find the best combination of: 𝑘 𝑓,𝑞 , 𝑡 𝑢,𝑓 and 𝑎 𝑓 Cleyton-UFCG 10
  • 11. Knowledge • Each message 𝑚 𝑢 corresponds a fraction of the total expertise of 𝑢 𝑘𝑢 = 𝑘 𝑚𝑢 𝑚 𝑢 ∈𝑀 𝑢 • In IR we represent this fraction as a vector of the words/token contained in 𝑚 𝑢 • So the 𝑘 𝑢 is a vector where each coordinate represents a token and its value is the frequency of this token in all messages 𝑚 𝑢 Cleyton-UFCG 11
  • 12. Knowledge • If 𝑡 𝑞 is the frequency of the token 𝑡 in 𝑞, the knowledge needed to answer satisfactorily the question is calculated as a inner product between the vector that represent the follower and the vector that represent the question 𝑘 𝑓,𝑞 = 𝑡𝑞 ∗ 𝑡𝑘𝑢 𝑡∈𝑞 Cleyton-UFCG 12
  • 13. Trust • Trust is related to – Friendship [Schenkel et al 2008] – Similarity [Kuter and Golbeck 2010] • So we believe (and simplify) 𝑡 𝑢,𝑣 = 𝑓 𝑢,𝑣 ∗ 𝑠𝑖𝑚 𝑢, 𝑣 Cleyton-UFCG 13
  • 14. Friendship • Friendship measures the importance of a user to another • In Twitter a good estimative of friendship should consider the mentions (connections) between 𝑢 and 𝑣, so |𝑚𝑒𝑛𝑡𝑖𝑜𝑛𝑠 𝑢 𝑣 | 𝑓 𝑢,𝑣 = 𝑚𝑒𝑛𝑡𝑖𝑜𝑛𝑠 𝑢 Cleyton-UFCG 14
  • 15. Similarity • The similarity measures how to users are equal under some criterion • Appears intuitively that the similarity is related to equality among the attributes 𝐹𝑜𝑙𝑙𝑜𝑤𝑒𝑟𝑠 𝑢 ∩ 𝐹𝑜𝑙𝑙𝑜𝑤𝑒𝑟𝑠 𝑣 𝑠𝑖𝑚1 𝑢, 𝑣 ∝ 𝐹𝑜𝑙𝑙𝑜𝑤𝑒𝑟𝑠 𝑢 ∪ 𝐹𝑜𝑙𝑙𝑜𝑤𝑒𝑟𝑠 𝑣 𝐹𝑜𝑙𝑙𝑜𝑤𝑖𝑛𝑔 𝑢 ∩ 𝐹𝑜𝑙𝑙𝑜𝑤𝑖𝑛𝑔 𝑣 𝑠𝑖𝑚2 𝑢, 𝑣 ∝ 𝐹𝑜𝑙𝑙𝑜𝑤𝑖𝑛𝑔 𝑢 ∪ 𝐹𝑜𝑙𝑙𝑜𝑤𝑖𝑛𝑔 𝑣 𝑠𝑖𝑚3 𝑢, 𝑣 ∝ 𝑠𝑖𝑚(𝑘 𝑢 , 𝑘 𝑣 ) Cleyton-UFCG 15
  • 16. Similarity • Any combination of this equations could be used • We choose use 𝑠𝑖𝑚1 𝑢, 𝑣 𝑠𝑖𝑚2 𝑢, 𝑣 𝑠𝑖𝑚3 𝑢, 𝑣 𝑠𝑖𝑚 𝑢, 𝑣 = ∗ ∗ 1 − 𝑠𝑖𝑚1 𝑢, 𝑣 1 − 𝑠𝑖𝑚2 𝑢, 𝑣 1 − 𝑠𝑖𝑚3 𝑢, 𝑣 Cleyton-UFCG 16
  • 17. Activity • Users not interact with the same intensity • It seems intuitive that the activity level of a user depends on the frequency with he/she post new tweets Cleyton-UFCG 17
  • 18. Activity • Activity means the mean time between the messages posted by 𝑢 |𝑀| 𝑡𝑜𝑑𝑎𝑦 − 𝑑 𝑚, 𝑀 𝑢 + 𝑖=1 𝑑 𝑚,𝑖+1 − 𝑑 𝑚,𝑖 𝑎𝑢 = 𝑀𝑢 +1 • As lower this value, most active is the user and bigger the chances of him give a answer quickly Cleyton-UFCG 18
  • 19. Solving the Model • Calculate the tuples (𝑘 𝑓,𝑞 , 𝑡 𝑢,𝑓 , 𝑎 𝑓 ) to each user is a simple task • But, how decides who is the best? Cleyton-UFCG 19
  • 20. Solving the Model • We consider this is a problem of decision making with multiple criteria • We decide to use the Weight Product Model to solve based on [Triantaphyllou and Mann 1989] Cleyton-UFCG 20
  • 21. Solving the Model-Step 1 • The resolution of the model starts calculating the tuple (𝑘 𝑓,𝑞 , 𝑡 𝑢,𝑓 , 𝑎 𝑓 ) to each user 𝑓 𝑢 ∈ 𝐹𝑜𝑙𝑙𝑜𝑤𝑒𝑟𝑠 𝑢 Cleyton-UFCG 21
  • 22. Solving the Model-Step 2 • The we display this users in a matrix 𝐹𝑜𝑙𝑙𝑜𝑤𝑒𝑟𝑠 𝑢 𝑥|𝐹𝑜𝑙𝑙𝑜𝑤𝑒𝑟𝑠 𝑢 | Cleyton-UFCG 22
  • 23. Solving the Model-Step 3 • We create a function 𝑚𝑎𝑝 𝑥 which will map the values of (𝑘 𝑓,𝑞 , 𝑡 𝑢,𝑓 , 𝑎 𝑓 ) in a same scale Cleyton-UFCG 23
  • 24. Solving the Model-Step 4 • For each pair 𝑓1 , 𝑓2 |𝑓1 ≠ 𝑓2 we calculate 𝑥 𝑦 𝑧 𝑘 𝑓1 ,𝑞 𝑡 𝑢,𝑓1 𝑎 𝑓1 𝑝 𝑓1,𝑓2 = ∗ * 𝑘 𝑓2 ,𝑞 𝑡 𝑢,𝑓2 𝑎 𝑓2 • The values 𝑥,𝑦 and 𝑧 are factors of importance and must be between 0 and 1, besides that 𝑥+ 𝑦+ 𝑧=1 Cleyton-UFCG 24
  • 25. Solving the Model-Step 5 • If 𝑝 𝑓1,𝑓2 > 0 we put 1 in position (𝑓1 , 𝑓2 ) and 0 in position (𝑓2 , 𝑓1 ) • If 𝑝 𝑓1,𝑓2 < 0 we put 0 in position (𝑓1 , 𝑓2 ) and 1 in position (𝑓2 , 𝑓1 ) • If 𝑝 𝑓1,𝑓2 = 0 we put 1 in position (𝑓1 , 𝑓2 ) and 1 in position (𝑓2 , 𝑓1 ) Cleyton-UFCG 25
  • 26. Solving the Model-Step 5 Cleyton-UFCG 26
  • 27. Solving the Model-Step 6 (End) • We calculate the sum of each line of the matrix, this number represents the number of victories of each user • In the end we have • The question will be routed to the user with more victories Cleyton-UFCG 27
  • 28. Conclusion • The differential of our research – We focus in a successful network – We treat the problem over a new perspective – We lead with a recent and interesting problem Cleyton-UFCG 28
  • 29. Future Works • The model was already implemented • We are investigating if our heuristics are coherent • We will investigating – If the indications of the model are accurate – If direct questions is more effective – What factor of importance is most important Cleyton-UFCG 29
  • 30. Thank You • Any Question? Cleyton-UFCG 30