SlideShare a Scribd company logo
►Dataset:
 INEX IMDb Dataset.
 30 INEX IMDb Topics and their relevance judgments.
 7 social signals from 5 social networks.
►Using language model to estimate the relevance of document D to a query Q.
𝑷 𝑫 is a document prior. 𝑤𝑖 represents words of query Q.
►Signals are grouped according to their property 𝑥 ∈ 𝑃: 𝑃𝑜𝑝𝑢𝑙𝑎𝑟𝑖𝑡𝑦, 𝑅: 𝑅𝑒𝑝𝑢𝑡𝑎𝑡𝑖𝑜𝑛
►The priors are estimated by a counting of actions 𝑎𝑖 associated with D.
►Smoothing 𝑃(𝑎𝑖
𝑥
) by collection C using Dirichlet :
Where 𝑷 𝒙 𝑫 represents the a priori probability of D. 𝑥 ∈ 𝑃, 𝑅 refers to the
social property estimated from a set of specific actions. 𝐶𝑜𝑢𝑛𝑡(𝑎𝑖
𝑥
, 𝐷) represents
number of occurrence of action 𝑎𝑖
𝑥
on resource D. 𝑎𝑖
𝑥
designs action 𝑎𝑖 used to
estimate 𝑥 property. 𝑎•
𝑥
is the total number of signals.
►Estimating signals diversity in a resource using diversity clue of Shannon-Wiener:
Where 𝑚 represents the total number of signals.
►The Shannon clue is often accompanied by Pielou evenness clue :
►The general formula of 𝑷 𝒙 𝑫 becomes as follows:
2. Social Signals Diversity
►Context:
 Exploiting social signals to enhance a search.
 Do the quality and diversity of signals matter to capture relevant documents?
►Hypothesis 1: Diversity of signals associated with a resource is a clue that may
indicate an interest beyond a social network or a community, i.e., a resource
dominated by a single signal should be disadvantaged versus a resource with an
equitable distribution of the signals.
►Hypothesis 2: Origin of social signals might impact the retrieval.
►Research Questions:
 How to estimate the signals diversity of a resource?
 What is the impact of signals diversity on IR system?
 Is there an influence of the social networks origin on the quality of their signals?
1. Introduction
Web ResourcesSocial Networks
Like (Frequency)
Comment (Frequency)
Share (Frequency)
+1 (Frequency)
…
User’s Actions
(Social Signals)
Social Relevance Topical Relevance
Global Relevance
Figure 1. Global presentation of our approach
Signals Diversity
Ismail Badache and Mohand Boughanem
IRIT - Paul Sabatier University,Toulouse, France
{Badache, Boughanem}@irit.fr
A Priori Relevance Based On Quality and Diversity of Social Signals
𝑃 𝐷 𝑄 = 𝑅𝑎𝑛𝑘
𝑃 𝐷 ∙ 𝑃 𝑄 𝐷 = 𝑷 𝑫 ∙
𝑤 𝑖 𝜖𝑄
𝑃(𝑤𝑖 |𝑄) (1)
𝑷 𝒙 𝑫 =
𝑎 𝑖
𝑥
∈𝐴
𝑃𝑥(𝑎𝑖
𝑥
) (2)
𝑷 𝒙 𝑫 =
𝑎 𝑖
𝑥
∈𝐴
𝐶𝑜𝑢𝑛𝑡 𝑎𝑖
𝑥
, 𝐷 + 𝜇 ∙ 𝑃(𝑎𝑖
𝑥
|𝐶)
𝐶𝑜𝑢𝑛𝑡 𝑎•
𝑥
, 𝐷 + 𝜇 (3)
Santiago, Chile
August 9-13, 2015
The 38th Annual ACM SIGIR Conference
3. Experimental Evaluation
𝐷𝑖𝑣𝑒𝑟𝑠𝑖𝑡𝑦𝑠(𝐷) = −
𝑖=1
𝑚
𝑃𝑥 𝑎𝑖
𝑥
∙ log(𝑃𝑥 𝑎𝑖
𝑥
) (4)
𝐷𝑖𝑣𝑒𝑟𝑠𝑖𝑡𝑦𝑠
𝑒𝑣𝑒𝑛𝑛𝑒𝑠𝑠
𝐷 =
𝐷𝑖𝑣𝑒𝑟𝑠𝑖𝑡𝑦𝑠(𝐷)
𝑀𝐴𝑋(𝐷𝑖𝑣𝑒𝑟𝑠𝑖𝑡𝑦𝑠 𝐷 )
=
𝐷𝑖𝑣𝑒𝑟𝑠𝑖𝑡𝑦𝑠(𝐷)
log(𝑚)
(5)
Like Share Comment Tweet +1 Bookmark Share(LIn)
P@10 0,3938 0,4061 0,3857 0,3879 0,3826 0,373 0,3739
P@20 0,362 0,3649 0,3551 0,3512 0,3468 0,3414 0,3432
nDCG 0,513 0,5262 0,5121 0,4769 0,5017 0,4621 0,4566
MAP 0,2832 0,2905 0,2813 0,2735 0,2704 0,26 0,2515
0
0,1
0,2
0,3
0,4
0,5
0,6
(B) Baselines: Single Priors
VSM ML.Hiemstra
P@10 0,3411 0,37
P@20 0,3122 0,3403
nDCG 0,3919 0,4325
MAP 0,1782 0,2402
0
0,1
0,2
0,3
0,4
0,5
(A) Baselines: Without Priors
TotalFacebook Popularity Reputation All Criteria All Properties
P@10 0,4227 0,4403 0,448 0,4463 0,4689
P@20 0,4187 0,4288 0,4306 0,4318 0,4563
nDCG 0,5713 0,5983 0,611 0,6174 0,6245
MAP 0,3167 0,332 0,3319 0,3325 0,3571
0
0,1
0,2
0,3
0,4
0,5
0,6
0,7
(D) With Considering Signals Diversity
TotalFacebook Popularity Reputation All Criteria All Properties
P@10 0,4209 0,4316 0,4405 0,4408 0,4629
P@20 0,4102 0,4264 0,4272 0,4262 0,4509
nDCG 0,5681 0,5801 0,59 0,5974 0,6203
MAP 0,3125 0,3221 0,326 0,33 0,3557
0
0,1
0,2
0,3
0,4
0,5
0,6
0,7
(C) Baselines: Combination Priors
Relevant documents containing signals Relevant documents without signals Irrelevant documents
Number of documents Number of actions Average Number of documents Number of actions Average
Like 2210 800458 362,1981 555 1678040 61,6133
Share 2357 856009 363,1774 408 1862909 68,4012
Comment 1988 944023 474,8607 777 1901146 69,8052
Tweet 1735 168448 97,0884 1030 330784 12,1455
+1 790 23665 29,9556 1975 49727 1,8258
Bookmark 429 5654 13,1794 2336 20489 0,7523
Share (LIn) 601 40446 67,2985 2164 2341 0,0859
Total relevant: 2765 Total irrelevant: 27235
Table 3. Statistics on the distribution of the signals in the documents (relevant and irrelevant)
80% 85%
72%
63%
22%
29%
16%
Figure 3. Relevant documents % containing signals
32% 31% 33% 34%
95%
32%
22%
Figure 2. Signals % in the relevant documents
►Results:
Property Social signal Social Network
Popularity
Number of Comment Facebook
Number of Tweet Twitter
Number of Share(LIn) LinkedIn
Number of Share Facebook
Reputation
Number of Like Facebook
Number of +1 Google+
Number of Bookmark Delicious
4. Quantitative and Qualitative Analysis
Table 1. Exploited social signals in quantification
Document id Like Share Comment +1
tt1730728 30 11 2 0
Bookmark Tweet Share(LIn)
0 2 0
Table 2. Instance of document with social signals
𝑷 𝒙 𝑫 =
𝑎 𝑖
𝑥
∈𝐴
𝑃𝑥(𝑎𝑖
𝑥
) ∙ 𝐷𝑖𝑣𝑒𝑟𝑠𝑖𝑡𝑦𝑠
𝑒𝑣𝑒𝑛𝑛𝑒𝑠𝑠
𝐷 (6)

More Related Content

What's hot

Graph Neural Networks for Recommendations
Graph Neural Networks for RecommendationsGraph Neural Networks for Recommendations
Graph Neural Networks for Recommendations
WQ Fan
 
Learning to Classify Users in Online Interaction Networks
Learning to Classify Users in Online Interaction NetworksLearning to Classify Users in Online Interaction Networks
Learning to Classify Users in Online Interaction Networks
Symeon Papadopoulos
 
Link prediction
Link predictionLink prediction
Link prediction
Carlos Castillo (ChaTo)
 
Behind the Mask: Understanding the Structural Forces That Make Social Graphs ...
Behind the Mask: Understanding the Structural Forces That Make Social Graphs ...Behind the Mask: Understanding the Structural Forces That Make Social Graphs ...
Behind the Mask: Understanding the Structural Forces That Make Social Graphs ...
Sameera Horawalavithana
 
Introduction to Recommender System
Introduction to Recommender SystemIntroduction to Recommender System
Introduction to Recommender System
WQ Fan
 
Master defence 2020 - Andrew Kurochkin - Meme Generation for Social Media Aud...
Master defence 2020 - Andrew Kurochkin - Meme Generation for Social Media Aud...Master defence 2020 - Andrew Kurochkin - Meme Generation for Social Media Aud...
Master defence 2020 - Andrew Kurochkin - Meme Generation for Social Media Aud...
Lviv Data Science Summer School
 
02 Network Data Collection
02 Network Data Collection02 Network Data Collection
02 Network Data Collection
dnac
 
Fundamentals of Deep Recommender Systems
 Fundamentals of Deep Recommender Systems Fundamentals of Deep Recommender Systems
Fundamentals of Deep Recommender Systems
WQ Fan
 
Who to follow and why: link prediction with explanations
Who to follow and why: link prediction with explanationsWho to follow and why: link prediction with explanations
Who to follow and why: link prediction with explanations
Nicola Barbieri
 
Social and economical networks from (big-)data - Esteban Moro II
Social and economical networks from (big-)data - Esteban Moro IISocial and economical networks from (big-)data - Esteban Moro II
Social and economical networks from (big-)data - Esteban Moro II
Lake Como School of Advanced Studies
 
DIE 20130724
DIE 20130724DIE 20130724
DIE 20130724
Tokyo Tech
 
Data-driven Studies on Social Networks: Privacy and Simulation
Data-driven Studies on Social Networks: Privacy and SimulationData-driven Studies on Social Networks: Privacy and Simulation
Data-driven Studies on Social Networks: Privacy and Simulation
Sameera Horawalavithana
 
finalized poster
finalized posterfinalized poster
finalized poster
Tong Wu
 
Frontiers of Computational Journalism week 1 - Introduction and High Dimensio...
Frontiers of Computational Journalism week 1 - Introduction and High Dimensio...Frontiers of Computational Journalism week 1 - Introduction and High Dimensio...
Frontiers of Computational Journalism week 1 - Introduction and High Dimensio...
Jonathan Stray
 
CSE509 Lecture 5
CSE509 Lecture 5CSE509 Lecture 5
Searching for Interestingness in Wikipedia and Yahoo! Answers
Searching for Interestingness in Wikipedia and Yahoo! AnswersSearching for Interestingness in Wikipedia and Yahoo! Answers
Searching for Interestingness in Wikipedia and Yahoo! Answers
Gabriela Agustini
 
Frontiers of Computational Journalism week 3 - Information Filter Design
Frontiers of Computational Journalism week 3 - Information Filter DesignFrontiers of Computational Journalism week 3 - Information Filter Design
Frontiers of Computational Journalism week 3 - Information Filter Design
Jonathan Stray
 
Yuntech present
Yuntech presentYuntech present
Yuntech present
Tunghai University
 
NDU Present
NDU PresentNDU Present
NDU Present
Tunghai University
 

What's hot (19)

Graph Neural Networks for Recommendations
Graph Neural Networks for RecommendationsGraph Neural Networks for Recommendations
Graph Neural Networks for Recommendations
 
Learning to Classify Users in Online Interaction Networks
Learning to Classify Users in Online Interaction NetworksLearning to Classify Users in Online Interaction Networks
Learning to Classify Users in Online Interaction Networks
 
Link prediction
Link predictionLink prediction
Link prediction
 
Behind the Mask: Understanding the Structural Forces That Make Social Graphs ...
Behind the Mask: Understanding the Structural Forces That Make Social Graphs ...Behind the Mask: Understanding the Structural Forces That Make Social Graphs ...
Behind the Mask: Understanding the Structural Forces That Make Social Graphs ...
 
Introduction to Recommender System
Introduction to Recommender SystemIntroduction to Recommender System
Introduction to Recommender System
 
Master defence 2020 - Andrew Kurochkin - Meme Generation for Social Media Aud...
Master defence 2020 - Andrew Kurochkin - Meme Generation for Social Media Aud...Master defence 2020 - Andrew Kurochkin - Meme Generation for Social Media Aud...
Master defence 2020 - Andrew Kurochkin - Meme Generation for Social Media Aud...
 
02 Network Data Collection
02 Network Data Collection02 Network Data Collection
02 Network Data Collection
 
Fundamentals of Deep Recommender Systems
 Fundamentals of Deep Recommender Systems Fundamentals of Deep Recommender Systems
Fundamentals of Deep Recommender Systems
 
Who to follow and why: link prediction with explanations
Who to follow and why: link prediction with explanationsWho to follow and why: link prediction with explanations
Who to follow and why: link prediction with explanations
 
Social and economical networks from (big-)data - Esteban Moro II
Social and economical networks from (big-)data - Esteban Moro IISocial and economical networks from (big-)data - Esteban Moro II
Social and economical networks from (big-)data - Esteban Moro II
 
DIE 20130724
DIE 20130724DIE 20130724
DIE 20130724
 
Data-driven Studies on Social Networks: Privacy and Simulation
Data-driven Studies on Social Networks: Privacy and SimulationData-driven Studies on Social Networks: Privacy and Simulation
Data-driven Studies on Social Networks: Privacy and Simulation
 
finalized poster
finalized posterfinalized poster
finalized poster
 
Frontiers of Computational Journalism week 1 - Introduction and High Dimensio...
Frontiers of Computational Journalism week 1 - Introduction and High Dimensio...Frontiers of Computational Journalism week 1 - Introduction and High Dimensio...
Frontiers of Computational Journalism week 1 - Introduction and High Dimensio...
 
CSE509 Lecture 5
CSE509 Lecture 5CSE509 Lecture 5
CSE509 Lecture 5
 
Searching for Interestingness in Wikipedia and Yahoo! Answers
Searching for Interestingness in Wikipedia and Yahoo! AnswersSearching for Interestingness in Wikipedia and Yahoo! Answers
Searching for Interestingness in Wikipedia and Yahoo! Answers
 
Frontiers of Computational Journalism week 3 - Information Filter Design
Frontiers of Computational Journalism week 3 - Information Filter DesignFrontiers of Computational Journalism week 3 - Information Filter Design
Frontiers of Computational Journalism week 3 - Information Filter Design
 
Yuntech present
Yuntech presentYuntech present
Yuntech present
 
NDU Present
NDU PresentNDU Present
NDU Present
 

Similar to A Priori Relevance Based On Quality and Diversity Of Social Signals

SocialCom09-tutorial.pdf
SocialCom09-tutorial.pdfSocialCom09-tutorial.pdf
SocialCom09-tutorial.pdf
BalasundaramSr
 
Open Grid Forum workshop on Social Networks, Semantic Grids and Web
Open Grid Forum workshop on Social Networks, Semantic Grids and WebOpen Grid Forum workshop on Social Networks, Semantic Grids and Web
Open Grid Forum workshop on Social Networks, Semantic Grids and Web
Noshir Contractor
 
Subscriber Churn Prediction Model using Social Network Analysis In Telecommun...
Subscriber Churn Prediction Model using Social Network Analysis In Telecommun...Subscriber Churn Prediction Model using Social Network Analysis In Telecommun...
Subscriber Churn Prediction Model using Social Network Analysis In Telecommun...
BAINIDA
 
User Identity Linkage: Data Collection, DataSet Biases, Method, Control and A...
User Identity Linkage: Data Collection, DataSet Biases, Method, Control and A...User Identity Linkage: Data Collection, DataSet Biases, Method, Control and A...
User Identity Linkage: Data Collection, DataSet Biases, Method, Control and A...
IIIT Hyderabad
 
THE REACTION DATA ANALYSIS OFCOVID-19 VACCINATIONS
THE REACTION DATA ANALYSIS OFCOVID-19 VACCINATIONSTHE REACTION DATA ANALYSIS OFCOVID-19 VACCINATIONS
THE REACTION DATA ANALYSIS OFCOVID-19 VACCINATIONS
ManishReddy706923
 
Online Communities in Citizen Science
Online Communities in Citizen ScienceOnline Communities in Citizen Science
Online Communities in Citizen Science
Andrea Wiggins
 
IIR 2017, Lugano Switzerland
IIR 2017, Lugano SwitzerlandIIR 2017, Lugano Switzerland
IIR 2017, Lugano Switzerland
Marco Polignano
 
How people talk about health?
How people talk about health?How people talk about health?
How people talk about health?
Nicola Procopio
 
2010 Catalyst Conference - Trends in Social Network Analysis
2010 Catalyst Conference - Trends in Social Network Analysis2010 Catalyst Conference - Trends in Social Network Analysis
2010 Catalyst Conference - Trends in Social Network Analysis
Marc Smith
 
My Dissertation Defense
My Dissertation Defense My Dissertation Defense
My Dissertation Defense
Laura Pasquini
 
Conference talk: On the Privacy of Frequently Visited User Locations
Conference talk: On the Privacy of Frequently Visited User LocationsConference talk: On the Privacy of Frequently Visited User Locations
Conference talk: On the Privacy of Frequently Visited User Locations
Zohaib Riaz
 
From Research to Applications: What Can We Extract with Social Media Sensing?
From Research to Applications: What Can We Extract with Social Media Sensing?From Research to Applications: What Can We Extract with Social Media Sensing?
From Research to Applications: What Can We Extract with Social Media Sensing?
Yiannis Kompatsiaris
 
2009 - Connected Action - Marc Smith - Social Media Network Analysis
2009 - Connected Action - Marc Smith - Social Media Network Analysis2009 - Connected Action - Marc Smith - Social Media Network Analysis
2009 - Connected Action - Marc Smith - Social Media Network Analysis
Marc Smith
 
Enrique RCODI presentation symposium 2017
Enrique RCODI presentation symposium 2017Enrique RCODI presentation symposium 2017
Enrique RCODI presentation symposium 2017
Jesus Enrique Aldana S.
 
Hotspot Analysis with QGIS - FOSS4G-IT 2017
Hotspot Analysis with QGIS  - FOSS4G-IT 2017Hotspot Analysis with QGIS  - FOSS4G-IT 2017
Hotspot Analysis with QGIS - FOSS4G-IT 2017
Daniele Oxoli
 
巨量與開放資料之創新機會與關鍵挑戰-曾新穆
巨量與開放資料之創新機會與關鍵挑戰-曾新穆巨量與開放資料之創新機會與關鍵挑戰-曾新穆
巨量與開放資料之創新機會與關鍵挑戰-曾新穆
台灣資料科學年會
 
Curriculum_Amoroso_EN_28_07_2016
Curriculum_Amoroso_EN_28_07_2016Curriculum_Amoroso_EN_28_07_2016
Curriculum_Amoroso_EN_28_07_2016
Nicola Amoroso
 
2012: Natural Computing - The Grand Challenges and Two Case Studies
2012: Natural Computing - The Grand Challenges and Two Case Studies2012: Natural Computing - The Grand Challenges and Two Case Studies
2012: Natural Computing - The Grand Challenges and Two Case Studies
Leandro de Castro
 
Opportunities and Challenges in Crisis Informatics
Opportunities and Challenges in Crisis InformaticsOpportunities and Challenges in Crisis Informatics
Opportunities and Challenges in Crisis Informatics
Lea Shanley
 
InSTEDD HISA Conference
InSTEDD HISA ConferenceInSTEDD HISA Conference
InSTEDD HISA Conference
Eduardo Jezierski
 

Similar to A Priori Relevance Based On Quality and Diversity Of Social Signals (20)

SocialCom09-tutorial.pdf
SocialCom09-tutorial.pdfSocialCom09-tutorial.pdf
SocialCom09-tutorial.pdf
 
Open Grid Forum workshop on Social Networks, Semantic Grids and Web
Open Grid Forum workshop on Social Networks, Semantic Grids and WebOpen Grid Forum workshop on Social Networks, Semantic Grids and Web
Open Grid Forum workshop on Social Networks, Semantic Grids and Web
 
Subscriber Churn Prediction Model using Social Network Analysis In Telecommun...
Subscriber Churn Prediction Model using Social Network Analysis In Telecommun...Subscriber Churn Prediction Model using Social Network Analysis In Telecommun...
Subscriber Churn Prediction Model using Social Network Analysis In Telecommun...
 
User Identity Linkage: Data Collection, DataSet Biases, Method, Control and A...
User Identity Linkage: Data Collection, DataSet Biases, Method, Control and A...User Identity Linkage: Data Collection, DataSet Biases, Method, Control and A...
User Identity Linkage: Data Collection, DataSet Biases, Method, Control and A...
 
THE REACTION DATA ANALYSIS OFCOVID-19 VACCINATIONS
THE REACTION DATA ANALYSIS OFCOVID-19 VACCINATIONSTHE REACTION DATA ANALYSIS OFCOVID-19 VACCINATIONS
THE REACTION DATA ANALYSIS OFCOVID-19 VACCINATIONS
 
Online Communities in Citizen Science
Online Communities in Citizen ScienceOnline Communities in Citizen Science
Online Communities in Citizen Science
 
IIR 2017, Lugano Switzerland
IIR 2017, Lugano SwitzerlandIIR 2017, Lugano Switzerland
IIR 2017, Lugano Switzerland
 
How people talk about health?
How people talk about health?How people talk about health?
How people talk about health?
 
2010 Catalyst Conference - Trends in Social Network Analysis
2010 Catalyst Conference - Trends in Social Network Analysis2010 Catalyst Conference - Trends in Social Network Analysis
2010 Catalyst Conference - Trends in Social Network Analysis
 
My Dissertation Defense
My Dissertation Defense My Dissertation Defense
My Dissertation Defense
 
Conference talk: On the Privacy of Frequently Visited User Locations
Conference talk: On the Privacy of Frequently Visited User LocationsConference talk: On the Privacy of Frequently Visited User Locations
Conference talk: On the Privacy of Frequently Visited User Locations
 
From Research to Applications: What Can We Extract with Social Media Sensing?
From Research to Applications: What Can We Extract with Social Media Sensing?From Research to Applications: What Can We Extract with Social Media Sensing?
From Research to Applications: What Can We Extract with Social Media Sensing?
 
2009 - Connected Action - Marc Smith - Social Media Network Analysis
2009 - Connected Action - Marc Smith - Social Media Network Analysis2009 - Connected Action - Marc Smith - Social Media Network Analysis
2009 - Connected Action - Marc Smith - Social Media Network Analysis
 
Enrique RCODI presentation symposium 2017
Enrique RCODI presentation symposium 2017Enrique RCODI presentation symposium 2017
Enrique RCODI presentation symposium 2017
 
Hotspot Analysis with QGIS - FOSS4G-IT 2017
Hotspot Analysis with QGIS  - FOSS4G-IT 2017Hotspot Analysis with QGIS  - FOSS4G-IT 2017
Hotspot Analysis with QGIS - FOSS4G-IT 2017
 
巨量與開放資料之創新機會與關鍵挑戰-曾新穆
巨量與開放資料之創新機會與關鍵挑戰-曾新穆巨量與開放資料之創新機會與關鍵挑戰-曾新穆
巨量與開放資料之創新機會與關鍵挑戰-曾新穆
 
Curriculum_Amoroso_EN_28_07_2016
Curriculum_Amoroso_EN_28_07_2016Curriculum_Amoroso_EN_28_07_2016
Curriculum_Amoroso_EN_28_07_2016
 
2012: Natural Computing - The Grand Challenges and Two Case Studies
2012: Natural Computing - The Grand Challenges and Two Case Studies2012: Natural Computing - The Grand Challenges and Two Case Studies
2012: Natural Computing - The Grand Challenges and Two Case Studies
 
Opportunities and Challenges in Crisis Informatics
Opportunities and Challenges in Crisis InformaticsOpportunities and Challenges in Crisis Informatics
Opportunities and Challenges in Crisis Informatics
 
InSTEDD HISA Conference
InSTEDD HISA ConferenceInSTEDD HISA Conference
InSTEDD HISA Conference
 

More from Ismail BADACHE

Recherche d'Information Sociale en Langue Arabe : Cas de Facebook
Recherche d'Information Sociale en Langue Arabe : Cas de FacebookRecherche d'Information Sociale en Langue Arabe : Cas de Facebook
Recherche d'Information Sociale en Langue Arabe : Cas de Facebook
Ismail BADACHE
 
Predicting Contradiction Intensity: Low, Strong or Very Strong?
Predicting Contradiction Intensity: Low, Strong or Very Strong?Predicting Contradiction Intensity: Low, Strong or Very Strong?
Predicting Contradiction Intensity: Low, Strong or Very Strong?
Ismail BADACHE
 
Prédire l’intensité de contradiction dans les commentaires : faible, forte ou...
Prédire l’intensité de contradiction dans les commentaires : faible, forte ou...Prédire l’intensité de contradiction dans les commentaires : faible, forte ou...
Prédire l’intensité de contradiction dans les commentaires : faible, forte ou...
Ismail BADACHE
 
Intensité de contradiction dans les commentaires (Séminaire à l'EHESS 04 avri...
Intensité de contradiction dans les commentaires (Séminaire à l'EHESS 04 avri...Intensité de contradiction dans les commentaires (Séminaire à l'EHESS 04 avri...
Intensité de contradiction dans les commentaires (Séminaire à l'EHESS 04 avri...
Ismail BADACHE
 
Contradiction in Reviews: is it Strong or Low?
Contradiction in Reviews: is it Strong or Low?Contradiction in Reviews: is it Strong or Low?
Contradiction in Reviews: is it Strong or Low?
Ismail BADACHE
 
LES CONTENUS SOCIAUX : Quel impact sur le processus de RI et la quantificatio...
LES CONTENUS SOCIAUX : Quel impact sur le processus de RI et la quantificatio...LES CONTENUS SOCIAUX : Quel impact sur le processus de RI et la quantificatio...
LES CONTENUS SOCIAUX : Quel impact sur le processus de RI et la quantificatio...
Ismail BADACHE
 
Harnessing Ratings and Aspect-Sentiment to Estimate Contradiction Intensity i...
Harnessing Ratings and Aspect-Sentiment to Estimate Contradiction Intensity i...Harnessing Ratings and Aspect-Sentiment to Estimate Contradiction Intensity i...
Harnessing Ratings and Aspect-Sentiment to Estimate Contradiction Intensity i...
Ismail BADACHE
 
Finding and Quantifying Temporal-Aware Contradiction in Reviews
Finding and Quantifying Temporal-Aware Contradiction in ReviewsFinding and Quantifying Temporal-Aware Contradiction in Reviews
Finding and Quantifying Temporal-Aware Contradiction in Reviews
Ismail BADACHE
 
Les Signaux Sociaux Émotionnels : Quel impact sur la RI ?
Les Signaux Sociaux Émotionnels : Quel impact sur la RI ? Les Signaux Sociaux Émotionnels : Quel impact sur la RI ?
Les Signaux Sociaux Émotionnels : Quel impact sur la RI ?
Ismail BADACHE
 
Détection de contradiction dans les commentaires
Détection de contradiction dans les commentairesDétection de contradiction dans les commentaires
Détection de contradiction dans les commentaires
Ismail BADACHE
 
Social Signals: Any Impacts in Search?
Social Signals: Any Impacts in Search?Social Signals: Any Impacts in Search?
Social Signals: Any Impacts in Search?
Ismail BADACHE
 
Multimodal Social Book Search
Multimodal Social Book SearchMultimodal Social Book Search
Multimodal Social Book Search
Ismail BADACHE
 
Pertinence a Priori Basée sur la Diversité et la Temporalité des Signaux Sociaux
Pertinence a Priori Basée sur la Diversité et la Temporalité des Signaux SociauxPertinence a Priori Basée sur la Diversité et la Temporalité des Signaux Sociaux
Pertinence a Priori Basée sur la Diversité et la Temporalité des Signaux Sociaux
Ismail BADACHE
 
Social Networks Statistics 2014
Social Networks Statistics 2014Social Networks Statistics 2014
Social Networks Statistics 2014
Ismail BADACHE
 
Exploitation de signaux sociaux pour estimer la pertinence a priori d’une res...
Exploitation de signaux sociaux pour estimer la pertinence a priori d’une res...Exploitation de signaux sociaux pour estimer la pertinence a priori d’une res...
Exploitation de signaux sociaux pour estimer la pertinence a priori d’une res...
Ismail BADACHE
 
Poster Recherche d'Information Sociale
Poster Recherche d'Information SocialePoster Recherche d'Information Sociale
Poster Recherche d'Information SocialeIsmail BADACHE
 

More from Ismail BADACHE (16)

Recherche d'Information Sociale en Langue Arabe : Cas de Facebook
Recherche d'Information Sociale en Langue Arabe : Cas de FacebookRecherche d'Information Sociale en Langue Arabe : Cas de Facebook
Recherche d'Information Sociale en Langue Arabe : Cas de Facebook
 
Predicting Contradiction Intensity: Low, Strong or Very Strong?
Predicting Contradiction Intensity: Low, Strong or Very Strong?Predicting Contradiction Intensity: Low, Strong or Very Strong?
Predicting Contradiction Intensity: Low, Strong or Very Strong?
 
Prédire l’intensité de contradiction dans les commentaires : faible, forte ou...
Prédire l’intensité de contradiction dans les commentaires : faible, forte ou...Prédire l’intensité de contradiction dans les commentaires : faible, forte ou...
Prédire l’intensité de contradiction dans les commentaires : faible, forte ou...
 
Intensité de contradiction dans les commentaires (Séminaire à l'EHESS 04 avri...
Intensité de contradiction dans les commentaires (Séminaire à l'EHESS 04 avri...Intensité de contradiction dans les commentaires (Séminaire à l'EHESS 04 avri...
Intensité de contradiction dans les commentaires (Séminaire à l'EHESS 04 avri...
 
Contradiction in Reviews: is it Strong or Low?
Contradiction in Reviews: is it Strong or Low?Contradiction in Reviews: is it Strong or Low?
Contradiction in Reviews: is it Strong or Low?
 
LES CONTENUS SOCIAUX : Quel impact sur le processus de RI et la quantificatio...
LES CONTENUS SOCIAUX : Quel impact sur le processus de RI et la quantificatio...LES CONTENUS SOCIAUX : Quel impact sur le processus de RI et la quantificatio...
LES CONTENUS SOCIAUX : Quel impact sur le processus de RI et la quantificatio...
 
Harnessing Ratings and Aspect-Sentiment to Estimate Contradiction Intensity i...
Harnessing Ratings and Aspect-Sentiment to Estimate Contradiction Intensity i...Harnessing Ratings and Aspect-Sentiment to Estimate Contradiction Intensity i...
Harnessing Ratings and Aspect-Sentiment to Estimate Contradiction Intensity i...
 
Finding and Quantifying Temporal-Aware Contradiction in Reviews
Finding and Quantifying Temporal-Aware Contradiction in ReviewsFinding and Quantifying Temporal-Aware Contradiction in Reviews
Finding and Quantifying Temporal-Aware Contradiction in Reviews
 
Les Signaux Sociaux Émotionnels : Quel impact sur la RI ?
Les Signaux Sociaux Émotionnels : Quel impact sur la RI ? Les Signaux Sociaux Émotionnels : Quel impact sur la RI ?
Les Signaux Sociaux Émotionnels : Quel impact sur la RI ?
 
Détection de contradiction dans les commentaires
Détection de contradiction dans les commentairesDétection de contradiction dans les commentaires
Détection de contradiction dans les commentaires
 
Social Signals: Any Impacts in Search?
Social Signals: Any Impacts in Search?Social Signals: Any Impacts in Search?
Social Signals: Any Impacts in Search?
 
Multimodal Social Book Search
Multimodal Social Book SearchMultimodal Social Book Search
Multimodal Social Book Search
 
Pertinence a Priori Basée sur la Diversité et la Temporalité des Signaux Sociaux
Pertinence a Priori Basée sur la Diversité et la Temporalité des Signaux SociauxPertinence a Priori Basée sur la Diversité et la Temporalité des Signaux Sociaux
Pertinence a Priori Basée sur la Diversité et la Temporalité des Signaux Sociaux
 
Social Networks Statistics 2014
Social Networks Statistics 2014Social Networks Statistics 2014
Social Networks Statistics 2014
 
Exploitation de signaux sociaux pour estimer la pertinence a priori d’une res...
Exploitation de signaux sociaux pour estimer la pertinence a priori d’une res...Exploitation de signaux sociaux pour estimer la pertinence a priori d’une res...
Exploitation de signaux sociaux pour estimer la pertinence a priori d’une res...
 
Poster Recherche d'Information Sociale
Poster Recherche d'Information SocialePoster Recherche d'Information Sociale
Poster Recherche d'Information Sociale
 

Recently uploaded

原版制作(Hull毕业证书)赫尔大学毕业证Offer一模一样
原版制作(Hull毕业证书)赫尔大学毕业证Offer一模一样原版制作(Hull毕业证书)赫尔大学毕业证Offer一模一样
原版制作(Hull毕业证书)赫尔大学毕业证Offer一模一样
7lkkjxt
 
快速办理(BCR毕业证书)加州大学河滨分校毕业证文凭证书一模一样
快速办理(BCR毕业证书)加州大学河滨分校毕业证文凭证书一模一样快速办理(BCR毕业证书)加州大学河滨分校毕业证文凭证书一模一样
快速办理(BCR毕业证书)加州大学河滨分校毕业证文凭证书一模一样
ryxqoswi
 
Maximize Your Twitch Potential!..........
Maximize Your Twitch Potential!..........Maximize Your Twitch Potential!..........
Maximize Your Twitch Potential!..........
SocioCosmos
 
HOW TO USE FACEBOOK _ by Clarissa Credito
HOW TO USE FACEBOOK _ by Clarissa CreditoHOW TO USE FACEBOOK _ by Clarissa Credito
HOW TO USE FACEBOOK _ by Clarissa Credito
ClarissaAlanoCredito
 
Transportation_Channel_Investor_Presentation_April_2024_ Final .pdf
Transportation_Channel_Investor_Presentation_April_2024_ Final .pdfTransportation_Channel_Investor_Presentation_April_2024_ Final .pdf
Transportation_Channel_Investor_Presentation_April_2024_ Final .pdf
Matthewperry105
 
UR BHATTI ACADEMY AND ONLINE COURSES.pdf
UR BHATTI ACADEMY AND ONLINE COURSES.pdfUR BHATTI ACADEMY AND ONLINE COURSES.pdf
UR BHATTI ACADEMY AND ONLINE COURSES.pdf
urbhattiacademy
 
快速办理(worcester毕业证书)伍斯特大学毕业证PDF成绩单一模一样
快速办理(worcester毕业证书)伍斯特大学毕业证PDF成绩单一模一样快速办理(worcester毕业证书)伍斯特大学毕业证PDF成绩单一模一样
快速办理(worcester毕业证书)伍斯特大学毕业证PDF成绩单一模一样
9u4xjk4w
 
STUDY ON THE DEVELOPMENT STRATEGY OF HUZHOU TOURISM
STUDY ON THE DEVELOPMENT STRATEGY OF HUZHOU TOURISMSTUDY ON THE DEVELOPMENT STRATEGY OF HUZHOU TOURISM
STUDY ON THE DEVELOPMENT STRATEGY OF HUZHOU TOURISM
AJHSSR Journal
 
LORRAINE ANDREI_LEQUIGAN_HOW TO USE TELEGRAM
LORRAINE ANDREI_LEQUIGAN_HOW TO USE TELEGRAMLORRAINE ANDREI_LEQUIGAN_HOW TO USE TELEGRAM
LORRAINE ANDREI_LEQUIGAN_HOW TO USE TELEGRAM
lorraineandreiamcidl
 
EASY TUTORIAL OF HOW TO USE G-TEAMS BY: FEBLESS HERNANE
EASY TUTORIAL OF HOW TO USE G-TEAMS BY: FEBLESS HERNANEEASY TUTORIAL OF HOW TO USE G-TEAMS BY: FEBLESS HERNANE
EASY TUTORIAL OF HOW TO USE G-TEAMS BY: FEBLESS HERNANE
Febless Hernane
 
HMS Facebook Stories All V1 06092024.docx
HMS Facebook Stories All V1 06092024.docxHMS Facebook Stories All V1 06092024.docx
HMS Facebook Stories All V1 06092024.docx
Charles Bayless
 
Dominate Reddit Discussions.............
Dominate Reddit Discussions.............Dominate Reddit Discussions.............
Dominate Reddit Discussions.............
SocioCosmos
 
Your LinkedIn Success Starts Here.......
Your LinkedIn Success Starts Here.......Your LinkedIn Success Starts Here.......
Your LinkedIn Success Starts Here.......
SocioCosmos
 

Recently uploaded (13)

原版制作(Hull毕业证书)赫尔大学毕业证Offer一模一样
原版制作(Hull毕业证书)赫尔大学毕业证Offer一模一样原版制作(Hull毕业证书)赫尔大学毕业证Offer一模一样
原版制作(Hull毕业证书)赫尔大学毕业证Offer一模一样
 
快速办理(BCR毕业证书)加州大学河滨分校毕业证文凭证书一模一样
快速办理(BCR毕业证书)加州大学河滨分校毕业证文凭证书一模一样快速办理(BCR毕业证书)加州大学河滨分校毕业证文凭证书一模一样
快速办理(BCR毕业证书)加州大学河滨分校毕业证文凭证书一模一样
 
Maximize Your Twitch Potential!..........
Maximize Your Twitch Potential!..........Maximize Your Twitch Potential!..........
Maximize Your Twitch Potential!..........
 
HOW TO USE FACEBOOK _ by Clarissa Credito
HOW TO USE FACEBOOK _ by Clarissa CreditoHOW TO USE FACEBOOK _ by Clarissa Credito
HOW TO USE FACEBOOK _ by Clarissa Credito
 
Transportation_Channel_Investor_Presentation_April_2024_ Final .pdf
Transportation_Channel_Investor_Presentation_April_2024_ Final .pdfTransportation_Channel_Investor_Presentation_April_2024_ Final .pdf
Transportation_Channel_Investor_Presentation_April_2024_ Final .pdf
 
UR BHATTI ACADEMY AND ONLINE COURSES.pdf
UR BHATTI ACADEMY AND ONLINE COURSES.pdfUR BHATTI ACADEMY AND ONLINE COURSES.pdf
UR BHATTI ACADEMY AND ONLINE COURSES.pdf
 
快速办理(worcester毕业证书)伍斯特大学毕业证PDF成绩单一模一样
快速办理(worcester毕业证书)伍斯特大学毕业证PDF成绩单一模一样快速办理(worcester毕业证书)伍斯特大学毕业证PDF成绩单一模一样
快速办理(worcester毕业证书)伍斯特大学毕业证PDF成绩单一模一样
 
STUDY ON THE DEVELOPMENT STRATEGY OF HUZHOU TOURISM
STUDY ON THE DEVELOPMENT STRATEGY OF HUZHOU TOURISMSTUDY ON THE DEVELOPMENT STRATEGY OF HUZHOU TOURISM
STUDY ON THE DEVELOPMENT STRATEGY OF HUZHOU TOURISM
 
LORRAINE ANDREI_LEQUIGAN_HOW TO USE TELEGRAM
LORRAINE ANDREI_LEQUIGAN_HOW TO USE TELEGRAMLORRAINE ANDREI_LEQUIGAN_HOW TO USE TELEGRAM
LORRAINE ANDREI_LEQUIGAN_HOW TO USE TELEGRAM
 
EASY TUTORIAL OF HOW TO USE G-TEAMS BY: FEBLESS HERNANE
EASY TUTORIAL OF HOW TO USE G-TEAMS BY: FEBLESS HERNANEEASY TUTORIAL OF HOW TO USE G-TEAMS BY: FEBLESS HERNANE
EASY TUTORIAL OF HOW TO USE G-TEAMS BY: FEBLESS HERNANE
 
HMS Facebook Stories All V1 06092024.docx
HMS Facebook Stories All V1 06092024.docxHMS Facebook Stories All V1 06092024.docx
HMS Facebook Stories All V1 06092024.docx
 
Dominate Reddit Discussions.............
Dominate Reddit Discussions.............Dominate Reddit Discussions.............
Dominate Reddit Discussions.............
 
Your LinkedIn Success Starts Here.......
Your LinkedIn Success Starts Here.......Your LinkedIn Success Starts Here.......
Your LinkedIn Success Starts Here.......
 

A Priori Relevance Based On Quality and Diversity Of Social Signals

  • 1. ►Dataset:  INEX IMDb Dataset.  30 INEX IMDb Topics and their relevance judgments.  7 social signals from 5 social networks. ►Using language model to estimate the relevance of document D to a query Q. 𝑷 𝑫 is a document prior. 𝑤𝑖 represents words of query Q. ►Signals are grouped according to their property 𝑥 ∈ 𝑃: 𝑃𝑜𝑝𝑢𝑙𝑎𝑟𝑖𝑡𝑦, 𝑅: 𝑅𝑒𝑝𝑢𝑡𝑎𝑡𝑖𝑜𝑛 ►The priors are estimated by a counting of actions 𝑎𝑖 associated with D. ►Smoothing 𝑃(𝑎𝑖 𝑥 ) by collection C using Dirichlet : Where 𝑷 𝒙 𝑫 represents the a priori probability of D. 𝑥 ∈ 𝑃, 𝑅 refers to the social property estimated from a set of specific actions. 𝐶𝑜𝑢𝑛𝑡(𝑎𝑖 𝑥 , 𝐷) represents number of occurrence of action 𝑎𝑖 𝑥 on resource D. 𝑎𝑖 𝑥 designs action 𝑎𝑖 used to estimate 𝑥 property. 𝑎• 𝑥 is the total number of signals. ►Estimating signals diversity in a resource using diversity clue of Shannon-Wiener: Where 𝑚 represents the total number of signals. ►The Shannon clue is often accompanied by Pielou evenness clue : ►The general formula of 𝑷 𝒙 𝑫 becomes as follows: 2. Social Signals Diversity ►Context:  Exploiting social signals to enhance a search.  Do the quality and diversity of signals matter to capture relevant documents? ►Hypothesis 1: Diversity of signals associated with a resource is a clue that may indicate an interest beyond a social network or a community, i.e., a resource dominated by a single signal should be disadvantaged versus a resource with an equitable distribution of the signals. ►Hypothesis 2: Origin of social signals might impact the retrieval. ►Research Questions:  How to estimate the signals diversity of a resource?  What is the impact of signals diversity on IR system?  Is there an influence of the social networks origin on the quality of their signals? 1. Introduction Web ResourcesSocial Networks Like (Frequency) Comment (Frequency) Share (Frequency) +1 (Frequency) … User’s Actions (Social Signals) Social Relevance Topical Relevance Global Relevance Figure 1. Global presentation of our approach Signals Diversity Ismail Badache and Mohand Boughanem IRIT - Paul Sabatier University,Toulouse, France {Badache, Boughanem}@irit.fr A Priori Relevance Based On Quality and Diversity of Social Signals 𝑃 𝐷 𝑄 = 𝑅𝑎𝑛𝑘 𝑃 𝐷 ∙ 𝑃 𝑄 𝐷 = 𝑷 𝑫 ∙ 𝑤 𝑖 𝜖𝑄 𝑃(𝑤𝑖 |𝑄) (1) 𝑷 𝒙 𝑫 = 𝑎 𝑖 𝑥 ∈𝐴 𝑃𝑥(𝑎𝑖 𝑥 ) (2) 𝑷 𝒙 𝑫 = 𝑎 𝑖 𝑥 ∈𝐴 𝐶𝑜𝑢𝑛𝑡 𝑎𝑖 𝑥 , 𝐷 + 𝜇 ∙ 𝑃(𝑎𝑖 𝑥 |𝐶) 𝐶𝑜𝑢𝑛𝑡 𝑎• 𝑥 , 𝐷 + 𝜇 (3) Santiago, Chile August 9-13, 2015 The 38th Annual ACM SIGIR Conference 3. Experimental Evaluation 𝐷𝑖𝑣𝑒𝑟𝑠𝑖𝑡𝑦𝑠(𝐷) = − 𝑖=1 𝑚 𝑃𝑥 𝑎𝑖 𝑥 ∙ log(𝑃𝑥 𝑎𝑖 𝑥 ) (4) 𝐷𝑖𝑣𝑒𝑟𝑠𝑖𝑡𝑦𝑠 𝑒𝑣𝑒𝑛𝑛𝑒𝑠𝑠 𝐷 = 𝐷𝑖𝑣𝑒𝑟𝑠𝑖𝑡𝑦𝑠(𝐷) 𝑀𝐴𝑋(𝐷𝑖𝑣𝑒𝑟𝑠𝑖𝑡𝑦𝑠 𝐷 ) = 𝐷𝑖𝑣𝑒𝑟𝑠𝑖𝑡𝑦𝑠(𝐷) log(𝑚) (5) Like Share Comment Tweet +1 Bookmark Share(LIn) P@10 0,3938 0,4061 0,3857 0,3879 0,3826 0,373 0,3739 P@20 0,362 0,3649 0,3551 0,3512 0,3468 0,3414 0,3432 nDCG 0,513 0,5262 0,5121 0,4769 0,5017 0,4621 0,4566 MAP 0,2832 0,2905 0,2813 0,2735 0,2704 0,26 0,2515 0 0,1 0,2 0,3 0,4 0,5 0,6 (B) Baselines: Single Priors VSM ML.Hiemstra P@10 0,3411 0,37 P@20 0,3122 0,3403 nDCG 0,3919 0,4325 MAP 0,1782 0,2402 0 0,1 0,2 0,3 0,4 0,5 (A) Baselines: Without Priors TotalFacebook Popularity Reputation All Criteria All Properties P@10 0,4227 0,4403 0,448 0,4463 0,4689 P@20 0,4187 0,4288 0,4306 0,4318 0,4563 nDCG 0,5713 0,5983 0,611 0,6174 0,6245 MAP 0,3167 0,332 0,3319 0,3325 0,3571 0 0,1 0,2 0,3 0,4 0,5 0,6 0,7 (D) With Considering Signals Diversity TotalFacebook Popularity Reputation All Criteria All Properties P@10 0,4209 0,4316 0,4405 0,4408 0,4629 P@20 0,4102 0,4264 0,4272 0,4262 0,4509 nDCG 0,5681 0,5801 0,59 0,5974 0,6203 MAP 0,3125 0,3221 0,326 0,33 0,3557 0 0,1 0,2 0,3 0,4 0,5 0,6 0,7 (C) Baselines: Combination Priors Relevant documents containing signals Relevant documents without signals Irrelevant documents Number of documents Number of actions Average Number of documents Number of actions Average Like 2210 800458 362,1981 555 1678040 61,6133 Share 2357 856009 363,1774 408 1862909 68,4012 Comment 1988 944023 474,8607 777 1901146 69,8052 Tweet 1735 168448 97,0884 1030 330784 12,1455 +1 790 23665 29,9556 1975 49727 1,8258 Bookmark 429 5654 13,1794 2336 20489 0,7523 Share (LIn) 601 40446 67,2985 2164 2341 0,0859 Total relevant: 2765 Total irrelevant: 27235 Table 3. Statistics on the distribution of the signals in the documents (relevant and irrelevant) 80% 85% 72% 63% 22% 29% 16% Figure 3. Relevant documents % containing signals 32% 31% 33% 34% 95% 32% 22% Figure 2. Signals % in the relevant documents ►Results: Property Social signal Social Network Popularity Number of Comment Facebook Number of Tweet Twitter Number of Share(LIn) LinkedIn Number of Share Facebook Reputation Number of Like Facebook Number of +1 Google+ Number of Bookmark Delicious 4. Quantitative and Qualitative Analysis Table 1. Exploited social signals in quantification Document id Like Share Comment +1 tt1730728 30 11 2 0 Bookmark Tweet Share(LIn) 0 2 0 Table 2. Instance of document with social signals 𝑷 𝒙 𝑫 = 𝑎 𝑖 𝑥 ∈𝐴 𝑃𝑥(𝑎𝑖 𝑥 ) ∙ 𝐷𝑖𝑣𝑒𝑟𝑠𝑖𝑡𝑦𝑠 𝑒𝑣𝑒𝑛𝑛𝑒𝑠𝑠 𝐷 (6)