SlideShare a Scribd company logo
1 of 22
Download to read offline
Graph Cluster Randomization:
Network Exposure to Multiple Universes
Authors:
 Johan Ugander, Cornell University
 Brian Karrer, Facebook
 Lars Backstrom, Facebook
 Jon Kleinberg, Cornell University

Presented by:
Subhashis Hazarika,
Ohio State University
Motivation
• To estimate “average effect” of a treatment on a sample when the
treatment of individuals in the sample spills over to the neighboring
individuals via an underlying social network.
• A/B testing is so far the standard approach for “average effect” estimation
of a treatment on sample population.
• But A/B testing doesn’t take into account the social interference of the
sample being treated.

17-10-2013

2
A/B testing
• Assumption : SUTVA (single unit treatment value assumption)
New page A

Default page B

• Treatment group

• Control group

• Individuals respond
independently

• Independent response

• Universe A and Universe B are treated as two separate parallel universes.

17-10-2013

3
Proposed Solution
Graph Cluster Randomization
– Formulate Average Treatment and Network Exposure w.r.t graphtheoretic conditions
– Apply graph cluster randomization algorithms on the formulated
model
– Come up with an unbiased estimator i.e; Horvitz-Thompson estimator,
with an upper bound on the estimator variance that is linear in the
degrees of the graph.

17-10-2013

4
Average Treatment
• Given by Aronow and Samii equation without taking into consideration
SUTVA.
• Let
be the treatment assignment vector.
• Let
be the potential outcome of user i under the treatment
assignment vector z .
• Then the avg. treatment effect is given by:

17-10-2013

5
Network Exposure
• User i is “network exposed to a treatment” (with assignment vector say z)
if i’s response under z is same as i’s response in the assignment vector 1.
• So there can be the following exposure (or conditions )for the experiment:
o Full exposure
o Absolute k exposure
o Fractional q exposure

17-10-2013

6
Graph Cluster Randomization
• At a high level GCR is a technique in which the graph is partitioned into
clusters and then randomization between treatment and control is
performed at cluster level.
• We just need to know about the intersection of the set of clusters with the
local graph structure near the vertex.

17-10-2013

7
Exposure Models
• Exposure Condition of an individual determines how they experience the
intervention in full conjunction with how the world experiences the
intervention.
• Let
be the set of all assignment vector z for which i experiences
outcome x. which is basically the exposure condition for i.
• Exposure Model for user i is a set of exposure conditions that completely
partitions the possible assignment vectors z.
• Here we are interested only with

17-10-2013

and

.

8
Exposure Conditions
• Neighborhood Exposure( local exposure conditions ):
 Full neighborhood exposure
 Absolute k- neighborhood exposure
 Fractional q- neighborhood exposure

• Core Exposure(global dependency):
 Component exposure
 Absolute k-core exposure
 Fractional q-core exposure

Note:: assignment vectors of core exposure are entirely contained in the
associated neighborhood exposure.

17-10-2013

9
Randomization and Estimation
Select assignment vector z at random from Z in the range of

.

is distribution of Z.
is probability of network exposure to treatment.
Therefore avg. treatment effect is given by Horvitz-Thompson estimator,

The expectation over Z gives the actual avg. treatment effect.

17-10-2013

10
Exposure Probabilities
Model : Full neighborhood exposure + independent vertex randomization
– Probability of exposure to treatment will be

– Probability of exposure to control will be

– Exposure prob. for high degree vertex will be exponentially small in di and this will
dramatically increase the variance of HT estimator.

17-10-2013

11
Exposure Probabilities
For absolute and fractional neighborhood models we have the following
probabilities.

17-10-2013

12
Exposure Probabilities
• This model has an upper bound given by

.

• This also gives an upper bound on the core exposure probabilities, given
by the following proposition.

17-10-2013

13
Estimator Variance

The variance of effect estimator is given by:

Final variance:

17-10-2013

14
Estimator Variance
Final co-variance:

17-10-2013

15
Estimator Variance
• Thus we achieve O(1/n) bound on variance but only when the maximum
degree is bounded.
• Variance can grow exponentially with the degree.
• Hence they try to introduce a condition on the graph clustering such that
the degree remain bounded and we still have the variance growth.

17-10-2013

16
Restricted-Growth Graph

• Let Br(v) be the set of vertices within r hops of a vertex v.

17-10-2013

17
Variance in Restricted-Growth Graph
• Consider single cycle (k=1) graph of n vertices with basic cluster size c=2

• For c = 2
• For c >= 2

17-10-2013

18
Variance in Restricted-Growth Graph

17-10-2013

19
Clustering Restricted-Growth Graph
• Using 3-net for the shortest path metric of graph G.
 Initially all vertices are unmarked.
 While there are unmarked vertices, in step j find an arbitrary unmarked vertex v,
selecting v to be vertex vj and marking all vertices in B2(vj).
 Suppose k such vertices are defined and let S = {v1,v2,…..vk}
 For every vertex w of G assign w to the closest vertex vi belonging to S, breaking ties
consistently.
 For every vj, let Cj be the set of all vertices assigned to vj.

17-10-2013

20
Variance Bounds

17-10-2013

21
Thank You

17-10-2013

22

More Related Content

Similar to Graph cluster randomization

Graph Analysis Beyond Linear Algebra
Graph Analysis Beyond Linear AlgebraGraph Analysis Beyond Linear Algebra
Graph Analysis Beyond Linear AlgebraJason Riedy
 
Knowledge Graph Convolutional Networks for Recommender Systems.pptx
Knowledge Graph Convolutional Networks for Recommender Systems.pptxKnowledge Graph Convolutional Networks for Recommender Systems.pptx
Knowledge Graph Convolutional Networks for Recommender Systems.pptxssuser2624f71
 
1.a UV visible spec.pptx.................
1.a UV visible spec.pptx.................1.a UV visible spec.pptx.................
1.a UV visible spec.pptx.................YogeshSalunke22
 
A new development in the hierarchical clustering of repertory grid data
A new development in the hierarchical clustering of repertory grid dataA new development in the hierarchical clustering of repertory grid data
A new development in the hierarchical clustering of repertory grid dataMark Heckmann
 
Introduction geostatistic for_mineral_resources
Introduction geostatistic for_mineral_resourcesIntroduction geostatistic for_mineral_resources
Introduction geostatistic for_mineral_resourcesAdi Handarbeni
 
Observational parameters of Inflation in Holographic cosmology
Observational parameters of Inflation in Holographic cosmologyObservational parameters of Inflation in Holographic cosmology
Observational parameters of Inflation in Holographic cosmologyMilan Milošević
 
Ec 2401 wireless communication unit 2
Ec 2401 wireless communication   unit 2Ec 2401 wireless communication   unit 2
Ec 2401 wireless communication unit 2JAIGANESH SEKAR
 
Cluster randomization trial presentation
Cluster randomization trial presentationCluster randomization trial presentation
Cluster randomization trial presentationRanadip Chowdhury
 
Anomaly detection: Core Techniques and Advances in Big Data and Deep Learning
Anomaly detection: Core Techniques and Advances in Big Data and Deep LearningAnomaly detection: Core Techniques and Advances in Big Data and Deep Learning
Anomaly detection: Core Techniques and Advances in Big Data and Deep LearningQuantUniversity
 
Maxillofacial Pathology Detection Using an Extended a Contrario Approach Comb...
Maxillofacial Pathology Detection Using an Extended a Contrario Approach Comb...Maxillofacial Pathology Detection Using an Extended a Contrario Approach Comb...
Maxillofacial Pathology Detection Using an Extended a Contrario Approach Comb...sipij
 
From Competition to Complementarity: Comparative Influence Diffusion and Maxi...
From Competition to Complementarity: Comparative Influence Diffusion and Maxi...From Competition to Complementarity: Comparative Influence Diffusion and Maxi...
From Competition to Complementarity: Comparative Influence Diffusion and Maxi...Wei Lu
 
A presentation for Multiple linear regression.ppt
A presentation for Multiple linear regression.pptA presentation for Multiple linear regression.ppt
A presentation for Multiple linear regression.pptvigia41
 
[ICLR2021 (spotlight)] Benefit of deep learning with non-convex noisy gradien...
[ICLR2021 (spotlight)] Benefit of deep learning with non-convex noisy gradien...[ICLR2021 (spotlight)] Benefit of deep learning with non-convex noisy gradien...
[ICLR2021 (spotlight)] Benefit of deep learning with non-convex noisy gradien...Taiji Suzuki
 
Anomaly detection Meetup Slides
Anomaly detection Meetup SlidesAnomaly detection Meetup Slides
Anomaly detection Meetup SlidesQuantUniversity
 

Similar to Graph cluster randomization (20)

Graph Analysis Beyond Linear Algebra
Graph Analysis Beyond Linear AlgebraGraph Analysis Beyond Linear Algebra
Graph Analysis Beyond Linear Algebra
 
Climate Extremes Workshop - Assessing models for estimation and methods for u...
Climate Extremes Workshop - Assessing models for estimation and methods for u...Climate Extremes Workshop - Assessing models for estimation and methods for u...
Climate Extremes Workshop - Assessing models for estimation and methods for u...
 
Knowledge Graph Convolutional Networks for Recommender Systems.pptx
Knowledge Graph Convolutional Networks for Recommender Systems.pptxKnowledge Graph Convolutional Networks for Recommender Systems.pptx
Knowledge Graph Convolutional Networks for Recommender Systems.pptx
 
1.a UV visible spec.pptx.................
1.a UV visible spec.pptx.................1.a UV visible spec.pptx.................
1.a UV visible spec.pptx.................
 
A new development in the hierarchical clustering of repertory grid data
A new development in the hierarchical clustering of repertory grid dataA new development in the hierarchical clustering of repertory grid data
A new development in the hierarchical clustering of repertory grid data
 
Aussem
AussemAussem
Aussem
 
J integral report
J integral reportJ integral report
J integral report
 
50134 10
50134 1050134 10
50134 10
 
Introduction geostatistic for_mineral_resources
Introduction geostatistic for_mineral_resourcesIntroduction geostatistic for_mineral_resources
Introduction geostatistic for_mineral_resources
 
Observational parameters of Inflation in Holographic cosmology
Observational parameters of Inflation in Holographic cosmologyObservational parameters of Inflation in Holographic cosmology
Observational parameters of Inflation in Holographic cosmology
 
Ec 2401 wireless communication unit 2
Ec 2401 wireless communication   unit 2Ec 2401 wireless communication   unit 2
Ec 2401 wireless communication unit 2
 
Cluster randomization trial presentation
Cluster randomization trial presentationCluster randomization trial presentation
Cluster randomization trial presentation
 
Nachman - Electromagnetics - Spring Review 2013
Nachman - Electromagnetics - Spring Review 2013Nachman - Electromagnetics - Spring Review 2013
Nachman - Electromagnetics - Spring Review 2013
 
Anomaly detection: Core Techniques and Advances in Big Data and Deep Learning
Anomaly detection: Core Techniques and Advances in Big Data and Deep LearningAnomaly detection: Core Techniques and Advances in Big Data and Deep Learning
Anomaly detection: Core Techniques and Advances in Big Data and Deep Learning
 
Maxillofacial Pathology Detection Using an Extended a Contrario Approach Comb...
Maxillofacial Pathology Detection Using an Extended a Contrario Approach Comb...Maxillofacial Pathology Detection Using an Extended a Contrario Approach Comb...
Maxillofacial Pathology Detection Using an Extended a Contrario Approach Comb...
 
From Competition to Complementarity: Comparative Influence Diffusion and Maxi...
From Competition to Complementarity: Comparative Influence Diffusion and Maxi...From Competition to Complementarity: Comparative Influence Diffusion and Maxi...
From Competition to Complementarity: Comparative Influence Diffusion and Maxi...
 
A presentation for Multiple linear regression.ppt
A presentation for Multiple linear regression.pptA presentation for Multiple linear regression.ppt
A presentation for Multiple linear regression.ppt
 
Linear regression analysis
Linear regression analysisLinear regression analysis
Linear regression analysis
 
[ICLR2021 (spotlight)] Benefit of deep learning with non-convex noisy gradien...
[ICLR2021 (spotlight)] Benefit of deep learning with non-convex noisy gradien...[ICLR2021 (spotlight)] Benefit of deep learning with non-convex noisy gradien...
[ICLR2021 (spotlight)] Benefit of deep learning with non-convex noisy gradien...
 
Anomaly detection Meetup Slides
Anomaly detection Meetup SlidesAnomaly detection Meetup Slides
Anomaly detection Meetup Slides
 

More from Subhashis Hazarika

Deep_Learning_Frameworks_CNTK_PyTorch
Deep_Learning_Frameworks_CNTK_PyTorchDeep_Learning_Frameworks_CNTK_PyTorch
Deep_Learning_Frameworks_CNTK_PyTorchSubhashis Hazarika
 
Word2Vec Network Structure Explained
Word2Vec Network Structure ExplainedWord2Vec Network Structure Explained
Word2Vec Network Structure ExplainedSubhashis Hazarika
 
Probabilistic Graph Layout for Uncertain Network Visualization
Probabilistic Graph Layout for Uncertain Network VisualizationProbabilistic Graph Layout for Uncertain Network Visualization
Probabilistic Graph Layout for Uncertain Network VisualizationSubhashis Hazarika
 
An analysis of_machine_and_human_analytics_in_classification
An analysis of_machine_and_human_analytics_in_classificationAn analysis of_machine_and_human_analytics_in_classification
An analysis of_machine_and_human_analytics_in_classificationSubhashis Hazarika
 
Uncertainty aware multidimensional ensemble data visualization and exploration
Uncertainty aware multidimensional ensemble data visualization and explorationUncertainty aware multidimensional ensemble data visualization and exploration
Uncertainty aware multidimensional ensemble data visualization and explorationSubhashis Hazarika
 
CSE5559::Visualizing the Life and Anatomy of Cosmic Particles
CSE5559::Visualizing the Life and Anatomy of Cosmic ParticlesCSE5559::Visualizing the Life and Anatomy of Cosmic Particles
CSE5559::Visualizing the Life and Anatomy of Cosmic ParticlesSubhashis Hazarika
 
Visualizing the variability of gradient in uncertain 2d scalarfield
Visualizing the variability of gradient in uncertain 2d scalarfieldVisualizing the variability of gradient in uncertain 2d scalarfield
Visualizing the variability of gradient in uncertain 2d scalarfieldSubhashis Hazarika
 
Sparse PDF Volumes for Consistent Multi-resolution Volume Rendering
Sparse PDF Volumes for Consistent Multi-resolution Volume RenderingSparse PDF Volumes for Consistent Multi-resolution Volume Rendering
Sparse PDF Volumes for Consistent Multi-resolution Volume RenderingSubhashis Hazarika
 
Visualization of uncertainty_without_a_mean
Visualization of uncertainty_without_a_meanVisualization of uncertainty_without_a_mean
Visualization of uncertainty_without_a_meanSubhashis Hazarika
 
Semi automatic vortex extraction in 4 d pc-mri cardiac blood flow data using ...
Semi automatic vortex extraction in 4 d pc-mri cardiac blood flow data using ...Semi automatic vortex extraction in 4 d pc-mri cardiac blood flow data using ...
Semi automatic vortex extraction in 4 d pc-mri cardiac blood flow data using ...Subhashis Hazarika
 
Linear programming in computational geometry
Linear programming in computational geometryLinear programming in computational geometry
Linear programming in computational geometrySubhashis Hazarika
 

More from Subhashis Hazarika (13)

DNN Model Interpretability
DNN Model InterpretabilityDNN Model Interpretability
DNN Model Interpretability
 
Deep_Learning_Frameworks_CNTK_PyTorch
Deep_Learning_Frameworks_CNTK_PyTorchDeep_Learning_Frameworks_CNTK_PyTorch
Deep_Learning_Frameworks_CNTK_PyTorch
 
Word2Vec Network Structure Explained
Word2Vec Network Structure ExplainedWord2Vec Network Structure Explained
Word2Vec Network Structure Explained
 
Probabilistic Graph Layout for Uncertain Network Visualization
Probabilistic Graph Layout for Uncertain Network VisualizationProbabilistic Graph Layout for Uncertain Network Visualization
Probabilistic Graph Layout for Uncertain Network Visualization
 
An analysis of_machine_and_human_analytics_in_classification
An analysis of_machine_and_human_analytics_in_classificationAn analysis of_machine_and_human_analytics_in_classification
An analysis of_machine_and_human_analytics_in_classification
 
Uncertainty aware multidimensional ensemble data visualization and exploration
Uncertainty aware multidimensional ensemble data visualization and explorationUncertainty aware multidimensional ensemble data visualization and exploration
Uncertainty aware multidimensional ensemble data visualization and exploration
 
CSE5559::Visualizing the Life and Anatomy of Cosmic Particles
CSE5559::Visualizing the Life and Anatomy of Cosmic ParticlesCSE5559::Visualizing the Life and Anatomy of Cosmic Particles
CSE5559::Visualizing the Life and Anatomy of Cosmic Particles
 
Visualizing the variability of gradient in uncertain 2d scalarfield
Visualizing the variability of gradient in uncertain 2d scalarfieldVisualizing the variability of gradient in uncertain 2d scalarfield
Visualizing the variability of gradient in uncertain 2d scalarfield
 
Sparse PDF Volumes for Consistent Multi-resolution Volume Rendering
Sparse PDF Volumes for Consistent Multi-resolution Volume RenderingSparse PDF Volumes for Consistent Multi-resolution Volume Rendering
Sparse PDF Volumes for Consistent Multi-resolution Volume Rendering
 
Visualization of uncertainty_without_a_mean
Visualization of uncertainty_without_a_meanVisualization of uncertainty_without_a_mean
Visualization of uncertainty_without_a_mean
 
Semi automatic vortex extraction in 4 d pc-mri cardiac blood flow data using ...
Semi automatic vortex extraction in 4 d pc-mri cardiac blood flow data using ...Semi automatic vortex extraction in 4 d pc-mri cardiac blood flow data using ...
Semi automatic vortex extraction in 4 d pc-mri cardiac blood flow data using ...
 
Linear programming in computational geometry
Linear programming in computational geometryLinear programming in computational geometry
Linear programming in computational geometry
 
CERN summer presentation
CERN summer presentationCERN summer presentation
CERN summer presentation
 

Recently uploaded

Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...EduSkills OECD
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdfQucHHunhnh
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxGaneshChakor2
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationnomboosow
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Krashi Coaching
 
Disha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfDisha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfchloefrazer622
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13Steve Thomason
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxheathfieldcps1
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfciinovamais
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfJayanti Pande
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphThiyagu K
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introductionMaksud Ahmed
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdfSoniaTolstoy
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAssociation for Project Management
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeThiyagu K
 
Russian Call Girls in Andheri Airport Mumbai WhatsApp 9167673311 💞 Full Nigh...
Russian Call Girls in Andheri Airport Mumbai WhatsApp  9167673311 💞 Full Nigh...Russian Call Girls in Andheri Airport Mumbai WhatsApp  9167673311 💞 Full Nigh...
Russian Call Girls in Andheri Airport Mumbai WhatsApp 9167673311 💞 Full Nigh...Pooja Nehwal
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingTechSoup
 

Recently uploaded (20)

Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptx
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communication
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
Disha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfDisha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdf
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdf
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot Graph
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across Sectors
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
Russian Call Girls in Andheri Airport Mumbai WhatsApp 9167673311 💞 Full Nigh...
Russian Call Girls in Andheri Airport Mumbai WhatsApp  9167673311 💞 Full Nigh...Russian Call Girls in Andheri Airport Mumbai WhatsApp  9167673311 💞 Full Nigh...
Russian Call Girls in Andheri Airport Mumbai WhatsApp 9167673311 💞 Full Nigh...
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 

Graph cluster randomization

  • 1. Graph Cluster Randomization: Network Exposure to Multiple Universes Authors:  Johan Ugander, Cornell University  Brian Karrer, Facebook  Lars Backstrom, Facebook  Jon Kleinberg, Cornell University Presented by: Subhashis Hazarika, Ohio State University
  • 2. Motivation • To estimate “average effect” of a treatment on a sample when the treatment of individuals in the sample spills over to the neighboring individuals via an underlying social network. • A/B testing is so far the standard approach for “average effect” estimation of a treatment on sample population. • But A/B testing doesn’t take into account the social interference of the sample being treated. 17-10-2013 2
  • 3. A/B testing • Assumption : SUTVA (single unit treatment value assumption) New page A Default page B • Treatment group • Control group • Individuals respond independently • Independent response • Universe A and Universe B are treated as two separate parallel universes. 17-10-2013 3
  • 4. Proposed Solution Graph Cluster Randomization – Formulate Average Treatment and Network Exposure w.r.t graphtheoretic conditions – Apply graph cluster randomization algorithms on the formulated model – Come up with an unbiased estimator i.e; Horvitz-Thompson estimator, with an upper bound on the estimator variance that is linear in the degrees of the graph. 17-10-2013 4
  • 5. Average Treatment • Given by Aronow and Samii equation without taking into consideration SUTVA. • Let be the treatment assignment vector. • Let be the potential outcome of user i under the treatment assignment vector z . • Then the avg. treatment effect is given by: 17-10-2013 5
  • 6. Network Exposure • User i is “network exposed to a treatment” (with assignment vector say z) if i’s response under z is same as i’s response in the assignment vector 1. • So there can be the following exposure (or conditions )for the experiment: o Full exposure o Absolute k exposure o Fractional q exposure 17-10-2013 6
  • 7. Graph Cluster Randomization • At a high level GCR is a technique in which the graph is partitioned into clusters and then randomization between treatment and control is performed at cluster level. • We just need to know about the intersection of the set of clusters with the local graph structure near the vertex. 17-10-2013 7
  • 8. Exposure Models • Exposure Condition of an individual determines how they experience the intervention in full conjunction with how the world experiences the intervention. • Let be the set of all assignment vector z for which i experiences outcome x. which is basically the exposure condition for i. • Exposure Model for user i is a set of exposure conditions that completely partitions the possible assignment vectors z. • Here we are interested only with 17-10-2013 and . 8
  • 9. Exposure Conditions • Neighborhood Exposure( local exposure conditions ):  Full neighborhood exposure  Absolute k- neighborhood exposure  Fractional q- neighborhood exposure • Core Exposure(global dependency):  Component exposure  Absolute k-core exposure  Fractional q-core exposure Note:: assignment vectors of core exposure are entirely contained in the associated neighborhood exposure. 17-10-2013 9
  • 10. Randomization and Estimation Select assignment vector z at random from Z in the range of . is distribution of Z. is probability of network exposure to treatment. Therefore avg. treatment effect is given by Horvitz-Thompson estimator, The expectation over Z gives the actual avg. treatment effect. 17-10-2013 10
  • 11. Exposure Probabilities Model : Full neighborhood exposure + independent vertex randomization – Probability of exposure to treatment will be – Probability of exposure to control will be – Exposure prob. for high degree vertex will be exponentially small in di and this will dramatically increase the variance of HT estimator. 17-10-2013 11
  • 12. Exposure Probabilities For absolute and fractional neighborhood models we have the following probabilities. 17-10-2013 12
  • 13. Exposure Probabilities • This model has an upper bound given by . • This also gives an upper bound on the core exposure probabilities, given by the following proposition. 17-10-2013 13
  • 14. Estimator Variance The variance of effect estimator is given by: Final variance: 17-10-2013 14
  • 16. Estimator Variance • Thus we achieve O(1/n) bound on variance but only when the maximum degree is bounded. • Variance can grow exponentially with the degree. • Hence they try to introduce a condition on the graph clustering such that the degree remain bounded and we still have the variance growth. 17-10-2013 16
  • 17. Restricted-Growth Graph • Let Br(v) be the set of vertices within r hops of a vertex v. 17-10-2013 17
  • 18. Variance in Restricted-Growth Graph • Consider single cycle (k=1) graph of n vertices with basic cluster size c=2 • For c = 2 • For c >= 2 17-10-2013 18
  • 19. Variance in Restricted-Growth Graph 17-10-2013 19
  • 20. Clustering Restricted-Growth Graph • Using 3-net for the shortest path metric of graph G.  Initially all vertices are unmarked.  While there are unmarked vertices, in step j find an arbitrary unmarked vertex v, selecting v to be vertex vj and marking all vertices in B2(vj).  Suppose k such vertices are defined and let S = {v1,v2,…..vk}  For every vertex w of G assign w to the closest vertex vi belonging to S, breaking ties consistently.  For every vj, let Cj be the set of all vertices assigned to vj. 17-10-2013 20