SlideShare a Scribd company logo
Comparative Influence Maximization:
From Competition to Complementarity
Wei Lu (LinkedIn)
Wei Chen (Microsoft Research)
Laks V.S. Lakshmanan (UBC)
NDA’16 Workshop, SIGMOD
To appear in VLDB’16, New Delhi, India
Social influence
• Ubiquitous in life
• Fueled by the widespread popularity of online
social networks and social media
• Computational Social Influence (CSI)
– Viral Marketing
– Influence Maximization
– The applications and extensions to the above
Computational Social Influence
• Social networks with edge weights (influence
probabilities or weights)
• Stochastic influence/information propagation models
– Single-item vs. Multiple-item models
• Diffusion dynamics depend heavily on the
relationship of the propagating entities
• Pure Competition: Each user adopts at most one
item
– Competitive Independent Cascade Model (CIC)
– K-LT Model
– WPCLT Model …
Limitations of Pure Competition
Models: Example
Item Relationships
• Propagating items can be of any relationship:
– Compete (iPhone vs Nexus)
– Complement (iPhone vs Apple Watch, iPhone vs
iPhone cases)
• Natural and well-studied in economics
– Substitute goods and complementary goods
• Item relationship may be asymmetric
• Item relationship may be to an arbitrary degree
(not “pure”)
Motivations and Challenges
“One model that works for all kinds of item
relationships”: Not existent until this work
Challenges:
• Unified model with great expressive power
• Compact and manageable representation
• Allows room to develop tractable solutions for
natural influence optimization problems
• Model validation, data
Main Contributions
• Comparative Independent Cascade (ComIC):
Capturing both competition and
complementarity, to any arbitrary degree
• Problem: Self Influence Maximization
• Problem: Complementary Influence
Maximization
• Algorithm: Generalized Reverse Reachable Sets
• Algorithm: Sandwich Approximation
Model Overview
• Focusing on two items
– Challenges abundant already
– Future work: extended to an arbitrary number of items
• Edge-level influence/information propagation
– Similar to the classic IC model
• Node-level Decision-making controlled by
Node-Level Automata (NLA)
– Global Adoption Probabilities (GAP)
Global Adoption Probabilities
• Key parameters measuring the degree to which two
items compete with or complement each other
• q(A|0): probability of adopting A when the user has
not yet adopted any other items
• q(A|B): probability of adopting A when the user has
already adopted B
• q(A|0) >= q(A|B): B competes with A
• q(A|0) <= q(A|B): B complements A
Transition diagram
For each item, each node may be of the following status:
• Idle (inactive)
• Informed (influenced)
• Suspended / Adopted / Rejected
Diffusion dynamics
• Initially,every node is inactive/idle wrt both items
• When any node adopts the first item, its
outgoing edges are tested for information
propagation to neighbors (“info channel”)
– Each edge (u,v) becomes open w.p.p(u,v)
• If u is A-adopted, and info channel on edge (u,v)
is open, then v decides to adopt A based on:
– w.p. q(A|0) if v has not adopted B
– w.p. q(A|B) if v has adopted B
Node tie-breaking
• What if there are multiple in-neighbors active in
the last time step t-1?
• Generate a random permutation of those in-
neighbors, and follow that order to test activation
• If one such neighbor adopted both items at t-1,
following the same order for informing
• If a seed is targeted with both items, decide the
order randomly (0.5 and 0.5 prob.)
Node Reconsideration
• Suppose B complements A: q(A|0) <= q(A|B)
• User v was informed of A, but did not adopt with
probability 1 – q(A|0)
• Once v adopts B, since B complements A, user
may want to revisit the decision with a
reconsideration probability:
General Properties of ComIC
model
• Neither submodularity nor monotonicity holds in
an arbitrary instance of the model
• Influence maximization may be intractable
• Overall strategy:
– Identify a parameter subspace such that
submodularity is satisfied
– Develop efficient approximation algorithm
(Generalized RR-set) for submodular cases
– “Sandwich Approximation” for non-submodular
cases
Submodularity: Complementary Case
Possible World Definition
• An equivalent representation of the model and
the propagation dynamics
– Propagation in a possible world is deterministic, easy
to reason about
• Equivalent Possible World model for ComIC
– For each edge (u,v), remove w.p. 1-p(u,v)
– For each node v, randomly generate α(v,A) and α(v,B)
for testing with adoption probabilities.
– Adoption happens when α <= adoption prob.
Influence Maximization Problems
• Self Influence Maximization (SIM): Fix B-seed
set, find the best A-seed set of size k to
maximize A’s expected influence spread
• Complementary Influence Maximization
(CIM): Fix A-seed set, find the best B-seed set of
size k to maximize the boost B gives to A’s
expected influence spread
• Both NP-hard under ComIC model
Algorithm Design for SIM and CIM
• Generalized Reverse-Reachable Set (RR-set):
RR-set based algorithms are the state-of-the-art
for classical influence maximization with single-
item propagation models (IC and LT)
• Sandwich Approximation to achieve
approximation guarantees in non-submodular
cases
• Both techniques are generic and applicable to
any non-submodular maximization problems
Recap: Reverse-Reachable Set
• If u can reach v (in a deterministic directed
graph), then u is in a RR-set rooted at v [Borgs et
al., SODA’14]
• Random RR-set: root v is randomly chosen
• Two-phase Inf. Max. (TIM) [Tang et al 2014]
– Estimate the minimum number of random RR-sets
required, for probabilistic approx. guarantees
• 1-1/e-ε: smaller ε requires more RR-sets to be generated
– Generate random RR-sets using backward BFS
– Seed selection (deterministic max-cover problem)
Recap: TIM Algorithm
• (1-1/e-ε)-approximation with high probability
– Same as greedy, modulo probabilistic part
• Orders of magnitude faster than Greedy + Monte
Carlo simulations
• Scalable to billion-edge graphs
• Applies to a large family of stochastic
propagation models
Generalized RR-set and TIM
Algorithms
• Works for any stochastic propagation models
satisfying monotonicity and submodularity
– Has (1-1/e-ε)-approximation with high probability
• General RR-set (in a deterministic possible
world): u belongs to the RR-set rooted at v if the
singleton seed set {u} can activate v
– Note difference from “reaching”
– Random RR-set: root v is sampled uniformly at
random from the graph
RR-set generation for SIM (RR-
SIM)
• Problem definition and submodular setting
– Fix B-seed set, find A-seed set (size k)
– A is complemented by B: q(A|0) <= q(A|B)
– B is indifferent to A: q(B|0) = q(B|A)
• Phase 1: Forward Labeling: Start from B-seed
set, label node status w.r.t. B
• Phase 2: Backward BFS (details next)
Phase 2: Backward BFS
• Randomly choose root v from the graph
• Enqueue v into a FIFO queue Q
• Until empty, repeatedly dequeue from Q
• Let’s say we get a node u from Q
• Enqueue u’s in-neighbours (with edge test) if
either is true
– u is B-adopted and α(A,u) <= q(A|B)
– u is not B-adopted and α(A,u) <= q(A|0)
RR-Set generation for CIM (RR-
CIM)
• Given A-seed set, find best complementing B set
• Cross-submodularity holds q(B|A) = 1
• Forward Labeling: Start from A-seed set, identify
nodes can be A-adopted potentially
• Backward BFS: Two passes required
Sandwich Approximation
• Given any non-submodular set functions, how to
leverage submodular maximization (e.g., greedy,
local search) to achieve provable approximation
guarantees?
• Answer:
– Derive upper/lower bound submodular functions
(“sandwiched”)
– Use the best of the three solutions, which gives a
data-dependent approximation ratio
Sandwich Approximation
non-submodular, function we
want to maximize
lower bound, submodular
upper bound, submodular
Remarks
• Applicable to any non-submodular function
maximization
• If monotone, run Greedy on the upper bound,
lower bound, and the actual function
• If non-monotone, run Local Search
• Upper/lower bound should be reasonably tight to
be meaningful
Experiments: Datasets
Also have synthetic dataset up to 1 million nodes
Learning Global Adoption Probabilities
Dataset: Flixster
• Signals for adoption: rated a movie
• Signals for informed: “Want to See”, “Not Interested”
Effects of εin General TIM algorithm: Tradeoff
between seed set quality and running time
SIM experiments: spread
CIM experiments: spread
Running time
Sandwich Approximation Bounds
Thank you!
See you in VLDB’16!

More Related Content

Similar to From Competition to Complementarity: Comparative Influence Diffusion and Maximization

Graph Analysis Beyond Linear Algebra
Graph Analysis Beyond Linear AlgebraGraph Analysis Beyond Linear Algebra
Graph Analysis Beyond Linear Algebra
Jason Riedy
 
ProbabilisticModeling20080411
ProbabilisticModeling20080411ProbabilisticModeling20080411
ProbabilisticModeling20080411
Clay Stanek
 
Bounded Model Checking
Bounded Model CheckingBounded Model Checking
Bounded Model Checking
Ilham Amezzane
 
1629 stochastic subgradient approach for solving linear support vector
1629 stochastic subgradient approach for solving linear support vector1629 stochastic subgradient approach for solving linear support vector
1629 stochastic subgradient approach for solving linear support vector
Dr Fereidoun Dejahang
 
Sampling and Markov Chain Monte Carlo Techniques
Sampling and Markov Chain Monte Carlo TechniquesSampling and Markov Chain Monte Carlo Techniques
Sampling and Markov Chain Monte Carlo Techniques
Tomasz Kusmierczyk
 
lecture_16.pptx
lecture_16.pptxlecture_16.pptx
lecture_16.pptx
ObaidUllah693733
 
Machine Learning workshop by GDSC Amity University Chhattisgarh
Machine Learning workshop by GDSC Amity University ChhattisgarhMachine Learning workshop by GDSC Amity University Chhattisgarh
Machine Learning workshop by GDSC Amity University Chhattisgarh
Poorabpatel
 
Iterative Graph Computation in the Big Data Era
Iterative Graph Computation in the Big Data EraIterative Graph Computation in the Big Data Era
Iterative Graph Computation in the Big Data Era
Wenlei Xie
 
QX Simulator and quantum programming - 2020-04-28
QX Simulator and quantum programming - 2020-04-28QX Simulator and quantum programming - 2020-04-28
QX Simulator and quantum programming - 2020-04-28
Aritra Sarkar
 
Program on Quasi-Monte Carlo and High-Dimensional Sampling Methods for Applie...
Program on Quasi-Monte Carlo and High-Dimensional Sampling Methods for Applie...Program on Quasi-Monte Carlo and High-Dimensional Sampling Methods for Applie...
Program on Quasi-Monte Carlo and High-Dimensional Sampling Methods for Applie...
The Statistical and Applied Mathematical Sciences Institute
 
Towards quantum machine learning calogero zarbo - meet up
Towards quantum machine learning  calogero zarbo - meet upTowards quantum machine learning  calogero zarbo - meet up
Towards quantum machine learning calogero zarbo - meet up
Deep Learning Italia
 
Support Vector Machines USING MACHINE LEARNING HOW IT WORKS
Support Vector Machines USING MACHINE LEARNING HOW IT WORKSSupport Vector Machines USING MACHINE LEARNING HOW IT WORKS
Support Vector Machines USING MACHINE LEARNING HOW IT WORKS
rajalakshmi5921
 
Distributed Systems in the Post-Moore Era.pptx
Distributed Systems in the Post-Moore Era.pptxDistributed Systems in the Post-Moore Era.pptx
Distributed Systems in the Post-Moore Era.pptx
Förderverein Technische Fakultät
 
SVD and the Netflix Dataset
SVD and the Netflix DatasetSVD and the Netflix Dataset
SVD and the Netflix Dataset
Ben Mabey
 
Financial Networks III. Centrality and Systemic Importance
Financial Networks III. Centrality and Systemic ImportanceFinancial Networks III. Centrality and Systemic Importance
Financial Networks III. Centrality and Systemic Importance
Kimmo Soramaki
 
Online machine learning in Streaming Applications
Online machine learning in Streaming ApplicationsOnline machine learning in Streaming Applications
Online machine learning in Streaming Applications
Stavros Kontopoulos
 
Reinforcement Learning and Artificial Neural Nets
Reinforcement Learning and Artificial Neural NetsReinforcement Learning and Artificial Neural Nets
Reinforcement Learning and Artificial Neural Nets
Pierre de Lacaze
 
Declarative data analysis
Declarative data analysisDeclarative data analysis
Declarative data analysis
South West Data Meetup
 
QMC Program: Trends and Advances in Monte Carlo Sampling Algorithms Workshop,...
QMC Program: Trends and Advances in Monte Carlo Sampling Algorithms Workshop,...QMC Program: Trends and Advances in Monte Carlo Sampling Algorithms Workshop,...
QMC Program: Trends and Advances in Monte Carlo Sampling Algorithms Workshop,...
The Statistical and Applied Mathematical Sciences Institute
 
1 chayes
1 chayes1 chayes
1 chayes
Yandex
 

Similar to From Competition to Complementarity: Comparative Influence Diffusion and Maximization (20)

Graph Analysis Beyond Linear Algebra
Graph Analysis Beyond Linear AlgebraGraph Analysis Beyond Linear Algebra
Graph Analysis Beyond Linear Algebra
 
ProbabilisticModeling20080411
ProbabilisticModeling20080411ProbabilisticModeling20080411
ProbabilisticModeling20080411
 
Bounded Model Checking
Bounded Model CheckingBounded Model Checking
Bounded Model Checking
 
1629 stochastic subgradient approach for solving linear support vector
1629 stochastic subgradient approach for solving linear support vector1629 stochastic subgradient approach for solving linear support vector
1629 stochastic subgradient approach for solving linear support vector
 
Sampling and Markov Chain Monte Carlo Techniques
Sampling and Markov Chain Monte Carlo TechniquesSampling and Markov Chain Monte Carlo Techniques
Sampling and Markov Chain Monte Carlo Techniques
 
lecture_16.pptx
lecture_16.pptxlecture_16.pptx
lecture_16.pptx
 
Machine Learning workshop by GDSC Amity University Chhattisgarh
Machine Learning workshop by GDSC Amity University ChhattisgarhMachine Learning workshop by GDSC Amity University Chhattisgarh
Machine Learning workshop by GDSC Amity University Chhattisgarh
 
Iterative Graph Computation in the Big Data Era
Iterative Graph Computation in the Big Data EraIterative Graph Computation in the Big Data Era
Iterative Graph Computation in the Big Data Era
 
QX Simulator and quantum programming - 2020-04-28
QX Simulator and quantum programming - 2020-04-28QX Simulator and quantum programming - 2020-04-28
QX Simulator and quantum programming - 2020-04-28
 
Program on Quasi-Monte Carlo and High-Dimensional Sampling Methods for Applie...
Program on Quasi-Monte Carlo and High-Dimensional Sampling Methods for Applie...Program on Quasi-Monte Carlo and High-Dimensional Sampling Methods for Applie...
Program on Quasi-Monte Carlo and High-Dimensional Sampling Methods for Applie...
 
Towards quantum machine learning calogero zarbo - meet up
Towards quantum machine learning  calogero zarbo - meet upTowards quantum machine learning  calogero zarbo - meet up
Towards quantum machine learning calogero zarbo - meet up
 
Support Vector Machines USING MACHINE LEARNING HOW IT WORKS
Support Vector Machines USING MACHINE LEARNING HOW IT WORKSSupport Vector Machines USING MACHINE LEARNING HOW IT WORKS
Support Vector Machines USING MACHINE LEARNING HOW IT WORKS
 
Distributed Systems in the Post-Moore Era.pptx
Distributed Systems in the Post-Moore Era.pptxDistributed Systems in the Post-Moore Era.pptx
Distributed Systems in the Post-Moore Era.pptx
 
SVD and the Netflix Dataset
SVD and the Netflix DatasetSVD and the Netflix Dataset
SVD and the Netflix Dataset
 
Financial Networks III. Centrality and Systemic Importance
Financial Networks III. Centrality and Systemic ImportanceFinancial Networks III. Centrality and Systemic Importance
Financial Networks III. Centrality and Systemic Importance
 
Online machine learning in Streaming Applications
Online machine learning in Streaming ApplicationsOnline machine learning in Streaming Applications
Online machine learning in Streaming Applications
 
Reinforcement Learning and Artificial Neural Nets
Reinforcement Learning and Artificial Neural NetsReinforcement Learning and Artificial Neural Nets
Reinforcement Learning and Artificial Neural Nets
 
Declarative data analysis
Declarative data analysisDeclarative data analysis
Declarative data analysis
 
QMC Program: Trends and Advances in Monte Carlo Sampling Algorithms Workshop,...
QMC Program: Trends and Advances in Monte Carlo Sampling Algorithms Workshop,...QMC Program: Trends and Advances in Monte Carlo Sampling Algorithms Workshop,...
QMC Program: Trends and Advances in Monte Carlo Sampling Algorithms Workshop,...
 
1 chayes
1 chayes1 chayes
1 chayes
 

More from Wei Lu

Refutations on "Debunking the Myths of Influence Maximization: An In-Depth Be...
Refutations on "Debunking the Myths of Influence Maximization: An In-Depth Be...Refutations on "Debunking the Myths of Influence Maximization: An In-Depth Be...
Refutations on "Debunking the Myths of Influence Maximization: An In-Depth Be...
Wei Lu
 
Social Recommendation with Strong and Weak Ties
Social Recommendation with Strong and Weak TiesSocial Recommendation with Strong and Weak Ties
Social Recommendation with Strong and Weak Ties
Wei Lu
 
VLDB 2015 poster: Revenue Maximization in Recommender Systems
VLDB 2015 poster: Revenue Maximization in Recommender SystemsVLDB 2015 poster: Revenue Maximization in Recommender Systems
VLDB 2015 poster: Revenue Maximization in Recommender Systems
Wei Lu
 
Revenue Maximization in Recommeder Systems
Revenue Maximization in Recommeder SystemsRevenue Maximization in Recommeder Systems
Revenue Maximization in Recommeder Systems
Wei Lu
 
Profit Maximization over Social Networks
Profit Maximization over Social NetworksProfit Maximization over Social Networks
Profit Maximization over Social Networks
Wei Lu
 
Time Critical Influence Maximization
Time Critical Influence MaximizationTime Critical Influence Maximization
Time Critical Influence Maximization
Wei Lu
 
Optimal Recommendations under Attraction, Aversion, and Social Influence
Optimal Recommendations under Attraction, Aversion, and Social InfluenceOptimal Recommendations under Attraction, Aversion, and Social Influence
Optimal Recommendations under Attraction, Aversion, and Social Influence
Wei Lu
 

More from Wei Lu (7)

Refutations on "Debunking the Myths of Influence Maximization: An In-Depth Be...
Refutations on "Debunking the Myths of Influence Maximization: An In-Depth Be...Refutations on "Debunking the Myths of Influence Maximization: An In-Depth Be...
Refutations on "Debunking the Myths of Influence Maximization: An In-Depth Be...
 
Social Recommendation with Strong and Weak Ties
Social Recommendation with Strong and Weak TiesSocial Recommendation with Strong and Weak Ties
Social Recommendation with Strong and Weak Ties
 
VLDB 2015 poster: Revenue Maximization in Recommender Systems
VLDB 2015 poster: Revenue Maximization in Recommender SystemsVLDB 2015 poster: Revenue Maximization in Recommender Systems
VLDB 2015 poster: Revenue Maximization in Recommender Systems
 
Revenue Maximization in Recommeder Systems
Revenue Maximization in Recommeder SystemsRevenue Maximization in Recommeder Systems
Revenue Maximization in Recommeder Systems
 
Profit Maximization over Social Networks
Profit Maximization over Social NetworksProfit Maximization over Social Networks
Profit Maximization over Social Networks
 
Time Critical Influence Maximization
Time Critical Influence MaximizationTime Critical Influence Maximization
Time Critical Influence Maximization
 
Optimal Recommendations under Attraction, Aversion, and Social Influence
Optimal Recommendations under Attraction, Aversion, and Social InfluenceOptimal Recommendations under Attraction, Aversion, and Social Influence
Optimal Recommendations under Attraction, Aversion, and Social Influence
 

Recently uploaded

Oedema_types_causes_pathophysiology.pptx
Oedema_types_causes_pathophysiology.pptxOedema_types_causes_pathophysiology.pptx
Oedema_types_causes_pathophysiology.pptx
muralinath2
 
waterlessdyeingtechnolgyusing carbon dioxide chemicalspdf
waterlessdyeingtechnolgyusing carbon dioxide chemicalspdfwaterlessdyeingtechnolgyusing carbon dioxide chemicalspdf
waterlessdyeingtechnolgyusing carbon dioxide chemicalspdf
LengamoLAppostilic
 
molar-distalization in orthodontics-seminar.pptx
molar-distalization in orthodontics-seminar.pptxmolar-distalization in orthodontics-seminar.pptx
molar-distalization in orthodontics-seminar.pptx
Anagha Prasad
 
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốtmô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
HongcNguyn6
 
8.Isolation of pure cultures and preservation of cultures.pdf
8.Isolation of pure cultures and preservation of cultures.pdf8.Isolation of pure cultures and preservation of cultures.pdf
8.Isolation of pure cultures and preservation of cultures.pdf
by6843629
 
Basics of crystallography, crystal systems, classes and different forms
Basics of crystallography, crystal systems, classes and different formsBasics of crystallography, crystal systems, classes and different forms
Basics of crystallography, crystal systems, classes and different forms
MaheshaNanjegowda
 
Unlocking the mysteries of reproduction: Exploring fecundity and gonadosomati...
Unlocking the mysteries of reproduction: Exploring fecundity and gonadosomati...Unlocking the mysteries of reproduction: Exploring fecundity and gonadosomati...
Unlocking the mysteries of reproduction: Exploring fecundity and gonadosomati...
AbdullaAlAsif1
 
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills MN
 
Shallowest Oil Discovery of Turkiye.pptx
Shallowest Oil Discovery of Turkiye.pptxShallowest Oil Discovery of Turkiye.pptx
Shallowest Oil Discovery of Turkiye.pptx
Gokturk Mehmet Dilci
 
aziz sancar nobel prize winner: from mardin to nobel
aziz sancar nobel prize winner: from mardin to nobelaziz sancar nobel prize winner: from mardin to nobel
aziz sancar nobel prize winner: from mardin to nobel
İsa Badur
 
bordetella pertussis.................................ppt
bordetella pertussis.................................pptbordetella pertussis.................................ppt
bordetella pertussis.................................ppt
kejapriya1
 
Bob Reedy - Nitrate in Texas Groundwater.pdf
Bob Reedy - Nitrate in Texas Groundwater.pdfBob Reedy - Nitrate in Texas Groundwater.pdf
Bob Reedy - Nitrate in Texas Groundwater.pdf
Texas Alliance of Groundwater Districts
 
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
David Osipyan
 
Deep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless ReproducibilityDeep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless Reproducibility
University of Rennes, INSA Rennes, Inria/IRISA, CNRS
 
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
yqqaatn0
 
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdfTopic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
TinyAnderson
 
Thornton ESPP slides UK WW Network 4_6_24.pdf
Thornton ESPP slides UK WW Network 4_6_24.pdfThornton ESPP slides UK WW Network 4_6_24.pdf
Thornton ESPP slides UK WW Network 4_6_24.pdf
European Sustainable Phosphorus Platform
 
The binding of cosmological structures by massless topological defects
The binding of cosmological structures by massless topological defectsThe binding of cosmological structures by massless topological defects
The binding of cosmological structures by massless topological defects
Sérgio Sacani
 
Immersive Learning That Works: Research Grounding and Paths Forward
Immersive Learning That Works: Research Grounding and Paths ForwardImmersive Learning That Works: Research Grounding and Paths Forward
Immersive Learning That Works: Research Grounding and Paths Forward
Leonel Morgado
 
Medical Orthopedic PowerPoint Templates.pptx
Medical Orthopedic PowerPoint Templates.pptxMedical Orthopedic PowerPoint Templates.pptx
Medical Orthopedic PowerPoint Templates.pptx
terusbelajar5
 

Recently uploaded (20)

Oedema_types_causes_pathophysiology.pptx
Oedema_types_causes_pathophysiology.pptxOedema_types_causes_pathophysiology.pptx
Oedema_types_causes_pathophysiology.pptx
 
waterlessdyeingtechnolgyusing carbon dioxide chemicalspdf
waterlessdyeingtechnolgyusing carbon dioxide chemicalspdfwaterlessdyeingtechnolgyusing carbon dioxide chemicalspdf
waterlessdyeingtechnolgyusing carbon dioxide chemicalspdf
 
molar-distalization in orthodontics-seminar.pptx
molar-distalization in orthodontics-seminar.pptxmolar-distalization in orthodontics-seminar.pptx
molar-distalization in orthodontics-seminar.pptx
 
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốtmô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
 
8.Isolation of pure cultures and preservation of cultures.pdf
8.Isolation of pure cultures and preservation of cultures.pdf8.Isolation of pure cultures and preservation of cultures.pdf
8.Isolation of pure cultures and preservation of cultures.pdf
 
Basics of crystallography, crystal systems, classes and different forms
Basics of crystallography, crystal systems, classes and different formsBasics of crystallography, crystal systems, classes and different forms
Basics of crystallography, crystal systems, classes and different forms
 
Unlocking the mysteries of reproduction: Exploring fecundity and gonadosomati...
Unlocking the mysteries of reproduction: Exploring fecundity and gonadosomati...Unlocking the mysteries of reproduction: Exploring fecundity and gonadosomati...
Unlocking the mysteries of reproduction: Exploring fecundity and gonadosomati...
 
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
 
Shallowest Oil Discovery of Turkiye.pptx
Shallowest Oil Discovery of Turkiye.pptxShallowest Oil Discovery of Turkiye.pptx
Shallowest Oil Discovery of Turkiye.pptx
 
aziz sancar nobel prize winner: from mardin to nobel
aziz sancar nobel prize winner: from mardin to nobelaziz sancar nobel prize winner: from mardin to nobel
aziz sancar nobel prize winner: from mardin to nobel
 
bordetella pertussis.................................ppt
bordetella pertussis.................................pptbordetella pertussis.................................ppt
bordetella pertussis.................................ppt
 
Bob Reedy - Nitrate in Texas Groundwater.pdf
Bob Reedy - Nitrate in Texas Groundwater.pdfBob Reedy - Nitrate in Texas Groundwater.pdf
Bob Reedy - Nitrate in Texas Groundwater.pdf
 
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
 
Deep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless ReproducibilityDeep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless Reproducibility
 
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
 
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdfTopic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
 
Thornton ESPP slides UK WW Network 4_6_24.pdf
Thornton ESPP slides UK WW Network 4_6_24.pdfThornton ESPP slides UK WW Network 4_6_24.pdf
Thornton ESPP slides UK WW Network 4_6_24.pdf
 
The binding of cosmological structures by massless topological defects
The binding of cosmological structures by massless topological defectsThe binding of cosmological structures by massless topological defects
The binding of cosmological structures by massless topological defects
 
Immersive Learning That Works: Research Grounding and Paths Forward
Immersive Learning That Works: Research Grounding and Paths ForwardImmersive Learning That Works: Research Grounding and Paths Forward
Immersive Learning That Works: Research Grounding and Paths Forward
 
Medical Orthopedic PowerPoint Templates.pptx
Medical Orthopedic PowerPoint Templates.pptxMedical Orthopedic PowerPoint Templates.pptx
Medical Orthopedic PowerPoint Templates.pptx
 

From Competition to Complementarity: Comparative Influence Diffusion and Maximization

  • 1. Comparative Influence Maximization: From Competition to Complementarity Wei Lu (LinkedIn) Wei Chen (Microsoft Research) Laks V.S. Lakshmanan (UBC) NDA’16 Workshop, SIGMOD To appear in VLDB’16, New Delhi, India
  • 2. Social influence • Ubiquitous in life • Fueled by the widespread popularity of online social networks and social media • Computational Social Influence (CSI) – Viral Marketing – Influence Maximization – The applications and extensions to the above
  • 3. Computational Social Influence • Social networks with edge weights (influence probabilities or weights) • Stochastic influence/information propagation models – Single-item vs. Multiple-item models • Diffusion dynamics depend heavily on the relationship of the propagating entities • Pure Competition: Each user adopts at most one item – Competitive Independent Cascade Model (CIC) – K-LT Model – WPCLT Model …
  • 4. Limitations of Pure Competition Models: Example
  • 5. Item Relationships • Propagating items can be of any relationship: – Compete (iPhone vs Nexus) – Complement (iPhone vs Apple Watch, iPhone vs iPhone cases) • Natural and well-studied in economics – Substitute goods and complementary goods • Item relationship may be asymmetric • Item relationship may be to an arbitrary degree (not “pure”)
  • 6. Motivations and Challenges “One model that works for all kinds of item relationships”: Not existent until this work Challenges: • Unified model with great expressive power • Compact and manageable representation • Allows room to develop tractable solutions for natural influence optimization problems • Model validation, data
  • 7. Main Contributions • Comparative Independent Cascade (ComIC): Capturing both competition and complementarity, to any arbitrary degree • Problem: Self Influence Maximization • Problem: Complementary Influence Maximization • Algorithm: Generalized Reverse Reachable Sets • Algorithm: Sandwich Approximation
  • 8. Model Overview • Focusing on two items – Challenges abundant already – Future work: extended to an arbitrary number of items • Edge-level influence/information propagation – Similar to the classic IC model • Node-level Decision-making controlled by Node-Level Automata (NLA) – Global Adoption Probabilities (GAP)
  • 9. Global Adoption Probabilities • Key parameters measuring the degree to which two items compete with or complement each other • q(A|0): probability of adopting A when the user has not yet adopted any other items • q(A|B): probability of adopting A when the user has already adopted B • q(A|0) >= q(A|B): B competes with A • q(A|0) <= q(A|B): B complements A
  • 10. Transition diagram For each item, each node may be of the following status: • Idle (inactive) • Informed (influenced) • Suspended / Adopted / Rejected
  • 11. Diffusion dynamics • Initially,every node is inactive/idle wrt both items • When any node adopts the first item, its outgoing edges are tested for information propagation to neighbors (“info channel”) – Each edge (u,v) becomes open w.p.p(u,v) • If u is A-adopted, and info channel on edge (u,v) is open, then v decides to adopt A based on: – w.p. q(A|0) if v has not adopted B – w.p. q(A|B) if v has adopted B
  • 12. Node tie-breaking • What if there are multiple in-neighbors active in the last time step t-1? • Generate a random permutation of those in- neighbors, and follow that order to test activation • If one such neighbor adopted both items at t-1, following the same order for informing • If a seed is targeted with both items, decide the order randomly (0.5 and 0.5 prob.)
  • 13. Node Reconsideration • Suppose B complements A: q(A|0) <= q(A|B) • User v was informed of A, but did not adopt with probability 1 – q(A|0) • Once v adopts B, since B complements A, user may want to revisit the decision with a reconsideration probability:
  • 14. General Properties of ComIC model • Neither submodularity nor monotonicity holds in an arbitrary instance of the model • Influence maximization may be intractable • Overall strategy: – Identify a parameter subspace such that submodularity is satisfied – Develop efficient approximation algorithm (Generalized RR-set) for submodular cases – “Sandwich Approximation” for non-submodular cases
  • 16. Possible World Definition • An equivalent representation of the model and the propagation dynamics – Propagation in a possible world is deterministic, easy to reason about • Equivalent Possible World model for ComIC – For each edge (u,v), remove w.p. 1-p(u,v) – For each node v, randomly generate α(v,A) and α(v,B) for testing with adoption probabilities. – Adoption happens when α <= adoption prob.
  • 17. Influence Maximization Problems • Self Influence Maximization (SIM): Fix B-seed set, find the best A-seed set of size k to maximize A’s expected influence spread • Complementary Influence Maximization (CIM): Fix A-seed set, find the best B-seed set of size k to maximize the boost B gives to A’s expected influence spread • Both NP-hard under ComIC model
  • 18. Algorithm Design for SIM and CIM • Generalized Reverse-Reachable Set (RR-set): RR-set based algorithms are the state-of-the-art for classical influence maximization with single- item propagation models (IC and LT) • Sandwich Approximation to achieve approximation guarantees in non-submodular cases • Both techniques are generic and applicable to any non-submodular maximization problems
  • 19. Recap: Reverse-Reachable Set • If u can reach v (in a deterministic directed graph), then u is in a RR-set rooted at v [Borgs et al., SODA’14] • Random RR-set: root v is randomly chosen • Two-phase Inf. Max. (TIM) [Tang et al 2014] – Estimate the minimum number of random RR-sets required, for probabilistic approx. guarantees • 1-1/e-ε: smaller ε requires more RR-sets to be generated – Generate random RR-sets using backward BFS – Seed selection (deterministic max-cover problem)
  • 20. Recap: TIM Algorithm • (1-1/e-ε)-approximation with high probability – Same as greedy, modulo probabilistic part • Orders of magnitude faster than Greedy + Monte Carlo simulations • Scalable to billion-edge graphs • Applies to a large family of stochastic propagation models
  • 21. Generalized RR-set and TIM Algorithms • Works for any stochastic propagation models satisfying monotonicity and submodularity – Has (1-1/e-ε)-approximation with high probability • General RR-set (in a deterministic possible world): u belongs to the RR-set rooted at v if the singleton seed set {u} can activate v – Note difference from “reaching” – Random RR-set: root v is sampled uniformly at random from the graph
  • 22. RR-set generation for SIM (RR- SIM) • Problem definition and submodular setting – Fix B-seed set, find A-seed set (size k) – A is complemented by B: q(A|0) <= q(A|B) – B is indifferent to A: q(B|0) = q(B|A) • Phase 1: Forward Labeling: Start from B-seed set, label node status w.r.t. B • Phase 2: Backward BFS (details next)
  • 23. Phase 2: Backward BFS • Randomly choose root v from the graph • Enqueue v into a FIFO queue Q • Until empty, repeatedly dequeue from Q • Let’s say we get a node u from Q • Enqueue u’s in-neighbours (with edge test) if either is true – u is B-adopted and α(A,u) <= q(A|B) – u is not B-adopted and α(A,u) <= q(A|0)
  • 24. RR-Set generation for CIM (RR- CIM) • Given A-seed set, find best complementing B set • Cross-submodularity holds q(B|A) = 1 • Forward Labeling: Start from A-seed set, identify nodes can be A-adopted potentially • Backward BFS: Two passes required
  • 25. Sandwich Approximation • Given any non-submodular set functions, how to leverage submodular maximization (e.g., greedy, local search) to achieve provable approximation guarantees? • Answer: – Derive upper/lower bound submodular functions (“sandwiched”) – Use the best of the three solutions, which gives a data-dependent approximation ratio
  • 26. Sandwich Approximation non-submodular, function we want to maximize lower bound, submodular upper bound, submodular
  • 27. Remarks • Applicable to any non-submodular function maximization • If monotone, run Greedy on the upper bound, lower bound, and the actual function • If non-monotone, run Local Search • Upper/lower bound should be reasonably tight to be meaningful
  • 28. Experiments: Datasets Also have synthetic dataset up to 1 million nodes
  • 29. Learning Global Adoption Probabilities Dataset: Flixster • Signals for adoption: rated a movie • Signals for informed: “Want to See”, “Not Interested”
  • 30. Effects of εin General TIM algorithm: Tradeoff between seed set quality and running time
  • 35. Thank you! See you in VLDB’16!

Editor's Notes

  1. SIM: fix q_B|A = 1 and vary q_B|0 CIM: fix q_B|0 = 0.1 and vary q_B|A