SlideShare a Scribd company logo
X1 X4
X2
X3
𝒴 = {yx=0, yx=1}
p (yx=0) p (yx=1)
p (y|do (x = 0)) p (y|do (x = 1))
ATE := 𝔼 (yx=1) − 𝔼 (yx=0)
ATE := 𝔼 (y|do (x = 1)) − 𝔼 (y|do (x = 0))
Pr (X1, X2, X3, X4) = Pr (X1 |X2, X3) Pr (X2 |X3) Pr (X3) Pr (X4 |X1, X2) =
4
∏
i=1
Pr (Xi | 𝒫Xi)
X1 X4
X2
X3
do (X1 = 1)
X1
X2 X3 X1
X1 X4
X2
X3
G
𝒳 = {X1, . . . , XN}
Xi 𝒫Xi
𝒜
a = do (X = x) ∈ 𝒜
x = {x1, . . . , xn} X = {X1, . . . , Xn}
do () ∈ 𝒜
a = do (X = x) μa = 𝔼[Y|do(X = x)]
μ* := max
a∈𝒜
μa
T
t at = do(Xt = xt) ∈ 𝒜
Xc
t ∼ Pr (Xc
t |do (Xt = xt)) Xc
t = 𝒳 − Xt
Yt ∈ {0,1}
T
̂a*T
∈ 𝒜
RT
RT = μ* − μ ̂a*T
G = (𝒳, E)
𝒳
E
𝒜
T
̂a*T
∈ 𝒜 T
X K
𝒜 = {do (X = k)} : k ∈ {1,...,K}
Y ∈ {0,1} Pr{Y = 1|X}
N X = {X1, . . . , XN}
Y ∈ {0,1}
𝒜 = {do()} ∪ {do(Xi = j) : 1 ≤ i ≤ N, j ∈ {0,1}}
Xi ∼ Bernoulli (qi)
q = (q1, . . . , qN) ∈ [0,1]
N
Pr (Y = 1|X)
do()
Pr (Y|do (Xi = j)) = Pr (Y|Xi = j)
Pr (Xi = j) (Xi = j)
do (Xi = j)
Pr (Xi = j) (Xi = j)
q
qi (1 − qi)
τ ∈ [2...N] Iτ =
{
i : min{qi, 1 − qi} <
1
τ }
τ |Iτ |
m (q) : ℝN
→ ℝ
m (q) = min{τ : |Iτ | ≤ τ}
m (q)
q =
(
1
2
,
1
2
, …
1
2)
I2 =
{
i : min{qi, 1 − qi} <
1
2}
= {}, Iτ:τ>2 = {}
m (q) = min{τ : |Iτ | ≤ τ} = 2
q = (0,0,…,0)
I2 =
{
i : min{qi, 1 − qi} <
1
2}
= {1,…, N}, Iτ:τ>2 = {1,…, N}
m (q) = min{τ : |Iτ | ≤ τ} = N
RT ∈ 𝒪
m (q)
T
log
(
NT
m )
T q
RT ∈ Ω
m (q)
T
2N
RT ∈ Ω
(
N
T )
N > m
Pr{Y|Xi = j} ≠ Pr{Y|do (Xi = j)}
Pr{X1, . . . XN |do (Xi = j)}
Pr{X1, . . . XN |do (Xi = j)} =
∏
k≠i
Pr{Xk | 𝒫Xk
}δ (Xi − j)
Pr {Y|a}
Y = XN, Xk = Xk−1
Pr(X1) = 0
do(Xi = 1) do(X1 = 1)
do()
Pr (Y|do (X2 = j)) =
∑
X1
Pr (X1) Pr (Y|X1, X2 = j)
X1 = X2 Pr(X2 = 1)
(X1 = 0,X2 = 1)
Pr (Y|do (X2 = 1))
∀a ∈ 𝒜 Pr ( 𝒫Y |a) Pr (Y|a)
η ∈ [0,1]
|𝒜|
∑
a∈𝒜
ηa = 1
Q :=
∑
a∈𝒜
ηaPr { 𝒫Y |a}
η
η μa
∀a ∈ 𝒜
a
Ra (X) =
Pr{𝒫Y(X)|a}
Q{𝒫Y(X)}
=
Pr{𝒫Y(X)|a}
∑a∈𝒜
ηaPr { 𝒫Y(X)|a}
𝒫Y(X)
a
̂μa =
1
T
T
∑
t=1
Yt Ra(Xt) 1 {Ra(Xt) ≤ Ba}
Ba ≥ 0
η
m(η)
m(η) = max
a∈𝒜
𝔼a [
Pr{𝒫Y(X)|a}
Q{𝒫Y(X)} ]
B ∈ ℝ|𝒜|
Ba =
m (η) T
log (2T| 𝒜|)
RT ∈ 𝒪
m (η)
T
log (2T| 𝒜|)
Q
Q η
m (η)
η* = arg min
η
m (η) = arg min
η
max
a∈𝒜
𝔼a [
Pr{𝒫Y(X)|a}
Q{𝒫Y(X)} ]
= arg min
η
max
a∈𝒜
𝔼a
[
Pr{𝒫Y(X)|a}
∑b∈𝒜
ηbPr{𝒫Y(X)} ]
m (η) η
η*
Yt ∼ Bernouli
(
1
2
+ ϵ
)
if X1 = 1
Yt ∼ Bernouli
(
1
2
− ϵ′
)
if X1 = 0
𝔼[Y|do(X1 = 1)] =
1
2
+ ϵ
𝔼[Y|do(X1 = 0)] =
1
2
− ϵ′
a ∈ 𝒜{do(X1 = 1), do (X1 = 0)} 𝔼[Y|a] =
1
2
qi = 0 i ≤ m qi =
1
2
Xi ∼ Bernouli (qi)
ϵ′ =
q1ϵ
1 − q1
Xi:i>1
X1
q
ϵ
[Paper Reading] Causal Bandits: Learning Good Interventions via Causal Inference
[Paper Reading] Causal Bandits: Learning Good Interventions via Causal Inference

More Related Content

What's hot

基礎からのベイズ統計学 輪読会資料 第8章 「比率・相関・信頼性」
基礎からのベイズ統計学 輪読会資料  第8章 「比率・相関・信頼性」基礎からのベイズ統計学 輪読会資料  第8章 「比率・相関・信頼性」
基礎からのベイズ統計学 輪読会資料 第8章 「比率・相関・信頼性」
Ken'ichi Matsui
 
Quantum fourier transformation
Quantum fourier transformationQuantum fourier transformation
Quantum fourier transformation
takumitano
 
Comuter graphics bresenhams circle drawing algorithm
Comuter graphics bresenhams circle drawing algorithmComuter graphics bresenhams circle drawing algorithm
Comuter graphics bresenhams circle drawing algorithm
Rachana Marathe
 
Diff formula with audio1
Diff formula with audio1Diff formula with audio1
Diff formula with audio1
Athi Arunachalam
 
Integrales
Integrales Integrales
Integrales
Jordy Reyes
 
13 - 06 Feb - Dynamic Programming
13 - 06 Feb - Dynamic Programming13 - 06 Feb - Dynamic Programming
13 - 06 Feb - Dynamic Programming
Neeldhara Misra
 
capstone magic squares
capstone magic squarescapstone magic squares
capstone magic squaresCara Colotti
 
Notes.on.popularity.versus.similarity.model
Notes.on.popularity.versus.similarity.modelNotes.on.popularity.versus.similarity.model
Notes.on.popularity.versus.similarity.model
sun peiyuan
 
8.2 integration by parts
8.2 integration by parts8.2 integration by parts
8.2 integration by partsdicosmo178
 
Introduction to Neural Networks and Deep Learning from Scratch
Introduction to Neural Networks and Deep Learning from ScratchIntroduction to Neural Networks and Deep Learning from Scratch
Introduction to Neural Networks and Deep Learning from Scratch
Ahmed BESBES
 
脳の計算論 第3章「リズム活動と位相応答」
脳の計算論 第3章「リズム活動と位相応答」脳の計算論 第3章「リズム活動と位相応答」
脳の計算論 第3章「リズム活動と位相応答」
Kohei Ichikawa
 
Interpolation
InterpolationInterpolation
Interpolation
Brijesh Padhiyar
 
Vector calculus
Vector calculusVector calculus
Vector calculus
sujathavvv
 
15 integrals of trig products-i-x
15 integrals of trig products-i-x15 integrals of trig products-i-x
15 integrals of trig products-i-x
math266
 

What's hot (17)

基礎からのベイズ統計学 輪読会資料 第8章 「比率・相関・信頼性」
基礎からのベイズ統計学 輪読会資料  第8章 「比率・相関・信頼性」基礎からのベイズ統計学 輪読会資料  第8章 「比率・相関・信頼性」
基礎からのベイズ統計学 輪読会資料 第8章 「比率・相関・信頼性」
 
Quantum fourier transformation
Quantum fourier transformationQuantum fourier transformation
Quantum fourier transformation
 
Comuter graphics bresenhams circle drawing algorithm
Comuter graphics bresenhams circle drawing algorithmComuter graphics bresenhams circle drawing algorithm
Comuter graphics bresenhams circle drawing algorithm
 
Diff formula with audio1
Diff formula with audio1Diff formula with audio1
Diff formula with audio1
 
Integrales
Integrales Integrales
Integrales
 
13 - 06 Feb - Dynamic Programming
13 - 06 Feb - Dynamic Programming13 - 06 Feb - Dynamic Programming
13 - 06 Feb - Dynamic Programming
 
capstone magic squares
capstone magic squarescapstone magic squares
capstone magic squares
 
Notes.on.popularity.versus.similarity.model
Notes.on.popularity.versus.similarity.modelNotes.on.popularity.versus.similarity.model
Notes.on.popularity.versus.similarity.model
 
8.2 integration by parts
8.2 integration by parts8.2 integration by parts
8.2 integration by parts
 
Introduction to Neural Networks and Deep Learning from Scratch
Introduction to Neural Networks and Deep Learning from ScratchIntroduction to Neural Networks and Deep Learning from Scratch
Introduction to Neural Networks and Deep Learning from Scratch
 
脳の計算論 第3章「リズム活動と位相応答」
脳の計算論 第3章「リズム活動と位相応答」脳の計算論 第3章「リズム活動と位相応答」
脳の計算論 第3章「リズム活動と位相応答」
 
Interpolation
InterpolationInterpolation
Interpolation
 
Vector calculus
Vector calculusVector calculus
Vector calculus
 
Lec39
Lec39Lec39
Lec39
 
15 integrals of trig products-i-x
15 integrals of trig products-i-x15 integrals of trig products-i-x
15 integrals of trig products-i-x
 
Lec38
Lec38Lec38
Lec38
 
Complex numbers polynomial multiplication
Complex numbers polynomial multiplicationComplex numbers polynomial multiplication
Complex numbers polynomial multiplication
 

Similar to [Paper Reading] Causal Bandits: Learning Good Interventions via Causal Inference

HMM, MEMM, CRF メモ
HMM, MEMM, CRF メモHMM, MEMM, CRF メモ
HMM, MEMM, CRF メモ
Takeshi Arabiki
 
統計的学習の基礎 4章 前半
統計的学習の基礎 4章 前半統計的学習の基礎 4章 前半
統計的学習の基礎 4章 前半
Ken'ichi Matsui
 
Signals and Systems Formula Sheet
Signals and Systems Formula SheetSignals and Systems Formula Sheet
Signals and Systems Formula Sheet
Haris Hassan
 
ch9.pdf
ch9.pdfch9.pdf
ch9.pdf
KavS14
 
A neural attention model for sentence summarization
A neural attention model for sentence summarizationA neural attention model for sentence summarization
A neural attention model for sentence summarization
Akihiko Watanabe
 
Query Suggestion @ tokyotextmining#2
Query Suggestion @ tokyotextmining#2Query Suggestion @ tokyotextmining#2
Query Suggestion @ tokyotextmining#2
ybenjo
 
Unit 1 Operation on signals
Unit 1  Operation on signalsUnit 1  Operation on signals
Unit 1 Operation on signals
Dr.SHANTHI K.G
 
University of manchester mathematical formula tables
University of manchester mathematical formula tablesUniversity of manchester mathematical formula tables
University of manchester mathematical formula tablesGaurav Vasani
 
Implementation of parallel randomized algorithm for skew-symmetric matrix game
Implementation of parallel randomized algorithm for skew-symmetric matrix gameImplementation of parallel randomized algorithm for skew-symmetric matrix game
Implementation of parallel randomized algorithm for skew-symmetric matrix game
Ajay Bidyarthy
 
数式処理ソフトMathematicaで数学の問題を解く
数式処理ソフトMathematicaで数学の問題を解く数式処理ソフトMathematicaで数学の問題を解く
数式処理ソフトMathematicaで数学の問題を解く
Yoshihiro Mizoguchi
 
関西NIPS+読み会発表スライド
関西NIPS+読み会発表スライド関西NIPS+読み会発表スライド
関西NIPS+読み会発表スライド
Yuchi Matsuoka
 
Fuzzy calculation
Fuzzy calculationFuzzy calculation
Fuzzy calculation
Amir Rafati
 
Mathematical formula tables
Mathematical formula tablesMathematical formula tables
Mathematical formula tables
Saravana Selvan
 
A new Perron-Frobenius theorem for nonnegative tensors
A new Perron-Frobenius theorem for nonnegative tensorsA new Perron-Frobenius theorem for nonnegative tensors
A new Perron-Frobenius theorem for nonnegative tensors
Francesco Tudisco
 
Comparison Theorems for SDEs
Comparison Theorems for SDEs Comparison Theorems for SDEs
Comparison Theorems for SDEs
Ilya Gikhman
 
Interpolation functions
Interpolation functionsInterpolation functions
Interpolation functions
Tarun Gehlot
 
cps170_bayes_nets.ppt
cps170_bayes_nets.pptcps170_bayes_nets.ppt
cps170_bayes_nets.ppt
FaizAbaas
 

Similar to [Paper Reading] Causal Bandits: Learning Good Interventions via Causal Inference (20)

HMM, MEMM, CRF メモ
HMM, MEMM, CRF メモHMM, MEMM, CRF メモ
HMM, MEMM, CRF メモ
 
統計的学習の基礎 4章 前半
統計的学習の基礎 4章 前半統計的学習の基礎 4章 前半
統計的学習の基礎 4章 前半
 
Signals and Systems Formula Sheet
Signals and Systems Formula SheetSignals and Systems Formula Sheet
Signals and Systems Formula Sheet
 
ch9.pdf
ch9.pdfch9.pdf
ch9.pdf
 
A neural attention model for sentence summarization
A neural attention model for sentence summarizationA neural attention model for sentence summarization
A neural attention model for sentence summarization
 
Query Suggestion @ tokyotextmining#2
Query Suggestion @ tokyotextmining#2Query Suggestion @ tokyotextmining#2
Query Suggestion @ tokyotextmining#2
 
Unit 1 Operation on signals
Unit 1  Operation on signalsUnit 1  Operation on signals
Unit 1 Operation on signals
 
University of manchester mathematical formula tables
University of manchester mathematical formula tablesUniversity of manchester mathematical formula tables
University of manchester mathematical formula tables
 
Implementation of parallel randomized algorithm for skew-symmetric matrix game
Implementation of parallel randomized algorithm for skew-symmetric matrix gameImplementation of parallel randomized algorithm for skew-symmetric matrix game
Implementation of parallel randomized algorithm for skew-symmetric matrix game
 
Hog processing
Hog processingHog processing
Hog processing
 
数式処理ソフトMathematicaで数学の問題を解く
数式処理ソフトMathematicaで数学の問題を解く数式処理ソフトMathematicaで数学の問題を解く
数式処理ソフトMathematicaで数学の問題を解く
 
関西NIPS+読み会発表スライド
関西NIPS+読み会発表スライド関西NIPS+読み会発表スライド
関西NIPS+読み会発表スライド
 
Fuzzy calculation
Fuzzy calculationFuzzy calculation
Fuzzy calculation
 
Kisi2+met num
Kisi2+met numKisi2+met num
Kisi2+met num
 
Mathematical formula tables
Mathematical formula tablesMathematical formula tables
Mathematical formula tables
 
A new Perron-Frobenius theorem for nonnegative tensors
A new Perron-Frobenius theorem for nonnegative tensorsA new Perron-Frobenius theorem for nonnegative tensors
A new Perron-Frobenius theorem for nonnegative tensors
 
Comparison Theorems for SDEs
Comparison Theorems for SDEs Comparison Theorems for SDEs
Comparison Theorems for SDEs
 
Basic m4-2-chapter1
Basic m4-2-chapter1Basic m4-2-chapter1
Basic m4-2-chapter1
 
Interpolation functions
Interpolation functionsInterpolation functions
Interpolation functions
 
cps170_bayes_nets.ppt
cps170_bayes_nets.pptcps170_bayes_nets.ppt
cps170_bayes_nets.ppt
 

More from Daiki Tanaka

[Paper Reading] Theoretical Analysis of Self-Training with Deep Networks on U...
[Paper Reading] Theoretical Analysis of Self-Training with Deep Networks on U...[Paper Reading] Theoretical Analysis of Self-Training with Deep Networks on U...
[Paper Reading] Theoretical Analysis of Self-Training with Deep Networks on U...
Daiki Tanaka
 
カーネル法:正定値カーネルの理論
カーネル法:正定値カーネルの理論カーネル法:正定値カーネルの理論
カーネル法:正定値カーネルの理論
Daiki Tanaka
 
[Paper reading] L-SHAPLEY AND C-SHAPLEY: EFFICIENT MODEL INTERPRETATION FOR S...
[Paper reading] L-SHAPLEY AND C-SHAPLEY: EFFICIENT MODEL INTERPRETATION FOR S...[Paper reading] L-SHAPLEY AND C-SHAPLEY: EFFICIENT MODEL INTERPRETATION FOR S...
[Paper reading] L-SHAPLEY AND C-SHAPLEY: EFFICIENT MODEL INTERPRETATION FOR S...
Daiki Tanaka
 
Selective inference
Selective inferenceSelective inference
Selective inference
Daiki Tanaka
 
Anomaly Detection with VAEGAN and Attention [JSAI2019 report]
Anomaly Detection with VAEGAN and Attention [JSAI2019 report]Anomaly Detection with VAEGAN and Attention [JSAI2019 report]
Anomaly Detection with VAEGAN and Attention [JSAI2019 report]
Daiki Tanaka
 
オンライン学習 : Online learning
オンライン学習 : Online learningオンライン学習 : Online learning
オンライン学習 : Online learning
Daiki Tanaka
 
[Paper Reading] Attention is All You Need
[Paper Reading] Attention is All You Need[Paper Reading] Attention is All You Need
[Paper Reading] Attention is All You Need
Daiki Tanaka
 
Local Outlier Detection with Interpretation
Local Outlier Detection with InterpretationLocal Outlier Detection with Interpretation
Local Outlier Detection with Interpretation
Daiki Tanaka
 
Interpretability of machine learning
Interpretability of machine learningInterpretability of machine learning
Interpretability of machine learning
Daiki Tanaka
 
The Million Domain Challenge: Broadcast Email Prioritization by Cross-domain ...
The Million Domain Challenge: Broadcast Email Prioritization by Cross-domain ...The Million Domain Challenge: Broadcast Email Prioritization by Cross-domain ...
The Million Domain Challenge: Broadcast Email Prioritization by Cross-domain ...
Daiki Tanaka
 
The Limits of Popularity-Based Recommendations, and the Role of Social Ties
The Limits of Popularity-Based Recommendations, and the Role of Social TiesThe Limits of Popularity-Based Recommendations, and the Role of Social Ties
The Limits of Popularity-Based Recommendations, and the Role of Social Ties
Daiki Tanaka
 
Learning Deep Representation from Big and Heterogeneous Data for Traffic Acci...
Learning Deep Representation from Big and Heterogeneous Data for Traffic Acci...Learning Deep Representation from Big and Heterogeneous Data for Traffic Acci...
Learning Deep Representation from Big and Heterogeneous Data for Traffic Acci...
Daiki Tanaka
 
Toeplitz Inverse Covariance-Based Clustering of Multivariate Time Series Data
Toeplitz Inverse Covariance-Based Clustering of Multivariate Time Series DataToeplitz Inverse Covariance-Based Clustering of Multivariate Time Series Data
Toeplitz Inverse Covariance-Based Clustering of Multivariate Time Series Data
Daiki Tanaka
 

More from Daiki Tanaka (13)

[Paper Reading] Theoretical Analysis of Self-Training with Deep Networks on U...
[Paper Reading] Theoretical Analysis of Self-Training with Deep Networks on U...[Paper Reading] Theoretical Analysis of Self-Training with Deep Networks on U...
[Paper Reading] Theoretical Analysis of Self-Training with Deep Networks on U...
 
カーネル法:正定値カーネルの理論
カーネル法:正定値カーネルの理論カーネル法:正定値カーネルの理論
カーネル法:正定値カーネルの理論
 
[Paper reading] L-SHAPLEY AND C-SHAPLEY: EFFICIENT MODEL INTERPRETATION FOR S...
[Paper reading] L-SHAPLEY AND C-SHAPLEY: EFFICIENT MODEL INTERPRETATION FOR S...[Paper reading] L-SHAPLEY AND C-SHAPLEY: EFFICIENT MODEL INTERPRETATION FOR S...
[Paper reading] L-SHAPLEY AND C-SHAPLEY: EFFICIENT MODEL INTERPRETATION FOR S...
 
Selective inference
Selective inferenceSelective inference
Selective inference
 
Anomaly Detection with VAEGAN and Attention [JSAI2019 report]
Anomaly Detection with VAEGAN and Attention [JSAI2019 report]Anomaly Detection with VAEGAN and Attention [JSAI2019 report]
Anomaly Detection with VAEGAN and Attention [JSAI2019 report]
 
オンライン学習 : Online learning
オンライン学習 : Online learningオンライン学習 : Online learning
オンライン学習 : Online learning
 
[Paper Reading] Attention is All You Need
[Paper Reading] Attention is All You Need[Paper Reading] Attention is All You Need
[Paper Reading] Attention is All You Need
 
Local Outlier Detection with Interpretation
Local Outlier Detection with InterpretationLocal Outlier Detection with Interpretation
Local Outlier Detection with Interpretation
 
Interpretability of machine learning
Interpretability of machine learningInterpretability of machine learning
Interpretability of machine learning
 
The Million Domain Challenge: Broadcast Email Prioritization by Cross-domain ...
The Million Domain Challenge: Broadcast Email Prioritization by Cross-domain ...The Million Domain Challenge: Broadcast Email Prioritization by Cross-domain ...
The Million Domain Challenge: Broadcast Email Prioritization by Cross-domain ...
 
The Limits of Popularity-Based Recommendations, and the Role of Social Ties
The Limits of Popularity-Based Recommendations, and the Role of Social TiesThe Limits of Popularity-Based Recommendations, and the Role of Social Ties
The Limits of Popularity-Based Recommendations, and the Role of Social Ties
 
Learning Deep Representation from Big and Heterogeneous Data for Traffic Acci...
Learning Deep Representation from Big and Heterogeneous Data for Traffic Acci...Learning Deep Representation from Big and Heterogeneous Data for Traffic Acci...
Learning Deep Representation from Big and Heterogeneous Data for Traffic Acci...
 
Toeplitz Inverse Covariance-Based Clustering of Multivariate Time Series Data
Toeplitz Inverse Covariance-Based Clustering of Multivariate Time Series DataToeplitz Inverse Covariance-Based Clustering of Multivariate Time Series Data
Toeplitz Inverse Covariance-Based Clustering of Multivariate Time Series Data
 

Recently uploaded

Planning Of Procurement o different goods and services
Planning Of Procurement o different goods and servicesPlanning Of Procurement o different goods and services
Planning Of Procurement o different goods and services
JoytuBarua2
 
Investor-Presentation-Q1FY2024 investor presentation document.pptx
Investor-Presentation-Q1FY2024 investor presentation document.pptxInvestor-Presentation-Q1FY2024 investor presentation document.pptx
Investor-Presentation-Q1FY2024 investor presentation document.pptx
AmarGB2
 
Recycled Concrete Aggregate in Construction Part III
Recycled Concrete Aggregate in Construction Part IIIRecycled Concrete Aggregate in Construction Part III
Recycled Concrete Aggregate in Construction Part III
Aditya Rajan Patra
 
space technology lecture notes on satellite
space technology lecture notes on satellitespace technology lecture notes on satellite
space technology lecture notes on satellite
ongomchris
 
Hybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdf
Hybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdfHybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdf
Hybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdf
fxintegritypublishin
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理
zwunae
 
Immunizing Image Classifiers Against Localized Adversary Attacks
Immunizing Image Classifiers Against Localized Adversary AttacksImmunizing Image Classifiers Against Localized Adversary Attacks
Immunizing Image Classifiers Against Localized Adversary Attacks
gerogepatton
 
Railway Signalling Principles Edition 3.pdf
Railway Signalling Principles Edition 3.pdfRailway Signalling Principles Edition 3.pdf
Railway Signalling Principles Edition 3.pdf
TeeVichai
 
Cosmetic shop management system project report.pdf
Cosmetic shop management system project report.pdfCosmetic shop management system project report.pdf
Cosmetic shop management system project report.pdf
Kamal Acharya
 
一比一原版(UofT毕业证)多伦多大学毕业证成绩单如何办理
一比一原版(UofT毕业证)多伦多大学毕业证成绩单如何办理一比一原版(UofT毕业证)多伦多大学毕业证成绩单如何办理
一比一原版(UofT毕业证)多伦多大学毕业证成绩单如何办理
ydteq
 
road safety engineering r s e unit 3.pdf
road safety engineering  r s e unit 3.pdfroad safety engineering  r s e unit 3.pdf
road safety engineering r s e unit 3.pdf
VENKATESHvenky89705
 
English lab ppt no titlespecENG PPTt.pdf
English lab ppt no titlespecENG PPTt.pdfEnglish lab ppt no titlespecENG PPTt.pdf
English lab ppt no titlespecENG PPTt.pdf
BrazilAccount1
 
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
MdTanvirMahtab2
 
Final project report on grocery store management system..pdf
Final project report on grocery store management system..pdfFinal project report on grocery store management system..pdf
Final project report on grocery store management system..pdf
Kamal Acharya
 
DfMAy 2024 - key insights and contributions
DfMAy 2024 - key insights and contributionsDfMAy 2024 - key insights and contributions
DfMAy 2024 - key insights and contributions
gestioneergodomus
 
Forklift Classes Overview by Intella Parts
Forklift Classes Overview by Intella PartsForklift Classes Overview by Intella Parts
Forklift Classes Overview by Intella Parts
Intella Parts
 
在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样
在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样
在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样
obonagu
 
Water Industry Process Automation and Control Monthly - May 2024.pdf
Water Industry Process Automation and Control Monthly - May 2024.pdfWater Industry Process Automation and Control Monthly - May 2024.pdf
Water Industry Process Automation and Control Monthly - May 2024.pdf
Water Industry Process Automation & Control
 
ML for identifying fraud using open blockchain data.pptx
ML for identifying fraud using open blockchain data.pptxML for identifying fraud using open blockchain data.pptx
ML for identifying fraud using open blockchain data.pptx
Vijay Dialani, PhD
 
CW RADAR, FMCW RADAR, FMCW ALTIMETER, AND THEIR PARAMETERS
CW RADAR, FMCW RADAR, FMCW ALTIMETER, AND THEIR PARAMETERSCW RADAR, FMCW RADAR, FMCW ALTIMETER, AND THEIR PARAMETERS
CW RADAR, FMCW RADAR, FMCW ALTIMETER, AND THEIR PARAMETERS
veerababupersonal22
 

Recently uploaded (20)

Planning Of Procurement o different goods and services
Planning Of Procurement o different goods and servicesPlanning Of Procurement o different goods and services
Planning Of Procurement o different goods and services
 
Investor-Presentation-Q1FY2024 investor presentation document.pptx
Investor-Presentation-Q1FY2024 investor presentation document.pptxInvestor-Presentation-Q1FY2024 investor presentation document.pptx
Investor-Presentation-Q1FY2024 investor presentation document.pptx
 
Recycled Concrete Aggregate in Construction Part III
Recycled Concrete Aggregate in Construction Part IIIRecycled Concrete Aggregate in Construction Part III
Recycled Concrete Aggregate in Construction Part III
 
space technology lecture notes on satellite
space technology lecture notes on satellitespace technology lecture notes on satellite
space technology lecture notes on satellite
 
Hybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdf
Hybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdfHybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdf
Hybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdf
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理
 
Immunizing Image Classifiers Against Localized Adversary Attacks
Immunizing Image Classifiers Against Localized Adversary AttacksImmunizing Image Classifiers Against Localized Adversary Attacks
Immunizing Image Classifiers Against Localized Adversary Attacks
 
Railway Signalling Principles Edition 3.pdf
Railway Signalling Principles Edition 3.pdfRailway Signalling Principles Edition 3.pdf
Railway Signalling Principles Edition 3.pdf
 
Cosmetic shop management system project report.pdf
Cosmetic shop management system project report.pdfCosmetic shop management system project report.pdf
Cosmetic shop management system project report.pdf
 
一比一原版(UofT毕业证)多伦多大学毕业证成绩单如何办理
一比一原版(UofT毕业证)多伦多大学毕业证成绩单如何办理一比一原版(UofT毕业证)多伦多大学毕业证成绩单如何办理
一比一原版(UofT毕业证)多伦多大学毕业证成绩单如何办理
 
road safety engineering r s e unit 3.pdf
road safety engineering  r s e unit 3.pdfroad safety engineering  r s e unit 3.pdf
road safety engineering r s e unit 3.pdf
 
English lab ppt no titlespecENG PPTt.pdf
English lab ppt no titlespecENG PPTt.pdfEnglish lab ppt no titlespecENG PPTt.pdf
English lab ppt no titlespecENG PPTt.pdf
 
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
 
Final project report on grocery store management system..pdf
Final project report on grocery store management system..pdfFinal project report on grocery store management system..pdf
Final project report on grocery store management system..pdf
 
DfMAy 2024 - key insights and contributions
DfMAy 2024 - key insights and contributionsDfMAy 2024 - key insights and contributions
DfMAy 2024 - key insights and contributions
 
Forklift Classes Overview by Intella Parts
Forklift Classes Overview by Intella PartsForklift Classes Overview by Intella Parts
Forklift Classes Overview by Intella Parts
 
在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样
在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样
在线办理(ANU毕业证书)澳洲国立大学毕业证录取通知书一模一样
 
Water Industry Process Automation and Control Monthly - May 2024.pdf
Water Industry Process Automation and Control Monthly - May 2024.pdfWater Industry Process Automation and Control Monthly - May 2024.pdf
Water Industry Process Automation and Control Monthly - May 2024.pdf
 
ML for identifying fraud using open blockchain data.pptx
ML for identifying fraud using open blockchain data.pptxML for identifying fraud using open blockchain data.pptx
ML for identifying fraud using open blockchain data.pptx
 
CW RADAR, FMCW RADAR, FMCW ALTIMETER, AND THEIR PARAMETERS
CW RADAR, FMCW RADAR, FMCW ALTIMETER, AND THEIR PARAMETERSCW RADAR, FMCW RADAR, FMCW ALTIMETER, AND THEIR PARAMETERS
CW RADAR, FMCW RADAR, FMCW ALTIMETER, AND THEIR PARAMETERS
 

[Paper Reading] Causal Bandits: Learning Good Interventions via Causal Inference

  • 1.
  • 2.
  • 3.
  • 5.
  • 6.
  • 7. 𝒴 = {yx=0, yx=1} p (yx=0) p (yx=1) p (y|do (x = 0)) p (y|do (x = 1)) ATE := 𝔼 (yx=1) − 𝔼 (yx=0) ATE := 𝔼 (y|do (x = 1)) − 𝔼 (y|do (x = 0))
  • 8. Pr (X1, X2, X3, X4) = Pr (X1 |X2, X3) Pr (X2 |X3) Pr (X3) Pr (X4 |X1, X2) = 4 ∏ i=1 Pr (Xi | 𝒫Xi) X1 X4 X2 X3 do (X1 = 1) X1 X2 X3 X1 X1 X4 X2 X3
  • 9.
  • 10.
  • 11.
  • 12.
  • 13.
  • 14.
  • 15.
  • 16. G 𝒳 = {X1, . . . , XN} Xi 𝒫Xi 𝒜 a = do (X = x) ∈ 𝒜 x = {x1, . . . , xn} X = {X1, . . . , Xn} do () ∈ 𝒜 a = do (X = x) μa = 𝔼[Y|do(X = x)] μ* := max a∈𝒜 μa
  • 17. T t at = do(Xt = xt) ∈ 𝒜 Xc t ∼ Pr (Xc t |do (Xt = xt)) Xc t = 𝒳 − Xt Yt ∈ {0,1} T ̂a*T ∈ 𝒜 RT RT = μ* − μ ̂a*T
  • 18. G = (𝒳, E) 𝒳 E 𝒜 T ̂a*T ∈ 𝒜 T
  • 19. X K 𝒜 = {do (X = k)} : k ∈ {1,...,K} Y ∈ {0,1} Pr{Y = 1|X}
  • 20.
  • 21. N X = {X1, . . . , XN} Y ∈ {0,1} 𝒜 = {do()} ∪ {do(Xi = j) : 1 ≤ i ≤ N, j ∈ {0,1}} Xi ∼ Bernoulli (qi) q = (q1, . . . , qN) ∈ [0,1] N Pr (Y = 1|X)
  • 22. do() Pr (Y|do (Xi = j)) = Pr (Y|Xi = j) Pr (Xi = j) (Xi = j) do (Xi = j) Pr (Xi = j) (Xi = j)
  • 23. q qi (1 − qi) τ ∈ [2...N] Iτ = { i : min{qi, 1 − qi} < 1 τ } τ |Iτ | m (q) : ℝN → ℝ m (q) = min{τ : |Iτ | ≤ τ}
  • 24. m (q) q = ( 1 2 , 1 2 , … 1 2) I2 = { i : min{qi, 1 − qi} < 1 2} = {}, Iτ:τ>2 = {} m (q) = min{τ : |Iτ | ≤ τ} = 2 q = (0,0,…,0) I2 = { i : min{qi, 1 − qi} < 1 2} = {1,…, N}, Iτ:τ>2 = {1,…, N} m (q) = min{τ : |Iτ | ≤ τ} = N
  • 25.
  • 26. RT ∈ 𝒪 m (q) T log ( NT m ) T q RT ∈ Ω m (q) T 2N RT ∈ Ω ( N T ) N > m
  • 27. Pr{Y|Xi = j} ≠ Pr{Y|do (Xi = j)} Pr{X1, . . . XN |do (Xi = j)} Pr{X1, . . . XN |do (Xi = j)} = ∏ k≠i Pr{Xk | 𝒫Xk }δ (Xi − j) Pr {Y|a}
  • 28. Y = XN, Xk = Xk−1 Pr(X1) = 0 do(Xi = 1) do(X1 = 1) do() Pr (Y|do (X2 = j)) = ∑ X1 Pr (X1) Pr (Y|X1, X2 = j) X1 = X2 Pr(X2 = 1) (X1 = 0,X2 = 1) Pr (Y|do (X2 = 1))
  • 29. ∀a ∈ 𝒜 Pr ( 𝒫Y |a) Pr (Y|a) η ∈ [0,1] |𝒜| ∑ a∈𝒜 ηa = 1 Q := ∑ a∈𝒜 ηaPr { 𝒫Y |a} η
  • 30. η μa ∀a ∈ 𝒜 a Ra (X) = Pr{𝒫Y(X)|a} Q{𝒫Y(X)} = Pr{𝒫Y(X)|a} ∑a∈𝒜 ηaPr { 𝒫Y(X)|a} 𝒫Y(X) a ̂μa = 1 T T ∑ t=1 Yt Ra(Xt) 1 {Ra(Xt) ≤ Ba} Ba ≥ 0
  • 31. η
  • 32. m(η) m(η) = max a∈𝒜 𝔼a [ Pr{𝒫Y(X)|a} Q{𝒫Y(X)} ] B ∈ ℝ|𝒜| Ba = m (η) T log (2T| 𝒜|) RT ∈ 𝒪 m (η) T log (2T| 𝒜|)
  • 33. Q Q η m (η) η* = arg min η m (η) = arg min η max a∈𝒜 𝔼a [ Pr{𝒫Y(X)|a} Q{𝒫Y(X)} ] = arg min η max a∈𝒜 𝔼a [ Pr{𝒫Y(X)|a} ∑b∈𝒜 ηbPr{𝒫Y(X)} ] m (η) η η*
  • 34.
  • 35. Yt ∼ Bernouli ( 1 2 + ϵ ) if X1 = 1 Yt ∼ Bernouli ( 1 2 − ϵ′ ) if X1 = 0 𝔼[Y|do(X1 = 1)] = 1 2 + ϵ 𝔼[Y|do(X1 = 0)] = 1 2 − ϵ′ a ∈ 𝒜{do(X1 = 1), do (X1 = 0)} 𝔼[Y|a] = 1 2 qi = 0 i ≤ m qi = 1 2 Xi ∼ Bernouli (qi) ϵ′ = q1ϵ 1 − q1 Xi:i>1 X1
  • 36. q
  • 37. ϵ