7. Group
A
Group
B
Split
them
randomly
trafficofusers
Variant for A
Variant for B
Expose to one of
two variants of the service
e.g., the current production version
e.g., an evaluated update
Calculate a key
measure for each user
X(uA1)
…
Calculate the OEC for each
group as the mean value
e.g., X(u) is the number
of sessions of the user u
X(uA2)
X(uA3)
X(uA4)
X(uA5)
X(uB1)
…
X(uB2)
X(uB3)
X(uB4)
X(uB5)
µA(X)=avgu in AX(u)
µB(X)=avgu in BX(u)
Overall Evaluation Criterion
(OEC) for the group B
Overall Evaluation Criterion
(OEC) for the group A
8. µA(X)=avgu in AX(u)
µB(X)=avgu in BX(u)
Overall Evaluation Criterion
(OEC) for the group B
Overall Evaluation Criterion
(OEC) for the group A
Calculate the OEC for each
group as the mean value
Δ(x) VS 0
Δ(X) = µB(X) – µA(X)
the evaluated update is
positive or negative
Statistical
significance test
the difference is caused by
a noise or
the treatment effect
(e.g., Student’s t-test)
Overall Evaluation
Criterion (OEC)
[Kohavi et al., DMKD’2009]
Overall Acceptance
Criterion (OAC)
[Drutsa et al., CIKM’2015]
Sensitivity
Directionality
9. OEC levels
1. User level metrics
2. Non-user level metric
For example ratio OEC:
20. Our paper, WSDM 2018
21
Consistent Transformation of Ratio Metrics for Efficient Online
Controlled Experiments
Roman Budylin, Alexey Drutsa, Ilya Katsev, Valeriya Tsoy
Data Quality metrics • OEC metrics • Guard rail metrics • Local feature/Diagnosticmetrics
21. Our paper, WSDM 2018
22
Consistent Transformation of Ratio Metrics for Efficient Online
Controlled Experiments
Roman Budylin, Alexey Drutsa, Ilya Katsev, Valeriya Tsoy
The best comment: «This is too good to be true!» (someone from
Facebook)
Data Quality metrics • OEC metrics • Guard rail metrics • Local feature/Diagnosticmetrics
22. We found a transformation such that
Our contribution
23. We found a transformation such that
Ratio OEC User level OEC
Our contribution
24. We found a transformation such that
Ratio OEC User level OEC
NB: Preserve directionality and significance level!
Our contribution
25. Let we have a ratio OEC:
Consider the next expression:
And let us use its average
as a metric:
Now we got a linearization OEC:
Linearization
26. Let A and B be the control and the experiment. Let us denote
Theorem 1: Let be positive. Then for any
the next is true:
Theorems: directionality
27. Theorem 2: Let be positive and . Let
be the t-statistic applied for the OEC and let be the be the asymt.
standard normal statistic of obtained via the Delta method.
If then then under the null hypothesis that
1. the t-statistics is asymptotically normal
2. converges to 1 by probability
Theorems: significance level