Waws january 2015-nikolay-novozhilov

•Download as PPTX, PDF•

0 likes•240 views

Nikolay Novozhilov gave a presentation on common problems with A/B testing and statistics. He discussed how looking at data before testing can invalidate results and showed through a Monte Carlo simulation that variants can appear to "win" just by chance. Multivariate testing and multiple comparisons were also cited as issues. Novozhilov recommended starting with a clear hypothesis, replicating tests, and considering sample size, significance, effect size, and power to obtain more reliable results from A/B tests.

Data & Analytics

WEB ANALYTICS
WEDNESDAYS SINGAPORE
JANUARY 2015

A/B TESTING AND THE MOST COMMON
PROBLEMS, A LOOK AT STATISTICS AND
THE WAYS TO GET IT WRONG
Nikolay Novozhilov - Product director of data
platforms @ Wego.com.
Nikolay is building big data capabilities at Wego.com,
the Asia Pacific and the Middle East's leading travel
metasearch engine. He has 7+ years of experience in
data analytics working for IT startups and previously
for consulting. Nikolay received an MBA from INSEAD
in Singapore and before that lived and worked in
Moscow.

A/B testing and problems with statistics
Web Analytics Wednesday, Singapore
Nikolay Novozhilov, Wego.com
www.novozhilov.co

Imaginary uplifts
100 tests done, 10 successful, 10% uplift each…
…expect 159% growth!
Expectation Reality

Lies, damned lies, and statistics
All different! All based on assumptions!!!
Tool Test used
Optimizely Two-tailed sequential likelihood ratio test
with false discovery rate controls
Google Analytics Bayes estimate with uniform beta prior
VWO Intersection of confidence intervals for
binominal distribution
Leanplum Confidence intervals at p=5%, unknown
statistic
Usereffect Chi-square statistics
Commerce Sciences Welch's t-test

What is p-value and why it is 5%?
All tests are
based on
assumptions!
Assumption #1:
You don’t look at
the data upfront

What happens if you look?
I played Monte Carlo in
Excel
And here is the result:
• 5% p-value
• 1000 “users” in each
sample
• CR of 2%
• A wins over A 29% of
the times!

What do you do about it?
Don’t look! (just kidding)
Google “O'Brien & Fleming interim
analysis” (no, still kidding )
Keep calm, more stuff coming!

“My test on Buy button showed interesting
results…”
Buy Now! Buy Now! Buy Now! Buy Now!
Buy Now! Buy Now! Buy Now! Buy Now!
Buy Now! Buy Now! Buy Now! Buy Now!
-3% -23% +6% -9%
-2% +22% -11% -14%
-1% +9% -12% -1%
10000 users in each variant, base CR=1%

But in reality all colors were the same…
Buy Now! Buy Now! Buy Now! Buy Now!
Buy Now! Buy Now! Buy Now! Buy Now!
Buy Now! Buy Now! Buy Now! Buy Now!
-3% -23% +6% -9%
-2% +22% -11% -14%
-1% +9% -12% -1%
1000 users in each variant, base CR=1%

The real problem!
Multivariate testing
Multiple comparisons

Be smart or be Google
Sample
size
Significa
nce
Effect
size
Power

Start with a good hypothesis!
But people are good in finding plausible
explanations for data!

Replication
Do your dirty
business
Register Replicate
This might work!

Has some stat meaning!
ReplicationsVariance observation

The Best A/B Test Idea You Haven’t Thought Of

Kissmetrics on SlideShare

4 Steps Toward Scientific A/B Testing

Janessa Lantz

[Elite Camp 2016] Phil Nottingham - CRO with Video: Tips, Tricks and Tactics

CXL

The Truth Is Out There - User Research Based AB-Testing

AGConsult

While working with data we usually face several problems: we don't have enough data, we have too much data, we don't know what to do with this data. In this session, I'll show how to make sure you can rely on your data and share my favorite ideas on how you can use Google Analytics and other for A/B testing, optimization and analysis. You’ll gain a better understanding on what to look at to answer your UX questions, how to run a test properly and evaluate the its results.

A/B testing, optimization and results analysis by Mariia Bocheva, ATD'18

Mariia Bocheva

HeroConf LA - PPC Life After Mobile

Aaron Levy

The goal of analysis should provide leadership with insight into risk and uncertainty and guidance on actions that can be taken. However, common analysis methods of using point estimates to generate forward-looking business plans disregard uncertainty and ignore risk. In this presentation, you will learn how to incorporate uncertainty directly into a decision support application. The results is a range estimate with likelihoods of exceeding thresholds based on assumption values, providing leadership with the insight into uncertainty and actions that can be taken to reduce risk.

Introduction to Simulation- Predictive Analytics

PerformanceG2, Inc.

Voice has and continues to be the hot topic of 2018, fuelled by the statistic that “50% of searches" will be conducted via voice by 2020. Voice is still an emerging marking as such there is very little data or tools which allow individuals to understand the opportunity in the market by vertical or key phrase. John will be talking about the voice market landscape today, how you can calculate the opportunity at a key phrase level and practical steps to how to cease the opportunity ready for the supposedly “50% of searches” in 2020.

SearchLove London 2018 - John Campbell - Voice Search – Calculating and Seizi...

Distilled

Data-Driven off a Cliff: Anti-Patterns in Evidence-Based Decision Making

indeedeng

Slides from presentation at Emerce efinancials event. During this session Jeroen Tjepkema, CEO & Founder of MeasureWorks, togehter with Matthew Niederberger, Digital Insights & Analytics Manager at Philips Lighting talk about why user experience and performance are essential in persuading online consumers to buy something on your website. Based on both real data and case studies Matthew and Jeroen will share practical information about the relationship between content and experience and how to implement these at your website.

MeasureWorks - Emerce eFinancials - Content is King, but Experience is your k...

MeasureWorks

How to do A/B in scale - Nature Intelligence style

Kobi Salinas

DIT Digitial Marketing Forum: Analytics

Lar Veale

Viacheslav Kravchuk: Conversion rate optimisation. What’s really proved to m...

Meet Magento Poland

USING PREDICTIONS TO POWER CUSTOMER SUCCESS

Totango

Ton was asked to talk about things that get him excited as a web analyst looking at conversion rate optimization. He picked 5 things: - The real fun part of web analytics is analyzing how user behavior is changing (analyzing experiments), not creating campaign reports... - Win: inject your website feedback form responses into your analytics and be able to segment behavior based on goals. - Run your experiments with an automated and free GTM / GA / EXCEL results set-up! - To make sure business gets it: apply Bayesian statistics on experiment results, don't report on P values, confidence levels etc. - Bandit algoritms: use www.smartnotifications.com for automated persuasive messaging on your website.

Keynote Ton Wesseling at the Web Analytics Wednesday Copenhagen #wawcph at Se...

Online Dialogue

Brighton CRO Meetup #1 - Oh Boy These AB tests Sure Look Like Bullshit to Me

Craig Sullivan

Internet Conference 2018: Internet Measurement, how to get the relativities r...

APNIC

Web analytics clinics - Giorgos Vareloglu

Slovenian Tourist Board

12 reasons your site sucks - InvestNI

Craig Sullivan

Audience Researchndfhcvnfgvgbhujhgfv.pptx

Stephen266013

https://qidiantiku.com/solution-manual-for-data-visualization-exploring-and-explaining-with-data-1st-edition-by-camm.shtml name：Solution manual for Data Visualization: Exploring and Explaining with Data 1st Edition by Camm Edition：st Edition author：by Jeffrey D. Camm , James J. Cochran, Michael J. Fry , Jeffrey W. Ohlmann ISBN：ISBN: 9780357711415 type：solution manual format：word/zip All chapter include

Data Visualization Exploring and Explaining with Data 1st Edition by Camm sol...

ssuserf63bd7

Waws january 2015-nikolay-novozhilov

1. WEB ANALYTICS WEDNESDAYS SINGAPORE JANUARY 2015

2. A/B TESTING AND THE MOST COMMON PROBLEMS, A LOOK AT STATISTICS AND THE WAYS TO GET IT WRONG Nikolay Novozhilov - Product director of data platforms @ Wego.com. Nikolay is building big data capabilities at Wego.com, the Asia Pacific and the Middle East's leading travel metasearch engine. He has 7+ years of experience in data analytics working for IT startups and previously for consulting. Nikolay received an MBA from INSEAD in Singapore and before that lived and worked in Moscow.

3. A/B testing and problems with statistics Web Analytics Wednesday, Singapore Nikolay Novozhilov, Wego.com www.novozhilov.co

4. Is there a problem with A/B testing?

5. Imaginary uplifts 100 tests done, 10 successful, 10% uplift each… …expect 159% growth! Expectation Reality

6. Why? … and what to do about it

7. Lies, damned lies, and statistics All different! All based on assumptions!!! Tool Test used Optimizely Two-tailed sequential likelihood ratio test with false discovery rate controls Google Analytics Bayes estimate with uniform beta prior VWO Intersection of confidence intervals for binominal distribution Leanplum Confidence intervals at p=5%, unknown statistic Usereffect Chi-square statistics Commerce Sciences Welch's t-test

8. What is p-value and why it is 5%? All tests are based on assumptions! Assumption #1: You don’t look at the data upfront

9. What happens if you look? I played Monte Carlo in Excel And here is the result: • 5% p-value • 1000 “users” in each sample • CR of 2% • A wins over A 29% of the times!

10. What do you do about it? Don’t look! (just kidding) Google “O'Brien & Fleming interim analysis” (no, still kidding ) Keep calm, more stuff coming!

11. “My test on Buy button showed interesting results…” Buy Now! Buy Now! Buy Now! Buy Now! Buy Now! Buy Now! Buy Now! Buy Now! Buy Now! Buy Now! Buy Now! Buy Now! -3% -23% +6% -9% -2% +22% -11% -14% -1% +9% -12% -1% 10000 users in each variant, base CR=1%

12. But in reality all colors were the same… Buy Now! Buy Now! Buy Now! Buy Now! Buy Now! Buy Now! Buy Now! Buy Now! Buy Now! Buy Now! Buy Now! Buy Now! -3% -23% +6% -9% -2% +22% -11% -14% -1% +9% -12% -1% 1000 users in each variant, base CR=1%

13. The real problem! Multivariate testing Multiple comparisons

14. Be smart or be Google Sample size Significa nce Effect size Power

15. Start with a good hypothesis! But people are good in finding plausible explanations for data!

16. Replication Do your dirty business Register Replicate This might work!

17. Stop math, I’m a web designer!

18. Visual way of doing it

19. Has some stat meaning! ReplicationsVariance observation

Waws january 2015-nikolay-novozhilov

Recommended

Recommended

More Related Content

Similar to Waws january 2015-nikolay-novozhilov

Similar to Waws january 2015-nikolay-novozhilov (20)

Recently uploaded

Recently uploaded (20)

Waws january 2015-nikolay-novozhilov

Editor's Notes