SlideShare a Scribd company logo
@commagereBlake Commagere
A/B Testing
Technically Split or Bucket Testing, but
nobody calls it that.
Blake Commagere @commagere
Who the hell am I?
It’s a fair question
●Started seven companies
oFive of those have been bought
●Raised over 12M Angel / VC
●I build viral things
oPlaxo
oCauses
oVampires
●I’m obsessed with human behavior
2
Blake Commagere @commagere
My Fucking Talks
Analogies Rule
Dave McClure Blake Commagere
●Makes sailors blush with
his swearing
●Occasionally uses
hyperbole in his talks
●Is funny
●Makes Dave McClure
blush with his swearing
●The most egregious
abuse of hyperbole
●Thinks he’s funny
3
Blake Commagere @commagere
This talk is for you.
Interrupt at any point.
Don’t be shy.
If it sucks and you never say
anything, I’m blaming you.
4
Blake Commagere @commagere
What I’m covering
●What the hell is A/B testing?
●Why should you care?
●How should you A/B test?
●How can A/B testing fuck you over?
●A few ancillary benefits of A/B testing
5
Blake Commagere @commagere
What I’m NOT covering
AKA “Shit you should Google if you don’t already know”
●Familiarize yourself with concepts like
oConfidence Intervals
oComparative Error
oStatistical Significance
oSample Size requirements
 FYI, there are online calculators for all these things
●How to fucking do math
oYes there’s a lot of math in A/B Testing
oChances are, the tools you use will do the math for you
 If not, your code better do the math for you
6
Blake Commagere @commagere
What the hell is A/B testing?
7
Blake Commagere @commagere
What the hell is A/B Testing?
Will it hurt?
●Randomized Experiment
oControl & Variant
oTests against a specific goal
oTypically you want to establish Statistical Significance
with a Confidence Level of 95%
●Bucket or Split Testing
oTechnically, this is what you want most of the time
oMultiple variants
oPeople usually say A/B when they mean Bucket/Split
oJust go with it
8
Blake Commagere @commagere
Why should you care?
9
Blake Commagere @commagere
Do I really need A/B Testing?
Technically, you may not need it
●Do you have users?
●If not, do you plan on having users?
●If not… congrats! You don’t need A/B Testing
because you don’t have a business!
●For everyone else, you need A/B Testing.
10
Blake Commagere @commagere
You are not your user
Your users are much, much stupider
You Your User
●Busy as hell
●Dedicated
●Very knowledgeable
●Eager to learn
●Not Stupid
●Lazy
●Impatient
●Stupid
●Stupid
●Real. Stupid.
11
Blake Commagere @commagere
The Result?
You’re building the product for you
●You are not your user
●Your instincts are probably wrong
●Your wants/needs are probably different
●You view the market differently
12
Blake Commagere @commagere
Still not convinced?
●Create and Make are synonyms
●“Make an Ad” vs “Create an Ad”
●Software Prefers ‘Create’ because REST, CRUD
●Make outperformed by up to 20% for non-artists
●There is a mental barrier to Create - it sounds
more involved/complex. Making is easier.
13
Create vs Make - one word makes all the difference
Blake Commagere @commagere
How should you A/B Test
14
Blake Commagere @commagere
Blockers on your A/B testing
If this is happening, all your tests will suck
●Shit is broken
oFIX IT NOW WHY ARE YOU FUCKING READING THIS
●Shit is slow
oPageSpeed on frontend,
oProfilers on backend / mobile
●Official definitions of slow*
oA webpage that takes > ~2 seconds & no progress indicator
oAn app that isn’t immediately responsive
*according to me
15
Blake Commagere @commagere
Ensuring tests maximize impact
AKA the most obvious advice ever
1)ABT. Always Be Testing
2)Win as frequently as possible
3)In areas that matter
●Any moron can get #1 right
●Most morons can’t do #2 or #3
●Essentially, you need to know what to test and
how to test.
16
Blake Commagere @commagere
Only a few features matter
That’s it
●Users are either engaged or not
●Data on your userbase tells you which 2-3 features
matter
●Optimize these features
●Make Onboarding focus on these features
●Facebook example:
oFriends, Photos, Status Updates
oNew User Experience focused on these
oNetwork leverage to help new users
17
Blake Commagere @commagere
Similarly, only a few flows matter
●Is a flow critical?
odoes it get a user engaged with a critical feature
odoes it help another user engage with a critical feature
●Is every step necessary?
oAlmost always, the answer is no
●Where are the leaks?
oAlmost always, this is the result of bad messaging
oFind the worst leak, fix it, then move to the next one
18
That’s also it
Blake Commagere @commagere
Your messaging is never perfect
AKA test your fucking messaging
●In most cases it is better to:
oUse fewer words - your users are lazy
oBe Colloquial - your users are dumb
oTarget the user - your users are selfish
●You can always target better
oSome data is implicit to your existing data
oYour users only care about what they get
●Language is always evolving
oLanguage in every channel is evolving as well
oChannels change & language in a channel changes!
19
Blake Commagere @commagere
Tools of the trade
●Web:
oGoogle Analytics (Content Experiments)
oOptimizely
oMixPanel
●Mobile:
oMixPanel
oOptimizely
oSwrve
20
Blake Commagere @commagere
Which tool should I use?
●The best one for your company:
oIt depends
oOn. so. many. things.
●You’ll end up doing some coding too probably.
●The worst one for your company?
oNot having one
oCreating your own from scratch
21
Blake Commagere @commagere
How A/B Testing can fuck you over
22
Blake Commagere @commagere
Pitfalls
Some very common mistakes
●Find statistical significance sample size in advance
oYou want 95% confidence level
oWith 20,000 users, this can be 377 people. NOT
50/50
●Do not take convenience samples
o2-4 weeks is ideal
●Test in 7, 14, 28 day intervals
oHuman behavior is day dependent, tied to week
oYour results will skew if you do 8 day tests
23
Blake Commagere @commagere
The Lies We Tell Ourselves
Sometimes we like lying because we’re lazy
●You can make the numbers lie
●Avoiding a channel that you hate
oEmail works
oSEO works
oTwitter works
oFacebook works
oIf a channel doesn’t work for you,
you’re probably doing it wrong
24
Blake Commagere @commagere
Farmville
How to optimize for suck
Farmville on Launch Farmville after ~2 yrs
25
Blake Commagere @commagere
Ancillary Benefits
26
Blake Commagere @commagere
Designer vs Engineer
Your users are much, much stupider
Designer Engineer
●Better not suck at design
●Better have design exp
●Better respect design
●Will not always agree with
engineers on design
●Not Stupid
●Probably sucks at design
●May not have design exp
●Should respect design
●Will not always agree
with designers on design
●Not Stupid
27
Blake Commagere @commagere
Using A/B Testing for Team Bonding
●The loudest voice usually gets their way
●This can stifle good ideas
●Sometimes it’s ok to test a bad idea
●Becomes a teachable moment
●You could be wrong (GASP!)
28
Blake Commagere @commagere
Want more swearing and/or advice?
blake.commagere@gmail.com
@commagere
29

More Related Content

What's hot

Selenium Conference India: Intersection of Automation and Exploratory Testing
Selenium Conference India: Intersection of Automation and Exploratory TestingSelenium Conference India: Intersection of Automation and Exploratory Testing
Selenium Conference India: Intersection of Automation and Exploratory Testing
Maaret Pyhäjärvi
 
How Do You Know When You’re Ready in Your Internet Marketing?
How Do You Know When You’re Ready in Your Internet Marketing?How Do You Know When You’re Ready in Your Internet Marketing?
How Do You Know When You’re Ready in Your Internet Marketing?
Trevor Dumbleton
 
Startupfest 2015: TOM WALSHAM (TWG) - "How to" Stage
Startupfest 2015: TOM WALSHAM (TWG) - "How to" StageStartupfest 2015: TOM WALSHAM (TWG) - "How to" Stage
Startupfest 2015: TOM WALSHAM (TWG) - "How to" Stage
Startupfest
 
TMAcad: We're work in Progress, Lessons on Becoming a Great Tester
TMAcad: We're work in Progress, Lessons on Becoming a Great TesterTMAcad: We're work in Progress, Lessons on Becoming a Great Tester
TMAcad: We're work in Progress, Lessons on Becoming a Great Tester
Maaret Pyhäjärvi
 
Programming Assignments in Tester Interviews
Programming Assignments in Tester InterviewsProgramming Assignments in Tester Interviews
Programming Assignments in Tester Interviews
Maaret Pyhäjärvi
 
Intro to TDD
Intro to TDDIntro to TDD
Intro to TDD
Jason Nocks
 
Test Driven Testing
Test Driven TestingTest Driven Testing
Test Driven Testing
Brendan Connolly
 
5 Essential Tips For Improving Your Website Mockups & Prototypes!
5 Essential Tips For Improving Your Website Mockups & Prototypes!5 Essential Tips For Improving Your Website Mockups & Prototypes!
5 Essential Tips For Improving Your Website Mockups & Prototypes!
Usersnap
 
Hands off Exploratory Testing - Managing Testing at Scale
Hands off Exploratory Testing - Managing Testing at ScaleHands off Exploratory Testing - Managing Testing at Scale
Hands off Exploratory Testing - Managing Testing at Scale
Maaret Pyhäjärvi
 
Agile2016: (Lightning Talk) Breaking illusions
Agile2016: (Lightning Talk) Breaking illusionsAgile2016: (Lightning Talk) Breaking illusions
Agile2016: (Lightning Talk) Breaking illusions
Maaret Pyhäjärvi
 
STARWest Workshop: Explore with Intent
STARWest Workshop: Explore with IntentSTARWest Workshop: Explore with Intent
STARWest Workshop: Explore with Intent
Maaret Pyhäjärvi
 
6 things that will kill your new product _bethtemple4u
6 things that will kill your new product  _bethtemple4u6 things that will kill your new product  _bethtemple4u
6 things that will kill your new product _bethtemple4u
Beth Temple
 
Exploratory testing part 3
Exploratory testing part 3Exploratory testing part 3
Exploratory testing part 3
Dawn Code
 
A quick guide to some prominent Ruby on Rails learning materials
A quick guide to some prominent Ruby on Rails learning materialsA quick guide to some prominent Ruby on Rails learning materials
A quick guide to some prominent Ruby on Rails learning materials
Thomas Elam
 
redesignUserTest-spring2015
redesignUserTest-spring2015redesignUserTest-spring2015
redesignUserTest-spring2015
Alicia Thompson
 
Quick Start to Captioning
Quick Start to CaptioningQuick Start to Captioning
Quick Start to Captioning
3Play Media
 
Urgent Paraphrasing Help: Let's Make It Easy
Urgent Paraphrasing Help: Let's Make It EasyUrgent Paraphrasing Help: Let's Make It Easy
Urgent Paraphrasing Help: Let's Make It Easy
Paraphrasing Online
 
SEETest: Making Teams Awesome
SEETest: Making Teams AwesomeSEETest: Making Teams Awesome
SEETest: Making Teams Awesome
Maaret Pyhäjärvi
 
All you need know about testing
All you need know about testingAll you need know about testing
All you need know about testing
Jorge Barroso
 
7 lessons learned building high availability / performance systems - CM2015
7 lessons learned building high availability / performance systems - CM20157 lessons learned building high availability / performance systems - CM2015
7 lessons learned building high availability / performance systems - CM2015
Francesco Degrassi
 

What's hot (20)

Selenium Conference India: Intersection of Automation and Exploratory Testing
Selenium Conference India: Intersection of Automation and Exploratory TestingSelenium Conference India: Intersection of Automation and Exploratory Testing
Selenium Conference India: Intersection of Automation and Exploratory Testing
 
How Do You Know When You’re Ready in Your Internet Marketing?
How Do You Know When You’re Ready in Your Internet Marketing?How Do You Know When You’re Ready in Your Internet Marketing?
How Do You Know When You’re Ready in Your Internet Marketing?
 
Startupfest 2015: TOM WALSHAM (TWG) - "How to" Stage
Startupfest 2015: TOM WALSHAM (TWG) - "How to" StageStartupfest 2015: TOM WALSHAM (TWG) - "How to" Stage
Startupfest 2015: TOM WALSHAM (TWG) - "How to" Stage
 
TMAcad: We're work in Progress, Lessons on Becoming a Great Tester
TMAcad: We're work in Progress, Lessons on Becoming a Great TesterTMAcad: We're work in Progress, Lessons on Becoming a Great Tester
TMAcad: We're work in Progress, Lessons on Becoming a Great Tester
 
Programming Assignments in Tester Interviews
Programming Assignments in Tester InterviewsProgramming Assignments in Tester Interviews
Programming Assignments in Tester Interviews
 
Intro to TDD
Intro to TDDIntro to TDD
Intro to TDD
 
Test Driven Testing
Test Driven TestingTest Driven Testing
Test Driven Testing
 
5 Essential Tips For Improving Your Website Mockups & Prototypes!
5 Essential Tips For Improving Your Website Mockups & Prototypes!5 Essential Tips For Improving Your Website Mockups & Prototypes!
5 Essential Tips For Improving Your Website Mockups & Prototypes!
 
Hands off Exploratory Testing - Managing Testing at Scale
Hands off Exploratory Testing - Managing Testing at ScaleHands off Exploratory Testing - Managing Testing at Scale
Hands off Exploratory Testing - Managing Testing at Scale
 
Agile2016: (Lightning Talk) Breaking illusions
Agile2016: (Lightning Talk) Breaking illusionsAgile2016: (Lightning Talk) Breaking illusions
Agile2016: (Lightning Talk) Breaking illusions
 
STARWest Workshop: Explore with Intent
STARWest Workshop: Explore with IntentSTARWest Workshop: Explore with Intent
STARWest Workshop: Explore with Intent
 
6 things that will kill your new product _bethtemple4u
6 things that will kill your new product  _bethtemple4u6 things that will kill your new product  _bethtemple4u
6 things that will kill your new product _bethtemple4u
 
Exploratory testing part 3
Exploratory testing part 3Exploratory testing part 3
Exploratory testing part 3
 
A quick guide to some prominent Ruby on Rails learning materials
A quick guide to some prominent Ruby on Rails learning materialsA quick guide to some prominent Ruby on Rails learning materials
A quick guide to some prominent Ruby on Rails learning materials
 
redesignUserTest-spring2015
redesignUserTest-spring2015redesignUserTest-spring2015
redesignUserTest-spring2015
 
Quick Start to Captioning
Quick Start to CaptioningQuick Start to Captioning
Quick Start to Captioning
 
Urgent Paraphrasing Help: Let's Make It Easy
Urgent Paraphrasing Help: Let's Make It EasyUrgent Paraphrasing Help: Let's Make It Easy
Urgent Paraphrasing Help: Let's Make It Easy
 
SEETest: Making Teams Awesome
SEETest: Making Teams AwesomeSEETest: Making Teams Awesome
SEETest: Making Teams Awesome
 
All you need know about testing
All you need know about testingAll you need know about testing
All you need know about testing
 
7 lessons learned building high availability / performance systems - CM2015
7 lessons learned building high availability / performance systems - CM20157 lessons learned building high availability / performance systems - CM2015
7 lessons learned building high availability / performance systems - CM2015
 

Similar to A/B Testing That Matters

BDD is not about test automation, it's about testing what you don't know
BDD is not about test automation, it's about testing what you don't knowBDD is not about test automation, it's about testing what you don't know
BDD is not about test automation, it's about testing what you don't know
Axel Berle
 
11 rules for programmer should live by
11 rules for programmer should live by11 rules for programmer should live by
11 rules for programmer should live by
Ye Win
 
The Most Important Thing: How Mozilla Does Security and What You Can Steal
The Most Important Thing: How Mozilla Does Security and What You Can StealThe Most Important Thing: How Mozilla Does Security and What You Can Steal
The Most Important Thing: How Mozilla Does Security and What You Can Steal
mozilla.presentations
 
Retro Testing (DevConTLV Jan 2014)
Retro Testing (DevConTLV Jan 2014)Retro Testing (DevConTLV Jan 2014)
Retro Testing (DevConTLV Jan 2014)
Allon Mureinik
 
Misconceptions Of Unit Testing
Misconceptions Of Unit TestingMisconceptions Of Unit Testing
Misconceptions Of Unit Testing
Terry Yin
 
Overcoming Delay: How The Best Developers Increase Productivity
Overcoming Delay: How The Best Developers Increase ProductivityOvercoming Delay: How The Best Developers Increase Productivity
Overcoming Delay: How The Best Developers Increase Productivity
Paul Gower
 
Blameless CI - Bashing cultural monsters in continuous integration
Blameless CI - Bashing cultural monsters in continuous integrationBlameless CI - Bashing cultural monsters in continuous integration
Blameless CI - Bashing cultural monsters in continuous integration
Vivek Ganesan
 
Bashing cultural monsters in continuous integration by Vivek Ganesan XP Confe...
Bashing cultural monsters in continuous integration by Vivek Ganesan XP Confe...Bashing cultural monsters in continuous integration by Vivek Ganesan XP Confe...
Bashing cultural monsters in continuous integration by Vivek Ganesan XP Confe...
XP Conference India
 
Too Frequent Continuous Integration Build Failures?
Too Frequent Continuous Integration Build Failures?Too Frequent Continuous Integration Build Failures?
Too Frequent Continuous Integration Build Failures?
Vivek Ganesan
 
User Testing Tactics
User Testing TacticsUser Testing Tactics
User Testing Tactics
Cindy Alvarez
 
ASO Barcamp Talk 2: Top 10 ASO Questions
ASO Barcamp Talk 2: Top 10 ASO QuestionsASO Barcamp Talk 2: Top 10 ASO Questions
ASO Barcamp Talk 2: Top 10 ASO Questions
Adapt Worldwide
 
Grails Worst Practices
Grails Worst PracticesGrails Worst Practices
Grails Worst Practices
Burt Beckwith
 
Cracking the Product Manager Interview with Gayle McDowell
Cracking the Product Manager Interview with Gayle McDowellCracking the Product Manager Interview with Gayle McDowell
Cracking the Product Manager Interview with Gayle McDowell
Product School
 
Design Thinking in the Product Development Process - Product tank oxford
Design Thinking in the Product Development Process - Product tank oxford Design Thinking in the Product Development Process - Product tank oxford
Design Thinking in the Product Development Process - Product tank oxford
AJ Justo
 
Guerilla Usability Testing
Guerilla Usability TestingGuerilla Usability Testing
Guerilla Usability Testing
Andy Budd
 
How To Compete Against The Big Fish (eMetrics Summit, Milan, Nov. 2016)
How To Compete Against The Big Fish (eMetrics Summit, Milan, Nov. 2016)How To Compete Against The Big Fish (eMetrics Summit, Milan, Nov. 2016)
How To Compete Against The Big Fish (eMetrics Summit, Milan, Nov. 2016)
Mogens Møller
 
Effective Unit Testing
Effective Unit TestingEffective Unit Testing
Effective Unit Testing
Eyal Kenig
 
How to start_up
How to start_upHow to start_up
How to start_up
Grigor Yeghiazaryan
 
Craig Sullivan - Oh Boy! These A/B tests look like total bullshit! MKTFEST 2014
Craig Sullivan - Oh Boy! These A/B tests look like total bullshit! MKTFEST 2014Craig Sullivan - Oh Boy! These A/B tests look like total bullshit! MKTFEST 2014
Craig Sullivan - Oh Boy! These A/B tests look like total bullshit! MKTFEST 2014
Marketing Festival
 
Skillswap - Back To Basics
Skillswap - Back To BasicsSkillswap - Back To Basics
Skillswap - Back To Basics
Thomas Buckley-Houston
 

Similar to A/B Testing That Matters (20)

BDD is not about test automation, it's about testing what you don't know
BDD is not about test automation, it's about testing what you don't knowBDD is not about test automation, it's about testing what you don't know
BDD is not about test automation, it's about testing what you don't know
 
11 rules for programmer should live by
11 rules for programmer should live by11 rules for programmer should live by
11 rules for programmer should live by
 
The Most Important Thing: How Mozilla Does Security and What You Can Steal
The Most Important Thing: How Mozilla Does Security and What You Can StealThe Most Important Thing: How Mozilla Does Security and What You Can Steal
The Most Important Thing: How Mozilla Does Security and What You Can Steal
 
Retro Testing (DevConTLV Jan 2014)
Retro Testing (DevConTLV Jan 2014)Retro Testing (DevConTLV Jan 2014)
Retro Testing (DevConTLV Jan 2014)
 
Misconceptions Of Unit Testing
Misconceptions Of Unit TestingMisconceptions Of Unit Testing
Misconceptions Of Unit Testing
 
Overcoming Delay: How The Best Developers Increase Productivity
Overcoming Delay: How The Best Developers Increase ProductivityOvercoming Delay: How The Best Developers Increase Productivity
Overcoming Delay: How The Best Developers Increase Productivity
 
Blameless CI - Bashing cultural monsters in continuous integration
Blameless CI - Bashing cultural monsters in continuous integrationBlameless CI - Bashing cultural monsters in continuous integration
Blameless CI - Bashing cultural monsters in continuous integration
 
Bashing cultural monsters in continuous integration by Vivek Ganesan XP Confe...
Bashing cultural monsters in continuous integration by Vivek Ganesan XP Confe...Bashing cultural monsters in continuous integration by Vivek Ganesan XP Confe...
Bashing cultural monsters in continuous integration by Vivek Ganesan XP Confe...
 
Too Frequent Continuous Integration Build Failures?
Too Frequent Continuous Integration Build Failures?Too Frequent Continuous Integration Build Failures?
Too Frequent Continuous Integration Build Failures?
 
User Testing Tactics
User Testing TacticsUser Testing Tactics
User Testing Tactics
 
ASO Barcamp Talk 2: Top 10 ASO Questions
ASO Barcamp Talk 2: Top 10 ASO QuestionsASO Barcamp Talk 2: Top 10 ASO Questions
ASO Barcamp Talk 2: Top 10 ASO Questions
 
Grails Worst Practices
Grails Worst PracticesGrails Worst Practices
Grails Worst Practices
 
Cracking the Product Manager Interview with Gayle McDowell
Cracking the Product Manager Interview with Gayle McDowellCracking the Product Manager Interview with Gayle McDowell
Cracking the Product Manager Interview with Gayle McDowell
 
Design Thinking in the Product Development Process - Product tank oxford
Design Thinking in the Product Development Process - Product tank oxford Design Thinking in the Product Development Process - Product tank oxford
Design Thinking in the Product Development Process - Product tank oxford
 
Guerilla Usability Testing
Guerilla Usability TestingGuerilla Usability Testing
Guerilla Usability Testing
 
How To Compete Against The Big Fish (eMetrics Summit, Milan, Nov. 2016)
How To Compete Against The Big Fish (eMetrics Summit, Milan, Nov. 2016)How To Compete Against The Big Fish (eMetrics Summit, Milan, Nov. 2016)
How To Compete Against The Big Fish (eMetrics Summit, Milan, Nov. 2016)
 
Effective Unit Testing
Effective Unit TestingEffective Unit Testing
Effective Unit Testing
 
How to start_up
How to start_upHow to start_up
How to start_up
 
Craig Sullivan - Oh Boy! These A/B tests look like total bullshit! MKTFEST 2014
Craig Sullivan - Oh Boy! These A/B tests look like total bullshit! MKTFEST 2014Craig Sullivan - Oh Boy! These A/B tests look like total bullshit! MKTFEST 2014
Craig Sullivan - Oh Boy! These A/B tests look like total bullshit! MKTFEST 2014
 
Skillswap - Back To Basics
Skillswap - Back To BasicsSkillswap - Back To Basics
Skillswap - Back To Basics
 

Recently uploaded

Generating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and MilvusGenerating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and Milvus
Zilliz
 
June Patch Tuesday
June Patch TuesdayJune Patch Tuesday
June Patch Tuesday
Ivanti
 
GraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracyGraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracy
Tomaz Bratanic
 
Best 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERPBest 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERP
Pixlogix Infotech
 
Astute Business Solutions | Oracle Cloud Partner |
Astute Business Solutions | Oracle Cloud Partner |Astute Business Solutions | Oracle Cloud Partner |
Astute Business Solutions | Oracle Cloud Partner |
AstuteBusiness
 
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdfHow to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
Chart Kalyan
 
Building Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and MilvusBuilding Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and Milvus
Zilliz
 
JavaLand 2024: Application Development Green Masterplan
JavaLand 2024: Application Development Green MasterplanJavaLand 2024: Application Development Green Masterplan
JavaLand 2024: Application Development Green Masterplan
Miro Wengner
 
Leveraging the Graph for Clinical Trials and Standards
Leveraging the Graph for Clinical Trials and StandardsLeveraging the Graph for Clinical Trials and Standards
Leveraging the Graph for Clinical Trials and Standards
Neo4j
 
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAUHCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
panagenda
 
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy SurveyTrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc
 
GraphRAG for LifeSciences Hands-On with the Clinical Knowledge Graph
GraphRAG for LifeSciences Hands-On with the Clinical Knowledge GraphGraphRAG for LifeSciences Hands-On with the Clinical Knowledge Graph
GraphRAG for LifeSciences Hands-On with the Clinical Knowledge Graph
Neo4j
 
What is an RPA CoE? Session 1 – CoE Vision
What is an RPA CoE?  Session 1 – CoE VisionWhat is an RPA CoE?  Session 1 – CoE Vision
What is an RPA CoE? Session 1 – CoE Vision
DianaGray10
 
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
Edge AI and Vision Alliance
 
Fueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte WebinarFueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte Webinar
Zilliz
 
Y-Combinator seed pitch deck template PP
Y-Combinator seed pitch deck template PPY-Combinator seed pitch deck template PP
Y-Combinator seed pitch deck template PP
c5vrf27qcz
 
Energy Efficient Video Encoding for Cloud and Edge Computing Instances
Energy Efficient Video Encoding for Cloud and Edge Computing InstancesEnergy Efficient Video Encoding for Cloud and Edge Computing Instances
Energy Efficient Video Encoding for Cloud and Edge Computing Instances
Alpen-Adria-Universität
 
Presentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of GermanyPresentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of Germany
innovationoecd
 
zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...
zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...
zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...
Alex Pruden
 
Introduction of Cybersecurity with OSS at Code Europe 2024
Introduction of Cybersecurity with OSS  at Code Europe 2024Introduction of Cybersecurity with OSS  at Code Europe 2024
Introduction of Cybersecurity with OSS at Code Europe 2024
Hiroshi SHIBATA
 

Recently uploaded (20)

Generating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and MilvusGenerating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and Milvus
 
June Patch Tuesday
June Patch TuesdayJune Patch Tuesday
June Patch Tuesday
 
GraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracyGraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracy
 
Best 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERPBest 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERP
 
Astute Business Solutions | Oracle Cloud Partner |
Astute Business Solutions | Oracle Cloud Partner |Astute Business Solutions | Oracle Cloud Partner |
Astute Business Solutions | Oracle Cloud Partner |
 
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdfHow to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
 
Building Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and MilvusBuilding Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and Milvus
 
JavaLand 2024: Application Development Green Masterplan
JavaLand 2024: Application Development Green MasterplanJavaLand 2024: Application Development Green Masterplan
JavaLand 2024: Application Development Green Masterplan
 
Leveraging the Graph for Clinical Trials and Standards
Leveraging the Graph for Clinical Trials and StandardsLeveraging the Graph for Clinical Trials and Standards
Leveraging the Graph for Clinical Trials and Standards
 
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAUHCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
 
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy SurveyTrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy Survey
 
GraphRAG for LifeSciences Hands-On with the Clinical Knowledge Graph
GraphRAG for LifeSciences Hands-On with the Clinical Knowledge GraphGraphRAG for LifeSciences Hands-On with the Clinical Knowledge Graph
GraphRAG for LifeSciences Hands-On with the Clinical Knowledge Graph
 
What is an RPA CoE? Session 1 – CoE Vision
What is an RPA CoE?  Session 1 – CoE VisionWhat is an RPA CoE?  Session 1 – CoE Vision
What is an RPA CoE? Session 1 – CoE Vision
 
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
 
Fueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte WebinarFueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte Webinar
 
Y-Combinator seed pitch deck template PP
Y-Combinator seed pitch deck template PPY-Combinator seed pitch deck template PP
Y-Combinator seed pitch deck template PP
 
Energy Efficient Video Encoding for Cloud and Edge Computing Instances
Energy Efficient Video Encoding for Cloud and Edge Computing InstancesEnergy Efficient Video Encoding for Cloud and Edge Computing Instances
Energy Efficient Video Encoding for Cloud and Edge Computing Instances
 
Presentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of GermanyPresentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of Germany
 
zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...
zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...
zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...
 
Introduction of Cybersecurity with OSS at Code Europe 2024
Introduction of Cybersecurity with OSS  at Code Europe 2024Introduction of Cybersecurity with OSS  at Code Europe 2024
Introduction of Cybersecurity with OSS at Code Europe 2024
 

A/B Testing That Matters

  • 1. @commagereBlake Commagere A/B Testing Technically Split or Bucket Testing, but nobody calls it that.
  • 2. Blake Commagere @commagere Who the hell am I? It’s a fair question ●Started seven companies oFive of those have been bought ●Raised over 12M Angel / VC ●I build viral things oPlaxo oCauses oVampires ●I’m obsessed with human behavior 2
  • 3. Blake Commagere @commagere My Fucking Talks Analogies Rule Dave McClure Blake Commagere ●Makes sailors blush with his swearing ●Occasionally uses hyperbole in his talks ●Is funny ●Makes Dave McClure blush with his swearing ●The most egregious abuse of hyperbole ●Thinks he’s funny 3
  • 4. Blake Commagere @commagere This talk is for you. Interrupt at any point. Don’t be shy. If it sucks and you never say anything, I’m blaming you. 4
  • 5. Blake Commagere @commagere What I’m covering ●What the hell is A/B testing? ●Why should you care? ●How should you A/B test? ●How can A/B testing fuck you over? ●A few ancillary benefits of A/B testing 5
  • 6. Blake Commagere @commagere What I’m NOT covering AKA “Shit you should Google if you don’t already know” ●Familiarize yourself with concepts like oConfidence Intervals oComparative Error oStatistical Significance oSample Size requirements  FYI, there are online calculators for all these things ●How to fucking do math oYes there’s a lot of math in A/B Testing oChances are, the tools you use will do the math for you  If not, your code better do the math for you 6
  • 7. Blake Commagere @commagere What the hell is A/B testing? 7
  • 8. Blake Commagere @commagere What the hell is A/B Testing? Will it hurt? ●Randomized Experiment oControl & Variant oTests against a specific goal oTypically you want to establish Statistical Significance with a Confidence Level of 95% ●Bucket or Split Testing oTechnically, this is what you want most of the time oMultiple variants oPeople usually say A/B when they mean Bucket/Split oJust go with it 8
  • 9. Blake Commagere @commagere Why should you care? 9
  • 10. Blake Commagere @commagere Do I really need A/B Testing? Technically, you may not need it ●Do you have users? ●If not, do you plan on having users? ●If not… congrats! You don’t need A/B Testing because you don’t have a business! ●For everyone else, you need A/B Testing. 10
  • 11. Blake Commagere @commagere You are not your user Your users are much, much stupider You Your User ●Busy as hell ●Dedicated ●Very knowledgeable ●Eager to learn ●Not Stupid ●Lazy ●Impatient ●Stupid ●Stupid ●Real. Stupid. 11
  • 12. Blake Commagere @commagere The Result? You’re building the product for you ●You are not your user ●Your instincts are probably wrong ●Your wants/needs are probably different ●You view the market differently 12
  • 13. Blake Commagere @commagere Still not convinced? ●Create and Make are synonyms ●“Make an Ad” vs “Create an Ad” ●Software Prefers ‘Create’ because REST, CRUD ●Make outperformed by up to 20% for non-artists ●There is a mental barrier to Create - it sounds more involved/complex. Making is easier. 13 Create vs Make - one word makes all the difference
  • 14. Blake Commagere @commagere How should you A/B Test 14
  • 15. Blake Commagere @commagere Blockers on your A/B testing If this is happening, all your tests will suck ●Shit is broken oFIX IT NOW WHY ARE YOU FUCKING READING THIS ●Shit is slow oPageSpeed on frontend, oProfilers on backend / mobile ●Official definitions of slow* oA webpage that takes > ~2 seconds & no progress indicator oAn app that isn’t immediately responsive *according to me 15
  • 16. Blake Commagere @commagere Ensuring tests maximize impact AKA the most obvious advice ever 1)ABT. Always Be Testing 2)Win as frequently as possible 3)In areas that matter ●Any moron can get #1 right ●Most morons can’t do #2 or #3 ●Essentially, you need to know what to test and how to test. 16
  • 17. Blake Commagere @commagere Only a few features matter That’s it ●Users are either engaged or not ●Data on your userbase tells you which 2-3 features matter ●Optimize these features ●Make Onboarding focus on these features ●Facebook example: oFriends, Photos, Status Updates oNew User Experience focused on these oNetwork leverage to help new users 17
  • 18. Blake Commagere @commagere Similarly, only a few flows matter ●Is a flow critical? odoes it get a user engaged with a critical feature odoes it help another user engage with a critical feature ●Is every step necessary? oAlmost always, the answer is no ●Where are the leaks? oAlmost always, this is the result of bad messaging oFind the worst leak, fix it, then move to the next one 18 That’s also it
  • 19. Blake Commagere @commagere Your messaging is never perfect AKA test your fucking messaging ●In most cases it is better to: oUse fewer words - your users are lazy oBe Colloquial - your users are dumb oTarget the user - your users are selfish ●You can always target better oSome data is implicit to your existing data oYour users only care about what they get ●Language is always evolving oLanguage in every channel is evolving as well oChannels change & language in a channel changes! 19
  • 20. Blake Commagere @commagere Tools of the trade ●Web: oGoogle Analytics (Content Experiments) oOptimizely oMixPanel ●Mobile: oMixPanel oOptimizely oSwrve 20
  • 21. Blake Commagere @commagere Which tool should I use? ●The best one for your company: oIt depends oOn. so. many. things. ●You’ll end up doing some coding too probably. ●The worst one for your company? oNot having one oCreating your own from scratch 21
  • 22. Blake Commagere @commagere How A/B Testing can fuck you over 22
  • 23. Blake Commagere @commagere Pitfalls Some very common mistakes ●Find statistical significance sample size in advance oYou want 95% confidence level oWith 20,000 users, this can be 377 people. NOT 50/50 ●Do not take convenience samples o2-4 weeks is ideal ●Test in 7, 14, 28 day intervals oHuman behavior is day dependent, tied to week oYour results will skew if you do 8 day tests 23
  • 24. Blake Commagere @commagere The Lies We Tell Ourselves Sometimes we like lying because we’re lazy ●You can make the numbers lie ●Avoiding a channel that you hate oEmail works oSEO works oTwitter works oFacebook works oIf a channel doesn’t work for you, you’re probably doing it wrong 24
  • 25. Blake Commagere @commagere Farmville How to optimize for suck Farmville on Launch Farmville after ~2 yrs 25
  • 27. Blake Commagere @commagere Designer vs Engineer Your users are much, much stupider Designer Engineer ●Better not suck at design ●Better have design exp ●Better respect design ●Will not always agree with engineers on design ●Not Stupid ●Probably sucks at design ●May not have design exp ●Should respect design ●Will not always agree with designers on design ●Not Stupid 27
  • 28. Blake Commagere @commagere Using A/B Testing for Team Bonding ●The loudest voice usually gets their way ●This can stifle good ideas ●Sometimes it’s ok to test a bad idea ●Becomes a teachable moment ●You could be wrong (GASP!) 28
  • 29. Blake Commagere @commagere Want more swearing and/or advice? blake.commagere@gmail.com @commagere 29

Editor's Notes

  1. Ok, obviously I’m a huge fan of Dave. I just like giving him attitude and poking fun at him.
  2. What is NOT covered is important to stress - you can look this shit up. If you don’t know this stuff already, don’t worry, I’m not going to waste everyone’s time making them do formulas. But someone on the team better be familiar with these concepts so that you A/B test correctly.
  3. Your messaging will be largely rendered irrelevant if it takes 7 seconds to load
  4. Your messaging will be largely rendered irrelevant if it takes 7 seconds to load
  5. Your messaging will be largely rendered irrelevant if it takes 7 seconds to load
  6. Let’s let the customer decide