Product Experimentation Pitfalls and
How to Avoid Them
JON NOROHNA
Director, Product Management
OPTIMIZELY
D A N S I R O K E R
C O - F O U N D E R & E X E C U T I V E C H A I R M A N
Building a Culture of
Product Experimentation
P E T E K O O M E N
C O - F O U N D E R & C H I E F T E C H N O L O G Y O F F I C E R
J O N N O R O N H A
D I R E C T O R , P R O D U C T M A N A G E M E N T
Building a Culture of Experimentation
Avoiding the Pitfalls
T W I T T E R : @ T H A T S J O N S E N S E E M A I L : J O N @ O P T I M I Z E L Y . C O M
LAUNCH
BUILD
DESIGN PRAY
EXPERIMENT
ITERATE
LAUNCH
BUILDDESIGN PRAY
EXPERIMENT
ITERATE
LAUNCHBUILDDESIGN
PRAY
EXPERIMENT
ITERATE
LAUNCHBUILDDESIGN PRAY
EXPERIMENT
ITERATE
LAUNCHBUILDDESIGN PRAY
EXPERIMENT
ITERATE
DESIGN BUILD EXPERIMENT ITERATE
LAUNCH
PRAY
Product Experimentation
Think hypothetically

• Painted doors
• Validating MVPs
• Qualitative research
• Exploratory A/B testing
Mitigate development risk
• Frequent, small releases
• Feature flagging
• Staged rollouts
• Safe rollbacks
Quantify product impact
• Feature validation
• Multivariate tests
• Iterative adjustments
• Measuring business impact
Experimentation
Old Reality Culture of Experimentation
Top-Down Innovation
Embrace Success
Make Decisions
Follow Orders
Bottom-Up Innovation
Embrace Failure
Validate Decisions
Follow Data
A/B
TESTING
CULTURE OF
EXPERIMENTATION
CULTURE OF
EXPERIMENTATION
CULTURE OF
EXPERIMENTATION
0
10
100
1000
10000
VELOCITY
M AT U R I T Y
EXPERIMENTATION
HERO
0
10
100
1000
10000
VELOCITY
M AT U R I T Y
EXPERIMENTATION
PROGRAM
EXPERIMENTATION
HERO
0
10
100
1000
10000
VELOCITY
M AT U R I T Y
EXPERIMENTATION
PROGRAM
CULTURE OF
EXPERIMENTATION
EXPERIMENTATION
HERO
0
10
100
1000
10000
VELOCITY
M AT U R I T Y
EXPERIMENTATION
HERO
EXPERIMENTATION
PROGRAM
CULTURE OF
EXPERIMENTATION
The World’s Largest Digital Laboratory
1,000,000+ Experiments and Counting
0
10
100
1000
10000
VELOCITY
M AT U R I T Y
EXPERIMENTATION
HERO
EXPERIMENTATION
PROGRAM
CULTURE OF
EXPERIMENTATION
Experimentation Pitfalls
#1
Optimizing the wrong metrics
Bookings per Session Repeat Bookings per Visitor
Queries per Unique User
clicks that trigger a new query
clicks that don’t count as a new search
“We’re the only website on earth that
tries to get rid of our users as quickly as
possible”
Queries per Unique User
Queries per Session
Sessions per User
Tips for choosing the right metrics
Put yourself in the user’s shoes
Ask: what if this went up and nothing else?
Constantly re-evaluate
Trust your gut!
Experimentation Pitfalls
#2
Thinking too small
Theproblemwith“A/Btesting”
Theproblemwith“A/Btesting”
2 Variations
A B
3 Variations
C
4 Variations
D
5 Variations
E
>6 Variations
FA B
Three-fourthsofallexperimentsonlyhave2variations
2 Variations
3 Variations
4 Variations
5 Variations
>6 Variations
77%
14%
5%
2%
1%
Testing5ormorevariationscanimproveyourwinrateby75%
+71%
+75%
+48%
+32%
25%
33%
37%
44%
43%
+75%
2 Variations
3 Variations
4 Variations
5 Variations
>6 Variations
Significant uplift
Significant reduction
Inconclusive
“Every day, we run a thousand concurrent experiments
to quickly validate new ideas. These experiments run
across all our products, from mobile apps and tools
used by hoteliers to customer service phone lines and
internal systems. Experimentation has become so
ingrained in Booking.com culture that every change,
from entire redesigns and infrastructure changes to
bug fixes, is wrapped in an experiment.”
Experimental Treatment
Server-Side Experimentation
Optimizely Full Stack
Client-Side Experimentation
Optimizely Web
Client
Snippet Experiment
Dashboard
Default Treatment
Server
Experimental Treatment
Experimental Treatment
Client
Server SDK Experiment
Dashboard
User User
Experimentation Pitfalls
#3
Hoarding insights
HYPOTHESIS
CREATIVE
DEVELOPMENT
SETUP & QA
TESTING
ANALYSIS
SHARE
Remove all
contact
buttons for
EMEA
+1
Pop-Up
Promoting
the live
stream
Add NYC +
Atlanta
User Groups
to Log-in
10/17
Navigation
Experiment
(oct release)
AB test
EXO
handbook LP
Test new
hero
nav on
Blog
Scroll PopupBlog Posts
? Personalize
Promo cards
based on
behavior
August
Product
Release
Butterbar
DONE?
Test making
"try it free"
button green
Add PhoneTrackingfor
Leads
Test old
homepage vs.new homepage
Atlanta
User Groups
to Log-in
10/17
gation
eriment
(oct release)
st
dbook LP
Test new
hero
nav on
Blog
Scroll PopupBlog Posts
? Personalize
Promo cards
based on
behavior
Test making
"try it free"
button green
Add PhoneTrackingfor
Leads
Experimentation Hero
10’s experiments / year
Experimentation Program
100’s experiments / year
Culture of Experimentation
1000’s experiments / year
Scaling the Experimentation Program
0
10
100
1000
10000
VELOCITY
M AT U R I T Y
EXPERIMENTATION
HERO
EXPERIMENTATION
PROGRAM
CULTURE OF
EXPERIMENTATION
Product Experimentation
Think hypothetically

• Painted doors
• Validating MVPs
• Qualitative research
• Exploratory A/B testing
Mitigate development risk
• Frequent, small releases
• Feature flagging
• Staged rollouts
• Safe rollbacks
Quantify product impact
• Feature validation
• Multivariate tests
• Iterative adjustments
• Measuring business impact
Experimentation
J O N N O R O N H A
D I R E C T O R , P R O D U C T M A N A G E M E N T
Thank you!
Questions?
T W I T T E R : @ T H A T S J O N S E N S E E M A I L : J O N @ O P T I M I Z E L Y . C O M
THANK YOU
#Modernize2018
Algorithms
Optimize your search results,
recommendations, promotions, or any other
algorithms to deliver the most relevant content
to your customers
Redesigns
Test entirely new site or app designs before
you launch them, without any performance
impact to your customers
Feature Rollouts
Test features as you roll them out, to validate
your hypothesis and monitor your key business
metrics
Price Testing
Experiment with different prices, plans, or fees
personalized to your customer segments to
drive revenue
Backend Performance
Don’t just test the customer experience: test
APIs, microservices, clusters, and architecture
designs to improve performance & reliability
Omnichannel
Test experiences on mobile devices, over-the-
top apps, IoT, conversational interfaces, and
any other experience that runs in code, and
test across devices.
Server-side Experimentation Use Cases
A
B

Modernize 2018: Product Experimentation Pitfalls and How to Avoid Them, Optimizely