LondonSEO Meetup - Cutting through the noise in SEO with data - Reina Hanada

Cutting through the
noise in SEO with data.

10 million organic visits
£8 billion every month
13+ million customers
6+ million indexed URLs
@reinahanada @Wise

‘I’m so grateful that this
hell will be over
tomorrow. I can
proudly say, I do not
want to do this ever
again.’
- Reina, 2015

Data show us
what human eyes
struggle to see 👀

My goal:
By the end of this,
you’ll be able to:
Causal Impact + Tensorflow Probability
E.g. measuring the effect of Algo update
Before & after test 🧪
Prophet with regressor
E.g. removing noise from growth factors
Remove noise from data 📊

“It’s statistically
significant.”
🧪

Statistical significant =
very unlikely to have occurred
simply by chance
You can feel confident that it’s real, not
that you just got lucky or unlucky

Before & After test with
Causal Impact.

Causal impact = Reality vs ‘what if’?

Suitable for:
Any change in
a single day
Did Google Algo update impact
our traffic or position?
Algorithm update
Did updating the CTA improve our
conversion?
Changing CTA / design
Did optimisation help improve our
position?
On-page optimisation

January 10, 2022 - Migration to new landing page

First visits and new users are dropping 🧪
MIGRATION

Is this our fault? Do we revert back?

Conversions
down?
Traffic down? CVR down?
Rankings drop?
Search volume
drop?
Design
change
Tracking issue
(secret option)
Seasonality
External factors
Algo update
SERP change
On page
change/
error
Competitor
activity
Do nothing
What could have
gone wrong?

Data +
exact date
Causal
impact
(python
package)
Graph +
report

Ingredients = data + the exact date 🍳

Ranking?
Search volume?
Conversion rate?

First visits = YEP, we are losing visits 😭
…and it’s proven to be statistically significant

Real data vs predicted data
Orange shade = 95% probability

Difference between real vs predicted

Accumulated differences
Overall impact = negative

Report: “it’s statistically significant”
p-value = 0.01 (only 1% chance that this is accidental)

Always check: p-value
p-value = the probability of getting this result by
chance. Closer this gets to 0%, more likely that it
is not by chance
p-value < 0.05 = it’s statistically significant
p-value > 0.05 = it’s NOT (= just accident?)

New users = Drop is not statistically significant 😇

2 weeks lag = Drop is statistically significant 😭

4 weeks lag = Drop is statistically significant 💩

Maybe drop in search trend? 🧪
Google Trend = ‘transfert argent’ (remittance)

New users (4 weeks lag) with search trend
Drop is not statistically significant 😮💨
More
than 5%

We are losing new users,
but it might be due to drop in
search demand 🧪

Next step =
Keep the new design,
but optimise

Judgement =
Use common sense +
a pinch of pessimism

Python package: tfcausalimpact
Language Python
Author WillianFuks (CausalImpact by Google +
Tensorflow by Google)
Ingredients Data + exact date of the change
Regressor Optional (as many as you want)
Used for Impact of the change on a single day

Troubleshooting 🧪🔥
Google spreadsheet
● Dates should be in ascending order, in YYYY-MM-DD
● Numbers should not have dots (1,000 → 1000)
● Make sure there are no empty cells
● Useful formula for Google Trends: =if(len(B2),B2,C1)
Python
● When it fails - start from importing data again
● Check how data looks with data.head() or print(data)
● Check all data types are float or integers by data.dtypes

Causal impact =
measuring the impact of
1 change on 1 date

Removing external noise
with Prophet.

Prophet = forecasting based on trend 📈
FORECAST
(We won’t use)
TREND

Suitable for:
Overall trend
Is the traffic growing? Is it due to
search demand?
Growth (+ relationship
with price changes, etc)
Is there steady growth as we publish
more blog articles?
Growth in blogs

Our US blog is growing 🚀
Are we actually growing???

Or is it because… USD is at all time high? 🧪

Data +
regressors
Prophet
(python
package)
Graphs

Growth trend without the impact of USD FX rate
We are growing! 😍
TREND

Trend with the impact of USD FX rate
…We are stagnating! 💩
TREND

We are growing,
but growth might be coming
from exchange rate 😅

We are growing,
but we should be
growing more 🧪

REMINDER:
Use common sense +
a pinch of pessimism

Python package: Prophet
Language Python
Author Facebook
Ingredients Data (12+ months)
Regressor Optional (as many as you want)
Used for Forecast / time series trend

Troubleshooting 🧪🔥
Google spreadsheet
● You can have missing data - but preferably not
● Do not name a column named ‘trend’ - it’s reserved
● (Everything as I’ve said in Causal Impact)
Python
● (Everything as I’ve said in Causal Impact)

Prophet =
measuring trend over time

Important concept:
Multicollinearity.

Always check this before
running Causal Impact /
Prophet! 🚨

Multicollinearity = two regressors are correlated
Adds noise + confuses the model 😵💫

How to spot multicollinearity 🔎
Common sense
✅ No coding needed
❌ Inaccurate
VIF
(Variance inflation factors)
✅ Accurate
❌ Coding needed
VIF colab notebook + testing method in the slide

VIF - Remove any variables with VIF > 1.5
(1.5 is VERY conservative; can be 2-5)

Python package: VIF from Statsmodels
Language Python
Author Open source
Data Data

How to run VIF in 3 easy steps 💥
1. Google sheet with all the data (variables)
2. Open Colab Notebook and change here
3. Click buttons

Keep on running until you get all < 1.5

My goal:
And now, you should
be able to:
Causal Impact + Tensorflow Probability
E.g. measuring the effect of Algo update
Before & after test 🧪
Prophet with regressor
E.g. removing noise from growth factors
Remove noise in data 📊
✅
✅

Cheatsheet
Causal
impact
● Changed template
● Changed CTA
● Google Algorithm update
● Fee change of product
● Russian war on Ukraine
Prophet ● Traffic & Google trend relationship
● Overall trend with seasonality
● Effect of inflation

Limitations + cautions
Causal
impact
● Highly dependent on the data points
(provide as much data; consider time lag)
● Not good for multiple changes happening at
once / changes happening over time
● Very slow to run
● Small numbers (e.g. CvR) - hard to detect
Prophet ● Sensitive to seasonality - should provide
multiple seasons
● Automatically tries to ‘fit the model’ - so we
cannot specify to ‘prioritise’ one regressor
over another
● Weak on outlier / large impact events

Colab notebook links
Causal
impact
https://colab.research.google.com/drive/1MdYf-
78Lt1NicCPQ3ax2Orxfgzy_nsUn#scrollTo=ZzAIj
GO0OHjd
Prophet https://colab.research.google.com/drive/1RdDN
D2I81KeFhIXejCg0h4V3h3zo0t0m#scrollTo=3ky
jYeMK2DzK
VIF https://colab.research.google.com/drive/1b18CT
9bEcGqaqPibb_b8BSIPPa7lHZpo#scrollTo=7JU
8VAPOj6Z2

Isn’t this too easy to be true?
🧪

‘Sometimes the simplest tools are pure and
effective. You only use a complex technique if
there is no simpler way.
It is the principles of analysis (the logic, the
conclusions) that are the most powerful.’

Data analysis is easy,
coming up with variables
is difficult.

Data analysis is easy,
coming up with a story
is difficult.

LondonSEO Meetup - Cutting through the noise in SEO with data - Reina Hanada

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to LondonSEO Meetup - Cutting through the noise in SEO with data - Reina Hanada

Similar to LondonSEO Meetup - Cutting through the noise in SEO with data - Reina Hanada (20)

Recently uploaded

Recently uploaded (20)

LondonSEO Meetup - Cutting through the noise in SEO with data - Reina Hanada