SlideShare a Scribd company logo
1 of 95
@patrickstox @ahrefs #pubcon
How Search Works
Presented by:
Patrick Stox
@patrickstox @ahrefs #pubcon
Product Advisor, Technical SEO, &
Brand Ambassador at
• I write for Ahrefs blog but have written for many industry
publications in the past
• I speak at some conferences like SMX, Pubcon, UnGagged, DMO
Advanced, TechSEO Boost, BrightonSEO
• Organizer for the Raleigh SEO Meetup (most successful in US) and
the Beer & SEO Meetup
• We also run a conference, the Raleigh SEO Conference
• Founder Technical SEO Slack Group
• Moderator /r/TechSEO on Reddit
• Helped define the role of Search Marketing Strategist for the US
Department of Labor
• Lead author for the SEO Chapter of the 2021 Web Almanac, reviewer
for the 2022 Chapter
• Technical Review Editor for The Art of SEO 4th Edition
Who is Patrick Stox?
@patrickstox @ahrefs #pubcon
Disclaimer
This is my understanding of systems and is based on a lot of public statements
from Google and my own knowledge.
Warning: It’s not going to be 100% complete or accurate.
@patrickstox @ahrefs #pubcon
How Many Domains Exist?
Q3 2022 according to Verisign: 349.9 million registered
January 2023 according to Netcraft: 270.9 million unique domains responded
Ahrefs 213.1 million (after removing spam domains)
@patrickstox @ahrefs #pubcon
How Many Pages?
Google in 2016: 130T known
@patrickstox @ahrefs #pubcon
How Big Is The Index?
Google: hundreds of billions of pages indexed
100 PB in size
Ahrefs: ~380B pages
@patrickstox @ahrefs #pubcon
A Fraction Of The Web Is Useful Content
Rough math:
(400B / 130T) * 100 = 0.3%
@patrickstox @ahrefs #pubcon
https://twitter.com/
lilyraynyc/status/150
9176261884747781
@patrickstox @ahrefs #pubcon
Spam
Google 2021: “every day, we discover 40 billion spammy pages”
That’s 14.6T spam pages a year.
@patrickstox @ahrefs #pubcon
Googlebot
Googlebot is a lot of systems (1000+) and there are multiple Googlebots.
• Googlebot Image
• Googlebot News
• Googlebot Video
• Googlebot Desktop
• Googlebot Mobile
• +Ads and more
https://developers.google.com/search/docs/crawling-indexing/overview-
google-crawlers
@patrickstox @ahrefs #pubcon
Googlebot Is A Protocol Buffer
It stores structured data.
Similar to JSON, but smaller and faster.
@patrickstox @ahrefs #pubcon
Googlebot Rendering Pipeline (Simplified)
@patrickstox @ahrefs #pubcon
URL Sources
• Links on pages, or anything that even looks like a link
• Sitemaps
• Request indexing in GSC
• Indexing API (limited use cases)
• RSS Feeds
• WebSub (formerly PubSubHubbub)
@patrickstox @ahrefs #pubcon
Crawler Queue / Scheduler
Determines what URLs to crawl and when.
2 main purposes:
• Discovery
• Refresh
@patrickstox @ahrefs #pubcon
What SEOs Call Crawl Budget, Google Calls
Crawl demand
How much Google wants to crawl your site.
Crawl rate limit
How much crawling your website can support.
@patrickstox @ahrefs #pubcon
What Counts Against Your Crawl Budget?
All URLs and requests including:
• Pages/files
• Alternate URLs like AMP or m-dot pages, hreflang
• CSS
• JavaScript, including XHR requests
• Embedded content
***All Googlebots share the same crawl budget, including the ones for Ads,
images, etc.
@patrickstox @ahrefs #pubcon
Crawl Demand Factors
• PageRank
• How often pages change (freshness/staleness)
• When it was last crawled
• Any major changes
@patrickstox @ahrefs #pubcon
Crawl Rate Factors
• Stability / crawl health
• Slow responses
• Errors. 5xx (server errors) or 429 (too many requests) HTTP status codes.
They don’t want to crash the sites and the crawlers will generally back down if
they start seeing issues.
@patrickstox @ahrefs #pubcon
Crawl Rate Settings GSC
@patrickstox @ahrefs #pubcon
Crawling
The little spider is named Crawley.
@patrickstox @ahrefs #pubcon
Crawling
Mostly from Mountain View, CA, USA.
Every request needs to respect robots.txt.
15MB max HTML size.
@patrickstox @ahrefs #pubcon
Google Doesn’t Navigate Like Users
Sends requests for the files individually, doesn’t navigate between pages like a
user.
@patrickstox @ahrefs #pubcon
Caching Files
They Crawl more than HTML:
• Pages and other file types
• JavaScript
• CSS
@patrickstox @ahrefs #pubcon
Caching Files
Files are stored for use in rendering.
Google will ignore your cache timings and fetch a new copy when they want to.
JS
HTML HTML
HTML JS CSS CSS CSS
Cache
Cache
@patrickstox @ahrefs #pubcon
Processing – We’ll Cover This Shortly
@patrickstox @ahrefs #pubcon
Web Rendering Service (WRS)
Needed to process JavaScript
Evergreen (up-to-date) Googlebot
Headless (no Graphical User Interface)
@patrickstox @ahrefs #pubcon
Web Rendering Service (WRS)
• Stateless (storage and cookies cleared between loads)
• Denies Permissions
• Flattens light DOM and shadow DOM
• Date / Time functions adjusted
• Service workers rejected
• Animations may differ
• Random may not be random
@patrickstox @ahrefs #pubcon
Myth: 5 Second Limit
I think this started with a test from Max Prin on the time when the testing tools
took a screenshot. They need to have reasonable time limits for testing tools.
https://maxprin.com/tests/js-timer/
@patrickstox @ahrefs #pubcon
No 5 Second Limit
They’ll try to wait for pages to finish, something like networkidle0 (no more
activity).
Eventually cuts off in case something gets stuck or someone is trying to mine
bitcoin.
@patrickstox @ahrefs #pubcon
It Doesn’t Even Make Sense
They’re basically loading a page with everything cached already.
WRS
JS
HTML HTML
HTML
JS CSS CSS CSS
Cache
Cache
@patrickstox @ahrefs #pubcon
This System Causes Other Issues
Impossible states – previous file versions used when rendering.
File versioning /fingerprinting should help.
XHR requests are done in real time.
@patrickstox @ahrefs #pubcon
Myth: Weeks To Render
All pages go through the renderer.
The average wait time is 5 seconds according to Google’s Martin Splitt.
The 90th percentile is only minutes, not weeks.
Probably comes from pages not being prioritized
for crawling.
@patrickstox @ahrefs #pubcon
Rendering At Web Scale
The 8th wonder of the world.
@patrickstox @ahrefs #pubcon
They Use Some Hacks
“In Google search we don’t really care about the pixels because we don’t
really want to show it to someone. We want to process the information
and the semantic information so we need something in the intermediate
state. We don’t have to actually paint the pixels.” – Martin Splitt
@patrickstox @ahrefs #pubcon
What That Looks Like
Gray = downloads
Blue = HTML
Yellow = JavaScript
Purple = Layout
Green = Painting
@patrickstox @ahrefs #pubcon
They Won’t Render Noindexed Pages
<meta name="robots" content="noindex">
<meta name="robots" content="none">
None = noindex, nofollow
@patrickstox @ahrefs #pubcon
They’re Not Taking Actions
They don’t scroll.
They generally don’t click.
@patrickstox @ahrefs #pubcon
Mobile Desktop
@patrickstox @ahrefs #pubcon
They Don’t Click
Load content into the Document Object Model (DOM) by default. They won’t
see the content if it requires a click that makes an XHR request to pull it in.
DOM Tree and CSS Object Model (CSSOM) form the Render Tree. That’s what
gets indexed.
@patrickstox @ahrefs #pubcon
DOM Tree (pictured)
CSSOM (not pictured) would add info
like font size, weight, color, etc. to
each element.
Render Tree
@patrickstox @ahrefs #pubcon
Collapser
• Error handling
• Retries
• Soft 404s
@patrickstox @ahrefs #pubcon
Processing – Now We’ll Talk About It
@patrickstox @ahrefs #pubcon
Processing - Duplicates
Duplicate detection - content hashes or checksum
They’ll remove boilerplate content (nav, footer) for the checksum.
@patrickstox @ahrefs #pubcon
Near Duplicates
@patrickstox @ahrefs #pubcon
Processing – Duplicate Elimination
Canonicalization
@patrickstox @ahrefs #pubcon
~20 Canonicalization Signals
• Duplicates
• Redirects (high weight)
• Canonical link elements - multiple will be ignored
• Sitemap URLs
• Links (Internal/External, PageRank)
• Alternates – mobile, AMP, print, Hreflang
• HTTPS pages over HTTP
• Shorter URLs over longer URLs
• Where content was first published / seen
• Site level signals like a history of scraped content
• Pages over PDFs
Machine learning system
@patrickstox @ahrefs #pubcon
301 = Permanent, 302 = Temporary
Holds true for other perm and temp redirects
@patrickstox @ahrefs #pubcon
Warning! Speculation
@patrickstox @ahrefs #pubcon
Processing – Link Parser
Good:
<a> tag with an href attribute.
<a href=”/page”>simple is good</a>
<a href=”/page” onclick=”goTo(‘page’)”>still okay</a>
@patrickstox @ahrefs #pubcon
Processing – Link Parser
Bad (but may be parsed):
<a routerLink="products/category">no href</a>
<a onclick=”goTo(‘page’)”>no href</a>
<a href=”javascript:goTo(‘page’)”>kind of nested</a>
<a href=”javascript:void(0)”>missing link</a>
<span onclick=”goTo(‘page’)”>not the right HTML element or href</span>
<span href=“page">not the right HTML element</span>
<option value="page">not the right HTML element</option>
<a href=”#”>no link</a>
Button, ng-click, there are many more ways this can be done incorrectly.
@patrickstox @ahrefs #pubcon
Processing – Link Parser
• Link location, where it goes
• Anchor text
• Surrounding text
• …
@patrickstox @ahrefs #pubcon
Link Tagging
• Penguin
• Location on page (footer, main content)
• Disavow
• …
@patrickstox @ahrefs #pubcon
Processing – Content Parser
• Content – tokenized, vectorized. Words become numbers.
• Content language
• Content location
• Extract meta tags
• Extract Schema
• HTML Lexer – normalize the HTML
• Topic analysis. Content on other topics may be weighted less in ranking.
• Semantic analysis. Linguistic, knowledge graph, address extraction
• …
@patrickstox @ahrefs #pubcon
Content Tagging
• YMYL
• Adult / safe search
• Mobile-friendly
• …
@patrickstox @ahrefs #pubcon
Signal Collectors
• PageRank
• Spam
• Page Experience
• Freshness
• …
@patrickstox @ahrefs #pubcon
A Lot More In Processing Like
Drop anything after # in URLs.
(some exceptions to this)
Most Restrictive Directives
index + noindex + index = noindex
They’ll drop low quality content
@patrickstox @ahrefs #pubcon
Other Files May Be Processed Differently
Adobe Portable Document Format (.pdf)
•Adobe PostScript (.ps)
•Google Earth (.kml, .kmz)
•GPS eXchange Format (.gpx)
•Hancom Hanword (.hwp)
•HTML (.htm, .html, other file extensions)
•Lotus
•Microsoft Excel (.xls, .xlsx)
•Microsoft PowerPoint (.ppt, .pptx)
•Microsoft Word (.doc, .docx)
•OpenOffice presentation (.odp)
•OpenOffice spreadsheet (.ods)
•OpenOffice text (.odt)
•Rich Text Format (.rtf)
•Scalable Vector Graphics (.svg)
•TeX/LaTeX (.tex)
•Text (.txt, .text, other file extensions), including
source code in common programming languages:
• Basic source code (.bas)
• C/C++ source code (.c, .cc, .cpp, .cxx, .h, .hpp)
• C# source code (.cs)
• Java source code (.java)
• Perl source code (.pl)
• Python source code (.py)
•Wireless Markup Language (.wml, .wap)
•XML (.xml)
@patrickstox @ahrefs #pubcon
Image Processing
• Text around the image
• Content of the image. They tag what is in the image. Not super reliable.
• Alt attribute
• Image name (minimal weight)
• Webpage title and description
Photo from a Gary Illyes Presentation
at Pubcon.
@patrickstox @ahrefs #pubcon
Robots.txt for Images
Blocking Googlebot Image from crawling means that your images will not
be indexed.
@patrickstox @ahrefs #pubcon
Video Processing
• OCR to get text
• Objects identified from visuals
• Speech converted to text
• Structured data
• Text and other signals from the page, URL, title, description
@patrickstox @ahrefs #pubcon
PDFs
• PDFs are converted and indexed as HTML
• OCR to get text
• Images get indexed
• Links get picked up
• Title
• File name
• …
@patrickstox @ahrefs #pubcon
Google Index
Named Caffeine
@patrickstox @ahrefs #pubcon
Data Infrastructure
Many data centers around the world.
Each has a copy of the index.
Millions of servers and hard drives.
Index is an inverted index.
Maps things like words to documents.
Index shards are split into words and phrases.
Other shards for metadata.
@patrickstox @ahrefs #pubcon
Indexing Tiers – Based On Doc Popularity
• Ram (fastest)
• SSD (fast)
• Hard drives (slowest)
@patrickstox @ahrefs #pubcon
Mobile Version Is Indexed (Mostly)
Some sites may remain on desktop-only indexing.
They don’t work on mobile.
@patrickstox @ahrefs #pubcon
Life Of A Query
@patrickstox @ahrefs #pubcon
Fun Fact
15% of queries have never been seen before
@patrickstox @ahrefs #pubcon
Start Typing - Autocomplete
Powered by real search data
and patterns across the web +
• The language of the query
• The location a query is coming from
• Trending interest in a query
• Your past searches
Probably reduces misspellings
@patrickstox @ahrefs #pubcon
Query parsing and understanding
BERT (DeepRank) – combinations of words express different
meanings and intents. They won’t drop important words from
the queries.
Neural matching – words to searches.
“For example, neural matching helps Google understand that a
search for “why does my TV look strange” is related to the
concept of “the soap opera effect.” We can then return pages
about the soap opera effect, even if the exact words aren’t used.”
@patrickstox @ahrefs #pubcon
Misspelling
1/10 searches are misspelled
@patrickstox @ahrefs #pubcon
Google Training Misspelling Example
Over 600 ways people misspelled Britney Spears.
http://archive.google.com/jobs/britney.html
@patrickstox @ahrefs #pubcon
Spelling Old Vs New
Old way:
How often terms were searched
+probability of typos from neighboring keys
New way:
Deep neural net with 680M parameters
@patrickstox @ahrefs #pubcon
Query Expansion
When the query is sent, it’s going to also pull pages with terms that include:
• Synonyms
• Antonyms
• Acronyms
• Plural/singular
• Stemming – root words
• Diacritical expansion - accent characters other versions
These will mostly get lower weights in scoring than the main term used.
@patrickstox @ahrefs #pubcon
Concepts & Entities
People, places, things
“RankBrain helps Google better relate pages to concepts – This
means Google can better return relevant pages even if they
don’t contain the exact words used in a search, by
understanding the page is related to other words and
concepts.”
@patrickstox @ahrefs #pubcon
Speculation
All the query expansion things may not
be necessary anymore. They may just
pull close terms in vector space.
@patrickstox @ahrefs #pubcon
Stop Words
The, is, and, of, a, are, an, if, etc.
Removed for some queries.
Used for other queries, like when it matches a concept.
@patrickstox @ahrefs #pubcon
Segmenter
Splits up strings (languages without spaces).
'上海浦东开发与建设同步' → ['上海', '浦东', '开发', ‘与', ’建设', '同步']
@patrickstox @ahrefs #pubcon
Retrieval – Posting List
Remember that inverted index?
Map of terms to pages that contain those terms. Get all those.
@patrickstox @ahrefs #pubcon
Sum Of The Total Pages From All Shards
@patrickstox @ahrefs #pubcon
Popular Queries Are Cached
@patrickstox @ahrefs #pubcon
Make A Smaller List - Ranking
Google is going to cut all those results down to the top 1000 by ranking them.
@patrickstox @ahrefs #pubcon
Ranking / Scoring – Query Dependent
Feature of a page & query
• Keyword hits
• All those other versions from the query expansion like synonyms
• Proximity
• Content relevance, topicality
• …
@patrickstox @ahrefs #pubcon
Ranking / Scoring – Query Independent
Feature of a page
• PageRank, site queries, mentions,
& other E-E-A-T signals
• Language
• Mobile-friendliness
• Page experience
• …
Numbers multiplied by other numbers in the scoring
@patrickstox @ahrefs #pubcon
They’re Like Nah, We Can Do Better
@patrickstox @ahrefs #pubcon
Reranking / Post-Retrieval Adjustments
Has a smaller number of results - 1000
With the smaller number, they can run more intelligent but resource intensive
systems to re-order the results.
@patrickstox @ahrefs #pubcon
RankBrain & BERT - Again
“Based on its complex language understanding, BERT can very quickly rank
documents for relevance.”
Depending on the search, Google’s algorithm can use either RankBrain, BERT,
or both.
@patrickstox @ahrefs #pubcon
Host Clustering
Limits the results you see from the same domain.
Add &filter=0 to your search URL to see unfiltered results.
@patrickstox @ahrefs #pubcon
Hreflang
Tries to swap to the most relevant country/language version of a page.
@patrickstox @ahrefs #pubcon
DMCA, Privacy Removals, URL Removal Tool
@patrickstox @ahrefs #pubcon
Spelling Corrections
@patrickstox @ahrefs #pubcon
Trending Topics Are Promoted
@patrickstox @ahrefs #pubcon
Spam
Spam demotions
Manual actions
@patrickstox @ahrefs #pubcon
Query Other Systems - Universal results
News, Maps, Images, Videos, etc.
Results are bidding for their position
@patrickstox @ahrefs #pubcon
@patrickstox @ahrefs #pubcon

More Related Content

What's hot

HELP! I've Been Hit By An Algorithm Update - Jess Maloney - BrightonSEO Apri...
HELP! I've Been Hit By An Algorithm Update - Jess Maloney - BrightonSEO  Apri...HELP! I've Been Hit By An Algorithm Update - Jess Maloney - BrightonSEO  Apri...
HELP! I've Been Hit By An Algorithm Update - Jess Maloney - BrightonSEO Apri...Jessica Maloney
 
I Am A Donut - How To Avoid International SEO Mistakes
I Am A Donut - How To Avoid International SEO MistakesI Am A Donut - How To Avoid International SEO Mistakes
I Am A Donut - How To Avoid International SEO MistakesTom Brennan
 
Canonicalization for SEO BrightonSEO April 2023 Patrick Stox
Canonicalization for SEO BrightonSEO April 2023 Patrick StoxCanonicalization for SEO BrightonSEO April 2023 Patrick Stox
Canonicalization for SEO BrightonSEO April 2023 Patrick StoxAhrefs
 
How to Use Search Intent to Dominate Google Discover
How to Use Search Intent to Dominate Google DiscoverHow to Use Search Intent to Dominate Google Discover
How to Use Search Intent to Dominate Google DiscoverFelipe Bazon
 
eCommerce Internal Linking - Into the Spider-Verse (BrightonSEO edition)
eCommerce Internal Linking - Into the Spider-Verse (BrightonSEO edition)eCommerce Internal Linking - Into the Spider-Verse (BrightonSEO edition)
eCommerce Internal Linking - Into the Spider-Verse (BrightonSEO edition)Kristina Azarenko
 
Why your tech optimisations are still sat in the backlog
Why your tech optimisations are still sat in the backlogWhy your tech optimisations are still sat in the backlog
Why your tech optimisations are still sat in the backlogVicky481083
 
Probabilistic Thinking in SEO - BrightonSEO October 2022
Probabilistic Thinking in SEO - BrightonSEO October 2022Probabilistic Thinking in SEO - BrightonSEO October 2022
Probabilistic Thinking in SEO - BrightonSEO October 2022Andrew Charlton
 
Core Web Vitals Audit - Sophie Gibson - PDF - BrightonSEO.pdf
Core Web Vitals Audit - Sophie Gibson - PDF - BrightonSEO.pdfCore Web Vitals Audit - Sophie Gibson - PDF - BrightonSEO.pdf
Core Web Vitals Audit - Sophie Gibson - PDF - BrightonSEO.pdfSophie Gibson
 
Creating Search Quality Algorithms - Richard Lawrence - BrightonSEO.pdf
Creating Search Quality Algorithms - Richard Lawrence - BrightonSEO.pdfCreating Search Quality Algorithms - Richard Lawrence - BrightonSEO.pdf
Creating Search Quality Algorithms - Richard Lawrence - BrightonSEO.pdfRichard Lawrence
 
Kleecks - AI-Martech as a game changer-DEF.pdf
Kleecks - AI-Martech as a game changer-DEF.pdfKleecks - AI-Martech as a game changer-DEF.pdf
Kleecks - AI-Martech as a game changer-DEF.pdfKleecks
 
How to produce great multilingual content, even when you can't read it | Laur...
How to produce great multilingual content, even when you can't read it | Laur...How to produce great multilingual content, even when you can't read it | Laur...
How to produce great multilingual content, even when you can't read it | Laur...Oban International
 
Most Valuable SEO Presentation - Advanced Search Summit - DMO Advanced 2021 -...
Most Valuable SEO Presentation - Advanced Search Summit - DMO Advanced 2021 -...Most Valuable SEO Presentation - Advanced Search Summit - DMO Advanced 2021 -...
Most Valuable SEO Presentation - Advanced Search Summit - DMO Advanced 2021 -...Ahrefs
 
Google Sheets For SEO - Tom Pool - London SEO Meetup XL
Google Sheets For SEO - Tom Pool - London SEO Meetup XLGoogle Sheets For SEO - Tom Pool - London SEO Meetup XL
Google Sheets For SEO - Tom Pool - London SEO Meetup XLTom Pool
 
Turning A Neglected YouTube Channel into a Traffic Generation Machine
Turning A Neglected YouTube Channel into a Traffic Generation MachineTurning A Neglected YouTube Channel into a Traffic Generation Machine
Turning A Neglected YouTube Channel into a Traffic Generation MachinePhil Nottingham
 
Hreflang tags: everything you need to know to start implementing them
Hreflang tags: everything you need to know to start implementing themHreflang tags: everything you need to know to start implementing them
Hreflang tags: everything you need to know to start implementing themSara Moccand-Sayegh
 
GretaMunari - The redemption of content automation
GretaMunari - The redemption of content automationGretaMunari - The redemption of content automation
GretaMunari - The redemption of content automationGretaMunari1
 
BrightonSEO - NLP for SEOs - How to optimise your content for BERT.pptx
BrightonSEO - NLP for SEOs - How to optimise your content for BERT.pptxBrightonSEO - NLP for SEOs - How to optimise your content for BERT.pptx
BrightonSEO - NLP for SEOs - How to optimise your content for BERT.pptxJosephineHaagen
 
BrightonSEO-Pres.pdf
BrightonSEO-Pres.pdfBrightonSEO-Pres.pdf
BrightonSEO-Pres.pdfNicolas Basoalto
 
Data Driven Approach to Scale SEO at BrightonSEO 2023
Data Driven Approach to Scale SEO at BrightonSEO 2023Data Driven Approach to Scale SEO at BrightonSEO 2023
Data Driven Approach to Scale SEO at BrightonSEO 2023Nitin Manchanda
 
Goodbye SEO fck ups! Learn to set an SEO Quality Assurance Framework
Goodbye SEO fck ups! Learn to set an SEO Quality Assurance FrameworkGoodbye SEO fck ups! Learn to set an SEO Quality Assurance Framework
Goodbye SEO fck ups! Learn to set an SEO Quality Assurance FrameworkAleyda SolĂ­s
 

What's hot (20)

HELP! I've Been Hit By An Algorithm Update - Jess Maloney - BrightonSEO Apri...
HELP! I've Been Hit By An Algorithm Update - Jess Maloney - BrightonSEO  Apri...HELP! I've Been Hit By An Algorithm Update - Jess Maloney - BrightonSEO  Apri...
HELP! I've Been Hit By An Algorithm Update - Jess Maloney - BrightonSEO Apri...
 
I Am A Donut - How To Avoid International SEO Mistakes
I Am A Donut - How To Avoid International SEO MistakesI Am A Donut - How To Avoid International SEO Mistakes
I Am A Donut - How To Avoid International SEO Mistakes
 
Canonicalization for SEO BrightonSEO April 2023 Patrick Stox
Canonicalization for SEO BrightonSEO April 2023 Patrick StoxCanonicalization for SEO BrightonSEO April 2023 Patrick Stox
Canonicalization for SEO BrightonSEO April 2023 Patrick Stox
 
How to Use Search Intent to Dominate Google Discover
How to Use Search Intent to Dominate Google DiscoverHow to Use Search Intent to Dominate Google Discover
How to Use Search Intent to Dominate Google Discover
 
eCommerce Internal Linking - Into the Spider-Verse (BrightonSEO edition)
eCommerce Internal Linking - Into the Spider-Verse (BrightonSEO edition)eCommerce Internal Linking - Into the Spider-Verse (BrightonSEO edition)
eCommerce Internal Linking - Into the Spider-Verse (BrightonSEO edition)
 
Why your tech optimisations are still sat in the backlog
Why your tech optimisations are still sat in the backlogWhy your tech optimisations are still sat in the backlog
Why your tech optimisations are still sat in the backlog
 
Probabilistic Thinking in SEO - BrightonSEO October 2022
Probabilistic Thinking in SEO - BrightonSEO October 2022Probabilistic Thinking in SEO - BrightonSEO October 2022
Probabilistic Thinking in SEO - BrightonSEO October 2022
 
Core Web Vitals Audit - Sophie Gibson - PDF - BrightonSEO.pdf
Core Web Vitals Audit - Sophie Gibson - PDF - BrightonSEO.pdfCore Web Vitals Audit - Sophie Gibson - PDF - BrightonSEO.pdf
Core Web Vitals Audit - Sophie Gibson - PDF - BrightonSEO.pdf
 
Creating Search Quality Algorithms - Richard Lawrence - BrightonSEO.pdf
Creating Search Quality Algorithms - Richard Lawrence - BrightonSEO.pdfCreating Search Quality Algorithms - Richard Lawrence - BrightonSEO.pdf
Creating Search Quality Algorithms - Richard Lawrence - BrightonSEO.pdf
 
Kleecks - AI-Martech as a game changer-DEF.pdf
Kleecks - AI-Martech as a game changer-DEF.pdfKleecks - AI-Martech as a game changer-DEF.pdf
Kleecks - AI-Martech as a game changer-DEF.pdf
 
How to produce great multilingual content, even when you can't read it | Laur...
How to produce great multilingual content, even when you can't read it | Laur...How to produce great multilingual content, even when you can't read it | Laur...
How to produce great multilingual content, even when you can't read it | Laur...
 
Most Valuable SEO Presentation - Advanced Search Summit - DMO Advanced 2021 -...
Most Valuable SEO Presentation - Advanced Search Summit - DMO Advanced 2021 -...Most Valuable SEO Presentation - Advanced Search Summit - DMO Advanced 2021 -...
Most Valuable SEO Presentation - Advanced Search Summit - DMO Advanced 2021 -...
 
Google Sheets For SEO - Tom Pool - London SEO Meetup XL
Google Sheets For SEO - Tom Pool - London SEO Meetup XLGoogle Sheets For SEO - Tom Pool - London SEO Meetup XL
Google Sheets For SEO - Tom Pool - London SEO Meetup XL
 
Turning A Neglected YouTube Channel into a Traffic Generation Machine
Turning A Neglected YouTube Channel into a Traffic Generation MachineTurning A Neglected YouTube Channel into a Traffic Generation Machine
Turning A Neglected YouTube Channel into a Traffic Generation Machine
 
Hreflang tags: everything you need to know to start implementing them
Hreflang tags: everything you need to know to start implementing themHreflang tags: everything you need to know to start implementing them
Hreflang tags: everything you need to know to start implementing them
 
GretaMunari - The redemption of content automation
GretaMunari - The redemption of content automationGretaMunari - The redemption of content automation
GretaMunari - The redemption of content automation
 
BrightonSEO - NLP for SEOs - How to optimise your content for BERT.pptx
BrightonSEO - NLP for SEOs - How to optimise your content for BERT.pptxBrightonSEO - NLP for SEOs - How to optimise your content for BERT.pptx
BrightonSEO - NLP for SEOs - How to optimise your content for BERT.pptx
 
BrightonSEO-Pres.pdf
BrightonSEO-Pres.pdfBrightonSEO-Pres.pdf
BrightonSEO-Pres.pdf
 
Data Driven Approach to Scale SEO at BrightonSEO 2023
Data Driven Approach to Scale SEO at BrightonSEO 2023Data Driven Approach to Scale SEO at BrightonSEO 2023
Data Driven Approach to Scale SEO at BrightonSEO 2023
 
Goodbye SEO fck ups! Learn to set an SEO Quality Assurance Framework
Goodbye SEO fck ups! Learn to set an SEO Quality Assurance FrameworkGoodbye SEO fck ups! Learn to set an SEO Quality Assurance Framework
Goodbye SEO fck ups! Learn to set an SEO Quality Assurance Framework
 

Similar to How Search Works

JavaScript SEO Ungagged 2019 Patrick Stox
JavaScript SEO Ungagged 2019 Patrick StoxJavaScript SEO Ungagged 2019 Patrick Stox
JavaScript SEO Ungagged 2019 Patrick Stoxpatrickstox
 
React JS and Search Engines - Patrick Stox at Triangle ReactJS Meetup
React JS and Search Engines - Patrick Stox at Triangle ReactJS MeetupReact JS and Search Engines - Patrick Stox at Triangle ReactJS Meetup
React JS and Search Engines - Patrick Stox at Triangle ReactJS Meetuppatrickstox
 
SMX Advanced 2018 SEO for Javascript Frameworks by Patrick Stox
SMX Advanced 2018 SEO for Javascript Frameworks by Patrick StoxSMX Advanced 2018 SEO for Javascript Frameworks by Patrick Stox
SMX Advanced 2018 SEO for Javascript Frameworks by Patrick Stoxpatrickstox
 
Everything That Can Go Wrong Will Go Wrong - Tech SEO Boost 2017 - Patrick Stox
Everything That Can Go Wrong Will Go Wrong - Tech SEO Boost 2017 - Patrick StoxEverything That Can Go Wrong Will Go Wrong - Tech SEO Boost 2017 - Patrick Stox
Everything That Can Go Wrong Will Go Wrong - Tech SEO Boost 2017 - Patrick Stoxpatrickstox
 
International SEO: The Weird Technical Parts - Pubcon Vegas 2019 Patrick Stox
International SEO: The Weird Technical Parts - Pubcon Vegas 2019 Patrick StoxInternational SEO: The Weird Technical Parts - Pubcon Vegas 2019 Patrick Stox
International SEO: The Weird Technical Parts - Pubcon Vegas 2019 Patrick Stoxpatrickstox
 
SMX Advanced 2018 Solving Complex SEO Problems by Patrick Stox
SMX Advanced 2018 Solving Complex SEO Problems by Patrick StoxSMX Advanced 2018 Solving Complex SEO Problems by Patrick Stox
SMX Advanced 2018 Solving Complex SEO Problems by Patrick Stoxpatrickstox
 
Google's Search Signals For Page Experience - SMX Advanced 2021 Patrick Stox
Google's Search Signals For Page Experience - SMX Advanced 2021 Patrick StoxGoogle's Search Signals For Page Experience - SMX Advanced 2021 Patrick Stox
Google's Search Signals For Page Experience - SMX Advanced 2021 Patrick StoxAhrefs
 
Page Experience Update TMC June 2021 Patrick Stox
Page Experience Update TMC June 2021 Patrick StoxPage Experience Update TMC June 2021 Patrick Stox
Page Experience Update TMC June 2021 Patrick Stoxpatrickstox
 
Website Migrations at SMX Munich 2019 - Patrick Stox
Website Migrations at SMX Munich 2019 - Patrick StoxWebsite Migrations at SMX Munich 2019 - Patrick Stox
Website Migrations at SMX Munich 2019 - Patrick Stoxpatrickstox
 
Migration Best Practices - SMX West 2019
Migration Best Practices - SMX West 2019Migration Best Practices - SMX West 2019
Migration Best Practices - SMX West 2019Bastian Grimm
 
Troubleshooting SEO for JS Frameworks - Patrick Stox - DTD 2018
Troubleshooting SEO for JS Frameworks - Patrick Stox - DTD 2018Troubleshooting SEO for JS Frameworks - Patrick Stox - DTD 2018
Troubleshooting SEO for JS Frameworks - Patrick Stox - DTD 2018patrickstox
 
Migration Best Practices - Search Y 2019, Paris
Migration Best Practices - Search Y 2019, ParisMigration Best Practices - Search Y 2019, Paris
Migration Best Practices - Search Y 2019, ParisBastian Grimm
 
Troubleshooting Technical SEO Problems - Patrick Stox - Raleigh SEO Meetup
Troubleshooting Technical SEO Problems - Patrick Stox - Raleigh SEO MeetupTroubleshooting Technical SEO Problems - Patrick Stox - Raleigh SEO Meetup
Troubleshooting Technical SEO Problems - Patrick Stox - Raleigh SEO Meetuppatrickstox
 
Nofollow UGC Sponsored SMX West 2020 Patrick Stox
Nofollow UGC Sponsored SMX West 2020 Patrick StoxNofollow UGC Sponsored SMX West 2020 Patrick Stox
Nofollow UGC Sponsored SMX West 2020 Patrick Stoxpatrickstox
 
Nofollow, UGC, and Sponsored Attributes
Nofollow, UGC, and Sponsored AttributesNofollow, UGC, and Sponsored Attributes
Nofollow, UGC, and Sponsored AttributesAhrefs
 
On-page SEO for Drupal
On-page SEO for DrupalOn-page SEO for Drupal
On-page SEO for DrupalSvilen Sabev
 
Advanced data-driven technical SEO - SMX London 2019
Advanced data-driven technical SEO - SMX London 2019Advanced data-driven technical SEO - SMX London 2019
Advanced data-driven technical SEO - SMX London 2019Bastian Grimm
 
Max Prin - TechSEO Boost 2017 - SEO Best Practices For JavaScript-Based Websites
Max Prin - TechSEO Boost 2017 - SEO Best Practices For JavaScript-Based WebsitesMax Prin - TechSEO Boost 2017 - SEO Best Practices For JavaScript-Based Websites
Max Prin - TechSEO Boost 2017 - SEO Best Practices For JavaScript-Based WebsitesMax Prin
 

Similar to How Search Works (20)

JavaScript SEO Ungagged 2019 Patrick Stox
JavaScript SEO Ungagged 2019 Patrick StoxJavaScript SEO Ungagged 2019 Patrick Stox
JavaScript SEO Ungagged 2019 Patrick Stox
 
React JS and Search Engines - Patrick Stox at Triangle ReactJS Meetup
React JS and Search Engines - Patrick Stox at Triangle ReactJS MeetupReact JS and Search Engines - Patrick Stox at Triangle ReactJS Meetup
React JS and Search Engines - Patrick Stox at Triangle ReactJS Meetup
 
SMX Advanced 2018 SEO for Javascript Frameworks by Patrick Stox
SMX Advanced 2018 SEO for Javascript Frameworks by Patrick StoxSMX Advanced 2018 SEO for Javascript Frameworks by Patrick Stox
SMX Advanced 2018 SEO for Javascript Frameworks by Patrick Stox
 
Everything That Can Go Wrong Will Go Wrong - Tech SEO Boost 2017 - Patrick Stox
Everything That Can Go Wrong Will Go Wrong - Tech SEO Boost 2017 - Patrick StoxEverything That Can Go Wrong Will Go Wrong - Tech SEO Boost 2017 - Patrick Stox
Everything That Can Go Wrong Will Go Wrong - Tech SEO Boost 2017 - Patrick Stox
 
International SEO: The Weird Technical Parts - Pubcon Vegas 2019 Patrick Stox
International SEO: The Weird Technical Parts - Pubcon Vegas 2019 Patrick StoxInternational SEO: The Weird Technical Parts - Pubcon Vegas 2019 Patrick Stox
International SEO: The Weird Technical Parts - Pubcon Vegas 2019 Patrick Stox
 
SMX Advanced 2018 Solving Complex SEO Problems by Patrick Stox
SMX Advanced 2018 Solving Complex SEO Problems by Patrick StoxSMX Advanced 2018 Solving Complex SEO Problems by Patrick Stox
SMX Advanced 2018 Solving Complex SEO Problems by Patrick Stox
 
BrightonSEO
BrightonSEOBrightonSEO
BrightonSEO
 
Google's Search Signals For Page Experience - SMX Advanced 2021 Patrick Stox
Google's Search Signals For Page Experience - SMX Advanced 2021 Patrick StoxGoogle's Search Signals For Page Experience - SMX Advanced 2021 Patrick Stox
Google's Search Signals For Page Experience - SMX Advanced 2021 Patrick Stox
 
Page Experience Update TMC June 2021 Patrick Stox
Page Experience Update TMC June 2021 Patrick StoxPage Experience Update TMC June 2021 Patrick Stox
Page Experience Update TMC June 2021 Patrick Stox
 
Website Migrations at SMX Munich 2019 - Patrick Stox
Website Migrations at SMX Munich 2019 - Patrick StoxWebsite Migrations at SMX Munich 2019 - Patrick Stox
Website Migrations at SMX Munich 2019 - Patrick Stox
 
Migration Best Practices - SMX West 2019
Migration Best Practices - SMX West 2019Migration Best Practices - SMX West 2019
Migration Best Practices - SMX West 2019
 
Troubleshooting SEO for JS Frameworks - Patrick Stox - DTD 2018
Troubleshooting SEO for JS Frameworks - Patrick Stox - DTD 2018Troubleshooting SEO for JS Frameworks - Patrick Stox - DTD 2018
Troubleshooting SEO for JS Frameworks - Patrick Stox - DTD 2018
 
Migration Best Practices - Search Y 2019, Paris
Migration Best Practices - Search Y 2019, ParisMigration Best Practices - Search Y 2019, Paris
Migration Best Practices - Search Y 2019, Paris
 
Troubleshooting Technical SEO Problems - Patrick Stox - Raleigh SEO Meetup
Troubleshooting Technical SEO Problems - Patrick Stox - Raleigh SEO MeetupTroubleshooting Technical SEO Problems - Patrick Stox - Raleigh SEO Meetup
Troubleshooting Technical SEO Problems - Patrick Stox - Raleigh SEO Meetup
 
Nofollow UGC Sponsored SMX West 2020 Patrick Stox
Nofollow UGC Sponsored SMX West 2020 Patrick StoxNofollow UGC Sponsored SMX West 2020 Patrick Stox
Nofollow UGC Sponsored SMX West 2020 Patrick Stox
 
Nofollow, UGC, and Sponsored Attributes
Nofollow, UGC, and Sponsored AttributesNofollow, UGC, and Sponsored Attributes
Nofollow, UGC, and Sponsored Attributes
 
On-page SEO for Drupal
On-page SEO for DrupalOn-page SEO for Drupal
On-page SEO for Drupal
 
Advanced data-driven technical SEO - SMX London 2019
Advanced data-driven technical SEO - SMX London 2019Advanced data-driven technical SEO - SMX London 2019
Advanced data-driven technical SEO - SMX London 2019
 
Technical Site Audits
Technical Site AuditsTechnical Site Audits
Technical Site Audits
 
Max Prin - TechSEO Boost 2017 - SEO Best Practices For JavaScript-Based Websites
Max Prin - TechSEO Boost 2017 - SEO Best Practices For JavaScript-Based WebsitesMax Prin - TechSEO Boost 2017 - SEO Best Practices For JavaScript-Based Websites
Max Prin - TechSEO Boost 2017 - SEO Best Practices For JavaScript-Based Websites
 

More from Ahrefs

What I learned from auditing over 1,000,000 websites - SERP Conf 2024 Patrick...
What I learned from auditing over 1,000,000 websites - SERP Conf 2024 Patrick...What I learned from auditing over 1,000,000 websites - SERP Conf 2024 Patrick...
What I learned from auditing over 1,000,000 websites - SERP Conf 2024 Patrick...Ahrefs
 
What's new at Ahrefs [end of 2023]
What's new at Ahrefs [end of 2023]What's new at Ahrefs [end of 2023]
What's new at Ahrefs [end of 2023]Ahrefs
 
10 quick wins to improve your rankings using Ahrefs.pptx
10 quick wins to improve your rankings using Ahrefs.pptx10 quick wins to improve your rankings using Ahrefs.pptx
10 quick wins to improve your rankings using Ahrefs.pptxAhrefs
 
The Elusive ROI of Content Marketing (by Tim Soulo)
The Elusive ROI of Content Marketing (by Tim Soulo)The Elusive ROI of Content Marketing (by Tim Soulo)
The Elusive ROI of Content Marketing (by Tim Soulo)Ahrefs
 
Machine Learning use cases for Technical SEO Automation Brighton SEO Patrick ...
Machine Learning use cases for Technical SEO Automation Brighton SEO Patrick ...Machine Learning use cases for Technical SEO Automation Brighton SEO Patrick ...
Machine Learning use cases for Technical SEO Automation Brighton SEO Patrick ...Ahrefs
 
Why Scaling (Great) Content Is So Bloody Hard
Why Scaling (Great) Content Is So Bloody HardWhy Scaling (Great) Content Is So Bloody Hard
Why Scaling (Great) Content Is So Bloody HardAhrefs
 
What's Next for Page Experience - SMX Next 2021 - Patrick Stox
What's Next for Page Experience - SMX Next 2021 - Patrick StoxWhat's Next for Page Experience - SMX Next 2021 - Patrick Stox
What's Next for Page Experience - SMX Next 2021 - Patrick StoxAhrefs
 
SEO Data Mythbusting - Are You Analyzing the Right Data?
SEO Data Mythbusting - Are You Analyzing the Right Data?SEO Data Mythbusting - Are You Analyzing the Right Data?
SEO Data Mythbusting - Are You Analyzing the Right Data?Ahrefs
 
Local Link Building - Pubcon Local 2021 - Patrick Stox
Local Link Building - Pubcon Local 2021 - Patrick StoxLocal Link Building - Pubcon Local 2021 - Patrick Stox
Local Link Building - Pubcon Local 2021 - Patrick StoxAhrefs
 
How to find other affiliates most successful content TPAS Patrick Stox Ahrefs
How to find other affiliates most successful content TPAS Patrick Stox AhrefsHow to find other affiliates most successful content TPAS Patrick Stox Ahrefs
How to find other affiliates most successful content TPAS Patrick Stox AhrefsAhrefs
 
Nofollow UGC Sponsored SEOFromHome Patrick Stox Ahrefs
Nofollow UGC Sponsored SEOFromHome Patrick Stox AhrefsNofollow UGC Sponsored SEOFromHome Patrick Stox Ahrefs
Nofollow UGC Sponsored SEOFromHome Patrick Stox AhrefsAhrefs
 
Data Visualization for SEO
Data Visualization for SEOData Visualization for SEO
Data Visualization for SEOAhrefs
 
Blogging for Business: How to Build a Million Dollar Business With One Article
Blogging for Business: How to Build a Million Dollar Business With One ArticleBlogging for Business: How to Build a Million Dollar Business With One Article
Blogging for Business: How to Build a Million Dollar Business With One ArticleAhrefs
 
Advanced Ways to Use Ahrefs (That You Didn't Know About)
Advanced Ways to Use Ahrefs (That You Didn't Know About)Advanced Ways to Use Ahrefs (That You Didn't Know About)
Advanced Ways to Use Ahrefs (That You Didn't Know About)Ahrefs
 
11 Things That ONLY Ahrefs Can Do
11 Things That ONLY Ahrefs Can Do11 Things That ONLY Ahrefs Can Do
11 Things That ONLY Ahrefs Can DoAhrefs
 
What you need to know about backlinks in 2019
What you need to know about backlinks in 2019What you need to know about backlinks in 2019
What you need to know about backlinks in 2019Ahrefs
 
Rethinking The Fundamentals of Keyword Research With The Insights From Big Da...
Rethinking The Fundamentals of Keyword Research With The Insights From Big Da...Rethinking The Fundamentals of Keyword Research With The Insights From Big Da...
Rethinking The Fundamentals of Keyword Research With The Insights From Big Da...Ahrefs
 
How To Get Backlinks For Free: the Unlinked Brand Mentions method
How To Get Backlinks For Free: the Unlinked Brand Mentions methodHow To Get Backlinks For Free: the Unlinked Brand Mentions method
How To Get Backlinks For Free: the Unlinked Brand Mentions methodAhrefs
 
How to Build Your Website Traffic with Evergreen Content and Social Media
How to Build Your Website Traffic with Evergreen Content and Social MediaHow to Build Your Website Traffic with Evergreen Content and Social Media
How to Build Your Website Traffic with Evergreen Content and Social MediaAhrefs
 

More from Ahrefs (19)

What I learned from auditing over 1,000,000 websites - SERP Conf 2024 Patrick...
What I learned from auditing over 1,000,000 websites - SERP Conf 2024 Patrick...What I learned from auditing over 1,000,000 websites - SERP Conf 2024 Patrick...
What I learned from auditing over 1,000,000 websites - SERP Conf 2024 Patrick...
 
What's new at Ahrefs [end of 2023]
What's new at Ahrefs [end of 2023]What's new at Ahrefs [end of 2023]
What's new at Ahrefs [end of 2023]
 
10 quick wins to improve your rankings using Ahrefs.pptx
10 quick wins to improve your rankings using Ahrefs.pptx10 quick wins to improve your rankings using Ahrefs.pptx
10 quick wins to improve your rankings using Ahrefs.pptx
 
The Elusive ROI of Content Marketing (by Tim Soulo)
The Elusive ROI of Content Marketing (by Tim Soulo)The Elusive ROI of Content Marketing (by Tim Soulo)
The Elusive ROI of Content Marketing (by Tim Soulo)
 
Machine Learning use cases for Technical SEO Automation Brighton SEO Patrick ...
Machine Learning use cases for Technical SEO Automation Brighton SEO Patrick ...Machine Learning use cases for Technical SEO Automation Brighton SEO Patrick ...
Machine Learning use cases for Technical SEO Automation Brighton SEO Patrick ...
 
Why Scaling (Great) Content Is So Bloody Hard
Why Scaling (Great) Content Is So Bloody HardWhy Scaling (Great) Content Is So Bloody Hard
Why Scaling (Great) Content Is So Bloody Hard
 
What's Next for Page Experience - SMX Next 2021 - Patrick Stox
What's Next for Page Experience - SMX Next 2021 - Patrick StoxWhat's Next for Page Experience - SMX Next 2021 - Patrick Stox
What's Next for Page Experience - SMX Next 2021 - Patrick Stox
 
SEO Data Mythbusting - Are You Analyzing the Right Data?
SEO Data Mythbusting - Are You Analyzing the Right Data?SEO Data Mythbusting - Are You Analyzing the Right Data?
SEO Data Mythbusting - Are You Analyzing the Right Data?
 
Local Link Building - Pubcon Local 2021 - Patrick Stox
Local Link Building - Pubcon Local 2021 - Patrick StoxLocal Link Building - Pubcon Local 2021 - Patrick Stox
Local Link Building - Pubcon Local 2021 - Patrick Stox
 
How to find other affiliates most successful content TPAS Patrick Stox Ahrefs
How to find other affiliates most successful content TPAS Patrick Stox AhrefsHow to find other affiliates most successful content TPAS Patrick Stox Ahrefs
How to find other affiliates most successful content TPAS Patrick Stox Ahrefs
 
Nofollow UGC Sponsored SEOFromHome Patrick Stox Ahrefs
Nofollow UGC Sponsored SEOFromHome Patrick Stox AhrefsNofollow UGC Sponsored SEOFromHome Patrick Stox Ahrefs
Nofollow UGC Sponsored SEOFromHome Patrick Stox Ahrefs
 
Data Visualization for SEO
Data Visualization for SEOData Visualization for SEO
Data Visualization for SEO
 
Blogging for Business: How to Build a Million Dollar Business With One Article
Blogging for Business: How to Build a Million Dollar Business With One ArticleBlogging for Business: How to Build a Million Dollar Business With One Article
Blogging for Business: How to Build a Million Dollar Business With One Article
 
Advanced Ways to Use Ahrefs (That You Didn't Know About)
Advanced Ways to Use Ahrefs (That You Didn't Know About)Advanced Ways to Use Ahrefs (That You Didn't Know About)
Advanced Ways to Use Ahrefs (That You Didn't Know About)
 
11 Things That ONLY Ahrefs Can Do
11 Things That ONLY Ahrefs Can Do11 Things That ONLY Ahrefs Can Do
11 Things That ONLY Ahrefs Can Do
 
What you need to know about backlinks in 2019
What you need to know about backlinks in 2019What you need to know about backlinks in 2019
What you need to know about backlinks in 2019
 
Rethinking The Fundamentals of Keyword Research With The Insights From Big Da...
Rethinking The Fundamentals of Keyword Research With The Insights From Big Da...Rethinking The Fundamentals of Keyword Research With The Insights From Big Da...
Rethinking The Fundamentals of Keyword Research With The Insights From Big Da...
 
How To Get Backlinks For Free: the Unlinked Brand Mentions method
How To Get Backlinks For Free: the Unlinked Brand Mentions methodHow To Get Backlinks For Free: the Unlinked Brand Mentions method
How To Get Backlinks For Free: the Unlinked Brand Mentions method
 
How to Build Your Website Traffic with Evergreen Content and Social Media
How to Build Your Website Traffic with Evergreen Content and Social MediaHow to Build Your Website Traffic with Evergreen Content and Social Media
How to Build Your Website Traffic with Evergreen Content and Social Media
 

Recently uploaded

Forecast of Content Marketing through AI
Forecast of Content Marketing through AIForecast of Content Marketing through AI
Forecast of Content Marketing through AIRinky
 
How videos can elevate your Google rankings and improve your EEAT - Benjamin ...
How videos can elevate your Google rankings and improve your EEAT - Benjamin ...How videos can elevate your Google rankings and improve your EEAT - Benjamin ...
How videos can elevate your Google rankings and improve your EEAT - Benjamin ...Benjamin Szturmaj
 
Call Girls in Lajpat Nagar Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Lajpat Nagar Delhi 💯Call Us 🔝8264348440🔝Call Girls in Lajpat Nagar Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Lajpat Nagar Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
Netflix Ads The Game Changer in Video Ads – Who Needs YouTube.pptx (Chester Y...
Netflix Ads The Game Changer in Video Ads – Who Needs YouTube.pptx (Chester Y...Netflix Ads The Game Changer in Video Ads – Who Needs YouTube.pptx (Chester Y...
Netflix Ads The Game Changer in Video Ads – Who Needs YouTube.pptx (Chester Y...ChesterYang6
 
How To Utilize Calculated Properties in your HubSpot Setup
How To Utilize Calculated Properties in your HubSpot SetupHow To Utilize Calculated Properties in your HubSpot Setup
How To Utilize Calculated Properties in your HubSpot Setupssuser4571da
 
Do More with Less: Navigating Customer Acquisition Challenges for Today's Ent...
Do More with Less: Navigating Customer Acquisition Challenges for Today's Ent...Do More with Less: Navigating Customer Acquisition Challenges for Today's Ent...
Do More with Less: Navigating Customer Acquisition Challenges for Today's Ent...Search Engine Journal
 
Red bull marketing presentation pptxxxxx
Red bull marketing presentation pptxxxxxRed bull marketing presentation pptxxxxx
Red bull marketing presentation pptxxxxx216310017
 
Mastering SEO in the Evolving AI-driven World
Mastering SEO in the Evolving AI-driven WorldMastering SEO in the Evolving AI-driven World
Mastering SEO in the Evolving AI-driven WorldScalenut
 
SORA AI: Will It Be the Future of Video Creation?
SORA AI: Will It Be the Future of Video Creation?SORA AI: Will It Be the Future of Video Creation?
SORA AI: Will It Be the Future of Video Creation?Searchable Design
 
DIGITAL MARKETING COURSE IN BTM -Influencer Marketing Strategy
DIGITAL MARKETING COURSE IN BTM -Influencer Marketing StrategyDIGITAL MARKETING COURSE IN BTM -Influencer Marketing Strategy
DIGITAL MARKETING COURSE IN BTM -Influencer Marketing StrategySouvikRay24
 
Avoid the 2025 web accessibility rush: do not fear WCAG compliance
Avoid the 2025 web accessibility rush: do not fear WCAG complianceAvoid the 2025 web accessibility rush: do not fear WCAG compliance
Avoid the 2025 web accessibility rush: do not fear WCAG complianceDamien ROBERT
 
DGR_Digital Advertising Strategies for a Cookieless World_Presentation.pdf
DGR_Digital Advertising Strategies for a Cookieless World_Presentation.pdfDGR_Digital Advertising Strategies for a Cookieless World_Presentation.pdf
DGR_Digital Advertising Strategies for a Cookieless World_Presentation.pdfDemandbase
 
CALL ON ➥8923113531 🔝Call Girls Hazratganj Lucknow best sexual service Online
CALL ON ➥8923113531 🔝Call Girls Hazratganj Lucknow best sexual service OnlineCALL ON ➥8923113531 🔝Call Girls Hazratganj Lucknow best sexual service Online
CALL ON ➥8923113531 🔝Call Girls Hazratganj Lucknow best sexual service Onlineanilsa9823
 
2024 SEO Trends for Business Success (WSA)
2024 SEO Trends for Business Success (WSA)2024 SEO Trends for Business Success (WSA)
2024 SEO Trends for Business Success (WSA)Jomer Gregorio
 
Russian Call Girls Nagpur Swara 8617697112 Independent Escort Service Nagpur
Russian Call Girls Nagpur Swara 8617697112 Independent Escort Service NagpurRussian Call Girls Nagpur Swara 8617697112 Independent Escort Service Nagpur
Russian Call Girls Nagpur Swara 8617697112 Independent Escort Service NagpurCall girls in Ahmedabad High profile
 
Social Samosa Guidebook for SAMMIES 2024.pdf
Social Samosa Guidebook for SAMMIES 2024.pdfSocial Samosa Guidebook for SAMMIES 2024.pdf
Social Samosa Guidebook for SAMMIES 2024.pdfSocial Samosa
 
Call Us ➥9654467111▻Call Girls In Delhi NCR
Call Us ➥9654467111▻Call Girls In Delhi NCRCall Us ➥9654467111▻Call Girls In Delhi NCR
Call Us ➥9654467111▻Call Girls In Delhi NCRSapana Sha
 

Recently uploaded (20)

Forecast of Content Marketing through AI
Forecast of Content Marketing through AIForecast of Content Marketing through AI
Forecast of Content Marketing through AI
 
How videos can elevate your Google rankings and improve your EEAT - Benjamin ...
How videos can elevate your Google rankings and improve your EEAT - Benjamin ...How videos can elevate your Google rankings and improve your EEAT - Benjamin ...
How videos can elevate your Google rankings and improve your EEAT - Benjamin ...
 
Call Girls in Lajpat Nagar Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Lajpat Nagar Delhi 💯Call Us 🔝8264348440🔝Call Girls in Lajpat Nagar Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Lajpat Nagar Delhi 💯Call Us 🔝8264348440🔝
 
Netflix Ads The Game Changer in Video Ads – Who Needs YouTube.pptx (Chester Y...
Netflix Ads The Game Changer in Video Ads – Who Needs YouTube.pptx (Chester Y...Netflix Ads The Game Changer in Video Ads – Who Needs YouTube.pptx (Chester Y...
Netflix Ads The Game Changer in Video Ads – Who Needs YouTube.pptx (Chester Y...
 
No Cookies No Problem - Steve Krull, Be Found Online
No Cookies No Problem - Steve Krull, Be Found OnlineNo Cookies No Problem - Steve Krull, Be Found Online
No Cookies No Problem - Steve Krull, Be Found Online
 
How To Utilize Calculated Properties in your HubSpot Setup
How To Utilize Calculated Properties in your HubSpot SetupHow To Utilize Calculated Properties in your HubSpot Setup
How To Utilize Calculated Properties in your HubSpot Setup
 
Do More with Less: Navigating Customer Acquisition Challenges for Today's Ent...
Do More with Less: Navigating Customer Acquisition Challenges for Today's Ent...Do More with Less: Navigating Customer Acquisition Challenges for Today's Ent...
Do More with Less: Navigating Customer Acquisition Challenges for Today's Ent...
 
How to Create a Social Media Plan Like a Pro - Jordan Scheltgen
How to Create a Social Media Plan Like a Pro - Jordan ScheltgenHow to Create a Social Media Plan Like a Pro - Jordan Scheltgen
How to Create a Social Media Plan Like a Pro - Jordan Scheltgen
 
Red bull marketing presentation pptxxxxx
Red bull marketing presentation pptxxxxxRed bull marketing presentation pptxxxxx
Red bull marketing presentation pptxxxxx
 
Mastering SEO in the Evolving AI-driven World
Mastering SEO in the Evolving AI-driven WorldMastering SEO in the Evolving AI-driven World
Mastering SEO in the Evolving AI-driven World
 
The Future of Brands on LinkedIn - Alison Kaltman
The Future of Brands on LinkedIn - Alison KaltmanThe Future of Brands on LinkedIn - Alison Kaltman
The Future of Brands on LinkedIn - Alison Kaltman
 
SORA AI: Will It Be the Future of Video Creation?
SORA AI: Will It Be the Future of Video Creation?SORA AI: Will It Be the Future of Video Creation?
SORA AI: Will It Be the Future of Video Creation?
 
DIGITAL MARKETING COURSE IN BTM -Influencer Marketing Strategy
DIGITAL MARKETING COURSE IN BTM -Influencer Marketing StrategyDIGITAL MARKETING COURSE IN BTM -Influencer Marketing Strategy
DIGITAL MARKETING COURSE IN BTM -Influencer Marketing Strategy
 
Avoid the 2025 web accessibility rush: do not fear WCAG compliance
Avoid the 2025 web accessibility rush: do not fear WCAG complianceAvoid the 2025 web accessibility rush: do not fear WCAG compliance
Avoid the 2025 web accessibility rush: do not fear WCAG compliance
 
DGR_Digital Advertising Strategies for a Cookieless World_Presentation.pdf
DGR_Digital Advertising Strategies for a Cookieless World_Presentation.pdfDGR_Digital Advertising Strategies for a Cookieless World_Presentation.pdf
DGR_Digital Advertising Strategies for a Cookieless World_Presentation.pdf
 
CALL ON ➥8923113531 🔝Call Girls Hazratganj Lucknow best sexual service Online
CALL ON ➥8923113531 🔝Call Girls Hazratganj Lucknow best sexual service OnlineCALL ON ➥8923113531 🔝Call Girls Hazratganj Lucknow best sexual service Online
CALL ON ➥8923113531 🔝Call Girls Hazratganj Lucknow best sexual service Online
 
2024 SEO Trends for Business Success (WSA)
2024 SEO Trends for Business Success (WSA)2024 SEO Trends for Business Success (WSA)
2024 SEO Trends for Business Success (WSA)
 
Russian Call Girls Nagpur Swara 8617697112 Independent Escort Service Nagpur
Russian Call Girls Nagpur Swara 8617697112 Independent Escort Service NagpurRussian Call Girls Nagpur Swara 8617697112 Independent Escort Service Nagpur
Russian Call Girls Nagpur Swara 8617697112 Independent Escort Service Nagpur
 
Social Samosa Guidebook for SAMMIES 2024.pdf
Social Samosa Guidebook for SAMMIES 2024.pdfSocial Samosa Guidebook for SAMMIES 2024.pdf
Social Samosa Guidebook for SAMMIES 2024.pdf
 
Call Us ➥9654467111▻Call Girls In Delhi NCR
Call Us ➥9654467111▻Call Girls In Delhi NCRCall Us ➥9654467111▻Call Girls In Delhi NCR
Call Us ➥9654467111▻Call Girls In Delhi NCR
 

How Search Works

  • 1. @patrickstox @ahrefs #pubcon How Search Works Presented by: Patrick Stox
  • 2. @patrickstox @ahrefs #pubcon Product Advisor, Technical SEO, & Brand Ambassador at • I write for Ahrefs blog but have written for many industry publications in the past • I speak at some conferences like SMX, Pubcon, UnGagged, DMO Advanced, TechSEO Boost, BrightonSEO • Organizer for the Raleigh SEO Meetup (most successful in US) and the Beer & SEO Meetup • We also run a conference, the Raleigh SEO Conference • Founder Technical SEO Slack Group • Moderator /r/TechSEO on Reddit • Helped define the role of Search Marketing Strategist for the US Department of Labor • Lead author for the SEO Chapter of the 2021 Web Almanac, reviewer for the 2022 Chapter • Technical Review Editor for The Art of SEO 4th Edition Who is Patrick Stox?
  • 3. @patrickstox @ahrefs #pubcon Disclaimer This is my understanding of systems and is based on a lot of public statements from Google and my own knowledge. Warning: It’s not going to be 100% complete or accurate.
  • 4. @patrickstox @ahrefs #pubcon How Many Domains Exist? Q3 2022 according to Verisign: 349.9 million registered January 2023 according to Netcraft: 270.9 million unique domains responded Ahrefs 213.1 million (after removing spam domains)
  • 5. @patrickstox @ahrefs #pubcon How Many Pages? Google in 2016: 130T known
  • 6. @patrickstox @ahrefs #pubcon How Big Is The Index? Google: hundreds of billions of pages indexed 100 PB in size Ahrefs: ~380B pages
  • 7. @patrickstox @ahrefs #pubcon A Fraction Of The Web Is Useful Content Rough math: (400B / 130T) * 100 = 0.3%
  • 9. @patrickstox @ahrefs #pubcon Spam Google 2021: “every day, we discover 40 billion spammy pages” That’s 14.6T spam pages a year.
  • 10. @patrickstox @ahrefs #pubcon Googlebot Googlebot is a lot of systems (1000+) and there are multiple Googlebots. • Googlebot Image • Googlebot News • Googlebot Video • Googlebot Desktop • Googlebot Mobile • +Ads and more https://developers.google.com/search/docs/crawling-indexing/overview- google-crawlers
  • 11. @patrickstox @ahrefs #pubcon Googlebot Is A Protocol Buffer It stores structured data. Similar to JSON, but smaller and faster.
  • 12. @patrickstox @ahrefs #pubcon Googlebot Rendering Pipeline (Simplified)
  • 13. @patrickstox @ahrefs #pubcon URL Sources • Links on pages, or anything that even looks like a link • Sitemaps • Request indexing in GSC • Indexing API (limited use cases) • RSS Feeds • WebSub (formerly PubSubHubbub)
  • 14. @patrickstox @ahrefs #pubcon Crawler Queue / Scheduler Determines what URLs to crawl and when. 2 main purposes: • Discovery • Refresh
  • 15. @patrickstox @ahrefs #pubcon What SEOs Call Crawl Budget, Google Calls Crawl demand How much Google wants to crawl your site. Crawl rate limit How much crawling your website can support.
  • 16. @patrickstox @ahrefs #pubcon What Counts Against Your Crawl Budget? All URLs and requests including: • Pages/files • Alternate URLs like AMP or m-dot pages, hreflang • CSS • JavaScript, including XHR requests • Embedded content ***All Googlebots share the same crawl budget, including the ones for Ads, images, etc.
  • 17. @patrickstox @ahrefs #pubcon Crawl Demand Factors • PageRank • How often pages change (freshness/staleness) • When it was last crawled • Any major changes
  • 18. @patrickstox @ahrefs #pubcon Crawl Rate Factors • Stability / crawl health • Slow responses • Errors. 5xx (server errors) or 429 (too many requests) HTTP status codes. They don’t want to crash the sites and the crawlers will generally back down if they start seeing issues.
  • 20. @patrickstox @ahrefs #pubcon Crawling The little spider is named Crawley.
  • 21. @patrickstox @ahrefs #pubcon Crawling Mostly from Mountain View, CA, USA. Every request needs to respect robots.txt. 15MB max HTML size.
  • 22. @patrickstox @ahrefs #pubcon Google Doesn’t Navigate Like Users Sends requests for the files individually, doesn’t navigate between pages like a user.
  • 23. @patrickstox @ahrefs #pubcon Caching Files They Crawl more than HTML: • Pages and other file types • JavaScript • CSS
  • 24. @patrickstox @ahrefs #pubcon Caching Files Files are stored for use in rendering. Google will ignore your cache timings and fetch a new copy when they want to. JS HTML HTML HTML JS CSS CSS CSS Cache Cache
  • 25. @patrickstox @ahrefs #pubcon Processing – We’ll Cover This Shortly
  • 26. @patrickstox @ahrefs #pubcon Web Rendering Service (WRS) Needed to process JavaScript Evergreen (up-to-date) Googlebot Headless (no Graphical User Interface)
  • 27. @patrickstox @ahrefs #pubcon Web Rendering Service (WRS) • Stateless (storage and cookies cleared between loads) • Denies Permissions • Flattens light DOM and shadow DOM • Date / Time functions adjusted • Service workers rejected • Animations may differ • Random may not be random
  • 28. @patrickstox @ahrefs #pubcon Myth: 5 Second Limit I think this started with a test from Max Prin on the time when the testing tools took a screenshot. They need to have reasonable time limits for testing tools. https://maxprin.com/tests/js-timer/
  • 29. @patrickstox @ahrefs #pubcon No 5 Second Limit They’ll try to wait for pages to finish, something like networkidle0 (no more activity). Eventually cuts off in case something gets stuck or someone is trying to mine bitcoin.
  • 30. @patrickstox @ahrefs #pubcon It Doesn’t Even Make Sense They’re basically loading a page with everything cached already. WRS JS HTML HTML HTML JS CSS CSS CSS Cache Cache
  • 31. @patrickstox @ahrefs #pubcon This System Causes Other Issues Impossible states – previous file versions used when rendering. File versioning /fingerprinting should help. XHR requests are done in real time.
  • 32. @patrickstox @ahrefs #pubcon Myth: Weeks To Render All pages go through the renderer. The average wait time is 5 seconds according to Google’s Martin Splitt. The 90th percentile is only minutes, not weeks. Probably comes from pages not being prioritized for crawling.
  • 33. @patrickstox @ahrefs #pubcon Rendering At Web Scale The 8th wonder of the world.
  • 34. @patrickstox @ahrefs #pubcon They Use Some Hacks “In Google search we don’t really care about the pixels because we don’t really want to show it to someone. We want to process the information and the semantic information so we need something in the intermediate state. We don’t have to actually paint the pixels.” – Martin Splitt
  • 35. @patrickstox @ahrefs #pubcon What That Looks Like Gray = downloads Blue = HTML Yellow = JavaScript Purple = Layout Green = Painting
  • 36. @patrickstox @ahrefs #pubcon They Won’t Render Noindexed Pages <meta name="robots" content="noindex"> <meta name="robots" content="none"> None = noindex, nofollow
  • 37. @patrickstox @ahrefs #pubcon They’re Not Taking Actions They don’t scroll. They generally don’t click.
  • 39. @patrickstox @ahrefs #pubcon They Don’t Click Load content into the Document Object Model (DOM) by default. They won’t see the content if it requires a click that makes an XHR request to pull it in. DOM Tree and CSS Object Model (CSSOM) form the Render Tree. That’s what gets indexed.
  • 40. @patrickstox @ahrefs #pubcon DOM Tree (pictured) CSSOM (not pictured) would add info like font size, weight, color, etc. to each element. Render Tree
  • 41. @patrickstox @ahrefs #pubcon Collapser • Error handling • Retries • Soft 404s
  • 42. @patrickstox @ahrefs #pubcon Processing – Now We’ll Talk About It
  • 43. @patrickstox @ahrefs #pubcon Processing - Duplicates Duplicate detection - content hashes or checksum They’ll remove boilerplate content (nav, footer) for the checksum.
  • 45. @patrickstox @ahrefs #pubcon Processing – Duplicate Elimination Canonicalization
  • 46. @patrickstox @ahrefs #pubcon ~20 Canonicalization Signals • Duplicates • Redirects (high weight) • Canonical link elements - multiple will be ignored • Sitemap URLs • Links (Internal/External, PageRank) • Alternates – mobile, AMP, print, Hreflang • HTTPS pages over HTTP • Shorter URLs over longer URLs • Where content was first published / seen • Site level signals like a history of scraped content • Pages over PDFs Machine learning system
  • 47. @patrickstox @ahrefs #pubcon 301 = Permanent, 302 = Temporary Holds true for other perm and temp redirects
  • 49. @patrickstox @ahrefs #pubcon Processing – Link Parser Good: <a> tag with an href attribute. <a href=”/page”>simple is good</a> <a href=”/page” onclick=”goTo(‘page’)”>still okay</a>
  • 50. @patrickstox @ahrefs #pubcon Processing – Link Parser Bad (but may be parsed): <a routerLink="products/category">no href</a> <a onclick=”goTo(‘page’)”>no href</a> <a href=”javascript:goTo(‘page’)”>kind of nested</a> <a href=”javascript:void(0)”>missing link</a> <span onclick=”goTo(‘page’)”>not the right HTML element or href</span> <span href=“page">not the right HTML element</span> <option value="page">not the right HTML element</option> <a href=”#”>no link</a> Button, ng-click, there are many more ways this can be done incorrectly.
  • 51. @patrickstox @ahrefs #pubcon Processing – Link Parser • Link location, where it goes • Anchor text • Surrounding text • …
  • 52. @patrickstox @ahrefs #pubcon Link Tagging • Penguin • Location on page (footer, main content) • Disavow • …
  • 53. @patrickstox @ahrefs #pubcon Processing – Content Parser • Content – tokenized, vectorized. Words become numbers. • Content language • Content location • Extract meta tags • Extract Schema • HTML Lexer – normalize the HTML • Topic analysis. Content on other topics may be weighted less in ranking. • Semantic analysis. Linguistic, knowledge graph, address extraction • …
  • 54. @patrickstox @ahrefs #pubcon Content Tagging • YMYL • Adult / safe search • Mobile-friendly • …
  • 55. @patrickstox @ahrefs #pubcon Signal Collectors • PageRank • Spam • Page Experience • Freshness • …
  • 56. @patrickstox @ahrefs #pubcon A Lot More In Processing Like Drop anything after # in URLs. (some exceptions to this) Most Restrictive Directives index + noindex + index = noindex They’ll drop low quality content
  • 57. @patrickstox @ahrefs #pubcon Other Files May Be Processed Differently Adobe Portable Document Format (.pdf) •Adobe PostScript (.ps) •Google Earth (.kml, .kmz) •GPS eXchange Format (.gpx) •Hancom Hanword (.hwp) •HTML (.htm, .html, other file extensions) •Lotus •Microsoft Excel (.xls, .xlsx) •Microsoft PowerPoint (.ppt, .pptx) •Microsoft Word (.doc, .docx) •OpenOffice presentation (.odp) •OpenOffice spreadsheet (.ods) •OpenOffice text (.odt) •Rich Text Format (.rtf) •Scalable Vector Graphics (.svg) •TeX/LaTeX (.tex) •Text (.txt, .text, other file extensions), including source code in common programming languages: • Basic source code (.bas) • C/C++ source code (.c, .cc, .cpp, .cxx, .h, .hpp) • C# source code (.cs) • Java source code (.java) • Perl source code (.pl) • Python source code (.py) •Wireless Markup Language (.wml, .wap) •XML (.xml)
  • 58. @patrickstox @ahrefs #pubcon Image Processing • Text around the image • Content of the image. They tag what is in the image. Not super reliable. • Alt attribute • Image name (minimal weight) • Webpage title and description Photo from a Gary Illyes Presentation at Pubcon.
  • 59. @patrickstox @ahrefs #pubcon Robots.txt for Images Blocking Googlebot Image from crawling means that your images will not be indexed.
  • 60. @patrickstox @ahrefs #pubcon Video Processing • OCR to get text • Objects identified from visuals • Speech converted to text • Structured data • Text and other signals from the page, URL, title, description
  • 61. @patrickstox @ahrefs #pubcon PDFs • PDFs are converted and indexed as HTML • OCR to get text • Images get indexed • Links get picked up • Title • File name • …
  • 62. @patrickstox @ahrefs #pubcon Google Index Named Caffeine
  • 63. @patrickstox @ahrefs #pubcon Data Infrastructure Many data centers around the world. Each has a copy of the index. Millions of servers and hard drives. Index is an inverted index. Maps things like words to documents. Index shards are split into words and phrases. Other shards for metadata.
  • 64. @patrickstox @ahrefs #pubcon Indexing Tiers – Based On Doc Popularity • Ram (fastest) • SSD (fast) • Hard drives (slowest)
  • 65. @patrickstox @ahrefs #pubcon Mobile Version Is Indexed (Mostly) Some sites may remain on desktop-only indexing. They don’t work on mobile.
  • 67. @patrickstox @ahrefs #pubcon Fun Fact 15% of queries have never been seen before
  • 68. @patrickstox @ahrefs #pubcon Start Typing - Autocomplete Powered by real search data and patterns across the web + • The language of the query • The location a query is coming from • Trending interest in a query • Your past searches Probably reduces misspellings
  • 69. @patrickstox @ahrefs #pubcon Query parsing and understanding BERT (DeepRank) – combinations of words express different meanings and intents. They won’t drop important words from the queries. Neural matching – words to searches. “For example, neural matching helps Google understand that a search for “why does my TV look strange” is related to the concept of “the soap opera effect.” We can then return pages about the soap opera effect, even if the exact words aren’t used.”
  • 71. @patrickstox @ahrefs #pubcon Google Training Misspelling Example Over 600 ways people misspelled Britney Spears. http://archive.google.com/jobs/britney.html
  • 72. @patrickstox @ahrefs #pubcon Spelling Old Vs New Old way: How often terms were searched +probability of typos from neighboring keys New way: Deep neural net with 680M parameters
  • 73. @patrickstox @ahrefs #pubcon Query Expansion When the query is sent, it’s going to also pull pages with terms that include: • Synonyms • Antonyms • Acronyms • Plural/singular • Stemming – root words • Diacritical expansion - accent characters other versions These will mostly get lower weights in scoring than the main term used.
  • 74. @patrickstox @ahrefs #pubcon Concepts & Entities People, places, things “RankBrain helps Google better relate pages to concepts – This means Google can better return relevant pages even if they don’t contain the exact words used in a search, by understanding the page is related to other words and concepts.”
  • 75. @patrickstox @ahrefs #pubcon Speculation All the query expansion things may not be necessary anymore. They may just pull close terms in vector space.
  • 76. @patrickstox @ahrefs #pubcon Stop Words The, is, and, of, a, are, an, if, etc. Removed for some queries. Used for other queries, like when it matches a concept.
  • 77. @patrickstox @ahrefs #pubcon Segmenter Splits up strings (languages without spaces). '上海浦东开发与建设同步' → ['上海', '浦东', '开发', ‘与', ’建设', '同步']
  • 78. @patrickstox @ahrefs #pubcon Retrieval – Posting List Remember that inverted index? Map of terms to pages that contain those terms. Get all those.
  • 79. @patrickstox @ahrefs #pubcon Sum Of The Total Pages From All Shards
  • 81. @patrickstox @ahrefs #pubcon Make A Smaller List - Ranking Google is going to cut all those results down to the top 1000 by ranking them.
  • 82. @patrickstox @ahrefs #pubcon Ranking / Scoring – Query Dependent Feature of a page & query • Keyword hits • All those other versions from the query expansion like synonyms • Proximity • Content relevance, topicality • …
  • 83. @patrickstox @ahrefs #pubcon Ranking / Scoring – Query Independent Feature of a page • PageRank, site queries, mentions, & other E-E-A-T signals • Language • Mobile-friendliness • Page experience • … Numbers multiplied by other numbers in the scoring
  • 84. @patrickstox @ahrefs #pubcon They’re Like Nah, We Can Do Better
  • 85. @patrickstox @ahrefs #pubcon Reranking / Post-Retrieval Adjustments Has a smaller number of results - 1000 With the smaller number, they can run more intelligent but resource intensive systems to re-order the results.
  • 86. @patrickstox @ahrefs #pubcon RankBrain & BERT - Again “Based on its complex language understanding, BERT can very quickly rank documents for relevance.” Depending on the search, Google’s algorithm can use either RankBrain, BERT, or both.
  • 87. @patrickstox @ahrefs #pubcon Host Clustering Limits the results you see from the same domain. Add &filter=0 to your search URL to see unfiltered results.
  • 88. @patrickstox @ahrefs #pubcon Hreflang Tries to swap to the most relevant country/language version of a page.
  • 89. @patrickstox @ahrefs #pubcon DMCA, Privacy Removals, URL Removal Tool
  • 92. @patrickstox @ahrefs #pubcon Spam Spam demotions Manual actions
  • 93. @patrickstox @ahrefs #pubcon Query Other Systems - Universal results News, Maps, Images, Videos, etc. Results are bidding for their position