12. Site Health Monitoring
● Out-of-the-box solutions, scheduled crawls, custom extractions, etc.
● Custom solution - tips and ideas
○ Monitor only a few URLs per page template
○ No database needed, only keep 3 days of
data in .txt of .json files
○ Free API for Domain-level info:
whoisxmlapi.com (screenshot, DNS,
categorization, SSL cert, etc.)
○ Gmail API + Puppeteer:
parse GSC notifications and login GSC to get
“# of URLs impacted”
13. URL Inspector on Steroid
“I want to know everything about that URL…”
● Fetch and render (custom extractions for
rendering issues)
○ cURL + puppeteer
● On-page elements (meta data, headings, etc.)
● hreflang tags validation
● Schema markup extraction
○ Free Yandex API
● GSC’s URL inspection API
● PageSpeed Insights API (Lighthouse reports)
● Content: entities, classification, sentiment
○ Google Natural Language API
● Traffic: GSC + GA APIs
● CMS API for internal data
● Geotargeting: anonymous-proxies.net
https://technicalseo.com/tools/fetch-render/
22. More Insights (Just) with Search Console Data
Combine “dimensions” with the API and/or data from multiple requests to analyze:
● Top Keywords of Top URLs
○ e.g. top 5 keywords for each of the top 100 URLs
○ What % of traffic each keyword represent? e.g. top keyword = 65% of
clicks to page
● Cannibalization
○ How many and which pages rank for each keywords?
○ Within same site or across markets (e.g. UK site outranking US site in US)
23. Core Web Vitals
● Leverage the historical data
endpoint in
CrUX API
● Overlay competitors
(programmatically pull from
semrush)
● Warning: Search Console report
vs. CrUX origin data
29. Generative AI
OpenAI’s GPT API - Function calling
- Integrate any data into your
chat/assistant
- Eliminate “hallucinations”
- New: cheaper pricing for GPT-4
API and improved function
calling feature