Time to speed-up your SEO workflows!
In this talk, I will show you how to:
+ Build simple Python Web apps to automate your tasks via the mighty Streamlit framework
+ Deploy them in one click and for free, so you can share them with your teammates (or the word!)
I'll finish the talk with some exciting use cases!
SearchLove London 2016 | Dom Woodman | How to Get Insight From Your LogsDistilled
In the SEO industry, we obsess on everything Google says, from John Mueller dropping a hint in a Webmaster Hangout, to the ranking data we spend £1000s to gather. Yet we ignore the data Google throws at us every day, the crawling data. For the longest time, site crawls, traffic data, and rankings have been the pillars of SEO data gathering. Log files should join them as something everyone is doing. We'll go through how to get everything set-up, look at some of the tools to make it easy and repeatable and go through the kinds of analysis you can do to get insights from the data.
Migrating a large scale banking app to composeFatih Giris
The Android community adopts Jetpack Compose more and more each day. But what about our existing apps? Adopting existing apps to Jetpack Compose might be an overwhelming task when the migration is not planned well.
In this talk, we will see the Compose migration strategy used in DNB. We will briefly talk about the interoperability APIs and how they are used in this migration process. We will also talk about how we are adopting our internal design system to Jetpack Compose. Finally, we will discuss the challenges we faced during this migration.
Rendering SEO Manifesto - Why we need to go beyond JavaScript SEOOnely
Want to make sure that your content gets properly accessed by search engines and ranks high? Look no further! In this beginner-friendly introduction to batch-optimized rendering, Bartosz will guide you through how Google is rendering websites on a large scale. You’ll gain groundbreaking insights based on Google’s patents and documentation. Join Bartosz to get a new perspective on technical SEO and use it to get more traffic!
Barry Adams - Search in a screenless world - Nottingham Digital SummitHallam
There has been a seismic shift in how people discover content online. Desktop searches are surpassed by mobile searches, and voice search is seeing double-digit growth every month. How can businesses prepare their websites for the future of online search? This talk will explore changes in search behaviour and provide actionable tips for businesses. Which technologies are driving screenless searches through voice search and automatic notifications? How can you prepare your website to integrate with the future of SEO?
Search engines have come a long way in understanding JavaScript, but issues with rendering and load times can still impact your crawl budget and prevent search engines from indexing valuable content!
Finding the optimal solution that provides the best user experience, whilst also satisfying the bots can be a challenge. This talk will cover the differences between these solutions, a number of tools and metrics you can use, and other significant considerations to take into account when proposing a rendering solution to your developers.
SearchLeeds 2018 - Steve Chambers - Stickyeyes - How not to F**K up a Migration Branded3
Whether you’re migrating to HTTPS, domain or to a whole new website platform, there are crucial elements that you need to be considering to avoid substantial loss of organic traffic. Steve's presentation takes you through the most common pitfalls of site migrations and how you can avoid them, ensuring a smooth transition.
SearchLove London 2016 | Dom Woodman | How to Get Insight From Your LogsDistilled
In the SEO industry, we obsess on everything Google says, from John Mueller dropping a hint in a Webmaster Hangout, to the ranking data we spend £1000s to gather. Yet we ignore the data Google throws at us every day, the crawling data. For the longest time, site crawls, traffic data, and rankings have been the pillars of SEO data gathering. Log files should join them as something everyone is doing. We'll go through how to get everything set-up, look at some of the tools to make it easy and repeatable and go through the kinds of analysis you can do to get insights from the data.
Migrating a large scale banking app to composeFatih Giris
The Android community adopts Jetpack Compose more and more each day. But what about our existing apps? Adopting existing apps to Jetpack Compose might be an overwhelming task when the migration is not planned well.
In this talk, we will see the Compose migration strategy used in DNB. We will briefly talk about the interoperability APIs and how they are used in this migration process. We will also talk about how we are adopting our internal design system to Jetpack Compose. Finally, we will discuss the challenges we faced during this migration.
Rendering SEO Manifesto - Why we need to go beyond JavaScript SEOOnely
Want to make sure that your content gets properly accessed by search engines and ranks high? Look no further! In this beginner-friendly introduction to batch-optimized rendering, Bartosz will guide you through how Google is rendering websites on a large scale. You’ll gain groundbreaking insights based on Google’s patents and documentation. Join Bartosz to get a new perspective on technical SEO and use it to get more traffic!
Barry Adams - Search in a screenless world - Nottingham Digital SummitHallam
There has been a seismic shift in how people discover content online. Desktop searches are surpassed by mobile searches, and voice search is seeing double-digit growth every month. How can businesses prepare their websites for the future of online search? This talk will explore changes in search behaviour and provide actionable tips for businesses. Which technologies are driving screenless searches through voice search and automatic notifications? How can you prepare your website to integrate with the future of SEO?
Search engines have come a long way in understanding JavaScript, but issues with rendering and load times can still impact your crawl budget and prevent search engines from indexing valuable content!
Finding the optimal solution that provides the best user experience, whilst also satisfying the bots can be a challenge. This talk will cover the differences between these solutions, a number of tools and metrics you can use, and other significant considerations to take into account when proposing a rendering solution to your developers.
SearchLeeds 2018 - Steve Chambers - Stickyeyes - How not to F**K up a Migration Branded3
Whether you’re migrating to HTTPS, domain or to a whole new website platform, there are crucial elements that you need to be considering to avoid substantial loss of organic traffic. Steve's presentation takes you through the most common pitfalls of site migrations and how you can avoid them, ensuring a smooth transition.
BrightonSEO, July 2021 - To better understand a website's content search engines developed Web Rendering Services and are now able to render pages more or less like a normal user. Those Web Rendering Services are strictly connected to other phases of the crawling-indexing-ranking pipeline - if a rendering fails, it may affect all of them. In this session Giacomo will guide you through the process of understanding why rendering could be a problem also for non-Javascript pages, how to manually debug page rendering, the difference between understanding WRSs' capabilities and debugging problems on a website, and eventually how to test pages at scale.
TechSEO Boost 2018: The Statelessness of Technical SEOCatalyst
Is there a state of technical SEO? Or is it simply a function of what’s happening with tech in broader disciplines like data/analytics, digital marketing, and web development? Join this session to get Mike King’s take on the state, or rather, statelessness of technical SEO. He’ll share the things you need to know to build an authentic picture of your technical SEO program and get on Google’s level.
TechSEO Boost 2021 - Rendering Strategies: Measuring the Devil’s Details in C...Catalyst
View the recording here: https://www.catalystdigital.com/techseoboost/#on-demand-recordings
Jamie Indigo, Technical SEO Consultant, DeepCrawl
Everyone's buzzing about Core Web Vitals, but we often overlook critical context-- these metrics are the results of how we render content. Let's deep dive into render mechanics, strategies, performance measurement and what's the best fit for your site. As with most things SEO: it's not about what you do; it's how you do it.
Managing Director of iPullRank, Mike King, talks about how to leverage automated testing to ensure that developers engaged in Continuous Integration don't end up accidentally breaking the optimizations in place for SEO
Hreflang - why and how and why not for International SEOGerry White
Hreflang has been the most challenging, one of the most complicated and almost unpredictable elements of technical SEO, and after doing hundreds of implementations Gerry White and Rise at Seven have learnt a ton about the reasons it can go wrong, why Google really does just treat it as a hint, not a directive and why even when it is all perfect, it can still no achieve the desired results. This will explain how to look at the data, how to evaluate the results and monitor it.
rel canonical audit BrightonSEO September 2018Mark Thomas
Why taking a look at your canonical setup is a smart move... Some data to compare with, an audit checklist, and a number of Google's quotes over the years.
Solving Complex JavaScript Issues and Leveraging Semantic HTML5Hamlet Batista
On this presentation we go deep on Chrome developer tools, JS debugger and breakpoints, technical optimization and capabilities of browser service workers to improve SEO and performance
SEO for Large/Enterprise Websites - Data & Tech SideDominic Woodman
There is a lot to cover about SEO for large websites/enterprise.
In this talk we'll cover primarily the data analysis and the technical SEO side of things. In future presentations we'll look at more.
TechSEO Boost 2018: Internal Link Optimization on SteroidsCatalyst
SEO for large sites is completely different than SEO for smaller sites. Large sites have a strong (yet often overlooked!) lever that can boost rankings: internal linking! However, it can be challenging to understand which pages have the highest PageRank, so that you can tweak them to serve important pages better. That can only be determined when you combine internal and external PageRank. Join Kevin Indig as he presents an innovative approach that merges data from crawls, log files, and backlinks to solve the puzzle! You’ll learn how to:
• Combine crawls, log files, and backlinks to find weaknesses in your internal linking structure
• Analyze the impact of tweaking internal linking before you deploy the changes
• Understand how to tweak internal linking at scale
TechSEO Boost 2017: SEO Best Practices for JavaScript T-Based WebsitesCatalyst
While providing a dynamic and fast user experience, JavaScript-based sites (SPAs/PWAs) are not always “SEO friendly.” Therefore, it is crucial for developers to understand how search engines crawl, parse, eventually render, and index dynamic websites, to make sure bots get the experience they developed and the content of the site.
UK Top 5,000 Websites; Mobile Site Speed Benchmark - BrightonSEOErudite
At Erudite we like to conduct our own R and D so that we truly understand the competitive landscape. We analysed the Lighthouse speed metrics of 5,000 of the UKs top websites, and categorised them by channel, so that we can better understand mobile site speed in the context of competition.
We’ve analysed the SEO of over 100 eCom sites - this is what we’ve learned!DanielCartland
Working exclusively with eCommerce websites across a whole range of CMS, we've seen (almost) everything, from accidental overnight noindexing of entire websites, to broken dynamic rendering stopping only poor Bingbot from crawling a site.
But what we find most fascinating are the trends that arise most frequently, often transcending the boundaries of website size, niche, and brand: - What opportunities do most eCommerce sites miss? - What problems are frequently holding eCom brands from product goldmines? - What have we found to be the most effective ways of selling the ROI of SEO and getting our recommendations implemented above the inevitable development backlogs?
This talk aims to outline what we've learned about the most common issues and opportunities eCommerce websites face, with data-backed examples of what websites can do today to boost SEO and make an immediate difference in organic traffic and revenue.
Getting Started with Python and Machine Learning for SEO | BrightonSEO Octobe...Ruth Everett
Find out how Python and Machine Learning can be used to save you time and help you understand your website better.
This session will cover how advancements to Python and Machine Learning are changing the game for busy SEOs, discuss the positive impact they can have and provide you with plenty of ready-to-use scripts you can use to save time with your technical SEO efforts.
Sara Harkousse - "Web Components: It's all rainbows and unicorns! Is it?"IT Event
Web components are a tale of four w3c specifications. They are a hot topic now. We’ve all seen big headlines, for instance, “The Web Components revolution”, “Web Components are a game changer”, “A Tectonic Shift for Web Development”, … and so many others. They are certainly exciting and promising, nevertheless, there are some factors holding them back such as performance issues and lack of browser support. Some features seems to be more hassle than they’re worth. In this talk you’ll examine web components from a pragmatic stand point. So if you want to start using web components in production, come to learn what features can you use today. Actually, despite the still short browser support, some of web components features seems to be the best choice to start with . The assessment you’ll learn is the reflection of my personal research and work on my spare time and also feedbacks from my co-workers.
BrightonSEO, July 2021 - To better understand a website's content search engines developed Web Rendering Services and are now able to render pages more or less like a normal user. Those Web Rendering Services are strictly connected to other phases of the crawling-indexing-ranking pipeline - if a rendering fails, it may affect all of them. In this session Giacomo will guide you through the process of understanding why rendering could be a problem also for non-Javascript pages, how to manually debug page rendering, the difference between understanding WRSs' capabilities and debugging problems on a website, and eventually how to test pages at scale.
TechSEO Boost 2018: The Statelessness of Technical SEOCatalyst
Is there a state of technical SEO? Or is it simply a function of what’s happening with tech in broader disciplines like data/analytics, digital marketing, and web development? Join this session to get Mike King’s take on the state, or rather, statelessness of technical SEO. He’ll share the things you need to know to build an authentic picture of your technical SEO program and get on Google’s level.
TechSEO Boost 2021 - Rendering Strategies: Measuring the Devil’s Details in C...Catalyst
View the recording here: https://www.catalystdigital.com/techseoboost/#on-demand-recordings
Jamie Indigo, Technical SEO Consultant, DeepCrawl
Everyone's buzzing about Core Web Vitals, but we often overlook critical context-- these metrics are the results of how we render content. Let's deep dive into render mechanics, strategies, performance measurement and what's the best fit for your site. As with most things SEO: it's not about what you do; it's how you do it.
Managing Director of iPullRank, Mike King, talks about how to leverage automated testing to ensure that developers engaged in Continuous Integration don't end up accidentally breaking the optimizations in place for SEO
Hreflang - why and how and why not for International SEOGerry White
Hreflang has been the most challenging, one of the most complicated and almost unpredictable elements of technical SEO, and after doing hundreds of implementations Gerry White and Rise at Seven have learnt a ton about the reasons it can go wrong, why Google really does just treat it as a hint, not a directive and why even when it is all perfect, it can still no achieve the desired results. This will explain how to look at the data, how to evaluate the results and monitor it.
rel canonical audit BrightonSEO September 2018Mark Thomas
Why taking a look at your canonical setup is a smart move... Some data to compare with, an audit checklist, and a number of Google's quotes over the years.
Solving Complex JavaScript Issues and Leveraging Semantic HTML5Hamlet Batista
On this presentation we go deep on Chrome developer tools, JS debugger and breakpoints, technical optimization and capabilities of browser service workers to improve SEO and performance
SEO for Large/Enterprise Websites - Data & Tech SideDominic Woodman
There is a lot to cover about SEO for large websites/enterprise.
In this talk we'll cover primarily the data analysis and the technical SEO side of things. In future presentations we'll look at more.
TechSEO Boost 2018: Internal Link Optimization on SteroidsCatalyst
SEO for large sites is completely different than SEO for smaller sites. Large sites have a strong (yet often overlooked!) lever that can boost rankings: internal linking! However, it can be challenging to understand which pages have the highest PageRank, so that you can tweak them to serve important pages better. That can only be determined when you combine internal and external PageRank. Join Kevin Indig as he presents an innovative approach that merges data from crawls, log files, and backlinks to solve the puzzle! You’ll learn how to:
• Combine crawls, log files, and backlinks to find weaknesses in your internal linking structure
• Analyze the impact of tweaking internal linking before you deploy the changes
• Understand how to tweak internal linking at scale
TechSEO Boost 2017: SEO Best Practices for JavaScript T-Based WebsitesCatalyst
While providing a dynamic and fast user experience, JavaScript-based sites (SPAs/PWAs) are not always “SEO friendly.” Therefore, it is crucial for developers to understand how search engines crawl, parse, eventually render, and index dynamic websites, to make sure bots get the experience they developed and the content of the site.
UK Top 5,000 Websites; Mobile Site Speed Benchmark - BrightonSEOErudite
At Erudite we like to conduct our own R and D so that we truly understand the competitive landscape. We analysed the Lighthouse speed metrics of 5,000 of the UKs top websites, and categorised them by channel, so that we can better understand mobile site speed in the context of competition.
We’ve analysed the SEO of over 100 eCom sites - this is what we’ve learned!DanielCartland
Working exclusively with eCommerce websites across a whole range of CMS, we've seen (almost) everything, from accidental overnight noindexing of entire websites, to broken dynamic rendering stopping only poor Bingbot from crawling a site.
But what we find most fascinating are the trends that arise most frequently, often transcending the boundaries of website size, niche, and brand: - What opportunities do most eCommerce sites miss? - What problems are frequently holding eCom brands from product goldmines? - What have we found to be the most effective ways of selling the ROI of SEO and getting our recommendations implemented above the inevitable development backlogs?
This talk aims to outline what we've learned about the most common issues and opportunities eCommerce websites face, with data-backed examples of what websites can do today to boost SEO and make an immediate difference in organic traffic and revenue.
Getting Started with Python and Machine Learning for SEO | BrightonSEO Octobe...Ruth Everett
Find out how Python and Machine Learning can be used to save you time and help you understand your website better.
This session will cover how advancements to Python and Machine Learning are changing the game for busy SEOs, discuss the positive impact they can have and provide you with plenty of ready-to-use scripts you can use to save time with your technical SEO efforts.
Sara Harkousse - "Web Components: It's all rainbows and unicorns! Is it?"IT Event
Web components are a tale of four w3c specifications. They are a hot topic now. We’ve all seen big headlines, for instance, “The Web Components revolution”, “Web Components are a game changer”, “A Tectonic Shift for Web Development”, … and so many others. They are certainly exciting and promising, nevertheless, there are some factors holding them back such as performance issues and lack of browser support. Some features seems to be more hassle than they’re worth. In this talk you’ll examine web components from a pragmatic stand point. So if you want to start using web components in production, come to learn what features can you use today. Actually, despite the still short browser support, some of web components features seems to be the best choice to start with . The assessment you’ll learn is the reflection of my personal research and work on my spare time and also feedbacks from my co-workers.
Streaming Trend Discovery: Real-Time Discovery in a Sea of Events with Scott ...Databricks
Time is the one thing we can never get in front of. It is rooted in everything, and “timeliness” is now more important than ever especially as we see businesses automate more and more of their processes. This presentation will scratch the surface of streaming discovery with a deeper dive into the telecommunications space where it is normal to receive billions of events a day from globally distributed sub-systems and where key decisions “must” be automated.
We’ll start out with a quick primer on telecommunications, an overview of the key components of our architecture, and make a case for the importance of “ringing”. We will then walk through a simplified solution for doing windowed histogram analysis and labeling of data in flight using Spark Structured Streaming and mapGroupsWithState. I will walk through some suggestions for scaling up to billions of events, managing memory when using the spark StateStore as well as how to avoid pitfalls with the serialized data stored there.
What you’ll learn:
1. How to use the new features of Spark 2.2.0 (mapGroupsWithState / StateStore)
2. How to bucket and analyze data in the streaming world
3. How to avoid common Serialization mistakes (eg. how to upgrade application code and retain stored state)
4. More about the telecommunications space than you’ll probably want to know!
5. Learn a new approach to building applications for enterprise and production.
Assumptions:
1. You know Scala – or want to know more about it.
2. You have deployed spark to production at your company or want to
3. You want to learn some neat tricks that may save you tons of time!
Take Aways:
1. Fully functioning spark app – with unit tests!
Taking Jupyter Notebooks and Apache Spark to the Next Level PixieDust with Da...Databricks
PixieDust is a new open source library that helps data scientists and developers working in Jupyter Notebooks and Apache Spark be more efficient. PixieDust speeds up data manipulation and display with features like: auto-visualization of Spark DataFrames, real-time Spark job progress monitoring, automated local install of Python and Scala kernels running with Spark, and much more.
Come along and learn how you can use this tool in your own projects to visualize and explore data effortlessly with no coding. Oh, and if you prefer working with a Scala Notebook, this session is also for you, as PixieDust can also run on a Scala Kernel. Imagine being able to visualize your favorite Python chart engines from a Scala Notebook!
We’ll finish the session with a demo combining Twitter, Watson Tone Analyzer, Spark Streaming, and some fun real-time visualizations–all running within a Notebook.
Technical SEO: Crawl Space Management - SEOZone Istanbul 2014Bastian Grimm
My talk at #SEOZone 2014 in Istanbul covering various aspects of crawl space optimization such as crawler control & indexation strategies as well as site speed.
Codemotion Milan 2018 - AI with a devops mindset: experimentation, sharing an...Thiago de Faria
AI is the buzzword while ML is the underlying component... but when do we use ML? To solve problems that machines can find patterns without explicitly programming them to do so. But do you have a team building an ML model? How far are they from the IT team? Do they know how to deploy and serve that? Testing? And sharing what they have done? That's where a devops mindset comes in: reduce the batch size, continuous-everything and a culture of failure/experimentation are vital for your data team! In the end, I will show how the workflow of a data scientist can be in real life with a live demo!
Thiago de Faria - AI with a devops mindset - experimentation, sharing and eas...Codemotion
AI is the buzzword while ML is the underlying component... but when do we use ML? To solve problems that machines can find patterns without explicitly programming them to do so. But do you have a team building an ML model? How far are they from the IT team? Do they know how to deploy and serve that? Testing? And sharing what they have done? That's where a devops mindset comes in: reduce the batch size, continuous-everything and a culture of failure/experimentation are vital for your data team! In the end, I will show how the workflow of a data scientist can be on the real life with a live demo!
Using Cool New Frameworks in (Mobile) Domino AppsTeamstudio
Did you know there's an abundance of cool CSS and JS frameworks out there? Have you ever wanted to find out how you can use them in your own (mobile) Domino apps? Theo Heselmans shows real world applications from his customers using some of these powerful frameworks inside Domino. Explore with us on how they integrated Bootstrap, Ratchet.js, Knockout.js, Backbone.js, Underscore.js, jQuery.js and more!
Dart on Arm - Flutter Bangalore June 2021Chris Swan
Running Dart on Arm servers, covering the trade offs between JIT and AOT. The dependencies needed for building and running AOT binaries, and how to cross compile Arm binaries.
Developers have spent countless hours working on a plugin that seems to be built just for you, just for this moment. If only it wouldn’t…. This is a problem nearly every developer faces at some point during a project. Knowing how to safely customize functionality is key to building products that will stand the test of time (or major updates.)
This talk will cover the following topics:
Hooks
Filters
Overwriting Functions
Modifying Returned content
Requesting a change to a plugin
This talk will define and provide a use case for each of the possible pathways listed above. This presentation is for beginning to intermediate developers and though I will use as little code as possible, given the nature of this talk I would imagine each item will require 1 code example. The slides will be available ahead of time on slideshare and the code examples will be available on GitHub.
Achieving Technical Excellence in Your Software Teams - from Devternity Peter Gfader
Our industry has a problem: We are not lacking software methodologies, programming languages, tools or frameworks but we need great software engineers.
Great software engineer teams build quality-in and deliver great software on a regular basis. The technical excellence of those engineers will help you escape the "Waterfall sandwich" and make your organization a little more agile, from the inception of an idea till they go live.
I will talk about my experiences from the last 15 years, including small software delivery teams until big financial institutions.
Why would a company like to be "agile"?
How can a company achieve that?
How can you achieve Technical Excellence in your software teams?
What developer skills are more important than languages, methods or frameworks?
This will be an interactive session with a Q&A at the end.
Explore our comprehensive data analysis project presentation on predicting product ad campaign performance. Learn how data-driven insights can optimize your marketing strategies and enhance campaign effectiveness. Perfect for professionals and students looking to understand the power of data analysis in advertising. for more details visit: https://bostoninstituteofanalytics.org/data-science-and-artificial-intelligence/
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Data and AI
Discussion on Vector Databases, Unstructured Data and AI
https://www.meetup.com/unstructured-data-meetup-new-york/
This meetup is for people working in unstructured data. Speakers will come present about related topics such as vector databases, LLMs, and managing data at scale. The intended audience of this group includes roles like machine learning engineers, data scientists, data engineers, software engineers, and PMs.This meetup was formerly Milvus Meetup, and is sponsored by Zilliz maintainers of Milvus.
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...pchutichetpong
M Capital Group (“MCG”) expects to see demand and the changing evolution of supply, facilitated through institutional investment rotation out of offices and into work from home (“WFH”), while the ever-expanding need for data storage as global internet usage expands, with experts predicting 5.3 billion users by 2023. These market factors will be underpinned by technological changes, such as progressing cloud services and edge sites, allowing the industry to see strong expected annual growth of 13% over the next 4 years.
Whilst competitive headwinds remain, represented through the recent second bankruptcy filing of Sungard, which blames “COVID-19 and other macroeconomic trends including delayed customer spending decisions, insourcing and reductions in IT spending, energy inflation and reduction in demand for certain services”, the industry has seen key adjustments, where MCG believes that engineering cost management and technological innovation will be paramount to success.
MCG reports that the more favorable market conditions expected over the next few years, helped by the winding down of pandemic restrictions and a hybrid working environment will be driving market momentum forward. The continuous injection of capital by alternative investment firms, as well as the growing infrastructural investment from cloud service providers and social media companies, whose revenues are expected to grow over 3.6x larger by value in 2026, will likely help propel center provision and innovation. These factors paint a promising picture for the industry players that offset rising input costs and adapt to new technologies.
According to M Capital Group: “Specifically, the long-term cost-saving opportunities available from the rise of remote managing will likely aid value growth for the industry. Through margin optimization and further availability of capital for reinvestment, strong players will maintain their competitive foothold, while weaker players exit the market to balance supply and demand.”
15. 1. A (mini) intro to PYTHON
2. Why Streamlit?
3. A look at Streamlit functions
4. Building our 1st app
5. More advanced examples
6. More tips to help you start
Agenda
@DataChaz #BrightonSEO
47. ML for SEO -> 🔥
● Content generation via GPT-3
● Keyword/Intent classification via Deep Learning
● Anomaly & opportunity detection via clustering
● Forecast data series
● Predict *some* ranking factors
@DataChaz #BrightonSEO
52. Ubiquitous. And for good reasons!
• Instant feedback
• Access to the whole Python eco-system
• Free GPU = Massive horsepower for
machine learning computation!
52
@DataChaz #BrightonSEO
61. Notebooks = bad coding habits!
• Little for exception handling!
• Messed-up hidden state
• Don’t play nice w. CI & source control
• …. I’ll stop there!
@DataChaz #BrightonSEO
62. What Notebooks are good for!
• Scribbling
• Explore your data
• Document your processes
• Prototyping models
@DataChaz #BrightonSEO
74. Cache data = skip redundant compute
• Decorate any function
• Streamlit will cache its results!
• Can be dataframes transformations, CSV load etc.
@DataChaz #BrightonSEO
75. Big data ready! 🙌
• Connect to databases
• Connect to Bigquery, Azure, AWS
• Big data libraries like PySpark
@DataChaz #BrightonSEO
130. • Uses advanced fuzzy matching techniques
• Map keywords to your crawl’s URLs
• Use the code and tweak it for your own projects! 🎉
@DataChaz #BrightonSEO
132. ● Upload a Screaming Frog crawl
● Paste keywords (e.g., internal
search terms)
● Auto-map them to URLs
● Get accuracy scores
● Download results to CSV
133. Install files on local machine
● Create a folder on your local machine
● Drop the downloaded files in that folder
● Head to that folder via cd
● pip install streamlit polyfuzz
@DataChaz #BrightonSEO
https://github.com/CharlyWargnier/KeywordMapperForBrightonSEO
137. Add a CSV uploader
@DataChaz #BrightonSEO
https://github.com/CharlyWargnier/KeywordMapperForBrightonSEO
138. Add a text area for keywords
https://github.com/CharlyWargnier/KeywordMapperForBrightonSEO
139. Apply the fuzzy matching function
@DataChaz #BrightonSEO
https://github.com/CharlyWargnier/KeywordMapperForBrightonSEO
140. Add a formatted table
@DataChaz #BrightonSEO
https://github.com/CharlyWargnier/KeywordMapperForBrightonSEO
141. Add a Download button
@DataChaz #BrightonSEO
https://github.com/CharlyWargnier/KeywordMapperForBrightonSEO
142. Here and there, I’ve also added…
@DataChaz #BrightonSEO
● Some headers
● Some warning messages
● Some exceptions via ‘Try/Except’
https://github.com/CharlyWargnier/KeywordMapperForBrightonSEO
143. Type ‘streamlit run app.py’ in
your command prompt
Voila! 🙌
@DataChaz #BrightonSEO
https://github.com/CharlyWargnier/KeywordMapperForBrightonSEO
148. Once you have your invite…
@DataChaz #BrightonSEO
● Create a Github repo
● Upload all your files in that repo
● Go to https://share.streamlit.io/deploy
152. • Uses the Google language API
• Extract & analyse entities at scale
• Directly from web pages!
▶️ https://streamea-entity-analyzer.herokuapp.com/
@DataChaz #BrightonSEO
154. • Bulk retrieve auto-complete suggestions
• From Google and Bing
• Choose depth
• Visualise in tree/tabular form
• Export results to csv
▶️ https://streamsuggest.herokuapp.com/
155. • Retrieve entities from
Wikipedia URLs or topics!
• Get interactive graphs of
connected entities
• Get entity types + salience
scores
• Export results to CSV!
• (Original script from
@jroakes)
▶️ https://share.streamlit.io/charlywargnier/s4_wiki_topic_grapher/main/app.py
162. Tip 01: Data blending ideas
• Blend your client’s data…
• … with Google Analytics or GSC
• Create a Streamlit app!
@DataChaz #BrightonSEO
163. Tip 02: Start ditching Excel!
• Think of transfos you’re doing all the time!
• Break in smaller, easy chunks
• Code these in your IDE, or a Colab
• Convert to Streamlit! 🔥
@DataChaz #BrightonSEO
164. Tip 03: Play with APIs!
• Pick an API
• Choose your parameters
• Build a Streamlit GUI for it!
@DataChaz #BrightonSEO