In this data management session, Christopher describes how to build robust and reliable data products in BigQuery and dbt, for PPC and SEO use cases. After an introduction to the modern data stack, six principles of reliable data products are presented, followed by the following use cases:
- Google Ads Conversion upload
- SEO sitemap efficiency report
- Google Shopping product rating sync
- Large-Scale link checker with advertools
- Inventory-based PPC campaigns with dbt
Here is the referenced selection of gists on github: https://gist.github.com/ChrisGutknecht
Your Raw Data is Ready - Introduction to Analytics Engineering | SMX Advanced...Christopher Gutknecht
In this SMX Advanced 2022 session, Christopher talks about the potential of working with raw data and how to properly approach the task of transforming raw data into high-quality reusable tables in your data warehouse. dbt as a transformation framework plays a key role in delivering quality and structure for this process. The chance for search marketers is to acquire data modeling skills and learn to build their own custom data products around Google Ads and Google Analytics. This talk is not just for inhouse-teams, also for agencies seeking to extend their services into data management.
Building a Marketing Data Warehouse from Scratch - SMX Advanced 202Christopher Gutknecht
This deck covers the journey of starting with BigQuery, adding more data sources and building a process around your data warehouse. It covers the three phases greenfield, dashboards and operational analytics and the necessary data components.
The code for uploading your product feed can be found here:
https://gist.github.com/ChrisGutknecht/fde93092e21039299ab76715596eac01
If you have any questions, reach out to me on Linkedin!
Scaling Search Campaigns With Bulk Uploads and Ad Customizers (SMX 2023)Christopher Gutknecht
Bulk uploads are a great choice for large-scale automation, as they can handle larger volumes than Ads scripts but without the complexity of the Google Ads API. In this session, Christopher will show what you can update via MCC-level bulk uploads and what not, including restrictions and workarounds.
Different to Ads scripts, the preparation of data has to happen elsewhere: Christopher will explain the Bergzeit approach of preparing all search campaign data in SQL and BigQuery, leveraging the Google Ads Transfer data and the dbt framework for transformations. The full github repository will be shared that you can build your own search automation platform for highly customisable account structure setups, including extensive use of ad customisers via structured data. The common campaign structure considerations arecovered step by step. This session will start with the basics of bulk uploads and finish off with advanced tactics.
Here is the link to the github repository: https://github.com/ChrisGutknecht/inventory_campaigns
How to build a holistic search strategy that worksHannahIJohnson1
Far too often, SEM and SEO are managed in silos, leaving users with a disjointed search experience and causing brands to lose potential customers. SEO and SEM are complementary channels, which requires both teams to be unified in order to optimize the user experience and drive conversions.
This presentation goes over:
Educate key stakeholders about why the shift to holistic search is necessary
Evaluate your program from search query to conversion to identify areas of opportunity and growth
Understand how those elements translate to day-to-day account management
Create the plan for how your teams will work together moving forward
The Python Cheat Sheet for the Busy MarketerHamlet Batista
What percentage of an Inbound marketer's day doesn't involve working with spreadsheets? How much of this work is time-consuming and repetitive? In this interactive session, you will learn how to manipulate Google Sheets to automate common data analysis workflows using Python, a very easy to use programming language.
In this webinar you'll learn about the best practices for Google BigQuery—and how Matillion ETL makes loading your data faster and easier. Find out from our experts how to leverage one of the largest, fastest, and most capable cloud data warehouses to improve your business and save money.
In this webinar:
- Discover how to work fast and efficiently with Google BigQuery
- Find out the best ways to monitor and control costs
- Learn to leverage Matillion ETL and optimize Google BigQuery
- Get tips and tricks for better performance
Wish you knew the secrets to getting your written content seen and ranked faster?
Want to know what’s slowing it all down?
We can help you understand how your content gets seen and crawled by Google so you can create naturally higher ranking pages, blogs, and more.
Once you see your content how Google sees it, you can easily determine how to get it live on the SERPs – faster.
Real-time log file insights can become your secret ingredient to better content and SEO.
Join our webinar to learn how you can easily use log file insights to improve crawling, indexing, and ranking higher with content.
You'll discover how to:
- Get content crawled, indexed, and ranked faster.
- How to use log file insights for effective content.
- Create far better SEO forecasts.
Watch Conductor's special guest, Steven van Vessum, Director of Organic Marketing at ContentKing, as he answers your most pressing crawl questions and how it relates to content.
Get a better idea of how content impacts the behavior of Google's crawlers and how to use it to rank higher.
Most organizations still produce content each month without reflecting critically on how quickly these assets were crawled and indexed.
Making future content plans is still a guessing game, but this webinar will get you up to speed.
PubCon, Lazarina Stoy. - Machine Learning in Search: Google's ML APIs vs Open...LazarinaStoyanova
In this presentation, I go through the different use cases of machine learning APIs for search marketers and digital marketers. Specifically, I look at APIs by Google Cloud and OpenAI and identify the best to use for your SEO projects, based on the task at hand.
Your Raw Data is Ready - Introduction to Analytics Engineering | SMX Advanced...Christopher Gutknecht
In this SMX Advanced 2022 session, Christopher talks about the potential of working with raw data and how to properly approach the task of transforming raw data into high-quality reusable tables in your data warehouse. dbt as a transformation framework plays a key role in delivering quality and structure for this process. The chance for search marketers is to acquire data modeling skills and learn to build their own custom data products around Google Ads and Google Analytics. This talk is not just for inhouse-teams, also for agencies seeking to extend their services into data management.
Building a Marketing Data Warehouse from Scratch - SMX Advanced 202Christopher Gutknecht
This deck covers the journey of starting with BigQuery, adding more data sources and building a process around your data warehouse. It covers the three phases greenfield, dashboards and operational analytics and the necessary data components.
The code for uploading your product feed can be found here:
https://gist.github.com/ChrisGutknecht/fde93092e21039299ab76715596eac01
If you have any questions, reach out to me on Linkedin!
Scaling Search Campaigns With Bulk Uploads and Ad Customizers (SMX 2023)Christopher Gutknecht
Bulk uploads are a great choice for large-scale automation, as they can handle larger volumes than Ads scripts but without the complexity of the Google Ads API. In this session, Christopher will show what you can update via MCC-level bulk uploads and what not, including restrictions and workarounds.
Different to Ads scripts, the preparation of data has to happen elsewhere: Christopher will explain the Bergzeit approach of preparing all search campaign data in SQL and BigQuery, leveraging the Google Ads Transfer data and the dbt framework for transformations. The full github repository will be shared that you can build your own search automation platform for highly customisable account structure setups, including extensive use of ad customisers via structured data. The common campaign structure considerations arecovered step by step. This session will start with the basics of bulk uploads and finish off with advanced tactics.
Here is the link to the github repository: https://github.com/ChrisGutknecht/inventory_campaigns
How to build a holistic search strategy that worksHannahIJohnson1
Far too often, SEM and SEO are managed in silos, leaving users with a disjointed search experience and causing brands to lose potential customers. SEO and SEM are complementary channels, which requires both teams to be unified in order to optimize the user experience and drive conversions.
This presentation goes over:
Educate key stakeholders about why the shift to holistic search is necessary
Evaluate your program from search query to conversion to identify areas of opportunity and growth
Understand how those elements translate to day-to-day account management
Create the plan for how your teams will work together moving forward
The Python Cheat Sheet for the Busy MarketerHamlet Batista
What percentage of an Inbound marketer's day doesn't involve working with spreadsheets? How much of this work is time-consuming and repetitive? In this interactive session, you will learn how to manipulate Google Sheets to automate common data analysis workflows using Python, a very easy to use programming language.
In this webinar you'll learn about the best practices for Google BigQuery—and how Matillion ETL makes loading your data faster and easier. Find out from our experts how to leverage one of the largest, fastest, and most capable cloud data warehouses to improve your business and save money.
In this webinar:
- Discover how to work fast and efficiently with Google BigQuery
- Find out the best ways to monitor and control costs
- Learn to leverage Matillion ETL and optimize Google BigQuery
- Get tips and tricks for better performance
Wish you knew the secrets to getting your written content seen and ranked faster?
Want to know what’s slowing it all down?
We can help you understand how your content gets seen and crawled by Google so you can create naturally higher ranking pages, blogs, and more.
Once you see your content how Google sees it, you can easily determine how to get it live on the SERPs – faster.
Real-time log file insights can become your secret ingredient to better content and SEO.
Join our webinar to learn how you can easily use log file insights to improve crawling, indexing, and ranking higher with content.
You'll discover how to:
- Get content crawled, indexed, and ranked faster.
- How to use log file insights for effective content.
- Create far better SEO forecasts.
Watch Conductor's special guest, Steven van Vessum, Director of Organic Marketing at ContentKing, as he answers your most pressing crawl questions and how it relates to content.
Get a better idea of how content impacts the behavior of Google's crawlers and how to use it to rank higher.
Most organizations still produce content each month without reflecting critically on how quickly these assets were crawled and indexed.
Making future content plans is still a guessing game, but this webinar will get you up to speed.
PubCon, Lazarina Stoy. - Machine Learning in Search: Google's ML APIs vs Open...LazarinaStoyanova
In this presentation, I go through the different use cases of machine learning APIs for search marketers and digital marketers. Specifically, I look at APIs by Google Cloud and OpenAI and identify the best to use for your SEO projects, based on the task at hand.
Join 20-year SEO veteran Ryan Huser as he explores the transformative intersection of Generative AI and SEO in his talk, ""Generative AI: The new Wild West of SEO"". The presentation will discuss how tech giants Bing and Google have enhanced the search experience using Generative AI. It will also unpack the wealth of options available to marketers, and how they can use these innovations for content creation and search engine optimization. Ryan's talk promises to offer invaluable insights into the emerging landscape of AI-driven SEO, emphasizing its profound implications for digital marketing strategies.
Data Restart 2022: Roman Appeltauer - Aktivace first-party dat pomocí SGTMTaste
Server-side GTM není jen k měření ze serveru. Skvěle slouží i jako real-time integrační nástroj, který šetří kapacity a focus vývojářů, a přitom efektivně a kontrolovaně předává data mezi aplikacemi, které si samy povídat neumí, nebo ne tak, jak potřebujete.
[LondonSEO 2020] BigQuery & SQL for SEOsAreej AbuAli
In this talk, Areej will share her learning process, how SEOs can get acquainted with the world of BigQuery and why SQL is the new and improved Excel. The audience will walk away with a handful of scripts and tips to get their BigQuery journey started!
Semantic Publishing and Entity SEO - Conteference 20-11-2022Massimiliano Geraci
Semantic Publishing is publishing a page on the Internet by adding a semantic layer (i.e., semantic enrichment) in the form of structured data that describes the page itself.
BI Consultancy - Data, Analytics and StrategyShivam Dhawan
The presentation describes my views around the data we encounter in digital businesses like:
- Looking at common Data collection methodologies,
-What are the common issues within the decision support system and optimiztion lifecycle,
- Where are most of failing?
and most importantly, "How to connect the dots and move from Data to Strategy?"
I work with all facets of Web Analytics and Business Strategy and see the structures and governance models of various domains to establish and analyze the key performance indicators that allow you to have a 360º overview of online and offline multi-channel environment.
Apart from my experience with the leading analytic tools in the market like Google Analytics, Omniture and BI tools for Big Data, I am developing new solutions to solve complex digital / business problems.
As a resourceful consultant, I can connect with your team in any modality or in any form that meets your needs and solves any data/strategy problem.
2021-07-16 잔디콘 시즌6 발표자료
- 베이지안 방법론에 Expected Loss를 활용하여 프로덕트 개선 속도를 높이는 방법에 대해 알아봅니다.
- 개발자 및 통계학 전공자가 아닌 분들을 대상으로 한 발표입니다. 다소 엄밀하지 못한 설명이 포함되었을 수 있으니 양해 부탁드립니다. 잘못된 부분은 답글로 달아주시면 감사하겠습니다.
원본 파일은 다음 링크로 다운로드 받으실 수 있습니다 :)
https://www.dropbox.com/s/zo1er99muu2oj5l/leeminho_til6_bayesian_abtest.pdf?dl=0
Retail Analytics and BI with Looker, BigQuery, GCP & Leigha JarettDaniel Zivkovic
Leigha Jarett of GCP explains how to bring Cloud "superpowers" to your Data and modernize your Business Intelligence with Looker, BigQuery and Google Cloud services on an example of Cymbal Direct - one of Google Cloud's demo brands. The meetup recording with TOC for easy navigation is at https://youtu.be/BpzJU_S40ic.
P.S. For more interactive lectures like this, go to http://youtube.serverlesstoronto.org/ or sign up for our upcoming live events at https://www.meetup.com/Serverless-Toronto/events/
Building a Data Strategy – Practical Steps for Aligning with Business GoalsDATAVERSITY
Developing a Data Strategy for your organization can seem like a daunting task – but it’s worth the effort. Getting your Data Strategy right can provide significant value, as data drives many of the key initiatives in today’s marketplace, from digital transformation to marketing, customer centricity, population health, and more. This webinar will help demystify Data Strategy and its relationship to Data Architecture and will provide concrete, practical ways to get started.
A beginner's guide to machine learning for SEOs - WTSFest 2022LazarinaStoyanova
This is a guide for machine learning for beginners, tailored to the SEO industry, aimed at breaking down the challenges that hold us back from experimenting, the breakdown of machine learning's main characteristics to help us understand how to implement it a bit better, and the ways we can embed advanced technology into our daily practice.
Whether you’re an agency spending days doing keyword research to build an information architecture, a marketplace wanting to A/B test internal linking across 100,000 pages or a classifieds site pruning millions of pages, no-code automation offers a way to do SEO more quickly, scalably, holistically and portably.
This presentations explains what exactly no-code automation for SEO is, what it isn’t, what data sources it can combine and how Similar.ai has used it to drive organic traffic, improve rankings and grow revenue.
Discover, pa’ tipos como tú: Los 13 factores para disparar tu tráficoClara Soteras
Ponencia sobre Google Discover en Sidralytics 2023. San Sebastián.
Google Discover es uno de los canales de captación de tráfico más importantes actualmente para la mayoría de medios de comunicación juntamente con las breaking news y el trabajo en real-time, pero ¿sabías que todos los factores que aplican los periódicos digitales para posicionar en Discover también pueden aplicarse a tu web?
En esta charla, Clara nos contará cuáles son los factores disparadores para que tus contenidos puedan aparecer en Discover y multiplicar tu tráfico con una estrategia más allá del posicionamiento orgánico en la SERP. Aprende a aprovechar las entidades y tendencias del momento para apretar el acelerador de tu (coche).
Supercharge your data analytics with BigQueryMárton Kodok
Powering interactive data analysis require massive architecture, and Know-How to build a fast real-time computing system. BigQuery solves this problem by enabling super-fast, SQL-like queries against petabytes of data using the processing power of Google’s infrastructure. We will cover its core features, creating tables, columns, views, working with partitions, clustering for cost optimizations, streaming inserts, User Defined Functions, and several use cases for everydaay developer: funnel analytics, behavioral analytics, exploring unstructured data.
The other part will be about BigQuery ML, which enables users to create and execute machine learning models in BigQuery using standard SQL queries. BigQuery ML democratizes machine learning by enabling SQL practitioners to build models using existing SQL tools and skills. BigQuery ML increases development speed by eliminating the need to move data.
Applying BigQuery ML on e-commerce data analyticsMárton Kodok
With BigQuery ML, you can build machine learning models without leaving the database environment and training it on massive datasets. We are going to demonstrate common marketing Machine Learning use cases we do at REEA.net to build, train, eval and predict, your own scalable machine learning models using SQL language in Google BigQuery and to address the following use cases:
Customer Segmentation
Customer Lifetime Value (LTV) prediction
Conversion/Purchase prediction
The audience will get first hand experience how to write CREATE MODEL sql syntax to build machine learning models such as:
Multiclass logistic regression for classification
K-means clustering
Import TensorFlow models for prediction in BigQuery
Models are trained and accessed in BigQuery using SQL — a language data analysts know. This enables business decision making through predictive analytics across the organization without leaving the query editor
Join 20-year SEO veteran Ryan Huser as he explores the transformative intersection of Generative AI and SEO in his talk, ""Generative AI: The new Wild West of SEO"". The presentation will discuss how tech giants Bing and Google have enhanced the search experience using Generative AI. It will also unpack the wealth of options available to marketers, and how they can use these innovations for content creation and search engine optimization. Ryan's talk promises to offer invaluable insights into the emerging landscape of AI-driven SEO, emphasizing its profound implications for digital marketing strategies.
Data Restart 2022: Roman Appeltauer - Aktivace first-party dat pomocí SGTMTaste
Server-side GTM není jen k měření ze serveru. Skvěle slouží i jako real-time integrační nástroj, který šetří kapacity a focus vývojářů, a přitom efektivně a kontrolovaně předává data mezi aplikacemi, které si samy povídat neumí, nebo ne tak, jak potřebujete.
[LondonSEO 2020] BigQuery & SQL for SEOsAreej AbuAli
In this talk, Areej will share her learning process, how SEOs can get acquainted with the world of BigQuery and why SQL is the new and improved Excel. The audience will walk away with a handful of scripts and tips to get their BigQuery journey started!
Semantic Publishing and Entity SEO - Conteference 20-11-2022Massimiliano Geraci
Semantic Publishing is publishing a page on the Internet by adding a semantic layer (i.e., semantic enrichment) in the form of structured data that describes the page itself.
BI Consultancy - Data, Analytics and StrategyShivam Dhawan
The presentation describes my views around the data we encounter in digital businesses like:
- Looking at common Data collection methodologies,
-What are the common issues within the decision support system and optimiztion lifecycle,
- Where are most of failing?
and most importantly, "How to connect the dots and move from Data to Strategy?"
I work with all facets of Web Analytics and Business Strategy and see the structures and governance models of various domains to establish and analyze the key performance indicators that allow you to have a 360º overview of online and offline multi-channel environment.
Apart from my experience with the leading analytic tools in the market like Google Analytics, Omniture and BI tools for Big Data, I am developing new solutions to solve complex digital / business problems.
As a resourceful consultant, I can connect with your team in any modality or in any form that meets your needs and solves any data/strategy problem.
2021-07-16 잔디콘 시즌6 발표자료
- 베이지안 방법론에 Expected Loss를 활용하여 프로덕트 개선 속도를 높이는 방법에 대해 알아봅니다.
- 개발자 및 통계학 전공자가 아닌 분들을 대상으로 한 발표입니다. 다소 엄밀하지 못한 설명이 포함되었을 수 있으니 양해 부탁드립니다. 잘못된 부분은 답글로 달아주시면 감사하겠습니다.
원본 파일은 다음 링크로 다운로드 받으실 수 있습니다 :)
https://www.dropbox.com/s/zo1er99muu2oj5l/leeminho_til6_bayesian_abtest.pdf?dl=0
Retail Analytics and BI with Looker, BigQuery, GCP & Leigha JarettDaniel Zivkovic
Leigha Jarett of GCP explains how to bring Cloud "superpowers" to your Data and modernize your Business Intelligence with Looker, BigQuery and Google Cloud services on an example of Cymbal Direct - one of Google Cloud's demo brands. The meetup recording with TOC for easy navigation is at https://youtu.be/BpzJU_S40ic.
P.S. For more interactive lectures like this, go to http://youtube.serverlesstoronto.org/ or sign up for our upcoming live events at https://www.meetup.com/Serverless-Toronto/events/
Building a Data Strategy – Practical Steps for Aligning with Business GoalsDATAVERSITY
Developing a Data Strategy for your organization can seem like a daunting task – but it’s worth the effort. Getting your Data Strategy right can provide significant value, as data drives many of the key initiatives in today’s marketplace, from digital transformation to marketing, customer centricity, population health, and more. This webinar will help demystify Data Strategy and its relationship to Data Architecture and will provide concrete, practical ways to get started.
A beginner's guide to machine learning for SEOs - WTSFest 2022LazarinaStoyanova
This is a guide for machine learning for beginners, tailored to the SEO industry, aimed at breaking down the challenges that hold us back from experimenting, the breakdown of machine learning's main characteristics to help us understand how to implement it a bit better, and the ways we can embed advanced technology into our daily practice.
Whether you’re an agency spending days doing keyword research to build an information architecture, a marketplace wanting to A/B test internal linking across 100,000 pages or a classifieds site pruning millions of pages, no-code automation offers a way to do SEO more quickly, scalably, holistically and portably.
This presentations explains what exactly no-code automation for SEO is, what it isn’t, what data sources it can combine and how Similar.ai has used it to drive organic traffic, improve rankings and grow revenue.
Discover, pa’ tipos como tú: Los 13 factores para disparar tu tráficoClara Soteras
Ponencia sobre Google Discover en Sidralytics 2023. San Sebastián.
Google Discover es uno de los canales de captación de tráfico más importantes actualmente para la mayoría de medios de comunicación juntamente con las breaking news y el trabajo en real-time, pero ¿sabías que todos los factores que aplican los periódicos digitales para posicionar en Discover también pueden aplicarse a tu web?
En esta charla, Clara nos contará cuáles son los factores disparadores para que tus contenidos puedan aparecer en Discover y multiplicar tu tráfico con una estrategia más allá del posicionamiento orgánico en la SERP. Aprende a aprovechar las entidades y tendencias del momento para apretar el acelerador de tu (coche).
Supercharge your data analytics with BigQueryMárton Kodok
Powering interactive data analysis require massive architecture, and Know-How to build a fast real-time computing system. BigQuery solves this problem by enabling super-fast, SQL-like queries against petabytes of data using the processing power of Google’s infrastructure. We will cover its core features, creating tables, columns, views, working with partitions, clustering for cost optimizations, streaming inserts, User Defined Functions, and several use cases for everydaay developer: funnel analytics, behavioral analytics, exploring unstructured data.
The other part will be about BigQuery ML, which enables users to create and execute machine learning models in BigQuery using standard SQL queries. BigQuery ML democratizes machine learning by enabling SQL practitioners to build models using existing SQL tools and skills. BigQuery ML increases development speed by eliminating the need to move data.
Applying BigQuery ML on e-commerce data analyticsMárton Kodok
With BigQuery ML, you can build machine learning models without leaving the database environment and training it on massive datasets. We are going to demonstrate common marketing Machine Learning use cases we do at REEA.net to build, train, eval and predict, your own scalable machine learning models using SQL language in Google BigQuery and to address the following use cases:
Customer Segmentation
Customer Lifetime Value (LTV) prediction
Conversion/Purchase prediction
The audience will get first hand experience how to write CREATE MODEL sql syntax to build machine learning models such as:
Multiclass logistic regression for classification
K-means clustering
Import TensorFlow models for prediction in BigQuery
Models are trained and accessed in BigQuery using SQL — a language data analysts know. This enables business decision making through predictive analytics across the organization without leaving the query editor
BigdataConference Europe - BigQuery MLMárton Kodok
One of the hottest topics in database land these days is BigQuery ML. A new way to use machine learning on top of tabular data straight on your tables without leaving the query editor.
With BigQuery ML, you can build machine learning models without leaving the database environment and training it on massive datasets.
In this demo session, we are going to demonstrate common marketing Machine Learning use cases how to build, train, eval and predict, your own scalable machine learning models using SQL language.
The audience will get first hand experience how to write CREATE MODEL sql syntax to build machine learning models such as:
– Multiclass logistic regression for classification
– K-means clustering
– Matrix factorization
– ARIMA time series predictions
– Import TensorFlow models for prediction in BigQuery
Models are trained and accessed in BigQuery using SQL — a language data analysts know. This enables business decision making through predictive analytics across the organization without leaving the query editor.
Google Analytics Konferenz 2019_Google Cloud Platform_Carl Fernandes & Ksenia...e-dialog GmbH
Marketing in the Cloud with Google
It's no secret that "data" and "the cloud" presents a huge opportunity for marketers - but often it's difficult to understand how exactly these famous buzzwords can really help step change performance for a business. In this talk you will learn how Google thinks about marketing in the cloud, what the key use cases are and best practices that will help advertisers prepare for the future.
Voxxed Days Cluj - Powering interactive data analysis with Google BigQueryMárton Kodok
Every company,
no matter how far from the tech they are,
is evolving into a software company,
and by extension a data company.
For a small company it’s important
to have access to modern BigData tools
without running a dedicated team for it.
CodeCamp Iasi - Creating serverless data analytics system on GCP using BigQueryMárton Kodok
Teaser: provide developers a new way of understanding advanced analytics and choosing the right cloud architecture
The new buzzword is #serverless, as there are many great services that helps us abstract away the complexity associated with managing servers. In this session we will see how serverless helps on large data analytics backends.
We will see how to architect for Cloud and implement into an existing project components that will take us into the #serverless architecture that will ingest our streaming data, run advanced analytics on petabytes of data using BigQuery on Google Cloud Platform - all this next to an existing stack, without being forced to reengineer our app.
BigQuery enables super-fast, SQL/Javascript queries against petabytes of data using the processing power of Google’s infrastructure. We will cover its core features, SQL 2011 standard, working with streaming inserts, User Defined Functions written in Javascript, reference external JS libraries, and several use cases for everyday backend developer: funnel analytics, email heatmap, custom data processing, building dashboards, extracting data using JS functions, emitting rows based on business logic.
Cherokee nation 2 day AIAD & DIAD - App in a day and Dashboard in dayVishal Pawar
Cherokee nation 2 day AIAD & DIAD - App in a day and Dashboard in day
Power Apps: A software as a service application platform that enables power users in line of business
roles to easily build and deploy custom business apps. You will learn how to build Canvas and Modeldriven
style of apps.
Common Data Service (CDS): Make it easier to bring your data together and quickly create powerful
apps using a compliant and scalable data service and app platform that’s integrated into Power Apps.
Power Automate: A business service for line of business specialists and IT pros to build automated
workflows intuitively.
Power BI: Self-service business intelligence capabilities, where end users can create reports and
dashboards by themselves, without having to depend on information technology staff or database
administrators.
BigQuery ML - Machine learning at scale using SQLMárton Kodok
With BigQuery ML, you can build machine learning models without leaving the data warehouse environment and training it on massive datasets. We are going to demonstrate how to build, train, eval and predict, your own scalable machine learning models using standard SQL language in Google BigQuery.
We will see how can we use CREATE MODEL sql syntax to build different models such as:
-Linear regression
-Multiclass logistic regression for classification
-K-means clustering
-Import TensorFlow models for prediction in BigQuery
We will see how we can apply these models on tabular data in retail and marketing use cases.
Models are trained and accessed in BigQuery using SQL — a language data analysts know. This enables business decision making through predictive analytics across the organization without leaving the query editor.
Discover BigQuery ML, build your own CREATE MODEL statementMárton Kodok
With BigQuery ML, you can build machine learning models without leaving the database environment and training it on massive datasets. In this demo session we are going to demonstrate common marketing Machine Learning use cases of how to build, train, eval, and predict, your own scalable machine learning models using SQL language in Google BigQuery and to address the following use cases: - Customer Segmentation + Product cross sale recommendation - Conversion/Purchase prediction - Inference with other in-built >20 models The audience will get first-hand experience with how to write CREATE MODEL sql syntax to build machine learning models such as: - Multiclass logistic regression for classification - K-means clustering - Matrix factorization - ARIMA time series predictions ... and more Models are trained and accessed in BigQuery using SQL — a language data analysts know. This enables business decision-making through predictive analytics across the organization without leaving the query editor. In the end, the audience will learn how everyday developers can build/train/run their own machine-learning models straight from the database query editor, by issuing CREATE MODEL statements
Lessons learnt and system built while solving the last mile problem in machine learning - taking models to production. Used for the talk at - http://sched.co/BLvf
BigQuery ML - Machine learning at scale using SQLMárton Kodok
With BigQuery ML, you can build machine learning models without leaving the data warehouse environment and training it on massive datasets. We are going to demonstrate how to build, train, eval and predict, your own scalable machine learning models using standard SQL language in Google BigQuery.
We will see how can we use CREATE MODEL sql syntax to build different models such as:
Linear regression
Multiclass logistic regression for classification
K-means clustering
Import TensorFlow models for prediction in BigQuery
We will see how we can apply these models on tabular data in retail and marketing use cases.
Models are trained and accessed in BigQuery using SQL — a language data analysts know. This enables business decision making through predictive analytics across the organization without leaving the query editor.
An short introduction on Big Query. With this presentation you'll quickly discover :
How load data in BigQuery
How to build dashboard using BigQuery
How to work with BigQuery
and, at last but not least, we've added some best practices
We hope you'll enjoy this presentation and that it will help you to start exploring this wonderful solution. Don't hesitate to send us your feedbacks or questions
A Big (Query) Frog in a Small Pond, Jakub Motyl, BuffPanelData Science Club
How does one process 200GB of streaming raw data, daily? Where dedicated servers and home-made solutions fail, BigQuery comes out the victor. We will talk about the big data architecture with over 110 million players total on record, how we managed to implement it, and how is it possible that we keep daily operational costs under $50.
In the beginning we will explain what kinds of data sources a top-selling game has to integrate and analyze and how to pre-process the data to avoid ramping up costs in disaster scenarios. Part of the talk is also dedicated to all the components that are involved in the many transformations the data undergoes and we will show you how the output from the entire pipeline looks.
Data Driven Attribution in BigQuery with Shapley Values and Markov ChainsChristopher Gutknecht
This talk covers the journey of implementing two data driven attribution models in BigQuery and the findings so far. The packages Fractribution (Google) and Channel Attribution were used to model Shapley values and markov chains respectively.
A basic introduction to Big Query, how it works and what it can do. Look into a use case of Big Query, using Google Analytics and CRM data to create a powerful remarketing list.
GDG DevFest Ukraine - Powering Interactive Data Analysis with Google BigQueryMárton Kodok
Every scientist who needs big data analytics to save millions of lives should have that power. Powering Interactive Data Analysis require massive architecture, and know-how to build a fast real-time computing system. You will learn how Google BigQuery solves this problem by enabling super-fast, SQL queries against petabytes of data using the processing power of Google’s infrastructure. After this session you will be able to work with BigQuery, do streaming inserts, write User Defined Functions in Javascript, and several use cases for everyday developer: funnel analytics, behavioral analytics, exploring unstructured data. You will be able to run arbitrary queries on open-data such as historical data about Github commits, Stackoverflow Q&A data, or analysing Reddit comments to find out books the community talks about.
This German session at PPC Camp Rosenheim show different optimization strategies for structuring products in PMAX shopping campaigns. It contains solutions for the following structuring frameworks:
- Heroes or performers
- Villains or budget burners
- Zombies or zero volume products
- how to quantify size selection
- how to combine product attractiveness from a merchant and from a user view
The code shown can be explored at: https://github.com/ChrisGutknecht/ppc_camp_pmax
*****
Diese Session beim PPC Camp Rosenheim zeigt verschiedene Optimierungsstrategien für die Strukturierung von Produkten in PMAX-Shopping-Kampagnen. Sie enthält Lösungen für die folgenden Strukturierungsansätze:
- Heroes oder Performer
- Villians oder Kostenfresser
- Zombies oder Produkte ohne Volumen
- wie man die Größenauswahl quantifiziert
- wie man die Produktattraktivität aus der Sicht eines Händlers und eines Benutzers kombiniert
Der gezeigte Code kann unter folgendem Link erkundet werden: https://github.com/ChrisGutknecht/ppc_camp_pmax
How to recover from an unsuccessful SEO relaunch by activating your data (SMX...Christopher Gutknecht
In this session, Chris and Danny their experience of recovering from an unsuccessful SEO relaunch by using data tactics. The sessions covers three SEO issues structural change, page bloat and Javascript issues and how data tactics can help.
This Github repo contains all the data pipeline code shown at SMX Advanced EU 2023. It contains implementation details for four data products, as shown in the session:
- a 404 live alert on GA4 data, executed via a cloud function
- a sitemap monitoring script, based on advertools
- a custom SEO crawler, based on advertools
- a webvitals monitoring script
https://github.com/ChrisGutknecht/smx_advanced_seo_data/
In this session, Helena and Christopher from Bergzeit describe two possible solutions to rebuild custom session level channel groups with GA4 raw data, using the dbt framework. The goal of the contained code is to bring the GA4 raw data as close as possible to the session level custom channel groups displayed in the GA4 UI.
If you see shortcomings in the custom channel groups in the GA4 UI, or if you are in need of session level channel groups in your GA4 raw data for reporting or other purposes this repository may be for you. If you use dbt you can use the documented code blocks with some minor adjustments. If you do not use dbt the code may still give you some inspiration on the process. The accompanying code repository can be found here: https://github.com/hellste/dbt_ga4_custom_channelgroups
This SMX session covers Bergzeit's approach on integrating product margins, return rates, attribution data and customer effects into Google Ads bidding. Kira and I leverage a BigQuery pipeline in SQL and upload the gross profit data via the conversion import, sharing our specific learnings.
Questioning data quality and troubleshooting tracking gaps (version2 | Smx Su...Christopher Gutknecht
Checklist LINK: https://docs.google.com/spreadsheets/d/1C7Ojteg-EWazi_xDEwlljwALppHxG3PY8lBuodGSmxY/edit?usp=drive_web&ouid=102383257734620162787
This session gives an updated overview of tracking gaps due to measurement settings, cookie consent and browser issues, also offering an outlook on server-side tagging.
Questioning Data Quality and Troubleshooting Tracking Gaps (SMX Munich 2020)Christopher Gutknecht
This session covers a wide range of causes for observed gaps across different data sources in a web tracking context. The three chapters are measurement settings, browser data loss and cookie consent. The session was held by Christopher Gutknecht at SMX Virtual Edition 2020.
The accompanying checklist can be found here: https://docs.google.com/spreadsheets/d/1C7Ojteg-EWazi_xDEwlljwALppHxG3PY8lBuodGSmxY/edit#gid=1216566297
SMX Advanced - When to use Machine Learning for Search CampaignsChristopher Gutknecht
This SMX talk will walk you through how search campaigns can be automated from an inventory and a query perspective and where entry-level machine learning services can improve the automation quality. The accompanying code can be found at: bit.ly/smx_chrisg
The talk was held at SMX Advanded Europe 2019 in Berlin by Christopher Gutknecht from Bergzeit.
As Europe's leading economic powerhouse and the fourth-largest hashtag#economy globally, Germany stands at the forefront of innovation and industrial might. Renowned for its precision engineering and high-tech sectors, Germany's economic structure is heavily supported by a robust service industry, accounting for approximately 68% of its GDP. This economic clout and strategic geopolitical stance position Germany as a focal point in the global cyber threat landscape.
In the face of escalating global tensions, particularly those emanating from geopolitical disputes with nations like hashtag#Russia and hashtag#China, hashtag#Germany has witnessed a significant uptick in targeted cyber operations. Our analysis indicates a marked increase in hashtag#cyberattack sophistication aimed at critical infrastructure and key industrial sectors. These attacks range from ransomware campaigns to hashtag#AdvancedPersistentThreats (hashtag#APTs), threatening national security and business integrity.
🔑 Key findings include:
🔍 Increased frequency and complexity of cyber threats.
🔍 Escalation of state-sponsored and criminally motivated cyber operations.
🔍 Active dark web exchanges of malicious tools and tactics.
Our comprehensive report delves into these challenges, using a blend of open-source and proprietary data collection techniques. By monitoring activity on critical networks and analyzing attack patterns, our team provides a detailed overview of the threats facing German entities.
This report aims to equip stakeholders across public and private sectors with the knowledge to enhance their defensive strategies, reduce exposure to cyber risks, and reinforce Germany's resilience against cyber threats.
Show drafts
volume_up
Empowering the Data Analytics Ecosystem: A Laser Focus on Value
The data analytics ecosystem thrives when every component functions at its peak, unlocking the true potential of data. Here's a laser focus on key areas for an empowered ecosystem:
1. Democratize Access, Not Data:
Granular Access Controls: Provide users with self-service tools tailored to their specific needs, preventing data overload and misuse.
Data Catalogs: Implement robust data catalogs for easy discovery and understanding of available data sources.
2. Foster Collaboration with Clear Roles:
Data Mesh Architecture: Break down data silos by creating a distributed data ownership model with clear ownership and responsibilities.
Collaborative Workspaces: Utilize interactive platforms where data scientists, analysts, and domain experts can work seamlessly together.
3. Leverage Advanced Analytics Strategically:
AI-powered Automation: Automate repetitive tasks like data cleaning and feature engineering, freeing up data talent for higher-level analysis.
Right-Tool Selection: Strategically choose the most effective advanced analytics techniques (e.g., AI, ML) based on specific business problems.
4. Prioritize Data Quality with Automation:
Automated Data Validation: Implement automated data quality checks to identify and rectify errors at the source, minimizing downstream issues.
Data Lineage Tracking: Track the flow of data throughout the ecosystem, ensuring transparency and facilitating root cause analysis for errors.
5. Cultivate a Data-Driven Mindset:
Metrics-Driven Performance Management: Align KPIs and performance metrics with data-driven insights to ensure actionable decision making.
Data Storytelling Workshops: Equip stakeholders with the skills to translate complex data findings into compelling narratives that drive action.
Benefits of a Precise Ecosystem:
Sharpened Focus: Precise access and clear roles ensure everyone works with the most relevant data, maximizing efficiency.
Actionable Insights: Strategic analytics and automated quality checks lead to more reliable and actionable data insights.
Continuous Improvement: Data-driven performance management fosters a culture of learning and continuous improvement.
Sustainable Growth: Empowered by data, organizations can make informed decisions to drive sustainable growth and innovation.
By focusing on these precise actions, organizations can create an empowered data analytics ecosystem that delivers real value by driving data-driven decisions and maximizing the return on their data investment.
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Subhajit Sahu
Abstract — Levelwise PageRank is an alternative method of PageRank computation which decomposes the input graph into a directed acyclic block-graph of strongly connected components, and processes them in topological order, one level at a time. This enables calculation for ranks in a distributed fashion without per-iteration communication, unlike the standard method where all vertices are processed in each iteration. It however comes with a precondition of the absence of dead ends in the input graph. Here, the native non-distributed performance of Levelwise PageRank was compared against Monolithic PageRank on a CPU as well as a GPU. To ensure a fair comparison, Monolithic PageRank was also performed on a graph where vertices were split by components. Results indicate that Levelwise PageRank is about as fast as Monolithic PageRank on the CPU, but quite a bit slower on the GPU. Slowdown on the GPU is likely caused by a large submission of small workloads, and expected to be non-issue when the computation is performed on massive graphs.
Opendatabay - Open Data Marketplace.pptxOpendatabay
Opendatabay.com unlocks the power of data for everyone. Open Data Marketplace fosters a collaborative hub for data enthusiasts to explore, share, and contribute to a vast collection of datasets.
First ever open hub for data enthusiasts to collaborate and innovate. A platform to explore, share, and contribute to a vast collection of datasets. Through robust quality control and innovative technologies like blockchain verification, opendatabay ensures the authenticity and reliability of datasets, empowering users to make data-driven decisions with confidence. Leverage cutting-edge AI technologies to enhance the data exploration, analysis, and discovery experience.
From intelligent search and recommendations to automated data productisation and quotation, Opendatabay AI-driven features streamline the data workflow. Finding the data you need shouldn't be a complex. Opendatabay simplifies the data acquisition process with an intuitive interface and robust search tools. Effortlessly explore, discover, and access the data you need, allowing you to focus on extracting valuable insights. Opendatabay breaks new ground with a dedicated, AI-generated, synthetic datasets.
Leverage these privacy-preserving datasets for training and testing AI models without compromising sensitive information. Opendatabay prioritizes transparency by providing detailed metadata, provenance information, and usage guidelines for each dataset, ensuring users have a comprehensive understanding of the data they're working with. By leveraging a powerful combination of distributed ledger technology and rigorous third-party audits Opendatabay ensures the authenticity and reliability of every dataset. Security is at the core of Opendatabay. Marketplace implements stringent security measures, including encryption, access controls, and regular vulnerability assessments, to safeguard your data and protect your privacy.
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...John Andrews
SlideShare Description for "Chatty Kathy - UNC Bootcamp Final Project Presentation"
Title: Chatty Kathy: Enhancing Physical Activity Among Older Adults
Description:
Discover how Chatty Kathy, an innovative project developed at the UNC Bootcamp, aims to tackle the challenge of low physical activity among older adults. Our AI-driven solution uses peer interaction to boost and sustain exercise levels, significantly improving health outcomes. This presentation covers our problem statement, the rationale behind Chatty Kathy, synthetic data and persona creation, model performance metrics, a visual demonstration of the project, and potential future developments. Join us for an insightful Q&A session to explore the potential of this groundbreaking project.
Project Team: Jay Requarth, Jana Avery, John Andrews, Dr. Dick Davis II, Nee Buntoum, Nam Yeongjin & Mat Nicholas
Explore our comprehensive data analysis project presentation on predicting product ad campaign performance. Learn how data-driven insights can optimize your marketing strategies and enhance campaign effectiveness. Perfect for professionals and students looking to understand the power of data analysis in advertising. for more details visit: https://bostoninstituteofanalytics.org/data-science-and-artificial-intelligence/
Building Data Products with BigQuery for PPC and SEO (SMX 2022)
1. How To Build Data Products
in BigQuery for PPC & SEO
Christopher Gutknecht | @chrisgutknecht | Bergzeit
2. 1. Intro
Our Plan: Build Data Products & Activate Data
3. PPC & SEO Use Cases
2. Product Principles
3. About Chris: Acquisition & Analytics at Bergzeit
Digital Marketer
Data Nerd
Climber
1997 2008 2010 2022
Dad of 2
Online Store for Mountain Gear
145 M Revenue in FY 21/22
14 Countries, 5 Languages
World-class, data-driven team 🔥
Hiring a PPC!
4. I’d like to Set Clear Expectations for This Session
No BigQuery Intro
Data Management-Talk
What this session IS What it’s NOT
No BigQuery tactics
No ML Focus
Google Cloud & dbt focus
Data Product mindset
75% PPC, 25% SEO
5. Large-Scale PPC is Becoming the
Science of Managing Data Pipelines
RECAP FROM SMX 2019
Recap of My SMX 2019 Talk
13. Are there Alternatives to dbt? Not really
No extra setup
No control
Saved queries Google Dataprep
No SQL (no-code)
Transformation &
scheduling only
Google Dataform
Free for GCP
Smaller ecosystem
SQL framework
15. Wait: What about Javascript and Python?
Easy to get started
Instantly ready
Javascript Python
Powerful libraries
Leading data tools
SQL
Super scalable
Ideal for production
Centralized code
Only for smaller
data tasks
Harder to centralize
16. 1. Intro
Plan: Build Data Products & Activate Your Data
3. PPC & SEO Use Cases
2. Product Principles
30. Keep Your Documentation Close To Your Code
Documentation in .yml and .md Files
Document all Sources, Models and Exposures
Get documentation as HTML
YAML File for Model Metadata
Markdown File for Description
SQL Files with Model code
58. To This SQL-based Transformation in dbt
better control for complex transformations
easier to extend and reuse parts of logic
lower cost in long run
automated testing and documentation
59. There Are More Data Products in the Lab…
(Join Our Team!)
61. Your Takeaways from this Session
1. What the modern data stack is and why it’s exciting
3. Which principles to apply for data products in production
2. Why dbt is the best tool for data warehouse transformation
4. A few interesting PPC and SEO use cases for you to try
62. Thanks for Your Time.
Looking Forward To Questions!
Chris Gutknecht | Teamlead A&O | Hiring a PPC!