SlideShare a Scribd company logo
1 of 32
Download to read offline
Get more from 
Analytics with 
Google BigQuery 
javier ramirez 
@supercoco9
about me 
19 years working on software: banking, e-commerce, 
government, CMS, start-ups... 
founder of 
https://datawaki.com 
https://teowaki.com 
https://teowaki.com/services 
Google Developer Expert on 
the Cloud Platform 
datawaki 
mail: j@teowaki.com twitter: @supercoco9
BigQuery 
is awes.. 
I use Google 
Analytics 
javier ramirez @supercoco9 https://teowaki.com
Isn't Google Analytics 
good enough? 
javier ramirez @supercoco9 https://teowaki.com
Google Analytics is great but... 
It lets you access aggregated data and sampled 
reports, not individual sessions/visits data. 
Even premium accounts get sampled reports when 
there are too many data (and not all the reports can 
be unsampled). 
javier ramirez @supercoco9 https://teowaki.com
Google Analytics is great but... 
If you need to manage many different segments, and 
if you want to combine segments, it can get tricky. 
Moreover, you can only segment or create reports 
using the pre-defined filters, which might or not be 
enough for you*. 
*even if segments have experienced a huge 
improvement with Universal Analytics 
javier ramirez @supercoco9 https://teowaki.com
Google Analytics is great but... 
It's not easy to cross data in Analytics with data from 
other sources (CRM, invoicing system...) 
Now you can use Import Data from Universal 
Analytics, but there are many constraints to what 
you can do 
javier ramirez @supercoco9 https://teowaki.com
Google Analytics is great but... 
Good for knowing what's happening in your 
application, but difficult for: 
* business intelligence/big data (data mining, 
find patterns...) 
* machine learning (classify information, 
predict future trends...) 
javier ramirez @supercoco9 https://teowaki.com
Google BigQuery 
Designed to run analytics 
over huge volumes of raw 
data, and to integrate 
with other data sources 
javier ramirez @supercoco9 https://teowaki.com
one 
more 
thing
Google BigQuery + GA Premium 
Google Analytics Premium 
users get free daily 
exports from GA to 
BigQuery. 
javier ramirez @supercoco9 https://teowaki.com
Google BigQuery + GA Premium 
All your raw data. 
Unsampled. 
Use it however you want. 
BOOM! 
javier ramirez @supercoco9 https://teowaki.com
o'reilly
khan academy
it's just SQL 
javier ramirez @supercoco9 https://teowaki.com
SQL is not very hard 
Give me the count of visitors from our analytics who 
visited yesterday using a mobile device, by 
country 
SELECT count(fullVisitorId) 
from ga_sessions_20141203 
where device.isMobile = true 
GROUP BY geoNetwork.country
data 
schema 
javier ramirez @supercoco9 https://teowaki.com
basic queries (metric/dimension) 
SELECT trafficSource.source, SUM( totals.transactions ) AS total_transactions 
FROM playground.ga_sessions_20140621 
GROUP BY trafficSource.source 
ORDER BY total_transactions; 
SELECT device.isMobile, SUM ( totals.pageviews ) AS total_pageviews 
FROM playground.ga_sessions_20140621 
GROUP BY device.isMobile 
ORDER BY total_pageviews;
basic queries 
with a twist 
SELECT 
IF(DOMAIN(trafficSource.source) is null, 
trafficSource.source, 
DOMAIN(trafficSource.source)) 
AS normalized_source, 
SUM ( totals.transactions ) AS total_transactions 
FROM playground.ga_sessions_20140621 
GROUP BY normalized_source 
ORDER BY total_transactions;
Average amount spent per visit 
SELECT ( SUM(total_transactionrevenue_per_user) / SUM(total_visits_per_user) ) 
AS avg_revenue_by_user_per_visit 
FROM ( 
SELECT SUM(totals.visits) AS total_visits_per_user, 
SUM( totals.transactionRevenue ) AS total_transactionrevenue_per_user, 
visitorId 
FROM playground.ga_sessions_20140621 
WHERE totals.visits>0 
AND totals.transactions>=1 
AND totals.transactionRevenue IS NOT NULL 
GROUP BY visitorId ) ;
2 segments, combined
Users who bought product A, 
also bought product B 
SELECT hits.item.productName AS other_purchased_products, 
COUNT(hits.item.productName) AS quantity 
FROM playground.ga_sessions_20140621 
WHERE fullVisitorId IN ( 
SELECT fullVisitorId 
FROM playground.ga_sessions_20140621 
WHERE hits.item.productName CONTAINS 'Light Helmet' 
AND totals.transactions>=1 
GROUP BY fullVisitorId ) 
AND hits.item.productName IS NOT NULL 
AND hits.item.productName !='Light Helmet' 
GROUP BY other_purchased_products 
ORDER BY quantity DESC;
SELECT prod_name, count(*) as transactions 
FROM 
( 
SELECT fullVisitorId, min(date) AS date, visitId, 
hits.item.productName as prod_name 
FROM ( 
SELECT fullVisitorId, date, visitId, 
totals.transactions, 
hits.item.productName FROM 
(TABLE_DATE_RANGE([dataset.ga_sessions_], 
TIMESTAMP('2014-06-01'), 
TIMESTAMP('2014-06-14'))) 
) 
WHERE fullVisitorId IN 
( 
SELECT fullVisitorId 
FROM (TABLE_DATE_RANGE([dataset.ga_sessions_], 
TIMESTAMP('2014-06-01'), 
TIMESTAMP('2014-06-14'))) 
GROUP BY fullVisitorId 
HAVING SUM(totals.transactions) > 1 
) 
AND hits.item.productName IS NOT NULL 
GROUP BY fullVisitorId, visitId, prod_name ORDER BY 
fullVisitorId DESC 
) 
GROUP BY prod_name ORDER BY transactions DESC; 
* example query from the lunametrics blog. Check them out for more awesomeness 
Products that 
are purchased 
and lead to 
other products 
being purchased
Identify user path/user actions 
SELECT fullvisitorID, visitID, visitNumber, hits.page.pagePath 
FROM playground.ga_sessions_20140621 
where hits.type='PAGE' 
order by fullvisitorID, visitID, hits.hitnumber asc
individual users data is awesome 
Cross CRM data with individual users actions to see 
how your response to incidents affect your users. 
Use the “frequently bought together” query and find 
users who didn't buy the related products. Send an 
e-mail campaign with an offer for those products.
integrating with external 
data sources 
* Connectors/REST API 
* Export into GCS 
* Import into BigQuery 
javier ramirez @supercoco9 https://teowaki.com
What if I don't have 
a GA Premium 
Account?
just send your own data 
define a data structure that fits your needs 
(or replicate the one GA provides), use a JS 
snippet to send data to your server, then 
to BigQuery** 
..you will miss many of the GA dimensions, but 
you can keep using GA and use BigQuery 
only for your unsampled data 
datawaki ** If you want to do this without managing 
your own servers, we can help you 
javier ramirez @supercoco9 https://teowaki.com
BigQuery pricing 
$20 per stored TB 
A site with 50m pageviews, would pay less 
than $10 a month per every 6 months worth 
of data 
$5 per processed TB 
*the 1st TB every month is free of charge 
** GA premium get $500 free credit monthly 
javier ramirez @supercoco9 https://teowaki.com
for GA premium users 
BigQuery is effectively 
for free 
*unless you upload huge external data or make 
huge queries 
javier ramirez @supercoco9 https://teowaki.com
Want to know more? 
https://cloud.google.com/products/bigquery/ 
https://datawaki.com 
Need help? 
https://teowaki.com/services 
Thanks! 
Javier Ramírez 
@supercoco9

More Related Content

What's hot

Google Analytics Data Mining with R
Google Analytics Data Mining with RGoogle Analytics Data Mining with R
Google Analytics Data Mining with RTatvic Analytics
 
Google Analytics 4 Trial Recommendation
Google Analytics 4 Trial RecommendationGoogle Analytics 4 Trial Recommendation
Google Analytics 4 Trial RecommendationYisrael Segall
 
Benefits of Google Tag Manager
Benefits of Google Tag ManagerBenefits of Google Tag Manager
Benefits of Google Tag ManagerPhil Pearce
 
Space Data Strategy
Space Data StrategySpace Data Strategy
Space Data StrategyDatentreiber
 
Design Thinking for Data Superwomen & Supermen
Design Thinking for Data Superwomen & SupermenDesign Thinking for Data Superwomen & Supermen
Design Thinking for Data Superwomen & SupermenDatentreiber
 
Unlock your Big Data with Analytics and BI on Office 365
Unlock your Big Data with Analytics and BI on Office 365Unlock your Big Data with Analytics and BI on Office 365
Unlock your Big Data with Analytics and BI on Office 365Brian Culver
 
Snowplow at DA Hub emerging technology showcase
Snowplow at DA Hub emerging technology showcaseSnowplow at DA Hub emerging technology showcase
Snowplow at DA Hub emerging technology showcaseyalisassoon
 

What's hot (8)

Google Analytics Data Mining with R
Google Analytics Data Mining with RGoogle Analytics Data Mining with R
Google Analytics Data Mining with R
 
Google Analytics 4 Trial Recommendation
Google Analytics 4 Trial RecommendationGoogle Analytics 4 Trial Recommendation
Google Analytics 4 Trial Recommendation
 
Benefits of Google Tag Manager
Benefits of Google Tag ManagerBenefits of Google Tag Manager
Benefits of Google Tag Manager
 
Space Data Strategy
Space Data StrategySpace Data Strategy
Space Data Strategy
 
Design Thinking for Data Superwomen & Supermen
Design Thinking for Data Superwomen & SupermenDesign Thinking for Data Superwomen & Supermen
Design Thinking for Data Superwomen & Supermen
 
Unlock your Big Data with Analytics and BI on Office 365
Unlock your Big Data with Analytics and BI on Office 365Unlock your Big Data with Analytics and BI on Office 365
Unlock your Big Data with Analytics and BI on Office 365
 
Snowplow at DA Hub emerging technology showcase
Snowplow at DA Hub emerging technology showcaseSnowplow at DA Hub emerging technology showcase
Snowplow at DA Hub emerging technology showcase
 
Clustrix Infographic
Clustrix InfographicClustrix Infographic
Clustrix Infographic
 

Viewers also liked

Web Optimization Process Management
Web Optimization Process ManagementWeb Optimization Process Management
Web Optimization Process ManagementTim Wilson
 
BigQuery for the Big Data win
BigQuery for the Big Data winBigQuery for the Big Data win
BigQuery for the Big Data winKen Taylor
 
A Process for Being Data Driven
A Process for Being Data DrivenA Process for Being Data Driven
A Process for Being Data DrivenTim Wilson
 
Should Digital Analysts Become More Data Science-y?
Should Digital Analysts Become More Data Science-y?Should Digital Analysts Become More Data Science-y?
Should Digital Analysts Become More Data Science-y?Tim Wilson
 
Google BigQuery for Everyday Developer
Google BigQuery for Everyday DeveloperGoogle BigQuery for Everyday Developer
Google BigQuery for Everyday DeveloperMárton Kodok
 
Exploring BigData with Google BigQuery
Exploring BigData with Google BigQueryExploring BigData with Google BigQuery
Exploring BigData with Google BigQueryDharmesh Vaya
 
Google BigQuery 101 & What’s New
Google BigQuery 101 & What’s NewGoogle BigQuery 101 & What’s New
Google BigQuery 101 & What’s NewDoiT International
 
AWS Athena vs. Google BigQuery for interactive SQL Queries
AWS Athena vs. Google BigQuery for interactive SQL QueriesAWS Athena vs. Google BigQuery for interactive SQL Queries
AWS Athena vs. Google BigQuery for interactive SQL QueriesDoiT International
 
Big Query Basics
Big Query BasicsBig Query Basics
Big Query BasicsIdo Green
 

Viewers also liked (11)

Web Optimization Process Management
Web Optimization Process ManagementWeb Optimization Process Management
Web Optimization Process Management
 
BigQuery for the Big Data win
BigQuery for the Big Data winBigQuery for the Big Data win
BigQuery for the Big Data win
 
A Process for Being Data Driven
A Process for Being Data DrivenA Process for Being Data Driven
A Process for Being Data Driven
 
Should Digital Analysts Become More Data Science-y?
Should Digital Analysts Become More Data Science-y?Should Digital Analysts Become More Data Science-y?
Should Digital Analysts Become More Data Science-y?
 
Google BigQuery
Google BigQueryGoogle BigQuery
Google BigQuery
 
Google BigQuery for Everyday Developer
Google BigQuery for Everyday DeveloperGoogle BigQuery for Everyday Developer
Google BigQuery for Everyday Developer
 
Exploring BigData with Google BigQuery
Exploring BigData with Google BigQueryExploring BigData with Google BigQuery
Exploring BigData with Google BigQuery
 
Google BigQuery 101 & What’s New
Google BigQuery 101 & What’s NewGoogle BigQuery 101 & What’s New
Google BigQuery 101 & What’s New
 
AWS Athena vs. Google BigQuery for interactive SQL Queries
AWS Athena vs. Google BigQuery for interactive SQL QueriesAWS Athena vs. Google BigQuery for interactive SQL Queries
AWS Athena vs. Google BigQuery for interactive SQL Queries
 
Big Query Basics
Big Query BasicsBig Query Basics
Big Query Basics
 
Google Cloud Dataflow
Google Cloud DataflowGoogle Cloud Dataflow
Google Cloud Dataflow
 

Similar to Google Analytics and BigQuery, by Javier Ramirez, from datawaki

Google Analytics with an Intro to Google Tag Manager for Austin WordPress Meetup
Google Analytics with an Intro to Google Tag Manager for Austin WordPress MeetupGoogle Analytics with an Intro to Google Tag Manager for Austin WordPress Meetup
Google Analytics with an Intro to Google Tag Manager for Austin WordPress MeetupRich Plakas
 
Ga premium bigquery-integration
Ga premium bigquery-integrationGa premium bigquery-integration
Ga premium bigquery-integrationStefan Xhunga
 
Big query the first step - (MOSG)
Big query the first step - (MOSG)Big query the first step - (MOSG)
Big query the first step - (MOSG)Soshi Nemoto
 
Usando metodologías ágiles en UX
Usando metodologías ágiles en UX Usando metodologías ágiles en UX
Usando metodologías ágiles en UX Paradigma Digital
 
Track Report & Optimize Your Web Creations
Track Report & Optimize Your Web CreationsTrack Report & Optimize Your Web Creations
Track Report & Optimize Your Web CreationsEmpirical Path
 
Top 10 Tips for Google Tag Manager
Top 10 Tips for Google Tag ManagerTop 10 Tips for Google Tag Manager
Top 10 Tips for Google Tag ManagerAnna Lewis
 
Google Analytics 4 and BigQuery: The New Kids On The Block.pptx
Google Analytics 4 and BigQuery: The New Kids On The Block.pptxGoogle Analytics 4 and BigQuery: The New Kids On The Block.pptx
Google Analytics 4 and BigQuery: The New Kids On The Block.pptxOmi Sido
 
Ai based analytics in the cloud
Ai based analytics in the cloudAi based analytics in the cloud
Ai based analytics in the cloudSvetlin Stanchev
 
Florian Pertynski session at Google Partner Summit Review
Florian Pertynski session at Google Partner Summit Review Florian Pertynski session at Google Partner Summit Review
Florian Pertynski session at Google Partner Summit Review IIHEvents
 
Turbocharging Google Analytics
Turbocharging Google AnalyticsTurbocharging Google Analytics
Turbocharging Google AnalyticsDana DiTomaso
 
danmcclary-pspresentation-katieboyle-171030115522.pdf
danmcclary-pspresentation-katieboyle-171030115522.pdfdanmcclary-pspresentation-katieboyle-171030115522.pdf
danmcclary-pspresentation-katieboyle-171030115522.pdfssuser3ee399
 
Why Big and Small Data Is Important by Google's Product Manager
Why Big and Small Data Is Important by Google's Product ManagerWhy Big and Small Data Is Important by Google's Product Manager
Why Big and Small Data Is Important by Google's Product ManagerProduct School
 
Topic 6- SEO and Analytics .ppt
Topic 6- SEO and Analytics  .pptTopic 6- SEO and Analytics  .ppt
Topic 6- SEO and Analytics .pptJaySears2
 
Google Analytics Training Seminar - Vorian Agency
Google Analytics Training Seminar - Vorian AgencyGoogle Analytics Training Seminar - Vorian Agency
Google Analytics Training Seminar - Vorian AgencyVorian Agency
 
Why Big Query is so Powerful - Trusted Conf
Why Big Query is so Powerful - Trusted ConfWhy Big Query is so Powerful - Trusted Conf
Why Big Query is so Powerful - Trusted ConfIn Marketing We Trust
 

Similar to Google Analytics and BigQuery, by Javier Ramirez, from datawaki (20)

Google Analytics with an Intro to Google Tag Manager for Austin WordPress Meetup
Google Analytics with an Intro to Google Tag Manager for Austin WordPress MeetupGoogle Analytics with an Intro to Google Tag Manager for Austin WordPress Meetup
Google Analytics with an Intro to Google Tag Manager for Austin WordPress Meetup
 
Ga premium bigquery-integration
Ga premium bigquery-integrationGa premium bigquery-integration
Ga premium bigquery-integration
 
Big query the first step - (MOSG)
Big query the first step - (MOSG)Big query the first step - (MOSG)
Big query the first step - (MOSG)
 
Usando metodologías ágiles en UX
Usando metodologías ágiles en UX Usando metodologías ágiles en UX
Usando metodologías ágiles en UX
 
Track Report & Optimize Your Web Creations
Track Report & Optimize Your Web CreationsTrack Report & Optimize Your Web Creations
Track Report & Optimize Your Web Creations
 
Top 10 Tips for Google Tag Manager
Top 10 Tips for Google Tag ManagerTop 10 Tips for Google Tag Manager
Top 10 Tips for Google Tag Manager
 
MOOCS.pptx
MOOCS.pptxMOOCS.pptx
MOOCS.pptx
 
Google Analytics 4 and BigQuery: The New Kids On The Block.pptx
Google Analytics 4 and BigQuery: The New Kids On The Block.pptxGoogle Analytics 4 and BigQuery: The New Kids On The Block.pptx
Google Analytics 4 and BigQuery: The New Kids On The Block.pptx
 
Ai based analytics in the cloud
Ai based analytics in the cloudAi based analytics in the cloud
Ai based analytics in the cloud
 
Florian Pertynski session at Google Partner Summit Review
Florian Pertynski session at Google Partner Summit Review Florian Pertynski session at Google Partner Summit Review
Florian Pertynski session at Google Partner Summit Review
 
ga4.pdf
ga4.pdfga4.pdf
ga4.pdf
 
ga4.pdf
ga4.pdfga4.pdf
ga4.pdf
 
Google analytics
Google analyticsGoogle analytics
Google analytics
 
Turbocharging Google Analytics
Turbocharging Google AnalyticsTurbocharging Google Analytics
Turbocharging Google Analytics
 
CEO & Co-Founder - Steve Krull, Be Found Online
CEO & Co-Founder - Steve Krull, Be Found OnlineCEO & Co-Founder - Steve Krull, Be Found Online
CEO & Co-Founder - Steve Krull, Be Found Online
 
danmcclary-pspresentation-katieboyle-171030115522.pdf
danmcclary-pspresentation-katieboyle-171030115522.pdfdanmcclary-pspresentation-katieboyle-171030115522.pdf
danmcclary-pspresentation-katieboyle-171030115522.pdf
 
Why Big and Small Data Is Important by Google's Product Manager
Why Big and Small Data Is Important by Google's Product ManagerWhy Big and Small Data Is Important by Google's Product Manager
Why Big and Small Data Is Important by Google's Product Manager
 
Topic 6- SEO and Analytics .ppt
Topic 6- SEO and Analytics  .pptTopic 6- SEO and Analytics  .ppt
Topic 6- SEO and Analytics .ppt
 
Google Analytics Training Seminar - Vorian Agency
Google Analytics Training Seminar - Vorian AgencyGoogle Analytics Training Seminar - Vorian Agency
Google Analytics Training Seminar - Vorian Agency
 
Why Big Query is so Powerful - Trusted Conf
Why Big Query is so Powerful - Trusted ConfWhy Big Query is so Powerful - Trusted Conf
Why Big Query is so Powerful - Trusted Conf
 

More from javier ramirez

¿Se puede vivir del open source? T3chfest
¿Se puede vivir del open source? T3chfest¿Se puede vivir del open source? T3chfest
¿Se puede vivir del open source? T3chfestjavier ramirez
 
QuestDB: The building blocks of a fast open-source time-series database
QuestDB: The building blocks of a fast open-source time-series databaseQuestDB: The building blocks of a fast open-source time-series database
QuestDB: The building blocks of a fast open-source time-series databasejavier ramirez
 
Como creamos QuestDB Cloud, un SaaS basado en Kubernetes alrededor de QuestDB...
Como creamos QuestDB Cloud, un SaaS basado en Kubernetes alrededor de QuestDB...Como creamos QuestDB Cloud, un SaaS basado en Kubernetes alrededor de QuestDB...
Como creamos QuestDB Cloud, un SaaS basado en Kubernetes alrededor de QuestDB...javier ramirez
 
Ingesting Over Four Million Rows Per Second With QuestDB Timeseries Database ...
Ingesting Over Four Million Rows Per Second With QuestDB Timeseries Database ...Ingesting Over Four Million Rows Per Second With QuestDB Timeseries Database ...
Ingesting Over Four Million Rows Per Second With QuestDB Timeseries Database ...javier ramirez
 
Deduplicating and analysing time-series data with Apache Beam and QuestDB
Deduplicating and analysing time-series data with Apache Beam and QuestDBDeduplicating and analysing time-series data with Apache Beam and QuestDB
Deduplicating and analysing time-series data with Apache Beam and QuestDBjavier ramirez
 
Your Database Cannot Do this (well)
Your Database Cannot Do this (well)Your Database Cannot Do this (well)
Your Database Cannot Do this (well)javier ramirez
 
Your Timestamps Deserve Better than a Generic Database
Your Timestamps Deserve Better than a Generic DatabaseYour Timestamps Deserve Better than a Generic Database
Your Timestamps Deserve Better than a Generic Databasejavier ramirez
 
Cómo se diseña una base de datos que pueda ingerir más de cuatro millones de ...
Cómo se diseña una base de datos que pueda ingerir más de cuatro millones de ...Cómo se diseña una base de datos que pueda ingerir más de cuatro millones de ...
Cómo se diseña una base de datos que pueda ingerir más de cuatro millones de ...javier ramirez
 
QuestDB-Community-Call-20220728
QuestDB-Community-Call-20220728QuestDB-Community-Call-20220728
QuestDB-Community-Call-20220728javier ramirez
 
Processing and analysing streaming data with Python. Pycon Italy 2022
Processing and analysing streaming  data with Python. Pycon Italy 2022Processing and analysing streaming  data with Python. Pycon Italy 2022
Processing and analysing streaming data with Python. Pycon Italy 2022javier ramirez
 
QuestDB: ingesting a million time series per second on a single instance. Big...
QuestDB: ingesting a million time series per second on a single instance. Big...QuestDB: ingesting a million time series per second on a single instance. Big...
QuestDB: ingesting a million time series per second on a single instance. Big...javier ramirez
 
Servicios e infraestructura de AWS y la próxima región en Aragón
Servicios e infraestructura de AWS y la próxima región en AragónServicios e infraestructura de AWS y la próxima región en Aragón
Servicios e infraestructura de AWS y la próxima región en Aragónjavier ramirez
 
Primeros pasos en desarrollo serverless
Primeros pasos en desarrollo serverlessPrimeros pasos en desarrollo serverless
Primeros pasos en desarrollo serverlessjavier ramirez
 
How AWS is reinventing the cloud
How AWS is reinventing the cloudHow AWS is reinventing the cloud
How AWS is reinventing the cloudjavier ramirez
 
Analitica de datos en tiempo real con Apache Flink y Apache BEAM
Analitica de datos en tiempo real con Apache Flink y Apache BEAMAnalitica de datos en tiempo real con Apache Flink y Apache BEAM
Analitica de datos en tiempo real con Apache Flink y Apache BEAMjavier ramirez
 
Getting started with streaming analytics
Getting started with streaming analyticsGetting started with streaming analytics
Getting started with streaming analyticsjavier ramirez
 
Getting started with streaming analytics: Setting up a pipeline
Getting started with streaming analytics: Setting up a pipelineGetting started with streaming analytics: Setting up a pipeline
Getting started with streaming analytics: Setting up a pipelinejavier ramirez
 
Getting started with streaming analytics: Deep Dive
Getting started with streaming analytics: Deep DiveGetting started with streaming analytics: Deep Dive
Getting started with streaming analytics: Deep Divejavier ramirez
 
Getting started with streaming analytics: streaming basics (1 of 3)
Getting started with streaming analytics: streaming basics (1 of 3)Getting started with streaming analytics: streaming basics (1 of 3)
Getting started with streaming analytics: streaming basics (1 of 3)javier ramirez
 
Monitorización de seguridad y detección de amenazas con AWS
Monitorización de seguridad y detección de amenazas con AWSMonitorización de seguridad y detección de amenazas con AWS
Monitorización de seguridad y detección de amenazas con AWSjavier ramirez
 

More from javier ramirez (20)

¿Se puede vivir del open source? T3chfest
¿Se puede vivir del open source? T3chfest¿Se puede vivir del open source? T3chfest
¿Se puede vivir del open source? T3chfest
 
QuestDB: The building blocks of a fast open-source time-series database
QuestDB: The building blocks of a fast open-source time-series databaseQuestDB: The building blocks of a fast open-source time-series database
QuestDB: The building blocks of a fast open-source time-series database
 
Como creamos QuestDB Cloud, un SaaS basado en Kubernetes alrededor de QuestDB...
Como creamos QuestDB Cloud, un SaaS basado en Kubernetes alrededor de QuestDB...Como creamos QuestDB Cloud, un SaaS basado en Kubernetes alrededor de QuestDB...
Como creamos QuestDB Cloud, un SaaS basado en Kubernetes alrededor de QuestDB...
 
Ingesting Over Four Million Rows Per Second With QuestDB Timeseries Database ...
Ingesting Over Four Million Rows Per Second With QuestDB Timeseries Database ...Ingesting Over Four Million Rows Per Second With QuestDB Timeseries Database ...
Ingesting Over Four Million Rows Per Second With QuestDB Timeseries Database ...
 
Deduplicating and analysing time-series data with Apache Beam and QuestDB
Deduplicating and analysing time-series data with Apache Beam and QuestDBDeduplicating and analysing time-series data with Apache Beam and QuestDB
Deduplicating and analysing time-series data with Apache Beam and QuestDB
 
Your Database Cannot Do this (well)
Your Database Cannot Do this (well)Your Database Cannot Do this (well)
Your Database Cannot Do this (well)
 
Your Timestamps Deserve Better than a Generic Database
Your Timestamps Deserve Better than a Generic DatabaseYour Timestamps Deserve Better than a Generic Database
Your Timestamps Deserve Better than a Generic Database
 
Cómo se diseña una base de datos que pueda ingerir más de cuatro millones de ...
Cómo se diseña una base de datos que pueda ingerir más de cuatro millones de ...Cómo se diseña una base de datos que pueda ingerir más de cuatro millones de ...
Cómo se diseña una base de datos que pueda ingerir más de cuatro millones de ...
 
QuestDB-Community-Call-20220728
QuestDB-Community-Call-20220728QuestDB-Community-Call-20220728
QuestDB-Community-Call-20220728
 
Processing and analysing streaming data with Python. Pycon Italy 2022
Processing and analysing streaming  data with Python. Pycon Italy 2022Processing and analysing streaming  data with Python. Pycon Italy 2022
Processing and analysing streaming data with Python. Pycon Italy 2022
 
QuestDB: ingesting a million time series per second on a single instance. Big...
QuestDB: ingesting a million time series per second on a single instance. Big...QuestDB: ingesting a million time series per second on a single instance. Big...
QuestDB: ingesting a million time series per second on a single instance. Big...
 
Servicios e infraestructura de AWS y la próxima región en Aragón
Servicios e infraestructura de AWS y la próxima región en AragónServicios e infraestructura de AWS y la próxima región en Aragón
Servicios e infraestructura de AWS y la próxima región en Aragón
 
Primeros pasos en desarrollo serverless
Primeros pasos en desarrollo serverlessPrimeros pasos en desarrollo serverless
Primeros pasos en desarrollo serverless
 
How AWS is reinventing the cloud
How AWS is reinventing the cloudHow AWS is reinventing the cloud
How AWS is reinventing the cloud
 
Analitica de datos en tiempo real con Apache Flink y Apache BEAM
Analitica de datos en tiempo real con Apache Flink y Apache BEAMAnalitica de datos en tiempo real con Apache Flink y Apache BEAM
Analitica de datos en tiempo real con Apache Flink y Apache BEAM
 
Getting started with streaming analytics
Getting started with streaming analyticsGetting started with streaming analytics
Getting started with streaming analytics
 
Getting started with streaming analytics: Setting up a pipeline
Getting started with streaming analytics: Setting up a pipelineGetting started with streaming analytics: Setting up a pipeline
Getting started with streaming analytics: Setting up a pipeline
 
Getting started with streaming analytics: Deep Dive
Getting started with streaming analytics: Deep DiveGetting started with streaming analytics: Deep Dive
Getting started with streaming analytics: Deep Dive
 
Getting started with streaming analytics: streaming basics (1 of 3)
Getting started with streaming analytics: streaming basics (1 of 3)Getting started with streaming analytics: streaming basics (1 of 3)
Getting started with streaming analytics: streaming basics (1 of 3)
 
Monitorización de seguridad y detección de amenazas con AWS
Monitorización de seguridad y detección de amenazas con AWSMonitorización de seguridad y detección de amenazas con AWS
Monitorización de seguridad y detección de amenazas con AWS
 

Recently uploaded

CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxolyaivanovalion
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptxAnupama Kate
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxolyaivanovalion
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightDelhi Call girls
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...amitlee9823
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxfirstjob4
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...amitlee9823
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxolyaivanovalion
 
Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...shambhavirathore45
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% SecurePooja Nehwal
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxolyaivanovalion
 
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Online
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service OnlineCALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Online
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Onlineanilsa9823
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz1
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfadriantubila
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusTimothy Spann
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxolyaivanovalion
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramMoniSankarHazra
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 

Recently uploaded (20)

CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptx
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptx
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptx
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptx
 
Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
 
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptx
 
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Online
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service OnlineCALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Online
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Online
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics Program
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 

Google Analytics and BigQuery, by Javier Ramirez, from datawaki

  • 1. Get more from Analytics with Google BigQuery javier ramirez @supercoco9
  • 2. about me 19 years working on software: banking, e-commerce, government, CMS, start-ups... founder of https://datawaki.com https://teowaki.com https://teowaki.com/services Google Developer Expert on the Cloud Platform datawaki mail: j@teowaki.com twitter: @supercoco9
  • 3. BigQuery is awes.. I use Google Analytics javier ramirez @supercoco9 https://teowaki.com
  • 4. Isn't Google Analytics good enough? javier ramirez @supercoco9 https://teowaki.com
  • 5. Google Analytics is great but... It lets you access aggregated data and sampled reports, not individual sessions/visits data. Even premium accounts get sampled reports when there are too many data (and not all the reports can be unsampled). javier ramirez @supercoco9 https://teowaki.com
  • 6. Google Analytics is great but... If you need to manage many different segments, and if you want to combine segments, it can get tricky. Moreover, you can only segment or create reports using the pre-defined filters, which might or not be enough for you*. *even if segments have experienced a huge improvement with Universal Analytics javier ramirez @supercoco9 https://teowaki.com
  • 7. Google Analytics is great but... It's not easy to cross data in Analytics with data from other sources (CRM, invoicing system...) Now you can use Import Data from Universal Analytics, but there are many constraints to what you can do javier ramirez @supercoco9 https://teowaki.com
  • 8. Google Analytics is great but... Good for knowing what's happening in your application, but difficult for: * business intelligence/big data (data mining, find patterns...) * machine learning (classify information, predict future trends...) javier ramirez @supercoco9 https://teowaki.com
  • 9. Google BigQuery Designed to run analytics over huge volumes of raw data, and to integrate with other data sources javier ramirez @supercoco9 https://teowaki.com
  • 11. Google BigQuery + GA Premium Google Analytics Premium users get free daily exports from GA to BigQuery. javier ramirez @supercoco9 https://teowaki.com
  • 12. Google BigQuery + GA Premium All your raw data. Unsampled. Use it however you want. BOOM! javier ramirez @supercoco9 https://teowaki.com
  • 15.
  • 16. it's just SQL javier ramirez @supercoco9 https://teowaki.com
  • 17. SQL is not very hard Give me the count of visitors from our analytics who visited yesterday using a mobile device, by country SELECT count(fullVisitorId) from ga_sessions_20141203 where device.isMobile = true GROUP BY geoNetwork.country
  • 18. data schema javier ramirez @supercoco9 https://teowaki.com
  • 19. basic queries (metric/dimension) SELECT trafficSource.source, SUM( totals.transactions ) AS total_transactions FROM playground.ga_sessions_20140621 GROUP BY trafficSource.source ORDER BY total_transactions; SELECT device.isMobile, SUM ( totals.pageviews ) AS total_pageviews FROM playground.ga_sessions_20140621 GROUP BY device.isMobile ORDER BY total_pageviews;
  • 20. basic queries with a twist SELECT IF(DOMAIN(trafficSource.source) is null, trafficSource.source, DOMAIN(trafficSource.source)) AS normalized_source, SUM ( totals.transactions ) AS total_transactions FROM playground.ga_sessions_20140621 GROUP BY normalized_source ORDER BY total_transactions;
  • 21. Average amount spent per visit SELECT ( SUM(total_transactionrevenue_per_user) / SUM(total_visits_per_user) ) AS avg_revenue_by_user_per_visit FROM ( SELECT SUM(totals.visits) AS total_visits_per_user, SUM( totals.transactionRevenue ) AS total_transactionrevenue_per_user, visitorId FROM playground.ga_sessions_20140621 WHERE totals.visits>0 AND totals.transactions>=1 AND totals.transactionRevenue IS NOT NULL GROUP BY visitorId ) ;
  • 23. Users who bought product A, also bought product B SELECT hits.item.productName AS other_purchased_products, COUNT(hits.item.productName) AS quantity FROM playground.ga_sessions_20140621 WHERE fullVisitorId IN ( SELECT fullVisitorId FROM playground.ga_sessions_20140621 WHERE hits.item.productName CONTAINS 'Light Helmet' AND totals.transactions>=1 GROUP BY fullVisitorId ) AND hits.item.productName IS NOT NULL AND hits.item.productName !='Light Helmet' GROUP BY other_purchased_products ORDER BY quantity DESC;
  • 24. SELECT prod_name, count(*) as transactions FROM ( SELECT fullVisitorId, min(date) AS date, visitId, hits.item.productName as prod_name FROM ( SELECT fullVisitorId, date, visitId, totals.transactions, hits.item.productName FROM (TABLE_DATE_RANGE([dataset.ga_sessions_], TIMESTAMP('2014-06-01'), TIMESTAMP('2014-06-14'))) ) WHERE fullVisitorId IN ( SELECT fullVisitorId FROM (TABLE_DATE_RANGE([dataset.ga_sessions_], TIMESTAMP('2014-06-01'), TIMESTAMP('2014-06-14'))) GROUP BY fullVisitorId HAVING SUM(totals.transactions) > 1 ) AND hits.item.productName IS NOT NULL GROUP BY fullVisitorId, visitId, prod_name ORDER BY fullVisitorId DESC ) GROUP BY prod_name ORDER BY transactions DESC; * example query from the lunametrics blog. Check them out for more awesomeness Products that are purchased and lead to other products being purchased
  • 25. Identify user path/user actions SELECT fullvisitorID, visitID, visitNumber, hits.page.pagePath FROM playground.ga_sessions_20140621 where hits.type='PAGE' order by fullvisitorID, visitID, hits.hitnumber asc
  • 26. individual users data is awesome Cross CRM data with individual users actions to see how your response to incidents affect your users. Use the “frequently bought together” query and find users who didn't buy the related products. Send an e-mail campaign with an offer for those products.
  • 27. integrating with external data sources * Connectors/REST API * Export into GCS * Import into BigQuery javier ramirez @supercoco9 https://teowaki.com
  • 28. What if I don't have a GA Premium Account?
  • 29. just send your own data define a data structure that fits your needs (or replicate the one GA provides), use a JS snippet to send data to your server, then to BigQuery** ..you will miss many of the GA dimensions, but you can keep using GA and use BigQuery only for your unsampled data datawaki ** If you want to do this without managing your own servers, we can help you javier ramirez @supercoco9 https://teowaki.com
  • 30. BigQuery pricing $20 per stored TB A site with 50m pageviews, would pay less than $10 a month per every 6 months worth of data $5 per processed TB *the 1st TB every month is free of charge ** GA premium get $500 free credit monthly javier ramirez @supercoco9 https://teowaki.com
  • 31. for GA premium users BigQuery is effectively for free *unless you upload huge external data or make huge queries javier ramirez @supercoco9 https://teowaki.com
  • 32. Want to know more? https://cloud.google.com/products/bigquery/ https://datawaki.com Need help? https://teowaki.com/services Thanks! Javier Ramírez @supercoco9