SlideShare a Scribd company logo
1 of 10
BigQuery by Google Cloud
What is BigQuery ?
• BigQuery is a web service that lets you do
interactive analysis of massive datasets—up to
billions of rows. Scalable and easy to use,
BigQuery lets developers and businesses tap into
powerful data analytics on demand.
• BigQuery allows you to execute any business
query like a SQL query against very large
datasets, with potentially billions of rows.
• It is an OLAP (online analytical processing) system
and not an OLTP (online transactional processing)
system like MySql.
History of BigQuery
• Google uses Dremel ( internal code/project )
to do all sort of Analysis/Monitoring on
BigData ( Search, YouTube, AdWords, Gmail ).
BigQuery is externalization of Dremel.
• Hence, BigQuery gives us a stable platform
which has been tested all these years
extensively with credibility of Google.
How fast & Scalable it is !
• It can Scan 35 Billion Rows Without an Index
in Tens of Seconds.
• Dremel, the cloud-powered massively parallel
query service, shares Google’s infrastructure,
so it can parallelize each query and run it on
tens of thousands of servers simultaneously
By the way how it is so fast !
• Basic architecture which makes it so fast
includes following components:
– Columnar Storage: Data is stored in a columnar
storage fashion. All the columns are stored on
different servers ( even thousands of commodity
servers )
– Tree Architecture is used for dispatching queries
and aggregating results across thousands of
machines in a few seconds.
How to access it ?
Very simple to start using it. No need of
extensive programming knowledge. It can be
accessed via:
• Web-browser
• Command line tool
• REST API using client libraries in Java, python
etc.
Pricing
• Free to start with like Google App Engine.
• Only If usage exceeds a threshold; pricing is
based on:
– Storage ( $0.12 per GB/month )
– Query Processing ( only data processed in
columns, not entire tables)
• Batch Queries ( $0.02 per GB processed)
• Interactive Queries ( $0.035 per GB processed)
Comparison with MapReduce
• BigQuery is suitable for OLAP (Online Analytical
Processing) or BI (Business Intelligence) usage,
where most of the queries are simple and done
through a quick aggregation and filtering by a set
of columns (dimensions).
• Best for ad hoc queries or trialand-error data
analysis.
• MapReduce is a better choice when you want to
process unstructured data programmatically or if
you need to output gigabytes of data, as in the
case of merging two big tables.
Demo
• Visit https://bigquery.cloud.google.com/
• Sample Query:
SELECT title, COUNT(title) as count
FROM publicdata:samples.wikipedia
WHERE (REGEXP_MATCH(title,r'ww'ww'))
GROUP BY title
ORDER BY count DESC;
matches "ne'er", "we'll", "speak'st", "you'll" and so on in 313 million rows
within few seconds.
publicdata: ProjectId, samples is dataset, wikipediais table name
Have a good day at
BarCamp Bangalore 2013
Deepak Singhal
deepakagra@gmail.com
in.linkedin.com/in/deepakagra

More Related Content

Recently uploaded

Abortion Pills For Sale WhatsApp[[+27737758557]] In Birch Acres, Abortion Pil...
Abortion Pills For Sale WhatsApp[[+27737758557]] In Birch Acres, Abortion Pil...Abortion Pills For Sale WhatsApp[[+27737758557]] In Birch Acres, Abortion Pil...
Abortion Pills For Sale WhatsApp[[+27737758557]] In Birch Acres, Abortion Pil...
drm1699
 
Team Transformation Tactics for Holistic Testing and Quality (NewCrafts Paris...
Team Transformation Tactics for Holistic Testing and Quality (NewCrafts Paris...Team Transformation Tactics for Holistic Testing and Quality (NewCrafts Paris...
Team Transformation Tactics for Holistic Testing and Quality (NewCrafts Paris...
Lisi Hocke
 

Recently uploaded (20)

Automate your OpenSIPS config tests - OpenSIPS Summit 2024
Automate your OpenSIPS config tests - OpenSIPS Summit 2024Automate your OpenSIPS config tests - OpenSIPS Summit 2024
Automate your OpenSIPS config tests - OpenSIPS Summit 2024
 
Abortion Pills For Sale WhatsApp[[+27737758557]] In Birch Acres, Abortion Pil...
Abortion Pills For Sale WhatsApp[[+27737758557]] In Birch Acres, Abortion Pil...Abortion Pills For Sale WhatsApp[[+27737758557]] In Birch Acres, Abortion Pil...
Abortion Pills For Sale WhatsApp[[+27737758557]] In Birch Acres, Abortion Pil...
 
Microsoft365_Dev_Security_2024_05_16.pdf
Microsoft365_Dev_Security_2024_05_16.pdfMicrosoft365_Dev_Security_2024_05_16.pdf
Microsoft365_Dev_Security_2024_05_16.pdf
 
Test Automation Design Patterns_ A Comprehensive Guide.pdf
Test Automation Design Patterns_ A Comprehensive Guide.pdfTest Automation Design Patterns_ A Comprehensive Guide.pdf
Test Automation Design Patterns_ A Comprehensive Guide.pdf
 
Abortion Pill Prices Germiston ](+27832195400*)[ 🏥 Women's Abortion Clinic in...
Abortion Pill Prices Germiston ](+27832195400*)[ 🏥 Women's Abortion Clinic in...Abortion Pill Prices Germiston ](+27832195400*)[ 🏥 Women's Abortion Clinic in...
Abortion Pill Prices Germiston ](+27832195400*)[ 🏥 Women's Abortion Clinic in...
 
Abortion Pill Prices Mthatha (@](+27832195400*)[ 🏥 Women's Abortion Clinic In...
Abortion Pill Prices Mthatha (@](+27832195400*)[ 🏥 Women's Abortion Clinic In...Abortion Pill Prices Mthatha (@](+27832195400*)[ 🏥 Women's Abortion Clinic In...
Abortion Pill Prices Mthatha (@](+27832195400*)[ 🏥 Women's Abortion Clinic In...
 
Team Transformation Tactics for Holistic Testing and Quality (NewCrafts Paris...
Team Transformation Tactics for Holistic Testing and Quality (NewCrafts Paris...Team Transformation Tactics for Holistic Testing and Quality (NewCrafts Paris...
Team Transformation Tactics for Holistic Testing and Quality (NewCrafts Paris...
 
The mythical technical debt. (Brooke, please, forgive me)
The mythical technical debt. (Brooke, please, forgive me)The mythical technical debt. (Brooke, please, forgive me)
The mythical technical debt. (Brooke, please, forgive me)
 
Transformer Neural Network Use Cases with Links
Transformer Neural Network Use Cases with LinksTransformer Neural Network Use Cases with Links
Transformer Neural Network Use Cases with Links
 
Weeding your micro service landscape.pdf
Weeding your micro service landscape.pdfWeeding your micro service landscape.pdf
Weeding your micro service landscape.pdf
 
Your Ultimate Web Studio for Streaming Anywhere | Evmux
Your Ultimate Web Studio for Streaming Anywhere | EvmuxYour Ultimate Web Studio for Streaming Anywhere | Evmux
Your Ultimate Web Studio for Streaming Anywhere | Evmux
 
Prompt Engineering - an Art, a Science, or your next Job Title?
Prompt Engineering - an Art, a Science, or your next Job Title?Prompt Engineering - an Art, a Science, or your next Job Title?
Prompt Engineering - an Art, a Science, or your next Job Title?
 
Auto Affiliate AI Earns First Commission in 3 Hours..pdf
Auto Affiliate  AI Earns First Commission in 3 Hours..pdfAuto Affiliate  AI Earns First Commission in 3 Hours..pdf
Auto Affiliate AI Earns First Commission in 3 Hours..pdf
 
Effective Strategies for Wix's Scaling challenges - GeeCon
Effective Strategies for Wix's Scaling challenges - GeeConEffective Strategies for Wix's Scaling challenges - GeeCon
Effective Strategies for Wix's Scaling challenges - GeeCon
 
Abortion Pill Prices Jane Furse ](+27832195400*)[ 🏥 Women's Abortion Clinic i...
Abortion Pill Prices Jane Furse ](+27832195400*)[ 🏥 Women's Abortion Clinic i...Abortion Pill Prices Jane Furse ](+27832195400*)[ 🏥 Women's Abortion Clinic i...
Abortion Pill Prices Jane Furse ](+27832195400*)[ 🏥 Women's Abortion Clinic i...
 
Community is Just as Important as Code by Andrea Goulet
Community is Just as Important as Code by Andrea GouletCommunity is Just as Important as Code by Andrea Goulet
Community is Just as Important as Code by Andrea Goulet
 
Anypoint Code Builder - Munich MuleSoft Meetup - 16th May 2024
Anypoint Code Builder - Munich MuleSoft Meetup - 16th May 2024Anypoint Code Builder - Munich MuleSoft Meetup - 16th May 2024
Anypoint Code Builder - Munich MuleSoft Meetup - 16th May 2024
 
OpenChain Webinar: AboutCode and Beyond - End-to-End SCA
OpenChain Webinar: AboutCode and Beyond - End-to-End SCAOpenChain Webinar: AboutCode and Beyond - End-to-End SCA
OpenChain Webinar: AboutCode and Beyond - End-to-End SCA
 
COMPUTER AND ITS COMPONENTS PPT.by naitik sharma Class 9th A mittal internati...
COMPUTER AND ITS COMPONENTS PPT.by naitik sharma Class 9th A mittal internati...COMPUTER AND ITS COMPONENTS PPT.by naitik sharma Class 9th A mittal internati...
COMPUTER AND ITS COMPONENTS PPT.by naitik sharma Class 9th A mittal internati...
 
Lessons Learned from Building a Serverless Notifications System.pdf
Lessons Learned from Building a Serverless Notifications System.pdfLessons Learned from Building a Serverless Notifications System.pdf
Lessons Learned from Building a Serverless Notifications System.pdf
 

Featured

How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
ThinkNow
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
Kurio // The Social Media Age(ncy)
 

Featured (20)

2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot
 
Everything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTEverything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPT
 
Product Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsProduct Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage Engineerings
 
How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
 
Skeleton Culture Code
Skeleton Culture CodeSkeleton Culture Code
Skeleton Culture Code
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 

Basics of BigQuery

  • 2. What is BigQuery ? • BigQuery is a web service that lets you do interactive analysis of massive datasets—up to billions of rows. Scalable and easy to use, BigQuery lets developers and businesses tap into powerful data analytics on demand. • BigQuery allows you to execute any business query like a SQL query against very large datasets, with potentially billions of rows. • It is an OLAP (online analytical processing) system and not an OLTP (online transactional processing) system like MySql.
  • 3. History of BigQuery • Google uses Dremel ( internal code/project ) to do all sort of Analysis/Monitoring on BigData ( Search, YouTube, AdWords, Gmail ). BigQuery is externalization of Dremel. • Hence, BigQuery gives us a stable platform which has been tested all these years extensively with credibility of Google.
  • 4. How fast & Scalable it is ! • It can Scan 35 Billion Rows Without an Index in Tens of Seconds. • Dremel, the cloud-powered massively parallel query service, shares Google’s infrastructure, so it can parallelize each query and run it on tens of thousands of servers simultaneously
  • 5. By the way how it is so fast ! • Basic architecture which makes it so fast includes following components: – Columnar Storage: Data is stored in a columnar storage fashion. All the columns are stored on different servers ( even thousands of commodity servers ) – Tree Architecture is used for dispatching queries and aggregating results across thousands of machines in a few seconds.
  • 6. How to access it ? Very simple to start using it. No need of extensive programming knowledge. It can be accessed via: • Web-browser • Command line tool • REST API using client libraries in Java, python etc.
  • 7. Pricing • Free to start with like Google App Engine. • Only If usage exceeds a threshold; pricing is based on: – Storage ( $0.12 per GB/month ) – Query Processing ( only data processed in columns, not entire tables) • Batch Queries ( $0.02 per GB processed) • Interactive Queries ( $0.035 per GB processed)
  • 8. Comparison with MapReduce • BigQuery is suitable for OLAP (Online Analytical Processing) or BI (Business Intelligence) usage, where most of the queries are simple and done through a quick aggregation and filtering by a set of columns (dimensions). • Best for ad hoc queries or trialand-error data analysis. • MapReduce is a better choice when you want to process unstructured data programmatically or if you need to output gigabytes of data, as in the case of merging two big tables.
  • 9. Demo • Visit https://bigquery.cloud.google.com/ • Sample Query: SELECT title, COUNT(title) as count FROM publicdata:samples.wikipedia WHERE (REGEXP_MATCH(title,r'ww'ww')) GROUP BY title ORDER BY count DESC; matches "ne'er", "we'll", "speak'st", "you'll" and so on in 313 million rows within few seconds. publicdata: ProjectId, samples is dataset, wikipediais table name
  • 10. Have a good day at BarCamp Bangalore 2013 Deepak Singhal deepakagra@gmail.com in.linkedin.com/in/deepakagra