SlideShare a Scribd company logo
Snowflake Data Cloud
Optimization Fun!
How to Optimize your
Snowflake Data Cloud
Best Practices and Tips
from a Snowflake Data
Superhero
• Intro – Speaker and the Snowflake Data Cloud
• Snowflake Evolution and Difference
• Snowflake Consumption Pricing
• Snowflake Optimization
– Cost
– Performance
– Security
• Optimization Techniques
– Hard Way
– Easy Way (Automation Tools)
• New Cost and Resource Monitoring Tools coming in 2022
– Resource Groups
– Budget Assignment
DATACONLA – 2022 - Agenda
Frank Bell
*Main author of the book Snowflake Essentials
*Top Snowflake Data Thought Leader at Accenture
*Started Snowflake LA Users Group 2018
*Ran top Snowflake Data Cloud Consulting Practice
[acquired by Fairway/Accenture in 2019]
*25+ years of data .. “stuff” 
*Created Snowflake Solutions and Snoptimizer
https://snowflakesolutions.net – Snowflake Business User and Developer Community with
Knowledge Repository and Snowflake Tools
https://snoptimizer.com – Automated Snowflake Data Cloud Optimization Service
Snowflake Data Superhero
*Creator of Snowflake Solutions and Snoptimizer
• Snowflake Data Cloud is:
• A cloud based database and
interconnected data system
that can handle multiple
workloads
• Fully connected data cloud
with “no-copy” data sharing,
data cloning, and data usage
enabled within a cloud
provider region.
What is the Snowflake Data Cloud?
KEY POINTS:
*Unified system
*Connects companies and data
providers to data
*Single and seamless experience
*Run across multiple public cloud
*Removes massive amounts of
friction from data access and
processing
• 2014-2018 – Snowflake Database (Focused on Data
Warehouse)
• Structured and Semi-Structured Data
• 2019-2020
• 2021 Snowflake Data Cloud
• Workloads: Data Warehousing, Data Lakes, Data
Applications, Data Science, etc.
• Data Types: Structured,Semi-Structured,Unstructured
Snowflake Evolution
Frank’s Snowflake Differentiators: (from all other data systems)
#1: Architected on cloud providers separated storage
and compute.
#2: Micro-Partition Architecture
#3: ”no-copy” data cloning – enables DataOps and
true Agile Data Systems/Applications
#4: “no-copy” data sharing – “game changer”
Why is the Snowflake Data Cloud
so different? (Part 1)
Frank’s Snowflake Differentiators: (continued)
#5: Capability to process Structured, Semi-Structured
and fully Unstructured data all in one system
#6: End to End data processing and data science
workloads in one fully secure and governed system.
#7: in progress
#8: in progress
Why is the Snowflake Data Cloud
so different? (Part 2)
• Overall Snowflake is Awesome
but it is not continuously optimized
for cost, security, or performance
• Snowflake continues to add excellent features and many
of them improve performance but also adds cost,
security, and performance complexity. -
Snowflake
Consumption Based Pricing
Consumption Based
Pricing is Awesome!
Unless you have:
1. no optimizations
2. resource constraints!
Some Snowflake
Customers who have a
budget of $50,000/year
can blow through it in 3
days without
optimization
Medium to large size
customers who do not
optimize extensively can
miss out on saving
$5,000+/month
Automated Snowflake Data
Cloud Optimization is what
keeps Consumption Pricing
on Snowflake Awesome!
Then it's not …
Resource Monitors, warehouse
Optimization, etc.
Cost
Roles, Warehouses, Privileges, Grants,
Stale Users, Network Policies, and 30
more tests
Security
Queuing, Spilling, etc.
Performance
Snowflake Optimization Areas
Resource Monitors, warehouse
Optimization, etc.
Cost
Roles, Warehouses, Privileges, Grants,
Stale Users, Network Policies, and 30
more tests
Security
Queuing, Spilling, etc.
Performance
Snowflake Cost Optimization
Best Practices
Most costs found here
Query Consumption
Typically should be low
Automatic Clustering
What out for stages & time
travel
Storage Consumption
Need to monitor
Search Optimization
Need to monitor
Materialized Views
Typically low but you still need
to monitor
Cloud Services
Typically low - very efficient
Pipes
Need to monitor if you have
replication set up
Replication
Cost Areas on Snowflake
Before After
Customer A Profile
• Capacity Contract -$100,000/year
• Data Platform Size: 10+ TBs
• Uses Snowpipe for IOT data
• $12,000+/month consumption costs
Optimization Results
• Cost Optimization around Load
Warehouses with non-optimized
suspend and parameters setting
• Exact code to fix above issues
• $10,200/month consumption costs
Cost Optimization
Tip #1
Cost Optimization
Credit Consumption
Optimization - Using
Resource Monitors
(and soon Resource
Groups)
*Put actual code here
on Resource Monitor
Best Practices
Tip #2
Cost Optimization
Warehouse
Optimizations – DO
NOT USE DEFAULTS!
(Auto-Suspend – give
details)
*Place code here
Tip #3
Cost Optimization
Cloud Data Storage
Common Cost Issues
*Anti-Pattern
*Put actual code
here?
Tip #4
Cost Optimization
x Data Loading
optimization tips and
tricks
*Add actual code here
Snowflake Optimization
Techniques
Snowflake Optimization Technique Services Available PROS CONS
*Reporting on Costs. Manual
Optimizations
*link to cost reporting Quick to implement. Not comprehensive at all. Also, very
reactive versus proactive. Major
problems can occur on cost,
performance, and security very quickly.
*Custom Coded automated optimization
system
* More thorough and typically more
proactive than reporting and finding
error culture.
Extremely expensive. Time-consuming.
Easily outdated if large staff of
performance experts is not continuously
used.
*Relying on Human based Consulting or
Snowflake Professional Service Health
Checks
Snowflake Usually engages reasonably detailed
Snowflake Consulting Experts
Expensive and not repeatable.
Prone to human error. Is outdated often
within days of it being finished.
*Fully Automated SaaS services that
automate optimization
Snoptimizer
Nadilytics SaaS
Security [. ]
Automates the tremendous complexity
of optimizing Snowflake for cost,
performance and security continuously.
Medium Cost
?
Resource Monitors, warehouse
Optimization, etc.
Cost
Roles, Warehouses, Privileges, Grants,
Stale Users, Network Policies, and 30
more tests
Security
Queuing, Spilling, etc.
Performance
Snowflake Performance Optimization
Best Practices
Tip #1
Performance
Optimization
Snowflake Queuing
Optimization
*Code
Tip #2
Performance
Optimization
Snowflake Spilling
Optimization
*
Code
Tip #3
Performance
Optimization
Data Loading
Optimization
*Code goes here
Customer C Profile
• Capacity Contract -$200,000/year
• Data Platform Size: 50++ TBs
• 3TB ++ size table with non-optimized
cluster keys - Many queries running
over several minutes
• Slower External Table Queries
Optimization Results
• Improved Cluster Keys and queries
above 1 minute all fell under 1 minute
• Materialized Views for certain External
Tables - Massive Query Performance
Improvement
Before After
Performance
Optimization
Resource Monitors, warehouse
Optimization, etc.
Cost
Roles, Warehouses, Privileges, Grants,
Stale Users, Network Policies, and 30
more tests
Security
Queuing, Spilling, etc.
Performance
Snowflake Security Optimization
Best Practices
Tip #1
Security
Optimization
Security Opt
*Add code
Tip #2 Security opt
*Add code
Security
Optimization
Tip #3 Security opt
*Add code here
Security
Optimization
Tip #4 Security opt
*Add Code
Security
Optimization
Customer B Profile
• Capacity Contract: $50,000/year
• Data Platform Size: 1-2TBs
• Significant Cost Risk due to a large
amount of users having CREATE
WAREHOUSE at any size granted.
• Stale objects with too many users
having access
• Stages not properly secured
Optimization Results
• Improved RBAC, cost risk significantly
reduced
• No stale objects
• Stages properly secured
• Continuous Security Analysis and
Protection
Before After
Security Optimization
Optimization Results
• In our tests, on average we have seen 10-30% cost savings,
1000s of security issues fixed and 100s of performance
problems solved.
• In some implementations we have seen almost 50% cost
savings.
Contact Us
Frank Bell, Snowflake Data Superhero [3rd year]
fbell@itstrategists.com
https://snowflakesolutions.net
https://snoptimizer.com
Snowflake Questions?
Join LA
Snowflake
Users Group
and Meetup
https://www.meetup.com/los-angeles-business-snowflake-users-group/

More Related Content

Similar to Data Con LA 2022 - Supercharge your Snowflake Data Cloud from a Snowflake Data super hero

gkkCloudtechnologyassociate(cta)day 2
gkkCloudtechnologyassociate(cta)day 2gkkCloudtechnologyassociate(cta)day 2
gkkCloudtechnologyassociate(cta)day 2
Anne Starr
 
ADV Slides: Platforming Your Data for Success – Databases, Hadoop, Managed Ha...
ADV Slides: Platforming Your Data for Success – Databases, Hadoop, Managed Ha...ADV Slides: Platforming Your Data for Success – Databases, Hadoop, Managed Ha...
ADV Slides: Platforming Your Data for Success – Databases, Hadoop, Managed Ha...
DATAVERSITY
 
IBM Cloud Day January 2021 - A well architected data lake
IBM Cloud Day January 2021 - A well architected data lakeIBM Cloud Day January 2021 - A well architected data lake
IBM Cloud Day January 2021 - A well architected data lake
Torsten Steinbach
 
ADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
ADV Slides: When and How Data Lakes Fit into a Modern Data ArchitectureADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
ADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
DATAVERSITY
 
Benefits of Cloud Hosting and SaaS Solutions for IT Solution Providers and th...
Benefits of Cloud Hosting and SaaS Solutions for IT Solution Providers and th...Benefits of Cloud Hosting and SaaS Solutions for IT Solution Providers and th...
Benefits of Cloud Hosting and SaaS Solutions for IT Solution Providers and th...
Janine Soika
 
2022 Trends in Enterprise Analytics
2022 Trends in Enterprise Analytics2022 Trends in Enterprise Analytics
2022 Trends in Enterprise Analytics
DATAVERSITY
 
Data Architecture Best Practices for Advanced Analytics
Data Architecture Best Practices for Advanced AnalyticsData Architecture Best Practices for Advanced Analytics
Data Architecture Best Practices for Advanced Analytics
DATAVERSITY
 
Estimating the Total Costs of Your Cloud Analytics Platform
Estimating the Total Costs of Your Cloud Analytics PlatformEstimating the Total Costs of Your Cloud Analytics Platform
Estimating the Total Costs of Your Cloud Analytics Platform
DATAVERSITY
 
The Shifting Landscape of Data Integration
The Shifting Landscape of Data IntegrationThe Shifting Landscape of Data Integration
The Shifting Landscape of Data Integration
DATAVERSITY
 
Denver Big Data Analytics Day
Denver Big Data Analytics DayDenver Big Data Analytics Day
Denver Big Data Analytics Day
Zivaro Inc
 
Curiosity and Lemontree present - Data Breaks DevOps: Why you need automated ...
Curiosity and Lemontree present - Data Breaks DevOps: Why you need automated ...Curiosity and Lemontree present - Data Breaks DevOps: Why you need automated ...
Curiosity and Lemontree present - Data Breaks DevOps: Why you need automated ...
Curiosity Software Ireland
 
Snowflake: The Good, the Bad and the Ugly
Snowflake: The Good, the Bad and the UglySnowflake: The Good, the Bad and the Ugly
Snowflake: The Good, the Bad and the Ugly
SamanthaBerlant
 
Data & Analytics - Session 1 - Big Data Analytics
Data & Analytics - Session 1 -  Big Data AnalyticsData & Analytics - Session 1 -  Big Data Analytics
Data & Analytics - Session 1 - Big Data Analytics
Amazon Web Services
 
Webinar: Ten Ways to Enhance Your Salesforce.com Application in 2013
Webinar: Ten Ways to Enhance Your Salesforce.com Application in 2013Webinar: Ten Ways to Enhance Your Salesforce.com Application in 2013
Webinar: Ten Ways to Enhance Your Salesforce.com Application in 2013
Emtec Inc.
 
FInal Project - USMx CC605x Cloud Computing for Enterprises - Hugo Aquino
FInal Project - USMx CC605x Cloud Computing for Enterprises - Hugo AquinoFInal Project - USMx CC605x Cloud Computing for Enterprises - Hugo Aquino
FInal Project - USMx CC605x Cloud Computing for Enterprises - Hugo Aquino
Hugo Aquino
 
How Greenhouse Software Unlocked the Power of Machine Data Analytics with Sum...
How Greenhouse Software Unlocked the Power of Machine Data Analytics with Sum...How Greenhouse Software Unlocked the Power of Machine Data Analytics with Sum...
How Greenhouse Software Unlocked the Power of Machine Data Analytics with Sum...
Amazon Web Services
 
Gcp dataflow
Gcp dataflowGcp dataflow
Gcp dataflow
Igor Roiter
 
Demystifying Data Virtualization (ASEAN)
Demystifying Data Virtualization (ASEAN)Demystifying Data Virtualization (ASEAN)
Demystifying Data Virtualization (ASEAN)
Denodo
 
cloud-training-pricing-billing.pdf
cloud-training-pricing-billing.pdfcloud-training-pricing-billing.pdf
cloud-training-pricing-billing.pdf
Abhi850745
 
Understanding the Cloud and the Benefits for the Accountancy Sector - Present...
Understanding the Cloud and the Benefits for the Accountancy Sector - Present...Understanding the Cloud and the Benefits for the Accountancy Sector - Present...
Understanding the Cloud and the Benefits for the Accountancy Sector - Present...
LouisaHDUK
 

Similar to Data Con LA 2022 - Supercharge your Snowflake Data Cloud from a Snowflake Data super hero (20)

gkkCloudtechnologyassociate(cta)day 2
gkkCloudtechnologyassociate(cta)day 2gkkCloudtechnologyassociate(cta)day 2
gkkCloudtechnologyassociate(cta)day 2
 
ADV Slides: Platforming Your Data for Success – Databases, Hadoop, Managed Ha...
ADV Slides: Platforming Your Data for Success – Databases, Hadoop, Managed Ha...ADV Slides: Platforming Your Data for Success – Databases, Hadoop, Managed Ha...
ADV Slides: Platforming Your Data for Success – Databases, Hadoop, Managed Ha...
 
IBM Cloud Day January 2021 - A well architected data lake
IBM Cloud Day January 2021 - A well architected data lakeIBM Cloud Day January 2021 - A well architected data lake
IBM Cloud Day January 2021 - A well architected data lake
 
ADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
ADV Slides: When and How Data Lakes Fit into a Modern Data ArchitectureADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
ADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
 
Benefits of Cloud Hosting and SaaS Solutions for IT Solution Providers and th...
Benefits of Cloud Hosting and SaaS Solutions for IT Solution Providers and th...Benefits of Cloud Hosting and SaaS Solutions for IT Solution Providers and th...
Benefits of Cloud Hosting and SaaS Solutions for IT Solution Providers and th...
 
2022 Trends in Enterprise Analytics
2022 Trends in Enterprise Analytics2022 Trends in Enterprise Analytics
2022 Trends in Enterprise Analytics
 
Data Architecture Best Practices for Advanced Analytics
Data Architecture Best Practices for Advanced AnalyticsData Architecture Best Practices for Advanced Analytics
Data Architecture Best Practices for Advanced Analytics
 
Estimating the Total Costs of Your Cloud Analytics Platform
Estimating the Total Costs of Your Cloud Analytics PlatformEstimating the Total Costs of Your Cloud Analytics Platform
Estimating the Total Costs of Your Cloud Analytics Platform
 
The Shifting Landscape of Data Integration
The Shifting Landscape of Data IntegrationThe Shifting Landscape of Data Integration
The Shifting Landscape of Data Integration
 
Denver Big Data Analytics Day
Denver Big Data Analytics DayDenver Big Data Analytics Day
Denver Big Data Analytics Day
 
Curiosity and Lemontree present - Data Breaks DevOps: Why you need automated ...
Curiosity and Lemontree present - Data Breaks DevOps: Why you need automated ...Curiosity and Lemontree present - Data Breaks DevOps: Why you need automated ...
Curiosity and Lemontree present - Data Breaks DevOps: Why you need automated ...
 
Snowflake: The Good, the Bad and the Ugly
Snowflake: The Good, the Bad and the UglySnowflake: The Good, the Bad and the Ugly
Snowflake: The Good, the Bad and the Ugly
 
Data & Analytics - Session 1 - Big Data Analytics
Data & Analytics - Session 1 -  Big Data AnalyticsData & Analytics - Session 1 -  Big Data Analytics
Data & Analytics - Session 1 - Big Data Analytics
 
Webinar: Ten Ways to Enhance Your Salesforce.com Application in 2013
Webinar: Ten Ways to Enhance Your Salesforce.com Application in 2013Webinar: Ten Ways to Enhance Your Salesforce.com Application in 2013
Webinar: Ten Ways to Enhance Your Salesforce.com Application in 2013
 
FInal Project - USMx CC605x Cloud Computing for Enterprises - Hugo Aquino
FInal Project - USMx CC605x Cloud Computing for Enterprises - Hugo AquinoFInal Project - USMx CC605x Cloud Computing for Enterprises - Hugo Aquino
FInal Project - USMx CC605x Cloud Computing for Enterprises - Hugo Aquino
 
How Greenhouse Software Unlocked the Power of Machine Data Analytics with Sum...
How Greenhouse Software Unlocked the Power of Machine Data Analytics with Sum...How Greenhouse Software Unlocked the Power of Machine Data Analytics with Sum...
How Greenhouse Software Unlocked the Power of Machine Data Analytics with Sum...
 
Gcp dataflow
Gcp dataflowGcp dataflow
Gcp dataflow
 
Demystifying Data Virtualization (ASEAN)
Demystifying Data Virtualization (ASEAN)Demystifying Data Virtualization (ASEAN)
Demystifying Data Virtualization (ASEAN)
 
cloud-training-pricing-billing.pdf
cloud-training-pricing-billing.pdfcloud-training-pricing-billing.pdf
cloud-training-pricing-billing.pdf
 
Understanding the Cloud and the Benefits for the Accountancy Sector - Present...
Understanding the Cloud and the Benefits for the Accountancy Sector - Present...Understanding the Cloud and the Benefits for the Accountancy Sector - Present...
Understanding the Cloud and the Benefits for the Accountancy Sector - Present...
 

More from Data Con LA

Data Con LA 2022 Keynotes
Data Con LA 2022 KeynotesData Con LA 2022 Keynotes
Data Con LA 2022 Keynotes
Data Con LA
 
Data Con LA 2022 Keynotes
Data Con LA 2022 KeynotesData Con LA 2022 Keynotes
Data Con LA 2022 Keynotes
Data Con LA
 
Data Con LA 2022 Keynote
Data Con LA 2022 KeynoteData Con LA 2022 Keynote
Data Con LA 2022 Keynote
Data Con LA
 
Data Con LA 2022 - Startup Showcase
Data Con LA 2022 - Startup ShowcaseData Con LA 2022 - Startup Showcase
Data Con LA 2022 - Startup Showcase
Data Con LA
 
Data Con LA 2022 Keynote
Data Con LA 2022 KeynoteData Con LA 2022 Keynote
Data Con LA 2022 Keynote
Data Con LA
 
Data Con LA 2022 - Using Google trends data to build product recommendations
Data Con LA 2022 - Using Google trends data to build product recommendationsData Con LA 2022 - Using Google trends data to build product recommendations
Data Con LA 2022 - Using Google trends data to build product recommendations
Data Con LA
 
Data Con LA 2022 - AI Ethics
Data Con LA 2022 - AI EthicsData Con LA 2022 - AI Ethics
Data Con LA 2022 - AI Ethics
Data Con LA
 
Data Con LA 2022 - Improving disaster response with machine learning
Data Con LA 2022 - Improving disaster response with machine learningData Con LA 2022 - Improving disaster response with machine learning
Data Con LA 2022 - Improving disaster response with machine learning
Data Con LA
 
Data Con LA 2022 - What's new with MongoDB 6.0 and Atlas
Data Con LA 2022 - What's new with MongoDB 6.0 and AtlasData Con LA 2022 - What's new with MongoDB 6.0 and Atlas
Data Con LA 2022 - What's new with MongoDB 6.0 and Atlas
Data Con LA
 
Data Con LA 2022 - Real world consumer segmentation
Data Con LA 2022 - Real world consumer segmentationData Con LA 2022 - Real world consumer segmentation
Data Con LA 2022 - Real world consumer segmentation
Data Con LA
 
Data Con LA 2022 - Modernizing Analytics & AI for today's needs: Intuit Turbo...
Data Con LA 2022 - Modernizing Analytics & AI for today's needs: Intuit Turbo...Data Con LA 2022 - Modernizing Analytics & AI for today's needs: Intuit Turbo...
Data Con LA 2022 - Modernizing Analytics & AI for today's needs: Intuit Turbo...
Data Con LA
 
Data Con LA 2022 - Moving Data at Scale to AWS
Data Con LA 2022 - Moving Data at Scale to AWSData Con LA 2022 - Moving Data at Scale to AWS
Data Con LA 2022 - Moving Data at Scale to AWS
Data Con LA
 
Data Con LA 2022 - Collaborative Data Exploration using Conversational AI
Data Con LA 2022 - Collaborative Data Exploration using Conversational AIData Con LA 2022 - Collaborative Data Exploration using Conversational AI
Data Con LA 2022 - Collaborative Data Exploration using Conversational AI
Data Con LA
 
Data Con LA 2022 - Why Database Modernization Makes Your Data Decisions More ...
Data Con LA 2022 - Why Database Modernization Makes Your Data Decisions More ...Data Con LA 2022 - Why Database Modernization Makes Your Data Decisions More ...
Data Con LA 2022 - Why Database Modernization Makes Your Data Decisions More ...
Data Con LA
 
Data Con LA 2022 - Intro to Data Science
Data Con LA 2022 - Intro to Data ScienceData Con LA 2022 - Intro to Data Science
Data Con LA 2022 - Intro to Data Science
Data Con LA
 
Data Con LA 2022 - How are NFTs and DeFi Changing Entertainment
Data Con LA 2022 - How are NFTs and DeFi Changing EntertainmentData Con LA 2022 - How are NFTs and DeFi Changing Entertainment
Data Con LA 2022 - How are NFTs and DeFi Changing Entertainment
Data Con LA
 
Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...
Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...
Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...
Data Con LA
 
Data Con LA 2022-Perfect Viral Ad prediction of Superbowl 2022 using Tease, T...
Data Con LA 2022-Perfect Viral Ad prediction of Superbowl 2022 using Tease, T...Data Con LA 2022-Perfect Viral Ad prediction of Superbowl 2022 using Tease, T...
Data Con LA 2022-Perfect Viral Ad prediction of Superbowl 2022 using Tease, T...
Data Con LA
 
Data Con LA 2022- Embedding medical journeys with machine learning to improve...
Data Con LA 2022- Embedding medical journeys with machine learning to improve...Data Con LA 2022- Embedding medical journeys with machine learning to improve...
Data Con LA 2022- Embedding medical journeys with machine learning to improve...
Data Con LA
 
Data Con LA 2022 - Data Streaming with Kafka
Data Con LA 2022 - Data Streaming with KafkaData Con LA 2022 - Data Streaming with Kafka
Data Con LA 2022 - Data Streaming with Kafka
Data Con LA
 

More from Data Con LA (20)

Data Con LA 2022 Keynotes
Data Con LA 2022 KeynotesData Con LA 2022 Keynotes
Data Con LA 2022 Keynotes
 
Data Con LA 2022 Keynotes
Data Con LA 2022 KeynotesData Con LA 2022 Keynotes
Data Con LA 2022 Keynotes
 
Data Con LA 2022 Keynote
Data Con LA 2022 KeynoteData Con LA 2022 Keynote
Data Con LA 2022 Keynote
 
Data Con LA 2022 - Startup Showcase
Data Con LA 2022 - Startup ShowcaseData Con LA 2022 - Startup Showcase
Data Con LA 2022 - Startup Showcase
 
Data Con LA 2022 Keynote
Data Con LA 2022 KeynoteData Con LA 2022 Keynote
Data Con LA 2022 Keynote
 
Data Con LA 2022 - Using Google trends data to build product recommendations
Data Con LA 2022 - Using Google trends data to build product recommendationsData Con LA 2022 - Using Google trends data to build product recommendations
Data Con LA 2022 - Using Google trends data to build product recommendations
 
Data Con LA 2022 - AI Ethics
Data Con LA 2022 - AI EthicsData Con LA 2022 - AI Ethics
Data Con LA 2022 - AI Ethics
 
Data Con LA 2022 - Improving disaster response with machine learning
Data Con LA 2022 - Improving disaster response with machine learningData Con LA 2022 - Improving disaster response with machine learning
Data Con LA 2022 - Improving disaster response with machine learning
 
Data Con LA 2022 - What's new with MongoDB 6.0 and Atlas
Data Con LA 2022 - What's new with MongoDB 6.0 and AtlasData Con LA 2022 - What's new with MongoDB 6.0 and Atlas
Data Con LA 2022 - What's new with MongoDB 6.0 and Atlas
 
Data Con LA 2022 - Real world consumer segmentation
Data Con LA 2022 - Real world consumer segmentationData Con LA 2022 - Real world consumer segmentation
Data Con LA 2022 - Real world consumer segmentation
 
Data Con LA 2022 - Modernizing Analytics & AI for today's needs: Intuit Turbo...
Data Con LA 2022 - Modernizing Analytics & AI for today's needs: Intuit Turbo...Data Con LA 2022 - Modernizing Analytics & AI for today's needs: Intuit Turbo...
Data Con LA 2022 - Modernizing Analytics & AI for today's needs: Intuit Turbo...
 
Data Con LA 2022 - Moving Data at Scale to AWS
Data Con LA 2022 - Moving Data at Scale to AWSData Con LA 2022 - Moving Data at Scale to AWS
Data Con LA 2022 - Moving Data at Scale to AWS
 
Data Con LA 2022 - Collaborative Data Exploration using Conversational AI
Data Con LA 2022 - Collaborative Data Exploration using Conversational AIData Con LA 2022 - Collaborative Data Exploration using Conversational AI
Data Con LA 2022 - Collaborative Data Exploration using Conversational AI
 
Data Con LA 2022 - Why Database Modernization Makes Your Data Decisions More ...
Data Con LA 2022 - Why Database Modernization Makes Your Data Decisions More ...Data Con LA 2022 - Why Database Modernization Makes Your Data Decisions More ...
Data Con LA 2022 - Why Database Modernization Makes Your Data Decisions More ...
 
Data Con LA 2022 - Intro to Data Science
Data Con LA 2022 - Intro to Data ScienceData Con LA 2022 - Intro to Data Science
Data Con LA 2022 - Intro to Data Science
 
Data Con LA 2022 - How are NFTs and DeFi Changing Entertainment
Data Con LA 2022 - How are NFTs and DeFi Changing EntertainmentData Con LA 2022 - How are NFTs and DeFi Changing Entertainment
Data Con LA 2022 - How are NFTs and DeFi Changing Entertainment
 
Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...
Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...
Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...
 
Data Con LA 2022-Perfect Viral Ad prediction of Superbowl 2022 using Tease, T...
Data Con LA 2022-Perfect Viral Ad prediction of Superbowl 2022 using Tease, T...Data Con LA 2022-Perfect Viral Ad prediction of Superbowl 2022 using Tease, T...
Data Con LA 2022-Perfect Viral Ad prediction of Superbowl 2022 using Tease, T...
 
Data Con LA 2022- Embedding medical journeys with machine learning to improve...
Data Con LA 2022- Embedding medical journeys with machine learning to improve...Data Con LA 2022- Embedding medical journeys with machine learning to improve...
Data Con LA 2022- Embedding medical journeys with machine learning to improve...
 
Data Con LA 2022 - Data Streaming with Kafka
Data Con LA 2022 - Data Streaming with KafkaData Con LA 2022 - Data Streaming with Kafka
Data Con LA 2022 - Data Streaming with Kafka
 

Recently uploaded

社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .
NABLAS株式会社
 
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
yhkoc
 
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
mbawufebxi
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP
 
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
NABLAS株式会社
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptx
Opendatabay
 
Q1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year ReboundQ1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year Rebound
Oppotus
 
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
vcaxypu
 
Machine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptxMachine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptx
balafet
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
jerlynmaetalle
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
ewymefz
 
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
axoqas
 
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Subhajit Sahu
 
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
Tiktokethiodaily
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单
enxupq
 
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
pchutichetpong
 
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
vcaxypu
 
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Subhajit Sahu
 
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
slg6lamcq
 
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
ahzuo
 

Recently uploaded (20)

社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .
 
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
 
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
 
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptx
 
Q1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year ReboundQ1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year Rebound
 
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
 
Machine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptxMachine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptx
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
 
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
 
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
 
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单
 
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
 
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
 
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
 
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
 
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
 

Data Con LA 2022 - Supercharge your Snowflake Data Cloud from a Snowflake Data super hero

  • 1. Snowflake Data Cloud Optimization Fun! How to Optimize your Snowflake Data Cloud Best Practices and Tips from a Snowflake Data Superhero
  • 2. • Intro – Speaker and the Snowflake Data Cloud • Snowflake Evolution and Difference • Snowflake Consumption Pricing • Snowflake Optimization – Cost – Performance – Security • Optimization Techniques – Hard Way – Easy Way (Automation Tools) • New Cost and Resource Monitoring Tools coming in 2022 – Resource Groups – Budget Assignment DATACONLA – 2022 - Agenda
  • 3. Frank Bell *Main author of the book Snowflake Essentials *Top Snowflake Data Thought Leader at Accenture *Started Snowflake LA Users Group 2018 *Ran top Snowflake Data Cloud Consulting Practice [acquired by Fairway/Accenture in 2019] *25+ years of data .. “stuff”  *Created Snowflake Solutions and Snoptimizer https://snowflakesolutions.net – Snowflake Business User and Developer Community with Knowledge Repository and Snowflake Tools https://snoptimizer.com – Automated Snowflake Data Cloud Optimization Service Snowflake Data Superhero *Creator of Snowflake Solutions and Snoptimizer
  • 4. • Snowflake Data Cloud is: • A cloud based database and interconnected data system that can handle multiple workloads • Fully connected data cloud with “no-copy” data sharing, data cloning, and data usage enabled within a cloud provider region. What is the Snowflake Data Cloud? KEY POINTS: *Unified system *Connects companies and data providers to data *Single and seamless experience *Run across multiple public cloud *Removes massive amounts of friction from data access and processing
  • 5. • 2014-2018 – Snowflake Database (Focused on Data Warehouse) • Structured and Semi-Structured Data • 2019-2020 • 2021 Snowflake Data Cloud • Workloads: Data Warehousing, Data Lakes, Data Applications, Data Science, etc. • Data Types: Structured,Semi-Structured,Unstructured Snowflake Evolution
  • 6. Frank’s Snowflake Differentiators: (from all other data systems) #1: Architected on cloud providers separated storage and compute. #2: Micro-Partition Architecture #3: ”no-copy” data cloning – enables DataOps and true Agile Data Systems/Applications #4: “no-copy” data sharing – “game changer” Why is the Snowflake Data Cloud so different? (Part 1)
  • 7. Frank’s Snowflake Differentiators: (continued) #5: Capability to process Structured, Semi-Structured and fully Unstructured data all in one system #6: End to End data processing and data science workloads in one fully secure and governed system. #7: in progress #8: in progress Why is the Snowflake Data Cloud so different? (Part 2)
  • 8. • Overall Snowflake is Awesome but it is not continuously optimized for cost, security, or performance • Snowflake continues to add excellent features and many of them improve performance but also adds cost, security, and performance complexity. - Snowflake Consumption Based Pricing
  • 9. Consumption Based Pricing is Awesome! Unless you have: 1. no optimizations 2. resource constraints! Some Snowflake Customers who have a budget of $50,000/year can blow through it in 3 days without optimization Medium to large size customers who do not optimize extensively can miss out on saving $5,000+/month Automated Snowflake Data Cloud Optimization is what keeps Consumption Pricing on Snowflake Awesome! Then it's not …
  • 10. Resource Monitors, warehouse Optimization, etc. Cost Roles, Warehouses, Privileges, Grants, Stale Users, Network Policies, and 30 more tests Security Queuing, Spilling, etc. Performance Snowflake Optimization Areas
  • 11. Resource Monitors, warehouse Optimization, etc. Cost Roles, Warehouses, Privileges, Grants, Stale Users, Network Policies, and 30 more tests Security Queuing, Spilling, etc. Performance Snowflake Cost Optimization Best Practices
  • 12. Most costs found here Query Consumption Typically should be low Automatic Clustering What out for stages & time travel Storage Consumption Need to monitor Search Optimization Need to monitor Materialized Views Typically low but you still need to monitor Cloud Services Typically low - very efficient Pipes Need to monitor if you have replication set up Replication Cost Areas on Snowflake
  • 13. Before After Customer A Profile • Capacity Contract -$100,000/year • Data Platform Size: 10+ TBs • Uses Snowpipe for IOT data • $12,000+/month consumption costs Optimization Results • Cost Optimization around Load Warehouses with non-optimized suspend and parameters setting • Exact code to fix above issues • $10,200/month consumption costs Cost Optimization
  • 14. Tip #1 Cost Optimization Credit Consumption Optimization - Using Resource Monitors (and soon Resource Groups) *Put actual code here on Resource Monitor Best Practices
  • 15. Tip #2 Cost Optimization Warehouse Optimizations – DO NOT USE DEFAULTS! (Auto-Suspend – give details) *Place code here
  • 16. Tip #3 Cost Optimization Cloud Data Storage Common Cost Issues *Anti-Pattern *Put actual code here?
  • 17. Tip #4 Cost Optimization x Data Loading optimization tips and tricks *Add actual code here
  • 18. Snowflake Optimization Techniques Snowflake Optimization Technique Services Available PROS CONS *Reporting on Costs. Manual Optimizations *link to cost reporting Quick to implement. Not comprehensive at all. Also, very reactive versus proactive. Major problems can occur on cost, performance, and security very quickly. *Custom Coded automated optimization system * More thorough and typically more proactive than reporting and finding error culture. Extremely expensive. Time-consuming. Easily outdated if large staff of performance experts is not continuously used. *Relying on Human based Consulting or Snowflake Professional Service Health Checks Snowflake Usually engages reasonably detailed Snowflake Consulting Experts Expensive and not repeatable. Prone to human error. Is outdated often within days of it being finished. *Fully Automated SaaS services that automate optimization Snoptimizer Nadilytics SaaS Security [. ] Automates the tremendous complexity of optimizing Snowflake for cost, performance and security continuously. Medium Cost ?
  • 19. Resource Monitors, warehouse Optimization, etc. Cost Roles, Warehouses, Privileges, Grants, Stale Users, Network Policies, and 30 more tests Security Queuing, Spilling, etc. Performance Snowflake Performance Optimization Best Practices
  • 23. Customer C Profile • Capacity Contract -$200,000/year • Data Platform Size: 50++ TBs • 3TB ++ size table with non-optimized cluster keys - Many queries running over several minutes • Slower External Table Queries Optimization Results • Improved Cluster Keys and queries above 1 minute all fell under 1 minute • Materialized Views for certain External Tables - Massive Query Performance Improvement Before After Performance Optimization
  • 24. Resource Monitors, warehouse Optimization, etc. Cost Roles, Warehouses, Privileges, Grants, Stale Users, Network Policies, and 30 more tests Security Queuing, Spilling, etc. Performance Snowflake Security Optimization Best Practices
  • 26. Tip #2 Security opt *Add code Security Optimization
  • 27. Tip #3 Security opt *Add code here Security Optimization
  • 28. Tip #4 Security opt *Add Code Security Optimization
  • 29. Customer B Profile • Capacity Contract: $50,000/year • Data Platform Size: 1-2TBs • Significant Cost Risk due to a large amount of users having CREATE WAREHOUSE at any size granted. • Stale objects with too many users having access • Stages not properly secured Optimization Results • Improved RBAC, cost risk significantly reduced • No stale objects • Stages properly secured • Continuous Security Analysis and Protection Before After Security Optimization
  • 30. Optimization Results • In our tests, on average we have seen 10-30% cost savings, 1000s of security issues fixed and 100s of performance problems solved. • In some implementations we have seen almost 50% cost savings.
  • 31. Contact Us Frank Bell, Snowflake Data Superhero [3rd year] fbell@itstrategists.com https://snowflakesolutions.net https://snoptimizer.com Snowflake Questions?
  • 32. Join LA Snowflake Users Group and Meetup https://www.meetup.com/los-angeles-business-snowflake-users-group/