Data Con LA 2022 - Supercharge your Snowflake Data Cloud from a Snowflake Data super hero

Snowflake Data Cloud
Optimization Fun!
How to Optimize your
Snowflake Data Cloud
Best Practices and Tips
from a Snowflake Data
Superhero

• Intro – Speaker and the Snowflake Data Cloud
• Snowflake Evolution and Difference
• Snowflake Consumption Pricing
• Snowflake Optimization
– Cost
– Performance
– Security
• Optimization Techniques
– Hard Way
– Easy Way (Automation Tools)
• New Cost and Resource Monitoring Tools coming in 2022
– Resource Groups
– Budget Assignment
DATACONLA – 2022 - Agenda

Frank Bell
*Main author of the book Snowflake Essentials
*Top Snowflake Data Thought Leader at Accenture
*Started Snowflake LA Users Group 2018
*Ran top Snowflake Data Cloud Consulting Practice
[acquired by Fairway/Accenture in 2019]
*25+ years of data .. “stuff” 
*Created Snowflake Solutions and Snoptimizer
https://snowflakesolutions.net – Snowflake Business User and Developer Community with
Knowledge Repository and Snowflake Tools
https://snoptimizer.com – Automated Snowflake Data Cloud Optimization Service
Snowflake Data Superhero
*Creator of Snowflake Solutions and Snoptimizer

• Snowflake Data Cloud is:
• A cloud based database and
interconnected data system
that can handle multiple
workloads
• Fully connected data cloud
with “no-copy” data sharing,
data cloning, and data usage
enabled within a cloud
provider region.
What is the Snowflake Data Cloud?
KEY POINTS:
*Unified system
*Connects companies and data
providers to data
*Single and seamless experience
*Run across multiple public cloud
*Removes massive amounts of
friction from data access and
processing

• 2014-2018 – Snowflake Database (Focused on Data
Warehouse)
• Structured and Semi-Structured Data
• 2019-2020
• 2021 Snowflake Data Cloud
• Workloads: Data Warehousing, Data Lakes, Data
Applications, Data Science, etc.
• Data Types: Structured,Semi-Structured,Unstructured
Snowflake Evolution

Frank’s Snowflake Differentiators: (from all other data systems)
#1: Architected on cloud providers separated storage
and compute.
#2: Micro-Partition Architecture
#3: ”no-copy” data cloning – enables DataOps and
true Agile Data Systems/Applications
#4: “no-copy” data sharing – “game changer”
Why is the Snowflake Data Cloud
so different? (Part 1)

Frank’s Snowflake Differentiators: (continued)
#5: Capability to process Structured, Semi-Structured
and fully Unstructured data all in one system
#6: End to End data processing and data science
workloads in one fully secure and governed system.
#7: in progress
#8: in progress
Why is the Snowflake Data Cloud
so different? (Part 2)

• Overall Snowflake is Awesome
but it is not continuously optimized
for cost, security, or performance
• Snowflake continues to add excellent features and many
of them improve performance but also adds cost,
security, and performance complexity. -
Snowflake
Consumption Based Pricing

Consumption Based
Pricing is Awesome!
Unless you have:
1. no optimizations
2. resource constraints!
Some Snowflake
Customers who have a
budget of $50,000/year
can blow through it in 3
days without
optimization
Medium to large size
customers who do not
optimize extensively can
miss out on saving
$5,000+/month
Automated Snowflake Data
Cloud Optimization is what
keeps Consumption Pricing
on Snowflake Awesome!
Then it's not …

Resource Monitors, warehouse
Optimization, etc.
Cost
Roles, Warehouses, Privileges, Grants,
Stale Users, Network Policies, and 30
more tests
Security
Queuing, Spilling, etc.
Performance
Snowflake Optimization Areas

Optimization, etc.
Cost
more tests
Security
Performance
Snowflake Cost Optimization
Best Practices

Most costs found here
Query Consumption
Typically should be low
Automatic Clustering
What out for stages & time
travel
Storage Consumption
Need to monitor
Search Optimization
Need to monitor
Materialized Views
Typically low but you still need
to monitor
Cloud Services
Typically low - very efficient
Pipes
Need to monitor if you have
replication set up
Replication
Cost Areas on Snowflake

Before After
Customer A Profile
• Capacity Contract -$100,000/year
• Data Platform Size: 10+ TBs
• Uses Snowpipe for IOT data
• $12,000+/month consumption costs
Optimization Results
• Cost Optimization around Load
Warehouses with non-optimized
suspend and parameters setting
• Exact code to fix above issues
• $10,200/month consumption costs
Cost Optimization

Tip #1
Cost Optimization
Credit Consumption
Optimization - Using
Resource Monitors
(and soon Resource
Groups)
*Put actual code here
on Resource Monitor
Best Practices

Tip #2
Cost Optimization
Warehouse
Optimizations – DO
NOT USE DEFAULTS!
(Auto-Suspend – give
details)
*Place code here

Tip #3
Cost Optimization
Cloud Data Storage
Common Cost Issues
*Anti-Pattern
*Put actual code
here?

Tip #4
Cost Optimization
x Data Loading
optimization tips and
tricks
*Add actual code here

Snowflake Optimization
Techniques
Snowflake Optimization Technique Services Available PROS CONS
*Reporting on Costs. Manual
Optimizations
*link to cost reporting Quick to implement. Not comprehensive at all. Also, very
reactive versus proactive. Major
problems can occur on cost,
performance, and security very quickly.
*Custom Coded automated optimization
system
* More thorough and typically more
proactive than reporting and finding
error culture.
Extremely expensive. Time-consuming.
Easily outdated if large staff of
performance experts is not continuously
used.
*Relying on Human based Consulting or
Snowflake Professional Service Health
Checks
Snowflake Usually engages reasonably detailed
Snowflake Consulting Experts
Expensive and not repeatable.
Prone to human error. Is outdated often
within days of it being finished.
*Fully Automated SaaS services that
automate optimization
Snoptimizer
Nadilytics SaaS
Security [. ]
Automates the tremendous complexity
of optimizing Snowflake for cost,
performance and security continuously.
Medium Cost
?

Optimization, etc.
Cost
more tests
Security
Performance
Snowflake Performance Optimization
Best Practices

Tip #1
Performance
Optimization
Snowflake Queuing
Optimization
*Code

Tip #2
Performance
Optimization
Snowflake Spilling
Optimization
*
Code

Tip #3
Performance
Optimization
Data Loading
Optimization
*Code goes here

Customer C Profile
• Capacity Contract -$200,000/year
• Data Platform Size: 50++ TBs
• 3TB ++ size table with non-optimized
cluster keys - Many queries running
over several minutes
• Slower External Table Queries
• Improved Cluster Keys and queries
above 1 minute all fell under 1 minute
• Materialized Views for certain External
Tables - Massive Query Performance
Improvement
Before After
Performance
Optimization

Optimization, etc.
Cost
more tests
Security
Performance
Snowflake Security Optimization
Best Practices

Tip #1
Security
Optimization
Security Opt
*Add code

Tip #2 Security opt
*Add code
Security
Optimization

Tip #3 Security opt
*Add code here
Security
Optimization

Tip #4 Security opt
*Add Code
Security
Optimization

Customer B Profile
• Capacity Contract: $50,000/year
• Data Platform Size: 1-2TBs
• Significant Cost Risk due to a large
amount of users having CREATE
WAREHOUSE at any size granted.
• Stale objects with too many users
having access
• Stages not properly secured
• Improved RBAC, cost risk significantly
reduced
• No stale objects
• Stages properly secured
• Continuous Security Analysis and
Protection
Before After
Security Optimization

• In our tests, on average we have seen 10-30% cost savings,
1000s of security issues fixed and 100s of performance
problems solved.
• In some implementations we have seen almost 50% cost
savings.

Contact Us
Frank Bell, Snowflake Data Superhero [3rd year]
fbell@itstrategists.com
https://snowflakesolutions.net
https://snoptimizer.com
Snowflake Questions?

Join LA
Snowflake
Users Group
and Meetup
https://www.meetup.com/los-angeles-business-snowflake-users-group/

Data Con LA 2022 - Supercharge your Snowflake Data Cloud from a Snowflake Data super hero

Recommended

Recommended

More Related Content

Similar to Data Con LA 2022 - Supercharge your Snowflake Data Cloud from a Snowflake Data super hero

Similar to Data Con LA 2022 - Supercharge your Snowflake Data Cloud from a Snowflake Data super hero (20)

More from Data Con LA

More from Data Con LA (20)

Recently uploaded

Recently uploaded (20)

Data Con LA 2022 - Supercharge your Snowflake Data Cloud from a Snowflake Data super hero