SlideShare a Scribd company logo
1© 2017 Snowflake Computing Inc. All Rights Reserved.
Y O U R D A T A , N O L I M I T S
A 30 Day Plan to End Your Struggle for Data
Common data struggles
© 2016 Snowflake Computing Inc. All Rights Reserved. 2
AnalyticsData IntegrationData Loading Collaboration
© 2016 Snowflake Computing Inc. All Rights Reserved. 3
Data Loading
© 2017 Snowflake Computing Inc. All Rights Reserved. 4
Struggle to Load Data
Resource Contention
Capacity Planning
Preparing disparate data to load
“Where can I connect to that
new JSON web log data?”
-BI Team
– Have to flatten to store semi-structured (or use noSQL)
– Storage and compute are limited
– Architecture forces linear compute capacity
© 2017 Snowflake Computing Inc. All Rights Reserved. 5
Tackle loading challenges with Snowflake
Contention
Capacity
Disparate data
– Variant column type supports semi-structured
– No more flattening (unless you want to)
– Built on the cloud (S3, EC2)
– Scale data and compute to load any data
– Unlimited virtual warehouses allow independent compute
– Isolate loading and other tasks
© 2016 Snowflake Computing Inc. All Rights Reserved. 6
Data Integration
© 2017 Snowflake Computing Inc. All Rights Reserved. 7
Struggle to Integrate Data
Making sense of data in silos
Editing and transforming data
Support evolving business logic and disparate use cases
– Hard to transform different datasets while in different
silos/formats
– noSQL tools complex, not all data stores ACID complaint
– Contention an issue while transforming
– No way to easily experiment with and add business logic
– Different people have different use cases
“Are the updated KPI’s in
the sensor data tables?”
- Data scientist
© 2017 Snowflake Computing Inc. All Rights Reserved. 8
Improve data integration with Snowflake
Silos
Editing and transforming
Business logic
– Native storage for semi-structured, ANSI standard SQL and dot notation to use it
– Combine all of your data fluidly
– ACID compliant with virtual data warehouses
– Edit, transform, insert, delete, however or whenever you want
– Zero-copy cloning
– Rapidly iterate, test and promote business logic for multiple
people
© 2016 Snowflake Computing Inc. All Rights Reserved. 9
Data analytics
© 2017 Snowflake Computing Inc. All Rights Reserved. 10
Struggle to Analyze Data
Queues
Delays
– Analysts are always the end of the resource priority
queue
– Even with unlimited access, database is non-performant
“How come the dashboard isn’t working?”
- Sales director
© 2017 Snowflake Computing Inc. All Rights Reserved. 11
Analyzing Efficiently with Snowflake
Queues
Delays
– Independent virtual warehouses
– Scale up, down or out to serve analytics use cases
– Autoscaling and multi-cluster warehouses
– Automatically match compute to even massive demand
© 2016 Snowflake Computing Inc. All Rights Reserved. 12
Collaboration
© 2017 Snowflake Computing Inc. All Rights Reserved. 13
Struggle to Collaborate
Incessant fixing
Siloed teams
– Fixing loading, integration and analytics struggles burns time
– Conflicts from those struggles reduce morale
– Technical and business teams often not working together (physically or
otherwise)
“I’m so buried under this queue I
can’t make the BI standup”
- IT team member
“I could ask IT for an updated table,
but I’m not sure who was working on it.”
- BI team member
© 2017 Snowflake Computing Inc. All Rights Reserved. 14
Start Collaborating with Snowflake
Fixing
Siloed teams
– Address the other struggles as referenced
– Free more time for collaboration and discussion
– With new time, start new discussions around data
– Build updates and additions into a scheduled meet-up
© 2016 Snowflake Computing Inc. All Rights Reserved. 15
A 30-day Plan to Start Ending
Your Struggle with Snowflake
© 2016 Snowflake Computing Inc. All Rights Reserved. 16
Start from the beginning – what’s the analytics goal?
1. Define the team
2. Discuss blocking issues and a place to start
3. Define the scope
4. Define success criteria
5. Try Snowflake On-Demand
6. Plan status updates going forward
Start Ending Your Data Struggle – Week 1
© 2017 Snowflake Computing Inc. All Rights Reserved. 17
Start Ending Your Data Struggle – Week 2
1. Find data to load
2. Create a Warehouse
3. Load data
– Work within defined scope, agree as a team
– Use data that’s new, challenging, or semi-structured
– Will need this to load data
– Create a database and a table
– Stage your data
– Load from stage to database
© 2017 Snowflake Computing Inc. All Rights Reserved. 18
Start Ending Your Data Struggle – Week 3
1. Test and deploy business logic
2. Optional: Create Integration WH
3. Optional: Plan ongoing loading and transform
– Discuss metrics, KPIs, transformations to add
– Use zero-copy cloning to test and then promote
– Isolate integration and transformation
– Use zero-copy cloning to test iterations safely and
promote
© 2017 Snowflake Computing Inc. All Rights Reserved. 19
Start Ending Your Data Struggle – Week 4
1. Create Warehouses for BI
2. Create analytics users
3. Connect your BI to Snowflake
– Avoid queues with isolated compute resources
– Optionally, set up auto-scaling
– Spread the value of the data
– Use this as an opportunity to share and discuss
– Use Tableau, Looker, etc. to query your data live
– Consider publishing dashboards with live connect
© 2017 Snowflake Computing Inc. All Rights Reserved. 20
After 30 days you should see improvements
1. Your team should be talking and collaborating more
2. You should be able to easily load and combine data
3. You should have accurate business logic in your data
4. You should be finding more insight
TRY SNOWFLAKE FOR FREE

More Related Content

What's hot

Snowflake for Data Engineering
Snowflake for Data EngineeringSnowflake for Data Engineering
Snowflake for Data Engineering
Harald Erb
 
Zero to Snowflake Presentation
Zero to Snowflake Presentation Zero to Snowflake Presentation
Zero to Snowflake Presentation
Brett VanderPlaats
 
Snowflake + Power BI: Cloud Analytics for Everyone
Snowflake + Power BI: Cloud Analytics for EveryoneSnowflake + Power BI: Cloud Analytics for Everyone
Snowflake + Power BI: Cloud Analytics for Everyone
Angel Abundez
 
Snowflake: The Good, the Bad, and the Ugly
Snowflake: The Good, the Bad, and the UglySnowflake: The Good, the Bad, and the Ugly
Snowflake: The Good, the Bad, and the Ugly
Tyler Wishnoff
 
Get Savvy with Snowflake
Get Savvy with SnowflakeGet Savvy with Snowflake
Get Savvy with Snowflake
Matillion
 
Changing the game with cloud dw
Changing the game with cloud dwChanging the game with cloud dw
Changing the game with cloud dw
elephantscale
 
Snowflake essentials
Snowflake essentialsSnowflake essentials
Snowflake essentials
qureshihamid
 
Snowflake Architecture.pptx
Snowflake Architecture.pptxSnowflake Architecture.pptx
Snowflake Architecture.pptx
chennakesava44
 
How to Take Advantage of an Enterprise Data Warehouse in the Cloud
How to Take Advantage of an Enterprise Data Warehouse in the CloudHow to Take Advantage of an Enterprise Data Warehouse in the Cloud
How to Take Advantage of an Enterprise Data Warehouse in the Cloud
Denodo
 
Modernizing to a Cloud Data Architecture
Modernizing to a Cloud Data ArchitectureModernizing to a Cloud Data Architecture
Modernizing to a Cloud Data Architecture
Databricks
 
Data Catalog & ETL - Glue & Athena
Data Catalog & ETL - Glue & AthenaData Catalog & ETL - Glue & Athena
Data Catalog & ETL - Glue & Athena
Amazon Web Services
 
[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...
[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...
[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...
DataScienceConferenc1
 
Snowflake Data Science and AI/ML at Scale
Snowflake Data Science and AI/ML at ScaleSnowflake Data Science and AI/ML at Scale
Snowflake Data Science and AI/ML at Scale
Adam Doyle
 
Data Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to MeshData Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to Mesh
Jeffrey T. Pollock
 
Free Training: How to Build a Lakehouse
Free Training: How to Build a LakehouseFree Training: How to Build a Lakehouse
Free Training: How to Build a Lakehouse
Databricks
 
Snowflake Data Loading.pptx
Snowflake Data Loading.pptxSnowflake Data Loading.pptx
Snowflake Data Loading.pptx
Parag860410
 
Snowflake Company Presentation
Snowflake Company PresentationSnowflake Company Presentation
Snowflake Company Presentation
AndrewJiang18
 
Snowflake Overview
Snowflake OverviewSnowflake Overview
Snowflake Overview
Snowflake Computing
 
Intro to Delta Lake
Intro to Delta LakeIntro to Delta Lake
Intro to Delta Lake
Databricks
 
Let’s get to know Snowflake
Let’s get to know SnowflakeLet’s get to know Snowflake
Let’s get to know Snowflake
Knoldus Inc.
 

What's hot (20)

Snowflake for Data Engineering
Snowflake for Data EngineeringSnowflake for Data Engineering
Snowflake for Data Engineering
 
Zero to Snowflake Presentation
Zero to Snowflake Presentation Zero to Snowflake Presentation
Zero to Snowflake Presentation
 
Snowflake + Power BI: Cloud Analytics for Everyone
Snowflake + Power BI: Cloud Analytics for EveryoneSnowflake + Power BI: Cloud Analytics for Everyone
Snowflake + Power BI: Cloud Analytics for Everyone
 
Snowflake: The Good, the Bad, and the Ugly
Snowflake: The Good, the Bad, and the UglySnowflake: The Good, the Bad, and the Ugly
Snowflake: The Good, the Bad, and the Ugly
 
Get Savvy with Snowflake
Get Savvy with SnowflakeGet Savvy with Snowflake
Get Savvy with Snowflake
 
Changing the game with cloud dw
Changing the game with cloud dwChanging the game with cloud dw
Changing the game with cloud dw
 
Snowflake essentials
Snowflake essentialsSnowflake essentials
Snowflake essentials
 
Snowflake Architecture.pptx
Snowflake Architecture.pptxSnowflake Architecture.pptx
Snowflake Architecture.pptx
 
How to Take Advantage of an Enterprise Data Warehouse in the Cloud
How to Take Advantage of an Enterprise Data Warehouse in the CloudHow to Take Advantage of an Enterprise Data Warehouse in the Cloud
How to Take Advantage of an Enterprise Data Warehouse in the Cloud
 
Modernizing to a Cloud Data Architecture
Modernizing to a Cloud Data ArchitectureModernizing to a Cloud Data Architecture
Modernizing to a Cloud Data Architecture
 
Data Catalog & ETL - Glue & Athena
Data Catalog & ETL - Glue & AthenaData Catalog & ETL - Glue & Athena
Data Catalog & ETL - Glue & Athena
 
[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...
[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...
[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...
 
Snowflake Data Science and AI/ML at Scale
Snowflake Data Science and AI/ML at ScaleSnowflake Data Science and AI/ML at Scale
Snowflake Data Science and AI/ML at Scale
 
Data Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to MeshData Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to Mesh
 
Free Training: How to Build a Lakehouse
Free Training: How to Build a LakehouseFree Training: How to Build a Lakehouse
Free Training: How to Build a Lakehouse
 
Snowflake Data Loading.pptx
Snowflake Data Loading.pptxSnowflake Data Loading.pptx
Snowflake Data Loading.pptx
 
Snowflake Company Presentation
Snowflake Company PresentationSnowflake Company Presentation
Snowflake Company Presentation
 
Snowflake Overview
Snowflake OverviewSnowflake Overview
Snowflake Overview
 
Intro to Delta Lake
Intro to Delta LakeIntro to Delta Lake
Intro to Delta Lake
 
Let’s get to know Snowflake
Let’s get to know SnowflakeLet’s get to know Snowflake
Let’s get to know Snowflake
 

Similar to A 30 day plan to start ending your data struggle with Snowflake

Business Intelligence is more than just pretty visuals
Business Intelligence is more than just pretty visualsBusiness Intelligence is more than just pretty visuals
Business Intelligence is more than just pretty visuals
Vincent Woon
 
Nw2008 tips tricks_edw_v10
Nw2008 tips tricks_edw_v10Nw2008 tips tricks_edw_v10
Nw2008 tips tricks_edw_v10
Harsha Gowda B R
 
The Key to Big Data Modeling: Collaboration
The Key to Big Data Modeling: CollaborationThe Key to Big Data Modeling: Collaboration
The Key to Big Data Modeling: Collaboration
Embarcadero Technologies
 
10 Reasons Snowflake Is Great for Analytics
10 Reasons Snowflake Is Great for Analytics10 Reasons Snowflake Is Great for Analytics
10 Reasons Snowflake Is Great for Analytics
Senturus
 
IT + Line of Business - Driving Faster, Deeper Insights Together
IT + Line of Business - Driving Faster, Deeper Insights TogetherIT + Line of Business - Driving Faster, Deeper Insights Together
IT + Line of Business - Driving Faster, Deeper Insights Together
DATAVERSITY
 
Big Data Integration Webinar: Reducing Implementation Efforts of Hadoop, NoSQ...
Big Data Integration Webinar: Reducing Implementation Efforts of Hadoop, NoSQ...Big Data Integration Webinar: Reducing Implementation Efforts of Hadoop, NoSQ...
Big Data Integration Webinar: Reducing Implementation Efforts of Hadoop, NoSQ...
Pentaho
 
18. Madhur Hemnani - Result Orientated Innovation with Oracle HR Analytics
18. Madhur Hemnani - Result Orientated Innovation with Oracle HR Analytics18. Madhur Hemnani - Result Orientated Innovation with Oracle HR Analytics
18. Madhur Hemnani - Result Orientated Innovation with Oracle HR Analytics
Cedar Consulting
 
Q1 and q2 2020 role overview
Q1 and q2 2020 role overviewQ1 and q2 2020 role overview
Q1 and q2 2020 role overview
DaveMeckler
 
Azure + DataStax Enterprise Powers Office 365 Per User Store
Azure + DataStax Enterprise Powers Office 365 Per User StoreAzure + DataStax Enterprise Powers Office 365 Per User Store
Azure + DataStax Enterprise Powers Office 365 Per User Store
DataStax Academy
 
Managing Large Amounts of Data with Salesforce
Managing Large Amounts of Data with SalesforceManaging Large Amounts of Data with Salesforce
Managing Large Amounts of Data with Salesforce
Sense Corp
 
Transforming Data Management and Time to Insight with Anzo Smart Data Lake®
Transforming Data Management and Time to Insight with Anzo Smart Data Lake®Transforming Data Management and Time to Insight with Anzo Smart Data Lake®
Transforming Data Management and Time to Insight with Anzo Smart Data Lake®
Cambridge Semantics
 
Top 10 Tips for an Effective Postgres Deployment
Top 10 Tips for an Effective Postgres DeploymentTop 10 Tips for an Effective Postgres Deployment
Top 10 Tips for an Effective Postgres Deployment
EDB
 
IBM Cognos Analytics Reporting vs. Dashboarding: Matching Tools to Business R...
IBM Cognos Analytics Reporting vs. Dashboarding: Matching Tools to Business R...IBM Cognos Analytics Reporting vs. Dashboarding: Matching Tools to Business R...
IBM Cognos Analytics Reporting vs. Dashboarding: Matching Tools to Business R...
Senturus
 
Does it only have to be ML + AI?
Does it only have to be ML + AI?Does it only have to be ML + AI?
Does it only have to be ML + AI?
Harald Erb
 
Unify Data at Memory Speed
Unify Data at Memory SpeedUnify Data at Memory Speed
Unify Data at Memory Speed
Alluxio, Inc.
 
Big Data & Information Management Channel Manager
Big Data & Information Management Channel ManagerBig Data & Information Management Channel Manager
Big Data & Information Management Channel Manager
Arrow ECS UK
 
Building MuleSoft Applications with Google BigQuery Meetup 4
Building MuleSoft Applications with Google BigQuery Meetup 4Building MuleSoft Applications with Google BigQuery Meetup 4
Building MuleSoft Applications with Google BigQuery Meetup 4
MannaAkpan
 
IBM Cognos Analytics Release 7+ Authoring Improvements: Demos of New and Rein...
IBM Cognos Analytics Release 7+ Authoring Improvements: Demos of New and Rein...IBM Cognos Analytics Release 7+ Authoring Improvements: Demos of New and Rein...
IBM Cognos Analytics Release 7+ Authoring Improvements: Demos of New and Rein...
Senturus
 
How to grow to a modern workplace in 16 steps with microsoft 365
How to grow to a modern workplace in 16 steps with microsoft 365How to grow to a modern workplace in 16 steps with microsoft 365
How to grow to a modern workplace in 16 steps with microsoft 365
Tim Hermie ☁️
 
Postgres Integrates Effectively in the "Enterprise Sandbox"
Postgres Integrates Effectively in the "Enterprise Sandbox"Postgres Integrates Effectively in the "Enterprise Sandbox"
Postgres Integrates Effectively in the "Enterprise Sandbox"
EDB
 

Similar to A 30 day plan to start ending your data struggle with Snowflake (20)

Business Intelligence is more than just pretty visuals
Business Intelligence is more than just pretty visualsBusiness Intelligence is more than just pretty visuals
Business Intelligence is more than just pretty visuals
 
Nw2008 tips tricks_edw_v10
Nw2008 tips tricks_edw_v10Nw2008 tips tricks_edw_v10
Nw2008 tips tricks_edw_v10
 
The Key to Big Data Modeling: Collaboration
The Key to Big Data Modeling: CollaborationThe Key to Big Data Modeling: Collaboration
The Key to Big Data Modeling: Collaboration
 
10 Reasons Snowflake Is Great for Analytics
10 Reasons Snowflake Is Great for Analytics10 Reasons Snowflake Is Great for Analytics
10 Reasons Snowflake Is Great for Analytics
 
IT + Line of Business - Driving Faster, Deeper Insights Together
IT + Line of Business - Driving Faster, Deeper Insights TogetherIT + Line of Business - Driving Faster, Deeper Insights Together
IT + Line of Business - Driving Faster, Deeper Insights Together
 
Big Data Integration Webinar: Reducing Implementation Efforts of Hadoop, NoSQ...
Big Data Integration Webinar: Reducing Implementation Efforts of Hadoop, NoSQ...Big Data Integration Webinar: Reducing Implementation Efforts of Hadoop, NoSQ...
Big Data Integration Webinar: Reducing Implementation Efforts of Hadoop, NoSQ...
 
18. Madhur Hemnani - Result Orientated Innovation with Oracle HR Analytics
18. Madhur Hemnani - Result Orientated Innovation with Oracle HR Analytics18. Madhur Hemnani - Result Orientated Innovation with Oracle HR Analytics
18. Madhur Hemnani - Result Orientated Innovation with Oracle HR Analytics
 
Q1 and q2 2020 role overview
Q1 and q2 2020 role overviewQ1 and q2 2020 role overview
Q1 and q2 2020 role overview
 
Azure + DataStax Enterprise Powers Office 365 Per User Store
Azure + DataStax Enterprise Powers Office 365 Per User StoreAzure + DataStax Enterprise Powers Office 365 Per User Store
Azure + DataStax Enterprise Powers Office 365 Per User Store
 
Managing Large Amounts of Data with Salesforce
Managing Large Amounts of Data with SalesforceManaging Large Amounts of Data with Salesforce
Managing Large Amounts of Data with Salesforce
 
Transforming Data Management and Time to Insight with Anzo Smart Data Lake®
Transforming Data Management and Time to Insight with Anzo Smart Data Lake®Transforming Data Management and Time to Insight with Anzo Smart Data Lake®
Transforming Data Management and Time to Insight with Anzo Smart Data Lake®
 
Top 10 Tips for an Effective Postgres Deployment
Top 10 Tips for an Effective Postgres DeploymentTop 10 Tips for an Effective Postgres Deployment
Top 10 Tips for an Effective Postgres Deployment
 
IBM Cognos Analytics Reporting vs. Dashboarding: Matching Tools to Business R...
IBM Cognos Analytics Reporting vs. Dashboarding: Matching Tools to Business R...IBM Cognos Analytics Reporting vs. Dashboarding: Matching Tools to Business R...
IBM Cognos Analytics Reporting vs. Dashboarding: Matching Tools to Business R...
 
Does it only have to be ML + AI?
Does it only have to be ML + AI?Does it only have to be ML + AI?
Does it only have to be ML + AI?
 
Unify Data at Memory Speed
Unify Data at Memory SpeedUnify Data at Memory Speed
Unify Data at Memory Speed
 
Big Data & Information Management Channel Manager
Big Data & Information Management Channel ManagerBig Data & Information Management Channel Manager
Big Data & Information Management Channel Manager
 
Building MuleSoft Applications with Google BigQuery Meetup 4
Building MuleSoft Applications with Google BigQuery Meetup 4Building MuleSoft Applications with Google BigQuery Meetup 4
Building MuleSoft Applications with Google BigQuery Meetup 4
 
IBM Cognos Analytics Release 7+ Authoring Improvements: Demos of New and Rein...
IBM Cognos Analytics Release 7+ Authoring Improvements: Demos of New and Rein...IBM Cognos Analytics Release 7+ Authoring Improvements: Demos of New and Rein...
IBM Cognos Analytics Release 7+ Authoring Improvements: Demos of New and Rein...
 
How to grow to a modern workplace in 16 steps with microsoft 365
How to grow to a modern workplace in 16 steps with microsoft 365How to grow to a modern workplace in 16 steps with microsoft 365
How to grow to a modern workplace in 16 steps with microsoft 365
 
Postgres Integrates Effectively in the "Enterprise Sandbox"
Postgres Integrates Effectively in the "Enterprise Sandbox"Postgres Integrates Effectively in the "Enterprise Sandbox"
Postgres Integrates Effectively in the "Enterprise Sandbox"
 

Recently uploaded

What’s New in VictoriaLogs - Q2 2024 Update
What’s New in VictoriaLogs - Q2 2024 UpdateWhat’s New in VictoriaLogs - Q2 2024 Update
What’s New in VictoriaLogs - Q2 2024 Update
VictoriaMetrics
 
Hyperledger Besu 빨리 따라하기 (Private Networks)
Hyperledger Besu 빨리 따라하기 (Private Networks)Hyperledger Besu 빨리 따라하기 (Private Networks)
Hyperledger Besu 빨리 따라하기 (Private Networks)
wonyong hwang
 
Baha Majid WCA4Z IBM Z Customer Council Boston June 2024.pdf
Baha Majid WCA4Z IBM Z Customer Council Boston June 2024.pdfBaha Majid WCA4Z IBM Z Customer Council Boston June 2024.pdf
Baha Majid WCA4Z IBM Z Customer Council Boston June 2024.pdf
Baha Majid
 
Software Test Automation - A Comprehensive Guide on Automated Testing.pdf
Software Test Automation - A Comprehensive Guide on Automated Testing.pdfSoftware Test Automation - A Comprehensive Guide on Automated Testing.pdf
Software Test Automation - A Comprehensive Guide on Automated Testing.pdf
kalichargn70th171
 
Beginner's Guide to Observability@Devoxx PL 2024
Beginner's  Guide to Observability@Devoxx PL 2024Beginner's  Guide to Observability@Devoxx PL 2024
Beginner's Guide to Observability@Devoxx PL 2024
michniczscribd
 
Unlock the Secrets to Effortless Video Creation with Invideo: Your Ultimate G...
Unlock the Secrets to Effortless Video Creation with Invideo: Your Ultimate G...Unlock the Secrets to Effortless Video Creation with Invideo: Your Ultimate G...
Unlock the Secrets to Effortless Video Creation with Invideo: Your Ultimate G...
The Third Creative Media
 
14 th Edition of International conference on computer vision
14 th Edition of International conference on computer vision14 th Edition of International conference on computer vision
14 th Edition of International conference on computer vision
ShulagnaSarkar2
 
Orca: Nocode Graphical Editor for Container Orchestration
Orca: Nocode Graphical Editor for Container OrchestrationOrca: Nocode Graphical Editor for Container Orchestration
Orca: Nocode Graphical Editor for Container Orchestration
Pedro J. Molina
 
Boost Your Savings with These Money Management Apps
Boost Your Savings with These Money Management AppsBoost Your Savings with These Money Management Apps
Boost Your Savings with These Money Management Apps
Jhone kinadey
 
Safelyio Toolbox Talk Softwate & App (How To Digitize Safety Meetings)
Safelyio Toolbox Talk Softwate & App (How To Digitize Safety Meetings)Safelyio Toolbox Talk Softwate & App (How To Digitize Safety Meetings)
Safelyio Toolbox Talk Softwate & App (How To Digitize Safety Meetings)
safelyiotech
 
Hands-on with Apache Druid: Installation & Data Ingestion Steps
Hands-on with Apache Druid: Installation & Data Ingestion StepsHands-on with Apache Druid: Installation & Data Ingestion Steps
Hands-on with Apache Druid: Installation & Data Ingestion Steps
servicesNitor
 
Streamlining End-to-End Testing Automation
Streamlining End-to-End Testing AutomationStreamlining End-to-End Testing Automation
Streamlining End-to-End Testing Automation
Anand Bagmar
 
Refactoring legacy systems using events commands and bubble contexts
Refactoring legacy systems using events commands and bubble contextsRefactoring legacy systems using events commands and bubble contexts
Refactoring legacy systems using events commands and bubble contexts
Michał Kurzeja
 
Why Apache Kafka Clusters Are Like Galaxies (And Other Cosmic Kafka Quandarie...
Why Apache Kafka Clusters Are Like Galaxies (And Other Cosmic Kafka Quandarie...Why Apache Kafka Clusters Are Like Galaxies (And Other Cosmic Kafka Quandarie...
Why Apache Kafka Clusters Are Like Galaxies (And Other Cosmic Kafka Quandarie...
Paul Brebner
 
A Comprehensive Guide on Implementing Real-World Mobile Testing Strategies fo...
A Comprehensive Guide on Implementing Real-World Mobile Testing Strategies fo...A Comprehensive Guide on Implementing Real-World Mobile Testing Strategies fo...
A Comprehensive Guide on Implementing Real-World Mobile Testing Strategies fo...
kalichargn70th171
 
Going AOT: Everything you need to know about GraalVM for Java applications
Going AOT: Everything you need to know about GraalVM for Java applicationsGoing AOT: Everything you need to know about GraalVM for Java applications
Going AOT: Everything you need to know about GraalVM for Java applications
Alina Yurenko
 
一比一原版(sdsu毕业证书)圣地亚哥州立大学毕业证如何办理
一比一原版(sdsu毕业证书)圣地亚哥州立大学毕业证如何办理一比一原版(sdsu毕业证书)圣地亚哥州立大学毕业证如何办理
一比一原版(sdsu毕业证书)圣地亚哥州立大学毕业证如何办理
kgyxske
 
The Comprehensive Guide to Validating Audio-Visual Performances.pdf
The Comprehensive Guide to Validating Audio-Visual Performances.pdfThe Comprehensive Guide to Validating Audio-Visual Performances.pdf
The Comprehensive Guide to Validating Audio-Visual Performances.pdf
kalichargn70th171
 
如何办理(hull学位证书)英国赫尔大学毕业证硕士文凭原版一模一样
如何办理(hull学位证书)英国赫尔大学毕业证硕士文凭原版一模一样如何办理(hull学位证书)英国赫尔大学毕业证硕士文凭原版一模一样
如何办理(hull学位证书)英国赫尔大学毕业证硕士文凭原版一模一样
gapen1
 
The Role of DevOps in Digital Transformation.pdf
The Role of DevOps in Digital Transformation.pdfThe Role of DevOps in Digital Transformation.pdf
The Role of DevOps in Digital Transformation.pdf
mohitd6
 

Recently uploaded (20)

What’s New in VictoriaLogs - Q2 2024 Update
What’s New in VictoriaLogs - Q2 2024 UpdateWhat’s New in VictoriaLogs - Q2 2024 Update
What’s New in VictoriaLogs - Q2 2024 Update
 
Hyperledger Besu 빨리 따라하기 (Private Networks)
Hyperledger Besu 빨리 따라하기 (Private Networks)Hyperledger Besu 빨리 따라하기 (Private Networks)
Hyperledger Besu 빨리 따라하기 (Private Networks)
 
Baha Majid WCA4Z IBM Z Customer Council Boston June 2024.pdf
Baha Majid WCA4Z IBM Z Customer Council Boston June 2024.pdfBaha Majid WCA4Z IBM Z Customer Council Boston June 2024.pdf
Baha Majid WCA4Z IBM Z Customer Council Boston June 2024.pdf
 
Software Test Automation - A Comprehensive Guide on Automated Testing.pdf
Software Test Automation - A Comprehensive Guide on Automated Testing.pdfSoftware Test Automation - A Comprehensive Guide on Automated Testing.pdf
Software Test Automation - A Comprehensive Guide on Automated Testing.pdf
 
Beginner's Guide to Observability@Devoxx PL 2024
Beginner's  Guide to Observability@Devoxx PL 2024Beginner's  Guide to Observability@Devoxx PL 2024
Beginner's Guide to Observability@Devoxx PL 2024
 
Unlock the Secrets to Effortless Video Creation with Invideo: Your Ultimate G...
Unlock the Secrets to Effortless Video Creation with Invideo: Your Ultimate G...Unlock the Secrets to Effortless Video Creation with Invideo: Your Ultimate G...
Unlock the Secrets to Effortless Video Creation with Invideo: Your Ultimate G...
 
14 th Edition of International conference on computer vision
14 th Edition of International conference on computer vision14 th Edition of International conference on computer vision
14 th Edition of International conference on computer vision
 
Orca: Nocode Graphical Editor for Container Orchestration
Orca: Nocode Graphical Editor for Container OrchestrationOrca: Nocode Graphical Editor for Container Orchestration
Orca: Nocode Graphical Editor for Container Orchestration
 
Boost Your Savings with These Money Management Apps
Boost Your Savings with These Money Management AppsBoost Your Savings with These Money Management Apps
Boost Your Savings with These Money Management Apps
 
Safelyio Toolbox Talk Softwate & App (How To Digitize Safety Meetings)
Safelyio Toolbox Talk Softwate & App (How To Digitize Safety Meetings)Safelyio Toolbox Talk Softwate & App (How To Digitize Safety Meetings)
Safelyio Toolbox Talk Softwate & App (How To Digitize Safety Meetings)
 
Hands-on with Apache Druid: Installation & Data Ingestion Steps
Hands-on with Apache Druid: Installation & Data Ingestion StepsHands-on with Apache Druid: Installation & Data Ingestion Steps
Hands-on with Apache Druid: Installation & Data Ingestion Steps
 
Streamlining End-to-End Testing Automation
Streamlining End-to-End Testing AutomationStreamlining End-to-End Testing Automation
Streamlining End-to-End Testing Automation
 
Refactoring legacy systems using events commands and bubble contexts
Refactoring legacy systems using events commands and bubble contextsRefactoring legacy systems using events commands and bubble contexts
Refactoring legacy systems using events commands and bubble contexts
 
Why Apache Kafka Clusters Are Like Galaxies (And Other Cosmic Kafka Quandarie...
Why Apache Kafka Clusters Are Like Galaxies (And Other Cosmic Kafka Quandarie...Why Apache Kafka Clusters Are Like Galaxies (And Other Cosmic Kafka Quandarie...
Why Apache Kafka Clusters Are Like Galaxies (And Other Cosmic Kafka Quandarie...
 
A Comprehensive Guide on Implementing Real-World Mobile Testing Strategies fo...
A Comprehensive Guide on Implementing Real-World Mobile Testing Strategies fo...A Comprehensive Guide on Implementing Real-World Mobile Testing Strategies fo...
A Comprehensive Guide on Implementing Real-World Mobile Testing Strategies fo...
 
Going AOT: Everything you need to know about GraalVM for Java applications
Going AOT: Everything you need to know about GraalVM for Java applicationsGoing AOT: Everything you need to know about GraalVM for Java applications
Going AOT: Everything you need to know about GraalVM for Java applications
 
一比一原版(sdsu毕业证书)圣地亚哥州立大学毕业证如何办理
一比一原版(sdsu毕业证书)圣地亚哥州立大学毕业证如何办理一比一原版(sdsu毕业证书)圣地亚哥州立大学毕业证如何办理
一比一原版(sdsu毕业证书)圣地亚哥州立大学毕业证如何办理
 
The Comprehensive Guide to Validating Audio-Visual Performances.pdf
The Comprehensive Guide to Validating Audio-Visual Performances.pdfThe Comprehensive Guide to Validating Audio-Visual Performances.pdf
The Comprehensive Guide to Validating Audio-Visual Performances.pdf
 
如何办理(hull学位证书)英国赫尔大学毕业证硕士文凭原版一模一样
如何办理(hull学位证书)英国赫尔大学毕业证硕士文凭原版一模一样如何办理(hull学位证书)英国赫尔大学毕业证硕士文凭原版一模一样
如何办理(hull学位证书)英国赫尔大学毕业证硕士文凭原版一模一样
 
The Role of DevOps in Digital Transformation.pdf
The Role of DevOps in Digital Transformation.pdfThe Role of DevOps in Digital Transformation.pdf
The Role of DevOps in Digital Transformation.pdf
 

A 30 day plan to start ending your data struggle with Snowflake

  • 1. 1© 2017 Snowflake Computing Inc. All Rights Reserved. Y O U R D A T A , N O L I M I T S A 30 Day Plan to End Your Struggle for Data
  • 2. Common data struggles © 2016 Snowflake Computing Inc. All Rights Reserved. 2 AnalyticsData IntegrationData Loading Collaboration
  • 3. © 2016 Snowflake Computing Inc. All Rights Reserved. 3 Data Loading
  • 4. © 2017 Snowflake Computing Inc. All Rights Reserved. 4 Struggle to Load Data Resource Contention Capacity Planning Preparing disparate data to load “Where can I connect to that new JSON web log data?” -BI Team – Have to flatten to store semi-structured (or use noSQL) – Storage and compute are limited – Architecture forces linear compute capacity
  • 5. © 2017 Snowflake Computing Inc. All Rights Reserved. 5 Tackle loading challenges with Snowflake Contention Capacity Disparate data – Variant column type supports semi-structured – No more flattening (unless you want to) – Built on the cloud (S3, EC2) – Scale data and compute to load any data – Unlimited virtual warehouses allow independent compute – Isolate loading and other tasks
  • 6. © 2016 Snowflake Computing Inc. All Rights Reserved. 6 Data Integration
  • 7. © 2017 Snowflake Computing Inc. All Rights Reserved. 7 Struggle to Integrate Data Making sense of data in silos Editing and transforming data Support evolving business logic and disparate use cases – Hard to transform different datasets while in different silos/formats – noSQL tools complex, not all data stores ACID complaint – Contention an issue while transforming – No way to easily experiment with and add business logic – Different people have different use cases “Are the updated KPI’s in the sensor data tables?” - Data scientist
  • 8. © 2017 Snowflake Computing Inc. All Rights Reserved. 8 Improve data integration with Snowflake Silos Editing and transforming Business logic – Native storage for semi-structured, ANSI standard SQL and dot notation to use it – Combine all of your data fluidly – ACID compliant with virtual data warehouses – Edit, transform, insert, delete, however or whenever you want – Zero-copy cloning – Rapidly iterate, test and promote business logic for multiple people
  • 9. © 2016 Snowflake Computing Inc. All Rights Reserved. 9 Data analytics
  • 10. © 2017 Snowflake Computing Inc. All Rights Reserved. 10 Struggle to Analyze Data Queues Delays – Analysts are always the end of the resource priority queue – Even with unlimited access, database is non-performant “How come the dashboard isn’t working?” - Sales director
  • 11. © 2017 Snowflake Computing Inc. All Rights Reserved. 11 Analyzing Efficiently with Snowflake Queues Delays – Independent virtual warehouses – Scale up, down or out to serve analytics use cases – Autoscaling and multi-cluster warehouses – Automatically match compute to even massive demand
  • 12. © 2016 Snowflake Computing Inc. All Rights Reserved. 12 Collaboration
  • 13. © 2017 Snowflake Computing Inc. All Rights Reserved. 13 Struggle to Collaborate Incessant fixing Siloed teams – Fixing loading, integration and analytics struggles burns time – Conflicts from those struggles reduce morale – Technical and business teams often not working together (physically or otherwise) “I’m so buried under this queue I can’t make the BI standup” - IT team member “I could ask IT for an updated table, but I’m not sure who was working on it.” - BI team member
  • 14. © 2017 Snowflake Computing Inc. All Rights Reserved. 14 Start Collaborating with Snowflake Fixing Siloed teams – Address the other struggles as referenced – Free more time for collaboration and discussion – With new time, start new discussions around data – Build updates and additions into a scheduled meet-up
  • 15. © 2016 Snowflake Computing Inc. All Rights Reserved. 15 A 30-day Plan to Start Ending Your Struggle with Snowflake
  • 16. © 2016 Snowflake Computing Inc. All Rights Reserved. 16 Start from the beginning – what’s the analytics goal? 1. Define the team 2. Discuss blocking issues and a place to start 3. Define the scope 4. Define success criteria 5. Try Snowflake On-Demand 6. Plan status updates going forward Start Ending Your Data Struggle – Week 1
  • 17. © 2017 Snowflake Computing Inc. All Rights Reserved. 17 Start Ending Your Data Struggle – Week 2 1. Find data to load 2. Create a Warehouse 3. Load data – Work within defined scope, agree as a team – Use data that’s new, challenging, or semi-structured – Will need this to load data – Create a database and a table – Stage your data – Load from stage to database
  • 18. © 2017 Snowflake Computing Inc. All Rights Reserved. 18 Start Ending Your Data Struggle – Week 3 1. Test and deploy business logic 2. Optional: Create Integration WH 3. Optional: Plan ongoing loading and transform – Discuss metrics, KPIs, transformations to add – Use zero-copy cloning to test and then promote – Isolate integration and transformation – Use zero-copy cloning to test iterations safely and promote
  • 19. © 2017 Snowflake Computing Inc. All Rights Reserved. 19 Start Ending Your Data Struggle – Week 4 1. Create Warehouses for BI 2. Create analytics users 3. Connect your BI to Snowflake – Avoid queues with isolated compute resources – Optionally, set up auto-scaling – Spread the value of the data – Use this as an opportunity to share and discuss – Use Tableau, Looker, etc. to query your data live – Consider publishing dashboards with live connect
  • 20. © 2017 Snowflake Computing Inc. All Rights Reserved. 20 After 30 days you should see improvements 1. Your team should be talking and collaborating more 2. You should be able to easily load and combine data 3. You should have accurate business logic in your data 4. You should be finding more insight TRY SNOWFLAKE FOR FREE

Editor's Notes

  1. Data loading – struggle to load, store and manage data Data integration – struggle to unify and integrate disparate data sources Analytics – Struggle to analyze data quickly and effectively Collaboration – Because your spending so much time on the other three problems, its difficult to get everyone on the same page, to work together to find insight in your data
  2. Preparing disparate data to load The struggle to load data begins with the need to prepare disparate datasets to load. Many organizations are dealing with a host of new semi-structured data in formats like JSON and Avro that require flattening to load into a relational database. Or, they choose to store semi-structured data separate from relational data in a NoSQL store, creating silos. Capacity planning Finding space for data can be another enormous challenge. Large numbers of complex datasets can quickly snowball into a storage capacity issue on fixed size on-premises or cloud data platforms. Resource contention Loading large datasets also requires significant compute capacity. Many data warehouses are already strained under normal business workloads, and the compute needed for loading forces those other processes to be pushed back in the priority queue. All of these problems lead to difficult conversations about whose data or use case is most important. One project might need funding for an open source, semi-structured data store. Another wants to expand the on-premises data warehouse. One team wants to load clickstream data, and another needs finance data. Prioritizing completely different needs can be a minefield that leads to a host of struggles within and between teams.
  3. Tackle loading challenges with Snowflake Snowflake addresses each loading challenge with simplicity. Semi-structured data can be loaded natively alongside structured data, and queried together in one location. Because Snowflake’s built on the cloud, you can store as much data as you want with no need to prioritize different datasets. Best of all, you can create independent compute resources, called virtual warehouses, for each of your use cases, negating the need for queues.
  4. Making sense of data in silos With data scattered across NoSQL data lakes, cloud applications, and data warehouses (not to mention flat files and CSVs), organizations are struggling to combine and analyze their data in one cohesive picture. Editing and transforming data Every system that stores data has it’s challenges, but many organizations are finding it particularly hard to analyze and understand data in NoSQL systems like Hadoop. Semi-structured open source data stores require a large amount of custom configuration, uncommon skillets, and transformation to successfully combine with other business data. They also rarely support edit, update and insert commands that are essential to data modeling and transformation. Supporting evolving business logic and disparate use cases It’s hard for the business to drive evolutions in business logic within the database when it takes arduous manual process to test and update. Often, entire databases need to be physically copied in order to test a simple change to a table or derived field, which can be extremely expensive and time consuming. Because different people within the organization have different data needs, a “single source of the truth” is often too ungainly and impractical for most organizations to maintain and use. All of these problems make it difficult to generate a refined view of what the data actually says. Differing methods of transforming data arise, with competing factions struggling to promote their own methods of working with, storing and querying data. People from throughout the business wonder where they can find the “right” version of their metrics and KPIs.
  5. Improve data integration with Snowflake Snowflake makes data integration straightforward. You can load all of your data, in almost any structured or semi-structured format, so you can avoid data silos. Transforming is made easier with ANSI standard SQL and dot notation for semi-structured data. Inserts, deletes and other common operations are fully supported. You can even rapidly test and update with zero-copy cloning, driving faster iteration in business logic.
  6. Queues Analytics users are always at the bottom of the resource priority queue. It’s not always designed to be that way, but if ETL, as a simplified example, needs to run for 45 minutes every hour, then there’s little time left over for the analytics team to access and iterate on the database. Delays Through the eyes of an analyst, nothing ever works fast enough. But, often disappointing performance isn’t for lack of trying. Many data warehouses require hours and hours of painstaking optimization, tuning, indexing, sorting, and vacuuming from a dedicated data engineer.  To add to the pain, often one optimization will lead to deoptimization in another area The struggle to analyze data is one of the most visible. Report consumers complain that the BI tool isn’t working fast enough. The BI team points their finger to the data engineers. But, at the end of the day, antiquated database technology is the real culprit.
  7. Analyzing efficiently with Snowflake Snowflake addresses efficient analytics in two ways. As we saw before, independent virtual warehouses can help with concurrent queries, allowing ETL and BI to run side by side at the same time. Large or variable analytics workloads within a single warehouse can be dealt with using mutli-cluster warehouses, and even autoscaling to automatically match your compute resources to need.
  8. The struggle to load, integrate and analyze data leads to a fourth struggle that’s often the worst. Collaboration.
  9. Incessant fixing If the organization spends all its time endlessly solving loading, integration, and analytics struggles, it’s impossible to break away and think at a higher level about what needs to be accomplished. Data is a constant flash point of disagreement, rather than a rallying point for collaboration. Siloed teams Historically, there’s been a dividing line between technical, IT implementers, and less-technical business side consumers. This was partly driven by technology, but reinforced by organizational structures that don’t favor cross team collaboration. The lack of collaboration is the end result of the struggles, and the most frustrating of them all. How can two disparate types of people, on two different teams (or multiple different teams) effectively work together when they are completely buried under the weight of their antiquated data platform.
  10. Analyzing efficiently with Snowflake As we noted previously, Snowflake can help to solve loading, integration and analytics struggles, freeing time for collaboration and higher level planning. Working together with Snowflake, the dividing line between IT and BI becomes less important. IT can lead the business with technology and empower the BI team to analyze data. On the same token, with more accessible technology in the form of Snowflake, BI teams can take an active role in the curation and modeling of data that has historically rested solely on IT’s shoulders.
  11. Week one is all about the team. It’s time to bring everyone around the same table to figure out the best way to move forward with your data. Keep your conversation focused on an achievable goal: trying to get an important dataset into Snowflake for analysis. Discuss blocking issues, but be sure to define them in terms of technology, rather than people. Once you’ve got a plan to get around any blocking issues, set up Snowflake On-Demand for free and make a plan to bring the team together for status updates in the weeks to follow. Pro tip: Think big. Every new Snowflake On-Demand customer gets $400 in free credits to play around with, more than enough to load and store a massive dataset. One Snowflake customer performance tested Snowflake against a $10,000,000 on premise database with only $100. It was 100x faster.
  12. Week 2 is when the practical, real life work begins. Pick up where you left off with your team, and discuss the right data to load into Snowflake. Clearly define the scope within that dataset, so you settle on a dataset that is large enough to be useful but also flexible enough to get out of it’s current location within the week. Once you’ve got your data, it’s time to create a warehouse, database and tables to load your data into. Pro tip: Remember to stay open minded about semi-structured data too - in fact, that might be the best dataset to get started with in Snowflake. Store semi-structured data in nested form within the Variant type column, and transform with dot notation using standard SQL statements.
  13. By week 3, you should have data loaded and perhaps you’ve already started querying and using it. If not, now is a good time to start. Make sure to take note of the business logic (in the form of calculations, derived fields, KPIs, etc) that it would make sense to add. Work with the team to futher define this logic, and experiment with zero-copy cloning to test transformations to your production data from the safety of a cloned database. When you’ve got your business logic added, look to add an additional warehouse for ongoing loading and transformation needs. Pro tip: The value of Snowflake increases exponentially with the number of related data sources you are able to load and integrate. In other words, sales data from Salesforce is more than twice as interesting when people are able to combine it with account based web interaction data from Google Analytics.
  14. As week 4 rolls around, it’s time to spread the value of your data as widely as possible. Add users to Snowflake, along with roles and permissions to match. Create auto-scaling warehouses for the BI, analytics and reporting teams to enable everyone to access data without contention. Connect Snowflake to your BI tool to begin creating the visualizations and dashboards that will power the insight you need. Pro tip: Many organizations that have traditionally relied on extracts or in-memory data are using Snowflake as a live-connection within their BI tool. Experiment and take advantage of the speed and flexibility that Snowflake can give your team.
  15. After 30 days, you should see some significant improvements. Your team should be talking about your data and collaborating more. You should be able to easily load and combine the data that matters to your business. There should be useful business logic within the data you loaded into Snowflake, and plans to test and expand even more. Your BI and analytics should be performing quickly on the data you’ve loaded, generating further interest in your overall plans for your data platform. The most important change you should see after this 30 day plan is within your relationships. The struggles that defined your loading, integration, analytics and collaboration should have given way to a new but promising spirit of mutual ownership. Next steps The next steps are up to you, but they look a lot like the first 30 day plan in elongated form. Continue the discussion. Load more data. Expand the number of users and groups that can access and benefit from the data that you’ve loaded in Snowflake. It’s also important to continually share and elevate the success and experiences you’ve had ending the struggle for data within your organization. Show executives and leaders the value of your data, and the time that you’ve put into perfecting it for analysis. Lastly, make sure to share your experiences outside of your own organization. Speak at conferences and events so you can synthesize what you’ve learned and spread the benefit of your experience to people that are still struggling with their data.