SlideShare a Scribd company logo
1© 2017 Snowflake Computing Inc. All Rights Reserved.
Y O U R D A T A , N O L I M I T S
A 30 Day Plan to End Your Struggle for Data
Common data struggles
© 2016 Snowflake Computing Inc. All Rights Reserved. 2
AnalyticsData IntegrationData Loading Collaboration
© 2016 Snowflake Computing Inc. All Rights Reserved. 3
Data Loading
© 2017 Snowflake Computing Inc. All Rights Reserved. 4
Struggle to Load Data
Resource Contention
Capacity Planning
Preparing disparate data to load
“Where can I connect to that
new JSON web log data?”
-BI Team
– Have to flatten to store semi-structured (or use noSQL)
– Storage and compute are limited
– Architecture forces linear compute capacity
© 2017 Snowflake Computing Inc. All Rights Reserved. 5
Tackle loading challenges with Snowflake
Contention
Capacity
Disparate data
– Variant column type supports semi-structured
– No more flattening (unless you want to)
– Built on the cloud (S3, EC2)
– Scale data and compute to load any data
– Unlimited virtual warehouses allow independent compute
– Isolate loading and other tasks
© 2016 Snowflake Computing Inc. All Rights Reserved. 6
Data Integration
© 2017 Snowflake Computing Inc. All Rights Reserved. 7
Struggle to Integrate Data
Making sense of data in silos
Editing and transforming data
Support evolving business logic and disparate use cases
– Hard to transform different datasets while in different
silos/formats
– noSQL tools complex, not all data stores ACID complaint
– Contention an issue while transforming
– No way to easily experiment with and add business logic
– Different people have different use cases
“Are the updated KPI’s in
the sensor data tables?”
- Data scientist
© 2017 Snowflake Computing Inc. All Rights Reserved. 8
Improve data integration with Snowflake
Silos
Editing and transforming
Business logic
– Native storage for semi-structured, ANSI standard SQL and dot notation to use it
– Combine all of your data fluidly
– ACID compliant with virtual data warehouses
– Edit, transform, insert, delete, however or whenever you want
– Zero-copy cloning
– Rapidly iterate, test and promote business logic for multiple
people
© 2016 Snowflake Computing Inc. All Rights Reserved. 9
Data analytics
© 2017 Snowflake Computing Inc. All Rights Reserved. 10
Struggle to Analyze Data
Queues
Delays
– Analysts are always the end of the resource priority
queue
– Even with unlimited access, database is non-performant
“How come the dashboard isn’t working?”
- Sales director
© 2017 Snowflake Computing Inc. All Rights Reserved. 11
Analyzing Efficiently with Snowflake
Queues
Delays
– Independent virtual warehouses
– Scale up, down or out to serve analytics use cases
– Autoscaling and multi-cluster warehouses
– Automatically match compute to even massive demand
© 2016 Snowflake Computing Inc. All Rights Reserved. 12
Collaboration
© 2017 Snowflake Computing Inc. All Rights Reserved. 13
Struggle to Collaborate
Incessant fixing
Siloed teams
– Fixing loading, integration and analytics struggles burns time
– Conflicts from those struggles reduce morale
– Technical and business teams often not working together (physically or
otherwise)
“I’m so buried under this queue I
can’t make the BI standup”
- IT team member
“I could ask IT for an updated table,
but I’m not sure who was working on it.”
- BI team member
© 2017 Snowflake Computing Inc. All Rights Reserved. 14
Start Collaborating with Snowflake
Fixing
Siloed teams
– Address the other struggles as referenced
– Free more time for collaboration and discussion
– With new time, start new discussions around data
– Build updates and additions into a scheduled meet-up
© 2016 Snowflake Computing Inc. All Rights Reserved. 15
A 30-day Plan to Start Ending
Your Struggle with Snowflake
© 2016 Snowflake Computing Inc. All Rights Reserved. 16
Start from the beginning – what’s the analytics goal?
1. Define the team
2. Discuss blocking issues and a place to start
3. Define the scope
4. Define success criteria
5. Try Snowflake On-Demand
6. Plan status updates going forward
Start Ending Your Data Struggle – Week 1
© 2017 Snowflake Computing Inc. All Rights Reserved. 17
Start Ending Your Data Struggle – Week 2
1. Find data to load
2. Create a Warehouse
3. Load data
– Work within defined scope, agree as a team
– Use data that’s new, challenging, or semi-structured
– Will need this to load data
– Create a database and a table
– Stage your data
– Load from stage to database
© 2017 Snowflake Computing Inc. All Rights Reserved. 18
Start Ending Your Data Struggle – Week 3
1. Test and deploy business logic
2. Optional: Create Integration WH
3. Optional: Plan ongoing loading and transform
– Discuss metrics, KPIs, transformations to add
– Use zero-copy cloning to test and then promote
– Isolate integration and transformation
– Use zero-copy cloning to test iterations safely and
promote
© 2017 Snowflake Computing Inc. All Rights Reserved. 19
Start Ending Your Data Struggle – Week 4
1. Create Warehouses for BI
2. Create analytics users
3. Connect your BI to Snowflake
– Avoid queues with isolated compute resources
– Optionally, set up auto-scaling
– Spread the value of the data
– Use this as an opportunity to share and discuss
– Use Tableau, Looker, etc. to query your data live
– Consider publishing dashboards with live connect
© 2017 Snowflake Computing Inc. All Rights Reserved. 20
After 30 days you should see improvements
1. Your team should be talking and collaborating more
2. You should be able to easily load and combine data
3. You should have accurate business logic in your data
4. You should be finding more insight
TRY SNOWFLAKE FOR FREE

More Related Content

What's hot

Snowflake Best Practices for Elastic Data Warehousing
Snowflake Best Practices for Elastic Data WarehousingSnowflake Best Practices for Elastic Data Warehousing
Snowflake Best Practices for Elastic Data Warehousing
Amazon Web Services
 
Snowflake Datawarehouse Architecturing
Snowflake Datawarehouse ArchitecturingSnowflake Datawarehouse Architecturing
Snowflake Datawarehouse Architecturing
Ishan Bhawantha Hewanayake
 
Snowflake essentials
Snowflake essentialsSnowflake essentials
Snowflake essentials
qureshihamid
 
Demystifying Data Warehousing as a Service - DFW
Demystifying Data Warehousing as a Service - DFWDemystifying Data Warehousing as a Service - DFW
Demystifying Data Warehousing as a Service - DFW
Kent Graziano
 
Snowflake Data Loading.pptx
Snowflake Data Loading.pptxSnowflake Data Loading.pptx
Snowflake Data Loading.pptx
Parag860410
 
Snowflake Data Science and AI/ML at Scale
Snowflake Data Science and AI/ML at ScaleSnowflake Data Science and AI/ML at Scale
Snowflake Data Science and AI/ML at Scale
Adam Doyle
 
Data Sharing with Snowflake
Data Sharing with SnowflakeData Sharing with Snowflake
Data Sharing with Snowflake
Snowflake Computing
 
Snowflake Architecture.pptx
Snowflake Architecture.pptxSnowflake Architecture.pptx
Snowflake Architecture.pptx
chennakesava44
 
How to Take Advantage of an Enterprise Data Warehouse in the Cloud
How to Take Advantage of an Enterprise Data Warehouse in the CloudHow to Take Advantage of an Enterprise Data Warehouse in the Cloud
How to Take Advantage of an Enterprise Data Warehouse in the Cloud
Denodo
 
Changing the game with cloud dw
Changing the game with cloud dwChanging the game with cloud dw
Changing the game with cloud dw
elephantscale
 
Snowflake Company Presentation
Snowflake Company PresentationSnowflake Company Presentation
Snowflake Company Presentation
AndrewJiang18
 
Apache Iceberg Presentation for the St. Louis Big Data IDEA
Apache Iceberg Presentation for the St. Louis Big Data IDEAApache Iceberg Presentation for the St. Louis Big Data IDEA
Apache Iceberg Presentation for the St. Louis Big Data IDEA
Adam Doyle
 
Demystifying Data Warehouse as a Service
Demystifying Data Warehouse as a ServiceDemystifying Data Warehouse as a Service
Demystifying Data Warehouse as a Service
Snowflake Computing
 
Snowflake: The Good, the Bad, and the Ugly
Snowflake: The Good, the Bad, and the UglySnowflake: The Good, the Bad, and the Ugly
Snowflake: The Good, the Bad, and the Ugly
Tyler Wishnoff
 
Time to Talk about Data Mesh
Time to Talk about Data MeshTime to Talk about Data Mesh
Time to Talk about Data Mesh
LibbySchulze
 
Elastic Data Warehousing
Elastic Data WarehousingElastic Data Warehousing
Elastic Data Warehousing
Snowflake Computing
 
Introduction to snowflake
Introduction to snowflakeIntroduction to snowflake
Introduction to snowflake
Sunil Gurav
 
Snowflake free trial_lab_guide
Snowflake free trial_lab_guideSnowflake free trial_lab_guide
Snowflake free trial_lab_guide
slidedown1
 
Data mesh
Data meshData mesh
Data mesh
ManojKumarR41
 
Data Warehouse - Incremental Migration to the Cloud
Data Warehouse - Incremental Migration to the CloudData Warehouse - Incremental Migration to the Cloud
Data Warehouse - Incremental Migration to the Cloud
Michael Rainey
 

What's hot (20)

Snowflake Best Practices for Elastic Data Warehousing
Snowflake Best Practices for Elastic Data WarehousingSnowflake Best Practices for Elastic Data Warehousing
Snowflake Best Practices for Elastic Data Warehousing
 
Snowflake Datawarehouse Architecturing
Snowflake Datawarehouse ArchitecturingSnowflake Datawarehouse Architecturing
Snowflake Datawarehouse Architecturing
 
Snowflake essentials
Snowflake essentialsSnowflake essentials
Snowflake essentials
 
Demystifying Data Warehousing as a Service - DFW
Demystifying Data Warehousing as a Service - DFWDemystifying Data Warehousing as a Service - DFW
Demystifying Data Warehousing as a Service - DFW
 
Snowflake Data Loading.pptx
Snowflake Data Loading.pptxSnowflake Data Loading.pptx
Snowflake Data Loading.pptx
 
Snowflake Data Science and AI/ML at Scale
Snowflake Data Science and AI/ML at ScaleSnowflake Data Science and AI/ML at Scale
Snowflake Data Science and AI/ML at Scale
 
Data Sharing with Snowflake
Data Sharing with SnowflakeData Sharing with Snowflake
Data Sharing with Snowflake
 
Snowflake Architecture.pptx
Snowflake Architecture.pptxSnowflake Architecture.pptx
Snowflake Architecture.pptx
 
How to Take Advantage of an Enterprise Data Warehouse in the Cloud
How to Take Advantage of an Enterprise Data Warehouse in the CloudHow to Take Advantage of an Enterprise Data Warehouse in the Cloud
How to Take Advantage of an Enterprise Data Warehouse in the Cloud
 
Changing the game with cloud dw
Changing the game with cloud dwChanging the game with cloud dw
Changing the game with cloud dw
 
Snowflake Company Presentation
Snowflake Company PresentationSnowflake Company Presentation
Snowflake Company Presentation
 
Apache Iceberg Presentation for the St. Louis Big Data IDEA
Apache Iceberg Presentation for the St. Louis Big Data IDEAApache Iceberg Presentation for the St. Louis Big Data IDEA
Apache Iceberg Presentation for the St. Louis Big Data IDEA
 
Demystifying Data Warehouse as a Service
Demystifying Data Warehouse as a ServiceDemystifying Data Warehouse as a Service
Demystifying Data Warehouse as a Service
 
Snowflake: The Good, the Bad, and the Ugly
Snowflake: The Good, the Bad, and the UglySnowflake: The Good, the Bad, and the Ugly
Snowflake: The Good, the Bad, and the Ugly
 
Time to Talk about Data Mesh
Time to Talk about Data MeshTime to Talk about Data Mesh
Time to Talk about Data Mesh
 
Elastic Data Warehousing
Elastic Data WarehousingElastic Data Warehousing
Elastic Data Warehousing
 
Introduction to snowflake
Introduction to snowflakeIntroduction to snowflake
Introduction to snowflake
 
Snowflake free trial_lab_guide
Snowflake free trial_lab_guideSnowflake free trial_lab_guide
Snowflake free trial_lab_guide
 
Data mesh
Data meshData mesh
Data mesh
 
Data Warehouse - Incremental Migration to the Cloud
Data Warehouse - Incremental Migration to the CloudData Warehouse - Incremental Migration to the Cloud
Data Warehouse - Incremental Migration to the Cloud
 

Similar to A 30 day plan to start ending your data struggle with Snowflake

Business Intelligence is more than just pretty visuals
Business Intelligence is more than just pretty visualsBusiness Intelligence is more than just pretty visuals
Business Intelligence is more than just pretty visuals
Vincent Woon
 
Nw2008 tips tricks_edw_v10
Nw2008 tips tricks_edw_v10Nw2008 tips tricks_edw_v10
Nw2008 tips tricks_edw_v10
Harsha Gowda B R
 
The Key to Big Data Modeling: Collaboration
The Key to Big Data Modeling: CollaborationThe Key to Big Data Modeling: Collaboration
The Key to Big Data Modeling: Collaboration
Embarcadero Technologies
 
10 Reasons Snowflake Is Great for Analytics
10 Reasons Snowflake Is Great for Analytics10 Reasons Snowflake Is Great for Analytics
10 Reasons Snowflake Is Great for Analytics
Senturus
 
IT + Line of Business - Driving Faster, Deeper Insights Together
IT + Line of Business - Driving Faster, Deeper Insights TogetherIT + Line of Business - Driving Faster, Deeper Insights Together
IT + Line of Business - Driving Faster, Deeper Insights Together
DATAVERSITY
 
Big Data Integration Webinar: Reducing Implementation Efforts of Hadoop, NoSQ...
Big Data Integration Webinar: Reducing Implementation Efforts of Hadoop, NoSQ...Big Data Integration Webinar: Reducing Implementation Efforts of Hadoop, NoSQ...
Big Data Integration Webinar: Reducing Implementation Efforts of Hadoop, NoSQ...Pentaho
 
18. Madhur Hemnani - Result Orientated Innovation with Oracle HR Analytics
18. Madhur Hemnani - Result Orientated Innovation with Oracle HR Analytics18. Madhur Hemnani - Result Orientated Innovation with Oracle HR Analytics
18. Madhur Hemnani - Result Orientated Innovation with Oracle HR Analytics
Cedar Consulting
 
Q1 and q2 2020 role overview
Q1 and q2 2020 role overviewQ1 and q2 2020 role overview
Q1 and q2 2020 role overview
DaveMeckler
 
Azure + DataStax Enterprise Powers Office 365 Per User Store
Azure + DataStax Enterprise Powers Office 365 Per User StoreAzure + DataStax Enterprise Powers Office 365 Per User Store
Azure + DataStax Enterprise Powers Office 365 Per User Store
DataStax Academy
 
Managing Large Amounts of Data with Salesforce
Managing Large Amounts of Data with SalesforceManaging Large Amounts of Data with Salesforce
Managing Large Amounts of Data with Salesforce
Sense Corp
 
Transforming Data Management and Time to Insight with Anzo Smart Data Lake®
Transforming Data Management and Time to Insight with Anzo Smart Data Lake®Transforming Data Management and Time to Insight with Anzo Smart Data Lake®
Transforming Data Management and Time to Insight with Anzo Smart Data Lake®
Cambridge Semantics
 
Top 10 Tips for an Effective Postgres Deployment
Top 10 Tips for an Effective Postgres DeploymentTop 10 Tips for an Effective Postgres Deployment
Top 10 Tips for an Effective Postgres Deployment
EDB
 
IBM Cognos Analytics Reporting vs. Dashboarding: Matching Tools to Business R...
IBM Cognos Analytics Reporting vs. Dashboarding: Matching Tools to Business R...IBM Cognos Analytics Reporting vs. Dashboarding: Matching Tools to Business R...
IBM Cognos Analytics Reporting vs. Dashboarding: Matching Tools to Business R...
Senturus
 
Does it only have to be ML + AI?
Does it only have to be ML + AI?Does it only have to be ML + AI?
Does it only have to be ML + AI?
Harald Erb
 
Unify Data at Memory Speed
Unify Data at Memory SpeedUnify Data at Memory Speed
Unify Data at Memory Speed
Alluxio, Inc.
 
Big Data & Information Management Channel Manager
Big Data & Information Management Channel ManagerBig Data & Information Management Channel Manager
Big Data & Information Management Channel Manager
Arrow ECS UK
 
Building MuleSoft Applications with Google BigQuery Meetup 4
Building MuleSoft Applications with Google BigQuery Meetup 4Building MuleSoft Applications with Google BigQuery Meetup 4
Building MuleSoft Applications with Google BigQuery Meetup 4
MannaAkpan
 
IBM Cognos Analytics Release 7+ Authoring Improvements: Demos of New and Rein...
IBM Cognos Analytics Release 7+ Authoring Improvements: Demos of New and Rein...IBM Cognos Analytics Release 7+ Authoring Improvements: Demos of New and Rein...
IBM Cognos Analytics Release 7+ Authoring Improvements: Demos of New and Rein...
Senturus
 
How to grow to a modern workplace in 16 steps with microsoft 365
How to grow to a modern workplace in 16 steps with microsoft 365How to grow to a modern workplace in 16 steps with microsoft 365
How to grow to a modern workplace in 16 steps with microsoft 365
Tim Hermie ☁️
 
Postgres Integrates Effectively in the "Enterprise Sandbox"
Postgres Integrates Effectively in the "Enterprise Sandbox"Postgres Integrates Effectively in the "Enterprise Sandbox"
Postgres Integrates Effectively in the "Enterprise Sandbox"
EDB
 

Similar to A 30 day plan to start ending your data struggle with Snowflake (20)

Business Intelligence is more than just pretty visuals
Business Intelligence is more than just pretty visualsBusiness Intelligence is more than just pretty visuals
Business Intelligence is more than just pretty visuals
 
Nw2008 tips tricks_edw_v10
Nw2008 tips tricks_edw_v10Nw2008 tips tricks_edw_v10
Nw2008 tips tricks_edw_v10
 
The Key to Big Data Modeling: Collaboration
The Key to Big Data Modeling: CollaborationThe Key to Big Data Modeling: Collaboration
The Key to Big Data Modeling: Collaboration
 
10 Reasons Snowflake Is Great for Analytics
10 Reasons Snowflake Is Great for Analytics10 Reasons Snowflake Is Great for Analytics
10 Reasons Snowflake Is Great for Analytics
 
IT + Line of Business - Driving Faster, Deeper Insights Together
IT + Line of Business - Driving Faster, Deeper Insights TogetherIT + Line of Business - Driving Faster, Deeper Insights Together
IT + Line of Business - Driving Faster, Deeper Insights Together
 
Big Data Integration Webinar: Reducing Implementation Efforts of Hadoop, NoSQ...
Big Data Integration Webinar: Reducing Implementation Efforts of Hadoop, NoSQ...Big Data Integration Webinar: Reducing Implementation Efforts of Hadoop, NoSQ...
Big Data Integration Webinar: Reducing Implementation Efforts of Hadoop, NoSQ...
 
18. Madhur Hemnani - Result Orientated Innovation with Oracle HR Analytics
18. Madhur Hemnani - Result Orientated Innovation with Oracle HR Analytics18. Madhur Hemnani - Result Orientated Innovation with Oracle HR Analytics
18. Madhur Hemnani - Result Orientated Innovation with Oracle HR Analytics
 
Q1 and q2 2020 role overview
Q1 and q2 2020 role overviewQ1 and q2 2020 role overview
Q1 and q2 2020 role overview
 
Azure + DataStax Enterprise Powers Office 365 Per User Store
Azure + DataStax Enterprise Powers Office 365 Per User StoreAzure + DataStax Enterprise Powers Office 365 Per User Store
Azure + DataStax Enterprise Powers Office 365 Per User Store
 
Managing Large Amounts of Data with Salesforce
Managing Large Amounts of Data with SalesforceManaging Large Amounts of Data with Salesforce
Managing Large Amounts of Data with Salesforce
 
Transforming Data Management and Time to Insight with Anzo Smart Data Lake®
Transforming Data Management and Time to Insight with Anzo Smart Data Lake®Transforming Data Management and Time to Insight with Anzo Smart Data Lake®
Transforming Data Management and Time to Insight with Anzo Smart Data Lake®
 
Top 10 Tips for an Effective Postgres Deployment
Top 10 Tips for an Effective Postgres DeploymentTop 10 Tips for an Effective Postgres Deployment
Top 10 Tips for an Effective Postgres Deployment
 
IBM Cognos Analytics Reporting vs. Dashboarding: Matching Tools to Business R...
IBM Cognos Analytics Reporting vs. Dashboarding: Matching Tools to Business R...IBM Cognos Analytics Reporting vs. Dashboarding: Matching Tools to Business R...
IBM Cognos Analytics Reporting vs. Dashboarding: Matching Tools to Business R...
 
Does it only have to be ML + AI?
Does it only have to be ML + AI?Does it only have to be ML + AI?
Does it only have to be ML + AI?
 
Unify Data at Memory Speed
Unify Data at Memory SpeedUnify Data at Memory Speed
Unify Data at Memory Speed
 
Big Data & Information Management Channel Manager
Big Data & Information Management Channel ManagerBig Data & Information Management Channel Manager
Big Data & Information Management Channel Manager
 
Building MuleSoft Applications with Google BigQuery Meetup 4
Building MuleSoft Applications with Google BigQuery Meetup 4Building MuleSoft Applications with Google BigQuery Meetup 4
Building MuleSoft Applications with Google BigQuery Meetup 4
 
IBM Cognos Analytics Release 7+ Authoring Improvements: Demos of New and Rein...
IBM Cognos Analytics Release 7+ Authoring Improvements: Demos of New and Rein...IBM Cognos Analytics Release 7+ Authoring Improvements: Demos of New and Rein...
IBM Cognos Analytics Release 7+ Authoring Improvements: Demos of New and Rein...
 
How to grow to a modern workplace in 16 steps with microsoft 365
How to grow to a modern workplace in 16 steps with microsoft 365How to grow to a modern workplace in 16 steps with microsoft 365
How to grow to a modern workplace in 16 steps with microsoft 365
 
Postgres Integrates Effectively in the "Enterprise Sandbox"
Postgres Integrates Effectively in the "Enterprise Sandbox"Postgres Integrates Effectively in the "Enterprise Sandbox"
Postgres Integrates Effectively in the "Enterprise Sandbox"
 

Recently uploaded

Top Features to Include in Your Winzo Clone App for Business Growth (4).pptx
Top Features to Include in Your Winzo Clone App for Business Growth (4).pptxTop Features to Include in Your Winzo Clone App for Business Growth (4).pptx
Top Features to Include in Your Winzo Clone App for Business Growth (4).pptx
rickgrimesss22
 
Custom Healthcare Software for Managing Chronic Conditions and Remote Patient...
Custom Healthcare Software for Managing Chronic Conditions and Remote Patient...Custom Healthcare Software for Managing Chronic Conditions and Remote Patient...
Custom Healthcare Software for Managing Chronic Conditions and Remote Patient...
Mind IT Systems
 
Navigating the Metaverse: A Journey into Virtual Evolution"
Navigating the Metaverse: A Journey into Virtual Evolution"Navigating the Metaverse: A Journey into Virtual Evolution"
Navigating the Metaverse: A Journey into Virtual Evolution"
Donna Lenk
 
Dominate Social Media with TubeTrivia AI’s Addictive Quiz Videos.pdf
Dominate Social Media with TubeTrivia AI’s Addictive Quiz Videos.pdfDominate Social Media with TubeTrivia AI’s Addictive Quiz Videos.pdf
Dominate Social Media with TubeTrivia AI’s Addictive Quiz Videos.pdf
AMB-Review
 
Enhancing Research Orchestration Capabilities at ORNL.pdf
Enhancing Research Orchestration Capabilities at ORNL.pdfEnhancing Research Orchestration Capabilities at ORNL.pdf
Enhancing Research Orchestration Capabilities at ORNL.pdf
Globus
 
Globus Connect Server Deep Dive - GlobusWorld 2024
Globus Connect Server Deep Dive - GlobusWorld 2024Globus Connect Server Deep Dive - GlobusWorld 2024
Globus Connect Server Deep Dive - GlobusWorld 2024
Globus
 
Globus Compute Introduction - GlobusWorld 2024
Globus Compute Introduction - GlobusWorld 2024Globus Compute Introduction - GlobusWorld 2024
Globus Compute Introduction - GlobusWorld 2024
Globus
 
Field Employee Tracking System| MiTrack App| Best Employee Tracking Solution|...
Field Employee Tracking System| MiTrack App| Best Employee Tracking Solution|...Field Employee Tracking System| MiTrack App| Best Employee Tracking Solution|...
Field Employee Tracking System| MiTrack App| Best Employee Tracking Solution|...
informapgpstrackings
 
Developing Distributed High-performance Computing Capabilities of an Open Sci...
Developing Distributed High-performance Computing Capabilities of an Open Sci...Developing Distributed High-performance Computing Capabilities of an Open Sci...
Developing Distributed High-performance Computing Capabilities of an Open Sci...
Globus
 
Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...
Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...
Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...
Globus
 
Introduction to Pygame (Lecture 7 Python Game Development)
Introduction to Pygame (Lecture 7 Python Game Development)Introduction to Pygame (Lecture 7 Python Game Development)
Introduction to Pygame (Lecture 7 Python Game Development)
abdulrafaychaudhry
 
Webinar: Salesforce Document Management 2.0 - Smarter, Faster, Better
Webinar: Salesforce Document Management 2.0 - Smarter, Faster, BetterWebinar: Salesforce Document Management 2.0 - Smarter, Faster, Better
Webinar: Salesforce Document Management 2.0 - Smarter, Faster, Better
XfilesPro
 
Beyond Event Sourcing - Embracing CRUD for Wix Platform - Java.IL
Beyond Event Sourcing - Embracing CRUD for Wix Platform - Java.ILBeyond Event Sourcing - Embracing CRUD for Wix Platform - Java.IL
Beyond Event Sourcing - Embracing CRUD for Wix Platform - Java.IL
Natan Silnitsky
 
AI Pilot Review: The World’s First Virtual Assistant Marketing Suite
AI Pilot Review: The World’s First Virtual Assistant Marketing SuiteAI Pilot Review: The World’s First Virtual Assistant Marketing Suite
AI Pilot Review: The World’s First Virtual Assistant Marketing Suite
Google
 
Lecture 1 Introduction to games development
Lecture 1 Introduction to games developmentLecture 1 Introduction to games development
Lecture 1 Introduction to games development
abdulrafaychaudhry
 
Launch Your Streaming Platforms in Minutes
Launch Your Streaming Platforms in MinutesLaunch Your Streaming Platforms in Minutes
Launch Your Streaming Platforms in Minutes
Roshan Dwivedi
 
Climate Science Flows: Enabling Petabyte-Scale Climate Analysis with the Eart...
Climate Science Flows: Enabling Petabyte-Scale Climate Analysis with the Eart...Climate Science Flows: Enabling Petabyte-Scale Climate Analysis with the Eart...
Climate Science Flows: Enabling Petabyte-Scale Climate Analysis with the Eart...
Globus
 
Graphic Design Crash Course for beginners
Graphic Design Crash Course for beginnersGraphic Design Crash Course for beginners
Graphic Design Crash Course for beginners
e20449
 
Top 7 Unique WhatsApp API Benefits | Saudi Arabia
Top 7 Unique WhatsApp API Benefits | Saudi ArabiaTop 7 Unique WhatsApp API Benefits | Saudi Arabia
Top 7 Unique WhatsApp API Benefits | Saudi Arabia
Yara Milbes
 
APIs for Browser Automation (MoT Meetup 2024)
APIs for Browser Automation (MoT Meetup 2024)APIs for Browser Automation (MoT Meetup 2024)
APIs for Browser Automation (MoT Meetup 2024)
Boni García
 

Recently uploaded (20)

Top Features to Include in Your Winzo Clone App for Business Growth (4).pptx
Top Features to Include in Your Winzo Clone App for Business Growth (4).pptxTop Features to Include in Your Winzo Clone App for Business Growth (4).pptx
Top Features to Include in Your Winzo Clone App for Business Growth (4).pptx
 
Custom Healthcare Software for Managing Chronic Conditions and Remote Patient...
Custom Healthcare Software for Managing Chronic Conditions and Remote Patient...Custom Healthcare Software for Managing Chronic Conditions and Remote Patient...
Custom Healthcare Software for Managing Chronic Conditions and Remote Patient...
 
Navigating the Metaverse: A Journey into Virtual Evolution"
Navigating the Metaverse: A Journey into Virtual Evolution"Navigating the Metaverse: A Journey into Virtual Evolution"
Navigating the Metaverse: A Journey into Virtual Evolution"
 
Dominate Social Media with TubeTrivia AI’s Addictive Quiz Videos.pdf
Dominate Social Media with TubeTrivia AI’s Addictive Quiz Videos.pdfDominate Social Media with TubeTrivia AI’s Addictive Quiz Videos.pdf
Dominate Social Media with TubeTrivia AI’s Addictive Quiz Videos.pdf
 
Enhancing Research Orchestration Capabilities at ORNL.pdf
Enhancing Research Orchestration Capabilities at ORNL.pdfEnhancing Research Orchestration Capabilities at ORNL.pdf
Enhancing Research Orchestration Capabilities at ORNL.pdf
 
Globus Connect Server Deep Dive - GlobusWorld 2024
Globus Connect Server Deep Dive - GlobusWorld 2024Globus Connect Server Deep Dive - GlobusWorld 2024
Globus Connect Server Deep Dive - GlobusWorld 2024
 
Globus Compute Introduction - GlobusWorld 2024
Globus Compute Introduction - GlobusWorld 2024Globus Compute Introduction - GlobusWorld 2024
Globus Compute Introduction - GlobusWorld 2024
 
Field Employee Tracking System| MiTrack App| Best Employee Tracking Solution|...
Field Employee Tracking System| MiTrack App| Best Employee Tracking Solution|...Field Employee Tracking System| MiTrack App| Best Employee Tracking Solution|...
Field Employee Tracking System| MiTrack App| Best Employee Tracking Solution|...
 
Developing Distributed High-performance Computing Capabilities of an Open Sci...
Developing Distributed High-performance Computing Capabilities of an Open Sci...Developing Distributed High-performance Computing Capabilities of an Open Sci...
Developing Distributed High-performance Computing Capabilities of an Open Sci...
 
Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...
Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...
Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...
 
Introduction to Pygame (Lecture 7 Python Game Development)
Introduction to Pygame (Lecture 7 Python Game Development)Introduction to Pygame (Lecture 7 Python Game Development)
Introduction to Pygame (Lecture 7 Python Game Development)
 
Webinar: Salesforce Document Management 2.0 - Smarter, Faster, Better
Webinar: Salesforce Document Management 2.0 - Smarter, Faster, BetterWebinar: Salesforce Document Management 2.0 - Smarter, Faster, Better
Webinar: Salesforce Document Management 2.0 - Smarter, Faster, Better
 
Beyond Event Sourcing - Embracing CRUD for Wix Platform - Java.IL
Beyond Event Sourcing - Embracing CRUD for Wix Platform - Java.ILBeyond Event Sourcing - Embracing CRUD for Wix Platform - Java.IL
Beyond Event Sourcing - Embracing CRUD for Wix Platform - Java.IL
 
AI Pilot Review: The World’s First Virtual Assistant Marketing Suite
AI Pilot Review: The World’s First Virtual Assistant Marketing SuiteAI Pilot Review: The World’s First Virtual Assistant Marketing Suite
AI Pilot Review: The World’s First Virtual Assistant Marketing Suite
 
Lecture 1 Introduction to games development
Lecture 1 Introduction to games developmentLecture 1 Introduction to games development
Lecture 1 Introduction to games development
 
Launch Your Streaming Platforms in Minutes
Launch Your Streaming Platforms in MinutesLaunch Your Streaming Platforms in Minutes
Launch Your Streaming Platforms in Minutes
 
Climate Science Flows: Enabling Petabyte-Scale Climate Analysis with the Eart...
Climate Science Flows: Enabling Petabyte-Scale Climate Analysis with the Eart...Climate Science Flows: Enabling Petabyte-Scale Climate Analysis with the Eart...
Climate Science Flows: Enabling Petabyte-Scale Climate Analysis with the Eart...
 
Graphic Design Crash Course for beginners
Graphic Design Crash Course for beginnersGraphic Design Crash Course for beginners
Graphic Design Crash Course for beginners
 
Top 7 Unique WhatsApp API Benefits | Saudi Arabia
Top 7 Unique WhatsApp API Benefits | Saudi ArabiaTop 7 Unique WhatsApp API Benefits | Saudi Arabia
Top 7 Unique WhatsApp API Benefits | Saudi Arabia
 
APIs for Browser Automation (MoT Meetup 2024)
APIs for Browser Automation (MoT Meetup 2024)APIs for Browser Automation (MoT Meetup 2024)
APIs for Browser Automation (MoT Meetup 2024)
 

A 30 day plan to start ending your data struggle with Snowflake

  • 1. 1© 2017 Snowflake Computing Inc. All Rights Reserved. Y O U R D A T A , N O L I M I T S A 30 Day Plan to End Your Struggle for Data
  • 2. Common data struggles © 2016 Snowflake Computing Inc. All Rights Reserved. 2 AnalyticsData IntegrationData Loading Collaboration
  • 3. © 2016 Snowflake Computing Inc. All Rights Reserved. 3 Data Loading
  • 4. © 2017 Snowflake Computing Inc. All Rights Reserved. 4 Struggle to Load Data Resource Contention Capacity Planning Preparing disparate data to load “Where can I connect to that new JSON web log data?” -BI Team – Have to flatten to store semi-structured (or use noSQL) – Storage and compute are limited – Architecture forces linear compute capacity
  • 5. © 2017 Snowflake Computing Inc. All Rights Reserved. 5 Tackle loading challenges with Snowflake Contention Capacity Disparate data – Variant column type supports semi-structured – No more flattening (unless you want to) – Built on the cloud (S3, EC2) – Scale data and compute to load any data – Unlimited virtual warehouses allow independent compute – Isolate loading and other tasks
  • 6. © 2016 Snowflake Computing Inc. All Rights Reserved. 6 Data Integration
  • 7. © 2017 Snowflake Computing Inc. All Rights Reserved. 7 Struggle to Integrate Data Making sense of data in silos Editing and transforming data Support evolving business logic and disparate use cases – Hard to transform different datasets while in different silos/formats – noSQL tools complex, not all data stores ACID complaint – Contention an issue while transforming – No way to easily experiment with and add business logic – Different people have different use cases “Are the updated KPI’s in the sensor data tables?” - Data scientist
  • 8. © 2017 Snowflake Computing Inc. All Rights Reserved. 8 Improve data integration with Snowflake Silos Editing and transforming Business logic – Native storage for semi-structured, ANSI standard SQL and dot notation to use it – Combine all of your data fluidly – ACID compliant with virtual data warehouses – Edit, transform, insert, delete, however or whenever you want – Zero-copy cloning – Rapidly iterate, test and promote business logic for multiple people
  • 9. © 2016 Snowflake Computing Inc. All Rights Reserved. 9 Data analytics
  • 10. © 2017 Snowflake Computing Inc. All Rights Reserved. 10 Struggle to Analyze Data Queues Delays – Analysts are always the end of the resource priority queue – Even with unlimited access, database is non-performant “How come the dashboard isn’t working?” - Sales director
  • 11. © 2017 Snowflake Computing Inc. All Rights Reserved. 11 Analyzing Efficiently with Snowflake Queues Delays – Independent virtual warehouses – Scale up, down or out to serve analytics use cases – Autoscaling and multi-cluster warehouses – Automatically match compute to even massive demand
  • 12. © 2016 Snowflake Computing Inc. All Rights Reserved. 12 Collaboration
  • 13. © 2017 Snowflake Computing Inc. All Rights Reserved. 13 Struggle to Collaborate Incessant fixing Siloed teams – Fixing loading, integration and analytics struggles burns time – Conflicts from those struggles reduce morale – Technical and business teams often not working together (physically or otherwise) “I’m so buried under this queue I can’t make the BI standup” - IT team member “I could ask IT for an updated table, but I’m not sure who was working on it.” - BI team member
  • 14. © 2017 Snowflake Computing Inc. All Rights Reserved. 14 Start Collaborating with Snowflake Fixing Siloed teams – Address the other struggles as referenced – Free more time for collaboration and discussion – With new time, start new discussions around data – Build updates and additions into a scheduled meet-up
  • 15. © 2016 Snowflake Computing Inc. All Rights Reserved. 15 A 30-day Plan to Start Ending Your Struggle with Snowflake
  • 16. © 2016 Snowflake Computing Inc. All Rights Reserved. 16 Start from the beginning – what’s the analytics goal? 1. Define the team 2. Discuss blocking issues and a place to start 3. Define the scope 4. Define success criteria 5. Try Snowflake On-Demand 6. Plan status updates going forward Start Ending Your Data Struggle – Week 1
  • 17. © 2017 Snowflake Computing Inc. All Rights Reserved. 17 Start Ending Your Data Struggle – Week 2 1. Find data to load 2. Create a Warehouse 3. Load data – Work within defined scope, agree as a team – Use data that’s new, challenging, or semi-structured – Will need this to load data – Create a database and a table – Stage your data – Load from stage to database
  • 18. © 2017 Snowflake Computing Inc. All Rights Reserved. 18 Start Ending Your Data Struggle – Week 3 1. Test and deploy business logic 2. Optional: Create Integration WH 3. Optional: Plan ongoing loading and transform – Discuss metrics, KPIs, transformations to add – Use zero-copy cloning to test and then promote – Isolate integration and transformation – Use zero-copy cloning to test iterations safely and promote
  • 19. © 2017 Snowflake Computing Inc. All Rights Reserved. 19 Start Ending Your Data Struggle – Week 4 1. Create Warehouses for BI 2. Create analytics users 3. Connect your BI to Snowflake – Avoid queues with isolated compute resources – Optionally, set up auto-scaling – Spread the value of the data – Use this as an opportunity to share and discuss – Use Tableau, Looker, etc. to query your data live – Consider publishing dashboards with live connect
  • 20. © 2017 Snowflake Computing Inc. All Rights Reserved. 20 After 30 days you should see improvements 1. Your team should be talking and collaborating more 2. You should be able to easily load and combine data 3. You should have accurate business logic in your data 4. You should be finding more insight TRY SNOWFLAKE FOR FREE

Editor's Notes

  1. Data loading – struggle to load, store and manage data Data integration – struggle to unify and integrate disparate data sources Analytics – Struggle to analyze data quickly and effectively Collaboration – Because your spending so much time on the other three problems, its difficult to get everyone on the same page, to work together to find insight in your data
  2. Preparing disparate data to load The struggle to load data begins with the need to prepare disparate datasets to load. Many organizations are dealing with a host of new semi-structured data in formats like JSON and Avro that require flattening to load into a relational database. Or, they choose to store semi-structured data separate from relational data in a NoSQL store, creating silos. Capacity planning Finding space for data can be another enormous challenge. Large numbers of complex datasets can quickly snowball into a storage capacity issue on fixed size on-premises or cloud data platforms. Resource contention Loading large datasets also requires significant compute capacity. Many data warehouses are already strained under normal business workloads, and the compute needed for loading forces those other processes to be pushed back in the priority queue. All of these problems lead to difficult conversations about whose data or use case is most important. One project might need funding for an open source, semi-structured data store. Another wants to expand the on-premises data warehouse. One team wants to load clickstream data, and another needs finance data. Prioritizing completely different needs can be a minefield that leads to a host of struggles within and between teams.
  3. Tackle loading challenges with Snowflake Snowflake addresses each loading challenge with simplicity. Semi-structured data can be loaded natively alongside structured data, and queried together in one location. Because Snowflake’s built on the cloud, you can store as much data as you want with no need to prioritize different datasets. Best of all, you can create independent compute resources, called virtual warehouses, for each of your use cases, negating the need for queues.
  4. Making sense of data in silos With data scattered across NoSQL data lakes, cloud applications, and data warehouses (not to mention flat files and CSVs), organizations are struggling to combine and analyze their data in one cohesive picture. Editing and transforming data Every system that stores data has it’s challenges, but many organizations are finding it particularly hard to analyze and understand data in NoSQL systems like Hadoop. Semi-structured open source data stores require a large amount of custom configuration, uncommon skillets, and transformation to successfully combine with other business data. They also rarely support edit, update and insert commands that are essential to data modeling and transformation. Supporting evolving business logic and disparate use cases It’s hard for the business to drive evolutions in business logic within the database when it takes arduous manual process to test and update. Often, entire databases need to be physically copied in order to test a simple change to a table or derived field, which can be extremely expensive and time consuming. Because different people within the organization have different data needs, a “single source of the truth” is often too ungainly and impractical for most organizations to maintain and use. All of these problems make it difficult to generate a refined view of what the data actually says. Differing methods of transforming data arise, with competing factions struggling to promote their own methods of working with, storing and querying data. People from throughout the business wonder where they can find the “right” version of their metrics and KPIs.
  5. Improve data integration with Snowflake Snowflake makes data integration straightforward. You can load all of your data, in almost any structured or semi-structured format, so you can avoid data silos. Transforming is made easier with ANSI standard SQL and dot notation for semi-structured data. Inserts, deletes and other common operations are fully supported. You can even rapidly test and update with zero-copy cloning, driving faster iteration in business logic.
  6. Queues Analytics users are always at the bottom of the resource priority queue. It’s not always designed to be that way, but if ETL, as a simplified example, needs to run for 45 minutes every hour, then there’s little time left over for the analytics team to access and iterate on the database. Delays Through the eyes of an analyst, nothing ever works fast enough. But, often disappointing performance isn’t for lack of trying. Many data warehouses require hours and hours of painstaking optimization, tuning, indexing, sorting, and vacuuming from a dedicated data engineer.  To add to the pain, often one optimization will lead to deoptimization in another area The struggle to analyze data is one of the most visible. Report consumers complain that the BI tool isn’t working fast enough. The BI team points their finger to the data engineers. But, at the end of the day, antiquated database technology is the real culprit.
  7. Analyzing efficiently with Snowflake Snowflake addresses efficient analytics in two ways. As we saw before, independent virtual warehouses can help with concurrent queries, allowing ETL and BI to run side by side at the same time. Large or variable analytics workloads within a single warehouse can be dealt with using mutli-cluster warehouses, and even autoscaling to automatically match your compute resources to need.
  8. The struggle to load, integrate and analyze data leads to a fourth struggle that’s often the worst. Collaboration.
  9. Incessant fixing If the organization spends all its time endlessly solving loading, integration, and analytics struggles, it’s impossible to break away and think at a higher level about what needs to be accomplished. Data is a constant flash point of disagreement, rather than a rallying point for collaboration. Siloed teams Historically, there’s been a dividing line between technical, IT implementers, and less-technical business side consumers. This was partly driven by technology, but reinforced by organizational structures that don’t favor cross team collaboration. The lack of collaboration is the end result of the struggles, and the most frustrating of them all. How can two disparate types of people, on two different teams (or multiple different teams) effectively work together when they are completely buried under the weight of their antiquated data platform.
  10. Analyzing efficiently with Snowflake As we noted previously, Snowflake can help to solve loading, integration and analytics struggles, freeing time for collaboration and higher level planning. Working together with Snowflake, the dividing line between IT and BI becomes less important. IT can lead the business with technology and empower the BI team to analyze data. On the same token, with more accessible technology in the form of Snowflake, BI teams can take an active role in the curation and modeling of data that has historically rested solely on IT’s shoulders.
  11. Week one is all about the team. It’s time to bring everyone around the same table to figure out the best way to move forward with your data. Keep your conversation focused on an achievable goal: trying to get an important dataset into Snowflake for analysis. Discuss blocking issues, but be sure to define them in terms of technology, rather than people. Once you’ve got a plan to get around any blocking issues, set up Snowflake On-Demand for free and make a plan to bring the team together for status updates in the weeks to follow. Pro tip: Think big. Every new Snowflake On-Demand customer gets $400 in free credits to play around with, more than enough to load and store a massive dataset. One Snowflake customer performance tested Snowflake against a $10,000,000 on premise database with only $100. It was 100x faster.
  12. Week 2 is when the practical, real life work begins. Pick up where you left off with your team, and discuss the right data to load into Snowflake. Clearly define the scope within that dataset, so you settle on a dataset that is large enough to be useful but also flexible enough to get out of it’s current location within the week. Once you’ve got your data, it’s time to create a warehouse, database and tables to load your data into. Pro tip: Remember to stay open minded about semi-structured data too - in fact, that might be the best dataset to get started with in Snowflake. Store semi-structured data in nested form within the Variant type column, and transform with dot notation using standard SQL statements.
  13. By week 3, you should have data loaded and perhaps you’ve already started querying and using it. If not, now is a good time to start. Make sure to take note of the business logic (in the form of calculations, derived fields, KPIs, etc) that it would make sense to add. Work with the team to futher define this logic, and experiment with zero-copy cloning to test transformations to your production data from the safety of a cloned database. When you’ve got your business logic added, look to add an additional warehouse for ongoing loading and transformation needs. Pro tip: The value of Snowflake increases exponentially with the number of related data sources you are able to load and integrate. In other words, sales data from Salesforce is more than twice as interesting when people are able to combine it with account based web interaction data from Google Analytics.
  14. As week 4 rolls around, it’s time to spread the value of your data as widely as possible. Add users to Snowflake, along with roles and permissions to match. Create auto-scaling warehouses for the BI, analytics and reporting teams to enable everyone to access data without contention. Connect Snowflake to your BI tool to begin creating the visualizations and dashboards that will power the insight you need. Pro tip: Many organizations that have traditionally relied on extracts or in-memory data are using Snowflake as a live-connection within their BI tool. Experiment and take advantage of the speed and flexibility that Snowflake can give your team.
  15. After 30 days, you should see some significant improvements. Your team should be talking about your data and collaborating more. You should be able to easily load and combine the data that matters to your business. There should be useful business logic within the data you loaded into Snowflake, and plans to test and expand even more. Your BI and analytics should be performing quickly on the data you’ve loaded, generating further interest in your overall plans for your data platform. The most important change you should see after this 30 day plan is within your relationships. The struggles that defined your loading, integration, analytics and collaboration should have given way to a new but promising spirit of mutual ownership. Next steps The next steps are up to you, but they look a lot like the first 30 day plan in elongated form. Continue the discussion. Load more data. Expand the number of users and groups that can access and benefit from the data that you’ve loaded in Snowflake. It’s also important to continually share and elevate the success and experiences you’ve had ending the struggle for data within your organization. Show executives and leaders the value of your data, and the time that you’ve put into perfecting it for analysis. Lastly, make sure to share your experiences outside of your own organization. Speak at conferences and events so you can synthesize what you’ve learned and spread the benefit of your experience to people that are still struggling with their data.