© 2018 Snowflake Computing Inc. All Rights
Reserved.
August 24, 2018© Snowflake Computing Inc. All Rights Reserved
ZERO TO
SNOWFLAKE IN
90 MINUTES
in partnership with
© 2018 Snowflake Computing Inc. All Rights
Reserved.
• Download Materials @ https://tinyurl.com/yy9frfuw (Hotmail / outlook / o365)
• Download Worksheets and unzip
• Partner Introduction
• Snowflake Introduction
• Data Warehousing Today
• What Users do with Snowflake
• Hands on Snowflake
• Data loading- sample data set is shared
• Integrations
• Multi-clustering
• And more!
• Scaling/Workload Isolation
• Snowflake in the Real World- Data Sharing and Customer Example
• Conclusion
AGENDA
© 2018DMICONFIDENTIAL&PROPRIETARY
3
DATA & ANALYTICS
Snowflake
Solution Integration
Partner
Creating an intelligent existence by linking physical
& digital worlds to unleash the power of connectivity
〉 DATA PLATFORM SOLUTIONS
〉 MODERN DATA SOLUTIONS
〉 VISUAL SOLUTIONS
〉 ADVANCED ANALYTICS
〉 ENTERPRISE DATA STRATEGY
〉 EXECUTIVE ADVISORY
〉 ANALYTICS AS A SERVICE
〉 AGILE ANALYTICS
© 2018DMICONFIDENTIAL&PROPRIETARY
4
DATA & ANALYTICS
PARTNERSHIPS
***8 MOST IMPORTANT AI AND ANALYTIC TRENDS OF 2019 EBOOK AND MORE INFO ABOUT DMI @ WWW.DMINC.COM/EBOOK M
Today’s Moderators
〉 Brett VanderPlaats
Sr. Data Architect, Data Platform & Analytics
〉 David Mellinger
Practice Lead, Data Platform & Analytics
1,500+ CUSTOMERS
Building new analytic applications
Delivered new analytic application to
pharmacies using Snowflake
Moving to the cloud
Using Snowflake to move data analytics
to the cloud
Modernizing data platforms
Replaced data warehouse appliance +
Hadoop with Snowflake
Accelerating enterprise BI and analytics
Moved from legacy data warehouse
systems (appliance & cloud) to
Snowflake
© 2018 Snowflake Computing Inc. All Rights Reserved. 5
© 2018 Snowflake Computing Inc. All Rights
Reserved.
What Are Users Doing Today?
© 2018 Snowflake Computing Inc. All Rights
Reserved.
© 2018 Snowflake Computing Inc. All Rights
Reserved.
August 24, 2018© Snowflake Computing Inc. All Rights ReservedAugust 24, 2018© Snowflake Computing Inc. All Rights Reserved
What would you like in your Data Warehouse?
8
Complete
SQL Database
Zero
Management
All of
Your
Data
All of
Your
Users
Pay Only for
What You
Use
Live Data
Sharing
© 2018 Snowflake Computing Inc. All Rights
Reserved.
What Are Users Doing with Snowflake?
© 2018 Snowflake Computing Inc. All Rights
Reserved.
What Are User Doing with Snowflake?
© 2018 Snowflake Computing Inc. All Rights
Reserved.
What Are User Doing with Snowflake?
© 2018 Snowflake Computing Inc. All Rights
Reserved.
© 2018 Snowflake Computing Inc. All Rights
Reserved.
Snowflake’s Differentiating Architecture
© 2018 Snowflake Computing Inc. All Rights
Reserved.
SNOWFLAKE’S MULTI-CLUSTER, SHARED DATA ARCHITECTURE
Centralized storage
Instant, automatic scalability & elasticity
Service
Compute
Storage
© 2018 Snowflake Computing Inc. All Rights
Reserved.
Why Scaling Compute Saves $$$$$$
© 2018 Snowflake Computing Inc. All Rights
Reserved.
How Does Snowflake Fit?
ADVANCED
ANALYTICS
INTEGRATION
TOOLS
ELT
Stream
© 2018 Snowflake Computing Inc. All Rights
Reserved.
Scale compute and concurrency
ADF
© 2018 Snowflake Computing Inc. All Rights
Reserved.
Scale Up (Response Time)
Queries that are:
--Large, Complex, many Calculations
Scale Out
(Throuhput)
Many users
or processes
concurrently
querying
S M L XL 2XXS 3X 4X
XS
XS
XS
XS
XS
XS
XS
XS
XS
L L
2X
2X2X2X
4X
4X
4X
© 2018 Snowflake Computing Inc. All Rights
Reserved.
Differences from other EDW vendors
© 2018 Snowflake Computing Inc. All Rights
Reserved.
Business Implications for Organizations
• Native ANSI-SQL database for leveraging existing skills
• Reducing expensive retraining
• Interoperability with existing tools (Power BI, Tableau, and others have SF
connectors)
• Simplified Migration
• Support/Documentation on Snowflake – Take a look at
© 2018 Snowflake Computing Inc. All Rights
Reserved.
But Wait….
…There’s
© 2018 Snowflake Computing Inc. All Rights
Reserved.
DEV OPS?
© 2018 Snowflake Computing Inc. All Rights
Reserved.
Data Protection w/o Restore?
© 2018 Snowflake Computing Inc. All Rights
Reserved.
Data Sharing (Monetization)
Disaster Recovery
***Platform (AZURE AWS) in 2019
© 2018 Snowflake Computing Inc. All Rights
Reserved.
Native Support for
structured and semi-
structured data
© 2018 Snowflake Computing Inc. All Rights
Reserved.
Cross Region Data Replication
© 2018 Snowflake Computing Inc. All Rights
Reserved.
Comprehensive data protection
August 24, 2018© Snowflake Computing Inc. All Rights ReservedAugust 24, 2018© Snowflake Computing Inc. All Rights Reserved
BIO-BREAK
THEN
LET’S DIVE INTO
SNOWFLAKE!
https://tinyurl.com/yy9frfuw
August 24, 2018© Snowflake Computing Inc. All Rights ReservedAugust 24, 2018© Snowflake Computing Inc. All Rights Reserved
Module 1
Snowflake DB/WH/Object
Configuration
August 24, 2018© Snowflake Computing Inc. All Rights ReservedAugust 24, 2018© Snowflake Computing Inc. All Rights Reserved
Module 2
Queries and Performance
© 2018 Snowflake Computing Inc. All Rights
Reserved.
Azure Storage
© 2018 Snowflake Computing Inc. All Rights
Reserved.
Performance through Caching
Azure Storage
© 2018 Snowflake Computing Inc. All Rights
Reserved.
Query
Profiler•Processing — time spent on data processing by the CPU.
•Local Disk IO — time when the processing was blocked by local disk access.
•Remote Disk IO — time when the processing was blocked by remote disk access.
•Network Communication — time when the processing was waiting for the network data
transfer.
•Synchronization — various synchronization activities between participating processes.
•Initialization — time spent setting up the query processing.
•IO — information about the input-output operations performed during the query:
• Scan progress — the percentage of data scanned for a given table so far.
• Bytes scanned — the number of bytes scanned so far.
• Percentage scanned from cache — the percentage of data scanned from the local disk
cache.
• Bytes written — bytes written (e.g. when loading into a table).
• Bytes written to result — bytes written to a result object.
• Bytes read from result — bytes read from a result object.
• External bytes scanned — bytes read from an external object, e.g. a stage.
•DML — statistics for Data Manipulation Language (DML) queries:
• Number of rows inserted — number of rows inserted into a table (or tables).
• Number of rows updated — number of rows updated in a table.
• Number of rows deleted — number of rows deleted from a table.
• Number of rows unloaded — number of rows unloaded during data export.
• Number of bytes deleted — number of bytes deleted from a table.
•Pruning — information on the effects of table pruning:
• Partitions scanned — number of partitions scanned so far.
• Partitions total — total number of partitions in a given table.
•Spilling — information about disk usage for operations where intermediate results do not fit in memory:
• Bytes spilled to local storage — volume of data spilled to local disk.
• Bytes spilled to remote storage — volume of data spilled to remote disk.
•Network — network communication:
• Bytes sent over the network — amount of data sent over the network.
•EXECUTION TIME
•STATISTICS
© 2018 Snowflake Computing Inc. All Rights
Reserved.
Multi-cluster Warehouse
© 2018 Snowflake Computing Inc. All Rights
Reserved.
© 2018 Snowflake Computing Inc. All Rights
Reserved.
Micropartion
s
&
Pruning
© 2018 Snowflake Computing Inc. All Rights
Reserved.
Enterprise Grade Security
August 24, 2018© Snowflake Computing Inc. All Rights ReservedAugust 24, 2018© Snowflake Computing Inc. All Rights Reserved
Module 3
Unstructured Data
© 2018 Snowflake Computing Inc. All Rights
Reserved.
August 24, 2018© Snowflake Computing Inc. All Rights ReservedAugust 24, 2018© Snowflake Computing Inc. All Rights Reserved
Module 4
Loading Data
© 2018 Snowflake Computing Inc. All Rights
Reserved.
• 4 objects for data loading
1) Source
2) Warehouse
3) Database
4) File Format (default CSV)
• 100 mb file is optimum size
August 24, 2018© Snowflake Computing Inc. All Rights ReservedAugust 24, 2018© Snowflake Computing Inc. All Rights Reserved
Module 5
Cloning / Time Travel
© 2018 Snowflake Computing Inc. All Rights
Reserved.
Multi-cluster Warehouse
August 24, 2018© Snowflake Computing Inc. All Rights ReservedAugust 24, 2018© Snowflake Computing Inc. All Rights Reserved
THANK YOU
© 2018 Snowflake Computing Inc. All Rights
Reserved.
© 2018 Snowflake Computing Inc. All Rights
Reserved.
USE CASES FOR SNOWFLAKE DATA SHARING
• Nielsen is a global information, data, and
measurement company
• Nielsen knows “What People Watch,
Listen To, and Buy”
• Nielsen Marketing Cloud includes eXelate
DMP which provides unified consumer
profiles
• Nielsen sells selective slices of their DMP
data available to advertisers for particular
marketing campaigns
• Nielsen plans to use data sharing for
making detailed datasets available to
subscribers
• Lower friction, lower cost solution
• Scalable operations
61© 2016 Snowflake Computing Inc. All Rights
Reserved.
CUSTOMER EXAMPLE: BLACKBOARD
Jay White
Director, Software Engineering
Scenario
Provide and perfect over 14 different data
products that help universities facilitate
learning online
Pain Points
Disparate data
Challenges integrating data
Semi-structured data
Solution
Replace existing Hadoop and RDBMS
system with Snowflake
Everything that we did left our jaw on the
table. ‘Wait – we’ve never done anything like
that.’ Or, ‘How did that just run so fast.’ We
are getting 16x performance from
Snowflake.
62© 2016 Snowflake Computing Inc. All Rights
Reserved.
A NEW DATA PIPELINE FOR BLACKBOARD
Snowflake
S
3
Student data
Mobile data
Collaborative data
Intelsuite data
Kafka
Airflow for data orchestration Looker for internal dashboards
Learning Management System
Blackboard Predict
R Prediction Engine
Unified data
Simplified data transformation
Existing tools integrate seamlessly
• 16x performance improvement over SQL
• 1 PB by the end of 2017
© 2018 Snowflake Computing Inc. All Rights
Reserved.
Built-in disaster recover and high availabity

Zero to Snowflake Presentation

  • 1.
    © 2018 SnowflakeComputing Inc. All Rights Reserved. August 24, 2018© Snowflake Computing Inc. All Rights Reserved ZERO TO SNOWFLAKE IN 90 MINUTES in partnership with
  • 2.
    © 2018 SnowflakeComputing Inc. All Rights Reserved. • Download Materials @ https://tinyurl.com/yy9frfuw (Hotmail / outlook / o365) • Download Worksheets and unzip • Partner Introduction • Snowflake Introduction • Data Warehousing Today • What Users do with Snowflake • Hands on Snowflake • Data loading- sample data set is shared • Integrations • Multi-clustering • And more! • Scaling/Workload Isolation • Snowflake in the Real World- Data Sharing and Customer Example • Conclusion AGENDA
  • 3.
    © 2018DMICONFIDENTIAL&PROPRIETARY 3 DATA & ANALYTICS Snowflake SolutionIntegration Partner Creating an intelligent existence by linking physical & digital worlds to unleash the power of connectivity 〉 DATA PLATFORM SOLUTIONS 〉 MODERN DATA SOLUTIONS 〉 VISUAL SOLUTIONS 〉 ADVANCED ANALYTICS 〉 ENTERPRISE DATA STRATEGY 〉 EXECUTIVE ADVISORY 〉 ANALYTICS AS A SERVICE 〉 AGILE ANALYTICS
  • 4.
    © 2018DMICONFIDENTIAL&PROPRIETARY 4 DATA & ANALYTICS PARTNERSHIPS ***8MOST IMPORTANT AI AND ANALYTIC TRENDS OF 2019 EBOOK AND MORE INFO ABOUT DMI @ WWW.DMINC.COM/EBOOK M Today’s Moderators 〉 Brett VanderPlaats Sr. Data Architect, Data Platform & Analytics 〉 David Mellinger Practice Lead, Data Platform & Analytics
  • 5.
    1,500+ CUSTOMERS Building newanalytic applications Delivered new analytic application to pharmacies using Snowflake Moving to the cloud Using Snowflake to move data analytics to the cloud Modernizing data platforms Replaced data warehouse appliance + Hadoop with Snowflake Accelerating enterprise BI and analytics Moved from legacy data warehouse systems (appliance & cloud) to Snowflake © 2018 Snowflake Computing Inc. All Rights Reserved. 5
  • 6.
    © 2018 SnowflakeComputing Inc. All Rights Reserved. What Are Users Doing Today?
  • 7.
    © 2018 SnowflakeComputing Inc. All Rights Reserved.
  • 8.
    © 2018 SnowflakeComputing Inc. All Rights Reserved. August 24, 2018© Snowflake Computing Inc. All Rights ReservedAugust 24, 2018© Snowflake Computing Inc. All Rights Reserved What would you like in your Data Warehouse? 8 Complete SQL Database Zero Management All of Your Data All of Your Users Pay Only for What You Use Live Data Sharing
  • 9.
    © 2018 SnowflakeComputing Inc. All Rights Reserved. What Are Users Doing with Snowflake?
  • 10.
    © 2018 SnowflakeComputing Inc. All Rights Reserved. What Are User Doing with Snowflake?
  • 11.
    © 2018 SnowflakeComputing Inc. All Rights Reserved. What Are User Doing with Snowflake?
  • 12.
    © 2018 SnowflakeComputing Inc. All Rights Reserved.
  • 13.
    © 2018 SnowflakeComputing Inc. All Rights Reserved. Snowflake’s Differentiating Architecture
  • 14.
    © 2018 SnowflakeComputing Inc. All Rights Reserved. SNOWFLAKE’S MULTI-CLUSTER, SHARED DATA ARCHITECTURE Centralized storage Instant, automatic scalability & elasticity Service Compute Storage
  • 15.
    © 2018 SnowflakeComputing Inc. All Rights Reserved. Why Scaling Compute Saves $$$$$$
  • 16.
    © 2018 SnowflakeComputing Inc. All Rights Reserved. How Does Snowflake Fit? ADVANCED ANALYTICS INTEGRATION TOOLS ELT Stream
  • 17.
    © 2018 SnowflakeComputing Inc. All Rights Reserved. Scale compute and concurrency ADF
  • 18.
    © 2018 SnowflakeComputing Inc. All Rights Reserved. Scale Up (Response Time) Queries that are: --Large, Complex, many Calculations Scale Out (Throuhput) Many users or processes concurrently querying S M L XL 2XXS 3X 4X XS XS XS XS XS XS XS XS XS L L 2X 2X2X2X 4X 4X 4X
  • 19.
    © 2018 SnowflakeComputing Inc. All Rights Reserved. Differences from other EDW vendors
  • 20.
    © 2018 SnowflakeComputing Inc. All Rights Reserved. Business Implications for Organizations • Native ANSI-SQL database for leveraging existing skills • Reducing expensive retraining • Interoperability with existing tools (Power BI, Tableau, and others have SF connectors) • Simplified Migration • Support/Documentation on Snowflake – Take a look at
  • 21.
    © 2018 SnowflakeComputing Inc. All Rights Reserved. But Wait…. …There’s
  • 22.
    © 2018 SnowflakeComputing Inc. All Rights Reserved. DEV OPS?
  • 23.
    © 2018 SnowflakeComputing Inc. All Rights Reserved. Data Protection w/o Restore?
  • 24.
    © 2018 SnowflakeComputing Inc. All Rights Reserved. Data Sharing (Monetization) Disaster Recovery ***Platform (AZURE AWS) in 2019
  • 25.
    © 2018 SnowflakeComputing Inc. All Rights Reserved. Native Support for structured and semi- structured data
  • 26.
    © 2018 SnowflakeComputing Inc. All Rights Reserved. Cross Region Data Replication
  • 27.
    © 2018 SnowflakeComputing Inc. All Rights Reserved. Comprehensive data protection
  • 28.
    August 24, 2018©Snowflake Computing Inc. All Rights ReservedAugust 24, 2018© Snowflake Computing Inc. All Rights Reserved BIO-BREAK THEN LET’S DIVE INTO SNOWFLAKE! https://tinyurl.com/yy9frfuw
  • 29.
    August 24, 2018©Snowflake Computing Inc. All Rights ReservedAugust 24, 2018© Snowflake Computing Inc. All Rights Reserved Module 1 Snowflake DB/WH/Object Configuration
  • 30.
    August 24, 2018©Snowflake Computing Inc. All Rights ReservedAugust 24, 2018© Snowflake Computing Inc. All Rights Reserved Module 2 Queries and Performance
  • 31.
    © 2018 SnowflakeComputing Inc. All Rights Reserved. Azure Storage
  • 32.
    © 2018 SnowflakeComputing Inc. All Rights Reserved. Performance through Caching Azure Storage
  • 33.
    © 2018 SnowflakeComputing Inc. All Rights Reserved. Query Profiler•Processing — time spent on data processing by the CPU. •Local Disk IO — time when the processing was blocked by local disk access. •Remote Disk IO — time when the processing was blocked by remote disk access. •Network Communication — time when the processing was waiting for the network data transfer. •Synchronization — various synchronization activities between participating processes. •Initialization — time spent setting up the query processing. •IO — information about the input-output operations performed during the query: • Scan progress — the percentage of data scanned for a given table so far. • Bytes scanned — the number of bytes scanned so far. • Percentage scanned from cache — the percentage of data scanned from the local disk cache. • Bytes written — bytes written (e.g. when loading into a table). • Bytes written to result — bytes written to a result object. • Bytes read from result — bytes read from a result object. • External bytes scanned — bytes read from an external object, e.g. a stage. •DML — statistics for Data Manipulation Language (DML) queries: • Number of rows inserted — number of rows inserted into a table (or tables). • Number of rows updated — number of rows updated in a table. • Number of rows deleted — number of rows deleted from a table. • Number of rows unloaded — number of rows unloaded during data export. • Number of bytes deleted — number of bytes deleted from a table. •Pruning — information on the effects of table pruning: • Partitions scanned — number of partitions scanned so far. • Partitions total — total number of partitions in a given table. •Spilling — information about disk usage for operations where intermediate results do not fit in memory: • Bytes spilled to local storage — volume of data spilled to local disk. • Bytes spilled to remote storage — volume of data spilled to remote disk. •Network — network communication: • Bytes sent over the network — amount of data sent over the network. •EXECUTION TIME •STATISTICS
  • 34.
    © 2018 SnowflakeComputing Inc. All Rights Reserved. Multi-cluster Warehouse
  • 35.
    © 2018 SnowflakeComputing Inc. All Rights Reserved.
  • 36.
    © 2018 SnowflakeComputing Inc. All Rights Reserved. Micropartion s & Pruning
  • 37.
    © 2018 SnowflakeComputing Inc. All Rights Reserved. Enterprise Grade Security
  • 38.
    August 24, 2018©Snowflake Computing Inc. All Rights ReservedAugust 24, 2018© Snowflake Computing Inc. All Rights Reserved Module 3 Unstructured Data
  • 39.
    © 2018 SnowflakeComputing Inc. All Rights Reserved.
  • 40.
    August 24, 2018©Snowflake Computing Inc. All Rights ReservedAugust 24, 2018© Snowflake Computing Inc. All Rights Reserved Module 4 Loading Data
  • 41.
    © 2018 SnowflakeComputing Inc. All Rights Reserved. • 4 objects for data loading 1) Source 2) Warehouse 3) Database 4) File Format (default CSV) • 100 mb file is optimum size
  • 42.
    August 24, 2018©Snowflake Computing Inc. All Rights ReservedAugust 24, 2018© Snowflake Computing Inc. All Rights Reserved Module 5 Cloning / Time Travel
  • 43.
    © 2018 SnowflakeComputing Inc. All Rights Reserved. Multi-cluster Warehouse
  • 44.
    August 24, 2018©Snowflake Computing Inc. All Rights ReservedAugust 24, 2018© Snowflake Computing Inc. All Rights Reserved THANK YOU
  • 45.
    © 2018 SnowflakeComputing Inc. All Rights Reserved.
  • 46.
    © 2018 SnowflakeComputing Inc. All Rights Reserved. USE CASES FOR SNOWFLAKE DATA SHARING • Nielsen is a global information, data, and measurement company • Nielsen knows “What People Watch, Listen To, and Buy” • Nielsen Marketing Cloud includes eXelate DMP which provides unified consumer profiles • Nielsen sells selective slices of their DMP data available to advertisers for particular marketing campaigns • Nielsen plans to use data sharing for making detailed datasets available to subscribers • Lower friction, lower cost solution • Scalable operations
  • 47.
    61© 2016 SnowflakeComputing Inc. All Rights Reserved. CUSTOMER EXAMPLE: BLACKBOARD Jay White Director, Software Engineering Scenario Provide and perfect over 14 different data products that help universities facilitate learning online Pain Points Disparate data Challenges integrating data Semi-structured data Solution Replace existing Hadoop and RDBMS system with Snowflake Everything that we did left our jaw on the table. ‘Wait – we’ve never done anything like that.’ Or, ‘How did that just run so fast.’ We are getting 16x performance from Snowflake.
  • 48.
    62© 2016 SnowflakeComputing Inc. All Rights Reserved. A NEW DATA PIPELINE FOR BLACKBOARD Snowflake S 3 Student data Mobile data Collaborative data Intelsuite data Kafka Airflow for data orchestration Looker for internal dashboards Learning Management System Blackboard Predict R Prediction Engine Unified data Simplified data transformation Existing tools integrate seamlessly • 16x performance improvement over SQL • 1 PB by the end of 2017
  • 49.
    © 2018 SnowflakeComputing Inc. All Rights Reserved. Built-in disaster recover and high availabity