SlideShare a Scribd company logo
1 of 22
1
www.matillion.com
© 2017 Matillion. All rights reserved.
Presented by:
Copyright © 2017. All rights reserved. Matillion, trademarks, registered trademarks
or service marks are property of their respective owners. 10/27/2017
Spectrum Webinar Series
Part 3: Accessing your Data Lake assets from Amazon Redshift Spectrum
James Johnson, Ian Funnell and Greg Khairallah
2
www.matillion.com
© 2017 Matillion. All rights reserved.
• Launch Redshift on demand
• Presentation from Greg Khairallah of AWS
• Recap on Amazon Redshift Spectrum
• Recap of Matillion's Spectrum features
• Demonstration, including Redshift on demand
Introduction
3
www.matillion.com
© 2017 Matillion. All rights reserved.
Launch On Demand Redshift
4
www.matillion.com
© 2017 Matillion. All rights reserved.
State of Data Warehousing
Data Warehousing Challenges Today
Exponential Data Growth Varying Data Types Need Data Analyzed Faster
5
www.matillion.com
© 2017 Matillion. All rights reserved.
Amazon Redshift is used for mission-critical workloads
Financial and
management reporting
Payments to suppliers
and billing workflows
Web/Mobile clickstream
and event analysis
Recommendation and
predictive analytics
6
www.matillion.com
© 2017 Matillion. All rights reserved.
The Forrester Wave™: Big Data Warehouse, Q2 2017
The Forrester Wave™ is copyrighted
by Forrester Research, Inc. Forrester
and Forrester Wave™ are trademarks
of Forrester Research, Inc. The
Forrester Wave™ is a graphical
representation of Forrester's call on a
market and is plotted using a detailed
spreadsheet with exposed scores,
weightings, and comments. Forrester
does not endorse any vendor, product,
or service depicted in the Forrester
Wave. Information is based on best
available resources. Opinions reflect
judgment at the time and are subject to
change.
7
www.matillion.com
© 2017 Matillion. All rights reserved.
Benefits of Using Amazon Redshift
Amazon Redshift is a fast, fully managed, petabyte-scale data warehouse that makes it
simple and cost-effective to analyze all your data using your existing business intelligence
tools. Amazon Redshift also includes Redshift Spectrum, allowing you to directly query
exabytes of unstructured data in Amazon S3.
Amazon Redshift is:
Fast Simple Elastic Secure Compatible Low Cost
8
www.matillion.com
© 2017 Matillion. All rights reserved.
Amazon Redshift
• Fully managed, petabyte-scale data warehousing
service
• High performance SQL, Massively Parallel Processing
• Fully managed with SSD and HDD platforms. Built-in
end to end security, including customer-managed keys
• Data automatically backed up to Amazon S3 with
cross region backup capability for global disaster
recovery
• $1,000/TB/Year; start at $0.25/hour
Query tool
Amazon
Redshift
9
www.matillion.com
© 2017 Matillion. All rights reserved.
Paradigm Shift Enabled by Redshift Spectrum
10
www.matillion.com
© 2017 Matillion. All rights reserved.
Paradigm Shift Enabled by Redshift Spectrum
• Extends Redshift to query Exabyte-scale
data lakes in S3
• Intelligent query optimizer
• Elastic scale-out fleet to process S3 data
• Directly query data in open formats
• Multiple Redshift clusters against same data
• Pay for just the queries you run ($5/TB scanned)
Query tool
Amazon
Redshift
...
1 2 3 4
Spectrum
n
S3
11
www.matillion.com
© 2017 Matillion. All rights reserved.
Let's build an analytic query - #1
• An author is releasing the 8th book in her
popular series. How many should we order
for Seattle? What were prior first few day
sales?
• Let’s get the prior books she’s written
• 1 Table
• 2 Filters
• SELECT
• P.ASIN,
• P.TITLE
• FROM
• products P
• WHERE
• P.TITLE LIKE ‘%POTTER%’ AND
• P.AUTHOR = ‘J. K. Rowling’
12
www.matillion.com
© 2017 Matillion. All rights reserved.
Let's build an analytic query - #2
• An author is releasing the 8th book in her
popular series. How many should we order
for Seattle? What were prior first few day
sales?
• Let's compute the sales of the prior books
she’s written in this series and return the top
20 values
• SELECT
• P.ASIN,
• P.TITLE,
• SUM(D.QUANTITY *
D.OUR_PRICE) AS SALES_sum
• FROM
• s3.d_customer_order_item_det
ails D,
• products P
• WHERE
• D.ASIN = P.ASIN AND
• P.TITLE LIKE '%Potter%' AND
• P.AUTHOR = 'J. K. Rowling'
AND
• GROUP BY P.ASIN, P.TITLE
• ORDER BY SALES_sum DESC
• LIMIT 20;
• 2 Tables (1 S3, 1 local)
• 2 Filters
• 1 Join
• 2 Group By columns
• 1 Order By
• 1 Limit
• 1 Aggregation
13
www.matillion.com
© 2017 Matillion. All rights reserved.
Let's build an analytic query - #3
• An author is releasing the 8th book in her
popular series. How many should we order
for Seattle? What were prior first few day
sales?
• Let's compute the sales of the prior books
she’s written in this series and return the top
20 values, just for the first three days of sales
of first editions
• SELECT
• P.ASIN,
• P.TITLE,
• P.RELEASE_DATE,
• SUM(D.QUANTITY * D.OUR_PRICE) AS
SALES_sum
• FROM
• s3.d_customer_order_item_details D,
• asin_attributes A,
• products P
• WHERE
• D.ASIN = P.ASIN AND
• P.ASIN = A.ASIN AND
• A.EDITION LIKE '%FIRST%' AND
• P.TITLE LIKE '%Potter%' AND
• P.AUTHOR = 'J. K. Rowling' AND
• D.ORDER_DAY :: DATE >=
P.RELEASE_DATE AND
• D.ORDER_DAY :: DATE < dateadd(day,
3, P.RELEASE_DATE)
• GROUP BY P.ASIN, P.TITLE, P.RELEASE_DATE
• ORDER BY SALES_sum DESC
• LIMIT 20;
• 3 Tables (1 S3, 2 local)
• 5 Filters
• 2 Joins
• 3 Group By columns
• 1 Order By
• 1 Limit
• 1 Aggregation
• 1 Function
• 2 Casts
14
www.matillion.com
© 2017 Matillion. All rights reserved.
Let's build an analytic query - #4
• An author is releasing the 8th book in her
popular series. How many should we
order for Seattle? What were prior first
few day sales?
• Let's compute the sales of the prior
books she’s written in this series and
return the top 20 values, just for the first
three days of sales of first editions in the
city of Seattle, WA, USA
• 4 Tables (1 S3, 3 local)
• 8 Filters
• 3 Joins
• 4 Group By columns
• 1 Order By
• 1 Limit
• 1 Aggregation
• 1 Function
• 2 Casts
• SELECT
• P.ASIN,
• P.TITLE,
• R.POSTAL_CODE,
• P.RELEASE_DATE,
• SUM(D.QUANTITY * D.OUR_PRICE) AS SALES_sum
• FROM
• s3.d_customer_order_item_details D,
• asin_attributes A,
• products P,
• regions R
• WHERE
• D.ASIN = P.ASIN AND
• P.ASIN = A.ASIN AND
• D.REGION_ID = R.REGION_ID AND
• A.EDITION LIKE '%FIRST%' AND
• P.TITLE LIKE '%Potter%' AND
• P.AUTHOR = 'J. K. Rowling' AND
• R.COUNTRY_CODE = ‘US’ AND
• R.CITY = ‘Seattle’ AND
• R.STATE = ‘WA’ AND
• D.ORDER_DAY :: DATE >= P.RELEASE_DATE AND
• D.ORDER_DAY :: DATE < dateadd(day, 3,
P.RELEASE_DATE)
• GROUP BY P.ASIN, P.TITLE, R.POSTAL_CODE, P.RELEASE_DATE
• ORDER BY SALES_sum DESC
• LIMIT 20;
15
www.matillion.com
© 2017 Matillion. All rights reserved.
Now let’s run that query over an exabyte of data in S3
Roughly 140 TB of customer item order detail records for
each day over past 20 years.
190 million files across 15,000 partitions in S3. One partition
per day for USA and rest of world.
Need a billion-fold reduction in data processed.
Running this query using a 1000 node Hive cluster would
take over 5 years.*
• Compression ……………..….……..5X
• Columnar file format……….......…10X
• Scanning with 2500 nodes…....2500X
• Static partition elimination…............2X
• Dynamic partition elimination..….350X
• Redshift’s query optimizer……......40X
---------------------------------------------------
Total reduction……….…………3.5B X
* Estimated using 20 node Hive cluster & 1.4TB, assume linear
* Query used a 20 node DC1.8XLarge Amazon Redshift cluster
* Not actual sales data - generated for this demo based on data format
used by Amazon Retail.
16
www.matillion.com
© 2017 Matillion. All rights reserved.
Accelerate Migrations from Legacy Systems
“AWS Database Migration Service is the most
impressive migration service we’ve seen.” – Gartner
Amazon Redshift
Migrate
Over 1,000 unique
migrations to Amazon
Redshift using DMS
17
www.matillion.com
© 2017 Matillion. All rights reserved.
Recap on Amazon Redshift Spectrum
Catalog
JDBC/ODBC Client Application
Leader Node
Node 1 Node nNode 1
Spectrum Spectrum Spectrum Spectrum
S3
Independent Scaling
Redshift Scaling
18
www.matillion.com
© 2017 Matillion. All rights reserved.
• No “load” into Redshift
• Independently-managed infrastructure
• Many formats
• Ordinary Redshift SQL
• Predicates and Aggregates are pushed down into Spectrum (but not Joins)
• You can visualize external data using e.g. Tableau
• Partitions are S3 prefixes
Recap on Amazon Redshift Spectrum
19
www.matillion.com
© 2017 Matillion. All rights reserved.
• Other tools  S3 + Athena  Spectrum/Redshift
• Redshift on demand
• Athena
• Predicates, Aggregates, Joins
• Unload
Demonstration, including Redshift on demand
20
www.matillion.com
© 2017 Matillion. All rights reserved.
Missed parts 1 and 2 of our Spectrum Series?
• September 20, 11AM EST: Getting started with Amazon Redshift Spectrum
 View an on-demand recording now (link in resource list)
• October 4, 11AM EST: Using Amazon Redshift Spectrum from Matillion ETL
 View an on-demand recording now (link in resource list)
 EST: Using Amazon Redshift Spectrum from Matillion ETL
• October 18, 11AM EST: Accessing your Data Lake assets from Amazon Redshift Spectrum
21
www.matillion.com
© 2017 Matillion. All rights reserved.
22
www.matillion.com
© 2017 Matillion. All rights reserved.
Presented by:
Copyright © 2017. All rights reserved. Matillion, trademarks, registered trademarks
or service marks are property of their respective owners. 10/27/2017
Thank You
James Johnson, Ian Funnell and Greg Khairallah

More Related Content

What's hot

Analytics with R in SQL Server 2016
Analytics with R in SQL Server 2016Analytics with R in SQL Server 2016
Analytics with R in SQL Server 2016HARIHARAN R
 
SQL to NoSQL Best Practices with Amazon DynamoDB - AWS July 2016 Webinar Se...
SQL to NoSQL   Best Practices with Amazon DynamoDB - AWS July 2016 Webinar Se...SQL to NoSQL   Best Practices with Amazon DynamoDB - AWS July 2016 Webinar Se...
SQL to NoSQL Best Practices with Amazon DynamoDB - AWS July 2016 Webinar Se...Amazon Web Services
 
Designing Data-Intensive Applications
Designing Data-Intensive ApplicationsDesigning Data-Intensive Applications
Designing Data-Intensive ApplicationsOleg Mürk
 
How to Manage Open Police Data - Tips for Data QA/QC and Automation
How to Manage Open Police Data - Tips for Data QA/QC and AutomationHow to Manage Open Police Data - Tips for Data QA/QC and Automation
How to Manage Open Police Data - Tips for Data QA/QC and AutomationSafe Software
 
Amazon Dynamo DB for Developers (김일호) - AWS DB Day
Amazon Dynamo DB for Developers (김일호) - AWS DB DayAmazon Dynamo DB for Developers (김일호) - AWS DB Day
Amazon Dynamo DB for Developers (김일호) - AWS DB DayAmazon Web Services Korea
 
Data Warehousing in the Era of Big Data
Data Warehousing in the Era of Big DataData Warehousing in the Era of Big Data
Data Warehousing in the Era of Big DataAmazon Web Services
 
The Beauty of Mapping Big Data
The Beauty of Mapping Big DataThe Beauty of Mapping Big Data
The Beauty of Mapping Big DataStoimen Popov
 
Best Practices for Migrating Your Data Warehouse to Amazon Redshift
Best Practices for Migrating Your Data Warehouse to Amazon RedshiftBest Practices for Migrating Your Data Warehouse to Amazon Redshift
Best Practices for Migrating Your Data Warehouse to Amazon RedshiftAmazon Web Services
 
Data Warehousing with Amazon Redshift
Data Warehousing with Amazon RedshiftData Warehousing with Amazon Redshift
Data Warehousing with Amazon RedshiftAmazon Web Services
 

What's hot (20)

Analytics with R in SQL Server 2016
Analytics with R in SQL Server 2016Analytics with R in SQL Server 2016
Analytics with R in SQL Server 2016
 
SQL to NoSQL Best Practices with Amazon DynamoDB - AWS July 2016 Webinar Se...
SQL to NoSQL   Best Practices with Amazon DynamoDB - AWS July 2016 Webinar Se...SQL to NoSQL   Best Practices with Amazon DynamoDB - AWS July 2016 Webinar Se...
SQL to NoSQL Best Practices with Amazon DynamoDB - AWS July 2016 Webinar Se...
 
Designing Data-Intensive Applications
Designing Data-Intensive ApplicationsDesigning Data-Intensive Applications
Designing Data-Intensive Applications
 
Deep Dive on Amazon DynamoDB
Deep Dive on Amazon DynamoDBDeep Dive on Amazon DynamoDB
Deep Dive on Amazon DynamoDB
 
Deep Dive on Amazon DynamoDB
Deep Dive on Amazon DynamoDBDeep Dive on Amazon DynamoDB
Deep Dive on Amazon DynamoDB
 
How to Manage Open Police Data - Tips for Data QA/QC and Automation
How to Manage Open Police Data - Tips for Data QA/QC and AutomationHow to Manage Open Police Data - Tips for Data QA/QC and Automation
How to Manage Open Police Data - Tips for Data QA/QC and Automation
 
AWS DynamoDB
AWS DynamoDBAWS DynamoDB
AWS DynamoDB
 
Introduction to GraphQL
Introduction to GraphQLIntroduction to GraphQL
Introduction to GraphQL
 
DynamoDB Design Workshop
DynamoDB Design WorkshopDynamoDB Design Workshop
DynamoDB Design Workshop
 
Amazon Dynamo DB for Developers (김일호) - AWS DB Day
Amazon Dynamo DB for Developers (김일호) - AWS DB DayAmazon Dynamo DB for Developers (김일호) - AWS DB Day
Amazon Dynamo DB for Developers (김일호) - AWS DB Day
 
Loading Data into Redshift
Loading Data into RedshiftLoading Data into Redshift
Loading Data into Redshift
 
Introduction to GraphQL
Introduction to GraphQLIntroduction to GraphQL
Introduction to GraphQL
 
Amazon DynamoDB 深入探討
Amazon DynamoDB 深入探討Amazon DynamoDB 深入探討
Amazon DynamoDB 深入探討
 
ETL to RDF with Talend and AllegroGraph
ETL to RDF with Talend and AllegroGraphETL to RDF with Talend and AllegroGraph
ETL to RDF with Talend and AllegroGraph
 
Data Warehousing in the Era of Big Data
Data Warehousing in the Era of Big DataData Warehousing in the Era of Big Data
Data Warehousing in the Era of Big Data
 
The Beauty of Mapping Big Data
The Beauty of Mapping Big DataThe Beauty of Mapping Big Data
The Beauty of Mapping Big Data
 
Best Practices for Migrating Your Data Warehouse to Amazon Redshift
Best Practices for Migrating Your Data Warehouse to Amazon RedshiftBest Practices for Migrating Your Data Warehouse to Amazon Redshift
Best Practices for Migrating Your Data Warehouse to Amazon Redshift
 
Amazon Redshift Masterclass
Amazon Redshift MasterclassAmazon Redshift Masterclass
Amazon Redshift Masterclass
 
Data Warehousing with Amazon Redshift
Data Warehousing with Amazon RedshiftData Warehousing with Amazon Redshift
Data Warehousing with Amazon Redshift
 
Talend big data online training
Talend big data online trainingTalend big data online training
Talend big data online training
 

Similar to Webinar | Accessing Your Data Lake Assets from Amazon Redshift Spectrum

The Power of Big Data - AWS Summit Bahrain 2017
The Power of Big Data - AWS Summit Bahrain 2017The Power of Big Data - AWS Summit Bahrain 2017
The Power of Big Data - AWS Summit Bahrain 2017Amazon Web Services
 
BDA305 NEW LAUNCH! Intro to Amazon Redshift Spectrum: Now query exabytes of d...
BDA305 NEW LAUNCH! Intro to Amazon Redshift Spectrum: Now query exabytes of d...BDA305 NEW LAUNCH! Intro to Amazon Redshift Spectrum: Now query exabytes of d...
BDA305 NEW LAUNCH! Intro to Amazon Redshift Spectrum: Now query exabytes of d...Amazon Web Services
 
SRV405 Deep Dive Amazon Redshift & Redshift Spectrum at Cardinal Health
SRV405 Deep Dive Amazon Redshift & Redshift Spectrum at Cardinal HealthSRV405 Deep Dive Amazon Redshift & Redshift Spectrum at Cardinal Health
SRV405 Deep Dive Amazon Redshift & Redshift Spectrum at Cardinal HealthAmazon Web Services
 
Deploying your Data Warehouse on AWS
Deploying your Data Warehouse on AWSDeploying your Data Warehouse on AWS
Deploying your Data Warehouse on AWSAmazon Web Services
 
Intro to Amazon Redshift Spectrum: Quickly Query Exabytes of Data in S3 - Jun...
Intro to Amazon Redshift Spectrum: Quickly Query Exabytes of Data in S3 - Jun...Intro to Amazon Redshift Spectrum: Quickly Query Exabytes of Data in S3 - Jun...
Intro to Amazon Redshift Spectrum: Quickly Query Exabytes of Data in S3 - Jun...Amazon Web Services
 
Modernise your Data Warehouse - AWS Summit Sydney 2018
Modernise your Data Warehouse - AWS Summit Sydney 2018Modernise your Data Warehouse - AWS Summit Sydney 2018
Modernise your Data Warehouse - AWS Summit Sydney 2018Amazon Web Services
 
Success has Many Query Engines- Tel Aviv Summit 2018
Success has Many Query Engines- Tel Aviv Summit 2018Success has Many Query Engines- Tel Aviv Summit 2018
Success has Many Query Engines- Tel Aviv Summit 2018Amazon Web Services
 
Modernise your Data Warehouse with Amazon Redshift and Amazon Redshift Spectrum
Modernise your Data Warehouse with Amazon Redshift and Amazon Redshift SpectrumModernise your Data Warehouse with Amazon Redshift and Amazon Redshift Spectrum
Modernise your Data Warehouse with Amazon Redshift and Amazon Redshift SpectrumAmazon Web Services
 
Extending Analytics Beyond the Data Warehouse, ft. Warner Bros. Analytics (AN...
Extending Analytics Beyond the Data Warehouse, ft. Warner Bros. Analytics (AN...Extending Analytics Beyond the Data Warehouse, ft. Warner Bros. Analytics (AN...
Extending Analytics Beyond the Data Warehouse, ft. Warner Bros. Analytics (AN...Amazon Web Services
 
Migrating your traditional Data Warehouse to a Modern Data Lake
Migrating your traditional Data Warehouse to a Modern Data LakeMigrating your traditional Data Warehouse to a Modern Data Lake
Migrating your traditional Data Warehouse to a Modern Data LakeAmazon Web Services
 
Building a Modern Data Warehouse: Deep Dive on Amazon Redshift - SRV337 - Chi...
Building a Modern Data Warehouse: Deep Dive on Amazon Redshift - SRV337 - Chi...Building a Modern Data Warehouse: Deep Dive on Amazon Redshift - SRV337 - Chi...
Building a Modern Data Warehouse: Deep Dive on Amazon Redshift - SRV337 - Chi...Amazon Web Services
 
Building with AWS Databases: Match Your Workload to the Right Database (DAT30...
Building with AWS Databases: Match Your Workload to the Right Database (DAT30...Building with AWS Databases: Match Your Workload to the Right Database (DAT30...
Building with AWS Databases: Match Your Workload to the Right Database (DAT30...Amazon Web Services
 
How to Migrate from Cassandra to Amazon DynamoDB - AWS Online Tech Talks
How to Migrate from Cassandra to Amazon DynamoDB - AWS Online Tech TalksHow to Migrate from Cassandra to Amazon DynamoDB - AWS Online Tech Talks
How to Migrate from Cassandra to Amazon DynamoDB - AWS Online Tech TalksAmazon Web Services
 
SRV307 Applying AWS Purpose-Built Database Strategy: Match Your Workload to ...
 SRV307 Applying AWS Purpose-Built Database Strategy: Match Your Workload to ... SRV307 Applying AWS Purpose-Built Database Strategy: Match Your Workload to ...
SRV307 Applying AWS Purpose-Built Database Strategy: Match Your Workload to ...Amazon Web Services
 
Applying AWS Purpose-Built Database Strategy - SRV307 - Anaheim AWS Summit
Applying AWS Purpose-Built Database Strategy - SRV307 - Anaheim AWS SummitApplying AWS Purpose-Built Database Strategy - SRV307 - Anaheim AWS Summit
Applying AWS Purpose-Built Database Strategy - SRV307 - Anaheim AWS SummitAmazon Web Services
 
ABD327_Migrating Your Traditional Data Warehouse to a Modern Data Lake
ABD327_Migrating Your Traditional Data Warehouse to a Modern Data LakeABD327_Migrating Your Traditional Data Warehouse to a Modern Data Lake
ABD327_Migrating Your Traditional Data Warehouse to a Modern Data LakeAmazon Web Services
 
Applying AWS Purpose-Built Database Strategy - SRV307 - Toronto AWS Summit
Applying AWS Purpose-Built Database Strategy - SRV307 - Toronto AWS SummitApplying AWS Purpose-Built Database Strategy - SRV307 - Toronto AWS Summit
Applying AWS Purpose-Built Database Strategy - SRV307 - Toronto AWS SummitAmazon Web Services
 
Data Warehousing with Amazon Redshift
Data Warehousing with Amazon RedshiftData Warehousing with Amazon Redshift
Data Warehousing with Amazon RedshiftAmazon Web Services
 
Data Warehousing with Amazon Redshift: Data Analytics Week SF
Data Warehousing with Amazon Redshift: Data Analytics Week SFData Warehousing with Amazon Redshift: Data Analytics Week SF
Data Warehousing with Amazon Redshift: Data Analytics Week SFAmazon Web Services
 
Leadership Session: AWS Database and Analytics (DAT206-L) - AWS re:Invent 2018
Leadership Session: AWS Database and Analytics (DAT206-L) - AWS re:Invent 2018Leadership Session: AWS Database and Analytics (DAT206-L) - AWS re:Invent 2018
Leadership Session: AWS Database and Analytics (DAT206-L) - AWS re:Invent 2018Amazon Web Services
 

Similar to Webinar | Accessing Your Data Lake Assets from Amazon Redshift Spectrum (20)

The Power of Big Data - AWS Summit Bahrain 2017
The Power of Big Data - AWS Summit Bahrain 2017The Power of Big Data - AWS Summit Bahrain 2017
The Power of Big Data - AWS Summit Bahrain 2017
 
BDA305 NEW LAUNCH! Intro to Amazon Redshift Spectrum: Now query exabytes of d...
BDA305 NEW LAUNCH! Intro to Amazon Redshift Spectrum: Now query exabytes of d...BDA305 NEW LAUNCH! Intro to Amazon Redshift Spectrum: Now query exabytes of d...
BDA305 NEW LAUNCH! Intro to Amazon Redshift Spectrum: Now query exabytes of d...
 
SRV405 Deep Dive Amazon Redshift & Redshift Spectrum at Cardinal Health
SRV405 Deep Dive Amazon Redshift & Redshift Spectrum at Cardinal HealthSRV405 Deep Dive Amazon Redshift & Redshift Spectrum at Cardinal Health
SRV405 Deep Dive Amazon Redshift & Redshift Spectrum at Cardinal Health
 
Deploying your Data Warehouse on AWS
Deploying your Data Warehouse on AWSDeploying your Data Warehouse on AWS
Deploying your Data Warehouse on AWS
 
Intro to Amazon Redshift Spectrum: Quickly Query Exabytes of Data in S3 - Jun...
Intro to Amazon Redshift Spectrum: Quickly Query Exabytes of Data in S3 - Jun...Intro to Amazon Redshift Spectrum: Quickly Query Exabytes of Data in S3 - Jun...
Intro to Amazon Redshift Spectrum: Quickly Query Exabytes of Data in S3 - Jun...
 
Modernise your Data Warehouse - AWS Summit Sydney 2018
Modernise your Data Warehouse - AWS Summit Sydney 2018Modernise your Data Warehouse - AWS Summit Sydney 2018
Modernise your Data Warehouse - AWS Summit Sydney 2018
 
Success has Many Query Engines- Tel Aviv Summit 2018
Success has Many Query Engines- Tel Aviv Summit 2018Success has Many Query Engines- Tel Aviv Summit 2018
Success has Many Query Engines- Tel Aviv Summit 2018
 
Modernise your Data Warehouse with Amazon Redshift and Amazon Redshift Spectrum
Modernise your Data Warehouse with Amazon Redshift and Amazon Redshift SpectrumModernise your Data Warehouse with Amazon Redshift and Amazon Redshift Spectrum
Modernise your Data Warehouse with Amazon Redshift and Amazon Redshift Spectrum
 
Extending Analytics Beyond the Data Warehouse, ft. Warner Bros. Analytics (AN...
Extending Analytics Beyond the Data Warehouse, ft. Warner Bros. Analytics (AN...Extending Analytics Beyond the Data Warehouse, ft. Warner Bros. Analytics (AN...
Extending Analytics Beyond the Data Warehouse, ft. Warner Bros. Analytics (AN...
 
Migrating your traditional Data Warehouse to a Modern Data Lake
Migrating your traditional Data Warehouse to a Modern Data LakeMigrating your traditional Data Warehouse to a Modern Data Lake
Migrating your traditional Data Warehouse to a Modern Data Lake
 
Building a Modern Data Warehouse: Deep Dive on Amazon Redshift - SRV337 - Chi...
Building a Modern Data Warehouse: Deep Dive on Amazon Redshift - SRV337 - Chi...Building a Modern Data Warehouse: Deep Dive on Amazon Redshift - SRV337 - Chi...
Building a Modern Data Warehouse: Deep Dive on Amazon Redshift - SRV337 - Chi...
 
Building with AWS Databases: Match Your Workload to the Right Database (DAT30...
Building with AWS Databases: Match Your Workload to the Right Database (DAT30...Building with AWS Databases: Match Your Workload to the Right Database (DAT30...
Building with AWS Databases: Match Your Workload to the Right Database (DAT30...
 
How to Migrate from Cassandra to Amazon DynamoDB - AWS Online Tech Talks
How to Migrate from Cassandra to Amazon DynamoDB - AWS Online Tech TalksHow to Migrate from Cassandra to Amazon DynamoDB - AWS Online Tech Talks
How to Migrate from Cassandra to Amazon DynamoDB - AWS Online Tech Talks
 
SRV307 Applying AWS Purpose-Built Database Strategy: Match Your Workload to ...
 SRV307 Applying AWS Purpose-Built Database Strategy: Match Your Workload to ... SRV307 Applying AWS Purpose-Built Database Strategy: Match Your Workload to ...
SRV307 Applying AWS Purpose-Built Database Strategy: Match Your Workload to ...
 
Applying AWS Purpose-Built Database Strategy - SRV307 - Anaheim AWS Summit
Applying AWS Purpose-Built Database Strategy - SRV307 - Anaheim AWS SummitApplying AWS Purpose-Built Database Strategy - SRV307 - Anaheim AWS Summit
Applying AWS Purpose-Built Database Strategy - SRV307 - Anaheim AWS Summit
 
ABD327_Migrating Your Traditional Data Warehouse to a Modern Data Lake
ABD327_Migrating Your Traditional Data Warehouse to a Modern Data LakeABD327_Migrating Your Traditional Data Warehouse to a Modern Data Lake
ABD327_Migrating Your Traditional Data Warehouse to a Modern Data Lake
 
Applying AWS Purpose-Built Database Strategy - SRV307 - Toronto AWS Summit
Applying AWS Purpose-Built Database Strategy - SRV307 - Toronto AWS SummitApplying AWS Purpose-Built Database Strategy - SRV307 - Toronto AWS Summit
Applying AWS Purpose-Built Database Strategy - SRV307 - Toronto AWS Summit
 
Data Warehousing with Amazon Redshift
Data Warehousing with Amazon RedshiftData Warehousing with Amazon Redshift
Data Warehousing with Amazon Redshift
 
Data Warehousing with Amazon Redshift: Data Analytics Week SF
Data Warehousing with Amazon Redshift: Data Analytics Week SFData Warehousing with Amazon Redshift: Data Analytics Week SF
Data Warehousing with Amazon Redshift: Data Analytics Week SF
 
Leadership Session: AWS Database and Analytics (DAT206-L) - AWS re:Invent 2018
Leadership Session: AWS Database and Analytics (DAT206-L) - AWS re:Invent 2018Leadership Session: AWS Database and Analytics (DAT206-L) - AWS re:Invent 2018
Leadership Session: AWS Database and Analytics (DAT206-L) - AWS re:Invent 2018
 

More from Matillion

Lets Talk Google BigQuery
Lets Talk Google BigQueryLets Talk Google BigQuery
Lets Talk Google BigQueryMatillion
 
Master the Multi-Clustered Data Warehouse - Snowflake
Master the Multi-Clustered Data Warehouse - SnowflakeMaster the Multi-Clustered Data Warehouse - Snowflake
Master the Multi-Clustered Data Warehouse - SnowflakeMatillion
 
ELT is Better. Here's Why.
ELT is Better. Here's Why. ELT is Better. Here's Why.
ELT is Better. Here's Why. Matillion
 
Pick a Winner: How to Choose a Data Warehouse
Pick a Winner: How to Choose a Data WarehousePick a Winner: How to Choose a Data Warehouse
Pick a Winner: How to Choose a Data WarehouseMatillion
 
Dive Into Data Lakes
Dive Into Data LakesDive Into Data Lakes
Dive Into Data LakesMatillion
 
Using ELT to load 1 Billion Rows of Data in 15 Minutes
Using ELT to load 1 Billion Rows of Data in 15 MinutesUsing ELT to load 1 Billion Rows of Data in 15 Minutes
Using ELT to load 1 Billion Rows of Data in 15 MinutesMatillion
 
Reach New Heights with Amazon Redshift
Reach New Heights with Amazon RedshiftReach New Heights with Amazon Redshift
Reach New Heights with Amazon RedshiftMatillion
 
Get Savvy with Snowflake
Get Savvy with SnowflakeGet Savvy with Snowflake
Get Savvy with SnowflakeMatillion
 
Google BigQuery Best Practices
Google BigQuery Best PracticesGoogle BigQuery Best Practices
Google BigQuery Best PracticesMatillion
 
ELT vs. ETL - How they’re different and why it matters
ELT vs. ETL - How they’re different and why it mattersELT vs. ETL - How they’re different and why it matters
ELT vs. ETL - How they’re different and why it mattersMatillion
 
How to Choose a Data Warehouse
How to Choose a Data WarehouseHow to Choose a Data Warehouse
How to Choose a Data WarehouseMatillion
 
Kickstart your data strategy for 2018: Getting started with Amazon Redshift
Kickstart your data strategy for 2018: Getting started with Amazon RedshiftKickstart your data strategy for 2018: Getting started with Amazon Redshift
Kickstart your data strategy for 2018: Getting started with Amazon RedshiftMatillion
 
Simplifying Your Journey to the Cloud: The Benefits of a Cloud-Based Data War...
Simplifying Your Journey to the Cloud: The Benefits of a Cloud-Based Data War...Simplifying Your Journey to the Cloud: The Benefits of a Cloud-Based Data War...
Simplifying Your Journey to the Cloud: The Benefits of a Cloud-Based Data War...Matillion
 
Using Google Cloud for Marketing Analytics: How the7stars, the UK’s largest i...
Using Google Cloud for Marketing Analytics: How the7stars, the UK’s largest i...Using Google Cloud for Marketing Analytics: How the7stars, the UK’s largest i...
Using Google Cloud for Marketing Analytics: How the7stars, the UK’s largest i...Matillion
 
Webinar | Getting Started With Amazon Redshift Spectrum
Webinar | Getting Started With Amazon Redshift SpectrumWebinar | Getting Started With Amazon Redshift Spectrum
Webinar | Getting Started With Amazon Redshift SpectrumMatillion
 
Getting Started With Amazon Redshift
Getting Started With Amazon Redshift Getting Started With Amazon Redshift
Getting Started With Amazon Redshift Matillion
 

More from Matillion (16)

Lets Talk Google BigQuery
Lets Talk Google BigQueryLets Talk Google BigQuery
Lets Talk Google BigQuery
 
Master the Multi-Clustered Data Warehouse - Snowflake
Master the Multi-Clustered Data Warehouse - SnowflakeMaster the Multi-Clustered Data Warehouse - Snowflake
Master the Multi-Clustered Data Warehouse - Snowflake
 
ELT is Better. Here's Why.
ELT is Better. Here's Why. ELT is Better. Here's Why.
ELT is Better. Here's Why.
 
Pick a Winner: How to Choose a Data Warehouse
Pick a Winner: How to Choose a Data WarehousePick a Winner: How to Choose a Data Warehouse
Pick a Winner: How to Choose a Data Warehouse
 
Dive Into Data Lakes
Dive Into Data LakesDive Into Data Lakes
Dive Into Data Lakes
 
Using ELT to load 1 Billion Rows of Data in 15 Minutes
Using ELT to load 1 Billion Rows of Data in 15 MinutesUsing ELT to load 1 Billion Rows of Data in 15 Minutes
Using ELT to load 1 Billion Rows of Data in 15 Minutes
 
Reach New Heights with Amazon Redshift
Reach New Heights with Amazon RedshiftReach New Heights with Amazon Redshift
Reach New Heights with Amazon Redshift
 
Get Savvy with Snowflake
Get Savvy with SnowflakeGet Savvy with Snowflake
Get Savvy with Snowflake
 
Google BigQuery Best Practices
Google BigQuery Best PracticesGoogle BigQuery Best Practices
Google BigQuery Best Practices
 
ELT vs. ETL - How they’re different and why it matters
ELT vs. ETL - How they’re different and why it mattersELT vs. ETL - How they’re different and why it matters
ELT vs. ETL - How they’re different and why it matters
 
How to Choose a Data Warehouse
How to Choose a Data WarehouseHow to Choose a Data Warehouse
How to Choose a Data Warehouse
 
Kickstart your data strategy for 2018: Getting started with Amazon Redshift
Kickstart your data strategy for 2018: Getting started with Amazon RedshiftKickstart your data strategy for 2018: Getting started with Amazon Redshift
Kickstart your data strategy for 2018: Getting started with Amazon Redshift
 
Simplifying Your Journey to the Cloud: The Benefits of a Cloud-Based Data War...
Simplifying Your Journey to the Cloud: The Benefits of a Cloud-Based Data War...Simplifying Your Journey to the Cloud: The Benefits of a Cloud-Based Data War...
Simplifying Your Journey to the Cloud: The Benefits of a Cloud-Based Data War...
 
Using Google Cloud for Marketing Analytics: How the7stars, the UK’s largest i...
Using Google Cloud for Marketing Analytics: How the7stars, the UK’s largest i...Using Google Cloud for Marketing Analytics: How the7stars, the UK’s largest i...
Using Google Cloud for Marketing Analytics: How the7stars, the UK’s largest i...
 
Webinar | Getting Started With Amazon Redshift Spectrum
Webinar | Getting Started With Amazon Redshift SpectrumWebinar | Getting Started With Amazon Redshift Spectrum
Webinar | Getting Started With Amazon Redshift Spectrum
 
Getting Started With Amazon Redshift
Getting Started With Amazon Redshift Getting Started With Amazon Redshift
Getting Started With Amazon Redshift
 

Recently uploaded

Automate your Kamailio Test Calls - Kamailio World 2024
Automate your Kamailio Test Calls - Kamailio World 2024Automate your Kamailio Test Calls - Kamailio World 2024
Automate your Kamailio Test Calls - Kamailio World 2024Andreas Granig
 
Building Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop SlideBuilding Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop SlideChristina Lin
 
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdfThe Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdfkalichargn70th171
 
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEBATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEOrtus Solutions, Corp
 
Advancing Engineering with AI through the Next Generation of Strategic Projec...
Advancing Engineering with AI through the Next Generation of Strategic Projec...Advancing Engineering with AI through the Next Generation of Strategic Projec...
Advancing Engineering with AI through the Next Generation of Strategic Projec...OnePlan Solutions
 
XpertSolvers: Your Partner in Building Innovative Software Solutions
XpertSolvers: Your Partner in Building Innovative Software SolutionsXpertSolvers: Your Partner in Building Innovative Software Solutions
XpertSolvers: Your Partner in Building Innovative Software SolutionsMehedi Hasan Shohan
 
Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
The Evolution of Karaoke From Analog to App.pdf
The Evolution of Karaoke From Analog to App.pdfThe Evolution of Karaoke From Analog to App.pdf
The Evolution of Karaoke From Analog to App.pdfPower Karaoke
 
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...Christina Lin
 
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...soniya singh
 
Engage Usergroup 2024 - The Good The Bad_The Ugly
Engage Usergroup 2024 - The Good The Bad_The UglyEngage Usergroup 2024 - The Good The Bad_The Ugly
Engage Usergroup 2024 - The Good The Bad_The UglyFrank van der Linden
 
Cloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackCloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackVICTOR MAESTRE RAMIREZ
 
Project Based Learning (A.I).pptx detail explanation
Project Based Learning (A.I).pptx detail explanationProject Based Learning (A.I).pptx detail explanation
Project Based Learning (A.I).pptx detail explanationkaushalgiri8080
 
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...stazi3110
 
why an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdfwhy an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdfjoe51371421
 
Intelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalmIntelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalmSujith Sukumaran
 
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer DataAdobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer DataBradBedford3
 
Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)OPEN KNOWLEDGE GmbH
 
buds n tech IT solutions
buds n  tech IT                solutionsbuds n  tech IT                solutions
buds n tech IT solutionsmonugehlot87
 

Recently uploaded (20)

Automate your Kamailio Test Calls - Kamailio World 2024
Automate your Kamailio Test Calls - Kamailio World 2024Automate your Kamailio Test Calls - Kamailio World 2024
Automate your Kamailio Test Calls - Kamailio World 2024
 
Building Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop SlideBuilding Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
 
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdfThe Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
 
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEBATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
 
Advancing Engineering with AI through the Next Generation of Strategic Projec...
Advancing Engineering with AI through the Next Generation of Strategic Projec...Advancing Engineering with AI through the Next Generation of Strategic Projec...
Advancing Engineering with AI through the Next Generation of Strategic Projec...
 
XpertSolvers: Your Partner in Building Innovative Software Solutions
XpertSolvers: Your Partner in Building Innovative Software SolutionsXpertSolvers: Your Partner in Building Innovative Software Solutions
XpertSolvers: Your Partner in Building Innovative Software Solutions
 
Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝
 
The Evolution of Karaoke From Analog to App.pdf
The Evolution of Karaoke From Analog to App.pdfThe Evolution of Karaoke From Analog to App.pdf
The Evolution of Karaoke From Analog to App.pdf
 
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
 
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
 
Engage Usergroup 2024 - The Good The Bad_The Ugly
Engage Usergroup 2024 - The Good The Bad_The UglyEngage Usergroup 2024 - The Good The Bad_The Ugly
Engage Usergroup 2024 - The Good The Bad_The Ugly
 
Cloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackCloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStack
 
Project Based Learning (A.I).pptx detail explanation
Project Based Learning (A.I).pptx detail explanationProject Based Learning (A.I).pptx detail explanation
Project Based Learning (A.I).pptx detail explanation
 
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
 
Call Girls In Mukherjee Nagar 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...
Call Girls In Mukherjee Nagar 📱  9999965857  🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...Call Girls In Mukherjee Nagar 📱  9999965857  🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...
Call Girls In Mukherjee Nagar 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...
 
why an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdfwhy an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdf
 
Intelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalmIntelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalm
 
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer DataAdobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
 
Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)
 
buds n tech IT solutions
buds n  tech IT                solutionsbuds n  tech IT                solutions
buds n tech IT solutions
 

Webinar | Accessing Your Data Lake Assets from Amazon Redshift Spectrum

  • 1. 1 www.matillion.com © 2017 Matillion. All rights reserved. Presented by: Copyright © 2017. All rights reserved. Matillion, trademarks, registered trademarks or service marks are property of their respective owners. 10/27/2017 Spectrum Webinar Series Part 3: Accessing your Data Lake assets from Amazon Redshift Spectrum James Johnson, Ian Funnell and Greg Khairallah
  • 2. 2 www.matillion.com © 2017 Matillion. All rights reserved. • Launch Redshift on demand • Presentation from Greg Khairallah of AWS • Recap on Amazon Redshift Spectrum • Recap of Matillion's Spectrum features • Demonstration, including Redshift on demand Introduction
  • 3. 3 www.matillion.com © 2017 Matillion. All rights reserved. Launch On Demand Redshift
  • 4. 4 www.matillion.com © 2017 Matillion. All rights reserved. State of Data Warehousing Data Warehousing Challenges Today Exponential Data Growth Varying Data Types Need Data Analyzed Faster
  • 5. 5 www.matillion.com © 2017 Matillion. All rights reserved. Amazon Redshift is used for mission-critical workloads Financial and management reporting Payments to suppliers and billing workflows Web/Mobile clickstream and event analysis Recommendation and predictive analytics
  • 6. 6 www.matillion.com © 2017 Matillion. All rights reserved. The Forrester Wave™: Big Data Warehouse, Q2 2017 The Forrester Wave™ is copyrighted by Forrester Research, Inc. Forrester and Forrester Wave™ are trademarks of Forrester Research, Inc. The Forrester Wave™ is a graphical representation of Forrester's call on a market and is plotted using a detailed spreadsheet with exposed scores, weightings, and comments. Forrester does not endorse any vendor, product, or service depicted in the Forrester Wave. Information is based on best available resources. Opinions reflect judgment at the time and are subject to change.
  • 7. 7 www.matillion.com © 2017 Matillion. All rights reserved. Benefits of Using Amazon Redshift Amazon Redshift is a fast, fully managed, petabyte-scale data warehouse that makes it simple and cost-effective to analyze all your data using your existing business intelligence tools. Amazon Redshift also includes Redshift Spectrum, allowing you to directly query exabytes of unstructured data in Amazon S3. Amazon Redshift is: Fast Simple Elastic Secure Compatible Low Cost
  • 8. 8 www.matillion.com © 2017 Matillion. All rights reserved. Amazon Redshift • Fully managed, petabyte-scale data warehousing service • High performance SQL, Massively Parallel Processing • Fully managed with SSD and HDD platforms. Built-in end to end security, including customer-managed keys • Data automatically backed up to Amazon S3 with cross region backup capability for global disaster recovery • $1,000/TB/Year; start at $0.25/hour Query tool Amazon Redshift
  • 9. 9 www.matillion.com © 2017 Matillion. All rights reserved. Paradigm Shift Enabled by Redshift Spectrum
  • 10. 10 www.matillion.com © 2017 Matillion. All rights reserved. Paradigm Shift Enabled by Redshift Spectrum • Extends Redshift to query Exabyte-scale data lakes in S3 • Intelligent query optimizer • Elastic scale-out fleet to process S3 data • Directly query data in open formats • Multiple Redshift clusters against same data • Pay for just the queries you run ($5/TB scanned) Query tool Amazon Redshift ... 1 2 3 4 Spectrum n S3
  • 11. 11 www.matillion.com © 2017 Matillion. All rights reserved. Let's build an analytic query - #1 • An author is releasing the 8th book in her popular series. How many should we order for Seattle? What were prior first few day sales? • Let’s get the prior books she’s written • 1 Table • 2 Filters • SELECT • P.ASIN, • P.TITLE • FROM • products P • WHERE • P.TITLE LIKE ‘%POTTER%’ AND • P.AUTHOR = ‘J. K. Rowling’
  • 12. 12 www.matillion.com © 2017 Matillion. All rights reserved. Let's build an analytic query - #2 • An author is releasing the 8th book in her popular series. How many should we order for Seattle? What were prior first few day sales? • Let's compute the sales of the prior books she’s written in this series and return the top 20 values • SELECT • P.ASIN, • P.TITLE, • SUM(D.QUANTITY * D.OUR_PRICE) AS SALES_sum • FROM • s3.d_customer_order_item_det ails D, • products P • WHERE • D.ASIN = P.ASIN AND • P.TITLE LIKE '%Potter%' AND • P.AUTHOR = 'J. K. Rowling' AND • GROUP BY P.ASIN, P.TITLE • ORDER BY SALES_sum DESC • LIMIT 20; • 2 Tables (1 S3, 1 local) • 2 Filters • 1 Join • 2 Group By columns • 1 Order By • 1 Limit • 1 Aggregation
  • 13. 13 www.matillion.com © 2017 Matillion. All rights reserved. Let's build an analytic query - #3 • An author is releasing the 8th book in her popular series. How many should we order for Seattle? What were prior first few day sales? • Let's compute the sales of the prior books she’s written in this series and return the top 20 values, just for the first three days of sales of first editions • SELECT • P.ASIN, • P.TITLE, • P.RELEASE_DATE, • SUM(D.QUANTITY * D.OUR_PRICE) AS SALES_sum • FROM • s3.d_customer_order_item_details D, • asin_attributes A, • products P • WHERE • D.ASIN = P.ASIN AND • P.ASIN = A.ASIN AND • A.EDITION LIKE '%FIRST%' AND • P.TITLE LIKE '%Potter%' AND • P.AUTHOR = 'J. K. Rowling' AND • D.ORDER_DAY :: DATE >= P.RELEASE_DATE AND • D.ORDER_DAY :: DATE < dateadd(day, 3, P.RELEASE_DATE) • GROUP BY P.ASIN, P.TITLE, P.RELEASE_DATE • ORDER BY SALES_sum DESC • LIMIT 20; • 3 Tables (1 S3, 2 local) • 5 Filters • 2 Joins • 3 Group By columns • 1 Order By • 1 Limit • 1 Aggregation • 1 Function • 2 Casts
  • 14. 14 www.matillion.com © 2017 Matillion. All rights reserved. Let's build an analytic query - #4 • An author is releasing the 8th book in her popular series. How many should we order for Seattle? What were prior first few day sales? • Let's compute the sales of the prior books she’s written in this series and return the top 20 values, just for the first three days of sales of first editions in the city of Seattle, WA, USA • 4 Tables (1 S3, 3 local) • 8 Filters • 3 Joins • 4 Group By columns • 1 Order By • 1 Limit • 1 Aggregation • 1 Function • 2 Casts • SELECT • P.ASIN, • P.TITLE, • R.POSTAL_CODE, • P.RELEASE_DATE, • SUM(D.QUANTITY * D.OUR_PRICE) AS SALES_sum • FROM • s3.d_customer_order_item_details D, • asin_attributes A, • products P, • regions R • WHERE • D.ASIN = P.ASIN AND • P.ASIN = A.ASIN AND • D.REGION_ID = R.REGION_ID AND • A.EDITION LIKE '%FIRST%' AND • P.TITLE LIKE '%Potter%' AND • P.AUTHOR = 'J. K. Rowling' AND • R.COUNTRY_CODE = ‘US’ AND • R.CITY = ‘Seattle’ AND • R.STATE = ‘WA’ AND • D.ORDER_DAY :: DATE >= P.RELEASE_DATE AND • D.ORDER_DAY :: DATE < dateadd(day, 3, P.RELEASE_DATE) • GROUP BY P.ASIN, P.TITLE, R.POSTAL_CODE, P.RELEASE_DATE • ORDER BY SALES_sum DESC • LIMIT 20;
  • 15. 15 www.matillion.com © 2017 Matillion. All rights reserved. Now let’s run that query over an exabyte of data in S3 Roughly 140 TB of customer item order detail records for each day over past 20 years. 190 million files across 15,000 partitions in S3. One partition per day for USA and rest of world. Need a billion-fold reduction in data processed. Running this query using a 1000 node Hive cluster would take over 5 years.* • Compression ……………..….……..5X • Columnar file format……….......…10X • Scanning with 2500 nodes…....2500X • Static partition elimination…............2X • Dynamic partition elimination..….350X • Redshift’s query optimizer……......40X --------------------------------------------------- Total reduction……….…………3.5B X * Estimated using 20 node Hive cluster & 1.4TB, assume linear * Query used a 20 node DC1.8XLarge Amazon Redshift cluster * Not actual sales data - generated for this demo based on data format used by Amazon Retail.
  • 16. 16 www.matillion.com © 2017 Matillion. All rights reserved. Accelerate Migrations from Legacy Systems “AWS Database Migration Service is the most impressive migration service we’ve seen.” – Gartner Amazon Redshift Migrate Over 1,000 unique migrations to Amazon Redshift using DMS
  • 17. 17 www.matillion.com © 2017 Matillion. All rights reserved. Recap on Amazon Redshift Spectrum Catalog JDBC/ODBC Client Application Leader Node Node 1 Node nNode 1 Spectrum Spectrum Spectrum Spectrum S3 Independent Scaling Redshift Scaling
  • 18. 18 www.matillion.com © 2017 Matillion. All rights reserved. • No “load” into Redshift • Independently-managed infrastructure • Many formats • Ordinary Redshift SQL • Predicates and Aggregates are pushed down into Spectrum (but not Joins) • You can visualize external data using e.g. Tableau • Partitions are S3 prefixes Recap on Amazon Redshift Spectrum
  • 19. 19 www.matillion.com © 2017 Matillion. All rights reserved. • Other tools  S3 + Athena  Spectrum/Redshift • Redshift on demand • Athena • Predicates, Aggregates, Joins • Unload Demonstration, including Redshift on demand
  • 20. 20 www.matillion.com © 2017 Matillion. All rights reserved. Missed parts 1 and 2 of our Spectrum Series? • September 20, 11AM EST: Getting started with Amazon Redshift Spectrum  View an on-demand recording now (link in resource list) • October 4, 11AM EST: Using Amazon Redshift Spectrum from Matillion ETL  View an on-demand recording now (link in resource list)  EST: Using Amazon Redshift Spectrum from Matillion ETL • October 18, 11AM EST: Accessing your Data Lake assets from Amazon Redshift Spectrum
  • 22. 22 www.matillion.com © 2017 Matillion. All rights reserved. Presented by: Copyright © 2017. All rights reserved. Matillion, trademarks, registered trademarks or service marks are property of their respective owners. 10/27/2017 Thank You James Johnson, Ian Funnell and Greg Khairallah