SlideShare a Scribd company logo
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Paul Macey
AWS Specialist Solutions Architect
Big data and Insights
3rd July 2019
Multi Source, Multi Speed
Data Consumption & Analytics on AWS
Agenda
How did it come to this?
Unintended data pipelines
Well architected data pipelines
Accelerated Data Lake
Multisource, multispeed patterns
Wrap up
How did it come to this?
Unintended data pipelines
Geospatial Girl
The Oracle
The Data Scientist
External request
Sales
Analyst
Operations
Security Guy
The Boss
Well architected data pipelines
Operational
Excellence Security Reliability
Performance
Efficiency
Cost
Optimization
Security
Day 0
Data governance
& metadata
Data centralised
& scalable
SQL & BI
ready
Analytical &
Data Science
foundation
Repeatable &
Extensible
Accelerated Data Lake
Customer example
Health Organisation
In New Zealand
Challenge :
Needed to bring disparate datasets together
Too many external files
Current State:
SQL Server 2008
Large text files
ETL (SSIS) https://www.linkedin.com/pulse/how-we-built-data-lake-less-than-4-weeks-alex-poor/
Solution :
Uses the ADL solution as a base
Extended the solution to access FTP server data
Entirely serverless
Multisource, multispeed
data and analytics patterns
Streaming
Streaming Pattern
Smart City
Camera
camera_stream
datetime
sensorId
locationId
currentTemperature
battery
objectDetected
status
camera_stream
Create SQL Schema Write SQL to query stream
Amazon Kinesis
Data Firehose
Amazon Kinesis
Data Streams
Amazon Kinesis
Data Analytics
Amazon Kinesis
Data Analytics
API Pattern - Inbound
WeatherTraffic
Amazon
CloudWatch
AWS Lambda Amazon S3
API Pattern - Outbound
Camera, weather, and traffic Insights
AWS Lambda Amazon S3Amazon
API Gateway
SFTP Pattern
+ No end-user disruption
+ Fully managed servers
+ Simple to use
+ Pay as you use
+ Native cloud integrations
AWS SFTP
Amazon S3
AWS Transfer
for SFTP
Data Stores
Data stores
CDC
AWS Database
Migration Service
* Target only
Amazon
DynamoDB*
Amazon
Kinesis
Amazon S3
Amazon RDS
Data Stores
Who is saying what about DMS and SCT?
"We migrated hundreds of our clients from our in-house data-center to Amazon RDS
Oracle 12c using the AWS Data Migration Service (DMS). Due to this service, we
could live-replicate the databases between our data-center and RDS before the
migration. That kept the migration down-time to the very minimum. We are very
happy with DMS and are planning to use it for Oracle to MySQL migration next”.
”The SCT Assessment Report was the key enabler to allow us to understand the
scope of effort required to complete an Oracle to PostgreSQL migration. What
was originally thought to be a largely manual task that no one was particularly
excited about having to do became a very straight-forward quick and easy
process."
“We are in the process of migrating some databases to Amazon Aurora. The ease
by which we can do this using the AWS Database Migration Service has
simplified this process for us and enabled us to accelerate our migration
efforts. The ability to closely monitor the process, the detailed logging feature, and
the support we received from AWS have given us a great deal of confidence in a
successful migration.”
Shared
file systems
NFS TLS
AWS storage resources
AWS DataSync
agent
AWS DataSync
Amazon S3
Amazon Elastic
File System
Region
Batch Processing Pattern
Corporate data center
Transfer data at speeds up to 10 times faster than open-source tools
AWS has services and patterns to create
YOUR organisations data goldmine
csv
json
xls
Batch
FTPStreaming
IoT
Geospatial Girl
The Oracle
Operations
External
Requests
The Data Scientist
Analyst
Sales
References
AWS Accelerated Data Lake (Git)
https://github.com/aws-samples/accelerated-data-lake
AWS Accelerated Data Lake Blog (part 1 & 2)
https://aws.amazon.com/blogs/publicsector/from-data-silos-to-data-domains-bringing-common-data-
together
https://aws.amazon.com/blogs/publicsector/securing-your-data-by-knowing-your-data
Our data lake story: How Woot.com built a serverless data lake on AWS
https://aws.amazon.com/blogs/big-data/our-data-lake-story-how-woot-com-built-a-serverless-data-lake-
on-aws
Kinesis Data Generator
https://awslabs.github.io/amazon-kinesis-data-generator/
References
IoT analytics
https://aws.amazon.com/iot-
analytics/
AWS Database Migration Service
https://aws.amazon.com/dms/
AWS Transfer for SFTP
https://aws.amazon.com/sftp/
AWS Datasync
https://aws.amazon.com/datasync/
Amazon Kinesis
https://aws.amazon.com/kinesis/
Amazon Kinesis Analytics
https://aws.amazon.com/kinesis/data-
analytics/
Available today @ GitHub
https://github.com/aws-samples/accelerated-data-lake
Includes
Data lake pipeline (CloudFormation)
Instructions
Data configuration, security and metadata templates
Delivery
Professional services
AWS partners
Accelerated Data Lake

More Related Content

What's hot

Infographic: AWS vs Azure vs GCP: What's the best cloud platform for enterprise?
Infographic: AWS vs Azure vs GCP: What's the best cloud platform for enterprise?Infographic: AWS vs Azure vs GCP: What's the best cloud platform for enterprise?
Infographic: AWS vs Azure vs GCP: What's the best cloud platform for enterprise?
Veritis Group, Inc
 
React Fast by Processing Streaming Data - AWS Summit Tel Aviv 2017
React Fast by Processing Streaming Data - AWS Summit Tel Aviv 2017React Fast by Processing Streaming Data - AWS Summit Tel Aviv 2017
React Fast by Processing Streaming Data - AWS Summit Tel Aviv 2017
Amazon Web Services
 
Cloud First: New Architecture for New Infrastructure
Cloud First: New Architecture for New InfrastructureCloud First: New Architecture for New Infrastructure
Cloud First: New Architecture for New Infrastructure
Amazon Web Services
 
Windows and .NET on AWS
Windows and .NET on AWSWindows and .NET on AWS
Windows and .NET on AWS
Amazon Web Services
 
AWS vs AZURE : Public Cloud Comparison
AWS vs AZURE : Public Cloud ComparisonAWS vs AZURE : Public Cloud Comparison
AWS vs AZURE : Public Cloud Comparison
InApp
 
Amazon Lightsail: Jumpstart Your Cloud Project for a Low, Predictable Price.
Amazon Lightsail: Jumpstart Your Cloud Project for a Low, Predictable Price. Amazon Lightsail: Jumpstart Your Cloud Project for a Low, Predictable Price.
Amazon Lightsail: Jumpstart Your Cloud Project for a Low, Predictable Price.
Amazon Web Services
 
Werner Vogels
Werner Vogels Werner Vogels
Werner Vogels
Amazon Web Services
 
DevOps für mittlere Unternehmen und Großunternehmen - AWS Cloud Web Day für M...
DevOps für mittlere Unternehmen und Großunternehmen - AWS Cloud Web Day für M...DevOps für mittlere Unternehmen und Großunternehmen - AWS Cloud Web Day für M...
DevOps für mittlere Unternehmen und Großunternehmen - AWS Cloud Web Day für M...
AWS Germany
 
AWS Innovate Montreal Keynote - by Chris Munns
AWS Innovate Montreal Keynote - by Chris MunnsAWS Innovate Montreal Keynote - by Chris Munns
AWS Innovate Montreal Keynote - by Chris Munns
Amazon Web Services
 
Architecting for Scale using Microservices - AWS Summit Tel Aviv 2017
Architecting for Scale using Microservices - AWS Summit Tel Aviv 2017Architecting for Scale using Microservices - AWS Summit Tel Aviv 2017
Architecting for Scale using Microservices - AWS Summit Tel Aviv 2017
Amazon Web Services
 
Optimize Cost Efficiency on AWS
Optimize Cost Efficiency on AWSOptimize Cost Efficiency on AWS
Optimize Cost Efficiency on AWS
Amazon Web Services
 
Cloud comparison - AWS vs Azure vs Google
Cloud comparison - AWS vs Azure vs GoogleCloud comparison - AWS vs Azure vs Google
Cloud comparison - AWS vs Azure vs Google
Patrick Pierson
 
Amazon.com migrating internal it apps to AWS - AWS Enterprise Tour - SF - 2010
Amazon.com migrating internal it apps to AWS - AWS Enterprise Tour - SF - 2010Amazon.com migrating internal it apps to AWS - AWS Enterprise Tour - SF - 2010
Amazon.com migrating internal it apps to AWS - AWS Enterprise Tour - SF - 2010
Amazon Web Services
 
Casi reali di Mass Migration nel Cloud: benefici tangibili ed intangibili
Casi reali di Mass Migration nel Cloud: benefici tangibili ed intangibiliCasi reali di Mass Migration nel Cloud: benefici tangibili ed intangibili
Casi reali di Mass Migration nel Cloud: benefici tangibili ed intangibili
Amazon Web Services
 
How to Secure your Hybrid Enviroment - Pop-up Loft Tel Aviv
How to Secure your Hybrid Enviroment - Pop-up Loft Tel AvivHow to Secure your Hybrid Enviroment - Pop-up Loft Tel Aviv
How to Secure your Hybrid Enviroment - Pop-up Loft Tel Aviv
Amazon Web Services
 
Connecting the Unconnected using AWS IoT - AWS Summit Tel Aviv 2017
Connecting the Unconnected using AWS IoT - AWS Summit Tel Aviv 2017Connecting the Unconnected using AWS IoT - AWS Summit Tel Aviv 2017
Connecting the Unconnected using AWS IoT - AWS Summit Tel Aviv 2017
Amazon Web Services
 
From Data Analysis to Smarter Apps - AWS Summit Tel Aviv 2017
From Data Analysis to Smarter Apps - AWS Summit Tel Aviv 2017From Data Analysis to Smarter Apps - AWS Summit Tel Aviv 2017
From Data Analysis to Smarter Apps - AWS Summit Tel Aviv 2017
Amazon Web Services
 
Various Cloud offerings AWS/AZURE/GCP
Various Cloud offerings AWS/AZURE/GCPVarious Cloud offerings AWS/AZURE/GCP
Various Cloud offerings AWS/AZURE/GCP
Mohammad Imran Ansari
 
IoT at the Edge: Greengrass and More!
IoT at the Edge: Greengrass and More!IoT at the Edge: Greengrass and More!
IoT at the Edge: Greengrass and More!
Amazon Web Services
 
Comparison of AWS, GCP & Azure web solutions
Comparison of AWS, GCP & Azure web solutionsComparison of AWS, GCP & Azure web solutions
Comparison of AWS, GCP & Azure web solutions
basit raza
 

What's hot (20)

Infographic: AWS vs Azure vs GCP: What's the best cloud platform for enterprise?
Infographic: AWS vs Azure vs GCP: What's the best cloud platform for enterprise?Infographic: AWS vs Azure vs GCP: What's the best cloud platform for enterprise?
Infographic: AWS vs Azure vs GCP: What's the best cloud platform for enterprise?
 
React Fast by Processing Streaming Data - AWS Summit Tel Aviv 2017
React Fast by Processing Streaming Data - AWS Summit Tel Aviv 2017React Fast by Processing Streaming Data - AWS Summit Tel Aviv 2017
React Fast by Processing Streaming Data - AWS Summit Tel Aviv 2017
 
Cloud First: New Architecture for New Infrastructure
Cloud First: New Architecture for New InfrastructureCloud First: New Architecture for New Infrastructure
Cloud First: New Architecture for New Infrastructure
 
Windows and .NET on AWS
Windows and .NET on AWSWindows and .NET on AWS
Windows and .NET on AWS
 
AWS vs AZURE : Public Cloud Comparison
AWS vs AZURE : Public Cloud ComparisonAWS vs AZURE : Public Cloud Comparison
AWS vs AZURE : Public Cloud Comparison
 
Amazon Lightsail: Jumpstart Your Cloud Project for a Low, Predictable Price.
Amazon Lightsail: Jumpstart Your Cloud Project for a Low, Predictable Price. Amazon Lightsail: Jumpstart Your Cloud Project for a Low, Predictable Price.
Amazon Lightsail: Jumpstart Your Cloud Project for a Low, Predictable Price.
 
Werner Vogels
Werner Vogels Werner Vogels
Werner Vogels
 
DevOps für mittlere Unternehmen und Großunternehmen - AWS Cloud Web Day für M...
DevOps für mittlere Unternehmen und Großunternehmen - AWS Cloud Web Day für M...DevOps für mittlere Unternehmen und Großunternehmen - AWS Cloud Web Day für M...
DevOps für mittlere Unternehmen und Großunternehmen - AWS Cloud Web Day für M...
 
AWS Innovate Montreal Keynote - by Chris Munns
AWS Innovate Montreal Keynote - by Chris MunnsAWS Innovate Montreal Keynote - by Chris Munns
AWS Innovate Montreal Keynote - by Chris Munns
 
Architecting for Scale using Microservices - AWS Summit Tel Aviv 2017
Architecting for Scale using Microservices - AWS Summit Tel Aviv 2017Architecting for Scale using Microservices - AWS Summit Tel Aviv 2017
Architecting for Scale using Microservices - AWS Summit Tel Aviv 2017
 
Optimize Cost Efficiency on AWS
Optimize Cost Efficiency on AWSOptimize Cost Efficiency on AWS
Optimize Cost Efficiency on AWS
 
Cloud comparison - AWS vs Azure vs Google
Cloud comparison - AWS vs Azure vs GoogleCloud comparison - AWS vs Azure vs Google
Cloud comparison - AWS vs Azure vs Google
 
Amazon.com migrating internal it apps to AWS - AWS Enterprise Tour - SF - 2010
Amazon.com migrating internal it apps to AWS - AWS Enterprise Tour - SF - 2010Amazon.com migrating internal it apps to AWS - AWS Enterprise Tour - SF - 2010
Amazon.com migrating internal it apps to AWS - AWS Enterprise Tour - SF - 2010
 
Casi reali di Mass Migration nel Cloud: benefici tangibili ed intangibili
Casi reali di Mass Migration nel Cloud: benefici tangibili ed intangibiliCasi reali di Mass Migration nel Cloud: benefici tangibili ed intangibili
Casi reali di Mass Migration nel Cloud: benefici tangibili ed intangibili
 
How to Secure your Hybrid Enviroment - Pop-up Loft Tel Aviv
How to Secure your Hybrid Enviroment - Pop-up Loft Tel AvivHow to Secure your Hybrid Enviroment - Pop-up Loft Tel Aviv
How to Secure your Hybrid Enviroment - Pop-up Loft Tel Aviv
 
Connecting the Unconnected using AWS IoT - AWS Summit Tel Aviv 2017
Connecting the Unconnected using AWS IoT - AWS Summit Tel Aviv 2017Connecting the Unconnected using AWS IoT - AWS Summit Tel Aviv 2017
Connecting the Unconnected using AWS IoT - AWS Summit Tel Aviv 2017
 
From Data Analysis to Smarter Apps - AWS Summit Tel Aviv 2017
From Data Analysis to Smarter Apps - AWS Summit Tel Aviv 2017From Data Analysis to Smarter Apps - AWS Summit Tel Aviv 2017
From Data Analysis to Smarter Apps - AWS Summit Tel Aviv 2017
 
Various Cloud offerings AWS/AZURE/GCP
Various Cloud offerings AWS/AZURE/GCPVarious Cloud offerings AWS/AZURE/GCP
Various Cloud offerings AWS/AZURE/GCP
 
IoT at the Edge: Greengrass and More!
IoT at the Edge: Greengrass and More!IoT at the Edge: Greengrass and More!
IoT at the Edge: Greengrass and More!
 
Comparison of AWS, GCP & Azure web solutions
Comparison of AWS, GCP & Azure web solutionsComparison of AWS, GCP & Azure web solutions
Comparison of AWS, GCP & Azure web solutions
 

Similar to Multi-Source, Multi-Speed Data Consumption & Analytics on AWS

Building your First Big Data Application on AWS
Building your First Big Data Application on AWSBuilding your First Big Data Application on AWS
Building your First Big Data Application on AWS
Amazon Web Services
 
AWS Big Data Solution Days
AWS Big Data Solution DaysAWS Big Data Solution Days
AWS Big Data Solution Days
Amazon Web Services
 
Innovation Track AWS Cloud Experience Argentina - Data Lakes & Analytics en AWS
Innovation Track AWS Cloud Experience Argentina - Data Lakes & Analytics en AWS Innovation Track AWS Cloud Experience Argentina - Data Lakes & Analytics en AWS
Innovation Track AWS Cloud Experience Argentina - Data Lakes & Analytics en AWS
Amazon Web Services LATAM
 
Delivering business insights and automation utilizing aws data services
Delivering business insights and automation utilizing aws data servicesDelivering business insights and automation utilizing aws data services
Delivering business insights and automation utilizing aws data services
Bhuvaneshwaran R
 
Understanding AWS Managed Database and Analytics Services | AWS Public Sector...
Understanding AWS Managed Database and Analytics Services | AWS Public Sector...Understanding AWS Managed Database and Analytics Services | AWS Public Sector...
Understanding AWS Managed Database and Analytics Services | AWS Public Sector...
Amazon Web Services
 
Understanding AWS Managed Database and Analytics Services | AWS Public Sector...
Understanding AWS Managed Database and Analytics Services | AWS Public Sector...Understanding AWS Managed Database and Analytics Services | AWS Public Sector...
Understanding AWS Managed Database and Analytics Services | AWS Public Sector...
Amazon Web Services
 
ABD202_Best Practices for Building Serverless Big Data Applications
ABD202_Best Practices for Building Serverless Big Data ApplicationsABD202_Best Practices for Building Serverless Big Data Applications
ABD202_Best Practices for Building Serverless Big Data Applications
Amazon Web Services
 
Building your Datalake on AWS
Building your Datalake on AWSBuilding your Datalake on AWS
Building your Datalake on AWS
Amazon Web Services
 
AWS Big Data Platform
AWS Big Data PlatformAWS Big Data Platform
AWS Big Data Platform
Amazon Web Services
 
Using Data Lakes: Data Analytics Week SF
Using Data Lakes: Data Analytics Week SFUsing Data Lakes: Data Analytics Week SF
Using Data Lakes: Data Analytics Week SF
Amazon Web Services
 
AWS Storage State of the Union
AWS Storage State of the UnionAWS Storage State of the Union
AWS Storage State of the Union
Amazon Web Services
 
Track 6 Session 1_進入 AI 領域的第一步驟_資料平台的建置.pptx
Track 6 Session 1_進入 AI 領域的第一步驟_資料平台的建置.pptxTrack 6 Session 1_進入 AI 領域的第一步驟_資料平台的建置.pptx
Track 6 Session 1_進入 AI 領域的第一步驟_資料平台的建置.pptx
Amazon Web Services
 
Fast Track to Your Data Lake on AWS
Fast Track to Your Data Lake on AWSFast Track to Your Data Lake on AWS
Fast Track to Your Data Lake on AWS
Amazon Web Services
 
Using Data Lakes
Using Data Lakes Using Data Lakes
Using Data Lakes
Amazon Web Services
 
Azure Data.pptx
Azure Data.pptxAzure Data.pptx
Azure Data.pptx
FedoRam1
 
Understanding AWS Managed Databases and Analytic Services - AWS Innovate Otta...
Understanding AWS Managed Databases and Analytic Services - AWS Innovate Otta...Understanding AWS Managed Databases and Analytic Services - AWS Innovate Otta...
Understanding AWS Managed Databases and Analytic Services - AWS Innovate Otta...
Amazon Web Services
 
Building Your Data Lake on AWS - Level 200
Building Your Data Lake on AWS - Level 200Building Your Data Lake on AWS - Level 200
Building Your Data Lake on AWS - Level 200
Amazon Web Services
 
Finding Meaning in the Noise: Understanding Big Data with AWS Analytics
Finding Meaning in the Noise: Understanding Big Data with AWS AnalyticsFinding Meaning in the Noise: Understanding Big Data with AWS Analytics
Finding Meaning in the Noise: Understanding Big Data with AWS Analytics
Amazon Web Services
 
BDA306 Building a Modern Data Warehouse: Deep Dive on Amazon Redshift
BDA306 Building a Modern Data Warehouse: Deep Dive on Amazon RedshiftBDA306 Building a Modern Data Warehouse: Deep Dive on Amazon Redshift
BDA306 Building a Modern Data Warehouse: Deep Dive on Amazon Redshift
Amazon Web Services
 
Building a Data Lake on AWS
Building a Data Lake on AWSBuilding a Data Lake on AWS
Building a Data Lake on AWS
Amazon Web Services
 

Similar to Multi-Source, Multi-Speed Data Consumption & Analytics on AWS (20)

Building your First Big Data Application on AWS
Building your First Big Data Application on AWSBuilding your First Big Data Application on AWS
Building your First Big Data Application on AWS
 
AWS Big Data Solution Days
AWS Big Data Solution DaysAWS Big Data Solution Days
AWS Big Data Solution Days
 
Innovation Track AWS Cloud Experience Argentina - Data Lakes & Analytics en AWS
Innovation Track AWS Cloud Experience Argentina - Data Lakes & Analytics en AWS Innovation Track AWS Cloud Experience Argentina - Data Lakes & Analytics en AWS
Innovation Track AWS Cloud Experience Argentina - Data Lakes & Analytics en AWS
 
Delivering business insights and automation utilizing aws data services
Delivering business insights and automation utilizing aws data servicesDelivering business insights and automation utilizing aws data services
Delivering business insights and automation utilizing aws data services
 
Understanding AWS Managed Database and Analytics Services | AWS Public Sector...
Understanding AWS Managed Database and Analytics Services | AWS Public Sector...Understanding AWS Managed Database and Analytics Services | AWS Public Sector...
Understanding AWS Managed Database and Analytics Services | AWS Public Sector...
 
Understanding AWS Managed Database and Analytics Services | AWS Public Sector...
Understanding AWS Managed Database and Analytics Services | AWS Public Sector...Understanding AWS Managed Database and Analytics Services | AWS Public Sector...
Understanding AWS Managed Database and Analytics Services | AWS Public Sector...
 
ABD202_Best Practices for Building Serverless Big Data Applications
ABD202_Best Practices for Building Serverless Big Data ApplicationsABD202_Best Practices for Building Serverless Big Data Applications
ABD202_Best Practices for Building Serverless Big Data Applications
 
Building your Datalake on AWS
Building your Datalake on AWSBuilding your Datalake on AWS
Building your Datalake on AWS
 
AWS Big Data Platform
AWS Big Data PlatformAWS Big Data Platform
AWS Big Data Platform
 
Using Data Lakes: Data Analytics Week SF
Using Data Lakes: Data Analytics Week SFUsing Data Lakes: Data Analytics Week SF
Using Data Lakes: Data Analytics Week SF
 
AWS Storage State of the Union
AWS Storage State of the UnionAWS Storage State of the Union
AWS Storage State of the Union
 
Track 6 Session 1_進入 AI 領域的第一步驟_資料平台的建置.pptx
Track 6 Session 1_進入 AI 領域的第一步驟_資料平台的建置.pptxTrack 6 Session 1_進入 AI 領域的第一步驟_資料平台的建置.pptx
Track 6 Session 1_進入 AI 領域的第一步驟_資料平台的建置.pptx
 
Fast Track to Your Data Lake on AWS
Fast Track to Your Data Lake on AWSFast Track to Your Data Lake on AWS
Fast Track to Your Data Lake on AWS
 
Using Data Lakes
Using Data Lakes Using Data Lakes
Using Data Lakes
 
Azure Data.pptx
Azure Data.pptxAzure Data.pptx
Azure Data.pptx
 
Understanding AWS Managed Databases and Analytic Services - AWS Innovate Otta...
Understanding AWS Managed Databases and Analytic Services - AWS Innovate Otta...Understanding AWS Managed Databases and Analytic Services - AWS Innovate Otta...
Understanding AWS Managed Databases and Analytic Services - AWS Innovate Otta...
 
Building Your Data Lake on AWS - Level 200
Building Your Data Lake on AWS - Level 200Building Your Data Lake on AWS - Level 200
Building Your Data Lake on AWS - Level 200
 
Finding Meaning in the Noise: Understanding Big Data with AWS Analytics
Finding Meaning in the Noise: Understanding Big Data with AWS AnalyticsFinding Meaning in the Noise: Understanding Big Data with AWS Analytics
Finding Meaning in the Noise: Understanding Big Data with AWS Analytics
 
BDA306 Building a Modern Data Warehouse: Deep Dive on Amazon Redshift
BDA306 Building a Modern Data Warehouse: Deep Dive on Amazon RedshiftBDA306 Building a Modern Data Warehouse: Deep Dive on Amazon Redshift
BDA306 Building a Modern Data Warehouse: Deep Dive on Amazon Redshift
 
Building a Data Lake on AWS
Building a Data Lake on AWSBuilding a Data Lake on AWS
Building a Data Lake on AWS
 

More from Amazon Web Services

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Amazon Web Services
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Amazon Web Services
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS Fargate
Amazon Web Services
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWS
Amazon Web Services
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot
Amazon Web Services
 
Open banking as a service
Open banking as a serviceOpen banking as a service
Open banking as a service
Amazon Web Services
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Amazon Web Services
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
Amazon Web Services
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Amazon Web Services
 
Computer Vision con AWS
Computer Vision con AWSComputer Vision con AWS
Computer Vision con AWS
Amazon Web Services
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatare
Amazon Web Services
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Amazon Web Services
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e web
Amazon Web Services
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Amazon Web Services
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWS
Amazon Web Services
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch Deck
Amazon Web Services
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without servers
Amazon Web Services
 
Fundraising Essentials
Fundraising EssentialsFundraising Essentials
Fundraising Essentials
Amazon Web Services
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
Amazon Web Services
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container Service
Amazon Web Services
 

More from Amazon Web Services (20)

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS Fargate
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWS
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot
 
Open banking as a service
Open banking as a serviceOpen banking as a service
Open banking as a service
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
 
Computer Vision con AWS
Computer Vision con AWSComputer Vision con AWS
Computer Vision con AWS
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatare
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e web
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWS
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch Deck
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without servers
 
Fundraising Essentials
Fundraising EssentialsFundraising Essentials
Fundraising Essentials
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container Service
 

Multi-Source, Multi-Speed Data Consumption & Analytics on AWS

  • 1. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Paul Macey AWS Specialist Solutions Architect Big data and Insights 3rd July 2019 Multi Source, Multi Speed Data Consumption & Analytics on AWS
  • 2. Agenda How did it come to this? Unintended data pipelines Well architected data pipelines Accelerated Data Lake Multisource, multispeed patterns Wrap up
  • 3. How did it come to this?
  • 4. Unintended data pipelines Geospatial Girl The Oracle The Data Scientist External request Sales Analyst Operations Security Guy The Boss
  • 5. Well architected data pipelines Operational Excellence Security Reliability Performance Efficiency Cost Optimization
  • 6. Security Day 0 Data governance & metadata Data centralised & scalable SQL & BI ready Analytical & Data Science foundation Repeatable & Extensible Accelerated Data Lake
  • 7. Customer example Health Organisation In New Zealand Challenge : Needed to bring disparate datasets together Too many external files Current State: SQL Server 2008 Large text files ETL (SSIS) https://www.linkedin.com/pulse/how-we-built-data-lake-less-than-4-weeks-alex-poor/ Solution : Uses the ADL solution as a base Extended the solution to access FTP server data Entirely serverless
  • 8. Multisource, multispeed data and analytics patterns
  • 10. Streaming Pattern Smart City Camera camera_stream datetime sensorId locationId currentTemperature battery objectDetected status camera_stream Create SQL Schema Write SQL to query stream Amazon Kinesis Data Firehose Amazon Kinesis Data Streams Amazon Kinesis Data Analytics Amazon Kinesis Data Analytics
  • 11. API Pattern - Inbound WeatherTraffic Amazon CloudWatch AWS Lambda Amazon S3
  • 12. API Pattern - Outbound Camera, weather, and traffic Insights AWS Lambda Amazon S3Amazon API Gateway
  • 13. SFTP Pattern + No end-user disruption + Fully managed servers + Simple to use + Pay as you use + Native cloud integrations AWS SFTP Amazon S3 AWS Transfer for SFTP
  • 14. Data Stores Data stores CDC AWS Database Migration Service * Target only Amazon DynamoDB* Amazon Kinesis Amazon S3 Amazon RDS
  • 16. Who is saying what about DMS and SCT? "We migrated hundreds of our clients from our in-house data-center to Amazon RDS Oracle 12c using the AWS Data Migration Service (DMS). Due to this service, we could live-replicate the databases between our data-center and RDS before the migration. That kept the migration down-time to the very minimum. We are very happy with DMS and are planning to use it for Oracle to MySQL migration next”. ”The SCT Assessment Report was the key enabler to allow us to understand the scope of effort required to complete an Oracle to PostgreSQL migration. What was originally thought to be a largely manual task that no one was particularly excited about having to do became a very straight-forward quick and easy process." “We are in the process of migrating some databases to Amazon Aurora. The ease by which we can do this using the AWS Database Migration Service has simplified this process for us and enabled us to accelerate our migration efforts. The ability to closely monitor the process, the detailed logging feature, and the support we received from AWS have given us a great deal of confidence in a successful migration.”
  • 17. Shared file systems NFS TLS AWS storage resources AWS DataSync agent AWS DataSync Amazon S3 Amazon Elastic File System Region Batch Processing Pattern Corporate data center Transfer data at speeds up to 10 times faster than open-source tools
  • 18. AWS has services and patterns to create YOUR organisations data goldmine csv json xls Batch FTPStreaming IoT Geospatial Girl The Oracle Operations External Requests The Data Scientist Analyst Sales
  • 19. References AWS Accelerated Data Lake (Git) https://github.com/aws-samples/accelerated-data-lake AWS Accelerated Data Lake Blog (part 1 & 2) https://aws.amazon.com/blogs/publicsector/from-data-silos-to-data-domains-bringing-common-data- together https://aws.amazon.com/blogs/publicsector/securing-your-data-by-knowing-your-data Our data lake story: How Woot.com built a serverless data lake on AWS https://aws.amazon.com/blogs/big-data/our-data-lake-story-how-woot-com-built-a-serverless-data-lake- on-aws Kinesis Data Generator https://awslabs.github.io/amazon-kinesis-data-generator/
  • 20. References IoT analytics https://aws.amazon.com/iot- analytics/ AWS Database Migration Service https://aws.amazon.com/dms/ AWS Transfer for SFTP https://aws.amazon.com/sftp/ AWS Datasync https://aws.amazon.com/datasync/ Amazon Kinesis https://aws.amazon.com/kinesis/ Amazon Kinesis Analytics https://aws.amazon.com/kinesis/data- analytics/
  • 21. Available today @ GitHub https://github.com/aws-samples/accelerated-data-lake Includes Data lake pipeline (CloudFormation) Instructions Data configuration, security and metadata templates Delivery Professional services AWS partners Accelerated Data Lake