SlideShare a Scribd company logo
1 of 20
Download to read offline
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Comprehensive Big Data Architecture Made Easy:
The AWS Marketplace Intelligent Analytical System
Luis Daniel Soto, AWS.
@luisdans
AWS Re:INVENT HANDS-ON WORKSHOP
Kim Schmidt, President & CIO/Dataleader.io
@dataleader
GPSWKS301
November 28, 2017
AWS re:INVENT
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Big Data Workshop Audience and Goals
Workshop Audience:
• APN Consulting Partners
• Organizations of all sizes
Workshop Goals:
• Show you the benefits of integrating AWS services and solutions from AWS
Marketplace
• Exercise 1: Build a data pipeline that will stream live data into Amazon
Redshift
• Exercise 2: Leverage machine learning to generate predictive analytics
• Learn how to avoid common errors when building a Big Data Architecture
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
What we’ll be doing today
1. Introduction (5 minutes)
2. Collect and Store Data (60 minutes)
• DEMO #1: Extending on-premise data load to AWS
• COLLABORATIVE TEAM EXERCISE: Building a data pipeline
3. Transform & Analyze (55 minutes)
• DEMO #2: Orchestrate, transform and aggregate data on Amazon Redshift
• COLLABORATIVE TEAM EXERCISE: Predictive Analytics with Machine Learning
• DEMO #3: Visualization of prediction output for real-time
4. Operations Management (25 minutes)
• PRESENTATION: AWS Marketplace Intelligent Analytical System
• DISCUSSION: Other challenges on building a end-to-end Big Data architecture
5. Workshop Wrap-up (5 minutes)
Collect & Store Transform
& Analyze
Operations
Management
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
GENERATE COLLECT STORE ANALYZE/PROCESS CONSUME
Increasing variety of
sources
Little visibility into
new data
Myopic view of
existing data
Difficulty
consolidating data
across different
sources and locations
Challenges
normalizing/
transforming/
aggregating data into
a standardized
format
Unable to capture
and/or process data
as quickly as it is
being generated
Unfamiliarity with
modern data
management
techniques
Lack of necessary skills
to implement and
maintain new
technologies
Scaling IT
infrastructure
Inability to process
data in a timely
manner once its
needed
Limited resources
and capabilities to
experiment and
iterate
Processing all data in
various formats
Predicting future
required capacity
Make more intelligent
business decisions
Limited adoption due
to rigidity and
inflexibility of legacy
BI tools
Be able to run queries
quickly
Get to data-driven
results faster
What we hear from our customers
Big Data Architecture
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Immediate availability
Broad & deep capabilities
Trusted & secure
Large partner ecosystem
http://aws.amazon.com/mp
Big Data on AWS iAS
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
DEMO 1: EXTEND YOUR ON-PREMISES
DATA MANAGEMENT TO AWS
Transform
& Analyze
Operations
Management
AWS MARKETPLACE: INTELLIGENT ANALYTICAL SYSTEM
Collect & Store
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
SoftNAS Cloud High-Performance Cloud NAS
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
• Software-only NAS virtual appliance
built for the cloud
• Full protocol support: CIFS, NFS, AFP,
iSCSI
• High Availability
• Snapshots / Rollbacks
• Replication
• Deduplication
• Compression
• Price and Performance Tunable
• Active Directory / LDAP integration
• Scales from Gigabytes to Petabytes
SoftNAS Cloud Virtual NAS Overview
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Transform
& Analyze
Operations
Management
Collect & Store
WORKSHOP ACTIVITY 1: CREATING A
REAL-TIME STREAMING DATA PIPELINE
AWS MARKETPLACE: INTELLIGENT ANALYTICAL SYSTEM
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
1. See how to use serverless AWS Lambda to transform streaming data from a
sample weblog generator
2. Define an Amazon Kinesis Firehose Delivery Stream
3. Set up Amazon Kinesis Streams
4. Monitor the live stream using Amazon CloudWatch
5. Query the data in Amazon Redshift via a customized command line
Collaborative Team Exercise #1
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Operations
Management
Collect & Store
DEMO 2: MPP DATA TRANSFORMATION
& ORCHESTRATION
AWS MARKETPLACE: INTELLIGENT ANALYTICAL SYSTEM
Transform
& Analyze
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Operations
Management
Collect & Store
WORKSHOP ACTIVITY 2:
PREDICTIVE ANALYTICS
AWS MARKETPLACE: INTELLIGENT ANALYTICAL SYSTEM
Transform
& Analyze
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
1. Query data in Amazon S3
2. Design a predictive scoring formula using PredicSis.ai
3. Compute predictive models to apply to real-time customer data that can
be used to discover outliers who are ready right now to buy a Tesla
Collaborative Team Exercise #2
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Operations
Management
Collect & Store
DEMO 3: REAL-TIME DATA
VISUALIZATIONS AND ALERTS
AWS MARKETPLACE: INTELLIGENT ANALYTICAL SYSTEM
Transform
& Analyze
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Collect & Store
OPERATIONS MANAGEMENT
AWS MARKETPLACE: INTELLIGENT ANALYTICAL SYSTEM
Transform
& Analyze
Operations
Management
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
iAS: System Architecture and Design
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
GROUP DISCUSSION
AWS MARKETPLACE: INTELLIGENT ANALYTICAL SYSTEM
• What challenges is my customer/organization facing
in building a Big Data Architecture?
• What capabilities is my organization missing?
• What parts would I want to further customize?
Collect & Store Transform
& Analyze
Operations
Management
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Key Takeaways
There is an increase of importance on new Big Data capabilities including:
• Securely combining data residing on-premises with Cloud applications
• Abandon historical-only business analysis to include predictive and other modern analytics to
immediately act upon key business insights
• Break down data silos and harness data in real time from all over the globe
• The importance of building a modular Big Data architecture
AWS Marketplace enables agility and experimentation
• Combining AWS services with solutions in AWS Marketplace are “pieces of the puzzle” that
can be replaced whenever newer products and services are released
• Easily evaluate innovative software solutions, pay only for what you use
Download the step-by-step guide for the Intelligent Analytical System (iAS)
• https://aws.amazon.com/mp/mp_solution
• https://aws-kimsshmidt.com
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Thank you!
@ l u i s d a n s
@ d a t a l e a d e r
http://aws.amazon.com/mp

More Related Content

What's hot

NET309_Best Practices for Securing an Amazon Virtual Private Cloud
NET309_Best Practices for Securing an Amazon Virtual Private CloudNET309_Best Practices for Securing an Amazon Virtual Private Cloud
NET309_Best Practices for Securing an Amazon Virtual Private CloudAmazon Web Services
 
DAT322_The Nanoservices Architecture That Powers BBC Online
DAT322_The Nanoservices Architecture That Powers BBC OnlineDAT322_The Nanoservices Architecture That Powers BBC Online
DAT322_The Nanoservices Architecture That Powers BBC OnlineAmazon Web Services
 
ARC303_Running Lean Architectures How to Optimize for Cost Efficiency
ARC303_Running Lean Architectures How to Optimize for Cost EfficiencyARC303_Running Lean Architectures How to Optimize for Cost Efficiency
ARC303_Running Lean Architectures How to Optimize for Cost EfficiencyAmazon Web Services
 
GPSBUS220-Refactor and Replatform .NET Apps to Use the Latest Microsoft SQL S...
GPSBUS220-Refactor and Replatform .NET Apps to Use the Latest Microsoft SQL S...GPSBUS220-Refactor and Replatform .NET Apps to Use the Latest Microsoft SQL S...
GPSBUS220-Refactor and Replatform .NET Apps to Use the Latest Microsoft SQL S...Amazon Web Services
 
CMP316_Hedge Your Own Funds Run Monte Carlo Simulations on EC2 Spot Fleet
CMP316_Hedge Your Own Funds Run Monte Carlo Simulations on EC2 Spot FleetCMP316_Hedge Your Own Funds Run Monte Carlo Simulations on EC2 Spot Fleet
CMP316_Hedge Your Own Funds Run Monte Carlo Simulations on EC2 Spot FleetAmazon Web Services
 
DAT203_Running MySQL Databases on AWS
DAT203_Running MySQL Databases on AWSDAT203_Running MySQL Databases on AWS
DAT203_Running MySQL Databases on AWSAmazon Web Services
 
ARC319_Multi-Region Active-Active Architecture
ARC319_Multi-Region Active-Active ArchitectureARC319_Multi-Region Active-Active Architecture
ARC319_Multi-Region Active-Active ArchitectureAmazon Web Services
 
STG206_Big Data Data Lakes and Data Oceans
STG206_Big Data Data Lakes and Data OceansSTG206_Big Data Data Lakes and Data Oceans
STG206_Big Data Data Lakes and Data OceansAmazon Web Services
 
ABD215_Serverless Data Prep with AWS Glue
ABD215_Serverless Data Prep with AWS GlueABD215_Serverless Data Prep with AWS Glue
ABD215_Serverless Data Prep with AWS GlueAmazon Web Services
 
MBL209_Learn How MicroStrategy on AWS is Helping Vivint Solar Deliver Clean E...
MBL209_Learn How MicroStrategy on AWS is Helping Vivint Solar Deliver Clean E...MBL209_Learn How MicroStrategy on AWS is Helping Vivint Solar Deliver Clean E...
MBL209_Learn How MicroStrategy on AWS is Helping Vivint Solar Deliver Clean E...Amazon Web Services
 
GPSTEC322-GPS Creating Your Virtual Data Center VPC Fundamentals Connectivity...
GPSTEC322-GPS Creating Your Virtual Data Center VPC Fundamentals Connectivity...GPSTEC322-GPS Creating Your Virtual Data Center VPC Fundamentals Connectivity...
GPSTEC322-GPS Creating Your Virtual Data Center VPC Fundamentals Connectivity...Amazon Web Services
 
規劃大規模遷移到 AWS 的最佳實踐
規劃大規模遷移到 AWS 的最佳實踐規劃大規模遷移到 AWS 的最佳實踐
規劃大規模遷移到 AWS 的最佳實踐Amazon Web Services
 
WIN301-Migrating Microsoft SQL Server Databases to AWS-Best Practices and Pat...
WIN301-Migrating Microsoft SQL Server Databases to AWS-Best Practices and Pat...WIN301-Migrating Microsoft SQL Server Databases to AWS-Best Practices and Pat...
WIN301-Migrating Microsoft SQL Server Databases to AWS-Best Practices and Pat...Amazon Web Services
 
Reinforcement Learning – The Ultimate AI - ARC320 - re:Invent 2017
Reinforcement Learning – The Ultimate AI - ARC320 - re:Invent 2017Reinforcement Learning – The Ultimate AI - ARC320 - re:Invent 2017
Reinforcement Learning – The Ultimate AI - ARC320 - re:Invent 2017Amazon Web Services
 
Scaling Up to Your First 10 Million Users
Scaling Up to Your First 10 Million UsersScaling Up to Your First 10 Million Users
Scaling Up to Your First 10 Million UsersAmazon Web Services
 
MSC203_How Citrix Uses AWS Marketplace Solutions To Accelerate Analytic Workl...
MSC203_How Citrix Uses AWS Marketplace Solutions To Accelerate Analytic Workl...MSC203_How Citrix Uses AWS Marketplace Solutions To Accelerate Analytic Workl...
MSC203_How Citrix Uses AWS Marketplace Solutions To Accelerate Analytic Workl...Amazon Web Services
 
DAT339_Replicate, Analyze, and Visualize Datasets Using AWS Database Migratio...
DAT339_Replicate, Analyze, and Visualize Datasets Using AWS Database Migratio...DAT339_Replicate, Analyze, and Visualize Datasets Using AWS Database Migratio...
DAT339_Replicate, Analyze, and Visualize Datasets Using AWS Database Migratio...Amazon Web Services
 
DEV326_DevOps Essentials An Introductory Workshop on CICD Practices
DEV326_DevOps Essentials An Introductory Workshop on CICD PracticesDEV326_DevOps Essentials An Introductory Workshop on CICD Practices
DEV326_DevOps Essentials An Introductory Workshop on CICD PracticesAmazon Web Services
 
ARC201_Scaling Up to Your First 10 Million Users
ARC201_Scaling Up to Your First 10 Million UsersARC201_Scaling Up to Your First 10 Million Users
ARC201_Scaling Up to Your First 10 Million UsersAmazon Web Services
 

What's hot (20)

NET309_Best Practices for Securing an Amazon Virtual Private Cloud
NET309_Best Practices for Securing an Amazon Virtual Private CloudNET309_Best Practices for Securing an Amazon Virtual Private Cloud
NET309_Best Practices for Securing an Amazon Virtual Private Cloud
 
GPSTEC325-Enterprise Storage
GPSTEC325-Enterprise StorageGPSTEC325-Enterprise Storage
GPSTEC325-Enterprise Storage
 
DAT322_The Nanoservices Architecture That Powers BBC Online
DAT322_The Nanoservices Architecture That Powers BBC OnlineDAT322_The Nanoservices Architecture That Powers BBC Online
DAT322_The Nanoservices Architecture That Powers BBC Online
 
ARC303_Running Lean Architectures How to Optimize for Cost Efficiency
ARC303_Running Lean Architectures How to Optimize for Cost EfficiencyARC303_Running Lean Architectures How to Optimize for Cost Efficiency
ARC303_Running Lean Architectures How to Optimize for Cost Efficiency
 
GPSBUS220-Refactor and Replatform .NET Apps to Use the Latest Microsoft SQL S...
GPSBUS220-Refactor and Replatform .NET Apps to Use the Latest Microsoft SQL S...GPSBUS220-Refactor and Replatform .NET Apps to Use the Latest Microsoft SQL S...
GPSBUS220-Refactor and Replatform .NET Apps to Use the Latest Microsoft SQL S...
 
CMP316_Hedge Your Own Funds Run Monte Carlo Simulations on EC2 Spot Fleet
CMP316_Hedge Your Own Funds Run Monte Carlo Simulations on EC2 Spot FleetCMP316_Hedge Your Own Funds Run Monte Carlo Simulations on EC2 Spot Fleet
CMP316_Hedge Your Own Funds Run Monte Carlo Simulations on EC2 Spot Fleet
 
DAT203_Running MySQL Databases on AWS
DAT203_Running MySQL Databases on AWSDAT203_Running MySQL Databases on AWS
DAT203_Running MySQL Databases on AWS
 
ARC319_Multi-Region Active-Active Architecture
ARC319_Multi-Region Active-Active ArchitectureARC319_Multi-Region Active-Active Architecture
ARC319_Multi-Region Active-Active Architecture
 
STG206_Big Data Data Lakes and Data Oceans
STG206_Big Data Data Lakes and Data OceansSTG206_Big Data Data Lakes and Data Oceans
STG206_Big Data Data Lakes and Data Oceans
 
ABD215_Serverless Data Prep with AWS Glue
ABD215_Serverless Data Prep with AWS GlueABD215_Serverless Data Prep with AWS Glue
ABD215_Serverless Data Prep with AWS Glue
 
MBL209_Learn How MicroStrategy on AWS is Helping Vivint Solar Deliver Clean E...
MBL209_Learn How MicroStrategy on AWS is Helping Vivint Solar Deliver Clean E...MBL209_Learn How MicroStrategy on AWS is Helping Vivint Solar Deliver Clean E...
MBL209_Learn How MicroStrategy on AWS is Helping Vivint Solar Deliver Clean E...
 
GPSTEC322-GPS Creating Your Virtual Data Center VPC Fundamentals Connectivity...
GPSTEC322-GPS Creating Your Virtual Data Center VPC Fundamentals Connectivity...GPSTEC322-GPS Creating Your Virtual Data Center VPC Fundamentals Connectivity...
GPSTEC322-GPS Creating Your Virtual Data Center VPC Fundamentals Connectivity...
 
規劃大規模遷移到 AWS 的最佳實踐
規劃大規模遷移到 AWS 的最佳實踐規劃大規模遷移到 AWS 的最佳實踐
規劃大規模遷移到 AWS 的最佳實踐
 
WIN301-Migrating Microsoft SQL Server Databases to AWS-Best Practices and Pat...
WIN301-Migrating Microsoft SQL Server Databases to AWS-Best Practices and Pat...WIN301-Migrating Microsoft SQL Server Databases to AWS-Best Practices and Pat...
WIN301-Migrating Microsoft SQL Server Databases to AWS-Best Practices and Pat...
 
Reinforcement Learning – The Ultimate AI - ARC320 - re:Invent 2017
Reinforcement Learning – The Ultimate AI - ARC320 - re:Invent 2017Reinforcement Learning – The Ultimate AI - ARC320 - re:Invent 2017
Reinforcement Learning – The Ultimate AI - ARC320 - re:Invent 2017
 
Scaling Up to Your First 10 Million Users
Scaling Up to Your First 10 Million UsersScaling Up to Your First 10 Million Users
Scaling Up to Your First 10 Million Users
 
MSC203_How Citrix Uses AWS Marketplace Solutions To Accelerate Analytic Workl...
MSC203_How Citrix Uses AWS Marketplace Solutions To Accelerate Analytic Workl...MSC203_How Citrix Uses AWS Marketplace Solutions To Accelerate Analytic Workl...
MSC203_How Citrix Uses AWS Marketplace Solutions To Accelerate Analytic Workl...
 
DAT339_Replicate, Analyze, and Visualize Datasets Using AWS Database Migratio...
DAT339_Replicate, Analyze, and Visualize Datasets Using AWS Database Migratio...DAT339_Replicate, Analyze, and Visualize Datasets Using AWS Database Migratio...
DAT339_Replicate, Analyze, and Visualize Datasets Using AWS Database Migratio...
 
DEV326_DevOps Essentials An Introductory Workshop on CICD Practices
DEV326_DevOps Essentials An Introductory Workshop on CICD PracticesDEV326_DevOps Essentials An Introductory Workshop on CICD Practices
DEV326_DevOps Essentials An Introductory Workshop on CICD Practices
 
ARC201_Scaling Up to Your First 10 Million Users
ARC201_Scaling Up to Your First 10 Million UsersARC201_Scaling Up to Your First 10 Million Users
ARC201_Scaling Up to Your First 10 Million Users
 

Similar to GPSWKS301_Comprehensive Big Data Architecture Made Easy

Architecting an Open Data Lake for the Enterprise
Architecting an Open Data Lake for the EnterpriseArchitecting an Open Data Lake for the Enterprise
Architecting an Open Data Lake for the EnterpriseAmazon Web Services
 
How Citrix Uses AWS Marketplace Solutions to Accelerate Analytic Workloads on...
How Citrix Uses AWS Marketplace Solutions to Accelerate Analytic Workloads on...How Citrix Uses AWS Marketplace Solutions to Accelerate Analytic Workloads on...
How Citrix Uses AWS Marketplace Solutions to Accelerate Analytic Workloads on...Amazon Web Services
 
Automating Big Data Technologies for Faster Time-to-Value
 Automating Big Data Technologies for Faster Time-to-Value Automating Big Data Technologies for Faster Time-to-Value
Automating Big Data Technologies for Faster Time-to-ValueAmazon Web Services
 
Architecting an Open Data Lake for the Enterprise
 Architecting an Open Data Lake for the Enterprise  Architecting an Open Data Lake for the Enterprise
Architecting an Open Data Lake for the Enterprise Amazon Web Services
 
TiVo: How to Scale New Products with a Data Lake on AWS and Qubole
 TiVo: How to Scale New Products with a Data Lake on AWS and Qubole TiVo: How to Scale New Products with a Data Lake on AWS and Qubole
TiVo: How to Scale New Products with a Data Lake on AWS and QuboleAmazon Web Services
 
TiVo: How to Scale New Products with a Data Lake on AWS and Qubole
 TiVo: How to Scale New Products with a Data Lake on AWS and Qubole TiVo: How to Scale New Products with a Data Lake on AWS and Qubole
TiVo: How to Scale New Products with a Data Lake on AWS and QuboleAmazon Web Services
 
Leveraging Data Analytics in the Cloud to Support Data-Driven Decisions
Leveraging Data Analytics in the Cloud to Support Data-Driven DecisionsLeveraging Data Analytics in the Cloud to Support Data-Driven Decisions
Leveraging Data Analytics in the Cloud to Support Data-Driven DecisionsAmazon Web Services
 
Fanatics Ingests Streaming Data to a Data Lake on AWS
Fanatics Ingests Streaming Data to a Data Lake on AWSFanatics Ingests Streaming Data to a Data Lake on AWS
Fanatics Ingests Streaming Data to a Data Lake on AWSAmazon Web Services
 
McGraw-Hill Optimizes Analytics Workloads with Databricks
 McGraw-Hill Optimizes Analytics Workloads with Databricks McGraw-Hill Optimizes Analytics Workloads with Databricks
McGraw-Hill Optimizes Analytics Workloads with DatabricksAmazon Web Services
 
利用 Amazon QuickSight 視覺化分析服務剖析資料
利用 Amazon QuickSight 視覺化分析服務剖析資料利用 Amazon QuickSight 視覺化分析服務剖析資料
利用 Amazon QuickSight 視覺化分析服務剖析資料Amazon Web Services
 
GPS: Industry 4.0: AI and the Future of Manufacturing - GPSTEC326 - re:Invent...
GPS: Industry 4.0: AI and the Future of Manufacturing - GPSTEC326 - re:Invent...GPS: Industry 4.0: AI and the Future of Manufacturing - GPSTEC326 - re:Invent...
GPS: Industry 4.0: AI and the Future of Manufacturing - GPSTEC326 - re:Invent...Amazon Web Services
 
Amazon Web Services
Amazon Web ServicesAmazon Web Services
Amazon Web ServicesJisc
 
Citrix Moves Data to Amazon Redshift Fast with Matillion ETL
 Citrix Moves Data to Amazon Redshift Fast with Matillion ETL Citrix Moves Data to Amazon Redshift Fast with Matillion ETL
Citrix Moves Data to Amazon Redshift Fast with Matillion ETLAmazon Web Services
 
How Amazon.com Uses AWS Analytics: Data Analytics Week SF
How Amazon.com Uses AWS Analytics: Data Analytics Week SFHow Amazon.com Uses AWS Analytics: Data Analytics Week SF
How Amazon.com Uses AWS Analytics: Data Analytics Week SFAmazon Web Services
 
規劃大規模遷移到 AWS 的最佳實踐
規劃大規模遷移到 AWS 的最佳實踐規劃大規模遷移到 AWS 的最佳實踐
規劃大規模遷移到 AWS 的最佳實踐Amazon Web Services
 
How a Global Healthcare Company Built a Migration Factory to Quickly Move Tho...
How a Global Healthcare Company Built a Migration Factory to Quickly Move Tho...How a Global Healthcare Company Built a Migration Factory to Quickly Move Tho...
How a Global Healthcare Company Built a Migration Factory to Quickly Move Tho...Amazon Web Services
 
Visualizing Big Data Insights with Amazon QuickSight
Visualizing Big Data Insights with Amazon QuickSightVisualizing Big Data Insights with Amazon QuickSight
Visualizing Big Data Insights with Amazon QuickSightAmazon Web Services
 
How Amazon.com Uses AWS Analytics
How Amazon.com Uses AWS AnalyticsHow Amazon.com Uses AWS Analytics
How Amazon.com Uses AWS AnalyticsAmazon Web Services
 
Modern Data Architectures for Business Outcomes
Modern Data Architectures for Business OutcomesModern Data Architectures for Business Outcomes
Modern Data Architectures for Business OutcomesAmazon Web Services
 

Similar to GPSWKS301_Comprehensive Big Data Architecture Made Easy (20)

Architecting an Open Data Lake for the Enterprise
Architecting an Open Data Lake for the EnterpriseArchitecting an Open Data Lake for the Enterprise
Architecting an Open Data Lake for the Enterprise
 
How Citrix Uses AWS Marketplace Solutions to Accelerate Analytic Workloads on...
How Citrix Uses AWS Marketplace Solutions to Accelerate Analytic Workloads on...How Citrix Uses AWS Marketplace Solutions to Accelerate Analytic Workloads on...
How Citrix Uses AWS Marketplace Solutions to Accelerate Analytic Workloads on...
 
Automating Big Data Technologies for Faster Time-to-Value
 Automating Big Data Technologies for Faster Time-to-Value Automating Big Data Technologies for Faster Time-to-Value
Automating Big Data Technologies for Faster Time-to-Value
 
Architecting an Open Data Lake for the Enterprise
 Architecting an Open Data Lake for the Enterprise  Architecting an Open Data Lake for the Enterprise
Architecting an Open Data Lake for the Enterprise
 
TiVo: How to Scale New Products with a Data Lake on AWS and Qubole
 TiVo: How to Scale New Products with a Data Lake on AWS and Qubole TiVo: How to Scale New Products with a Data Lake on AWS and Qubole
TiVo: How to Scale New Products with a Data Lake on AWS and Qubole
 
TiVo: How to Scale New Products with a Data Lake on AWS and Qubole
 TiVo: How to Scale New Products with a Data Lake on AWS and Qubole TiVo: How to Scale New Products with a Data Lake on AWS and Qubole
TiVo: How to Scale New Products with a Data Lake on AWS and Qubole
 
Leveraging Data Analytics in the Cloud to Support Data-Driven Decisions
Leveraging Data Analytics in the Cloud to Support Data-Driven DecisionsLeveraging Data Analytics in the Cloud to Support Data-Driven Decisions
Leveraging Data Analytics in the Cloud to Support Data-Driven Decisions
 
Fanatics Ingests Streaming Data to a Data Lake on AWS
Fanatics Ingests Streaming Data to a Data Lake on AWSFanatics Ingests Streaming Data to a Data Lake on AWS
Fanatics Ingests Streaming Data to a Data Lake on AWS
 
McGraw-Hill Optimizes Analytics Workloads with Databricks
 McGraw-Hill Optimizes Analytics Workloads with Databricks McGraw-Hill Optimizes Analytics Workloads with Databricks
McGraw-Hill Optimizes Analytics Workloads with Databricks
 
利用 Amazon QuickSight 視覺化分析服務剖析資料
利用 Amazon QuickSight 視覺化分析服務剖析資料利用 Amazon QuickSight 視覺化分析服務剖析資料
利用 Amazon QuickSight 視覺化分析服務剖析資料
 
GPS: Industry 4.0: AI and the Future of Manufacturing - GPSTEC326 - re:Invent...
GPS: Industry 4.0: AI and the Future of Manufacturing - GPSTEC326 - re:Invent...GPS: Industry 4.0: AI and the Future of Manufacturing - GPSTEC326 - re:Invent...
GPS: Industry 4.0: AI and the Future of Manufacturing - GPSTEC326 - re:Invent...
 
Amazon Web Services
Amazon Web ServicesAmazon Web Services
Amazon Web Services
 
Citrix Moves Data to Amazon Redshift Fast with Matillion ETL
 Citrix Moves Data to Amazon Redshift Fast with Matillion ETL Citrix Moves Data to Amazon Redshift Fast with Matillion ETL
Citrix Moves Data to Amazon Redshift Fast with Matillion ETL
 
How Amazon.com Uses AWS Analytics: Data Analytics Week SF
How Amazon.com Uses AWS Analytics: Data Analytics Week SFHow Amazon.com Uses AWS Analytics: Data Analytics Week SF
How Amazon.com Uses AWS Analytics: Data Analytics Week SF
 
How Amazon uses AWS Analytics
How Amazon uses AWS AnalyticsHow Amazon uses AWS Analytics
How Amazon uses AWS Analytics
 
規劃大規模遷移到 AWS 的最佳實踐
規劃大規模遷移到 AWS 的最佳實踐規劃大規模遷移到 AWS 的最佳實踐
規劃大規模遷移到 AWS 的最佳實踐
 
How a Global Healthcare Company Built a Migration Factory to Quickly Move Tho...
How a Global Healthcare Company Built a Migration Factory to Quickly Move Tho...How a Global Healthcare Company Built a Migration Factory to Quickly Move Tho...
How a Global Healthcare Company Built a Migration Factory to Quickly Move Tho...
 
Visualizing Big Data Insights with Amazon QuickSight
Visualizing Big Data Insights with Amazon QuickSightVisualizing Big Data Insights with Amazon QuickSight
Visualizing Big Data Insights with Amazon QuickSight
 
How Amazon.com Uses AWS Analytics
How Amazon.com Uses AWS AnalyticsHow Amazon.com Uses AWS Analytics
How Amazon.com Uses AWS Analytics
 
Modern Data Architectures for Business Outcomes
Modern Data Architectures for Business OutcomesModern Data Architectures for Business Outcomes
Modern Data Architectures for Business Outcomes
 

More from Amazon Web Services

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Amazon Web Services
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Amazon Web Services
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateAmazon Web Services
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSAmazon Web Services
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Amazon Web Services
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Amazon Web Services
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...Amazon Web Services
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsAmazon Web Services
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareAmazon Web Services
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSAmazon Web Services
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAmazon Web Services
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareAmazon Web Services
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWSAmazon Web Services
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckAmazon Web Services
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without serversAmazon Web Services
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...Amazon Web Services
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceAmazon Web Services
 

More from Amazon Web Services (20)

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS Fargate
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWS
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot
 
Open banking as a service
Open banking as a serviceOpen banking as a service
Open banking as a service
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
 
Computer Vision con AWS
Computer Vision con AWSComputer Vision con AWS
Computer Vision con AWS
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatare
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e web
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWS
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch Deck
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without servers
 
Fundraising Essentials
Fundraising EssentialsFundraising Essentials
Fundraising Essentials
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container Service
 

GPSWKS301_Comprehensive Big Data Architecture Made Easy

  • 1. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Comprehensive Big Data Architecture Made Easy: The AWS Marketplace Intelligent Analytical System Luis Daniel Soto, AWS. @luisdans AWS Re:INVENT HANDS-ON WORKSHOP Kim Schmidt, President & CIO/Dataleader.io @dataleader GPSWKS301 November 28, 2017 AWS re:INVENT
  • 2. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Big Data Workshop Audience and Goals Workshop Audience: • APN Consulting Partners • Organizations of all sizes Workshop Goals: • Show you the benefits of integrating AWS services and solutions from AWS Marketplace • Exercise 1: Build a data pipeline that will stream live data into Amazon Redshift • Exercise 2: Leverage machine learning to generate predictive analytics • Learn how to avoid common errors when building a Big Data Architecture
  • 3. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. What we’ll be doing today 1. Introduction (5 minutes) 2. Collect and Store Data (60 minutes) • DEMO #1: Extending on-premise data load to AWS • COLLABORATIVE TEAM EXERCISE: Building a data pipeline 3. Transform & Analyze (55 minutes) • DEMO #2: Orchestrate, transform and aggregate data on Amazon Redshift • COLLABORATIVE TEAM EXERCISE: Predictive Analytics with Machine Learning • DEMO #3: Visualization of prediction output for real-time 4. Operations Management (25 minutes) • PRESENTATION: AWS Marketplace Intelligent Analytical System • DISCUSSION: Other challenges on building a end-to-end Big Data architecture 5. Workshop Wrap-up (5 minutes) Collect & Store Transform & Analyze Operations Management
  • 4. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. GENERATE COLLECT STORE ANALYZE/PROCESS CONSUME Increasing variety of sources Little visibility into new data Myopic view of existing data Difficulty consolidating data across different sources and locations Challenges normalizing/ transforming/ aggregating data into a standardized format Unable to capture and/or process data as quickly as it is being generated Unfamiliarity with modern data management techniques Lack of necessary skills to implement and maintain new technologies Scaling IT infrastructure Inability to process data in a timely manner once its needed Limited resources and capabilities to experiment and iterate Processing all data in various formats Predicting future required capacity Make more intelligent business decisions Limited adoption due to rigidity and inflexibility of legacy BI tools Be able to run queries quickly Get to data-driven results faster What we hear from our customers Big Data Architecture
  • 5. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Immediate availability Broad & deep capabilities Trusted & secure Large partner ecosystem http://aws.amazon.com/mp Big Data on AWS iAS
  • 6. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. DEMO 1: EXTEND YOUR ON-PREMISES DATA MANAGEMENT TO AWS Transform & Analyze Operations Management AWS MARKETPLACE: INTELLIGENT ANALYTICAL SYSTEM Collect & Store
  • 7. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. SoftNAS Cloud High-Performance Cloud NAS
  • 8. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. • Software-only NAS virtual appliance built for the cloud • Full protocol support: CIFS, NFS, AFP, iSCSI • High Availability • Snapshots / Rollbacks • Replication • Deduplication • Compression • Price and Performance Tunable • Active Directory / LDAP integration • Scales from Gigabytes to Petabytes SoftNAS Cloud Virtual NAS Overview
  • 9. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Transform & Analyze Operations Management Collect & Store WORKSHOP ACTIVITY 1: CREATING A REAL-TIME STREAMING DATA PIPELINE AWS MARKETPLACE: INTELLIGENT ANALYTICAL SYSTEM
  • 10. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. 1. See how to use serverless AWS Lambda to transform streaming data from a sample weblog generator 2. Define an Amazon Kinesis Firehose Delivery Stream 3. Set up Amazon Kinesis Streams 4. Monitor the live stream using Amazon CloudWatch 5. Query the data in Amazon Redshift via a customized command line Collaborative Team Exercise #1
  • 11. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Operations Management Collect & Store DEMO 2: MPP DATA TRANSFORMATION & ORCHESTRATION AWS MARKETPLACE: INTELLIGENT ANALYTICAL SYSTEM Transform & Analyze
  • 12. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
  • 13. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Operations Management Collect & Store WORKSHOP ACTIVITY 2: PREDICTIVE ANALYTICS AWS MARKETPLACE: INTELLIGENT ANALYTICAL SYSTEM Transform & Analyze
  • 14. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. 1. Query data in Amazon S3 2. Design a predictive scoring formula using PredicSis.ai 3. Compute predictive models to apply to real-time customer data that can be used to discover outliers who are ready right now to buy a Tesla Collaborative Team Exercise #2
  • 15. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Operations Management Collect & Store DEMO 3: REAL-TIME DATA VISUALIZATIONS AND ALERTS AWS MARKETPLACE: INTELLIGENT ANALYTICAL SYSTEM Transform & Analyze
  • 16. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Collect & Store OPERATIONS MANAGEMENT AWS MARKETPLACE: INTELLIGENT ANALYTICAL SYSTEM Transform & Analyze Operations Management
  • 17. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. iAS: System Architecture and Design
  • 18. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. GROUP DISCUSSION AWS MARKETPLACE: INTELLIGENT ANALYTICAL SYSTEM • What challenges is my customer/organization facing in building a Big Data Architecture? • What capabilities is my organization missing? • What parts would I want to further customize? Collect & Store Transform & Analyze Operations Management
  • 19. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Key Takeaways There is an increase of importance on new Big Data capabilities including: • Securely combining data residing on-premises with Cloud applications • Abandon historical-only business analysis to include predictive and other modern analytics to immediately act upon key business insights • Break down data silos and harness data in real time from all over the globe • The importance of building a modular Big Data architecture AWS Marketplace enables agility and experimentation • Combining AWS services with solutions in AWS Marketplace are “pieces of the puzzle” that can be replaced whenever newer products and services are released • Easily evaluate innovative software solutions, pay only for what you use Download the step-by-step guide for the Intelligent Analytical System (iAS) • https://aws.amazon.com/mp/mp_solution • https://aws-kimsshmidt.com
  • 20. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Thank you! @ l u i s d a n s @ d a t a l e a d e r http://aws.amazon.com/mp