SlideShare a Scribd company logo
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Rick Houlihan
Senior Practice Manager, DBS NoSQL
SRV307
Applying AWS Purpose-Built
Database Strategy
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Agenda
• Database workload classifications
• Traditional approaches to scaling RDBMS
• How NoSQL databases compare
• The flavors of NoSQL on AWS
• What database to use when
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Why did you choose this database for your app?
“Because we heard X is the best new thing.”
“Because we have a site license for X.”
“Because X is what we know how to use.”
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Why did you choose this database for your app?
“Because this database is purpose built to support
what my application is designed to do.”
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Types of database workloads
• Online transaction processing (OLTP)
• Most common type of app
• Online analytics processing (OLAP)
• BI and ad hoc data projections
• Decision support systems (DSS)
• Long running query aggregations and projections
Operations
Analytics
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Sizing the workload
Unbounded problems are harder to solve
“I need a root cause analysis engine to correlate transaction level events
to trading patterns across global markets.”
Problems with limited scope are easier to solve
“I need a system to manage inventory in my store.”
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Sizing the database
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Scaling relational DBs
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Sharded relational DBs?
A B C D
?
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
NoSQL databases
• Denormalize and
shard to provide
horizontal scale
• Near unbounded
throughput and
storage
Collection 1
1 TB
Shard A
500 GB
Shard B
500 GB
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
00 55 A954 FFAA00 FF
Partition keys in NoSQL
Id = 1
Name = Jim
Hash (1) = 7B
Id = 2
Name = Andy
Dept = Eng
Hash (2) = 48
Id = 3
Name = Kim
Dept = Ops
Hash (3) = CD
Key Space
Partition key uniquely identifies an item
Partition key is used for building an unordered hash index
Allows table to be partitioned for scale
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
The iron triangle of data - All about CAP
C
A
PConsistency:
all clients always have
the same view of data
Partition tolerance:
the system works well despite physical
network partitions
Availability:
all clients can always
read and write
CA
MS SQL
Oracle
DB2
MySQL
Aster Data
Greenplum
Postgres
CP
Big Table
Hypertable
HBase
MongoDB
Terastore
Couchbase
Scalaris
DynamoDB
BerkeleyDB
Memcached
Redis
Pick two
AP
Voldemort
Tokyo Cabinet
KAI
DynamoDB
Cassandra
SimpleDB
CouchDB
Riak
Data Models:
Relational
Wide Column
Document
Key/Value
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Technology adoption and the hype curve
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
DW | Big data processing | Ad hoc
AWS databases and analytics
Broadest and deepest portfolio purpose-built for builders
Business Intelligence & Machine Learning
Data movement
AWS Database Migration Service | AWS Snowball | AWS Snowmobile | Amazon Kinesis Data Firehose | Amazon Kinesis Data Streams
Amazon QuickSight
Relational databases
AmazonR
DS
AmazonA
urora
Data lake (Batch/ETL)
Amazon S3/Amazon Glacier
(Storage)
AWS Glue
(ETL & Data Catalog)
Machine Learning
Amazon Macie
(Data Protection)
Denormalized databases
Analytics (OLAP/DSS)
DynamoDB
(Wide Column/Document)
ElastiCache
(Indexed Key Value)
Amazon
Redshift
EMR Amazon
Athena
Amazon
Kinesis Data
Analytics
Amazon
Elasticsearch
Service
Real time
Opertional (OLTP)
Neptune
(Graph)
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Amazon RDS
Managed relational database service with a choice of six popular database engines
Easy to administer Highly flexible Available &
durable
Fast
No need for infrastructure
provisioning, installing and
maintaining database software.
Scale database compute and
storage with a few mouse
clicks and zero downtime.
Multi-AZ: Automatically
replicates data.
Automated backup,
snapshots, failover.
Choose between dual SSD-
backed storage for high-
performance OLTP.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Amazon DynamoDB
Fast and flexible NoSQL database service for any scale
Key-value NoSQL database that supports both document and wide column structures
Fast, consistent performance
Highly scalable Fully managed
Business-critical
reliability
Consistent single-digit
millisecond latencies at any
scale. DAX speeds up times
to microseconds.
Auto-scaling tables serving
millions of requests per second,
storing hundreds of terabytes of
data.
Automatic provisioning
and infrastructure
management.
Data replicated across
multiple AZs and
accessed with regionally
available APIs.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
DynamoDB schema
Table
Items
Attributes
Partition
key
Sort
key
Mandatory
Key-value access pattern
Determines data distribution
Optional
Model 1:N relationships
Enables rich query capabilities
All items for key
==, <, >, >=, <=
“begins with”
“between”
“contains”
“in”
sorted results
counts
top/bottom N values
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
SQL vs. NoSQL design pattern
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Amazon Neptune
Fully managed graph database
Fast Reliable Open
Query billions of
relationships with
millisecond latency
Six replicas of your data
across three AZs with full
backup and restore
Build powerful queries
easily with Gremlin and
SPARQL
Supports Apache
TinkerPop & W3C RDF
graph models
Easy
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Amazon Neptune high-level architecture
Bulk load
from S3
Database
Mgmt.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Graph workloads
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Normalized graph design pattern
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
De-normalized graph design pattern
Node Relationship Endpoint
Bill
is a Person
visited Eiffel Tower
Alice
is a Person
visited Eiffel Tower
friend of Bob
Bob
is a Person
born on 7/14/90
friend of Alice
interested in Mona Lisa
Leonardo daVinci is a Person
La Jaconde a Washington
is a Video
is about Mona Lisa
Eiffel Tower
is a Place
located in Paris
7/14/90 is a Date
Paris is a City
The Louvre
is a Museum
located in Paris
Mona Lisa
is a Painting
created by Leonardo daVinci
exhibited at The Louvre
Nodes are the
vertices of a graph
Relationships are the
edges of a graph
Select nodes to get
edges for an entity
Index Relationship and
Endpoint for edge type and
target aggregations
Follow the edges to
traverse the graph
Bob wants to see the Mona
Lisa. While he is in Paris, he
would like to see other things
his friends have enjoyed.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Node Query (Primary)
• What entities are in the graph?
Edge Query (Index)
• What relationships do graph entities have?
Hybrid Query (Traversal)
• How are entities related through each other?
Graph query types
RDBMS, NoSQL,
GraphDB
GraphDB
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Amazon Redshift – Data warehousing
Fast, powerful, and simple data warehousing at 1/10 the cost
Massively parallel, petabyte scale
Fast Inexpensive Scalable Secure
Columnar storage technology
to improve I/O efficiency and
parallelize queries. Data load
scales linearly.
As low as $1,000 per
terabyte per year, 1/10th
the cost of traditional data
warehouse solutions.
Resize your cluster up and
down as your performance
and capacity needs
change.
Data encrypted at rest and
transit. Isolate clusters with
VPC. Manage your own keys
with AWS KMS.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Amazon Athena – Interactive analysis
Interactive query service to analyze data in Amazon S3 using standard SQL
No infrastructure to set up or manage and no data to load
Ability to run SQL queries on data archived in Amazon Glacier (coming soon)
Serverless
Zero setup cost. Just
point to Amazon S3,
and start querying.
Pay per query
Pay only for queries run.
Save 30–90% on per-
query costs through
compression.
Open
ANSI SQL interface,
JDBC/ODBC drivers, multiple
formats, compression types,
and complex Joins and data
types.
Easy
Serverless. Zero
infrastructure. Zero
administration.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Categories of database
Optimized for storage Optimized for compute
Optimized for
relationships
Normalized relational or
dimensional DW
Denormalized document,
wide column or key value
Denormalized entity
relationship
Ad hoc queries and
aggregations
Instantiated views and
computed aggregations
Ad hoc entity/relationship
aggregations
Scale vertically Scale horizontally Hybrid
Great for OLAP and DSS Built for OLTP or DSS at scale
Designed for graph
traversals
SQL NoSQL Graph
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Infinite scale:
The database can gracefully increase size
and throughput without practical limits
The iron triangle of purpose (the PIE theorem)
I
P
E Efficiency:
The database will deliver required query
latency for the workload at all times
Pattern flexibility:
The database supports random access
patterns and ad hoc queries
PI
Amazon RDS
Elasticsearch
Aurora Serverless
Neptune
IE
Pick two
PE
Data models:
Relational
Wide Column
Document
Graph
Columnar
Unstructured
Amazon DynamoDB
Amazon Redshift
Athena
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Hundreds of thousands of customers use DynamoDB
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Hundreds of thousands more use Amazon RDS
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Purpose-built database solutions from AWS
Provisioning
Capacity planning
Monitoring
OS patching
Hardware upgrades
Database upgrades
Security patches
Scaling
Monitoring
Performance tuning
Replication across data centers
Re-replicate on server failureProvision new regions
Infrastructure Software
With zero unplanned downtime
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Submit session feedback
1. Tap the Schedule icon.
2. Select the session you attended.
3. Tap Session Evaluation to submit your
feedback.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Thank you!

More Related Content

What's hot

Amazon DynamoDB Deep Dive Advanced Design Patterns for DynamoDB (DAT401) - AW...
Amazon DynamoDB Deep Dive Advanced Design Patterns for DynamoDB (DAT401) - AW...Amazon DynamoDB Deep Dive Advanced Design Patterns for DynamoDB (DAT401) - AW...
Amazon DynamoDB Deep Dive Advanced Design Patterns for DynamoDB (DAT401) - AW...
Amazon Web Services
 
Data Warehousing and Data Lake Analytics, Together - AWS Online Tech Talks
Data Warehousing and Data Lake Analytics, Together - AWS Online Tech TalksData Warehousing and Data Lake Analytics, Together - AWS Online Tech Talks
Data Warehousing and Data Lake Analytics, Together - AWS Online Tech Talks
Amazon Web Services
 
On-Ramp to Graph Databases and Amazon Neptune (DAT335) - AWS re:Invent 2018
On-Ramp to Graph Databases and Amazon Neptune (DAT335) - AWS re:Invent 2018On-Ramp to Graph Databases and Amazon Neptune (DAT335) - AWS re:Invent 2018
On-Ramp to Graph Databases and Amazon Neptune (DAT335) - AWS re:Invent 2018
Amazon Web Services
 
ABD312_Deep Dive Migrating Big Data Workloads to AWS
ABD312_Deep Dive Migrating Big Data Workloads to AWSABD312_Deep Dive Migrating Big Data Workloads to AWS
ABD312_Deep Dive Migrating Big Data Workloads to AWS
Amazon Web Services
 
Data Warehousing with Amazon Redshift
Data Warehousing with Amazon RedshiftData Warehousing with Amazon Redshift
Data Warehousing with Amazon Redshift
Amazon Web Services
 
Data Transformation Patterns in AWS - AWS Online Tech Talks
Data Transformation Patterns in AWS - AWS Online Tech TalksData Transformation Patterns in AWS - AWS Online Tech Talks
Data Transformation Patterns in AWS - AWS Online Tech Talks
Amazon Web Services
 
Migrate Workloads with Large Storage and I/O Demands (GPSTEC311) - AWS re:Inv...
Migrate Workloads with Large Storage and I/O Demands (GPSTEC311) - AWS re:Inv...Migrate Workloads with Large Storage and I/O Demands (GPSTEC311) - AWS re:Inv...
Migrate Workloads with Large Storage and I/O Demands (GPSTEC311) - AWS re:Inv...
Amazon Web Services
 
Building a Data Lake for Your Enterprise, ft. Sysco (STG309) - AWS re:Invent ...
Building a Data Lake for Your Enterprise, ft. Sysco (STG309) - AWS re:Invent ...Building a Data Lake for Your Enterprise, ft. Sysco (STG309) - AWS re:Invent ...
Building a Data Lake for Your Enterprise, ft. Sysco (STG309) - AWS re:Invent ...
Amazon Web Services
 
Using Data Lakes
Using Data LakesUsing Data Lakes
Using Data Lakes
Amazon Web Services
 
Deep Dive on Amazon Neptune (DAT403) - AWS re:Invent 2018
Deep Dive on Amazon Neptune (DAT403) - AWS re:Invent 2018Deep Dive on Amazon Neptune (DAT403) - AWS re:Invent 2018
Deep Dive on Amazon Neptune (DAT403) - AWS re:Invent 2018
Amazon Web Services
 
Build Data Lakes & Analytics on AWS: Patterns & Best Practices
Build Data Lakes & Analytics on AWS: Patterns & Best PracticesBuild Data Lakes & Analytics on AWS: Patterns & Best Practices
Build Data Lakes & Analytics on AWS: Patterns & Best Practices
Amazon Web Services
 
What's New with Amazon Redshift ft. Dow Jones (ANT350-R) - AWS re:Invent 2018
What's New with Amazon Redshift ft. Dow Jones (ANT350-R) - AWS re:Invent 2018What's New with Amazon Redshift ft. Dow Jones (ANT350-R) - AWS re:Invent 2018
What's New with Amazon Redshift ft. Dow Jones (ANT350-R) - AWS re:Invent 2018
Amazon Web Services
 
Building Serverless ETL Pipelines
Building Serverless ETL PipelinesBuilding Serverless ETL Pipelines
Building Serverless ETL Pipelines
Amazon Web Services
 
Building High Performance Apps with In-memory Data
Building High Performance Apps with In-memory DataBuilding High Performance Apps with In-memory Data
Building High Performance Apps with In-memory Data
Amazon Web Services
 
Loading Data into Redshift with Lab
Loading Data into Redshift with LabLoading Data into Redshift with Lab
Loading Data into Redshift with Lab
Amazon Web Services
 
reModernize - Updating & Consolidating MySQL
reModernize - Updating & Consolidating MySQLreModernize - Updating & Consolidating MySQL
reModernize - Updating & Consolidating MySQL
Amazon Web Services
 
Integrating Amazon Elasticsearch with your DevOps Tooling - AWS Online Tech T...
Integrating Amazon Elasticsearch with your DevOps Tooling - AWS Online Tech T...Integrating Amazon Elasticsearch with your DevOps Tooling - AWS Online Tech T...
Integrating Amazon Elasticsearch with your DevOps Tooling - AWS Online Tech T...
Amazon Web Services
 
Building Data Lakes That Cost Less and Deliver Results Faster - AWS Online Te...
Building Data Lakes That Cost Less and Deliver Results Faster - AWS Online Te...Building Data Lakes That Cost Less and Deliver Results Faster - AWS Online Te...
Building Data Lakes That Cost Less and Deliver Results Faster - AWS Online Te...
Amazon Web Services
 
Migrating Your NoSQL Database to Amazon DynamoDB (DAT314) - AWS re:Invent 2018
Migrating Your NoSQL Database to Amazon DynamoDB (DAT314) - AWS re:Invent 2018Migrating Your NoSQL Database to Amazon DynamoDB (DAT314) - AWS re:Invent 2018
Migrating Your NoSQL Database to Amazon DynamoDB (DAT314) - AWS re:Invent 2018
Amazon Web Services
 
Building Serverless Analytics Pipelines with AWS Glue (ANT308) - AWS re:Inven...
Building Serverless Analytics Pipelines with AWS Glue (ANT308) - AWS re:Inven...Building Serverless Analytics Pipelines with AWS Glue (ANT308) - AWS re:Inven...
Building Serverless Analytics Pipelines with AWS Glue (ANT308) - AWS re:Inven...
Amazon Web Services
 

What's hot (20)

Amazon DynamoDB Deep Dive Advanced Design Patterns for DynamoDB (DAT401) - AW...
Amazon DynamoDB Deep Dive Advanced Design Patterns for DynamoDB (DAT401) - AW...Amazon DynamoDB Deep Dive Advanced Design Patterns for DynamoDB (DAT401) - AW...
Amazon DynamoDB Deep Dive Advanced Design Patterns for DynamoDB (DAT401) - AW...
 
Data Warehousing and Data Lake Analytics, Together - AWS Online Tech Talks
Data Warehousing and Data Lake Analytics, Together - AWS Online Tech TalksData Warehousing and Data Lake Analytics, Together - AWS Online Tech Talks
Data Warehousing and Data Lake Analytics, Together - AWS Online Tech Talks
 
On-Ramp to Graph Databases and Amazon Neptune (DAT335) - AWS re:Invent 2018
On-Ramp to Graph Databases and Amazon Neptune (DAT335) - AWS re:Invent 2018On-Ramp to Graph Databases and Amazon Neptune (DAT335) - AWS re:Invent 2018
On-Ramp to Graph Databases and Amazon Neptune (DAT335) - AWS re:Invent 2018
 
ABD312_Deep Dive Migrating Big Data Workloads to AWS
ABD312_Deep Dive Migrating Big Data Workloads to AWSABD312_Deep Dive Migrating Big Data Workloads to AWS
ABD312_Deep Dive Migrating Big Data Workloads to AWS
 
Data Warehousing with Amazon Redshift
Data Warehousing with Amazon RedshiftData Warehousing with Amazon Redshift
Data Warehousing with Amazon Redshift
 
Data Transformation Patterns in AWS - AWS Online Tech Talks
Data Transformation Patterns in AWS - AWS Online Tech TalksData Transformation Patterns in AWS - AWS Online Tech Talks
Data Transformation Patterns in AWS - AWS Online Tech Talks
 
Migrate Workloads with Large Storage and I/O Demands (GPSTEC311) - AWS re:Inv...
Migrate Workloads with Large Storage and I/O Demands (GPSTEC311) - AWS re:Inv...Migrate Workloads with Large Storage and I/O Demands (GPSTEC311) - AWS re:Inv...
Migrate Workloads with Large Storage and I/O Demands (GPSTEC311) - AWS re:Inv...
 
Building a Data Lake for Your Enterprise, ft. Sysco (STG309) - AWS re:Invent ...
Building a Data Lake for Your Enterprise, ft. Sysco (STG309) - AWS re:Invent ...Building a Data Lake for Your Enterprise, ft. Sysco (STG309) - AWS re:Invent ...
Building a Data Lake for Your Enterprise, ft. Sysco (STG309) - AWS re:Invent ...
 
Using Data Lakes
Using Data LakesUsing Data Lakes
Using Data Lakes
 
Deep Dive on Amazon Neptune (DAT403) - AWS re:Invent 2018
Deep Dive on Amazon Neptune (DAT403) - AWS re:Invent 2018Deep Dive on Amazon Neptune (DAT403) - AWS re:Invent 2018
Deep Dive on Amazon Neptune (DAT403) - AWS re:Invent 2018
 
Build Data Lakes & Analytics on AWS: Patterns & Best Practices
Build Data Lakes & Analytics on AWS: Patterns & Best PracticesBuild Data Lakes & Analytics on AWS: Patterns & Best Practices
Build Data Lakes & Analytics on AWS: Patterns & Best Practices
 
What's New with Amazon Redshift ft. Dow Jones (ANT350-R) - AWS re:Invent 2018
What's New with Amazon Redshift ft. Dow Jones (ANT350-R) - AWS re:Invent 2018What's New with Amazon Redshift ft. Dow Jones (ANT350-R) - AWS re:Invent 2018
What's New with Amazon Redshift ft. Dow Jones (ANT350-R) - AWS re:Invent 2018
 
Building Serverless ETL Pipelines
Building Serverless ETL PipelinesBuilding Serverless ETL Pipelines
Building Serverless ETL Pipelines
 
Building High Performance Apps with In-memory Data
Building High Performance Apps with In-memory DataBuilding High Performance Apps with In-memory Data
Building High Performance Apps with In-memory Data
 
Loading Data into Redshift with Lab
Loading Data into Redshift with LabLoading Data into Redshift with Lab
Loading Data into Redshift with Lab
 
reModernize - Updating & Consolidating MySQL
reModernize - Updating & Consolidating MySQLreModernize - Updating & Consolidating MySQL
reModernize - Updating & Consolidating MySQL
 
Integrating Amazon Elasticsearch with your DevOps Tooling - AWS Online Tech T...
Integrating Amazon Elasticsearch with your DevOps Tooling - AWS Online Tech T...Integrating Amazon Elasticsearch with your DevOps Tooling - AWS Online Tech T...
Integrating Amazon Elasticsearch with your DevOps Tooling - AWS Online Tech T...
 
Building Data Lakes That Cost Less and Deliver Results Faster - AWS Online Te...
Building Data Lakes That Cost Less and Deliver Results Faster - AWS Online Te...Building Data Lakes That Cost Less and Deliver Results Faster - AWS Online Te...
Building Data Lakes That Cost Less and Deliver Results Faster - AWS Online Te...
 
Migrating Your NoSQL Database to Amazon DynamoDB (DAT314) - AWS re:Invent 2018
Migrating Your NoSQL Database to Amazon DynamoDB (DAT314) - AWS re:Invent 2018Migrating Your NoSQL Database to Amazon DynamoDB (DAT314) - AWS re:Invent 2018
Migrating Your NoSQL Database to Amazon DynamoDB (DAT314) - AWS re:Invent 2018
 
Building Serverless Analytics Pipelines with AWS Glue (ANT308) - AWS re:Inven...
Building Serverless Analytics Pipelines with AWS Glue (ANT308) - AWS re:Inven...Building Serverless Analytics Pipelines with AWS Glue (ANT308) - AWS re:Inven...
Building Serverless Analytics Pipelines with AWS Glue (ANT308) - AWS re:Inven...
 

Similar to Applying AWS Purpose-Built Database Strategy - SRV307 - Anaheim AWS Summit

SRV307 Applying AWS Purpose-Built Database Strategy: Match Your Workload to ...
 SRV307 Applying AWS Purpose-Built Database Strategy: Match Your Workload to ... SRV307 Applying AWS Purpose-Built Database Strategy: Match Your Workload to ...
SRV307 Applying AWS Purpose-Built Database Strategy: Match Your Workload to ...
Amazon Web Services
 
Building with AWS Databases: Match Your Workload to the Right Database (DAT30...
Building with AWS Databases: Match Your Workload to the Right Database (DAT30...Building with AWS Databases: Match Your Workload to the Right Database (DAT30...
Building with AWS Databases: Match Your Workload to the Right Database (DAT30...
Amazon Web Services
 
AWS Data Lake: data analysis @ scale
AWS Data Lake: data analysis @ scaleAWS Data Lake: data analysis @ scale
AWS Data Lake: data analysis @ scale
Amazon Web Services
 
Analyze your Data Lake, Fast @ Any Scale - AWS Online Tech Talks
Analyze your Data Lake, Fast @ Any Scale - AWS Online Tech TalksAnalyze your Data Lake, Fast @ Any Scale - AWS Online Tech Talks
Analyze your Data Lake, Fast @ Any Scale - AWS Online Tech Talks
Amazon Web Services
 
Data Warehouses and Data Lakes
Data Warehouses and Data LakesData Warehouses and Data Lakes
Data Warehouses and Data Lakes
Amazon Web Services
 
Data Warehouses and Data Lakes
Data Warehouses and Data LakesData Warehouses and Data Lakes
Data Warehouses and Data Lakes
Amazon Web Services
 
Database Freedom. Database migration approaches to get to the Cloud - Marcus ...
Database Freedom. Database migration approaches to get to the Cloud - Marcus ...Database Freedom. Database migration approaches to get to the Cloud - Marcus ...
Database Freedom. Database migration approaches to get to the Cloud - Marcus ...
Amazon Web Services
 
Build Data Lakes and Analytics on AWS: Patterns & Best Practices
Build Data Lakes and Analytics on AWS: Patterns & Best PracticesBuild Data Lakes and Analytics on AWS: Patterns & Best Practices
Build Data Lakes and Analytics on AWS: Patterns & Best Practices
Amazon Web Services
 
Big Data Analytics Architectural Patterns and Best Practices (ANT201-R1) - AW...
Big Data Analytics Architectural Patterns and Best Practices (ANT201-R1) - AW...Big Data Analytics Architectural Patterns and Best Practices (ANT201-R1) - AW...
Big Data Analytics Architectural Patterns and Best Practices (ANT201-R1) - AW...
Amazon Web Services
 
21st Century Analytics with Zopa
21st Century Analytics with Zopa21st Century Analytics with Zopa
21st Century Analytics with Zopa
Amazon Web Services
 
Builders' Day - Building Data Lakes for Analytics On AWS LC
Builders' Day - Building Data Lakes for Analytics On AWS LCBuilders' Day - Building Data Lakes for Analytics On AWS LC
Builders' Day - Building Data Lakes for Analytics On AWS LC
Amazon Web Services LATAM
 
Immersion Day - Como simplificar o acesso ao seu ambiente analítico
Immersion Day - Como simplificar o acesso ao seu ambiente analíticoImmersion Day - Como simplificar o acesso ao seu ambiente analítico
Immersion Day - Como simplificar o acesso ao seu ambiente analítico
Amazon Web Services LATAM
 
Implementazione di una soluzione Data Lake.pdf
Implementazione di una soluzione Data Lake.pdfImplementazione di una soluzione Data Lake.pdf
Implementazione di una soluzione Data Lake.pdf
Amazon Web Services
 
Using data lakes to quench your analytics fire - AWS Summit Cape Town 2018
Using data lakes to quench your analytics fire - AWS Summit Cape Town 2018Using data lakes to quench your analytics fire - AWS Summit Cape Town 2018
Using data lakes to quench your analytics fire - AWS Summit Cape Town 2018
Amazon Web Services
 
Cutting to the chase for Machine Learning Analytics Ecosystem & AWS Lake Form...
Cutting to the chase for Machine Learning Analytics Ecosystem & AWS Lake Form...Cutting to the chase for Machine Learning Analytics Ecosystem & AWS Lake Form...
Cutting to the chase for Machine Learning Analytics Ecosystem & AWS Lake Form...
AWS Riyadh User Group
 
Construindo data lakes e analytics com AWS
Construindo data lakes e analytics com AWSConstruindo data lakes e analytics com AWS
Construindo data lakes e analytics com AWS
Amazon Web Services LATAM
 
Modernise your Data Warehouse with Amazon Redshift and Amazon Redshift Spectrum
Modernise your Data Warehouse with Amazon Redshift and Amazon Redshift SpectrumModernise your Data Warehouse with Amazon Redshift and Amazon Redshift Spectrum
Modernise your Data Warehouse with Amazon Redshift and Amazon Redshift Spectrum
Amazon Web Services
 
ABD201-Big Data Architectural Patterns and Best Practices on AWS
ABD201-Big Data Architectural Patterns and Best Practices on AWSABD201-Big Data Architectural Patterns and Best Practices on AWS
ABD201-Big Data Architectural Patterns and Best Practices on AWS
Amazon Web Services
 
Building Data Lakes and Analytics on AWS
Building Data Lakes and Analytics on AWSBuilding Data Lakes and Analytics on AWS
Building Data Lakes and Analytics on AWS
Amazon Web Services
 
Building Data Lakes and Analytics on AWS
Building Data Lakes and Analytics on AWSBuilding Data Lakes and Analytics on AWS
Building Data Lakes and Analytics on AWS
Amazon Web Services
 

Similar to Applying AWS Purpose-Built Database Strategy - SRV307 - Anaheim AWS Summit (20)

SRV307 Applying AWS Purpose-Built Database Strategy: Match Your Workload to ...
 SRV307 Applying AWS Purpose-Built Database Strategy: Match Your Workload to ... SRV307 Applying AWS Purpose-Built Database Strategy: Match Your Workload to ...
SRV307 Applying AWS Purpose-Built Database Strategy: Match Your Workload to ...
 
Building with AWS Databases: Match Your Workload to the Right Database (DAT30...
Building with AWS Databases: Match Your Workload to the Right Database (DAT30...Building with AWS Databases: Match Your Workload to the Right Database (DAT30...
Building with AWS Databases: Match Your Workload to the Right Database (DAT30...
 
AWS Data Lake: data analysis @ scale
AWS Data Lake: data analysis @ scaleAWS Data Lake: data analysis @ scale
AWS Data Lake: data analysis @ scale
 
Analyze your Data Lake, Fast @ Any Scale - AWS Online Tech Talks
Analyze your Data Lake, Fast @ Any Scale - AWS Online Tech TalksAnalyze your Data Lake, Fast @ Any Scale - AWS Online Tech Talks
Analyze your Data Lake, Fast @ Any Scale - AWS Online Tech Talks
 
Data Warehouses and Data Lakes
Data Warehouses and Data LakesData Warehouses and Data Lakes
Data Warehouses and Data Lakes
 
Data Warehouses and Data Lakes
Data Warehouses and Data LakesData Warehouses and Data Lakes
Data Warehouses and Data Lakes
 
Database Freedom. Database migration approaches to get to the Cloud - Marcus ...
Database Freedom. Database migration approaches to get to the Cloud - Marcus ...Database Freedom. Database migration approaches to get to the Cloud - Marcus ...
Database Freedom. Database migration approaches to get to the Cloud - Marcus ...
 
Build Data Lakes and Analytics on AWS: Patterns & Best Practices
Build Data Lakes and Analytics on AWS: Patterns & Best PracticesBuild Data Lakes and Analytics on AWS: Patterns & Best Practices
Build Data Lakes and Analytics on AWS: Patterns & Best Practices
 
Big Data Analytics Architectural Patterns and Best Practices (ANT201-R1) - AW...
Big Data Analytics Architectural Patterns and Best Practices (ANT201-R1) - AW...Big Data Analytics Architectural Patterns and Best Practices (ANT201-R1) - AW...
Big Data Analytics Architectural Patterns and Best Practices (ANT201-R1) - AW...
 
21st Century Analytics with Zopa
21st Century Analytics with Zopa21st Century Analytics with Zopa
21st Century Analytics with Zopa
 
Builders' Day - Building Data Lakes for Analytics On AWS LC
Builders' Day - Building Data Lakes for Analytics On AWS LCBuilders' Day - Building Data Lakes for Analytics On AWS LC
Builders' Day - Building Data Lakes for Analytics On AWS LC
 
Immersion Day - Como simplificar o acesso ao seu ambiente analítico
Immersion Day - Como simplificar o acesso ao seu ambiente analíticoImmersion Day - Como simplificar o acesso ao seu ambiente analítico
Immersion Day - Como simplificar o acesso ao seu ambiente analítico
 
Implementazione di una soluzione Data Lake.pdf
Implementazione di una soluzione Data Lake.pdfImplementazione di una soluzione Data Lake.pdf
Implementazione di una soluzione Data Lake.pdf
 
Using data lakes to quench your analytics fire - AWS Summit Cape Town 2018
Using data lakes to quench your analytics fire - AWS Summit Cape Town 2018Using data lakes to quench your analytics fire - AWS Summit Cape Town 2018
Using data lakes to quench your analytics fire - AWS Summit Cape Town 2018
 
Cutting to the chase for Machine Learning Analytics Ecosystem & AWS Lake Form...
Cutting to the chase for Machine Learning Analytics Ecosystem & AWS Lake Form...Cutting to the chase for Machine Learning Analytics Ecosystem & AWS Lake Form...
Cutting to the chase for Machine Learning Analytics Ecosystem & AWS Lake Form...
 
Construindo data lakes e analytics com AWS
Construindo data lakes e analytics com AWSConstruindo data lakes e analytics com AWS
Construindo data lakes e analytics com AWS
 
Modernise your Data Warehouse with Amazon Redshift and Amazon Redshift Spectrum
Modernise your Data Warehouse with Amazon Redshift and Amazon Redshift SpectrumModernise your Data Warehouse with Amazon Redshift and Amazon Redshift Spectrum
Modernise your Data Warehouse with Amazon Redshift and Amazon Redshift Spectrum
 
ABD201-Big Data Architectural Patterns and Best Practices on AWS
ABD201-Big Data Architectural Patterns and Best Practices on AWSABD201-Big Data Architectural Patterns and Best Practices on AWS
ABD201-Big Data Architectural Patterns and Best Practices on AWS
 
Building Data Lakes and Analytics on AWS
Building Data Lakes and Analytics on AWSBuilding Data Lakes and Analytics on AWS
Building Data Lakes and Analytics on AWS
 
Building Data Lakes and Analytics on AWS
Building Data Lakes and Analytics on AWSBuilding Data Lakes and Analytics on AWS
Building Data Lakes and Analytics on AWS
 

More from Amazon Web Services

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Amazon Web Services
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Amazon Web Services
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS Fargate
Amazon Web Services
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWS
Amazon Web Services
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot
Amazon Web Services
 
Open banking as a service
Open banking as a serviceOpen banking as a service
Open banking as a service
Amazon Web Services
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Amazon Web Services
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
Amazon Web Services
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Amazon Web Services
 
Computer Vision con AWS
Computer Vision con AWSComputer Vision con AWS
Computer Vision con AWS
Amazon Web Services
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatare
Amazon Web Services
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Amazon Web Services
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e web
Amazon Web Services
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Amazon Web Services
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWS
Amazon Web Services
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch Deck
Amazon Web Services
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without servers
Amazon Web Services
 
Fundraising Essentials
Fundraising EssentialsFundraising Essentials
Fundraising Essentials
Amazon Web Services
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
Amazon Web Services
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container Service
Amazon Web Services
 

More from Amazon Web Services (20)

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS Fargate
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWS
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot
 
Open banking as a service
Open banking as a serviceOpen banking as a service
Open banking as a service
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
 
Computer Vision con AWS
Computer Vision con AWSComputer Vision con AWS
Computer Vision con AWS
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatare
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e web
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWS
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch Deck
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without servers
 
Fundraising Essentials
Fundraising EssentialsFundraising Essentials
Fundraising Essentials
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container Service
 

Applying AWS Purpose-Built Database Strategy - SRV307 - Anaheim AWS Summit

  • 1. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Rick Houlihan Senior Practice Manager, DBS NoSQL SRV307 Applying AWS Purpose-Built Database Strategy
  • 2. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Agenda • Database workload classifications • Traditional approaches to scaling RDBMS • How NoSQL databases compare • The flavors of NoSQL on AWS • What database to use when
  • 3. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Why did you choose this database for your app? “Because we heard X is the best new thing.” “Because we have a site license for X.” “Because X is what we know how to use.”
  • 4. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Why did you choose this database for your app? “Because this database is purpose built to support what my application is designed to do.”
  • 5. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Types of database workloads • Online transaction processing (OLTP) • Most common type of app • Online analytics processing (OLAP) • BI and ad hoc data projections • Decision support systems (DSS) • Long running query aggregations and projections Operations Analytics
  • 6. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Sizing the workload Unbounded problems are harder to solve “I need a root cause analysis engine to correlate transaction level events to trading patterns across global markets.” Problems with limited scope are easier to solve “I need a system to manage inventory in my store.”
  • 7. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Sizing the database
  • 8. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Scaling relational DBs
  • 9. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Sharded relational DBs? A B C D ?
  • 10. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. NoSQL databases • Denormalize and shard to provide horizontal scale • Near unbounded throughput and storage Collection 1 1 TB Shard A 500 GB Shard B 500 GB
  • 11. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. 00 55 A954 FFAA00 FF Partition keys in NoSQL Id = 1 Name = Jim Hash (1) = 7B Id = 2 Name = Andy Dept = Eng Hash (2) = 48 Id = 3 Name = Kim Dept = Ops Hash (3) = CD Key Space Partition key uniquely identifies an item Partition key is used for building an unordered hash index Allows table to be partitioned for scale
  • 12. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. The iron triangle of data - All about CAP C A PConsistency: all clients always have the same view of data Partition tolerance: the system works well despite physical network partitions Availability: all clients can always read and write CA MS SQL Oracle DB2 MySQL Aster Data Greenplum Postgres CP Big Table Hypertable HBase MongoDB Terastore Couchbase Scalaris DynamoDB BerkeleyDB Memcached Redis Pick two AP Voldemort Tokyo Cabinet KAI DynamoDB Cassandra SimpleDB CouchDB Riak Data Models: Relational Wide Column Document Key/Value
  • 13. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Technology adoption and the hype curve
  • 14. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. DW | Big data processing | Ad hoc AWS databases and analytics Broadest and deepest portfolio purpose-built for builders Business Intelligence & Machine Learning Data movement AWS Database Migration Service | AWS Snowball | AWS Snowmobile | Amazon Kinesis Data Firehose | Amazon Kinesis Data Streams Amazon QuickSight Relational databases AmazonR DS AmazonA urora Data lake (Batch/ETL) Amazon S3/Amazon Glacier (Storage) AWS Glue (ETL & Data Catalog) Machine Learning Amazon Macie (Data Protection) Denormalized databases Analytics (OLAP/DSS) DynamoDB (Wide Column/Document) ElastiCache (Indexed Key Value) Amazon Redshift EMR Amazon Athena Amazon Kinesis Data Analytics Amazon Elasticsearch Service Real time Opertional (OLTP) Neptune (Graph)
  • 15. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon RDS Managed relational database service with a choice of six popular database engines Easy to administer Highly flexible Available & durable Fast No need for infrastructure provisioning, installing and maintaining database software. Scale database compute and storage with a few mouse clicks and zero downtime. Multi-AZ: Automatically replicates data. Automated backup, snapshots, failover. Choose between dual SSD- backed storage for high- performance OLTP.
  • 16. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon DynamoDB Fast and flexible NoSQL database service for any scale Key-value NoSQL database that supports both document and wide column structures Fast, consistent performance Highly scalable Fully managed Business-critical reliability Consistent single-digit millisecond latencies at any scale. DAX speeds up times to microseconds. Auto-scaling tables serving millions of requests per second, storing hundreds of terabytes of data. Automatic provisioning and infrastructure management. Data replicated across multiple AZs and accessed with regionally available APIs.
  • 17. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. DynamoDB schema Table Items Attributes Partition key Sort key Mandatory Key-value access pattern Determines data distribution Optional Model 1:N relationships Enables rich query capabilities All items for key ==, <, >, >=, <= “begins with” “between” “contains” “in” sorted results counts top/bottom N values
  • 18. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. SQL vs. NoSQL design pattern
  • 19. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Neptune Fully managed graph database Fast Reliable Open Query billions of relationships with millisecond latency Six replicas of your data across three AZs with full backup and restore Build powerful queries easily with Gremlin and SPARQL Supports Apache TinkerPop & W3C RDF graph models Easy
  • 20. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Neptune high-level architecture Bulk load from S3 Database Mgmt.
  • 21. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Graph workloads
  • 22. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Normalized graph design pattern
  • 23. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. De-normalized graph design pattern Node Relationship Endpoint Bill is a Person visited Eiffel Tower Alice is a Person visited Eiffel Tower friend of Bob Bob is a Person born on 7/14/90 friend of Alice interested in Mona Lisa Leonardo daVinci is a Person La Jaconde a Washington is a Video is about Mona Lisa Eiffel Tower is a Place located in Paris 7/14/90 is a Date Paris is a City The Louvre is a Museum located in Paris Mona Lisa is a Painting created by Leonardo daVinci exhibited at The Louvre Nodes are the vertices of a graph Relationships are the edges of a graph Select nodes to get edges for an entity Index Relationship and Endpoint for edge type and target aggregations Follow the edges to traverse the graph Bob wants to see the Mona Lisa. While he is in Paris, he would like to see other things his friends have enjoyed.
  • 24. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Node Query (Primary) • What entities are in the graph? Edge Query (Index) • What relationships do graph entities have? Hybrid Query (Traversal) • How are entities related through each other? Graph query types RDBMS, NoSQL, GraphDB GraphDB
  • 25. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Redshift – Data warehousing Fast, powerful, and simple data warehousing at 1/10 the cost Massively parallel, petabyte scale Fast Inexpensive Scalable Secure Columnar storage technology to improve I/O efficiency and parallelize queries. Data load scales linearly. As low as $1,000 per terabyte per year, 1/10th the cost of traditional data warehouse solutions. Resize your cluster up and down as your performance and capacity needs change. Data encrypted at rest and transit. Isolate clusters with VPC. Manage your own keys with AWS KMS.
  • 26. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Athena – Interactive analysis Interactive query service to analyze data in Amazon S3 using standard SQL No infrastructure to set up or manage and no data to load Ability to run SQL queries on data archived in Amazon Glacier (coming soon) Serverless Zero setup cost. Just point to Amazon S3, and start querying. Pay per query Pay only for queries run. Save 30–90% on per- query costs through compression. Open ANSI SQL interface, JDBC/ODBC drivers, multiple formats, compression types, and complex Joins and data types. Easy Serverless. Zero infrastructure. Zero administration.
  • 27. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Categories of database Optimized for storage Optimized for compute Optimized for relationships Normalized relational or dimensional DW Denormalized document, wide column or key value Denormalized entity relationship Ad hoc queries and aggregations Instantiated views and computed aggregations Ad hoc entity/relationship aggregations Scale vertically Scale horizontally Hybrid Great for OLAP and DSS Built for OLTP or DSS at scale Designed for graph traversals SQL NoSQL Graph
  • 28. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Infinite scale: The database can gracefully increase size and throughput without practical limits The iron triangle of purpose (the PIE theorem) I P E Efficiency: The database will deliver required query latency for the workload at all times Pattern flexibility: The database supports random access patterns and ad hoc queries PI Amazon RDS Elasticsearch Aurora Serverless Neptune IE Pick two PE Data models: Relational Wide Column Document Graph Columnar Unstructured Amazon DynamoDB Amazon Redshift Athena
  • 29. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Hundreds of thousands of customers use DynamoDB
  • 30. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Hundreds of thousands more use Amazon RDS
  • 31. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Purpose-built database solutions from AWS Provisioning Capacity planning Monitoring OS patching Hardware upgrades Database upgrades Security patches Scaling Monitoring Performance tuning Replication across data centers Re-replicate on server failureProvision new regions Infrastructure Software With zero unplanned downtime
  • 32. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Submit session feedback 1. Tap the Schedule icon. 2. Select the session you attended. 3. Tap Session Evaluation to submit your feedback.
  • 33. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Thank you!