SlideShare a Scribd company logo
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Which Database to Use When?
T o n y P e t r o s s i a n , D i r e c t o r o f E n g i n e e r i n g , D y n a m o D B
I a n M e y e r s , H e a d o f D B S C u s t o m e r A d v i s o r y T e a m
D A T 3 1 0
N o v e m b e r 2 8 , 2 0 1 7
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
A Short Break from Generalities
Relational Non-Relational
NoSQL SQL
Schema Schema-free
Unstructured Structured
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Looking at the Specifics
Purpose of a database Your application needs
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Database Workloads
Data Considerations
Shape Size Compute
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Shape
Purpose-Built For Optimized for When you need to Example Workload
Row Store Operate on a record or group of records Payroll
Column Store Aggregations, scans and joins Analytics
Key-Value Store Query by key with high throughput & fast ingestion Tracking devices
Document Store Index & store documents for query on any property Patient data
Graph Store Persist and retrieve relationships Recommendations
Time-Series Store Store and process data sequence Process Engine telemetry
Unstructured Store Get and put of objects Store user reviews
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Size
Considerations Example Workload
Size at limit – bounded or unbounded
Number of employees – bounded
Number of sensors – unbounded
Working set size & caching
10-years of sales data but only the last 12-months is queried
Session data for users of a streaming service
Retrieval size
Get one row
Get one thousand rows
Partitionable or monolithic
Storage and processing of car location data is partitionable
Company payroll data has no natural partition boundary
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Compute
Considerations Example Workload
Compute functions
Sum of sales for the last 12-months
Get & Put data
Throughput
Million users browsing a product catalogue every second
50 doctors looking at 300 patient records per day
Latency
Get the location of a car in 5 milliseconds
Get the min, max & average deal size for the last 12-months in 5 seconds
Change rate
Inventory counts are frequently updated
Sales records are never updated
Rate of ingestion
Location telemetry from cars added to the database every minute
New employees records being added to the database
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
My [insert your favorite DB] works for everything
General purpose Special purpose
One size fits all Efficiency at scale
But Which Database to Use When?
Decision points and considerations
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Managed database services
DevOps
Build Deploy Operate
code, integrate, test provision, configure, rollout secure, monitor, scale, HA
All conveniently located at the end of an API call
But Which Databases to Use When?
Why pick just one?
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Our Strategy
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Operational Analytics
transactional retrospective
system of record streaming
content management predictive
Back to Generalities
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Operational Analytics
transactional retrospective
system of record predictive
content management streaming
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
General characteristics
• Usually a good fit for caching
• Small compute size – few rows,
items, documents per request
• Low-latency
• High-throughput
• High-concurrency
• Mission critical HA, DR and data
protection
Primary dimensions to consider
• Size at limit – bounded or
unbounded
• Rows, key-values or documents
• Need relational capabilities or not
• partitioned or monolithic
• Push-down compute requirements
• Change velocity
• Ingestion requirements
Operational workloads
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Amazon RDS
Managed relational database service with a choice of six popular database engines
Easy to administer Highly scalable Available &
Durable
Fast
No need for infrastructure
provisioning, installing and
maintaining database
software
Scale database compute
and storage with a few
mouse clicks with no
downtime
Multi-AZ:
Automatically
replicates data to in a
different AZ.
Automated backup,
snapshots, failover
Chose between 2 SSD-
backed storage for high
performance OLTP
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Amazon ElastiCache
Extreme
Performance
Secure & hardened Easily scalable
Highly available &
reliable
In-memory data store and
cache using optimized stack
to deliver sub-millisecond
response times
VPC for cluster isolation,
encryption at rest/transit,
and HIPAA compliance
Read scaling with replicas.
Write and memory scaling
with sharding. Non
disruptive scaling
Multi-AZ with automatic
failover
Managed, in-memory data store service.
Redis or Memcached to power real-time apps with sub-millisecond latency.
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Caching
ElastiCache can be added to most operational
database to improve read latency and reduce
provisioned read IOPS if required
And if your working set size fits in cache and you
can get a good hit rate
You application needs to be aware of the cache
that fronts the database
Important to understand the ‘cache aside’
pattern and the impact of stale reads on your
application
Application
Read
Miss
Read
Write
Respond
Read
Value
ElastiCache Amazon
RDS
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Caching
ElastiCache can be added to most operational
database to improve read latency and reduce
provisioned read IOPS if required
And if your working set size fits in cache and you
can get a good hit rate
You application needs to be aware of the cache
that fronts the database
Important to understand the ‘cache aside’
pattern and the impact of stale reads on your
application Delete
Write
Respond
Write
Stale Cache Reads
Application ElastiCache Amazon
RDS
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Amazon DynamoDB
F a s t a n d f l e x i b l e N o S Q L d a t a b a s e s e r v i c e f o r a n y s c a l e
NoSQL database that supports both document and key-value structures
Fast, consistent
performance
Highly scalable Fully managed
Business Critical
Reliability
Consistently single-digit
millisecond latencies at any
scale. DAX speeds up times
to microseconds.
Auto-scaling tables serving
millions of requests per
second, storing hundreds of
terabytes of data.
Automatic provisioning
and infrastructure
management.
Data is replicated across
fault tolerant availability
zones, with fine-grained
access control.
Fully managed, in-
memory cache for
DynamoDB.
Reduces DynamoDB
response times from
milliseconds to
microseconds.
Amazon DynamoDB Accelerator (DAX)
Fully managed write-through cache for DynamoDB
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Caching
DAX is fully integrated caching for DynamoDB at
the API level so no additional application
considerations are needed to use DAX.
If your working set size fits in cache and you
need the lower latency DAX is a great option
Application
Item
Write
Respond
Write
No Stale Cache Reads
Write
DAX Amazon
DynamoDB
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
High Availability and Durability
DynamoDB is always Multi-AZ durable
Writes are synchronous to two availability zones
Reads are Multi-AZ consistent if requested by the API request
Consumers can read from an item from any of 3 nodes hosting the partition
Amazon Aurora is always Multi-AZ durable
Writes are synchronous to 4 nodes of the 6 node storage clique
Reads are transactionally consistent from only 3 nodes
Consumers must read from the primary node
Amazon
DynamoDB
Amazon
Aurora
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Relational capabilities
RDBMS provide multi-table, multi-record transactions,
referential integrity and locking
DynamoDB provides Atomicity, Consistency (at the Item
level), Durability, and automatics partitioning at any scale
in exchange for relational capabilities
You must consider the scale of your requirement, skills of
your team, and data model complexity to make a good
choice
NOSQL Skills
Data Model
Massive Scale
Operational database dimensions
Size at limit – bounded ✔
Size at limit – unbounded ✔
key-values or documents ✔
Rows ✔
Need relational capabilities ✔
Partitioned ✔
Push-down compute requirements ✔
Change Velocity ✔ ✔
Ingestion requirements ✔
Amazon
DynamoDB
Amazon
RDS
A few Examples…
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Operational Analytics
transactional retrospective
system of record predictive
content management streaming
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
General characteristics
• Almost always a columnar
• Large and usually partitioned
• large compute size – millions of
items involved in query
• Heavy compute push down
• Batch writes or trickle inserts
• Little to no updates
• Needs a lot of memory and often
in-memory compute capabilities
Primary dimensions to consider
• Streaming or not
• Latency requirements
• ETL or no ETL
• Serverless or dedicated compute
• Always active or occasionally active
• Data formats
Analytic workloads
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Amazon Redshift – Data Warehousing
Fast, powerful, and simple data warehousing at 1/10 the cost
Massively parallel, petabyte scale
$
Fast Inexpensive Scalable Secure
Columnar storage technology
to improve I/O efficiency and
parallelize queries. Data load
scales linearly.
As low as $1,000 per
terabyte per year, 1/10th
the cost of traditional data
warehouse solutions.
Resize your cluster up and
down as your performance
and capacity needs
change.
Data encrypted at rest and
transit. Isolate clusters with
VPC. Manage your own keys
with AWS KMS.
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Amazon Athena – Interactive analysis
Interactive query service to analyze data in Amazon S3 using standard SQL
No infrastructure to setup or manage and no data to load
$
SQL
Query Instantly
Zero setup cost. Just
point to S3 and start
querying
Pay per query
Pay only for queries run.
Save 30-90% on per query
costs through compression
Open
ANSI SQL interface,
JDBC/ODBC drivers, Multiple
formats, compression types,
and complex Joins and data
types
Easy
Serverless. Zero
Infrastructure. Zero
Administration
Amazon Kinesis Analytics
Process and Analyze Streaming Data in Real-time with SQL
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Amazon Elasticsearch Service
Easy to Use
Fully-managed.
Deploy production-ready
clusters in minutes.
Open
Direct access to
Elasticsearch open-source
APIs. Supports Logstash
and Kibana.
Secure
Secure access with VPC to
keep all traffic within AWS
network.
Available
Zone awareness replicates
data between two AZs.
Automatically monitors &
replaces failed nodes.
Easy to deploy, secure, operate, and scale Elasticsearch
Customers use Elasticsearch for log analytics, full text search, & application monitoring
Analytics database dimensions
Streaming analytics ✔
Serverless ad-hoc query ✔
Process, prepare and index in-place ✔
Low-latency for reporting and BI dashboards ✔
Pay per query ✔
Data warehouse with multiple enterprise data sources ✔
Query data directly in S3 without format conversions ✔
Directly query CSV, JSON, TSV or text files ✔
Amazon
Redshift
AthenaKinesis Analytics
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Well Modelled DataData Exploration
Non-SQL Analytics
Real-Time analytics
Managed Storage Delivery
Amazon
Elasticsearch Service
Kinesis Analytics
Athena Amazon
Redshift
Amazon S3
Why pick one when you can use all three?
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
DW | Big Data Processing | Ad hoc
AWS Databases and Analytics
B r o a d e s t a n d d e e p e s t p o r t f o l i o p u r p o s e - b u i l t f o r b u i l d e r s
Business Intelligence & Machine Learning
Data Movement
Database Migration Service | Snowball | Snowmobile | Kinesis Data Firehose | Kinesis Data Streams
QuickSight
Relational Databases
RDS
Aurora
Data Lake
S3/Glacier
(Storage)
Glue
(ETL & Data Catalog)
Machine Learning
Macie
(Data Protection)
Non-Relational Databases
Analytics
DynamoDB
(Key value/Document)
ElastiCache
(Redis, Memcached)
Redshift EMR Athena
Kinesis
Analytics
Elasticsearch
Service
Real-time
Operational Databases
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
In Closing
AWS offers a myriad of services designed to help you solve your toughest problems at scale – no need to just
pick one service
When selecting a data service, consider the dimensions and pick the best match for each component of your
application
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Thank you!
P l e a s e f i l l i n t h e s e s s i o n s u r v e y
W e h o p e y o u e n j o y e d t h e d i s c u s s i o n !
D A T 3 1 0 : W h i c h D a t a b a s e t o U s e W h e n ?

More Related Content

What's hot

ARC329_Optimizing Performance and Efficiency for Amazon EC2 and More with Tur...
ARC329_Optimizing Performance and Efficiency for Amazon EC2 and More with Tur...ARC329_Optimizing Performance and Efficiency for Amazon EC2 and More with Tur...
ARC329_Optimizing Performance and Efficiency for Amazon EC2 and More with Tur...
Amazon Web Services
 
GPSTEC315_GPS Optimizing Tips Amazon Redshift for Cloud Data
GPSTEC315_GPS Optimizing Tips Amazon Redshift for Cloud DataGPSTEC315_GPS Optimizing Tips Amazon Redshift for Cloud Data
GPSTEC315_GPS Optimizing Tips Amazon Redshift for Cloud Data
Amazon Web Services
 
NET309_Best Practices for Securing an Amazon Virtual Private Cloud
NET309_Best Practices for Securing an Amazon Virtual Private CloudNET309_Best Practices for Securing an Amazon Virtual Private Cloud
NET309_Best Practices for Securing an Amazon Virtual Private Cloud
Amazon Web Services
 
ABD201-Big Data Architectural Patterns and Best Practices on AWS
ABD201-Big Data Architectural Patterns and Best Practices on AWSABD201-Big Data Architectural Patterns and Best Practices on AWS
ABD201-Big Data Architectural Patterns and Best Practices on AWS
Amazon Web Services
 
ARC319_Multi-Region Active-Active Architecture
ARC319_Multi-Region Active-Active ArchitectureARC319_Multi-Region Active-Active Architecture
ARC319_Multi-Region Active-Active Architecture
Amazon Web Services
 
AWS Commercial Management and Cost Optimisation - Dec 2017
AWS Commercial Management and Cost Optimisation - Dec 2017AWS Commercial Management and Cost Optimisation - Dec 2017
AWS Commercial Management and Cost Optimisation - Dec 2017
Amazon Web Services
 
Hybrid Cloud Data Management: Using Data for Business Outcomes - STG308 - re:...
Hybrid Cloud Data Management: Using Data for Business Outcomes - STG308 - re:...Hybrid Cloud Data Management: Using Data for Business Outcomes - STG308 - re:...
Hybrid Cloud Data Management: Using Data for Business Outcomes - STG308 - re:...
Amazon Web Services
 
Deep Dive on Amazon Glacier - STG303 - re:Invent 2017
Deep Dive on Amazon Glacier - STG303 - re:Invent 2017Deep Dive on Amazon Glacier - STG303 - re:Invent 2017
Deep Dive on Amazon Glacier - STG303 - re:Invent 2017
Amazon Web Services
 
STG206_Big Data Data Lakes and Data Oceans
STG206_Big Data Data Lakes and Data OceansSTG206_Big Data Data Lakes and Data Oceans
STG206_Big Data Data Lakes and Data Oceans
Amazon Web Services
 
GPSWKS301_Comprehensive Big Data Architecture Made Easy
GPSWKS301_Comprehensive Big Data Architecture Made EasyGPSWKS301_Comprehensive Big Data Architecture Made Easy
GPSWKS301_Comprehensive Big Data Architecture Made Easy
Amazon Web Services
 
Design patterns and best practices for data analytics with amazon emr (ABD305)
Design patterns and best practices for data analytics with amazon emr (ABD305)Design patterns and best practices for data analytics with amazon emr (ABD305)
Design patterns and best practices for data analytics with amazon emr (ABD305)
Amazon Web Services
 
ARC306_High Resiliency & Availability Of Online Entertainment Communities Usi...
ARC306_High Resiliency & Availability Of Online Entertainment Communities Usi...ARC306_High Resiliency & Availability Of Online Entertainment Communities Usi...
ARC306_High Resiliency & Availability Of Online Entertainment Communities Usi...
Amazon Web Services
 
STG311_Deep Dive on Amazon S3 & Amazon Glacier Storage Management
STG311_Deep Dive on Amazon S3 & Amazon Glacier Storage ManagementSTG311_Deep Dive on Amazon S3 & Amazon Glacier Storage Management
STG311_Deep Dive on Amazon S3 & Amazon Glacier Storage Management
Amazon Web Services
 
STG306_Deep Dive on Amazon EBS
STG306_Deep Dive on Amazon EBSSTG306_Deep Dive on Amazon EBS
STG306_Deep Dive on Amazon EBS
Amazon Web Services
 
ARC303_Running Lean Architectures How to Optimize for Cost Efficiency
ARC303_Running Lean Architectures How to Optimize for Cost EfficiencyARC303_Running Lean Architectures How to Optimize for Cost Efficiency
ARC303_Running Lean Architectures How to Optimize for Cost Efficiency
Amazon Web Services
 
DAT339_Replicate, Analyze, and Visualize Datasets Using AWS Database Migratio...
DAT339_Replicate, Analyze, and Visualize Datasets Using AWS Database Migratio...DAT339_Replicate, Analyze, and Visualize Datasets Using AWS Database Migratio...
DAT339_Replicate, Analyze, and Visualize Datasets Using AWS Database Migratio...
Amazon Web Services
 
DAT332_How Verizon is Adopting Amazon Aurora PostgreSQL for Enterprise Workloads
DAT332_How Verizon is Adopting Amazon Aurora PostgreSQL for Enterprise WorkloadsDAT332_How Verizon is Adopting Amazon Aurora PostgreSQL for Enterprise Workloads
DAT332_How Verizon is Adopting Amazon Aurora PostgreSQL for Enterprise Workloads
Amazon Web Services
 
ABD310 big data aws and security no notes
ABD310 big data aws and security no notesABD310 big data aws and security no notes
ABD310 big data aws and security no notes
Amazon Web Services
 
ATC303-Cache Me If You Can Minimizing Latency While Optimizing Cost Through A...
ATC303-Cache Me If You Can Minimizing Latency While Optimizing Cost Through A...ATC303-Cache Me If You Can Minimizing Latency While Optimizing Cost Through A...
ATC303-Cache Me If You Can Minimizing Latency While Optimizing Cost Through A...
Amazon Web Services
 
CMP310_Deep Dive on Amazon EBS
CMP310_Deep Dive on Amazon EBSCMP310_Deep Dive on Amazon EBS
CMP310_Deep Dive on Amazon EBS
Amazon Web Services
 

What's hot (20)

ARC329_Optimizing Performance and Efficiency for Amazon EC2 and More with Tur...
ARC329_Optimizing Performance and Efficiency for Amazon EC2 and More with Tur...ARC329_Optimizing Performance and Efficiency for Amazon EC2 and More with Tur...
ARC329_Optimizing Performance and Efficiency for Amazon EC2 and More with Tur...
 
GPSTEC315_GPS Optimizing Tips Amazon Redshift for Cloud Data
GPSTEC315_GPS Optimizing Tips Amazon Redshift for Cloud DataGPSTEC315_GPS Optimizing Tips Amazon Redshift for Cloud Data
GPSTEC315_GPS Optimizing Tips Amazon Redshift for Cloud Data
 
NET309_Best Practices for Securing an Amazon Virtual Private Cloud
NET309_Best Practices for Securing an Amazon Virtual Private CloudNET309_Best Practices for Securing an Amazon Virtual Private Cloud
NET309_Best Practices for Securing an Amazon Virtual Private Cloud
 
ABD201-Big Data Architectural Patterns and Best Practices on AWS
ABD201-Big Data Architectural Patterns and Best Practices on AWSABD201-Big Data Architectural Patterns and Best Practices on AWS
ABD201-Big Data Architectural Patterns and Best Practices on AWS
 
ARC319_Multi-Region Active-Active Architecture
ARC319_Multi-Region Active-Active ArchitectureARC319_Multi-Region Active-Active Architecture
ARC319_Multi-Region Active-Active Architecture
 
AWS Commercial Management and Cost Optimisation - Dec 2017
AWS Commercial Management and Cost Optimisation - Dec 2017AWS Commercial Management and Cost Optimisation - Dec 2017
AWS Commercial Management and Cost Optimisation - Dec 2017
 
Hybrid Cloud Data Management: Using Data for Business Outcomes - STG308 - re:...
Hybrid Cloud Data Management: Using Data for Business Outcomes - STG308 - re:...Hybrid Cloud Data Management: Using Data for Business Outcomes - STG308 - re:...
Hybrid Cloud Data Management: Using Data for Business Outcomes - STG308 - re:...
 
Deep Dive on Amazon Glacier - STG303 - re:Invent 2017
Deep Dive on Amazon Glacier - STG303 - re:Invent 2017Deep Dive on Amazon Glacier - STG303 - re:Invent 2017
Deep Dive on Amazon Glacier - STG303 - re:Invent 2017
 
STG206_Big Data Data Lakes and Data Oceans
STG206_Big Data Data Lakes and Data OceansSTG206_Big Data Data Lakes and Data Oceans
STG206_Big Data Data Lakes and Data Oceans
 
GPSWKS301_Comprehensive Big Data Architecture Made Easy
GPSWKS301_Comprehensive Big Data Architecture Made EasyGPSWKS301_Comprehensive Big Data Architecture Made Easy
GPSWKS301_Comprehensive Big Data Architecture Made Easy
 
Design patterns and best practices for data analytics with amazon emr (ABD305)
Design patterns and best practices for data analytics with amazon emr (ABD305)Design patterns and best practices for data analytics with amazon emr (ABD305)
Design patterns and best practices for data analytics with amazon emr (ABD305)
 
ARC306_High Resiliency & Availability Of Online Entertainment Communities Usi...
ARC306_High Resiliency & Availability Of Online Entertainment Communities Usi...ARC306_High Resiliency & Availability Of Online Entertainment Communities Usi...
ARC306_High Resiliency & Availability Of Online Entertainment Communities Usi...
 
STG311_Deep Dive on Amazon S3 & Amazon Glacier Storage Management
STG311_Deep Dive on Amazon S3 & Amazon Glacier Storage ManagementSTG311_Deep Dive on Amazon S3 & Amazon Glacier Storage Management
STG311_Deep Dive on Amazon S3 & Amazon Glacier Storage Management
 
STG306_Deep Dive on Amazon EBS
STG306_Deep Dive on Amazon EBSSTG306_Deep Dive on Amazon EBS
STG306_Deep Dive on Amazon EBS
 
ARC303_Running Lean Architectures How to Optimize for Cost Efficiency
ARC303_Running Lean Architectures How to Optimize for Cost EfficiencyARC303_Running Lean Architectures How to Optimize for Cost Efficiency
ARC303_Running Lean Architectures How to Optimize for Cost Efficiency
 
DAT339_Replicate, Analyze, and Visualize Datasets Using AWS Database Migratio...
DAT339_Replicate, Analyze, and Visualize Datasets Using AWS Database Migratio...DAT339_Replicate, Analyze, and Visualize Datasets Using AWS Database Migratio...
DAT339_Replicate, Analyze, and Visualize Datasets Using AWS Database Migratio...
 
DAT332_How Verizon is Adopting Amazon Aurora PostgreSQL for Enterprise Workloads
DAT332_How Verizon is Adopting Amazon Aurora PostgreSQL for Enterprise WorkloadsDAT332_How Verizon is Adopting Amazon Aurora PostgreSQL for Enterprise Workloads
DAT332_How Verizon is Adopting Amazon Aurora PostgreSQL for Enterprise Workloads
 
ABD310 big data aws and security no notes
ABD310 big data aws and security no notesABD310 big data aws and security no notes
ABD310 big data aws and security no notes
 
ATC303-Cache Me If You Can Minimizing Latency While Optimizing Cost Through A...
ATC303-Cache Me If You Can Minimizing Latency While Optimizing Cost Through A...ATC303-Cache Me If You Can Minimizing Latency While Optimizing Cost Through A...
ATC303-Cache Me If You Can Minimizing Latency While Optimizing Cost Through A...
 
CMP310_Deep Dive on Amazon EBS
CMP310_Deep Dive on Amazon EBSCMP310_Deep Dive on Amazon EBS
CMP310_Deep Dive on Amazon EBS
 

Similar to DAT310_Which Database to Use When

Choosing the Right Database for My Workload: Purpose-Built Databases
Choosing the Right Database for My Workload: Purpose-Built Databases Choosing the Right Database for My Workload: Purpose-Built Databases
Choosing the Right Database for My Workload: Purpose-Built Databases
AWS Germany
 
AWS Database and Analytics State of the Union
AWS Database and Analytics State of the UnionAWS Database and Analytics State of the Union
AWS Database and Analytics State of the Union
Amazon Web Services
 
Choose the right DB for the Job - Builders Day Israel
Choose the right DB for the Job - Builders Day IsraelChoose the right DB for the Job - Builders Day Israel
Choose the right DB for the Job - Builders Day Israel
Amazon Web Services
 
Architecting an Open Data Lake for the Enterprise
Architecting an Open Data Lake for the EnterpriseArchitecting an Open Data Lake for the Enterprise
Architecting an Open Data Lake for the Enterprise
Amazon Web Services
 
AWS Database and Analytics State of the Union
AWS Database and Analytics State of the UnionAWS Database and Analytics State of the Union
AWS Database and Analytics State of the Union
Amazon Web Services
 
Managed NoSQL databases
Managed NoSQL databasesManaged NoSQL databases
Managed NoSQL databases
Amazon Web Services
 
ABD312_Deep Dive Migrating Big Data Workloads to AWS
ABD312_Deep Dive Migrating Big Data Workloads to AWSABD312_Deep Dive Migrating Big Data Workloads to AWS
ABD312_Deep Dive Migrating Big Data Workloads to AWS
Amazon Web Services
 
Using AWS Purpose-Built Databases to Modernize your Applications
Using AWS Purpose-Built Databases to Modernize your ApplicationsUsing AWS Purpose-Built Databases to Modernize your Applications
Using AWS Purpose-Built Databases to Modernize your Applications
Amazon Web Services
 
Building low latency apps with a serverless architecture and in-memory data I...
Building low latency apps with a serverless architecture and in-memory data I...Building low latency apps with a serverless architecture and in-memory data I...
Building low latency apps with a serverless architecture and in-memory data I...
AWS Germany
 
DynamoDB - What's new - DAT304 - re:Invent 2017
DynamoDB - What's new - DAT304 - re:Invent 2017DynamoDB - What's new - DAT304 - re:Invent 2017
DynamoDB - What's new - DAT304 - re:Invent 2017
Amazon Web Services
 
Applying AWS Purpose-Built Database Strategy - SRV307 - Toronto AWS Summit
Applying AWS Purpose-Built Database Strategy - SRV307 - Toronto AWS SummitApplying AWS Purpose-Built Database Strategy - SRV307 - Toronto AWS Summit
Applying AWS Purpose-Built Database Strategy - SRV307 - Toronto AWS Summit
Amazon Web Services
 
AWS Purpose-Built Database Strategy: The Right Tool For The Right Job
AWS Purpose-Built Database Strategy: The Right Tool For The Right JobAWS Purpose-Built Database Strategy: The Right Tool For The Right Job
AWS Purpose-Built Database Strategy: The Right Tool For The Right Job
Amazon Web Services
 
Migrating your traditional Data Warehouse to a Modern Data Lake
Migrating your traditional Data Warehouse to a Modern Data LakeMigrating your traditional Data Warehouse to a Modern Data Lake
Migrating your traditional Data Warehouse to a Modern Data Lake
Amazon Web Services
 
Technology Trends in Data Processing - DAT311 - re:Invent 2017
Technology Trends in Data Processing - DAT311 - re:Invent 2017Technology Trends in Data Processing - DAT311 - re:Invent 2017
Technology Trends in Data Processing - DAT311 - re:Invent 2017
Amazon Web Services
 
SRV307 Applying AWS Purpose-Built Database Strategy: Match Your Workload to ...
 SRV307 Applying AWS Purpose-Built Database Strategy: Match Your Workload to ... SRV307 Applying AWS Purpose-Built Database Strategy: Match Your Workload to ...
SRV307 Applying AWS Purpose-Built Database Strategy: Match Your Workload to ...
Amazon Web Services
 
21st Century Analytics with Zopa
21st Century Analytics with Zopa21st Century Analytics with Zopa
21st Century Analytics with Zopa
Amazon Web Services
 
Immersion Day - Como simplificar o acesso ao seu ambiente analítico
Immersion Day - Como simplificar o acesso ao seu ambiente analíticoImmersion Day - Como simplificar o acesso ao seu ambiente analítico
Immersion Day - Como simplificar o acesso ao seu ambiente analítico
Amazon Web Services LATAM
 
FINRA's Managed Data Lake: Next-Gen Analytics in the Cloud - ENT328 - re:Inve...
FINRA's Managed Data Lake: Next-Gen Analytics in the Cloud - ENT328 - re:Inve...FINRA's Managed Data Lake: Next-Gen Analytics in the Cloud - ENT328 - re:Inve...
FINRA's Managed Data Lake: Next-Gen Analytics in the Cloud - ENT328 - re:Inve...
Amazon Web Services
 
How TrueCar Gains Actionable Insights with Splunk Cloud PPT
How TrueCar Gains Actionable Insights with Splunk Cloud PPTHow TrueCar Gains Actionable Insights with Splunk Cloud PPT
How TrueCar Gains Actionable Insights with Splunk Cloud PPT
Amazon Web Services
 
AWSome Day MODULE 3 - Databases
AWSome Day MODULE 3 - DatabasesAWSome Day MODULE 3 - Databases
AWSome Day MODULE 3 - Databases
Amazon Web Services
 

Similar to DAT310_Which Database to Use When (20)

Choosing the Right Database for My Workload: Purpose-Built Databases
Choosing the Right Database for My Workload: Purpose-Built Databases Choosing the Right Database for My Workload: Purpose-Built Databases
Choosing the Right Database for My Workload: Purpose-Built Databases
 
AWS Database and Analytics State of the Union
AWS Database and Analytics State of the UnionAWS Database and Analytics State of the Union
AWS Database and Analytics State of the Union
 
Choose the right DB for the Job - Builders Day Israel
Choose the right DB for the Job - Builders Day IsraelChoose the right DB for the Job - Builders Day Israel
Choose the right DB for the Job - Builders Day Israel
 
Architecting an Open Data Lake for the Enterprise
Architecting an Open Data Lake for the EnterpriseArchitecting an Open Data Lake for the Enterprise
Architecting an Open Data Lake for the Enterprise
 
AWS Database and Analytics State of the Union
AWS Database and Analytics State of the UnionAWS Database and Analytics State of the Union
AWS Database and Analytics State of the Union
 
Managed NoSQL databases
Managed NoSQL databasesManaged NoSQL databases
Managed NoSQL databases
 
ABD312_Deep Dive Migrating Big Data Workloads to AWS
ABD312_Deep Dive Migrating Big Data Workloads to AWSABD312_Deep Dive Migrating Big Data Workloads to AWS
ABD312_Deep Dive Migrating Big Data Workloads to AWS
 
Using AWS Purpose-Built Databases to Modernize your Applications
Using AWS Purpose-Built Databases to Modernize your ApplicationsUsing AWS Purpose-Built Databases to Modernize your Applications
Using AWS Purpose-Built Databases to Modernize your Applications
 
Building low latency apps with a serverless architecture and in-memory data I...
Building low latency apps with a serverless architecture and in-memory data I...Building low latency apps with a serverless architecture and in-memory data I...
Building low latency apps with a serverless architecture and in-memory data I...
 
DynamoDB - What's new - DAT304 - re:Invent 2017
DynamoDB - What's new - DAT304 - re:Invent 2017DynamoDB - What's new - DAT304 - re:Invent 2017
DynamoDB - What's new - DAT304 - re:Invent 2017
 
Applying AWS Purpose-Built Database Strategy - SRV307 - Toronto AWS Summit
Applying AWS Purpose-Built Database Strategy - SRV307 - Toronto AWS SummitApplying AWS Purpose-Built Database Strategy - SRV307 - Toronto AWS Summit
Applying AWS Purpose-Built Database Strategy - SRV307 - Toronto AWS Summit
 
AWS Purpose-Built Database Strategy: The Right Tool For The Right Job
AWS Purpose-Built Database Strategy: The Right Tool For The Right JobAWS Purpose-Built Database Strategy: The Right Tool For The Right Job
AWS Purpose-Built Database Strategy: The Right Tool For The Right Job
 
Migrating your traditional Data Warehouse to a Modern Data Lake
Migrating your traditional Data Warehouse to a Modern Data LakeMigrating your traditional Data Warehouse to a Modern Data Lake
Migrating your traditional Data Warehouse to a Modern Data Lake
 
Technology Trends in Data Processing - DAT311 - re:Invent 2017
Technology Trends in Data Processing - DAT311 - re:Invent 2017Technology Trends in Data Processing - DAT311 - re:Invent 2017
Technology Trends in Data Processing - DAT311 - re:Invent 2017
 
SRV307 Applying AWS Purpose-Built Database Strategy: Match Your Workload to ...
 SRV307 Applying AWS Purpose-Built Database Strategy: Match Your Workload to ... SRV307 Applying AWS Purpose-Built Database Strategy: Match Your Workload to ...
SRV307 Applying AWS Purpose-Built Database Strategy: Match Your Workload to ...
 
21st Century Analytics with Zopa
21st Century Analytics with Zopa21st Century Analytics with Zopa
21st Century Analytics with Zopa
 
Immersion Day - Como simplificar o acesso ao seu ambiente analítico
Immersion Day - Como simplificar o acesso ao seu ambiente analíticoImmersion Day - Como simplificar o acesso ao seu ambiente analítico
Immersion Day - Como simplificar o acesso ao seu ambiente analítico
 
FINRA's Managed Data Lake: Next-Gen Analytics in the Cloud - ENT328 - re:Inve...
FINRA's Managed Data Lake: Next-Gen Analytics in the Cloud - ENT328 - re:Inve...FINRA's Managed Data Lake: Next-Gen Analytics in the Cloud - ENT328 - re:Inve...
FINRA's Managed Data Lake: Next-Gen Analytics in the Cloud - ENT328 - re:Inve...
 
How TrueCar Gains Actionable Insights with Splunk Cloud PPT
How TrueCar Gains Actionable Insights with Splunk Cloud PPTHow TrueCar Gains Actionable Insights with Splunk Cloud PPT
How TrueCar Gains Actionable Insights with Splunk Cloud PPT
 
AWSome Day MODULE 3 - Databases
AWSome Day MODULE 3 - DatabasesAWSome Day MODULE 3 - Databases
AWSome Day MODULE 3 - Databases
 

More from Amazon Web Services

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Amazon Web Services
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Amazon Web Services
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS Fargate
Amazon Web Services
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWS
Amazon Web Services
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot
Amazon Web Services
 
Open banking as a service
Open banking as a serviceOpen banking as a service
Open banking as a service
Amazon Web Services
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Amazon Web Services
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
Amazon Web Services
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Amazon Web Services
 
Computer Vision con AWS
Computer Vision con AWSComputer Vision con AWS
Computer Vision con AWS
Amazon Web Services
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatare
Amazon Web Services
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Amazon Web Services
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e web
Amazon Web Services
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Amazon Web Services
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWS
Amazon Web Services
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch Deck
Amazon Web Services
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without servers
Amazon Web Services
 
Fundraising Essentials
Fundraising EssentialsFundraising Essentials
Fundraising Essentials
Amazon Web Services
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
Amazon Web Services
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container Service
Amazon Web Services
 

More from Amazon Web Services (20)

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS Fargate
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWS
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot
 
Open banking as a service
Open banking as a serviceOpen banking as a service
Open banking as a service
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
 
Computer Vision con AWS
Computer Vision con AWSComputer Vision con AWS
Computer Vision con AWS
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatare
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e web
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWS
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch Deck
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without servers
 
Fundraising Essentials
Fundraising EssentialsFundraising Essentials
Fundraising Essentials
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container Service
 

DAT310_Which Database to Use When

  • 1. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Which Database to Use When? T o n y P e t r o s s i a n , D i r e c t o r o f E n g i n e e r i n g , D y n a m o D B I a n M e y e r s , H e a d o f D B S C u s t o m e r A d v i s o r y T e a m D A T 3 1 0 N o v e m b e r 2 8 , 2 0 1 7
  • 2. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. A Short Break from Generalities Relational Non-Relational NoSQL SQL Schema Schema-free Unstructured Structured
  • 3. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Looking at the Specifics Purpose of a database Your application needs
  • 4. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Database Workloads Data Considerations Shape Size Compute
  • 5. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Shape Purpose-Built For Optimized for When you need to Example Workload Row Store Operate on a record or group of records Payroll Column Store Aggregations, scans and joins Analytics Key-Value Store Query by key with high throughput & fast ingestion Tracking devices Document Store Index & store documents for query on any property Patient data Graph Store Persist and retrieve relationships Recommendations Time-Series Store Store and process data sequence Process Engine telemetry Unstructured Store Get and put of objects Store user reviews
  • 6. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Size Considerations Example Workload Size at limit – bounded or unbounded Number of employees – bounded Number of sensors – unbounded Working set size & caching 10-years of sales data but only the last 12-months is queried Session data for users of a streaming service Retrieval size Get one row Get one thousand rows Partitionable or monolithic Storage and processing of car location data is partitionable Company payroll data has no natural partition boundary
  • 7. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Compute Considerations Example Workload Compute functions Sum of sales for the last 12-months Get & Put data Throughput Million users browsing a product catalogue every second 50 doctors looking at 300 patient records per day Latency Get the location of a car in 5 milliseconds Get the min, max & average deal size for the last 12-months in 5 seconds Change rate Inventory counts are frequently updated Sales records are never updated Rate of ingestion Location telemetry from cars added to the database every minute New employees records being added to the database
  • 8. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. My [insert your favorite DB] works for everything General purpose Special purpose One size fits all Efficiency at scale
  • 9. But Which Database to Use When? Decision points and considerations
  • 10. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Managed database services DevOps Build Deploy Operate code, integrate, test provision, configure, rollout secure, monitor, scale, HA All conveniently located at the end of an API call
  • 11. But Which Databases to Use When? Why pick just one?
  • 12. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Our Strategy
  • 13. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Operational Analytics transactional retrospective system of record streaming content management predictive Back to Generalities
  • 14. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Operational Analytics transactional retrospective system of record predictive content management streaming
  • 15. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. General characteristics • Usually a good fit for caching • Small compute size – few rows, items, documents per request • Low-latency • High-throughput • High-concurrency • Mission critical HA, DR and data protection Primary dimensions to consider • Size at limit – bounded or unbounded • Rows, key-values or documents • Need relational capabilities or not • partitioned or monolithic • Push-down compute requirements • Change velocity • Ingestion requirements Operational workloads
  • 16. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon RDS Managed relational database service with a choice of six popular database engines Easy to administer Highly scalable Available & Durable Fast No need for infrastructure provisioning, installing and maintaining database software Scale database compute and storage with a few mouse clicks with no downtime Multi-AZ: Automatically replicates data to in a different AZ. Automated backup, snapshots, failover Chose between 2 SSD- backed storage for high performance OLTP
  • 17. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon ElastiCache Extreme Performance Secure & hardened Easily scalable Highly available & reliable In-memory data store and cache using optimized stack to deliver sub-millisecond response times VPC for cluster isolation, encryption at rest/transit, and HIPAA compliance Read scaling with replicas. Write and memory scaling with sharding. Non disruptive scaling Multi-AZ with automatic failover Managed, in-memory data store service. Redis or Memcached to power real-time apps with sub-millisecond latency.
  • 18. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Caching ElastiCache can be added to most operational database to improve read latency and reduce provisioned read IOPS if required And if your working set size fits in cache and you can get a good hit rate You application needs to be aware of the cache that fronts the database Important to understand the ‘cache aside’ pattern and the impact of stale reads on your application Application Read Miss Read Write Respond Read Value ElastiCache Amazon RDS
  • 19. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Caching ElastiCache can be added to most operational database to improve read latency and reduce provisioned read IOPS if required And if your working set size fits in cache and you can get a good hit rate You application needs to be aware of the cache that fronts the database Important to understand the ‘cache aside’ pattern and the impact of stale reads on your application Delete Write Respond Write Stale Cache Reads Application ElastiCache Amazon RDS
  • 20. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon DynamoDB F a s t a n d f l e x i b l e N o S Q L d a t a b a s e s e r v i c e f o r a n y s c a l e NoSQL database that supports both document and key-value structures Fast, consistent performance Highly scalable Fully managed Business Critical Reliability Consistently single-digit millisecond latencies at any scale. DAX speeds up times to microseconds. Auto-scaling tables serving millions of requests per second, storing hundreds of terabytes of data. Automatic provisioning and infrastructure management. Data is replicated across fault tolerant availability zones, with fine-grained access control.
  • 21. Fully managed, in- memory cache for DynamoDB. Reduces DynamoDB response times from milliseconds to microseconds. Amazon DynamoDB Accelerator (DAX) Fully managed write-through cache for DynamoDB
  • 22. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Caching DAX is fully integrated caching for DynamoDB at the API level so no additional application considerations are needed to use DAX. If your working set size fits in cache and you need the lower latency DAX is a great option Application Item Write Respond Write No Stale Cache Reads Write DAX Amazon DynamoDB
  • 23. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. High Availability and Durability DynamoDB is always Multi-AZ durable Writes are synchronous to two availability zones Reads are Multi-AZ consistent if requested by the API request Consumers can read from an item from any of 3 nodes hosting the partition Amazon Aurora is always Multi-AZ durable Writes are synchronous to 4 nodes of the 6 node storage clique Reads are transactionally consistent from only 3 nodes Consumers must read from the primary node Amazon DynamoDB Amazon Aurora
  • 24. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Relational capabilities RDBMS provide multi-table, multi-record transactions, referential integrity and locking DynamoDB provides Atomicity, Consistency (at the Item level), Durability, and automatics partitioning at any scale in exchange for relational capabilities You must consider the scale of your requirement, skills of your team, and data model complexity to make a good choice NOSQL Skills Data Model Massive Scale
  • 25. Operational database dimensions Size at limit – bounded ✔ Size at limit – unbounded ✔ key-values or documents ✔ Rows ✔ Need relational capabilities ✔ Partitioned ✔ Push-down compute requirements ✔ Change Velocity ✔ ✔ Ingestion requirements ✔ Amazon DynamoDB Amazon RDS A few Examples…
  • 26. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Operational Analytics transactional retrospective system of record predictive content management streaming
  • 27. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. General characteristics • Almost always a columnar • Large and usually partitioned • large compute size – millions of items involved in query • Heavy compute push down • Batch writes or trickle inserts • Little to no updates • Needs a lot of memory and often in-memory compute capabilities Primary dimensions to consider • Streaming or not • Latency requirements • ETL or no ETL • Serverless or dedicated compute • Always active or occasionally active • Data formats Analytic workloads
  • 28. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Redshift – Data Warehousing Fast, powerful, and simple data warehousing at 1/10 the cost Massively parallel, petabyte scale $ Fast Inexpensive Scalable Secure Columnar storage technology to improve I/O efficiency and parallelize queries. Data load scales linearly. As low as $1,000 per terabyte per year, 1/10th the cost of traditional data warehouse solutions. Resize your cluster up and down as your performance and capacity needs change. Data encrypted at rest and transit. Isolate clusters with VPC. Manage your own keys with AWS KMS.
  • 29. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Athena – Interactive analysis Interactive query service to analyze data in Amazon S3 using standard SQL No infrastructure to setup or manage and no data to load $ SQL Query Instantly Zero setup cost. Just point to S3 and start querying Pay per query Pay only for queries run. Save 30-90% on per query costs through compression Open ANSI SQL interface, JDBC/ODBC drivers, Multiple formats, compression types, and complex Joins and data types Easy Serverless. Zero Infrastructure. Zero Administration
  • 30. Amazon Kinesis Analytics Process and Analyze Streaming Data in Real-time with SQL
  • 31. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Elasticsearch Service Easy to Use Fully-managed. Deploy production-ready clusters in minutes. Open Direct access to Elasticsearch open-source APIs. Supports Logstash and Kibana. Secure Secure access with VPC to keep all traffic within AWS network. Available Zone awareness replicates data between two AZs. Automatically monitors & replaces failed nodes. Easy to deploy, secure, operate, and scale Elasticsearch Customers use Elasticsearch for log analytics, full text search, & application monitoring
  • 32. Analytics database dimensions Streaming analytics ✔ Serverless ad-hoc query ✔ Process, prepare and index in-place ✔ Low-latency for reporting and BI dashboards ✔ Pay per query ✔ Data warehouse with multiple enterprise data sources ✔ Query data directly in S3 without format conversions ✔ Directly query CSV, JSON, TSV or text files ✔ Amazon Redshift AthenaKinesis Analytics
  • 33. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Well Modelled DataData Exploration Non-SQL Analytics Real-Time analytics Managed Storage Delivery Amazon Elasticsearch Service Kinesis Analytics Athena Amazon Redshift Amazon S3 Why pick one when you can use all three?
  • 34. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. DW | Big Data Processing | Ad hoc AWS Databases and Analytics B r o a d e s t a n d d e e p e s t p o r t f o l i o p u r p o s e - b u i l t f o r b u i l d e r s Business Intelligence & Machine Learning Data Movement Database Migration Service | Snowball | Snowmobile | Kinesis Data Firehose | Kinesis Data Streams QuickSight Relational Databases RDS Aurora Data Lake S3/Glacier (Storage) Glue (ETL & Data Catalog) Machine Learning Macie (Data Protection) Non-Relational Databases Analytics DynamoDB (Key value/Document) ElastiCache (Redis, Memcached) Redshift EMR Athena Kinesis Analytics Elasticsearch Service Real-time Operational Databases
  • 35. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. In Closing AWS offers a myriad of services designed to help you solve your toughest problems at scale – no need to just pick one service When selecting a data service, consider the dimensions and pick the best match for each component of your application
  • 36. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Thank you! P l e a s e f i l l i n t h e s e s s i o n s u r v e y W e h o p e y o u e n j o y e d t h e d i s c u s s i o n ! D A T 3 1 0 : W h i c h D a t a b a s e t o U s e W h e n ?