Building with Purpose - Built Databases: Match Your Workloads to the Right Database

© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Darin Briskman
Data Scientist & Engineer
AWS Databases, Analytics, and Machine Learning
Building with Purpose-Built Databases

Managed services transform operations
Power, HVAC, net
Rack & stack
Server maintenance
OS patches
DB software patches
Database backups
High Availability
DB software installs
OS installation
Scaling
Operating
Databases
in AWS
App optimization
you
Power, HVAC, net
Rack & stack
Server maintenance
OS patches
DB software patches
Database backups
Scaling
High Availability
DB software installs
OS installation
you
App optimization
Operating
Databases
in the Old World

A one size fits all database doesn’t fit anyone
Modern Applications Need Purpose-Built Databases
Users: 1M+
Data volume: TB–PB–EB
Locality: Global
Performance: Milliseconds–microseconds
Request Rate: Millions
Access: Mobile, IoT, devices
Scale: Up-out-in
Economics: Pay as you go
Developer Access: Instant API access
Relational Key-value Document
In-memory Graph Search

AWS purpose-built strategy
The right tool for the right job
Relational
Non-Relational
Aurora RDS
ElastiCacheDynamoDB
Key-value Document
Neptune
Graph

Data models and common use cases
Relational Key-value Document In-memory Graph Search
Referential
integrity, ACID
transactions,
schema-on-write
Low-latency,
key look-ups with
high throughput
and fast ingestion
of data
Indexing and
storing
documents with
support
for query on
any attribute
Microseconds
latency,
key-based
queries, and
specialized
data structures
Creating and
navigating
relations
between data
easily
and quickly
Indexing and
searching
semistructured
logs and data
ERP, medical records,
CRM, finance
Real-time bidding,
shopping cart, IoT device
tracking
Content management,
personalization, mobile
Leaderboards, real-time
analytics, caching
Fraud detection, social
networking,
recommendation engine
Product catalog,
help/FAQs, full-text
Amazon Aurora
Amazon RDS
Amazon Redshift
Amazon Amazon Amazon
for Redis &
Memcached
Amazon Neptune Amazon
Elasticsearch

AWS databases and analytics
B r o a d a n d d e e p p o r t f o l i o , p u r p o s e - b u i l t f o r b u i l d e r s
Data Lake
S3/Glacier Glue
(ETL & Data Catalog)
Data Movement
Database Migration Service | Snowball | Snowmobile | Kinesis Data Firehose | Kinesis Data Streams
Non-Relational Databases
DynamoDB
ElastiCache
(Redis, Memcached)
Neptune
(Graph)
Analytics
DW | Big Data Processing | Interactive
Redshift EMR Athena
Kinesis
Analytics
Elasticsearch
Service
Real-time
Relational Databases
RDS
Aurora
Business Intelligence & Machine Learning
QuickSight SageMaker Comprehend

Old-guard commercial databases
Very
expensive
Proprietary Lock-in Punitive
licensing
You’ve
got mail

Moving to open source
database engines
+
Commercial-grade performance and reliability?

Scale compute
and storage with a few
clicks; minimal
downtime for your
application
Automatic Multi-AZ
data replication;
automated backup,
snapshots, and failover
Data encryption at rest
and in transit; industry
compliance and
assurance programs
Running many databases with Amazon RDS
Managed Relational Database Service with choice
Managed &Automated
Deploy and maintain
hardware, OS, and DB
software; built-in monitoring
Performant & scalable Available & durable Secure & compliant

Fully compatible with
PostgreSQL and MySQL,
with 3x – 5x the throughput
Storage volume striped across
hundreds of storage nodes
distributed over 3 different
availability zones
Six copies of data on SSD, two
copies in each availability zone, to
protect against AZ+1 failures
Continuous backup to Amazon
S3 (built for 99.999999999%
durability)
Master Replica Replica Replica
Availability
Zone 1
Availability
Zone 2
Availability
Zone 3
Large relational databases with Amazon Aurora
Scale-out, distributed, multi-tenant architecture

AWS Database Migration Service
Migrating
Databases
to AWS
90,000+
Databases migrated
Migrate between on-premises and AWS
Migrate between databases
Data replication for zero-downtime migration
Automated schema conversion

Amazon DynamoDB
Fully-managed nonrelational database for any scale
Secure
Encryption at rest and transit
Fine-grained access control
PCI, HIPAA, FIPS140-2 eligible
High performance
Fast, consistent performance
Virtually unlimited throughput
Virtually unlimited storage
Fully managed
Maintenance-free
Serverless
Auto scaling
Backup and restore
Global tables GlobalTables
High-performance, globally distributed
applications
Multi-region redundancy
and resiliency
Easy to set up and no application
rewrites required

Managed services for open source software
Redis, Memcached, Elasticsearch, Apache Hadoop, etc.
Fully managed
AWS manages all hardware
and software setup,
configuration, monitoring
Extreme performance
In-memory data store and cache
for sub-millisecond response times
Easily scalable
Non-disruptive scaling
up and down to
meet changing
demands
Amazon ElastiCache
Open and Secure
Direct access to open-source APIs
Secure access withVPC
Amazon Elasticsearch Service
Apache Hadoop Ecosystem
19 open-source frameworks
Low costs with S3 storage and Spot
Amazon EMR

Highly connected data best represented in a graph
Relational model
Foreign keys used to represent relationships
Queries can involve nesting & complex joins
Performance can degrade as datasets grow
Graph model
Relationships are first-order citizens
Write queries that navigate the graph
Results returned quickly, even on large datasets

Amazon Neptune
Fully managed graph database
Fast & Scalable ReliableFlexible
Store billions of relationships; query
with millisecond latency
Six replicas of your data
across three AZs with full
backup and restore
Build powerful queries
with
Gremlin and SPARQL
Supports Apache
TinkerPop & W3C RDF
graph models
Gremlin
SPARQL
Open Standards

AWS Analytics Services
Any analytic workload, any scale, at the lowest possible cost
Insights
Analytics
Data Lake
Data Movement
QuickSight SageMaker
Glue
(ETL & Data Catalog)
S3/Glacier
(Storage)
Redshift
+Spectrum
EMR Athena
Elasticsearch
service
Kinesis Data Analytics
Database Migration Service | Snowball | Snowmobile | Kinesis Data Firehose | Kinesis Data Streams
Real-time
Comprehend
DW Big data processing Interactive

Data Lakes on AWS
Most ways to bring data in
Unmatched durability and availability at EB scale
Best security, compliance, and audit capabilities
Run any analytics on same data without movement
Scale storage and compute independently
OLTP ERP CRM LOB
Data Warehouse
Business
Intelligence
Data Lake
10011000010010101110010
10101110010101000010111
11011010
0011110010110010110
0100011000010
Devices Web Sensors Social
Catalog
Machine
Learning
DW
Queries
Big data
processing
Interactive Real-time

Amazon Redshift Spectrum
Extend the data warehouse to exabytes of data in S3 data lake
• Exabyte Redshift SQL queries against Amazon
S3
• Join data across Redshift and S3
• Scale compute and storage separately
• Stable query performance and unlimited
concurrency
• CSV, ORC, Grok, Avro, & Parquet data formats
• Pay only for the amount of data scanned
S3 data lakeAmazon
Redshift data
Redshift Spectrum
query engine

Amazon Elasticsearch Service
Fully-managed.
Deploy production-ready
clusters in minutes
Open
Direct access to Amazon ES
open-source APIs; supports
Logstash and Kibana
Secure
Secure access with VPC to
keep all traffic within AWS
network
Available
Zone awareness replicates
data between two AZs;
automatically monitors &
replaces failed nodes
Managed service to deploy, secure, operate, and scale Amazon Elasticsearch Service
Customers use Amazon ES for log analytics, full-text search & application monitoring
Fully Managed

Amazon QuickSight
• Fast, easy to use, serverless analytics at 1/10th the cost of traditional BI
Empower
everyone
Seamless
connectivity
Fast analysis Serverless

400,000+ Customers using AWS DB & Analytics Services

M L F R A M E W O R K S
Put Machine Learning in the hands of
every developer
M L S E R V I C E S
A M A Z O N
S A G E M A K E R
A I S E R V I C E S
R E K O G N I T I O N R E K O G N I T I O N
V I D E O
P O L L Y T R A N S C R I B E T R A N S L A T E C O M P R E H E N D L E X
V isio n S p eec h L an g u ag e C h at b o t s &
C o n t ac t C en t er s

Thank you
Please rate my session.
https://amzn.to/ottawa-sessions
Track: Spotlight
Session: 10:00 AM - Building with Purpose-Built Databases
How did we do?
https://amzn.to/ottawa-summit

Building with Purpose - Built Databases: Match Your Workloads to the Right Database

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Building with Purpose - Built Databases: Match Your Workloads to the Right Database

Similar to Building with Purpose - Built Databases: Match Your Workloads to the Right Database (20)

More from Amazon Web Services

More from Amazon Web Services (20)

Building with Purpose - Built Databases: Match Your Workloads to the Right Database

Editor's Notes