Tech-Spark: Exploring the Cosmos DB

Exploring the Cosmos… DB with Ralph Attard

Agenda: Exploring Cosmos DB
What is it?
Internals
Resource Model
Try it out!
DEMO: Create an SQL API & Download sample .NET app
Change Feed
Global Distribution
Use Cases
Consistency Models
Request Units
DEMO: Mongo - Lift and shift
Tinkerpop graphs
DEMO: Graphs

2010 2014 2015 2017
DocumentDB Cosmos DBProject Florence
• Originally started to address
the problems faced by large
scale apps inside Microsoft
• Built from the ground up for
the cloud
• Used extensively inside
Microsoft
• One of the fastest growing
services on Azure
Azure Cosmos DB

Turnkey global distribution
Elastic scale out
of storage & throughput
Guaranteed low latency at the 99th percentile
Comprehensive SLAs
Five well-defined consistency models
A globally distributed, massively scalable, multi-model database service
Azure Cosmos DB

Elastic scale out
Comprehensive SLAs
Azure Cosmos DB
Column-family Document
Graph
Key-value

Column-family Document
Graph
Elastic scale out
Comprehensive SLAs
Table API
Key-value
Cosmos DB’s API for
MongoDB
Azure Cosmos DB

Your application
Database
client
library
Your app
logic
Graph API
MongoDB API
Any other API …
Open-source
driver of choice*
Change of
connection string*
* Depending on feature supportability
Your app
Azure Cosmos DB

System Topology
Resource
Manager
Language
Runtime(s)
Hosts
Query
Processor
RSM
Index Manager
Bw-tree++/ LLAMA++
Log Manager
IO Manager
Resource Governor
Transport
Database engine
Admission control
…
…
Planet Earth Azure regions Datacenters Stamps Fault domains
Cluster Machine Replica Database engine
Container
Various agents

Resource Hierarchy
CONTAINERS
Logical resources “surfaced” to APIs as tables,
collections or graphs, which are made up of one or
more physical partitions or servers.
RESOURCE PARTITIONS
• Consistent, highly available, and resource-governed
coordination primitives
• Consist of replica sets, with each replica hosting an
instance of the database engine
Containers
Resource Partitions
CollectionsTables Graphs
Tenants
Leader
Follower
Follower
Forwarder
Replica Set
To remote resource partition(s)

Account
DatabaseDatabaseDatabase
DatabaseDatabaseContainer
DatabaseDatabaseItem
Account URI and Credentials
********.azure.com
IGeAvVUp …

Creating Account
Account

Database Representations
Account

Container Representations
Account
= Collection Graph Table

Item Representations
Account
DatabaseDatabaseItem Document Vertices/Edges Row
Collection Graph Table

Container-Level Resources
Account
DatabaseDatabaseItem Conflict
Stored
procedure
Trigger UDF

DEMO - Create an SQL API & download sample .NET app

Persistent log of records in the order in which they were modified
Azure Cosmos DB Change Feed

Multi-Master – Read/Write in any region
Benefits
• Write scalability around the world
• Low latency (<10ms P99 for 1kb document)
writes around the world
• 99.999% High Availability around the world
• Well-defined consistency models
• Automatic conflict management

Azure IoT Hub
Apache Storm on
Azure HDInsight
Azure Cosmos DB
(telemetry and
device state)
events
Azure Web Jobs
(Change feed
processor)
Azure Function
latest state
Azure Data Lake
(archival)
Internet of Things – Telemetry & Sensor Data

Azure Web App
(e-commerce app)
Azure Cosmos DB
(product catalog)
Azure Cosmos DB
(session state)
Azure Search
(full-text index)
Azure Storage
(logs, static
catalog content)
Retail Product Catalogs

Azure Functions
(E-Commerce Checkout API)
Azure Cosmos DB
(Order Event Store)
Azure Functions
(Microservice 1: Tax)
Azure Functions
(Microservice 2: Payment)
Azure Functions
(Microservice N: Fulfillment)
. . .
Retail Order Processing Pipelines

Azure Cosmos DB
(Low-latency User Profile Store)
Azure API Apps Azure Machine Learning
Azure Data Lake Storage
(Archive of Events)
Azure Cosmos DB
(Event Store)
Azure Web Jobs
(Change feed processor)
Real-time Personalization / Recommendations

Consistency Level Guarantees
Strong Linearizability (once operation is complete, it will be visible to all)
Bounded Staleness Consistent Prefix.
Reads lag behind writes by at most k prefixes or t interval
Similar properties to strong consistency (except within staleness window), while preserving 99.99%
availability and low latency.
Session Consistent Prefix.
Within a session: monotonic reads, monotonic writes, read-your-writes, write-follows-reads
Predictable consistency for a session, high read throughput + low latency
Consistent Prefix Reads will never see out of order writes (no gaps).
Eventual Potential for out of order reads. Lowest cost for reads of all consistency levels.
Well-Defined Consistency Models

string sessionToken;
using (DocumentClient client = new DocumentClient(new Uri(""), ""))
{
ResourceResponse<Document> response = client.CreateDocumentAsync(
collectionLink,
new { id = "an id", value = "some value" }
).Result;
sessionToken = response.SessionToken;
}
using (DocumentClient client = new DocumentClient(new Uri(""), ""))
{
ResourceResponse<Document> read = client.ReadDocumentAsync(
documentLink,
new RequestOptions { SessionToken = sessionToken }
).Result;
}
Session Consistency: Session is controlled using a “session token”.
• Session tokens are automatically cached by the Client SDK
• Can be pulled out and used to override other requests (to preserve session between multiple clients)

ResourceResponse<Document> read = client.ReadDocumentAsync(
documentLink,
new RequestOptions { ConsistencyLevel = ConsistencyLevel.Eventual }
).Result;

Billing Model
2 components: Storage + Throughput
You are billed on consumed storage and provisioned throughput
Collections in a database can share throughput
Unit Price (for most Azure regions)
SSD Storage (per GB) $0.25 per month
Provisioned Throughput (single region
writes)
$0.008/hour per 100 RU/s
Provisioned Throughput (multi-region
writes)
$0.016/hour per 100 multi-master RU/s
* pricing may vary by region; for up-to-date pricing, see: https://azure.microsoft.com/pricing/details/cosmos-db/

Request Units
Request Units (RUs) is a rate-based currency – e.g. 1000 RU/second
Abstracts physical resources for performing requests
% IOPS% CPU% Memory

Request Units
Each request consumes # of RU
Approx. 1 RU = 1 read of 1 KB document
Approx. 5 RU = 1 write of a 1KB document
Query: Depends on query & documents involved
GET
POST
PUT
Query
…
=
=
=
=

Request Units- Provisioned throughput
Provisioned in terms of RU/sec – e.g. 1000 RU/s
Billed for highest RU/s in 1 hour
Easy to increase and decrease on demand
Rate limiting based on amount of throughput provisioned
Background processes like TTL expiration, index
transformations scheduled when quiescent
Storage: 40RU per 1GB of data
Min RU/sec
Max
RU/sec
IncomingRequests
No rate limiting,
process background
operations
Rate limiting –
SDK retry
No rate limiting

What's a Tinkerpop Graph?
vertex
id: Luis
label: person
properties:
• age: 25
Luis
edge
id: edgeId
label: worksAt
properties:
• distance:
10miles
objects
language

Microsoft Confidential
Kobe
Bryant
vertex
label: person
properties:
- age: 39
- height: 6'6”

Kobe
Bryant
vertex
label: person
properties:
- age: 39
- height: 6'6”
edge
label: isPartOf
Los
Angeles
Lakers
vertex
label: team
properties:
- state: CA

Kobe
Bryant
vertex
label: person
properties:
- age: 39
- height: 6'6”
NBA
Champion
2000
edge
label: isPartOf
edge
label: hasNbaChampionship
Los
Angeles
Lakers
vertex
label: team
properties:
- state: CA
NBA
Champion
2002
NBA
Champion
2001
NBA
Champion
2010
NBA
Champion
2009
vertex
label: award
properties:
- obtained: 2010
vertex
label: award
properties:
- obtained: 2009
vertex
label: award
properties:
- obtained: 2002
vertex
label: award
properties:
- obtained: 2001
vertex
label: award
properties:
- obtained: 2000

Kobe
Bryant
vertex
label: person
properties:
- age: 39
- height: 6'6”
Oscar
2018
NBA
Champion
2000
edge
label: isPartOf
edge
vertex
label: award
properties:
- obtained: 2018
- category: Best Animated
Short Film
Los
Angeles
Lakers
vertex
label: team
properties:
- state: CA
NBA
Champion
2002
NBA
Champion
2001
NBA
Champion
2010
NBA
Champion
2009
vertex
label: award
properties:
- obtained: 2010
vertex
label: award
properties:
- obtained: 2009
vertex
label: award
properties:
- obtained: 2002
vertex
label: award
properties:
- obtained: 2001
vertex
label: award
properties:
- obtained: 2000
edge
label: hasAcademyAward

Kobe
Bryant
vertex
label: person
properties:
- age: 39
- height: 6'6”
Oscar
2018
NBA
Champion
2000
edge
label: isPartOf
edge
vertex
label: award
properties:
- obtained: 2018
Short Film
Los
Angeles
Lakers
vertex
label: team
properties:
- state: CA
NBA
Champion
2002
NBA
Champion
2001
NBA
Champion
2010
NBA
Champion
2009
vertex
label: award
properties:
- obtained: 2010
vertex
label: award
properties:
- obtained: 2009
vertex
label: award
properties:
- obtained: 2002
vertex
label: award
properties:
- obtained: 2001
vertex
label: award
properties:
- obtained: 2000
Tom
Cruise
vertex
label: person
properties:
- awards: null
edge

Kobe
Bryant
vertex
label: person
properties:
- age: 39
- height: 6'6”
Oscar
2018
NBA
Champion
2000
edge
label: isPartOf
edge
vertex
label: award
properties:
- obtained: 2018
Short Film
Los
Angeles
Lakers
vertex
label: team
properties:
- state: CA
NBA
Champion
2002
NBA
Champion
2001
NBA
Champion
2010
NBA
Champion
2009
vertex
label: award
properties:
- obtained: 2010
vertex
label: award
properties:
- obtained: 2009
vertex
label: award
properties:
- obtained: 2002
vertex
label: award
properties:
- obtained: 2001
vertex
label: award
properties:
- obtained: 2000
Tom
Cruise
vertex
label: person
properties:
- awards: null
Hollywood
Celebrity
edge
vertex
label: status
edge
label: status
edge
label: status

Tech-Spark: Exploring the Cosmos DB

Tech-Spark: Exploring the Cosmos DB

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Tech-Spark: Exploring the Cosmos DB

Similar to Tech-Spark: Exploring the Cosmos DB (20)

Recently uploaded

Recently uploaded (20)

Tech-Spark: Exploring the Cosmos DB

Editor's Notes