[第2回 Azure Cosmos DB 勉強会] Data modelling and partitioning in Azure Cosmos DB (Azure Cosmos DB でのデータモデリングとパーティション分割)

Data modelling and partitioning in Azure
Cosmos DB
(Azure Cosmos DB でのデータモデリングとパーティション分割)

What is Azure Cosmos DB?
Non-relational and horizontally scalable

horizontally scalable

non-relational

non-relational
and
horizontally scalable

So is Azure Cosmos DB suitable for relational
workloads?

Let's look at a concrete example

Identifying the operations we have to serve

Now let's implement this model on Azure Cosmos DB!

Starting with the Customer entity

To embed or to reference?
-
-
-
-
-
-

What is partitioning?
logical partitions

Andrew
Theo
Mark
TimDeborah Luis

Max size: 20 GB
Max size: 2 MB

Andrew TheoMarkTimDeborah Luis
SELECT * FROM c WHERE c.username = 'Mark'
our partition key

Andrew TheoMarkTimDeborah Luis
SELECT * FROM c WHERE c.favoriteColor = 'orange'
?

Choosing a partition key for customers
customers
PK: ?

Choosing a partition key for customers
customers
PK: id

Product categories
productCategories
PK: ?

Product categories
productCategories
PK: ?
SELECT * FROM c

Product categories
productCategories
PK: type

Product tags
productTags
PK: ?

Product tags
productTags
PK: type

Products
products
PK: ?
CategoryA CategoryCCategoryB
SELECT * FROM c WHERE c.categoryId = 'CategoryA'

Products
products
PK: categoryId
category name?
tag names?

Products: how to return category and tag names?
products
SELECT * FROM c WHERE c.categoryId = 'CategoryA'
productCategories
SELECT c.name FROM c WHERE c.id = 'CategoryA'
productTags
SELECT * FROM c
WHERE c.id IN ('<tagId1>', '<tagId2>', '<tagId3>')

Products: denormalizing category and tag names
products
PK: categoryId

Products: keeping everything in sync
productCategories
productTags
products

Sales orders
salesOrders
PK: ?

Sales orders
salesOrders
PK: ?
CustomerA CustomerCCustomerB
SELECT * FROM c WHERE c.customerId = 'CustomerA'

Sales orders
salesOrders
PK: customerId

Sales orders
salesOrders
PK: customerId
customers
PK: id

Mixing entities in the same container?

Sales orders: mixing with customers
customers
PK: id

customers
PK: customerId

CustomerA
CustomerC
CustomerB
customer sales orders
customers
PK: customerId

Sales orders
customers
PK: customerId
SELECT * FROM c WHERE c.customerId = 'CustomerA'
AND c.type = 'salesOrder'

Sales orders
customers
PK: customerId

Denormalizing the count of sales orders per customer

CustomerA
CustomerC
CustomerB
customer sales orders
customers
PK: customerId

CustomerA
CustomerC
CustomerB
update the customer add a sales order
customers
PK: customerId

CustomerA
CustomerC
CustomerB
update the customer add a sales order

Sales orders
customers
PK: customerId
SELECT * FROM c WHERE c.type = 'customer'
ORDER BY c.salesOrderCount DESC

Our final design
customers
PK: customerId
productCategories
PK: type
productTags
PK: type
products
PK: categoryId

Our final design, optimized!
customers
PK: customerId
productMeta
PK: type
products
PK: categoryId

Going further
https://docs.microsoft.com/azure/cosmos-db/modeling-data
https://docs.microsoft.com/azure/cosmos-db/how-to-model-partition-example
https://devblogs.microsoft.com/cosmosdb/data-modeling-and-partitioning-for-relational-workloads/
https://github.com/AzureCosmosDB/labs/blob/master/readme.md
https://github.com/AzureCosmosDB/labs/blob/master/decks/Data-Modeling.pptx

[第2回 Azure Cosmos DB 勉強会] Data modelling and partitioning in Azure Cosmos DB (Azure Cosmos DB でのデータモデリングとパーティション分割)

More Related Content

What's hot

Similar to [第2回 Azure Cosmos DB 勉強会] Data modelling and partitioning in Azure Cosmos DB (Azure Cosmos DB でのデータモデリングとパーティション分割)

More from Naoki (Neo) SATO

Recently uploaded

[第2回 Azure Cosmos DB 勉強会] Data modelling and partitioning in Azure Cosmos DB (Azure Cosmos DB でのデータモデリングとパーティション分割)