SlideShare a Scribd company logo
1 of 41
A sneak peek into scalable DB setups
for various applications
Dr. Parinaz Ameri
Agenda
● A brief history
● Basic requirements of Modern DBs
● Industrial usage of NoSQL main models
● What to expect?
Navigational DBs
1980
Rise of relational
DBs
1990 2000 2010 2020
3
General Characteristics
Relational Schema SQL Language
4
Orders
Customers
Order Lines
Credit Cards
5
Impedance Mismatch
Id Name zip_code
1 Rick 30062
2 Lisa 30074
3 Sam 30006
6
Contacts:
Normalization and Joins
Id Phone_Number
1 555-111-1234
2 555-222-1234
2 555-345-1234
3 555-333-1234
Numbers:
SELECT Name, Phone_Number FROM contacts LEFT JOIN Numbers On
Contacts.contact_id=Numbers.contact_id WHERE contacts.contact_id=3
7
Computation and Joins
Source: xinuos
Navigational DBs
1980
Rise of relational
DBs
1990
Object-oriented
DBs
Dominance of
relationalen DBs
2000 2010 2020
Rise of
Internet
Web2.0
IoT
Social Media
8
9
10
BigTable
DynamoDB
NoSQL Term
11
NoSQL-Definition?
12
NoSQL refers to a series of database management concepts that process data in a
performant and reliable manner [1].
Common Characteristics of NoSQL DBs
13
Non-Relational
Schemaless
Cluster-Friendly
Elimination of
Join Operations
Weaker
Concurrency Model
Big Data: Volume
14
Scale up
Scale up vs. Scale out
15
Horizontal Scalability
16
Source: dzone
Linear Scalability
17
Source: DTBD
Shared-Nothing Architecture
18
Source: DTBD
Sharding
19
Source: DTBD
Replication
20
Source: DTBD
Common Characteristics of NoSQL DBs
21
Non-Relational
Schemaless
Cluster-Friendly
Elimination of
Join Operations
Weaker
Concurrency Model
ACID Transactions
● Atomicity: “alle or nothing”
○ No interruption during the transaction
● Consistency: “valid data”
○ Bring the DB from one consistent state
to another consistent state
● Isolation: “one after another”
○ Concurrent transactions would be serialized
● Durability: “no lost”
○ Ensures that no committed transaction
will be lost
22
Orders
Customers
Order Lines
Credit Cards
23
Browser Server
Database
Get Get
Post
Post
Get Get
Post
Post
Get Get
Post
Post
Get Get
Post
Post
Get Get
Post
Post
Offline Lock
V001
V001 V001
V001
V002 V001
31
Consistency
Replication
Logical
CAP Theorem (Brewer, Lynch)
● Consistency:
○ Do all applications see the same data?
● Availability:
○ Can I interact with the system in case of a failure?
● Partitioning:
○ When two segments of the system cannot communicate with each other, could it still continue working?
■ If yes, consistency is sacrificed.
■ If no, availability is sacrificed.
32
CAP Theorem
33
34
35
Sam Kam
ACID vs. BASE
BASE:
● Basic Availability
● Soft-state
● Eventual Consistency
36
37
Source: DTBD
Main Data Models
39
Graph
Columnar
Document
Key-Value
40
Columnar
Document
Key-Value
Graph
Aggregation-Oriented
DBs
ACID
Source References
[xinuos] http://osr507doc.xinuos.com/en/PERFORM/disk_IO_mech.html
[dzone] https://dzone.com/articles/vertical-scaling-and-horizontal-scaling-in-aws
[DTBD] Big Data Principles and Paradigms/Chapter 6: Database Techniques for Big Data/
Parinaz Ameri
41

More Related Content

Similar to Introduction to NoSQL databases

3170722_BDA_GTU_Study_Material_Presentations_Unit-3_29092021094744AM.pdf
3170722_BDA_GTU_Study_Material_Presentations_Unit-3_29092021094744AM.pdf3170722_BDA_GTU_Study_Material_Presentations_Unit-3_29092021094744AM.pdf
3170722_BDA_GTU_Study_Material_Presentations_Unit-3_29092021094744AM.pdf
KrishnaShah908060
 
How to Place Data at the Center of Digital Transformation in BFSI
How to Place Data at the Center of Digital Transformation in BFSIHow to Place Data at the Center of Digital Transformation in BFSI
How to Place Data at the Center of Digital Transformation in BFSI
Denodo
 
I.J. Information Technology and Computer Science, 2016, 12, 59.docx
I.J. Information Technology and Computer Science, 2016, 12, 59.docxI.J. Information Technology and Computer Science, 2016, 12, 59.docx
I.J. Information Technology and Computer Science, 2016, 12, 59.docx
wilcockiris
 
365 main overview
365 main overview365 main overview
365 main overview
Tom Guyton
 
Mastering Customer Data on Apache Spark
Mastering Customer Data on Apache SparkMastering Customer Data on Apache Spark
Mastering Customer Data on Apache Spark
Caserta
 

Similar to Introduction to NoSQL databases (20)

GraphTalks Rome - Introducing Neo4j
GraphTalks Rome - Introducing Neo4jGraphTalks Rome - Introducing Neo4j
GraphTalks Rome - Introducing Neo4j
 
The Agile Analyst: Solving the Data Problem with Virtualization
The Agile Analyst: Solving the Data Problem with VirtualizationThe Agile Analyst: Solving the Data Problem with Virtualization
The Agile Analyst: Solving the Data Problem with Virtualization
 
Structure 2014 - Disrupting the data center - Intel sponsor workshop
Structure 2014 - Disrupting the data center - Intel sponsor workshopStructure 2014 - Disrupting the data center - Intel sponsor workshop
Structure 2014 - Disrupting the data center - Intel sponsor workshop
 
Your big data audience insight big data show 24 apr 2013
Your big data audience insight big data show 24 apr 2013Your big data audience insight big data show 24 apr 2013
Your big data audience insight big data show 24 apr 2013
 
3170722_BDA_GTU_Study_Material_Presentations_Unit-3_29092021094744AM.pdf
3170722_BDA_GTU_Study_Material_Presentations_Unit-3_29092021094744AM.pdf3170722_BDA_GTU_Study_Material_Presentations_Unit-3_29092021094744AM.pdf
3170722_BDA_GTU_Study_Material_Presentations_Unit-3_29092021094744AM.pdf
 
Findability Day 2014 Neo4j how graph data boost your insights
Findability Day 2014 Neo4j how graph data boost your insightsFindability Day 2014 Neo4j how graph data boost your insights
Findability Day 2014 Neo4j how graph data boost your insights
 
DataStax
DataStaxDataStax
DataStax
 
The Connected Data Imperative: Why Graphs? at Neo4j GraphDay New York City
The Connected Data Imperative: Why Graphs? at Neo4j GraphDay New York CityThe Connected Data Imperative: Why Graphs? at Neo4j GraphDay New York City
The Connected Data Imperative: Why Graphs? at Neo4j GraphDay New York City
 
Graph all the things - PRathle
Graph all the things - PRathleGraph all the things - PRathle
Graph all the things - PRathle
 
Beyond the Basics 3: Introduction to the MongoDB BI Connector
Beyond the Basics 3: Introduction to the MongoDB BI ConnectorBeyond the Basics 3: Introduction to the MongoDB BI Connector
Beyond the Basics 3: Introduction to the MongoDB BI Connector
 
How to Place Data at the Center of Digital Transformation in BFSI
How to Place Data at the Center of Digital Transformation in BFSIHow to Place Data at the Center of Digital Transformation in BFSI
How to Place Data at the Center of Digital Transformation in BFSI
 
Webinar: How Financial Services Organizations Use MongoDB
Webinar: How Financial Services Organizations Use MongoDBWebinar: How Financial Services Organizations Use MongoDB
Webinar: How Financial Services Organizations Use MongoDB
 
RDBMS to Graphs
RDBMS to GraphsRDBMS to Graphs
RDBMS to Graphs
 
I.J. Information Technology and Computer Science, 2016, 12, 59.docx
I.J. Information Technology and Computer Science, 2016, 12, 59.docxI.J. Information Technology and Computer Science, 2016, 12, 59.docx
I.J. Information Technology and Computer Science, 2016, 12, 59.docx
 
Integration intervention: Get your apps and data up to speed
Integration intervention: Get your apps and data up to speedIntegration intervention: Get your apps and data up to speed
Integration intervention: Get your apps and data up to speed
 
365 main overview
365 main overview365 main overview
365 main overview
 
SQL Training Institute in Ambala ! Batra Computer Centre
SQL Training Institute in Ambala ! Batra Computer CentreSQL Training Institute in Ambala ! Batra Computer Centre
SQL Training Institute in Ambala ! Batra Computer Centre
 
Mastering Customer Data on Apache Spark
Mastering Customer Data on Apache SparkMastering Customer Data on Apache Spark
Mastering Customer Data on Apache Spark
 
Risk Analytics Using Knowledge Graphs / FIBO with Deep Learning
Risk Analytics Using Knowledge Graphs / FIBO with Deep LearningRisk Analytics Using Knowledge Graphs / FIBO with Deep Learning
Risk Analytics Using Knowledge Graphs / FIBO with Deep Learning
 
Data Virtualization for Data Architects (Australia)
Data Virtualization for Data Architects (Australia)Data Virtualization for Data Architects (Australia)
Data Virtualization for Data Architects (Australia)
 

Recently uploaded

CORS (Kitworks Team Study 양다윗 발표자료 240510)
CORS (Kitworks Team Study 양다윗 발표자료 240510)CORS (Kitworks Team Study 양다윗 발표자료 240510)
CORS (Kitworks Team Study 양다윗 발표자료 240510)
Wonjun Hwang
 
“Iamnobody89757” Understanding the Mysterious of Digital Identity.pdf
“Iamnobody89757” Understanding the Mysterious of Digital Identity.pdf“Iamnobody89757” Understanding the Mysterious of Digital Identity.pdf
“Iamnobody89757” Understanding the Mysterious of Digital Identity.pdf
Muhammad Subhan
 
Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...
Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...
Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...
panagenda
 

Recently uploaded (20)

Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
How we scaled to 80K users by doing nothing!.pdf
How we scaled to 80K users by doing nothing!.pdfHow we scaled to 80K users by doing nothing!.pdf
How we scaled to 80K users by doing nothing!.pdf
 
Easier, Faster, and More Powerful – Notes Document Properties Reimagined
Easier, Faster, and More Powerful – Notes Document Properties ReimaginedEasier, Faster, and More Powerful – Notes Document Properties Reimagined
Easier, Faster, and More Powerful – Notes Document Properties Reimagined
 
CORS (Kitworks Team Study 양다윗 발표자료 240510)
CORS (Kitworks Team Study 양다윗 발표자료 240510)CORS (Kitworks Team Study 양다윗 발표자료 240510)
CORS (Kitworks Team Study 양다윗 발표자료 240510)
 
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
 
Observability Concepts EVERY Developer Should Know (DevOpsDays Seattle)
Observability Concepts EVERY Developer Should Know (DevOpsDays Seattle)Observability Concepts EVERY Developer Should Know (DevOpsDays Seattle)
Observability Concepts EVERY Developer Should Know (DevOpsDays Seattle)
 
Microsoft CSP Briefing Pre-Engagement - Questionnaire
Microsoft CSP Briefing Pre-Engagement - QuestionnaireMicrosoft CSP Briefing Pre-Engagement - Questionnaire
Microsoft CSP Briefing Pre-Engagement - Questionnaire
 
Vector Search @ sw2con for slideshare.pptx
Vector Search @ sw2con for slideshare.pptxVector Search @ sw2con for slideshare.pptx
Vector Search @ sw2con for slideshare.pptx
 
JavaScript Usage Statistics 2024 - The Ultimate Guide
JavaScript Usage Statistics 2024 - The Ultimate GuideJavaScript Usage Statistics 2024 - The Ultimate Guide
JavaScript Usage Statistics 2024 - The Ultimate Guide
 
Introduction to use of FHIR Documents in ABDM
Introduction to use of FHIR Documents in ABDMIntroduction to use of FHIR Documents in ABDM
Introduction to use of FHIR Documents in ABDM
 
2024 May Patch Tuesday
2024 May Patch Tuesday2024 May Patch Tuesday
2024 May Patch Tuesday
 
“Iamnobody89757” Understanding the Mysterious of Digital Identity.pdf
“Iamnobody89757” Understanding the Mysterious of Digital Identity.pdf“Iamnobody89757” Understanding the Mysterious of Digital Identity.pdf
“Iamnobody89757” Understanding the Mysterious of Digital Identity.pdf
 
How to Check CNIC Information Online with Pakdata cf
How to Check CNIC Information Online with Pakdata cfHow to Check CNIC Information Online with Pakdata cf
How to Check CNIC Information Online with Pakdata cf
 
Oauth 2.0 Introduction and Flows with MuleSoft
Oauth 2.0 Introduction and Flows with MuleSoftOauth 2.0 Introduction and Flows with MuleSoft
Oauth 2.0 Introduction and Flows with MuleSoft
 
Introduction to FIDO Authentication and Passkeys.pptx
Introduction to FIDO Authentication and Passkeys.pptxIntroduction to FIDO Authentication and Passkeys.pptx
Introduction to FIDO Authentication and Passkeys.pptx
 
AI in Action: Real World Use Cases by Anitaraj
AI in Action: Real World Use Cases by AnitarajAI in Action: Real World Use Cases by Anitaraj
AI in Action: Real World Use Cases by Anitaraj
 
JohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptxJohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptx
 
WebRTC and SIP not just audio and video @ OpenSIPS 2024
WebRTC and SIP not just audio and video @ OpenSIPS 2024WebRTC and SIP not just audio and video @ OpenSIPS 2024
WebRTC and SIP not just audio and video @ OpenSIPS 2024
 
UiPath manufacturing technology benefits and AI overview
UiPath manufacturing technology benefits and AI overviewUiPath manufacturing technology benefits and AI overview
UiPath manufacturing technology benefits and AI overview
 
Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...
Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...
Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...
 

Introduction to NoSQL databases