SlideShare a Scribd company logo
NoSQL Evaluation
By:
Karthik Kamath G
karthikkmth50@gmail.com
Bens John
bensjohn723@gmail.com
Contents
O Introduction
O Features
O RDBMS
O Data Models.
O Query Possibilities.
O Concurrency control.
O Partioning
O Replication and consistency.
Introduction
O SQL= Traditional Relational Database.
O NoSQL = No traditional Relational
Database.
O No SQL != Do not use Structured Query
Language.
O NoSQL = Not only SQL.
O Not every data management/ analysis
problem is solved using traditional
RDBMS.
O BIG Data.
Features
O Convenient
O Multi-User
O Safe
O Persistent
O Reliable
O Massive
O Efficient
Relational DB
O Adding or removing a feature to a blog is
not possible without system unavailability.
O Due to their normalized data model and
ACID support RDBMS is not useful in
Web2.0 domains, because joins and locks
influence performance in distributed
systems negatively.
O These databases are typically based on
consistency instead of availability.
O Replication techniques are limited .
NoSQL providers
Key-Value Stores
O Data is addressed by a unique key.
O Values are isolated from each other.
O Schema Free – New values can be added
O Data/values are opaque to the system.
O Example :
Voldemart, Redis, Membase.
O Pros :
Simple Data Model.
Scalable.
O Cons:
Create your own “Foreign Keys”.
Poor for complex data.
Column Family
O Based on BIGTABLE : Google’s
distributed storage system for structured
data.
O Arbitrary no. of key value pairs can be
stored within rows.
O Relationship to be implemented by App
Logic.
O Columns can be grouped to form column
families
O Examples :
HBase, Cassandra, Hyper table
O Hbase & Hypertext – Open source
implementations.
O Cassandra – Additional super column.
O Multiple versions of the same data are
stored in chronological order.
O Pros :
Scalable.
O Cons :
Poor for interconnected data.
Document Databases
O Data Model :
collection of documents.
document is a key value collection.
within a document keys should be
unique.
JSON: Java Script Object Notation
O Example :
CouchDB, MongoDB.
O Pros:
Simple, Powerful Data Model.
Scalable.
O Cons:
Poor for interconnected Data.
Query model limits to keys and
values.
Graph Databases
O Data Model :
Nodes and Relationships.
O Examples :
neo4j, Orient DB.
O FlockDB –Twitter-One way relationship.
O Location Based system, Navigation
systems which uses complex relations.
O Pros :
Powerful Data Model.
Easy to query.
Friend of a friend Problem is solved.
Query Possibilities
For Key Value pairs:
O Key based put, get and delete operations.
O Membase offers REST API.
For Document Databases
O Document stores offer much richer APIs.
O Operations like and,or and between can be
used
O MongoDB supports additional operations like
count and distinct.
O Riak offers functionalities to traverse links
between documents easily.
O UnQL Project
For Column Family
O Provide range queries and some
operations like "in", "and/or" and regular
expression.
O Even if every column family store offers a
SQL like query language in order to
provide a more convenient user
interaction, only row keys and indexed
values can be considered in where-
clauses, as well.
For graph database
O SPARQL is a popular, declarative query
language with a very simple syntax
providing graph pattern matching.
O Gremlin is an imperative programming
language used to perform graph
traversals based on XPA TH.
Concurrency Control
O Traditional databases use pessimistic
consistency strategies with exclusive
access on a dataset.
O Multiversion concurrency control
(MVCC) relaxes strict consistency in
favor of performance.
O In order to cope with two or more
conflicting write operations, every process
stores, additional to the new value, a link
to the version the process read before.
Partitioning
O The first strategy distributes datasets by the
range of their keys.
O In order to find a certain key, clients have to
contact the routing server for getting the
partition table.
O The second one is by Consistent Hashing.
O Neighbored keys are distributed randomly
across the cluster.
O Graph algorithms can help identifying
hotspots of strongly connected nodes in the
graph schema.
Replication and Consistency
O BASE systems.
O Availability at the cost of consistency.
O Can be inconsistent, but the system
must be high available and high
performant at all time.
O NoSQL systems are not only full ACID or
full BASE systems.
O Big Data is the only store, which supports
full consistency and replication natively.
Conclusion
O Choose the proper data model.
O Queries.
O Key value should be used for simple and
fast operations.
O Column DB for large data.
O Graph DB for entities and relationship.
O Document DB offers flexible Data model.
Thank
You.

More Related Content

What's hot

Find your way in Graph labyrinths
Find your way in Graph labyrinthsFind your way in Graph labyrinths
Find your way in Graph labyrinths
Daniel Camarda
 
Semantic Media Management with Apache Marmotta
Semantic Media Management with Apache MarmottaSemantic Media Management with Apache Marmotta
Semantic Media Management with Apache Marmotta
Thomas Kurz
 
RDF and the Semantic Web -- Joanna Pszenicyn
RDF and the Semantic Web -- Joanna PszenicynRDF and the Semantic Web -- Joanna Pszenicyn
RDF and the Semantic Web -- Joanna Pszenicyn
Richard.Sapon-White
 

What's hot (20)

Multi-model databases and node.js
Multi-model databases and node.jsMulti-model databases and node.js
Multi-model databases and node.js
 
Find your way in Graph labyrinths
Find your way in Graph labyrinthsFind your way in Graph labyrinths
Find your way in Graph labyrinths
 
Are Linked Datasets fit for Open-domain Question Answering? A Quality Assessment
Are Linked Datasets fit for Open-domain Question Answering? A Quality AssessmentAre Linked Datasets fit for Open-domain Question Answering? A Quality Assessment
Are Linked Datasets fit for Open-domain Question Answering? A Quality Assessment
 
An lsh based blocking approach with a homomorphic matching technique for priv...
An lsh based blocking approach with a homomorphic matching technique for priv...An lsh based blocking approach with a homomorphic matching technique for priv...
An lsh based blocking approach with a homomorphic matching technique for priv...
 
Facilitating Busines Interoperability from the Semantic Web
Facilitating Busines Interoperability from the Semantic WebFacilitating Busines Interoperability from the Semantic Web
Facilitating Busines Interoperability from the Semantic Web
 
Analytical data processing
Analytical data processingAnalytical data processing
Analytical data processing
 
Getting Started with the Alma API
Getting Started with the Alma APIGetting Started with the Alma API
Getting Started with the Alma API
 
Four NoSQL Databases You Should Know
Four NoSQL Databases You Should KnowFour NoSQL Databases You Should Know
Four NoSQL Databases You Should Know
 
Multi model-databases
Multi model-databasesMulti model-databases
Multi model-databases
 
XMl
XMlXMl
XMl
 
Introduction to NoSQL Database
Introduction to NoSQL DatabaseIntroduction to NoSQL Database
Introduction to NoSQL Database
 
[Paper Review] DRAGNN
[Paper Review] DRAGNN[Paper Review] DRAGNN
[Paper Review] DRAGNN
 
guacamole: an Object Document Mapper for ArangoDB
guacamole: an Object Document Mapper for ArangoDBguacamole: an Object Document Mapper for ArangoDB
guacamole: an Object Document Mapper for ArangoDB
 
Introduction to LDP in Apache Marmotta
Introduction to LDP in Apache MarmottaIntroduction to LDP in Apache Marmotta
Introduction to LDP in Apache Marmotta
 
Data exchange over internet (XML vs JSON)
Data exchange over internet (XML vs JSON)Data exchange over internet (XML vs JSON)
Data exchange over internet (XML vs JSON)
 
Semantic Media Management with Apache Marmotta
Semantic Media Management with Apache MarmottaSemantic Media Management with Apache Marmotta
Semantic Media Management with Apache Marmotta
 
Ajax
AjaxAjax
Ajax
 
FOXX - a Javascript application framework on top of ArangoDB
FOXX - a Javascript application framework on top of ArangoDBFOXX - a Javascript application framework on top of ArangoDB
FOXX - a Javascript application framework on top of ArangoDB
 
Linked Media and Data Using Apache Marmotta
Linked Media and Data Using Apache MarmottaLinked Media and Data Using Apache Marmotta
Linked Media and Data Using Apache Marmotta
 
RDF and the Semantic Web -- Joanna Pszenicyn
RDF and the Semantic Web -- Joanna PszenicynRDF and the Semantic Web -- Joanna Pszenicyn
RDF and the Semantic Web -- Joanna Pszenicyn
 

Viewers also liked

Evaluating the Impact of Android Best Practices on Energy Consumption
Evaluating the Impact of Android Best Practices on Energy ConsumptionEvaluating the Impact of Android Best Practices on Energy Consumption
Evaluating the Impact of Android Best Practices on Energy Consumption
Karthik Mohan
 
Interplanetary internet_sufi
Interplanetary internet_sufiInterplanetary internet_sufi
Interplanetary internet_sufi
Karthik Mohan
 
Gigabit wireless fidelity (gi fi)_sjec
Gigabit wireless fidelity (gi fi)_sjecGigabit wireless fidelity (gi fi)_sjec
Gigabit wireless fidelity (gi fi)_sjec
Karthik Mohan
 
Family Practice Guide to Oral Renin Inhibitors (
Family Practice Guide to Oral Renin Inhibitors (Family Practice Guide to Oral Renin Inhibitors (
Family Practice Guide to Oral Renin Inhibitors (
Russell N Hatcher, Pharm.D.
 
A Survey on Localization of Wireless Sensors
A Survey on Localization of Wireless SensorsA Survey on Localization of Wireless Sensors
A Survey on Localization of Wireless Sensors
Karthik Mohan
 

Viewers also liked (17)

Evaluating the Impact of Android Best Practices on Energy Consumption
Evaluating the Impact of Android Best Practices on Energy ConsumptionEvaluating the Impact of Android Best Practices on Energy Consumption
Evaluating the Impact of Android Best Practices on Energy Consumption
 
Programma lista Salatiello
Programma lista SalatielloProgramma lista Salatiello
Programma lista Salatiello
 
Monitoren groep 3
Monitoren groep 3Monitoren groep 3
Monitoren groep 3
 
Transaction processing hub
Transaction processing hubTransaction processing hub
Transaction processing hub
 
Personal shopping assistant
Personal shopping assistantPersonal shopping assistant
Personal shopping assistant
 
Interplanetary internet_sufi
Interplanetary internet_sufiInterplanetary internet_sufi
Interplanetary internet_sufi
 
MOSAICING IMAGES_sjec
MOSAICING IMAGES_sjecMOSAICING IMAGES_sjec
MOSAICING IMAGES_sjec
 
02. componentes y armado de pc
02. componentes y armado de pc02. componentes y armado de pc
02. componentes y armado de pc
 
Gigabit wireless fidelity (gi fi)_sjec
Gigabit wireless fidelity (gi fi)_sjecGigabit wireless fidelity (gi fi)_sjec
Gigabit wireless fidelity (gi fi)_sjec
 
Family Practice Guide to Oral Renin Inhibitors (
Family Practice Guide to Oral Renin Inhibitors (Family Practice Guide to Oral Renin Inhibitors (
Family Practice Guide to Oral Renin Inhibitors (
 
11. monitores crt
11. monitores crt11. monitores crt
11. monitores crt
 
Quantum cryptography data
Quantum cryptography dataQuantum cryptography data
Quantum cryptography data
 
SEA
SEASEA
SEA
 
towards 3d internet
towards 3d internettowards 3d internet
towards 3d internet
 
A Survey on Localization of Wireless Sensors
A Survey on Localization of Wireless SensorsA Survey on Localization of Wireless Sensors
A Survey on Localization of Wireless Sensors
 
Animatronics_sjec
Animatronics_sjecAnimatronics_sjec
Animatronics_sjec
 
Optical coherence tomography
Optical coherence tomographyOptical coherence tomography
Optical coherence tomography
 

Similar to NoSql evaluation

Similar to NoSql evaluation (20)

Why no sql ? Why Couchbase ?
Why no sql ? Why Couchbase ?Why no sql ? Why Couchbase ?
Why no sql ? Why Couchbase ?
 
No sq lv2
No sq lv2No sq lv2
No sq lv2
 
NOSQL- Presentation on NoSQL
NOSQL- Presentation on NoSQLNOSQL- Presentation on NoSQL
NOSQL- Presentation on NoSQL
 
NOSQL in big data is the not only structure langua.pdf
NOSQL in big data is the not only structure langua.pdfNOSQL in big data is the not only structure langua.pdf
NOSQL in big data is the not only structure langua.pdf
 
Big data technology unit 3
Big data technology unit 3Big data technology unit 3
Big data technology unit 3
 
Nosql
NosqlNosql
Nosql
 
Nosql
NosqlNosql
Nosql
 
Modern databases and its challenges (SQL ,NoSQL, NewSQL)
Modern databases and its challenges (SQL ,NoSQL, NewSQL)Modern databases and its challenges (SQL ,NoSQL, NewSQL)
Modern databases and its challenges (SQL ,NoSQL, NewSQL)
 
Database
DatabaseDatabase
Database
 
No sql database
No sql databaseNo sql database
No sql database
 
NoSQL Basics and MongDB
NoSQL Basics and  MongDBNoSQL Basics and  MongDB
NoSQL Basics and MongDB
 
Presentation on NoSQL Database related RDBMS
Presentation on NoSQL Database related RDBMSPresentation on NoSQL Database related RDBMS
Presentation on NoSQL Database related RDBMS
 
2.Introduction to NOSQL (Core concepts).pptx
2.Introduction to NOSQL (Core concepts).pptx2.Introduction to NOSQL (Core concepts).pptx
2.Introduction to NOSQL (Core concepts).pptx
 
NoSql Databases
NoSql DatabasesNoSql Databases
NoSql Databases
 
Unit-10.pptx
Unit-10.pptxUnit-10.pptx
Unit-10.pptx
 
nosql.pptx
nosql.pptxnosql.pptx
nosql.pptx
 
No sqlpresentation
No sqlpresentationNo sqlpresentation
No sqlpresentation
 
Oslo bekk2014
Oslo bekk2014Oslo bekk2014
Oslo bekk2014
 
Introduction to mongodb
Introduction to mongodbIntroduction to mongodb
Introduction to mongodb
 
the rising no sql technology
the rising no sql technologythe rising no sql technology
the rising no sql technology
 

Recently uploaded

一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
ukgaet
 
一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单
ewymefz
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
ewymefz
 
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
vcaxypu
 
Investigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_CrimesInvestigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_Crimes
StarCompliance.io
 
Introduction-to-Cybersecurit57hhfcbbcxxx
Introduction-to-Cybersecurit57hhfcbbcxxxIntroduction-to-Cybersecurit57hhfcbbcxxx
Introduction-to-Cybersecurit57hhfcbbcxxx
zahraomer517
 
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
nscud
 
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
nscud
 
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
ewymefz
 
一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单
ocavb
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptx
Opendatabay
 
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
ewymefz
 
Computer Presentation.pptx ecommerce advantage s
Computer Presentation.pptx ecommerce advantage sComputer Presentation.pptx ecommerce advantage s
Computer Presentation.pptx ecommerce advantage s
MAQIB18
 

Recently uploaded (20)

一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
 
Business update Q1 2024 Lar España Real Estate SOCIMI
Business update Q1 2024 Lar España Real Estate SOCIMIBusiness update Q1 2024 Lar España Real Estate SOCIMI
Business update Q1 2024 Lar España Real Estate SOCIMI
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
 
一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
 
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
 
Investigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_CrimesInvestigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_Crimes
 
Introduction-to-Cybersecurit57hhfcbbcxxx
Introduction-to-Cybersecurit57hhfcbbcxxxIntroduction-to-Cybersecurit57hhfcbbcxxx
Introduction-to-Cybersecurit57hhfcbbcxxx
 
Jpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization SampleJpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization Sample
 
Q1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year ReboundQ1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year Rebound
 
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
 
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
 
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
 
一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptx
 
Webinar One View, Multiple Systems No-Code Integration of Salesforce and ERPs
Webinar One View, Multiple Systems No-Code Integration of Salesforce and ERPsWebinar One View, Multiple Systems No-Code Integration of Salesforce and ERPs
Webinar One View, Multiple Systems No-Code Integration of Salesforce and ERPs
 
Tabula.io Cheatsheet: automate your data workflows
Tabula.io Cheatsheet: automate your data workflowsTabula.io Cheatsheet: automate your data workflows
Tabula.io Cheatsheet: automate your data workflows
 
社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .
 
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
 
Computer Presentation.pptx ecommerce advantage s
Computer Presentation.pptx ecommerce advantage sComputer Presentation.pptx ecommerce advantage s
Computer Presentation.pptx ecommerce advantage s
 

NoSql evaluation

  • 1. NoSQL Evaluation By: Karthik Kamath G karthikkmth50@gmail.com Bens John bensjohn723@gmail.com
  • 2. Contents O Introduction O Features O RDBMS O Data Models. O Query Possibilities. O Concurrency control. O Partioning O Replication and consistency.
  • 3. Introduction O SQL= Traditional Relational Database. O NoSQL = No traditional Relational Database. O No SQL != Do not use Structured Query Language. O NoSQL = Not only SQL. O Not every data management/ analysis problem is solved using traditional RDBMS. O BIG Data.
  • 4. Features O Convenient O Multi-User O Safe O Persistent O Reliable O Massive O Efficient
  • 5. Relational DB O Adding or removing a feature to a blog is not possible without system unavailability. O Due to their normalized data model and ACID support RDBMS is not useful in Web2.0 domains, because joins and locks influence performance in distributed systems negatively. O These databases are typically based on consistency instead of availability. O Replication techniques are limited .
  • 7. Key-Value Stores O Data is addressed by a unique key. O Values are isolated from each other. O Schema Free – New values can be added O Data/values are opaque to the system. O Example : Voldemart, Redis, Membase.
  • 8. O Pros : Simple Data Model. Scalable. O Cons: Create your own “Foreign Keys”. Poor for complex data.
  • 9. Column Family O Based on BIGTABLE : Google’s distributed storage system for structured data. O Arbitrary no. of key value pairs can be stored within rows. O Relationship to be implemented by App Logic. O Columns can be grouped to form column families O Examples : HBase, Cassandra, Hyper table
  • 10. O Hbase & Hypertext – Open source implementations. O Cassandra – Additional super column. O Multiple versions of the same data are stored in chronological order.
  • 11. O Pros : Scalable. O Cons : Poor for interconnected data.
  • 12. Document Databases O Data Model : collection of documents. document is a key value collection. within a document keys should be unique. JSON: Java Script Object Notation O Example : CouchDB, MongoDB.
  • 13. O Pros: Simple, Powerful Data Model. Scalable. O Cons: Poor for interconnected Data. Query model limits to keys and values.
  • 14. Graph Databases O Data Model : Nodes and Relationships. O Examples : neo4j, Orient DB. O FlockDB –Twitter-One way relationship. O Location Based system, Navigation systems which uses complex relations.
  • 15. O Pros : Powerful Data Model. Easy to query. Friend of a friend Problem is solved.
  • 16. Query Possibilities For Key Value pairs: O Key based put, get and delete operations. O Membase offers REST API. For Document Databases O Document stores offer much richer APIs. O Operations like and,or and between can be used O MongoDB supports additional operations like count and distinct. O Riak offers functionalities to traverse links between documents easily. O UnQL Project
  • 17. For Column Family O Provide range queries and some operations like "in", "and/or" and regular expression. O Even if every column family store offers a SQL like query language in order to provide a more convenient user interaction, only row keys and indexed values can be considered in where- clauses, as well.
  • 18. For graph database O SPARQL is a popular, declarative query language with a very simple syntax providing graph pattern matching. O Gremlin is an imperative programming language used to perform graph traversals based on XPA TH.
  • 19. Concurrency Control O Traditional databases use pessimistic consistency strategies with exclusive access on a dataset. O Multiversion concurrency control (MVCC) relaxes strict consistency in favor of performance. O In order to cope with two or more conflicting write operations, every process stores, additional to the new value, a link to the version the process read before.
  • 20. Partitioning O The first strategy distributes datasets by the range of their keys. O In order to find a certain key, clients have to contact the routing server for getting the partition table. O The second one is by Consistent Hashing. O Neighbored keys are distributed randomly across the cluster. O Graph algorithms can help identifying hotspots of strongly connected nodes in the graph schema.
  • 21. Replication and Consistency O BASE systems. O Availability at the cost of consistency. O Can be inconsistent, but the system must be high available and high performant at all time. O NoSQL systems are not only full ACID or full BASE systems. O Big Data is the only store, which supports full consistency and replication natively.
  • 22. Conclusion O Choose the proper data model. O Queries. O Key value should be used for simple and fast operations. O Column DB for large data. O Graph DB for entities and relationship. O Document DB offers flexible Data model.