SlideShare a Scribd company logo
1 of 18
Download to read offline
Presentation
on
NoSQL
“Towards the end of RDBMS ?”
By: Ramakant Soni
Asst. Professor, Dept. of Computer
Science, BKBIET, Pilani
What is RDBMS
 RDBMS: the relational database
management system.
 Relation: a relation is a 2D table
which has the following features:
 Name
 Attributes
 Tuples
Name
2
Issues with RDBMS- Scalability
 Issues with scaling up when the dataset is
just too big e.g. Big Data.
 Not designed to be distributed.
 Looking at multi-node database solutions.
Known as ‘horizontal scaling’.
 Different approaches include:
 Master-slave
 Sharding
3
Scaling RDBMS
Master-Slave
 All writes are written to the master.
All reads are performed against
the replicated slave databases.
 Critical reads may be incorrect as
writes may not have been
propagated down.
 Large data sets can pose problems
as master needs to duplicate data
to slaves.
Sharding
 Scales well for both reads and
writes.
 Not transparent, application needs
to be partition-aware.
 Can no longer have relationships or
joins across partitions.
 Loss of referential integrity across
shards.
4
What is NoSQL
 Stands for Not Only SQL. Term was redefined by Eric Evans after Carlo
Strozzi.
 Class of non-relational data storage systems.
 Do not require a fixed table schema nor do they use the concept of joins.
 Relaxation for one or more of the ACID properties (Atomicity, Consistency,
Isolation, Durability) using CAP theorem.
5
Need of NoSQL
 Explosion of social media sites (Facebook, Twitter, Google etc.) with large
data needs. (Sharding is a problem)
 Rise of cloud-based solutions such as Amazon S3 (simple storage solution).
 Just as moving to dynamically-typed languages (Ruby/Groovy), a shift to
dynamically-typed data with frequent schema changes.
 Expansion of Open-source community.
 NoSQL solution is more acceptable to a client now than a year ago.
6
NoSQL Types
NoSQL database are classified into four types:
• Key Value pair based
• Column based
• Document based
• Graph based
7
Key Value Pair Based
• Designed for processing dictionary. Dictionaries contain a
collection of records having fields containing data.
• Records are stored and retrieved using a key that uniquely
identifies the record, and is used to quickly find the data
within the database.
Example: CouchDB, Oracle NoSQL Database, Riak etc.
We use it for storing session information, user profiles, preferences,
shopping cart data.
We would avoid it when we need to query data having relationships
between entities.
8
Column based
It store data as Column families containing rows that have
many columns associated with a row key. Each row can have
different columns.
Column families are groups of related data that is accessed
together.
Example: Cassandra, HBase, Hypertable, and Amazon
DynamoDB.
We use it for content management systems, blogging platforms, log aggregation.
We would avoid it for systems that are in early development, changing query patterns.
9
Document Based
The database stores and retrieves documents. It stores documents in
the value part of the key-value store.
Self- describing, hierarchical tree data structures consisting of maps,
collections, and scalar values.
Example: Lotus Notes, MongoDB, Couch DB, Orient DB, Raven DB.
We use it for content management systems, blogging platforms, web analytics, real-time analytics,
e- commerce applications.
We would avoid it for systems that need complex transactions spanning multiple operations or
queries against varying aggregate structures.
10
Graph Based
Store entities and relationships between these entities as nodes
and edges of a graph respectively. Entities have properties.
Traversing the relationships is very fast as relationship between
nodes is not calculated at query time but is actually persisted
as a relationship.
Example: Neo4J, Infinite Graph, OrientDB, FlockDB.
It is well suited for connected data, such as social networks,
spatial data, routing information for goods and supply.
11
CAP Theorem
 According to Eric Brewer a distributed system has 3 properties :
 Consistency
 Availability
 Partitions
 We can have at most two of these three properties for any shared-data system
 To scale out, we have to partition. It leaves a choice between consistency and
availability. ( In almost all cases, we would choose availability over consistency)
 Everyone who builds big applications builds them on CAP : Google, Yahoo,
Facebook, Amazon, eBay, etc.
12
Advantages of NoSQL
 Cheap and easy to implement (open source)
 Data are replicated to multiple nodes (therefore identical and fault-
tolerant) and can be partitioned
 When data is written, the latest version is on at least one node and then
replicated to other nodes
 No single point of failure
 Easy to distribute
 Don't require a schema
13
What is not provided by NoSQL
 Joins
 Group by
 ACID transactions
 SQL
 Integration with applications that are based on SQL
14
Where to use NoSQL
 NoSQL Data storage systems makes sense for applications that process very large
semi-structured data –like Log Analysis, Social Networking Feeds, Time-based
data.
 To improve programmer productivity by using a database that better matches an
application's needs.
 To improve data access performance via some combination of handling larger data
volumes, reducing latency, and improving throughput.
15
Conclusion
 All the choices provided by the rise of NoSQL databases does not mean the demise
of RDBMS databases as Relational databases are a powerful tool.
 We are entering an era of Polyglot persistence, a technique that uses different data
storage technologies to handle varying data storage needs. It can apply across an
enterprise or within an individual application.
16
References
1. “NoSQL Databases: An Overview”. Pramod Sadalage, thoughtworks.com(2014)
2. “Data management in cloud environments: NoSQL and NewSQL data stores”.
Grolinger, K.; Higashino, W. A.; Tiwari, A.; Capretz, M. A. M. (2013). JoCCASA,
Springer.
3. “Making the Shift from Relational to NoSQL”. Couchbase.com(2014).
4. “NoSQL - Death to Relational Databases”. Scofield, Ben (2010).
17
Thank
You
Thank
You

More Related Content

Similar to NOSQL in big data is the not only structure langua.pdf

data base system to new data science lerne
data base system to new data science lernedata base system to new data science lerne
data base system to new data science lerne
tarunprajapati0t
 

Similar to NOSQL in big data is the not only structure langua.pdf (20)

No sq lv2
No sq lv2No sq lv2
No sq lv2
 
NOSQL Databases types and Uses
NOSQL Databases types and UsesNOSQL Databases types and Uses
NOSQL Databases types and Uses
 
Unit-10.pptx
Unit-10.pptxUnit-10.pptx
Unit-10.pptx
 
Non relational databases-no sql
Non relational databases-no sqlNon relational databases-no sql
Non relational databases-no sql
 
Report 1.0.docx
Report 1.0.docxReport 1.0.docx
Report 1.0.docx
 
Know what is NOSQL
Know what is NOSQL Know what is NOSQL
Know what is NOSQL
 
Report 2.0.docx
Report 2.0.docxReport 2.0.docx
Report 2.0.docx
 
Comparative study of no sql document, column store databases and evaluation o...
Comparative study of no sql document, column store databases and evaluation o...Comparative study of no sql document, column store databases and evaluation o...
Comparative study of no sql document, column store databases and evaluation o...
 
Unit 3 MongDB
Unit 3 MongDBUnit 3 MongDB
Unit 3 MongDB
 
NoSQL
NoSQLNoSQL
NoSQL
 
data base system to new data science lerne
data base system to new data science lernedata base system to new data science lerne
data base system to new data science lerne
 
Selecting best NoSQL
Selecting best NoSQL Selecting best NoSQL
Selecting best NoSQL
 
Artigo no sql x relational
Artigo no sql x relationalArtigo no sql x relational
Artigo no sql x relational
 
All About Database v1.1
All About Database  v1.1All About Database  v1.1
All About Database v1.1
 
New Data Technologies, Graph Computing and Relationship Discovery in the Ente...
New Data Technologies, Graph Computing and Relationship Discovery in the Ente...New Data Technologies, Graph Computing and Relationship Discovery in the Ente...
New Data Technologies, Graph Computing and Relationship Discovery in the Ente...
 
EVALUATING CASSANDRA, MONGO DB LIKE NOSQL DATASETS USING HADOOP STREAMING
EVALUATING CASSANDRA, MONGO DB LIKE NOSQL DATASETS USING HADOOP STREAMINGEVALUATING CASSANDRA, MONGO DB LIKE NOSQL DATASETS USING HADOOP STREAMING
EVALUATING CASSANDRA, MONGO DB LIKE NOSQL DATASETS USING HADOOP STREAMING
 
nosql.pptx
nosql.pptxnosql.pptx
nosql.pptx
 
Big Data technology Landscape
Big Data technology LandscapeBig Data technology Landscape
Big Data technology Landscape
 
No sql
No sqlNo sql
No sql
 
NOSQL
NOSQLNOSQL
NOSQL
 

More from ajajkhan16

More from ajajkhan16 (15)

Unit-II DBMS presentation for students.pdf
Unit-II DBMS presentation for students.pdfUnit-II DBMS presentation for students.pdf
Unit-II DBMS presentation for students.pdf
 
data stream processing.and its applications pdf
data stream processing.and its applications pdfdata stream processing.and its applications pdf
data stream processing.and its applications pdf
 
data streammining and its applications.ppt
data streammining and its applications.pptdata streammining and its applications.ppt
data streammining and its applications.ppt
 
Big-Data 5V of big data engineering.pptx
Big-Data 5V of big data engineering.pptxBig-Data 5V of big data engineering.pptx
Big-Data 5V of big data engineering.pptx
 
binarysearchtreeindatastructures-200604055006 (1).pdf
binarysearchtreeindatastructures-200604055006 (1).pdfbinarysearchtreeindatastructures-200604055006 (1).pdf
binarysearchtreeindatastructures-200604055006 (1).pdf
 
Array and its types and it's implemented programming Final.pdf
Array and its types and it's implemented programming Final.pdfArray and its types and it's implemented programming Final.pdf
Array and its types and it's implemented programming Final.pdf
 
sparkbigdataanlyticspoweerpointpptt.pptx
sparkbigdataanlyticspoweerpointpptt.pptxsparkbigdataanlyticspoweerpointpptt.pptx
sparkbigdataanlyticspoweerpointpptt.pptx
 
20-security.ppt
20-security.ppt20-security.ppt
20-security.ppt
 
Linked list.pptx
Linked list.pptxLinked list.pptx
Linked list.pptx
 
key1.pdf
key1.pdfkey1.pdf
key1.pdf
 
08 MK-PPT Advanced Topic 2.ppt
08 MK-PPT Advanced Topic 2.ppt08 MK-PPT Advanced Topic 2.ppt
08 MK-PPT Advanced Topic 2.ppt
 
Unit-I Recursion.pptx
Unit-I Recursion.pptxUnit-I Recursion.pptx
Unit-I Recursion.pptx
 
Unit-I Pointer Data structure.pptx
Unit-I Pointer Data structure.pptxUnit-I Pointer Data structure.pptx
Unit-I Pointer Data structure.pptx
 
Day2.pptx
Day2.pptxDay2.pptx
Day2.pptx
 
Unit-1 DataStructure Intro.pptx
Unit-1 DataStructure Intro.pptxUnit-1 DataStructure Intro.pptx
Unit-1 DataStructure Intro.pptx
 

Recently uploaded

Call Girls in Uttam Nagar (delhi) call me [🔝9953056974🔝] escort service 24X7
Call Girls in  Uttam Nagar (delhi) call me [🔝9953056974🔝] escort service 24X7Call Girls in  Uttam Nagar (delhi) call me [🔝9953056974🔝] escort service 24X7
Call Girls in Uttam Nagar (delhi) call me [🔝9953056974🔝] escort service 24X7
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
heathfieldcps1
 

Recently uploaded (20)

Google Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptxGoogle Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptx
 
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptxExploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
 
Interdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptxInterdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptx
 
latest AZ-104 Exam Questions and Answers
latest AZ-104 Exam Questions and Answerslatest AZ-104 Exam Questions and Answers
latest AZ-104 Exam Questions and Answers
 
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
 
OSCM Unit 2_Operations Processes & Systems
OSCM Unit 2_Operations Processes & SystemsOSCM Unit 2_Operations Processes & Systems
OSCM Unit 2_Operations Processes & Systems
 
AIM of Education-Teachers Training-2024.ppt
AIM of Education-Teachers Training-2024.pptAIM of Education-Teachers Training-2024.ppt
AIM of Education-Teachers Training-2024.ppt
 
Call Girls in Uttam Nagar (delhi) call me [🔝9953056974🔝] escort service 24X7
Call Girls in  Uttam Nagar (delhi) call me [🔝9953056974🔝] escort service 24X7Call Girls in  Uttam Nagar (delhi) call me [🔝9953056974🔝] escort service 24X7
Call Girls in Uttam Nagar (delhi) call me [🔝9953056974🔝] escort service 24X7
 
Details on CBSE Compartment Exam.pptx1111
Details on CBSE Compartment Exam.pptx1111Details on CBSE Compartment Exam.pptx1111
Details on CBSE Compartment Exam.pptx1111
 
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptxHMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
 
FICTIONAL SALESMAN/SALESMAN SNSW 2024.pdf
FICTIONAL SALESMAN/SALESMAN SNSW 2024.pdfFICTIONAL SALESMAN/SALESMAN SNSW 2024.pdf
FICTIONAL SALESMAN/SALESMAN SNSW 2024.pdf
 
dusjagr & nano talk on open tools for agriculture research and learning
dusjagr & nano talk on open tools for agriculture research and learningdusjagr & nano talk on open tools for agriculture research and learning
dusjagr & nano talk on open tools for agriculture research and learning
 
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptxOn_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
 
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
 
How to Manage Call for Tendor in Odoo 17
How to Manage Call for Tendor in Odoo 17How to Manage Call for Tendor in Odoo 17
How to Manage Call for Tendor in Odoo 17
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
 
Towards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxTowards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptx
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)
 
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptxCOMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan Fellows
 

NOSQL in big data is the not only structure langua.pdf

  • 1. Presentation on NoSQL “Towards the end of RDBMS ?” By: Ramakant Soni Asst. Professor, Dept. of Computer Science, BKBIET, Pilani
  • 2. What is RDBMS  RDBMS: the relational database management system.  Relation: a relation is a 2D table which has the following features:  Name  Attributes  Tuples Name 2
  • 3. Issues with RDBMS- Scalability  Issues with scaling up when the dataset is just too big e.g. Big Data.  Not designed to be distributed.  Looking at multi-node database solutions. Known as ‘horizontal scaling’.  Different approaches include:  Master-slave  Sharding 3
  • 4. Scaling RDBMS Master-Slave  All writes are written to the master. All reads are performed against the replicated slave databases.  Critical reads may be incorrect as writes may not have been propagated down.  Large data sets can pose problems as master needs to duplicate data to slaves. Sharding  Scales well for both reads and writes.  Not transparent, application needs to be partition-aware.  Can no longer have relationships or joins across partitions.  Loss of referential integrity across shards. 4
  • 5. What is NoSQL  Stands for Not Only SQL. Term was redefined by Eric Evans after Carlo Strozzi.  Class of non-relational data storage systems.  Do not require a fixed table schema nor do they use the concept of joins.  Relaxation for one or more of the ACID properties (Atomicity, Consistency, Isolation, Durability) using CAP theorem. 5
  • 6. Need of NoSQL  Explosion of social media sites (Facebook, Twitter, Google etc.) with large data needs. (Sharding is a problem)  Rise of cloud-based solutions such as Amazon S3 (simple storage solution).  Just as moving to dynamically-typed languages (Ruby/Groovy), a shift to dynamically-typed data with frequent schema changes.  Expansion of Open-source community.  NoSQL solution is more acceptable to a client now than a year ago. 6
  • 7. NoSQL Types NoSQL database are classified into four types: • Key Value pair based • Column based • Document based • Graph based 7
  • 8. Key Value Pair Based • Designed for processing dictionary. Dictionaries contain a collection of records having fields containing data. • Records are stored and retrieved using a key that uniquely identifies the record, and is used to quickly find the data within the database. Example: CouchDB, Oracle NoSQL Database, Riak etc. We use it for storing session information, user profiles, preferences, shopping cart data. We would avoid it when we need to query data having relationships between entities. 8
  • 9. Column based It store data as Column families containing rows that have many columns associated with a row key. Each row can have different columns. Column families are groups of related data that is accessed together. Example: Cassandra, HBase, Hypertable, and Amazon DynamoDB. We use it for content management systems, blogging platforms, log aggregation. We would avoid it for systems that are in early development, changing query patterns. 9
  • 10. Document Based The database stores and retrieves documents. It stores documents in the value part of the key-value store. Self- describing, hierarchical tree data structures consisting of maps, collections, and scalar values. Example: Lotus Notes, MongoDB, Couch DB, Orient DB, Raven DB. We use it for content management systems, blogging platforms, web analytics, real-time analytics, e- commerce applications. We would avoid it for systems that need complex transactions spanning multiple operations or queries against varying aggregate structures. 10
  • 11. Graph Based Store entities and relationships between these entities as nodes and edges of a graph respectively. Entities have properties. Traversing the relationships is very fast as relationship between nodes is not calculated at query time but is actually persisted as a relationship. Example: Neo4J, Infinite Graph, OrientDB, FlockDB. It is well suited for connected data, such as social networks, spatial data, routing information for goods and supply. 11
  • 12. CAP Theorem  According to Eric Brewer a distributed system has 3 properties :  Consistency  Availability  Partitions  We can have at most two of these three properties for any shared-data system  To scale out, we have to partition. It leaves a choice between consistency and availability. ( In almost all cases, we would choose availability over consistency)  Everyone who builds big applications builds them on CAP : Google, Yahoo, Facebook, Amazon, eBay, etc. 12
  • 13. Advantages of NoSQL  Cheap and easy to implement (open source)  Data are replicated to multiple nodes (therefore identical and fault- tolerant) and can be partitioned  When data is written, the latest version is on at least one node and then replicated to other nodes  No single point of failure  Easy to distribute  Don't require a schema 13
  • 14. What is not provided by NoSQL  Joins  Group by  ACID transactions  SQL  Integration with applications that are based on SQL 14
  • 15. Where to use NoSQL  NoSQL Data storage systems makes sense for applications that process very large semi-structured data –like Log Analysis, Social Networking Feeds, Time-based data.  To improve programmer productivity by using a database that better matches an application's needs.  To improve data access performance via some combination of handling larger data volumes, reducing latency, and improving throughput. 15
  • 16. Conclusion  All the choices provided by the rise of NoSQL databases does not mean the demise of RDBMS databases as Relational databases are a powerful tool.  We are entering an era of Polyglot persistence, a technique that uses different data storage technologies to handle varying data storage needs. It can apply across an enterprise or within an individual application. 16
  • 17. References 1. “NoSQL Databases: An Overview”. Pramod Sadalage, thoughtworks.com(2014) 2. “Data management in cloud environments: NoSQL and NewSQL data stores”. Grolinger, K.; Higashino, W. A.; Tiwari, A.; Capretz, M. A. M. (2013). JoCCASA, Springer. 3. “Making the Shift from Relational to NoSQL”. Couchbase.com(2014). 4. “NoSQL - Death to Relational Databases”. Scofield, Ben (2010). 17