SlideShare a Scribd company logo
Preparing Your Data for
Cloud



Narinder Kumar
11/11/2010
Agenda

    Relational DBMS's : Pros & Cons

    Non-Relational DBMS's : Pros & Cons

    Types of Non-Relational DBMS's

    Current Market State

    Applicability of Different Data-Bases in
    different environments
                                               2
Relational DBMS - Pros

    Data Integrity

    ACID Capabilities

    High Level Query Model

    Data Normalization

    Data Independence

                                   3
Relational DBMS - Cons

    Scaling Issues
     
       By Duplication (Master-Slave)
     
       By Sharding/Division (Not transparent)

    Fixed Schema

    Mostly disk-oriented (Performance)

    May fair poorly with large data


                                                4
Non-Relational DBMS - Pros

    Scalability

    Replication / Availability

    Performance

    Deployment Flexibility

    Modelling Flexibility

    Faster Development (?)
                                   5
Non-Relational DBMS - Cons

    Lack of Transactional Support

    Data Integrity is Application's responsibility

    Data Duplication / Application Dependent

    Eventually Consistent (mostly)

    No Standardization

    New Technology


                                                     6
RDBMS's and Cloud



                    7
Cloud Capable RDBMS




       s




Almost every RDBMS can run in a IAAS Cloud Platform

                                                      8
Cloud Native RDBMS's




                       9
Types of Non-Relational DBMS

    Key Value Stores


    Document Stores


    Column Stores


    Graph Stores

                                   10
Key Value Data-Bases

    Object is completely Opaque to DB

    Mostly GET, PUT & DELETE operations are
    supported

    There may be limits on size of Objects

        Inspired by Amazon Dynamo Paper


                                              11
Key Value DataBases & Cloud
                Project Voldemort




MemCachedDB       Tokyo Tyrant


                                    12
Document Data-Bases

    Object is not completely opaque to DB

    Every Object has it's own schema
      FirstName="Bob", Address="5 Oak St.", Hobby="sailing".
      FirstName="Jonathan", Children=("Michael,10", "Jennifer,8")

    Can perform queries based on Object's attributes

    Possible to describe relationships between Objects

    Joins and Transactions are not supported

    Good for XML or JSON objects

                                                                    13
Document DataBases & Cloud




                             14
Column-Store Data-Bases

    Richer than Document Stores

    Multi-Dimensional Map
    
        Tables
         
             Row
         
             Column
         
             Time-Stamp

    Supports Multiple Data Types

    Usually use an Underlying DFS
              Inspired by Google Big Table Paper
                                                   15
Column-Store Data-Bases & Cloud




                              16
Key Factors while Making a Choice

        Application Architecture Requirements

        Platform choices

        Non-Functional Requirements
    
          Consistency
    
          Availability
    
          Partition
    
          Security
    
          Data Redemption

                                                17
Different Requirements = Different Solutions




                                           18
Scenario 1

         Feature First
     
           Corporate Data
     
           Consistency Requirements
     
           Business Intelligence
     
           Legacy Application

    RDBMS on Amazon Cloud, RackSpace (IaaS) or
       Microsoft Azure/Amazon RDS (PaaS)
                                                 19
Scenario 2

        Consumer Facing Application
    
          Big Files (Images, BLOB's, Files)
    
          Geographically Distributed
    
          Mostly writes
    
          Not heavy requirement on Rich Queries


Key-Value Data Stores (Amazon S3, Project
             Voldemort, Redis)
                                                  20
Scenario 3

        Hundreds Of Government Documents
        with different schemas
    
         Need to serve on Web
    
         Data Mining

    Document Data-Stores (Amazon SimpleDB,
          Apache Couche DB, MongoDB)

                                             21
Scenario 4

        Scale First
    
          Huge Data-Set
    
          Analytical Requirements
    
          Consumer Facing
    
          High Availability over Consistency

Column Data-Stores (Google App Engine, Hbase,
                   Cassandra)
                                               22
Mix & Match of Earlier Scenarios
               
                   Polyglot Persistence
                    
                        RDBMS for low-
                        volume and high value
                    
                        Key-Value DB for large
                        files with little queries
                    
                        Memcached DB for
                        short-lived Data
                    
                        Column DB for
                        Analytics




                                               23
CAP Theorem




              24
Conclusions

    One Size does not Fit all


    Many choices


    No-SQL DB's providing Alternatives


    RDBMS serve useful purpose

                                         25
nkumar@inphina.com

     www.inphina.com
http://thoughts.inphina.com
References

    http://nosql-database.org/

    http://www.drdobbs.com/database/218900502

    http://perspectives.mvdirona.com/2009/11/03/OneSizeDoesNotFitAll.aspx

    http://blog.nahurst.com/visual-guide-to-nosql-systems

    http://blog.heroku.com/archives/2010/7/20/nosql/

    http://www.vineetgupta.com/2010/01/nosql-databases-part-1-landscape.html

    http://project-voldemort.com/

    http://code.google.com/p/redis/

    http://memcachedb.org/

    http://aws.amazon.com/simpledb/

    http://couchdb.apache.org/

    http://www.mongodb.org

    http://riak.basho.com

More Related Content

What's hot

What's hot (20)

SQLBits X SQL Server 2012 Rich Unstructured Data
SQLBits X SQL Server 2012 Rich Unstructured DataSQLBits X SQL Server 2012 Rich Unstructured Data
SQLBits X SQL Server 2012 Rich Unstructured Data
 
SQL or NoSQL, that is the question!
SQL or NoSQL, that is the question!SQL or NoSQL, that is the question!
SQL or NoSQL, that is the question!
 
[db tech showcase Tokyo 2017] C34: Replacing Oracle Database at DBS Bank ~Ora...
[db tech showcase Tokyo 2017] C34: Replacing Oracle Database at DBS Bank ~Ora...[db tech showcase Tokyo 2017] C34: Replacing Oracle Database at DBS Bank ~Ora...
[db tech showcase Tokyo 2017] C34: Replacing Oracle Database at DBS Bank ~Ora...
 
Databases in the Cloud
Databases in the CloudDatabases in the Cloud
Databases in the Cloud
 
TCO Comparison MongoDB & Oracle
TCO Comparison MongoDB & OracleTCO Comparison MongoDB & Oracle
TCO Comparison MongoDB & Oracle
 
Nonrelational Databases
Nonrelational DatabasesNonrelational Databases
Nonrelational Databases
 
MongoDB
MongoDBMongoDB
MongoDB
 
Building Big Data Applications
Building Big Data ApplicationsBuilding Big Data Applications
Building Big Data Applications
 
In memory big data management and processing a survey
In memory big data management and processing a surveyIn memory big data management and processing a survey
In memory big data management and processing a survey
 
A unified data modeler in the world of big data
A unified data modeler in the world of big dataA unified data modeler in the world of big data
A unified data modeler in the world of big data
 
Using hadoop to expand data warehousing
Using hadoop to expand data warehousingUsing hadoop to expand data warehousing
Using hadoop to expand data warehousing
 
MongoDB: The Operational Big Data by NORBERTO LEITE at Big Data Spain 2014
 MongoDB: The Operational Big Data by NORBERTO LEITE at Big Data Spain 2014 MongoDB: The Operational Big Data by NORBERTO LEITE at Big Data Spain 2014
MongoDB: The Operational Big Data by NORBERTO LEITE at Big Data Spain 2014
 
Big data vahidamiri-datastack.ir
Big data vahidamiri-datastack.irBig data vahidamiri-datastack.ir
Big data vahidamiri-datastack.ir
 
Emergent Distributed Data Storage
Emergent Distributed Data StorageEmergent Distributed Data Storage
Emergent Distributed Data Storage
 
Hdfs Dhruba
Hdfs DhrubaHdfs Dhruba
Hdfs Dhruba
 
NoSQL databases and managing big data
NoSQL databases and managing big dataNoSQL databases and managing big data
NoSQL databases and managing big data
 
No sql
No sqlNo sql
No sql
 
Non relational databases-no sql
Non relational databases-no sqlNon relational databases-no sql
Non relational databases-no sql
 
SQLBits X SQL Server 2012 Beyond Relational
SQLBits X SQL Server 2012 Beyond RelationalSQLBits X SQL Server 2012 Beyond Relational
SQLBits X SQL Server 2012 Beyond Relational
 
Introduction to Big Data Analytics on Apache Hadoop
Introduction to Big Data Analytics on Apache HadoopIntroduction to Big Data Analytics on Apache Hadoop
Introduction to Big Data Analytics on Apache Hadoop
 

Viewers also liked

Google appenginemigrationcasestudy
Google appenginemigrationcasestudyGoogle appenginemigrationcasestudy
Google appenginemigrationcasestudy
Inphina Technologies
 
Urbanising India and health issues
Urbanising India and health issuesUrbanising India and health issues
Urbanising India and health issues
AmitSamarth
 
22nd june Run Against Tobacco project report
22nd june Run Against Tobacco project report 22nd june Run Against Tobacco project report
22nd june Run Against Tobacco project report
AmitSamarth
 

Viewers also liked (8)

Google appenginemigrationcasestudy
Google appenginemigrationcasestudyGoogle appenginemigrationcasestudy
Google appenginemigrationcasestudy
 
Testing your application on Google App Engine
Testing your application on Google App EngineTesting your application on Google App Engine
Testing your application on Google App Engine
 
God Bless America Presentation
God Bless America PresentationGod Bless America Presentation
God Bless America Presentation
 
Inphina cloud
Inphina cloudInphina cloud
Inphina cloud
 
Inphina at a glance
Inphina at a glanceInphina at a glance
Inphina at a glance
 
Urbanising India and health issues
Urbanising India and health issuesUrbanising India and health issues
Urbanising India and health issues
 
Urban Planning to address Non-Communicable diseases
Urban Planning to address Non-Communicable diseasesUrban Planning to address Non-Communicable diseases
Urban Planning to address Non-Communicable diseases
 
22nd june Run Against Tobacco project report
22nd june Run Against Tobacco project report 22nd june Run Against Tobacco project report
22nd june Run Against Tobacco project report
 

Similar to Preparing yourdataforcloud

DynamoDB Gluecon 2012
DynamoDB Gluecon 2012DynamoDB Gluecon 2012
DynamoDB Gluecon 2012
Appirio
 
NoSQL Data Stores in Research and Practice - ICDE 2016 Tutorial - Extended Ve...
NoSQL Data Stores in Research and Practice - ICDE 2016 Tutorial - Extended Ve...NoSQL Data Stores in Research and Practice - ICDE 2016 Tutorial - Extended Ve...
NoSQL Data Stores in Research and Practice - ICDE 2016 Tutorial - Extended Ve...
Felix Gessert
 
NoSQL Options Compared
NoSQL Options ComparedNoSQL Options Compared
NoSQL Options Compared
Sergey Bushik
 
The World of Structured Storage System
The World of Structured Storage SystemThe World of Structured Storage System
The World of Structured Storage System
Schubert Zhang
 
A Survey of Advanced Non-relational Database Systems: Approaches and Applicat...
A Survey of Advanced Non-relational Database Systems: Approaches and Applicat...A Survey of Advanced Non-relational Database Systems: Approaches and Applicat...
A Survey of Advanced Non-relational Database Systems: Approaches and Applicat...
Qian Lin
 
NO SQL: What, Why, How
NO SQL: What, Why, HowNO SQL: What, Why, How
NO SQL: What, Why, How
Igor Moochnick
 

Similar to Preparing yourdataforcloud (20)

Preparing your data for the cloud
Preparing your data for the cloudPreparing your data for the cloud
Preparing your data for the cloud
 
SQL? NoSQL? NewSQL?!? What’s a Java developer to do? - JDC2012 Cairo, Egypt
SQL? NoSQL? NewSQL?!? What’s a Java developer to do? - JDC2012 Cairo, EgyptSQL? NoSQL? NewSQL?!? What’s a Java developer to do? - JDC2012 Cairo, Egypt
SQL? NoSQL? NewSQL?!? What’s a Java developer to do? - JDC2012 Cairo, Egypt
 
Presentation on Databases in the Cloud
Presentation on Databases in the CloudPresentation on Databases in the Cloud
Presentation on Databases in the Cloud
 
New Data Technologies, Graph Computing and Relationship Discovery in the Ente...
New Data Technologies, Graph Computing and Relationship Discovery in the Ente...New Data Technologies, Graph Computing and Relationship Discovery in the Ente...
New Data Technologies, Graph Computing and Relationship Discovery in the Ente...
 
NOSQL- Presentation on NoSQL
NOSQL- Presentation on NoSQLNOSQL- Presentation on NoSQL
NOSQL- Presentation on NoSQL
 
NOSQL in big data is the not only structure langua.pdf
NOSQL in big data is the not only structure langua.pdfNOSQL in big data is the not only structure langua.pdf
NOSQL in big data is the not only structure langua.pdf
 
DynamoDB Gluecon 2012
DynamoDB Gluecon 2012DynamoDB Gluecon 2012
DynamoDB Gluecon 2012
 
Gluecon 2012 - DynamoDB
Gluecon 2012 - DynamoDBGluecon 2012 - DynamoDB
Gluecon 2012 - DynamoDB
 
NoSQL Data Stores in Research and Practice - ICDE 2016 Tutorial - Extended Ve...
NoSQL Data Stores in Research and Practice - ICDE 2016 Tutorial - Extended Ve...NoSQL Data Stores in Research and Practice - ICDE 2016 Tutorial - Extended Ve...
NoSQL Data Stores in Research and Practice - ICDE 2016 Tutorial - Extended Ve...
 
NoSQL Options Compared
NoSQL Options ComparedNoSQL Options Compared
NoSQL Options Compared
 
The World of Structured Storage System
The World of Structured Storage SystemThe World of Structured Storage System
The World of Structured Storage System
 
A Survey of Advanced Non-relational Database Systems: Approaches and Applicat...
A Survey of Advanced Non-relational Database Systems: Approaches and Applicat...A Survey of Advanced Non-relational Database Systems: Approaches and Applicat...
A Survey of Advanced Non-relational Database Systems: Approaches and Applicat...
 
Rdbms vs. no sql
Rdbms vs. no sqlRdbms vs. no sql
Rdbms vs. no sql
 
Nosql Presentation.pdf for DBMS understanding
Nosql Presentation.pdf for DBMS understandingNosql Presentation.pdf for DBMS understanding
Nosql Presentation.pdf for DBMS understanding
 
عصر کلان داده، چرا و چگونه؟
عصر کلان داده، چرا و چگونه؟عصر کلان داده، چرا و چگونه؟
عصر کلان داده، چرا و چگونه؟
 
Sql no sql
Sql no sqlSql no sql
Sql no sql
 
Big Data Basic Concepts | Presented in 2014
Big Data Basic Concepts  | Presented in 2014Big Data Basic Concepts  | Presented in 2014
Big Data Basic Concepts | Presented in 2014
 
Information processing architectures
Information processing architecturesInformation processing architectures
Information processing architectures
 
Nosql seminar
Nosql seminarNosql seminar
Nosql seminar
 
NO SQL: What, Why, How
NO SQL: What, Why, HowNO SQL: What, Why, How
NO SQL: What, Why, How
 

More from Inphina Technologies

More from Inphina Technologies (8)

Scala collections
Scala collectionsScala collections
Scala collections
 
Scala test
Scala testScala test
Scala test
 
Easy ORMness with Objectify-Appengine
Easy ORMness with Objectify-AppengineEasy ORMness with Objectify-Appengine
Easy ORMness with Objectify-Appengine
 
Cloud Foundry Impressions
Cloud Foundry Impressions Cloud Foundry Impressions
Cloud Foundry Impressions
 
Cloud slam2011 multi-tenancy
Cloud slam2011 multi-tenancyCloud slam2011 multi-tenancy
Cloud slam2011 multi-tenancy
 
Multi-Tenancy in the Cloud
Multi-Tenancy in the CloudMulti-Tenancy in the Cloud
Multi-Tenancy in the Cloud
 
Multi-tenancy in the cloud
Multi-tenancy in the cloudMulti-tenancy in the cloud
Multi-tenancy in the cloud
 
Getting started with jClouds
Getting started with jCloudsGetting started with jClouds
Getting started with jClouds
 

Recently uploaded

Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with Parameters
Safe Software
 
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo DiehlFuture Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Peter Udo Diehl
 

Recently uploaded (20)

Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with Parameters
 
НАДІЯ ФЕДЮШКО БАЦ «Професійне зростання QA спеціаліста»
НАДІЯ ФЕДЮШКО БАЦ  «Професійне зростання QA спеціаліста»НАДІЯ ФЕДЮШКО БАЦ  «Професійне зростання QA спеціаліста»
НАДІЯ ФЕДЮШКО БАЦ «Професійне зростання QA спеціаліста»
 
How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
 
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish Caching
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
 
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
 
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...
 
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptx
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptxUnpacking Value Delivery - Agile Oxford Meetup - May 2024.pptx
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptx
 
"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor Turskyi"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor Turskyi
 
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
 
Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........
 
UiPath Test Automation using UiPath Test Suite series, part 2
UiPath Test Automation using UiPath Test Suite series, part 2UiPath Test Automation using UiPath Test Suite series, part 2
UiPath Test Automation using UiPath Test Suite series, part 2
 
In-Depth Performance Testing Guide for IT Professionals
In-Depth Performance Testing Guide for IT ProfessionalsIn-Depth Performance Testing Guide for IT Professionals
In-Depth Performance Testing Guide for IT Professionals
 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
 
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdfFIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
 
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo DiehlFuture Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
 
Speed Wins: From Kafka to APIs in Minutes
Speed Wins: From Kafka to APIs in MinutesSpeed Wins: From Kafka to APIs in Minutes
Speed Wins: From Kafka to APIs in Minutes
 

Preparing yourdataforcloud

  • 1. Preparing Your Data for Cloud Narinder Kumar 11/11/2010
  • 2. Agenda  Relational DBMS's : Pros & Cons  Non-Relational DBMS's : Pros & Cons  Types of Non-Relational DBMS's  Current Market State  Applicability of Different Data-Bases in different environments 2
  • 3. Relational DBMS - Pros  Data Integrity  ACID Capabilities  High Level Query Model  Data Normalization  Data Independence 3
  • 4. Relational DBMS - Cons  Scaling Issues  By Duplication (Master-Slave)  By Sharding/Division (Not transparent)  Fixed Schema  Mostly disk-oriented (Performance)  May fair poorly with large data 4
  • 5. Non-Relational DBMS - Pros  Scalability  Replication / Availability  Performance  Deployment Flexibility  Modelling Flexibility  Faster Development (?) 5
  • 6. Non-Relational DBMS - Cons  Lack of Transactional Support  Data Integrity is Application's responsibility  Data Duplication / Application Dependent  Eventually Consistent (mostly)  No Standardization  New Technology 6
  • 8. Cloud Capable RDBMS s Almost every RDBMS can run in a IAAS Cloud Platform 8
  • 10. Types of Non-Relational DBMS  Key Value Stores  Document Stores  Column Stores  Graph Stores 10
  • 11. Key Value Data-Bases  Object is completely Opaque to DB  Mostly GET, PUT & DELETE operations are supported  There may be limits on size of Objects Inspired by Amazon Dynamo Paper 11
  • 12. Key Value DataBases & Cloud Project Voldemort MemCachedDB Tokyo Tyrant 12
  • 13. Document Data-Bases  Object is not completely opaque to DB  Every Object has it's own schema FirstName="Bob", Address="5 Oak St.", Hobby="sailing". FirstName="Jonathan", Children=("Michael,10", "Jennifer,8")  Can perform queries based on Object's attributes  Possible to describe relationships between Objects  Joins and Transactions are not supported  Good for XML or JSON objects 13
  • 15. Column-Store Data-Bases  Richer than Document Stores  Multi-Dimensional Map  Tables  Row  Column  Time-Stamp  Supports Multiple Data Types  Usually use an Underlying DFS Inspired by Google Big Table Paper 15
  • 17. Key Factors while Making a Choice  Application Architecture Requirements  Platform choices  Non-Functional Requirements  Consistency  Availability  Partition  Security  Data Redemption 17
  • 18. Different Requirements = Different Solutions 18
  • 19. Scenario 1  Feature First  Corporate Data  Consistency Requirements  Business Intelligence  Legacy Application RDBMS on Amazon Cloud, RackSpace (IaaS) or Microsoft Azure/Amazon RDS (PaaS) 19
  • 20. Scenario 2  Consumer Facing Application  Big Files (Images, BLOB's, Files)  Geographically Distributed  Mostly writes  Not heavy requirement on Rich Queries Key-Value Data Stores (Amazon S3, Project Voldemort, Redis) 20
  • 21. Scenario 3  Hundreds Of Government Documents with different schemas  Need to serve on Web  Data Mining Document Data-Stores (Amazon SimpleDB, Apache Couche DB, MongoDB) 21
  • 22. Scenario 4  Scale First  Huge Data-Set  Analytical Requirements  Consumer Facing  High Availability over Consistency Column Data-Stores (Google App Engine, Hbase, Cassandra) 22
  • 23. Mix & Match of Earlier Scenarios  Polyglot Persistence  RDBMS for low- volume and high value  Key-Value DB for large files with little queries  Memcached DB for short-lived Data  Column DB for Analytics 23
  • 25. Conclusions  One Size does not Fit all  Many choices  No-SQL DB's providing Alternatives  RDBMS serve useful purpose 25
  • 26. nkumar@inphina.com www.inphina.com http://thoughts.inphina.com
  • 27. References  http://nosql-database.org/  http://www.drdobbs.com/database/218900502  http://perspectives.mvdirona.com/2009/11/03/OneSizeDoesNotFitAll.aspx  http://blog.nahurst.com/visual-guide-to-nosql-systems  http://blog.heroku.com/archives/2010/7/20/nosql/  http://www.vineetgupta.com/2010/01/nosql-databases-part-1-landscape.html  http://project-voldemort.com/  http://code.google.com/p/redis/  http://memcachedb.org/  http://aws.amazon.com/simpledb/  http://couchdb.apache.org/  http://www.mongodb.org  http://riak.basho.com