SlideShare a Scribd company logo
1 of 38
RDBMS ?



          Prateek Jain
          12-Jul-2012
Solutions…
Common Architecture

Web server          Web server         Web server


       App server                App server


                    Cache server




                      RDBMS


 CMS                                          Data Feeds
SQL - Story till now…

 Stable environment.
 No more discussions on Data stores.
 Easy to train and employ people.
 SQL running effectively at core.
SQL - Story till now…

 For dealing with lists (as tables) it’s a great
 language,dynamic and relatively fast
  • Sure it has a few problems but give me a language that
 doesn’t
What Next…?



     We need to fast, scale
     and be part of web
ORM - OMG!

   The effort of trying to convert something inherently
    hierarchical into something relational
   Probably the biggest waste of programming time,
    lines of code and source of bugs and latency is ORM
Challenges

 Data grows exponentially.
 Data is unstructured.
 Data is huge and spread across 100’s/1000’s
  of nodes.
 SQL is useful - when things are flat
Lots of data

       In the banking world we have a lot of data
       Today 50-100,000 quotes a second isn’t
        unusual
       It gets more complex...
    •     10,000 portfolios, each with 1,000 buy/sell orders at specific
          prices
    •     We now have 100,000 prices coming in every second and 10
          million orders to watch
Time is critical

 Inthe world of trading only the first one gets
  the deal, there is no second place.
 While being first to have the order is what
  makes the money banks now have a “new”
  problem
       “RISK”
Lots of data, lots of calculations




        There are two main flavors of distributed computing
     •      Data
     •      Computation


        Often they are closely related but not always.
        To achieve either we usually need lots of memory and CPUs
        We don’t stack them or put them in clusters these days, we
         distribute them
Why not RDBMS?

 Not designed to scale out.
 Strongly ACID complaint.
 Slower running queries (specially in joins).
 Schema based.
 Not suited for changing data structure.
CAP Theorem

C  – consistency
 A – availability
 P – partition tolerance



** You must make trade-offs and sacrifice at least one in favor of
   the other two.
NoSql




  Not Just Sql
Categories
 Document Based
                                      Graph Based

                  Column Based



    Key/Value Based              Data Structure Based
Example Products
Eventual Consistency
Eventual Consistency

 Given   a sufficiently long period of time, over
  which no updates are sent, one can expect
  that all updates will, eventually, propagate
  through the system and all the replicas will
  be consistent.
 In the presence of continuing updates, an
  accepted update eventually either reaches a
  replica or the replica retires from service.
Scalability
Scalability

 Scalability is the ability of a system to
  increase throughput with addition of
  resources to address load increases.
 Scalability can be achieved by:
   –   Provisioning a large and powerful resource to meet the additional
       demands.
   –   It can be achieved by relying on a cluster of ordinary machines to
       work as a unit.
How to choose ?

 Scalability
 Transactionalintegrity and consistency
 Data modeling
 Query support
 Access and interface availability
Scalability

 column-family-centric  NoSQL databases are
  a good choice if extreme scalability is a
  requirement.
 Not well suited for real-time transaction
  processing. (RDBMS is best)
 Eventually consistent NoSQL options, like
  Cassandra or Riak, may be workable.
Transactional Integrity and Consistency


 Batch-centric    analytics on warehoused data
  is also not subject to transactional
  requirements.
 Data sets that are written once for e.g., web
  traffic log files, social networking status
  updates, advt. click-through imprints, road-
  traffic data, stock market tick data, game
  scores etc.
Transactional Integrity and Consistency


 If range operations are common and integrity
  of updates is required, an RDBMS is the best
  choice.
 If atomicity at an individual item level is
  sufficient, then column-family databases,
  document databases.
Data Modeling

 RDBMS    offers a consistent way of modeling
  data. Relational algebra underlies the data
  model.
 In the NoSQL world there is no such
  standardized and well-defined data model.
Data Modeling

 Ifrelaxed schema is your primary reason for
  using NoSQL, then MongoDB is a great
  option for getting started with NoSQL.
 MongoDB is used by many web-centric
  businesses.
Querying Support

 An RDBMS thrives on SQL support, which
  makes accessing and querying data easy.
 Among document databases, MongoDB
  provides the best querying capabilities.
 For key/value pairs and in-memory stores,
  nothing is more feature-rich than Redis as far
  as querying capabilities go.
Querying Support

 Column-family    stores like HBase have little to
  offer as far as rich querying capabilities go.
 Project called Hive makes it possible to
  query HBase using SQL-like syntax and
  semantics.
Access and Interface Availability

 MongoDB    has the notion of drivers.
 CouchDB always has the RESTful HTTP
  interface available.
 Redis, Membase, Riak, HBase, Hypertable,
  Cassandra, and Voldemort have support for
  language bindings to connect from most
  mainstream languages.
Performance
50/50 Read and Update

 Resultsshowthat under this test case
 Apache Cassandra outperforms the
 competition on both read and update
 latencies.
 HBase comes close but stays behind
 Cassandra.
95/5 Read and Update

 The sorted ordered column-family stores
  perform best for contiguous range reads.
 HBase seems to deliver consistent
  performance for reads, irrespective of the
  number of operations per second.
 MySQL delivers the best performance for
  read-only cases.
Future?




   Coexistence
Future

 Getting  ready for polyglot persistence.
 Understanding the database technologies
  suitable for immutable data sets.
 Choosing the right database to facilitate ease
  of application development.
Examples

   Linked In uses Hadoop for many large-scale
    analytics jobs like probabilistically predicting people
    you may know.
   Facebook (mysql + HBase, cassandra, ZooKeeper)
   Twitter (mysql + Cassandra + FlockDB)
Questions?
Feedback




    trainer.prateek@gmail.com

More Related Content

What's hot

Big iron 2 (published)
Big iron 2 (published)Big iron 2 (published)
Big iron 2 (published)Ben Stopford
 
Sql vs NO-SQL database differences explained
Sql vs NO-SQL database differences explainedSql vs NO-SQL database differences explained
Sql vs NO-SQL database differences explainedSatya Pal
 
Massive parallel processing database systems mpp
Massive parallel processing database systems mppMassive parallel processing database systems mpp
Massive parallel processing database systems mppDiana Patricia Rey Cabra
 
Webinar: ROI on Big Data - RDBMS, NoSQL or Both? A Simple Guide for Knowing H...
Webinar: ROI on Big Data - RDBMS, NoSQL or Both? A Simple Guide for Knowing H...Webinar: ROI on Big Data - RDBMS, NoSQL or Both? A Simple Guide for Knowing H...
Webinar: ROI on Big Data - RDBMS, NoSQL or Both? A Simple Guide for Knowing H...DataStax
 
SQL, NoSQL, BigData in Data Architecture
SQL, NoSQL, BigData in Data ArchitectureSQL, NoSQL, BigData in Data Architecture
SQL, NoSQL, BigData in Data ArchitectureVenu Anuganti
 
Don't Let Your Shoppers Drop; 5 Rules for Today’s eCommerce
Don't Let Your Shoppers Drop; 5 Rules for Today’s eCommerceDon't Let Your Shoppers Drop; 5 Rules for Today’s eCommerce
Don't Let Your Shoppers Drop; 5 Rules for Today’s eCommerceDataStax
 
Prepare Your Data For The Cloud
Prepare Your Data For The CloudPrepare Your Data For The Cloud
Prepare Your Data For The CloudIndicThreads
 
Welcome | MariaDB today and our vision for the future
Welcome | MariaDB today and our vision for the futureWelcome | MariaDB today and our vision for the future
Welcome | MariaDB today and our vision for the futureMariaDB plc
 
http://www.hfadeel.com/Blog/?p=151
http://www.hfadeel.com/Blog/?p=151http://www.hfadeel.com/Blog/?p=151
http://www.hfadeel.com/Blog/?p=151xlight
 
In memory big data management and processing a survey
In memory big data management and processing a surveyIn memory big data management and processing a survey
In memory big data management and processing a surveyredpel dot com
 
Webinar: Dyn + DataStax - helping companies deliver exceptional end-user expe...
Webinar: Dyn + DataStax - helping companies deliver exceptional end-user expe...Webinar: Dyn + DataStax - helping companies deliver exceptional end-user expe...
Webinar: Dyn + DataStax - helping companies deliver exceptional end-user expe...DataStax
 
Azure Data services
Azure Data servicesAzure Data services
Azure Data servicesRajesh Kolla
 
Queues, Pools and Caches - Paper
Queues, Pools and Caches - PaperQueues, Pools and Caches - Paper
Queues, Pools and Caches - PaperGwen (Chen) Shapira
 
Hadoop vs. RDBMS for Advanced Analytics
Hadoop vs. RDBMS for Advanced AnalyticsHadoop vs. RDBMS for Advanced Analytics
Hadoop vs. RDBMS for Advanced Analyticsjoshwills
 

What's hot (20)

Big iron 2 (published)
Big iron 2 (published)Big iron 2 (published)
Big iron 2 (published)
 
Sql vs NO-SQL database differences explained
Sql vs NO-SQL database differences explainedSql vs NO-SQL database differences explained
Sql vs NO-SQL database differences explained
 
Massive parallel processing database systems mpp
Massive parallel processing database systems mppMassive parallel processing database systems mpp
Massive parallel processing database systems mpp
 
Webinar: ROI on Big Data - RDBMS, NoSQL or Both? A Simple Guide for Knowing H...
Webinar: ROI on Big Data - RDBMS, NoSQL or Both? A Simple Guide for Knowing H...Webinar: ROI on Big Data - RDBMS, NoSQL or Both? A Simple Guide for Knowing H...
Webinar: ROI on Big Data - RDBMS, NoSQL or Both? A Simple Guide for Knowing H...
 
SQL, NoSQL, BigData in Data Architecture
SQL, NoSQL, BigData in Data ArchitectureSQL, NoSQL, BigData in Data Architecture
SQL, NoSQL, BigData in Data Architecture
 
Big data rmoug
Big data rmougBig data rmoug
Big data rmoug
 
Rdbms vs. no sql
Rdbms vs. no sqlRdbms vs. no sql
Rdbms vs. no sql
 
NoSQL
NoSQLNoSQL
NoSQL
 
Don't Let Your Shoppers Drop; 5 Rules for Today’s eCommerce
Don't Let Your Shoppers Drop; 5 Rules for Today’s eCommerceDon't Let Your Shoppers Drop; 5 Rules for Today’s eCommerce
Don't Let Your Shoppers Drop; 5 Rules for Today’s eCommerce
 
No sql
No sqlNo sql
No sql
 
Prepare Your Data For The Cloud
Prepare Your Data For The CloudPrepare Your Data For The Cloud
Prepare Your Data For The Cloud
 
Welcome | MariaDB today and our vision for the future
Welcome | MariaDB today and our vision for the futureWelcome | MariaDB today and our vision for the future
Welcome | MariaDB today and our vision for the future
 
http://www.hfadeel.com/Blog/?p=151
http://www.hfadeel.com/Blog/?p=151http://www.hfadeel.com/Blog/?p=151
http://www.hfadeel.com/Blog/?p=151
 
In memory big data management and processing a survey
In memory big data management and processing a surveyIn memory big data management and processing a survey
In memory big data management and processing a survey
 
Webinar: Dyn + DataStax - helping companies deliver exceptional end-user expe...
Webinar: Dyn + DataStax - helping companies deliver exceptional end-user expe...Webinar: Dyn + DataStax - helping companies deliver exceptional end-user expe...
Webinar: Dyn + DataStax - helping companies deliver exceptional end-user expe...
 
Azure Data services
Azure Data servicesAzure Data services
Azure Data services
 
Queues, Pools and Caches - Paper
Queues, Pools and Caches - PaperQueues, Pools and Caches - Paper
Queues, Pools and Caches - Paper
 
Hadoop vs. RDBMS for Advanced Analytics
Hadoop vs. RDBMS for Advanced AnalyticsHadoop vs. RDBMS for Advanced Analytics
Hadoop vs. RDBMS for Advanced Analytics
 
Polyglot Persistence
Polyglot Persistence Polyglot Persistence
Polyglot Persistence
 
Preparing yourdataforcloud
Preparing yourdataforcloudPreparing yourdataforcloud
Preparing yourdataforcloud
 

Viewers also liked

140608 eng got fruit series (part 4 - faithfulness) by kelvin lim
140608 eng  got fruit series (part 4 - faithfulness) by kelvin lim140608 eng  got fruit series (part 4 - faithfulness) by kelvin lim
140608 eng got fruit series (part 4 - faithfulness) by kelvin limeaglepointcf
 
Loyalty with the self
Loyalty with the selfLoyalty with the self
Loyalty with the selfPrateek Jain
 
God's faithfulness
God's faithfulnessGod's faithfulness
God's faithfulnessElmer05
 
Hs 08 Faithfulness
Hs 08 FaithfulnessHs 08 Faithfulness
Hs 08 Faithfulnesschucho1943
 
The Fruit of the Spirit is Faithfulness
The Fruit of the Spirit is FaithfulnessThe Fruit of the Spirit is Faithfulness
The Fruit of the Spirit is FaithfulnessDave Stewart
 

Viewers also liked (7)

140608 eng got fruit series (part 4 - faithfulness) by kelvin lim
140608 eng  got fruit series (part 4 - faithfulness) by kelvin lim140608 eng  got fruit series (part 4 - faithfulness) by kelvin lim
140608 eng got fruit series (part 4 - faithfulness) by kelvin lim
 
07 faithfulness
07 faithfulness07 faithfulness
07 faithfulness
 
Loyalty with the self
Loyalty with the selfLoyalty with the self
Loyalty with the self
 
God's faithfulness
God's faithfulnessGod's faithfulness
God's faithfulness
 
Hs 08 Faithfulness
Hs 08 FaithfulnessHs 08 Faithfulness
Hs 08 Faithfulness
 
Developing Faithfulness
Developing FaithfulnessDeveloping Faithfulness
Developing Faithfulness
 
The Fruit of the Spirit is Faithfulness
The Fruit of the Spirit is FaithfulnessThe Fruit of the Spirit is Faithfulness
The Fruit of the Spirit is Faithfulness
 

Similar to No sql

If NoSQL is your answer, you are probably asking the wrong question.
If NoSQL is your answer, you are probably asking the wrong question.If NoSQL is your answer, you are probably asking the wrong question.
If NoSQL is your answer, you are probably asking the wrong question.Lukas Smith
 
Navigating NoSQL in cloudy skies
Navigating NoSQL in cloudy skiesNavigating NoSQL in cloudy skies
Navigating NoSQL in cloudy skiesshnkr_rmchndrn
 
NoSQLDatabases
NoSQLDatabasesNoSQLDatabases
NoSQLDatabasesAdi Challa
 
Relational databases vs Non-relational databases
Relational databases vs Non-relational databasesRelational databases vs Non-relational databases
Relational databases vs Non-relational databasesJames Serra
 
Database revolution opening webcast 01 18-12
Database revolution opening webcast 01 18-12Database revolution opening webcast 01 18-12
Database revolution opening webcast 01 18-12mark madsen
 
Database Revolution - Exploratory Webcast
Database Revolution - Exploratory WebcastDatabase Revolution - Exploratory Webcast
Database Revolution - Exploratory WebcastInside Analysis
 
NO SQL: What, Why, How
NO SQL: What, Why, HowNO SQL: What, Why, How
NO SQL: What, Why, HowIgor Moochnick
 
MinneBar 2013 - Scaling with Cassandra
MinneBar 2013 - Scaling with CassandraMinneBar 2013 - Scaling with Cassandra
MinneBar 2013 - Scaling with CassandraJeff Smoley
 
Webcast Q&A- Big Data Architectures Beyond Hadoop
Webcast Q&A- Big Data Architectures Beyond HadoopWebcast Q&A- Big Data Architectures Beyond Hadoop
Webcast Q&A- Big Data Architectures Beyond HadoopImpetus Technologies
 
NoSQL A brief look at Apache Cassandra Distributed Database
NoSQL A brief look at Apache Cassandra Distributed DatabaseNoSQL A brief look at Apache Cassandra Distributed Database
NoSQL A brief look at Apache Cassandra Distributed DatabaseJoe Alex
 
NoSQL Options Compared
NoSQL Options ComparedNoSQL Options Compared
NoSQL Options ComparedSergey Bushik
 
NOSQL- Presentation on NoSQL
NOSQL- Presentation on NoSQLNOSQL- Presentation on NoSQL
NOSQL- Presentation on NoSQLRamakant Soni
 
NOSQL in big data is the not only structure langua.pdf
NOSQL in big data is the not only structure langua.pdfNOSQL in big data is the not only structure langua.pdf
NOSQL in big data is the not only structure langua.pdfajajkhan16
 
Cache and consistency in nosql
Cache and consistency in nosqlCache and consistency in nosql
Cache and consistency in nosqlJoão Gabriel Lima
 

Similar to No sql (20)

If NoSQL is your answer, you are probably asking the wrong question.
If NoSQL is your answer, you are probably asking the wrong question.If NoSQL is your answer, you are probably asking the wrong question.
If NoSQL is your answer, you are probably asking the wrong question.
 
Navigating NoSQL in cloudy skies
Navigating NoSQL in cloudy skiesNavigating NoSQL in cloudy skies
Navigating NoSQL in cloudy skies
 
NoSQLDatabases
NoSQLDatabasesNoSQLDatabases
NoSQLDatabases
 
Relational databases vs Non-relational databases
Relational databases vs Non-relational databasesRelational databases vs Non-relational databases
Relational databases vs Non-relational databases
 
Nosql seminar
Nosql seminarNosql seminar
Nosql seminar
 
Database revolution opening webcast 01 18-12
Database revolution opening webcast 01 18-12Database revolution opening webcast 01 18-12
Database revolution opening webcast 01 18-12
 
Database Revolution - Exploratory Webcast
Database Revolution - Exploratory WebcastDatabase Revolution - Exploratory Webcast
Database Revolution - Exploratory Webcast
 
NO SQL: What, Why, How
NO SQL: What, Why, HowNO SQL: What, Why, How
NO SQL: What, Why, How
 
MinneBar 2013 - Scaling with Cassandra
MinneBar 2013 - Scaling with CassandraMinneBar 2013 - Scaling with Cassandra
MinneBar 2013 - Scaling with Cassandra
 
Webcast Q&A- Big Data Architectures Beyond Hadoop
Webcast Q&A- Big Data Architectures Beyond HadoopWebcast Q&A- Big Data Architectures Beyond Hadoop
Webcast Q&A- Big Data Architectures Beyond Hadoop
 
Introduction to NoSQL
Introduction to NoSQLIntroduction to NoSQL
Introduction to NoSQL
 
NoSQL A brief look at Apache Cassandra Distributed Database
NoSQL A brief look at Apache Cassandra Distributed DatabaseNoSQL A brief look at Apache Cassandra Distributed Database
NoSQL A brief look at Apache Cassandra Distributed Database
 
NoSQL
NoSQLNoSQL
NoSQL
 
NoSQL Options Compared
NoSQL Options ComparedNoSQL Options Compared
NoSQL Options Compared
 
NOSQL- Presentation on NoSQL
NOSQL- Presentation on NoSQLNOSQL- Presentation on NoSQL
NOSQL- Presentation on NoSQL
 
NOSQL in big data is the not only structure langua.pdf
NOSQL in big data is the not only structure langua.pdfNOSQL in big data is the not only structure langua.pdf
NOSQL in big data is the not only structure langua.pdf
 
No sql
No sqlNo sql
No sql
 
Report 2.0.docx
Report 2.0.docxReport 2.0.docx
Report 2.0.docx
 
NOSQL
NOSQLNOSQL
NOSQL
 
Cache and consistency in nosql
Cache and consistency in nosqlCache and consistency in nosql
Cache and consistency in nosql
 

Recently uploaded

TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessPixlogix Infotech
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 

Recently uploaded (20)

TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 

No sql

  • 1. RDBMS ? Prateek Jain 12-Jul-2012
  • 3. Common Architecture Web server Web server Web server App server App server Cache server RDBMS CMS Data Feeds
  • 4. SQL - Story till now…  Stable environment.  No more discussions on Data stores.  Easy to train and employ people.  SQL running effectively at core.
  • 5. SQL - Story till now…  For dealing with lists (as tables) it’s a great language,dynamic and relatively fast • Sure it has a few problems but give me a language that doesn’t
  • 6. What Next…? We need to fast, scale and be part of web
  • 7. ORM - OMG!  The effort of trying to convert something inherently hierarchical into something relational  Probably the biggest waste of programming time, lines of code and source of bugs and latency is ORM
  • 8. Challenges  Data grows exponentially.  Data is unstructured.  Data is huge and spread across 100’s/1000’s of nodes.  SQL is useful - when things are flat
  • 9. Lots of data  In the banking world we have a lot of data  Today 50-100,000 quotes a second isn’t unusual  It gets more complex... • 10,000 portfolios, each with 1,000 buy/sell orders at specific prices • We now have 100,000 prices coming in every second and 10 million orders to watch
  • 10. Time is critical  Inthe world of trading only the first one gets the deal, there is no second place.  While being first to have the order is what makes the money banks now have a “new” problem “RISK”
  • 11. Lots of data, lots of calculations  There are two main flavors of distributed computing • Data • Computation  Often they are closely related but not always.  To achieve either we usually need lots of memory and CPUs  We don’t stack them or put them in clusters these days, we distribute them
  • 12. Why not RDBMS?  Not designed to scale out.  Strongly ACID complaint.  Slower running queries (specially in joins).  Schema based.  Not suited for changing data structure.
  • 13.
  • 14. CAP Theorem C – consistency  A – availability  P – partition tolerance ** You must make trade-offs and sacrifice at least one in favor of the other two.
  • 15. NoSql Not Just Sql
  • 16. Categories Document Based Graph Based Column Based Key/Value Based Data Structure Based
  • 19. Eventual Consistency  Given a sufficiently long period of time, over which no updates are sent, one can expect that all updates will, eventually, propagate through the system and all the replicas will be consistent.  In the presence of continuing updates, an accepted update eventually either reaches a replica or the replica retires from service.
  • 21. Scalability  Scalability is the ability of a system to increase throughput with addition of resources to address load increases.  Scalability can be achieved by: – Provisioning a large and powerful resource to meet the additional demands. – It can be achieved by relying on a cluster of ordinary machines to work as a unit.
  • 22. How to choose ?  Scalability  Transactionalintegrity and consistency  Data modeling  Query support  Access and interface availability
  • 23. Scalability  column-family-centric NoSQL databases are a good choice if extreme scalability is a requirement.  Not well suited for real-time transaction processing. (RDBMS is best)  Eventually consistent NoSQL options, like Cassandra or Riak, may be workable.
  • 24. Transactional Integrity and Consistency  Batch-centric analytics on warehoused data is also not subject to transactional requirements.  Data sets that are written once for e.g., web traffic log files, social networking status updates, advt. click-through imprints, road- traffic data, stock market tick data, game scores etc.
  • 25. Transactional Integrity and Consistency  If range operations are common and integrity of updates is required, an RDBMS is the best choice.  If atomicity at an individual item level is sufficient, then column-family databases, document databases.
  • 26. Data Modeling  RDBMS offers a consistent way of modeling data. Relational algebra underlies the data model.  In the NoSQL world there is no such standardized and well-defined data model.
  • 27. Data Modeling  Ifrelaxed schema is your primary reason for using NoSQL, then MongoDB is a great option for getting started with NoSQL.  MongoDB is used by many web-centric businesses.
  • 28. Querying Support  An RDBMS thrives on SQL support, which makes accessing and querying data easy.  Among document databases, MongoDB provides the best querying capabilities.  For key/value pairs and in-memory stores, nothing is more feature-rich than Redis as far as querying capabilities go.
  • 29. Querying Support  Column-family stores like HBase have little to offer as far as rich querying capabilities go.  Project called Hive makes it possible to query HBase using SQL-like syntax and semantics.
  • 30. Access and Interface Availability  MongoDB has the notion of drivers.  CouchDB always has the RESTful HTTP interface available.  Redis, Membase, Riak, HBase, Hypertable, Cassandra, and Voldemort have support for language bindings to connect from most mainstream languages.
  • 32. 50/50 Read and Update  Resultsshowthat under this test case Apache Cassandra outperforms the competition on both read and update latencies.  HBase comes close but stays behind Cassandra.
  • 33. 95/5 Read and Update  The sorted ordered column-family stores perform best for contiguous range reads.  HBase seems to deliver consistent performance for reads, irrespective of the number of operations per second.  MySQL delivers the best performance for read-only cases.
  • 34. Future? Coexistence
  • 35. Future  Getting ready for polyglot persistence.  Understanding the database technologies suitable for immutable data sets.  Choosing the right database to facilitate ease of application development.
  • 36. Examples  Linked In uses Hadoop for many large-scale analytics jobs like probabilistically predicting people you may know.  Facebook (mysql + HBase, cassandra, ZooKeeper)  Twitter (mysql + Cassandra + FlockDB)
  • 38. Feedback trainer.prateek@gmail.com

Editor's Notes

  1. RDBMS assumes a well- defi ned structure in data. It assumes that the data is dense and is largely uniform. RDBMS builds on a prerequisite that the properties of the data can be defi ned up front and that its interrelationships are well established and systematically referenced. It also assumes that indexes can be consistently defi ned on data sets and that such indexes can be uniformly leveraged for faster querying. in the context of massive sparse data sets with loosely defi ned structures, RDBMS appears a forced fi t. With massive data sets the typical storage mechanisms and access methods also get stretched. Denormalizing tables, dropping constraints, and relaxing transactional guarantee can help an RDBMS scale, but after these modifi cations an RDBMS starts resembling a NoSQL product.