NoSQL, NewSQL, and BeyondThe answer to SPRAINed relational databasesMatthew AslettResearch Manager, Data Management and An...
451 Research Matthew Aslett  • Research manager, data management and analytics  • With The 451 Group since 2007  • www.tw...
The 451 Group                                                               3                © 2012 by The 451 Group. All ...
Relevant reports NoSQL, NewSQL and Beyond  • Assessing the drivers behind the development and adoption    of NoSQL and Ne...
NoSQL, NewSQL and Beyond NoSQL                                               NewSQL  New breed of non-relational         ...
The NoSQL landscape   Key Value Store                                                     Graph  •Citrusleaf              ...
The NewSQL ecosystem-as-a-Service                                                New databases                            ...
SPRAINED RELATIONAL DATABASESPhoto credit: Foxtongue on Flickrhttp://www.flickr.com/photos/foxtongue/4844016087/          ...
SPRAIN   Scalability - Hardware economics    Example project/service/vendor:   • BigTable, HBase, Riak, MongoDB, Couchbas...
SPRAIN   Scalability    Netflix:   • 37X growth in requests Jan 2010-Jan 2011   • “We had to get out of the datacenter bu...
SPRAIN   Performance – RDBMS limitations    Example project/service/vendor:   • Hypertable, Couchbase, Riak, Membrain, Mo...
SPRAIN   Performance    Tellybug:   • Having won the contract to deliver app for Britain’s Got Talent     realised that i...
SPRAIN   Relaxed consistency - CAP Theorem    Example project/service/vendor:   • Dynamo, Voldemort, Cassandra, Riak   • ...
SPRAIN   Relaxed consistency    Netflix:   • “We value availability over consistency. We don’t need full     consistency....
SPRAIN   Agility - polyglot persistence, schema-less    Example project/service/vendor:   • MongoDB, CouchDB, Cassandra, ...
SPRAIN   Agility    Tellybug:   • Had 4-5 weeks to ship a production system to meet contract   • Ability to re-build enti...
SPRAIN   Intricacy – volume, velocity, variety    Example project/service/vendor:   • Neo4j, GraphDB, InfiniteGraph   • A...
SPRAIN   Intricacy    Dachis Group:   • Combining social media data for 2,000 brands with sentiment,     relationship, co...
SPRAIN   Necessity - open source    The failure of existing suppliers to address emerging     requirements    Example pr...
SPRAIN   Necessity    Spotify:   • Created own backup and restore technology    Rackspace:   • Distributed secondary ind...
Relevant reports MySQL, NoSQL and NewSQL  • Assessing the competitive dynamic between the MySQL    ecosystem, NoSQL and N...
Thank you. Questions? Comments?   matt.aslett@451research.com             @maslett       © 2012 by The 451 Group. All righ...
Upcoming SlideShare
Loading in …5
×

Cassandra EU 2012 - Overview of Case Studies and State of the Market by 451 Research

2,424 views
2,280 views

Published on

Slides from Matt Aslett's presentation at Cassandra Europe 2012.

Published in: Technology
0 Comments
2 Likes
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total views
2,424
On SlideShare
0
From Embeds
0
Number of Embeds
51
Actions
Shares
0
Downloads
89
Comments
0
Likes
2
Embeds 0
No embeds

No notes for slide

Cassandra EU 2012 - Overview of Case Studies and State of the Market by 451 Research

  1. 1. NoSQL, NewSQL, and BeyondThe answer to SPRAINed relational databasesMatthew AslettResearch Manager, Data Management and Analytics 1 © 2012 by The 451 Group. All rights reserved
  2. 2. 451 Research Matthew Aslett • Research manager, data management and analytics • With The 451 Group since 2007 • www.twitter.com/maslettInformation Management Commercial Adoption of Open Source Operational databases (CAOS) Data warehousing  Open source projects Data caching  Adoption of open source software Event processing  Vendor strategies © 2012 by The 451 Group. All rights reserved
  3. 3. The 451 Group 3 © 2012 by The 451 Group. All rights reserved
  4. 4. Relevant reports NoSQL, NewSQL and Beyond • Assessing the drivers behind the development and adoption of NoSQL and NewSQL databases, as well as data grid/caching technologies • Released April 2011 • Role of open source in driving innovation • sales@the451group.com © 2012 by The 451 Group. All rights reserved
  5. 5. NoSQL, NewSQL and Beyond NoSQL NewSQL  New breed of non-relational  New breed of relational database products database products  Rejection of fixed table  Retain SQL and ACID schema and join operations  Designed to meet scalability  Designed to meet scalability requirements of distributed requirements of distributed architectures architectures  Or improve performance so  And/or schema-less data horizontal scalability is no management requirements longer a necessity … and Beyond  In-memory data grid/cache products  Potentialprimary platform for distributed data management © 2012 by The 451 Group. All rights reserved
  6. 6. The NoSQL landscape Key Value Store Graph •Citrusleaf Document • InfiniteGraph • HandlerSocket* •RavenDB • Neo4j • Redis • Riak • MongoDB • DEX • Voldemort •CouchBase • CouchDB •OrientDB • Membrain • Mongo Labs • Cloudant •NuvolaBase • Oracle NoSQL • Mongo HQ • Iris Couch • Castle •DynamoDB •RethinkDB • Redis-to-go •LevelDB • SimpleDB • Cassandra • DataStax EE •Acunu • HBase • App Engine Datastore -as-a-Service • Hypertable Big Tables © 2012 by The 451 Group. All rights reserved
  7. 7. The NewSQL ecosystem-as-a-Service New databases •Drizzle •NuoDB• Xeround •Drizzle • VoltDB •SQLFire• Tokutek • Akiban • JustOne DB • Translattice • GenieDB • Clustrix • Schooner SQL •ScaleDB •ParElastic • ScaleBaseStorage engines • MySQL Cluster •Continuent •ScaleArc •Zimory Scale •Galera • CodeFutures Clustering/sharding © 2012 by The 451 Group. All rights reserved
  8. 8. SPRAINED RELATIONAL DATABASESPhoto credit: Foxtongue on Flickrhttp://www.flickr.com/photos/foxtongue/4844016087/ © 2012 by The 451 Group. All rights reserved
  9. 9. SPRAIN Scalability - Hardware economics  Example project/service/vendor: • BigTable, HBase, Riak, MongoDB, Couchbase, Hadoop • Xeround, NuoDB • Data grid/cache  Associated use case: • Large-scale distributed data storage • Analysis of continuously updated data • Multi-tenant PaaS data layer © 2012 by The 451 Group. All rights reserved
  10. 10. SPRAIN Scalability  Netflix: • 37X growth in requests Jan 2010-Jan 2011 • “We had to get out of the datacenter business”  Dachis Group: • Specifically wanted a horizontally scalable data store  Spotify: • Importance (and challenges) of cross-datacenter replication  Tellybug: • Peak scalability – mission critical for two hours per week. Elastic scalability still not solved. © 2012 by The 451 Group. All rights reserved
  11. 11. SPRAIN Performance – RDBMS limitations  Example project/service/vendor: • Hypertable, Couchbase, Riak, Membrain, MongoDB, Redis • Data grid/cache • VoltDB, Clustrix  Associated use case: • Real time data processing of mixed read/write workloads • Data caching • Large-scale data ingestion © 2012 by The 451 Group. All rights reserved
  12. 12. SPRAIN Performance  Tellybug: • Having won the contract to deliver app for Britain’s Got Talent realised that its MySQL/Django/Python stack couldn’t deliver the anticipated load  Spotify: • Major upgrades without service interruptions, not possible with sharded SQL databases  Rackspace: • Ability to monitor a million different things, ability to withstand the failure of 1/3 data centers © 2012 by The 451 Group. All rights reserved
  13. 13. SPRAIN Relaxed consistency - CAP Theorem  Example project/service/vendor: • Dynamo, Voldemort, Cassandra, Riak • Amazon DynamoDB  Associated use case: • Multi-data center replication • Service availability • Non-transactional data off-load © 2012 by The 451 Group. All rights reserved
  14. 14. SPRAIN Relaxed consistency  Netflix: • “We value availability over consistency. We don’t need full consistency.”  Tellybug: • Soft production data – the ability to ignore failures • Spotify: • Flexibility to combine different consistency levels for different column families in a single application © 2012 by The 451 Group. All rights reserved
  15. 15. SPRAIN Agility - polyglot persistence, schema-less  Example project/service/vendor: • MongoDB, CouchDB, Cassandra, Riak • Google App Engine, SimpleDB,  Associated use case: • Mobile/remote device synchronization • Agile development • Data caching © 2012 by The 451 Group. All rights reserved
  16. 16. SPRAIN Agility  Tellybug: • Had 4-5 weeks to ship a production system to meet contract • Ability to re-build entire infrastructure from scratch in 10-15 mins  Netflix: • Single SQL database had to be brought down to change the schema. • Put the logic into the Web services and employ distributed key stores to enable agile development and schema changes  Spotify: • Cassandra is a platform for quickly developing new applications © 2012 by The 451 Group. All rights reserved
  17. 17. SPRAIN Intricacy – volume, velocity, variety  Example project/service/vendor: • Neo4j, GraphDB, InfiniteGraph • Apache Cassandra, Hadoop, Riak • VoltDB, Clustrix  Associated use case: • Social networking applications • Geo-locational applications • Configuration management database © 2012 by The 451 Group. All rights reserved
  18. 18. SPRAIN Intricacy  Dachis Group: • Combining social media data for 2,000 brands with sentiment, relationship, conversations, social graph  Spotify: • More than half a billion playlists, about 10 thousand requests per second at peak  Rackspace:  One row per customer with thousands of columns – potentially millions of columns  Tellybug: • Ability to cope with 10,000 interactions/sec and integrate that with the social graph for thousands of concurrent users © 2012 by The 451 Group. All rights reserved
  19. 19. SPRAIN Necessity - open source  The failure of existing suppliers to address emerging requirements  Example projects: • BigTable: Google • Dynamo: Amazon • Cassandra: Facebook • HBase: Powerset • Voldemort: LinkedIn • Hypertable: Zvents • Neo4j: Windh Technologies © 2012 by The 451 Group. All rights reserved
  20. 20. SPRAIN Necessity  Spotify: • Created own backup and restore technology  Rackspace: • Distributed secondary indexing, blob storage, many other projects  Tellybug: • Created sharded counter in memcached, wrote several tools to figure out closeness to the ‘truth’  Netflix: • Developed multiple tools including multi-datacenter replication, automated configuration and backup, provisioning interface etc © 2012 by The 451 Group. All rights reserved
  21. 21. Relevant reports MySQL, NoSQL and NewSQL • Assessing the competitive dynamic between the MySQL ecosystem, NoSQL and NewSQL technologies • Due May 2012 • Including market sizing of the three database segments • Survey of 200+ database users • sales@the451group.com © 2012 by The 451 Group. All rights reserved
  22. 22. Thank you. Questions? Comments? matt.aslett@451research.com @maslett © 2012 by The 451 Group. All rights reserved

×