SlideShare a Scribd company logo
1 of 11
Download to read offline
DATABASES AND 
QUERIES 
MATCHING PERFORMANCE AND RELIABILITY
Dave Smith 
VP, Engineering 
@dizzyd
SURVEY 
• Who has hit problems scaling RDBMS? 
• Who is using non-relational databases?
QUERIES 
• Relational 
• Key/value (document) 
• Text retrieval (full-text search) 
• Graph 
• Time-series 
• Geospatial
QUERIES(CONT. ) 
• What questions are you asking of your data? 
• Get a record by a key 
• Find records based on a relationship 
• Find all documents with a given term 
• Apply operation to metrics within a timeframe
It is possible to rewrite most queries in other 
forms.
PERFORMANCE 
• Access patterns 
• Read/write mix 
• Sequential vs. Pareto vs. uniformly random 
• Throughput - how many requests/sec? 
• Latency - how long does it take to service a single request? 
• Always a distribution! Mean is meaningless… 
• Data size 
• Total size of dataset 
• Size per item in dataset
RELIABILITY 
• How can databases fail? 
• Disks -> integrity checking 
• Nodes -> replication 
• Network -> versioning 
• Software -> (all of above) 
• Overload -> elasticity 
• Key questions 
• How well does the system tolerate failure? 
• How well does the system deal with unexpected load?
It can be impossible to distinguish between a slow 
node and a failed node.
UGLY TRUTHS 
• All databases require tuning 
• Failure is hard to test — most people don’t bother 
• Networks fail — especially under high load 
• The more your database does, the more ways it can 
fail 
• More code == more bugs
CHOICES, CHOICES… 
• MySQL, Postgres, Oracle 
• CouchDB, MongoDB, RethinkDB 
• Riak, Cassandra 
• HBase, Hypertable 
• MemSQL, CouchBase 
• ElasticSearch, SOLR 
• Neo4J, Titan

More Related Content

What's hot

scrazzl - A technical overview
scrazzl - A technical overviewscrazzl - A technical overview
scrazzl - A technical overview
scrazzl
 

What's hot (9)

Rob Harrop- Key Note The God, the Bad and the Ugly - NoSQL matters Paris 2015
Rob Harrop- Key Note The God, the Bad and the Ugly - NoSQL matters Paris 2015Rob Harrop- Key Note The God, the Bad and the Ugly - NoSQL matters Paris 2015
Rob Harrop- Key Note The God, the Bad and the Ugly - NoSQL matters Paris 2015
 
scrazzl - A technical overview
scrazzl - A technical overviewscrazzl - A technical overview
scrazzl - A technical overview
 
Geek Sync | Top 5 Tips to Keep Always On Always Humming and Users Happy
Geek Sync | Top 5 Tips to Keep Always On Always Humming and Users HappyGeek Sync | Top 5 Tips to Keep Always On Always Humming and Users Happy
Geek Sync | Top 5 Tips to Keep Always On Always Humming and Users Happy
 
Cassandra Summit 2014: Fuzzy Entity Matching at Scale
Cassandra Summit 2014: Fuzzy Entity Matching at ScaleCassandra Summit 2014: Fuzzy Entity Matching at Scale
Cassandra Summit 2014: Fuzzy Entity Matching at Scale
 
Big Data Overview Part 1
Big Data Overview Part 1Big Data Overview Part 1
Big Data Overview Part 1
 
Metabase lj meetup
Metabase lj meetupMetabase lj meetup
Metabase lj meetup
 
"TextMining with ElasticSearch", Saskia Vola, CEO at textminers.io
"TextMining with ElasticSearch", Saskia Vola, CEO at textminers.io"TextMining with ElasticSearch", Saskia Vola, CEO at textminers.io
"TextMining with ElasticSearch", Saskia Vola, CEO at textminers.io
 
Cassandra Day Denver 2014: Using Cassandra to Support Crisis Informatics Rese...
Cassandra Day Denver 2014: Using Cassandra to Support Crisis Informatics Rese...Cassandra Day Denver 2014: Using Cassandra to Support Crisis Informatics Rese...
Cassandra Day Denver 2014: Using Cassandra to Support Crisis Informatics Rese...
 
1. SQL Server forSharePoint geeksA gentle introductionThomas Vochten • Septem...
1. SQL Server forSharePoint geeksA gentle introductionThomas Vochten • Septem...1. SQL Server forSharePoint geeksA gentle introductionThomas Vochten • Septem...
1. SQL Server forSharePoint geeksA gentle introductionThomas Vochten • Septem...
 

Viewers also liked (9)

Silabus kimia
Silabus kimiaSilabus kimia
Silabus kimia
 
Rpp kimia
Rpp kimiaRpp kimia
Rpp kimia
 
Kesetimbangan kimia
Kesetimbangan kimiaKesetimbangan kimia
Kesetimbangan kimia
 
NoSQL Now
NoSQL NowNoSQL Now
NoSQL Now
 
Databases, the Cloud and its Discontents
Databases, the Cloud and its DiscontentsDatabases, the Cloud and its Discontents
Databases, the Cloud and its Discontents
 
API Con UK Workshop
API Con UK WorkshopAPI Con UK Workshop
API Con UK Workshop
 
Ikatan kimia
Ikatan kimiaIkatan kimia
Ikatan kimia
 
Ikatan kimia
Ikatan kimiaIkatan kimia
Ikatan kimia
 
Triumph of Simplicity: How databases will be replaced by simple services.
Triumph of Simplicity: How databases will be replaced by simple services.Triumph of Simplicity: How databases will be replaced by simple services.
Triumph of Simplicity: How databases will be replaced by simple services.
 

Similar to Databases and Queries: Matching Performance and Reliability.

UNIT I Introduction to NoSQL.pptx
UNIT I Introduction to NoSQL.pptxUNIT I Introduction to NoSQL.pptx
UNIT I Introduction to NoSQL.pptx
Rahul Borate
 
Colorado Springs Open Source Hadoop/MySQL
Colorado Springs Open Source Hadoop/MySQL Colorado Springs Open Source Hadoop/MySQL
Colorado Springs Open Source Hadoop/MySQL
David Smelker
 
Meta scale kognitio hadoop webinar
Meta scale kognitio hadoop webinarMeta scale kognitio hadoop webinar
Meta scale kognitio hadoop webinar
Kognitio
 

Similar to Databases and Queries: Matching Performance and Reliability. (20)

Sql vs NoSQL
Sql vs NoSQLSql vs NoSQL
Sql vs NoSQL
 
Oracle Week 2016 - Modern Data Architecture
Oracle Week 2016 - Modern Data ArchitectureOracle Week 2016 - Modern Data Architecture
Oracle Week 2016 - Modern Data Architecture
 
The Rise of NoSQL and Polyglot Persistence
The Rise of NoSQL and Polyglot PersistenceThe Rise of NoSQL and Polyglot Persistence
The Rise of NoSQL and Polyglot Persistence
 
Revision
RevisionRevision
Revision
 
NoSql
NoSqlNoSql
NoSql
 
Cassandra Core Concepts
Cassandra Core ConceptsCassandra Core Concepts
Cassandra Core Concepts
 
BigData, NoSQL & ElasticSearch
BigData, NoSQL & ElasticSearchBigData, NoSQL & ElasticSearch
BigData, NoSQL & ElasticSearch
 
Scaling the Web: Databases & NoSQL
Scaling the Web: Databases & NoSQLScaling the Web: Databases & NoSQL
Scaling the Web: Databases & NoSQL
 
UNIT I Introduction to NoSQL.pptx
UNIT I Introduction to NoSQL.pptxUNIT I Introduction to NoSQL.pptx
UNIT I Introduction to NoSQL.pptx
 
UNIT I Introduction to NoSQL.pptx
UNIT I Introduction to NoSQL.pptxUNIT I Introduction to NoSQL.pptx
UNIT I Introduction to NoSQL.pptx
 
Colorado Springs Open Source Hadoop/MySQL
Colorado Springs Open Source Hadoop/MySQL Colorado Springs Open Source Hadoop/MySQL
Colorado Springs Open Source Hadoop/MySQL
 
Non-Relational Databases at ACCU2011
Non-Relational Databases at ACCU2011Non-Relational Databases at ACCU2011
Non-Relational Databases at ACCU2011
 
Big Data Platforms: An Overview
Big Data Platforms: An OverviewBig Data Platforms: An Overview
Big Data Platforms: An Overview
 
NOsql Presentation.pdf
NOsql Presentation.pdfNOsql Presentation.pdf
NOsql Presentation.pdf
 
What ya gonna do?
What ya gonna do?What ya gonna do?
What ya gonna do?
 
Database Technologies
Database TechnologiesDatabase Technologies
Database Technologies
 
Data modeling trends for analytics
Data modeling trends for analyticsData modeling trends for analytics
Data modeling trends for analytics
 
Meta scale kognitio hadoop webinar
Meta scale kognitio hadoop webinarMeta scale kognitio hadoop webinar
Meta scale kognitio hadoop webinar
 
NoSQL.pptx
NoSQL.pptxNoSQL.pptx
NoSQL.pptx
 
Elasticsearch - Scalability and Multitenancy
Elasticsearch - Scalability and MultitenancyElasticsearch - Scalability and Multitenancy
Elasticsearch - Scalability and Multitenancy
 

Databases and Queries: Matching Performance and Reliability.

  • 1. DATABASES AND QUERIES MATCHING PERFORMANCE AND RELIABILITY
  • 2. Dave Smith VP, Engineering @dizzyd
  • 3. SURVEY • Who has hit problems scaling RDBMS? • Who is using non-relational databases?
  • 4. QUERIES • Relational • Key/value (document) • Text retrieval (full-text search) • Graph • Time-series • Geospatial
  • 5. QUERIES(CONT. ) • What questions are you asking of your data? • Get a record by a key • Find records based on a relationship • Find all documents with a given term • Apply operation to metrics within a timeframe
  • 6. It is possible to rewrite most queries in other forms.
  • 7. PERFORMANCE • Access patterns • Read/write mix • Sequential vs. Pareto vs. uniformly random • Throughput - how many requests/sec? • Latency - how long does it take to service a single request? • Always a distribution! Mean is meaningless… • Data size • Total size of dataset • Size per item in dataset
  • 8. RELIABILITY • How can databases fail? • Disks -> integrity checking • Nodes -> replication • Network -> versioning • Software -> (all of above) • Overload -> elasticity • Key questions • How well does the system tolerate failure? • How well does the system deal with unexpected load?
  • 9. It can be impossible to distinguish between a slow node and a failed node.
  • 10. UGLY TRUTHS • All databases require tuning • Failure is hard to test — most people don’t bother • Networks fail — especially under high load • The more your database does, the more ways it can fail • More code == more bugs
  • 11. CHOICES, CHOICES… • MySQL, Postgres, Oracle • CouchDB, MongoDB, RethinkDB • Riak, Cassandra • HBase, Hypertable • MemSQL, CouchBase • ElasticSearch, SOLR • Neo4J, Titan