NoSQL for SQL Users

Glynn Bird – Developer Advocate – IBM Cloud Data Services
NoSQL for SQL users

Introduction
@glynn_bird glynn.bird@uk.ibm.com
Glynn Bird
Developer Advocate
IBM Cloud Data Services
http://www.glynnbird.com

Agenda
 NoSQL vs SQL
 Types of NoSQL
 Scaling
 Querying and Data Modelling
 Replication
 Demo
3

RDBMs
 Relational Database Management Systems
 SQL language developed by IBM in the 1970s
 RDBMs power lots of IT systems
 Oracle, IBM DB2, MySQL, PostgreSQL etc
5

RDBMS downsides
 scalability
 availability
 price
6

NoSQL
 NoSQL = "Not only SQL"
 Response to use-cases that a RDBMS is not a good fit
 Easier to scale
7

8
Key-Value Document
BigTable Graph

SQL vs NoSQL - Development Cycle
 Build
 Migrate staging database
 Test
 Migrate production
 Deploy
10
 Build
 Test
 Deploy

Database migrations are costly
 Adding/updating/deleting columns
 May cause interruption to service
 Often performed "out of hours"
 Have to be carefully planned in multi-server deployments
11

Scaling a Cloudant database
15
• Database-as-a-Service
• Free/PAYG/Dedicated/Local
• Sign up and start using
• Scale by adding nodes
• More data
• More concurrency

Scaling other NoSQL databases
16

SQL Tables
18
users socialmediaprofiles
userid*
firstname
lastname
registration_date
dob
address1
.
socmedid*
userid *
socmed_type
url
profile

SQL
19
SELECT * from users
LEFT JOIN socialmediaprofiles
ON users.userid =
socialmediaprofiles.userid
WHERE registration_date > "2015-01-01"
AND verified = true
AND socialmedia = true
ORDER BY registration_date

NoSQL Data model
20
{
"firstname": "Glynn",
"lastname": "Bird",
"dob": "1986-10-02",
"registration_date": "2015-02-04",
"verified": true,
"address": { "address1": "10", "postcode": "W1A 1AA" },
"socialmedia": [
{ "type": "twitter", "handle": "glynn_bird" },
{ "type": "github", "username": "glynnbird" }
]

Cloudant Query
21
{
"selector": {
"$and": [
{ "registration_date" : { "$gt" : "2015-01-01" } },
{ "verified" : true },
{ "socialmedia" : true}
]
},
"sort": [
"registration_date:string"
]
}

MapReduce
22
function(doc) {
if (doc.verified && doc.socialmedia.length > 0) {
emit(doc.registration_date, null);
}
}

MapReduce
23
function(doc) {
if (doc.verified && doc.socialmedia.length > 0) {
emit(doc.registration_date, null);
}
}

24
CRUD – Document Primary
Index
Secondary Index
(view)
Search
Index
GeoSpatial Index Cloudant
Query
• Direct document
look up by _id
• Exists “OOTB”
• stored in a b-tree
• Primary key 
doc._id
• Built using
MapReduce
• stored in a b-tree
• Key  user-
defined field(s)
• Built using Lucene
• FTI: Any or all
fields can be
indexed
• stored in R*, TPR,
KD tree
• Lat/Long
coorindates in
GeoJSON
• “Mongo-style”
querying
• Built natively in
erlang
• Use when you
want a single
document and
can find by its _id
• Use when you can
find documents
based on their _id
• Pull back a range
of keys
• Use when you
need to analyze
data or get a
range of keys
• Ex: count data
fields,
sum/average
numeric results,
advanced stats,
group by date,
etc.
• Ad-hoc queries
• Find documents
based on their
contents
• Can do groups,
facets, and basic
geo queries (bbox
& sort by
distance)
• Complex
geometries
(polygon,
circularstring, etc.)
• Advanced
relations
(intersect,
overlaps, etc.)
• Ad-hoc queries
• Lots of operators
(>, <, IN, OR,
AND, etc.)
• Intuitive for people
coming from
Mongo or SQL
backgrounds

Cloudant Replication
26
• Replicate data from one cluster to another
• Replicate data to browser/mobile and back
• No data loss
• Offline-first apps/websites
• http://www.glynnbird.com/

Simple Search Service
 Free, open-source Bluemix App – install
with one click
 Upload your .csv or .tsv
– Imports data into Cloudant
– Indexes everything for search
– Presents HTTP Search API
 Demo!
28
https://developer.ibm.com/clouddataservices/simple-search-service/

Simple Search Service Architecture
29

Simple Search Service – Production Architecture
30

Cloudant use-cases
 Big Data – Large data sets
 Scalable operational data store
 Search – faceted, full-text search
 Geo-spatial – geographic, GIS systems, GeoJSON
 Offline-first – replicating data to mobile devices
31

Glynn Bird
Developer Advocate, Cloud Data Services
glynn.bird@uk.ibm.com
@glynn_bird
github.com/glynnbird
www.glynnbird.com

NoSQL for SQL Users

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to NoSQL for SQL Users

Similar to NoSQL for SQL Users (20)

More from IBM Cloud Data Services

More from IBM Cloud Data Services (18)

Recently uploaded

Recently uploaded (20)

NoSQL for SQL Users