Bathcamp 2010-riak

•

3 likes•717 views

Timothy Perrett

Technology News & Politics

What is Riak?
• Documented orientated database
• Written in Erlang
• Based on Dynamo[1] and CAP Theorem[2]
• Highly fault tolerant
• HTTP and ProtoBuff interface
• Write MapReduce in Erlang or JavaScript
1. http://goo.gl/r8Np
2. http://www.julianbrowne.com/article/viewer/brewers-cap-theorem

Same, Same but different
• Riak solves similar problems to MongoDB
• Semi-structured data modeled as "documents”
• Storage of non-document data in the database
• High write-availability
• Riak is intrinsically multi-node scalable
• Mongo in comparison is single system (+ sharding)
• Riak achieves availability via quorum writes
• Mongo uses performant in-place writes
• Riak uses “masterless” replication

N/R/W – Dynamo
N = Number of replicas to store
R = Number of replicas needed to read
W = Number of replicas needed to read
• These principals first appeared in an Amazon
research paper known as Dynamo

• 160bit integer key
space. Each node that
joins is assigned part
of that space for
consistent hashing
• Hashing means any
node can service any
request making the
cluster masterless and
eventually consistant
Number of replicas

• Number of replies
before Riak gives
the client a
successful reply.
• Tries to access all
nodes, but as soon
as the N/R is
satisfied a response
is given
Reads

• Same as reads; W
implies the number
of successful nodes
that must reply
before the write
is considered
consistent by
the client
Writes

Extreme example
• Given N=10, R=W=2 we
could have 8 nodes
down and the cluster
would still be fully
available to all clients

What does this all mean?
• N/R/W specified at request time, so each
client can specify its own tolerance for
outages dynamically
• Despite any outages within the cluster, the whole
cluster can still appear available based on N/R/W
• Given N=3 and R=W=2, we can have 3-2=1 node
down/unreachable/laggy in the cluster
• Stupidly high availability complete with eventual
consistency controlled by dynamic clients

Brewer’s CAP Theorem
• Consistency
• Availability
• Partition Tolerance
• You cant have all things, all the time…
• …but you can have some of each, all the time!
• Riak is about choosing your own levels of
each according to your use case

Consistency
• Start with document
version zero
• Things get redistributed
and n0 and n2 are
sitting in NYC and n1
and n3 are in London
• What if stuff changes??

Consistency
• Uh oh: inconsistency
• Both parts of the cluster
are still fully available
• NYC serves v1 whilst
London serves v0
• The network resumes
and Riak determines
the latest version by
using vector clocks

Consistency
• What if both sides of
the Atlantic changed?
• Riak is unable to
determine which is the
right document, both
are returned to the
client with an indication
of the inconsistency

• Distributed, fault-tolerant full-text searching
• Lucene syntax for queries
• No need for index sharding
• Linier scaling
• Double the number of nodes to get double the
search capacity (awesome!)
• Search via:
• Fields, wildcards, fuzzy text or token proximity
Riak Search

Questions?
basho.com/riak.html
github.com/basho/riak
twitter.com/timperrett
github.com/timperrett
blog.getintheloop.eu

What's hot

Project Reactor By ExampleDenny Abraham Cheriyan

Flink Forward San Francisco 2019: Scaling a real-time streaming warehouse wit...Flink Forward

Flink Forward Berlin 2017: Dominik Bruhn - Deploying Flink Jobs as Docker Con...Flink Forward

Aljoscha Krettek - Portable stateful big data processing in Apache BeamVerverica

Building your own Distributed System The easy way - Cassandra Summit EU 2014Kévin LOVATO

How to manage large amounts of data with akka streamsIgor Mielientiev

ChronoLogic Tools Demo: 6/12/18ChronoLogic

Kafka At Scale in the Cloudconfluent

Thoughts on consistency modelsrogerbodamer

Virtual Flink Forward 2020: Autoscaling Flink at Netflix - Timothy FarkasFlink Forward

Parallel processing for splitter in mule esbSunil Kumar

Alexander Kolb – Flink. Yet another Streaming Framework?Flink Forward

Fabian Hueske_Till Rohrmann - Declarative stream processing with StreamSQL an...Flink Forward

Flink Forward SF 2017: Till Rohrmann - Redesigning Apache Flink’s Distributed...Flink Forward

Flink Forward Berlin 2017: Andreas Kunft - Efficiently executing R Dataframes...Flink Forward

Introduction to Structured streamingdatamantra

Matthias J. Sax – A Tale of Squirrels and StormsFlink Forward

Notes on Netty baicsRick Hightower

Flink Forward SF 2017: Stephan Ewen - Convergence of real-time analytics and ...Flink Forward

Apache Software Foundation: How To Contribute, with Apache Flink as Example (...Apache Flink Taiwan User Group

What's hot (20)

Project Reactor By Example

Flink Forward San Francisco 2019: Scaling a real-time streaming warehouse wit...

Flink Forward Berlin 2017: Dominik Bruhn - Deploying Flink Jobs as Docker Con...

Aljoscha Krettek - Portable stateful big data processing in Apache Beam

Building your own Distributed System The easy way - Cassandra Summit EU 2014

How to manage large amounts of data with akka streams

ChronoLogic Tools Demo: 6/12/18

Kafka At Scale in the Cloud

Thoughts on consistency models

Virtual Flink Forward 2020: Autoscaling Flink at Netflix - Timothy Farkas

Parallel processing for splitter in mule esb

Alexander Kolb – Flink. Yet another Streaming Framework?

Fabian Hueske_Till Rohrmann - Declarative stream processing with StreamSQL an...

Flink Forward SF 2017: Till Rohrmann - Redesigning Apache Flink’s Distributed...

Flink Forward Berlin 2017: Andreas Kunft - Efficiently executing R Dataframes...

Introduction to Structured streaming

Matthias J. Sax – A Tale of Squirrels and Storms

Notes on Netty baics

Flink Forward SF 2017: Stephan Ewen - Convergence of real-time analytics and ...

Apache Software Foundation: How To Contribute, with Apache Flink as Example (...

Viewers also liked

Electronica jjAndres Cardona

Actividad1 curriculumLuzfrida

ville du mont-dorecatherineguillaume

Que es slideshareGabriel Chaves

CSR Events at Technika 15Gaurav Raj Anand

Lectura ser digitalLUISFDAVILA

Exposicion equipo 1Secretaria de Educación Publica

PВиктор Вяткин

Initial presentation Tesla management project (Swinburne University)Anthony Campana

Wiring of-mandibleZohaib Saleem

Gas licuado de petróleo GLPHéctor Chire

$Condylar fractures /certified fixed orthodontic courses by Indian dental acad...$ $Condylar fractures /certified fixed orthodontic courses by Indian dental acad...$

Condylar fractures /certified fixed orthodontic courses by Indian dental acad...Indian dental academy

FavolarePaolo Clemenza

$Mandibular fracture 2 / fixed orthodontic courses$ $Mandibular fracture 2 / fixed orthodontic courses$

Mandibular fracture 2 / fixed orthodontic coursesIndian dental academy

Mod morphology of deciduous dentitionJamil Kifayatullah

Phần 1 công ty kết cấu sx thép thái nguyêntranthihoaivan

A2 Media Studies Preproduction DevelopmentGeorginaMediaStudies

Human DentitionUmm Al-Qura University Faculty of Dentistry

Feuillet memento Degremont - n°1 UltragreenDegrémont

Viewers also liked (19)

Electronica jj

Actividad1 curriculum

ville du mont-dore

Que es slideshare

CSR Events at Technika 15

Lectura ser digital

Exposicion equipo 1

Initial presentation Tesla management project (Swinburne University)

Wiring of-mandible

Gas licuado de petróleo GLP

$Condylar fractures /certified fixed orthodontic courses by Indian dental acad...$ $Condylar fractures /certified fixed orthodontic courses by Indian dental acad...$

Condylar fractures /certified fixed orthodontic courses by Indian dental acad...

Favolare

$Mandibular fracture 2 / fixed orthodontic courses$ $Mandibular fracture 2 / fixed orthodontic courses$

Mandibular fracture 2 / fixed orthodontic courses

Mod morphology of deciduous dentition

Phần 1 công ty kết cấu sx thép thái nguyên

A2 Media Studies Preproduction Development

Human Dentition

Feuillet memento Degremont - n°1 Ultragreen

Similar to Bathcamp 2010-riak

Scalable Persistent Storage for Erlang: Theory and PracticeAmir Ghaffari

HPC Controls Futurercastain

Getting started with Riak in the CloudInes Sombra

Running a distributed system across kubernetes clusters - Kubecon North Ameri...Alex Robinson

Multi-Datacenter Kafka - Strata San Jose 2017Gwen (Chen) Shapira

MySQL on CephKyle Bader

My SQL on CephRed_Hat_Storage

Aurora_session.pdfRamkumar34150

Highly available, scalable and secure data with Cassandra and DataStax Enterp...Johnny Miller

Lessons learned from scaling YARN to 40K machines in a multi tenancy environmentDataWorks Summit

[EUC2016] DockerCap: a software-level power capping orchestrator for Docker c...Matteo Ferroni

Jay Kreps on Project Voldemort Scaling Simple Storage At LinkedInLinkedIn

Tuning kafka pipelinesSumant Tambe

End-to-End Reactive Data Access Using R2DBC with RSocket and ProteusVMware Tanzu

TDC2017 | São Paulo - Trilha Containers How we figured out we had a SRE team ...tdc-globalcode

High performace network of Cloud Native Taiwan User GroupHungWei Chiu

Building Distributed Systems With Riak and Riak CoreAndy Gross

Incremental Export of Relational Database Contents into RDF GraphsNikolaos Konstantinou

Building Large-Scale Stream Infrastructures Across Multiple Data Centers with...confluent

Scalable Web AppsPiotr Pelczar

Similar to Bathcamp 2010-riak (20)

Scalable Persistent Storage for Erlang: Theory and Practice

HPC Controls Future

Getting started with Riak in the Cloud

Running a distributed system across kubernetes clusters - Kubecon North Ameri...

Multi-Datacenter Kafka - Strata San Jose 2017

MySQL on Ceph

My SQL on Ceph

Aurora_session.pdf

Highly available, scalable and secure data with Cassandra and DataStax Enterp...

Lessons learned from scaling YARN to 40K machines in a multi tenancy environment

[EUC2016] DockerCap: a software-level power capping orchestrator for Docker c...

Jay Kreps on Project Voldemort Scaling Simple Storage At LinkedIn

Tuning kafka pipelines

End-to-End Reactive Data Access Using R2DBC with RSocket and Proteus

TDC2017 | São Paulo - Trilha Containers How we figured out we had a SRE team ...

High performace network of Cloud Native Taiwan User Group

Building Distributed Systems With Riak and Riak Core

Incremental Export of Relational Database Contents into RDF Graphs

Building Large-Scale Stream Infrastructures Across Multiple Data Centers with...

Scalable Web Apps

Recently uploaded

From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software

08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls

Pigging Solutions Piggable Sweeping ElbowsPigging Solutions

08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls

Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko

Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies

The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad

Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC

A Domino Admins Adventures (Engage 2024)Gabriella Davis

08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls

Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j

Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent

Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106

How to Remove Document Management Hurdles with X-Docs?XfilesPro

Pigging Solutions in Pet Food ManufacturingPigging Solutions

Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes

08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls

Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard

Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55

Salesforce Community Group Quito, Salesforce 101Paola De la Torre

Recently uploaded (20)

From Event to Action: Accelerate Your Decision Making with Real-Time Automation

08448380779 Call Girls In Civil Lines Women Seeking Men

Pigging Solutions Piggable Sweeping Elbows

08448380779 Call Girls In Diplomatic Enclave Women Seeking Men

Handwritten Text Recognition for manuscripts and early printed texts

Factors to Consider When Choosing Accounts Payable Services Providers.pptx

The Codex of Business Writing Software for Real-World Solutions 2.pptx

Breaking the Kubernetes Kill Chain: Host Path Mount

A Domino Admins Adventures (Engage 2024)

08448380779 Call Girls In Greater Kailash - I Women Seeking Men

Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...

Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...

Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics

How to Remove Document Management Hurdles with X-Docs?

Pigging Solutions in Pet Food Manufacturing

Enhancing Worker Digital Experience: A Hands-on Workshop for Partners

08448380779 Call Girls In Friends Colony Women Seeking Men

Maximizing Board Effectiveness 2024 Webinar.pptx

Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...

Salesforce Community Group Quito, Salesforce 101

Bathcamp 2010-riak

1. Timothy Perrett Bath Camp 2010

2. What is Riak? • Documented orientated database • Written in Erlang • Based on Dynamo[1] and CAP Theorem[2] • Highly fault tolerant • HTTP and ProtoBuff interface • Write MapReduce in Erlang or JavaScript 1. http://goo.gl/r8Np 2. http://www.julianbrowne.com/article/viewer/brewers-cap-theorem

3. Same, Same but different • Riak solves similar problems to MongoDB • Semi-structured data modeled as "documents” • Storage of non-document data in the database • High write-availability • Riak is intrinsically multi-node scalable • Mongo in comparison is single system (+ sharding) • Riak achieves availability via quorum writes • Mongo uses performant in-place writes • Riak uses “masterless” replication

4. N/R/W – Dynamo N = Number of replicas to store R = Number of replicas needed to read W = Number of replicas needed to read • These principals first appeared in an Amazon research paper known as Dynamo

5. • 160bit integer key space. Each node that joins is assigned part of that space for consistent hashing • Hashing means any node can service any request making the cluster masterless and eventually consistant Number of replicas

6. • Number of replies before Riak gives the client a successful reply. • Tries to access all nodes, but as soon as the N/R is satisfied a response is given Reads

7. • Same as reads; W implies the number of successful nodes that must reply before the write is considered consistent by the client Writes

8. Extreme example • Given N=10, R=W=2 we could have 8 nodes down and the cluster would still be fully available to all clients

9. What does this all mean? • N/R/W specified at request time, so each client can specify its own tolerance for outages dynamically • Despite any outages within the cluster, the whole cluster can still appear available based on N/R/W • Given N=3 and R=W=2, we can have 3-2=1 node down/unreachable/laggy in the cluster • Stupidly high availability complete with eventual consistency controlled by dynamic clients

10. Brewer’s CAP Theorem • Consistency • Availability • Partition Tolerance • You cant have all things, all the time… • …but you can have some of each, all the time! • Riak is about choosing your own levels of each according to your use case

11. Consistency • Start with document version zero • Things get redistributed and n0 and n2 are sitting in NYC and n1 and n3 are in London • What if stuff changes??

12. Consistency • Uh oh: inconsistency • Both parts of the cluster are still fully available • NYC serves v1 whilst London serves v0 • The network resumes and Riak determines the latest version by using vector clocks

13. Consistency • What if both sides of the Atlantic changed? • Riak is unable to determine which is the right document, both are returned to the client with an indication of the inconsistency

14. • Distributed, fault-tolerant full-text searching • Lucene syntax for queries • No need for index sharding • Linier scaling • Double the number of nodes to get double the search capacity (awesome!) • Search via: • Fields, wildcards, fuzzy text or token proximity Riak Search

15. Questions? basho.com/riak.html github.com/basho/riak twitter.com/timperrett github.com/timperrett blog.getintheloop.eu

Bathcamp 2010-riak

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Viewers also liked

Viewers also liked (19)

Similar to Bathcamp 2010-riak

Similar to Bathcamp 2010-riak (20)

More from Timothy Perrett

More from Timothy Perrett (15)

Recently uploaded

Recently uploaded (20)

Bathcamp 2010-riak