An introduction to Cassandra<br />London Web Meetup 20/1/2011<br />Dave Gardner @davegardnerisme<br />
Plug<br />Regular Cassandra meetup at Skills Matter near Old Street.<br />Monday 21st February. Free beer! Free pizza! <br...
What is Cassandra?<br />
Cassandra is a distributed,  fault-tolerant, scalable, column-oriented data store<br />More on this in a minute…<br />
Who is using Cassandra?<br />
Why might you want to use a distributed,  fault-tolerant, scalable, column-oriented data store?<br />
Collecting preferences<br />
Recommendations<br />The Tweeted Times<br />
Social graphs<br />
That’s a lot of nuts<br />http://www.flickr.com/photos/swamibu/3520926105/sizes/o/<br />
Why Cassandra?<br />
Cassandra is a distributed,  fault-tolerant, scalable, column-oriented data store<br />
#1<br />#6<br />#2<br />No master node (no single point of failure)<br />#3<br />#5<br />Client<br />#4<br />
Cassandra is a distributed,  fault tolerant, scalable, column-oriented data store<br />
Replication Factor = 3<br />#1<br />#6<br />#2<br />#3<br />#5<br />Client<br />#4<br />
Key feature!<br />Cassandra hastunable consistency<br />(on a per-request basis)  <br />
Consistency level: how many replicas must respond to declare success?<br />Plus some DC aware levels<br />
Cassandra is a distributed,  fault-tolerant, scalable, column-oriented data store<br />
Key feature!<br />Scales linearly for both reads and writes<br />(horizontally)  <br />
Cassandra is particularly suited to use-cases that have a high write load<br />
Cassandra is a distributed,  fault-tolerant, scalable, column-oriented data store<br />
{‘foo’ => {‘bar’ => ‘baz’}}<br />{key => {col_name => col_value}}<br />
{<br />column_family => {<br />		key => {<br />col_name => col_value<br />		}<br />	}<br />}<br />
That was a very brief overview<br />Learn more at Cassandra London!<br />
Thank you! Any questions?<br />Dave Gardner @davegardnerisme<br />Cassandra London @cassandralondon<br />http://meetup.com...
Further reading for the keen ones…<br />WTF is a SuperColumn? An Intro to the Cassandra Data Modelhttp://arin.me/blog/wtf-...
More reading for the really keen…<br />Introduction to Cassandra: Replication and Consistency by Ben Blackhttp://www.slide...
Upcoming SlideShare
Loading in...5
×

Introduction to Cassandra at London Web Meetup

4,084

Published on

A 15 minute introduction to the Cassandra distributed data store from the February 2011 London Web meetup.

This covers the basics of who is using it, why you might want to use it (due to the large amount of data being collected by Web Apps today) and, most importantly, _what_ it is!

0 Comments
3 Likes
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total Views
4,084
On Slideshare
0
From Embeds
0
Number of Embeds
2
Actions
Shares
0
Downloads
86
Comments
0
Likes
3
Embeds 0
No embeds

No notes for slide

Introduction to Cassandra at London Web Meetup

  1. 1. An introduction to Cassandra<br />London Web Meetup 20/1/2011<br />Dave Gardner @davegardnerisme<br />
  2. 2. Plug<br />Regular Cassandra meetup at Skills Matter near Old Street.<br />Monday 21st February. Free beer! Free pizza! <br />http://meetup.com/Cassandra-London<br />
  3. 3. What is Cassandra?<br />
  4. 4. Cassandra is a distributed, fault-tolerant, scalable, column-oriented data store<br />More on this in a minute…<br />
  5. 5. Who is using Cassandra?<br />
  6. 6.
  7. 7. Why might you want to use a distributed, fault-tolerant, scalable, column-oriented data store?<br />
  8. 8. Collecting preferences<br />
  9. 9. Recommendations<br />The Tweeted Times<br />
  10. 10. Social graphs<br />
  11. 11. That’s a lot of nuts<br />http://www.flickr.com/photos/swamibu/3520926105/sizes/o/<br />
  12. 12. Why Cassandra?<br />
  13. 13. Cassandra is a distributed, fault-tolerant, scalable, column-oriented data store<br />
  14. 14. #1<br />#6<br />#2<br />No master node (no single point of failure)<br />#3<br />#5<br />Client<br />#4<br />
  15. 15. Cassandra is a distributed, fault tolerant, scalable, column-oriented data store<br />
  16. 16. Replication Factor = 3<br />#1<br />#6<br />#2<br />#3<br />#5<br />Client<br />#4<br />
  17. 17. Key feature!<br />Cassandra hastunable consistency<br />(on a per-request basis) <br />
  18. 18. Consistency level: how many replicas must respond to declare success?<br />Plus some DC aware levels<br />
  19. 19. Cassandra is a distributed, fault-tolerant, scalable, column-oriented data store<br />
  20. 20. Key feature!<br />Scales linearly for both reads and writes<br />(horizontally) <br />
  21. 21. Cassandra is particularly suited to use-cases that have a high write load<br />
  22. 22. Cassandra is a distributed, fault-tolerant, scalable, column-oriented data store<br />
  23. 23. {‘foo’ => {‘bar’ => ‘baz’}}<br />{key => {col_name => col_value}}<br />
  24. 24. {<br />column_family => {<br /> key => {<br />col_name => col_value<br /> }<br /> }<br />}<br />
  25. 25. That was a very brief overview<br />Learn more at Cassandra London!<br />
  26. 26. Thank you! Any questions?<br />Dave Gardner @davegardnerisme<br />Cassandra London @cassandralondon<br />http://meetup.com/Cassandra-London<br />
  27. 27. Further reading for the keen ones…<br />WTF is a SuperColumn? An Intro to the Cassandra Data Modelhttp://arin.me/blog/wtf-is-a-supercolumn-cassandra-data-model<br />Clients list (Python, Java, .NET, Ruby, PHP)http://wiki.apache.org/cassandra/ClientOptions<br />Cassandra docs on DataStax websitehttp://www.datastax.com/docs/0.7/index<br />Introduction to Cassandra by Aaron Mortonhttp://www.slideshare.net/aaronmorton/well-railedcassandra24112010-5901169<br />Operations and troubleshooting guide by Ben Blackhttp://www.slideshare.net/benjaminblack/cassandra-summit-2010-operations-troubleshooting-intro<br />
  28. 28. More reading for the really keen…<br />Introduction to Cassandra: Replication and Consistency by Ben Blackhttp://www.slideshare.net/benjaminblack/introduction-to-cassandra-replication-and-consistency<br />Running Cassandra on Amazon EC2 by Dave Gardnerhttp://www.slideshare.net/davegardnerisme/running-cassandra-on-amazon-ec2<br />4 Months With Cassandra, a love story by CloudKickhttps://www.cloudkick.com/blog/2010/mar/02/4_months_with_cassandra/<br />Cassandra London meetuphttp://meetup.com/Cassandra-London<br />
  1. A particular slide catching your eye?

    Clipping is a handy way to collect important slides you want to go back to later.

×