Cassandra performance                testing                                    Richard Low                               ...
MenuBig DataCassandra testingQ &A
Big Data?Scale, Performance, Low Cost: Need all 3 :-|
RDBMS ext3,...  LVM RAIDhardware
RDBMS ext3,...  LVM RAIDhardware
Big Data ext3,...  LVM RAIDhardware
Big Data ext3,...  LVM RAIDnew HW!
Big Datanew HW!
Cassandra (vanilla)   Cassandra (Acunu)
Cassandra (vanilla)       Cassandra (Acunu)                                 +                 performance
Cassandra (vanilla)        Cassandra (Acunu)                                  +                 performance               ...
Cassandra (vanilla)        Cassandra (Acunu)                                  +                 performance               ...
Performance testing
Inserts• Single box, 24 GB RAM, 7 disks• 3 billion rows• 50 byte values• 8 byte keys• 1 column per row• Batch insert 100 a...
Random Inserts - Vanilla Cassandra
Insert summary                       Vanilla Cassandra   Acunu Cassandra Insert time / hrs            26                 1...
If you only care about writeperformance, /dev/null is     great (and web-scale)
Range queries• High insert rate no use if reads are slow• High insert rate can impact range query  performance• Test: inse...
Summary...• Strong foundations for Big Data with order of  magnitude performance improvement• v1: Back Cassandra (+others)...
Thanks :)www.acunu.com/blog
Thanks :)www.acunu.com/blog
Acunu Cassandra London Meetup
Acunu Cassandra London Meetup
Acunu Cassandra London Meetup
Upcoming SlideShare
Loading in …5
×

Acunu Cassandra London Meetup

2,120
-1

Published on

Richard Low of Acunu presented some performance results demonstrating how marvelous Acunu's distribution of Cassandra is compared to the vanilla version. There's a related blog post with updated range query results here: http://www.acunu.com/2011/03/cassandra-under-heavy-write-load-part-ii/

Published in: Technology
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total Views
2,120
On Slideshare
0
From Embeds
0
Number of Embeds
5
Actions
Shares
0
Downloads
23
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide
  • Introduce me\n
  • \n
  • \n
  • Old\nOracle, MySQL, etc; on top of legacy FS, volume mgr, RAID, hardware\nNew\nNoSQL (to get horizontal scalability)\nHardware evolving fast - but not evenly\nStuff in the middle hasn’t changed much\n\n\n
  • Old\nOracle, MySQL, etc; on top of legacy FS, volume mgr, RAID, hardware\nNew\nNoSQL (to get horizontal scalability)\nHardware evolving fast - but not evenly\nStuff in the middle hasn’t changed much\n\n\n
  • Old\nOracle, MySQL, etc; on top of legacy FS, volume mgr, RAID, hardware\nNew\nNoSQL (to get horizontal scalability)\nHardware evolving fast - but not evenly\nStuff in the middle hasn’t changed much\n\n\n
  • Old\nOracle, MySQL, etc; on top of legacy FS, volume mgr, RAID, hardware\nNew\nNoSQL (to get horizontal scalability)\nHardware evolving fast - but not evenly\nStuff in the middle hasn’t changed much\n\n\n
  • Old\nOracle, MySQL, etc; on top of legacy FS, volume mgr, RAID, hardware\nNew\nNoSQL (to get horizontal scalability)\nHardware evolving fast - but not evenly\nStuff in the middle hasn’t changed much\n\n\n
  • Old\nOracle, MySQL, etc; on top of legacy FS, volume mgr, RAID, hardware\nNew\nNoSQL (to get horizontal scalability)\nHardware evolving fast - but not evenly\nStuff in the middle hasn’t changed much\n\n\n
  • Old\nOracle, MySQL, etc; on top of legacy FS, volume mgr, RAID, hardware\nNew\nNoSQL (to get horizontal scalability)\nHardware evolving fast - but not evenly\nStuff in the middle hasn’t changed much\n\n\n
  • Old\nOracle, MySQL, etc; on top of legacy FS, volume mgr, RAID, hardware\nNew\nNoSQL (to get horizontal scalability)\nHardware evolving fast - but not evenly\nStuff in the middle hasn’t changed much\n\n\n
  • Old\nOracle, MySQL, etc; on top of legacy FS, volume mgr, RAID, hardware\nNew\nNoSQL (to get horizontal scalability)\nHardware evolving fast - but not evenly\nStuff in the middle hasn’t changed much\n\n\n
  • Old\nOracle, MySQL, etc; on top of legacy FS, volume mgr, RAID, hardware\nNew\nNoSQL (to get horizontal scalability)\nHardware evolving fast - but not evenly\nStuff in the middle hasn’t changed much\n\n\n
  • * performance is not just about speed, but also reliability + predictability ... \n* note the specific version tree, showing live / dev branches\n* monitoring goes from the cluster down to iostat\n* ... and includes management: easy cluster deployment\n* talk about performance here\n
  • * performance is not just about speed, but also reliability + predictability ... \n* note the specific version tree, showing live / dev branches\n* monitoring goes from the cluster down to iostat\n* ... and includes management: easy cluster deployment\n* talk about performance here\n
  • * performance is not just about speed, but also reliability + predictability ... \n* note the specific version tree, showing live / dev branches\n* monitoring goes from the cluster down to iostat\n* ... and includes management: easy cluster deployment\n* talk about performance here\n
  • * performance is not just about speed, but also reliability + predictability ... \n* note the specific version tree, showing live / dev branches\n* monitoring goes from the cluster down to iostat\n* ... and includes management: easy cluster deployment\n* talk about performance here\n
  • * performance is not just about speed, but also reliability + predictability ... \n* note the specific version tree, showing live / dev branches\n* monitoring goes from the cluster down to iostat\n* ... and includes management: easy cluster deployment\n* talk about performance here\n
  • * performance is not just about speed, but also reliability + predictability ... \n* note the specific version tree, showing live / dev branches\n* monitoring goes from the cluster down to iostat\n* ... and includes management: easy cluster deployment\n* talk about performance here\n
  • * performance is not just about speed, but also reliability + predictability ... \n* note the specific version tree, showing live / dev branches\n* monitoring goes from the cluster down to iostat\n* ... and includes management: easy cluster deployment\n* talk about performance here\n
  • * performance is not just about speed, but also reliability + predictability ... \n* note the specific version tree, showing live / dev branches\n* monitoring goes from the cluster down to iostat\n* ... and includes management: easy cluster deployment\n* talk about performance here\n
  • \n
  • \n
  • \n
  • \n
  • \n
  • \n
  • \n
  • e.g. facebook newsfeed, twitter stream\n
  • \n
  • \n
  • \n
  • Acunu Cassandra London Meetup

    1. 1. Cassandra performance testing Richard Low Software Engineer richard@acunu.comCassandra London Meetup, 21 Mar 2011
    2. 2. MenuBig DataCassandra testingQ &A
    3. 3. Big Data?Scale, Performance, Low Cost: Need all 3 :-|
    4. 4. RDBMS ext3,... LVM RAIDhardware
    5. 5. RDBMS ext3,... LVM RAIDhardware
    6. 6. Big Data ext3,... LVM RAIDhardware
    7. 7. Big Data ext3,... LVM RAIDnew HW!
    8. 8. Big Datanew HW!
    9. 9. Cassandra (vanilla) Cassandra (Acunu)
    10. 10. Cassandra (vanilla) Cassandra (Acunu) + performance
    11. 11. Cassandra (vanilla) Cassandra (Acunu) + performance + snapshots + clones
    12. 12. Cassandra (vanilla) Cassandra (Acunu) + performance + snapshots + clones + monitoring
    13. 13. Performance testing
    14. 14. Inserts• Single box, 24 GB RAM, 7 disks• 3 billion rows• 50 byte values• 8 byte keys• 1 column per row• Batch insert 100 at once
    15. 15. Random Inserts - Vanilla Cassandra
    16. 16. Insert summary Vanilla Cassandra Acunu Cassandra Insert time / hrs 26 18 Max 95thpercentile latency / 37 0.3 s
    17. 17. If you only care about writeperformance, /dev/null is great (and web-scale)
    18. 18. Range queries• High insert rate no use if reads are slow• High insert rate can impact range query performance• Test: insert 100 million rows, with same properties as before• After inserts, immediately perform small range queries (get_range_slices with random start key)
    19. 19. Summary...• Strong foundations for Big Data with order of magnitude performance improvement• v1: Back Cassandra (+others) and tools without modification• Fast, robust, simple platform• Makes test/dev cycle more effective
    20. 20. Thanks :)www.acunu.com/blog
    21. 21. Thanks :)www.acunu.com/blog
    1. A particular slide catching your eye?

      Clipping is a handy way to collect important slides you want to go back to later.

    ×