Your SlideShare is downloading. ×
Bulk Exporting from Cassandra - Carlo Cabanilla
Upcoming SlideShare
Loading in...5
×

Thanks for flagging this SlideShare!

Oops! An error has occurred.

×

Saving this for later?

Get the SlideShare app to save on your phone or tablet. Read anywhere, anytime - even offline.

Text the download link to your phone

Standard text messaging rates apply

Bulk Exporting from Cassandra - Carlo Cabanilla

383
views

Published on

Carlo give his perspective on the challenges of doing large exports from Cassandra.

Carlo give his perspective on the challenges of doing large exports from Cassandra.

Published in: Technology, Business

0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total Views
383
On Slideshare
0
From Embeds
0
Number of Embeds
0
Actions
Shares
0
Downloads
1
Comments
0
Likes
0
Embeds 0
No embeds

Report content
Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
No notes for slide

Transcript

  • 1. Bulk exporting datafrom CassandraCarlo Cabanilla@clofresh
  • 2. Why export?
  • 3. snapshot
  • 4. sstable2json
  • 5. Killing IO on live cluster
  • 6. sstable2json sstable2csv, with filters
  • 7. ionice -c 3
  • 8. Need a place to put it
  • 9. EBS to the rescue
  • 10. gzipped
  • 11. S3cmd
  • 12. Need to dedupe
  • 13. Hadoop
  • 14. numpy pickles
  • 15. Haderp Mortar Data
  • 16. numpy pickles msgpack lz4
  • 17. gzipped lzod
  • 18. Haderp file naming!2010-07-27~org-1018~m-48778.csv-1,316.gz
  • 19. S3 copy
  • 20. Bulk exporting datafrom CassandraCarlo Cabanilla@clofresh