• Email
  • Like
  • Save
  • Private Content
  • Embed
 

Online Analytics with Hadoop and Cassandra

by

  • 3,124 views

To date, Hadoop usage has focused primarily on offline analysis--making sense of web logs, parsing through loads of unstructured data in HDFS, etc. But what if you want to run map/reduce against your ...

To date, Hadoop usage has focused primarily on offline analysis--making sense of web logs, parsing through loads of unstructured data in HDFS, etc. But what if you want to run map/reduce against your live data set without affecting online performance? Combining Hadoop with Cassandra's multi-datacenter replication capabilities makes this possible. If you're interested in getting value from your data without the hassle and latency of first moving it into Hadoop, this talk is for you. I'll show you how to connect all the parts, enabling you to write map/reduce jobs or run Pig queries against your live data. As a bonus I'll cover writing map/reduce in Scala, which is particularly well-suited for the task.

Accessibility

Categories

Upload Details

Uploaded via SlideShare as Adobe PDF

Usage Rights

© All Rights Reserved

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate. If needed, use the feedback form to let us know more details.

Cancel

1 Embed 5

https://twitter.com 5

Statistics

Likes
4
Downloads
73
Comments
1
Embed Views
5
Views on SlideShare
3,119
Total Views
3,124

11 of 1 previous next

  • adi164 adi164 @rastrick thank you very much for posting such a nice topic. It helped me a lot. 3 weeks ago
    Are you sure you want to
Post Comment
Edit your comment

Online Analytics with Hadoop and Cassandra Online Analytics with Hadoop and Cassandra Presentation Transcript