SlideShare is now on Android. 15 million presentations at your fingertips.  Get the app

×
  • Share
  • Email
  • Embed
  • Like
  • Save
  • Private Content
 

Near Real Time Processing of Social Media Data with HBase

by on Aug 31, 2012

  • 1,786 views

Monitoring social media and news media in general is kind of building a search engine - a specific search engine called news agent. Where classical search engines are about to have all kind of ...

Monitoring social media and news media in general is kind of building a search engine - a specific search engine called news agent. Where classical search engines are about to have all kind of content, newsagent focus on news and social media only. This is where content freshness matters - it has to be near real time.

In order to process several million news with hundreds of Megabytes per day one has to choose a system architecture that is reliable on one hand and massive scalable at the other hand. Those requirements leads to a distributed architecture, a NoSQL approach.

This talk will give you a brief overview about the Media Monitoring use case, the system architecture based on Hadoop and HBase, challenges and lessons learned.

Statistics

Views

Total Views
1,786
Views on SlideShare
1,460
Embed Views
326

Actions

Likes
3
Downloads
28
Comments
0

7 Embeds 326

http://www.scoop.it 179
http://www.sentric.ch 115
http://www.ymc.ch 26
http://twimblr.appspot.com 2
http://development-basics.ymc-thzi.ymchq.sf.tv 2
http://translate.googleusercontent.com 1
http://website.test.ymc.ch 1
More...

Accessibility

Categories

Upload Details

Uploaded via SlideShare as Adobe PDF

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
Post Comment
Edit your comment

Near Real Time Processing of Social Media Data with HBase Near Real Time Processing of Social Media Data with HBase Presentation Transcript