• Share
  • Email
  • Embed
  • Like
  • Save
  • Private Content
HBaseCon 2013: Realtime User Segmentation using Apache HBase -- Architectural Case Study
 

HBaseCon 2013: Realtime User Segmentation using Apache HBase -- Architectural Case Study

on

  • 2,594 views

Presented by: Murtaza Doctor (RichRelevance) and Giang Nguyen (RichRelevance)

Presented by: Murtaza Doctor (RichRelevance) and Giang Nguyen (RichRelevance)

Statistics

Views

Total Views
2,594
Views on SlideShare
2,587
Embed Views
7

Actions

Likes
7
Downloads
1
Comments
0

2 Embeds 7

https://twitter.com 4
http://localhost 3

Accessibility

Categories

Upload Details

Uploaded via as Microsoft PowerPoint

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment
  • How we plan to go over stuff
  • The RichRelevance DataMesh Cloud Platform delivers a single view of your customer by:Giving you one single place to house unlimited sets of dataExample use cases:Create your own run-time strategies (predictive models)Create and manage segments via toolAutomatic & real-time segment creationView performance of strategies against KPIs Run adhoc queries using SQL-like toolImport into offline toolsOLAP capabilitiesMarket Basket AnalysisCustomer Lifetime ValueSequential Pattern miningManage APIs, build products & applications
  • Nuggets or Data Points1.5PB not as big as yahoo or facebook – huge from a retail industry perspective
  • Distributed System:: i.e. producers, brokers and consumer entities can all be deployed to different hosts in different colos in a truly distributed fashion and coordination controlled through zookeeperPersistence of Messages: messages need to be persisted on the broker for reliability, replay and temporary storagePush & Pull Mechanism:: i.e. push data to Kafka server and pull data from it using a consumer. This allows for two different rates: rate at which messages are transferred to the kafka server and the rate at which the messages are consumed.: Kafka supports GZIP and version 0.8 will additionally support Snappy compression.

HBaseCon 2013: Realtime User Segmentation using Apache HBase -- Architectural Case Study HBaseCon 2013: Realtime User Segmentation using Apache HBase -- Architectural Case Study Presentation Transcript