HBaseCon 2013: Realtime User Segmentation using Apache HBase -- Architectural Case Study

Cloudera, Inc.
Jul. 8, 2013
HBaseCon 2013: Realtime User Segmentation using Apache HBase -- Architectural Case Study
HBaseCon 2013: Realtime User Segmentation using Apache HBase -- Architectural Case Study
HBaseCon 2013: Realtime User Segmentation using Apache HBase -- Architectural Case Study
HBaseCon 2013: Realtime User Segmentation using Apache HBase -- Architectural Case Study
HBaseCon 2013: Realtime User Segmentation using Apache HBase -- Architectural Case Study
HBaseCon 2013: Realtime User Segmentation using Apache HBase -- Architectural Case Study
HBaseCon 2013: Realtime User Segmentation using Apache HBase -- Architectural Case Study
HBaseCon 2013: Realtime User Segmentation using Apache HBase -- Architectural Case Study
HBaseCon 2013: Realtime User Segmentation using Apache HBase -- Architectural Case Study
HBaseCon 2013: Realtime User Segmentation using Apache HBase -- Architectural Case Study
HBaseCon 2013: Realtime User Segmentation using Apache HBase -- Architectural Case Study
HBaseCon 2013: Realtime User Segmentation using Apache HBase -- Architectural Case Study
HBaseCon 2013: Realtime User Segmentation using Apache HBase -- Architectural Case Study
HBaseCon 2013: Realtime User Segmentation using Apache HBase -- Architectural Case Study
HBaseCon 2013: Realtime User Segmentation using Apache HBase -- Architectural Case Study
HBaseCon 2013: Realtime User Segmentation using Apache HBase -- Architectural Case Study
HBaseCon 2013: Realtime User Segmentation using Apache HBase -- Architectural Case Study
HBaseCon 2013: Realtime User Segmentation using Apache HBase -- Architectural Case Study
HBaseCon 2013: Realtime User Segmentation using Apache HBase -- Architectural Case Study
HBaseCon 2013: Realtime User Segmentation using Apache HBase -- Architectural Case Study
HBaseCon 2013: Realtime User Segmentation using Apache HBase -- Architectural Case Study
HBaseCon 2013: Realtime User Segmentation using Apache HBase -- Architectural Case Study
HBaseCon 2013: Realtime User Segmentation using Apache HBase -- Architectural Case Study
HBaseCon 2013: Realtime User Segmentation using Apache HBase -- Architectural Case Study
HBaseCon 2013: Realtime User Segmentation using Apache HBase -- Architectural Case Study
HBaseCon 2013: Realtime User Segmentation using Apache HBase -- Architectural Case Study
HBaseCon 2013: Realtime User Segmentation using Apache HBase -- Architectural Case Study
HBaseCon 2013: Realtime User Segmentation using Apache HBase -- Architectural Case Study
HBaseCon 2013: Realtime User Segmentation using Apache HBase -- Architectural Case Study
1 of 29

More Related Content

What's hot

HIPAA Compliance in the CloudHIPAA Compliance in the Cloud
HIPAA Compliance in the CloudDataWorks Summit/Hadoop Summit
How much money do you lose every time your ecommerce site goes down?How much money do you lose every time your ecommerce site goes down?
How much money do you lose every time your ecommerce site goes down?DataStax
Relying on Data for Strategic Decision-Making--Financial Services ExperienceRelying on Data for Strategic Decision-Making--Financial Services Experience
Relying on Data for Strategic Decision-Making--Financial Services ExperienceCloudera, Inc.
Verizon Centralizes Data into a Data Lake in Real Time for AnalyticsVerizon Centralizes Data into a Data Lake in Real Time for Analytics
Verizon Centralizes Data into a Data Lake in Real Time for AnalyticsDataWorks Summit
IMCSummit 2015 - Day 2 Developer Track - The Internet of Analytics – Discover...IMCSummit 2015 - Day 2 Developer Track - The Internet of Analytics – Discover...
IMCSummit 2015 - Day 2 Developer Track - The Internet of Analytics – Discover...In-Memory Computing Summit
How to Build Continuous Ingestion for the Internet of ThingsHow to Build Continuous Ingestion for the Internet of Things
How to Build Continuous Ingestion for the Internet of ThingsCloudera, Inc.

What's hot(20)

Similar to HBaseCon 2013: Realtime User Segmentation using Apache HBase -- Architectural Case Study

Deep.bi - Real-time, Deep Data Analytics Platform For EcommerceDeep.bi - Real-time, Deep Data Analytics Platform For Ecommerce
Deep.bi - Real-time, Deep Data Analytics Platform For EcommerceDeep.BI
Datastax - The Architect's guide to customer experience (CX)Datastax - The Architect's guide to customer experience (CX)
Datastax - The Architect's guide to customer experience (CX)DataStax
Hello Streams OverviewHello Streams Overview
Hello Streams Overviewpsanet
Vistara 3.1 - Delivering Unified IT OperationsVistara 3.1 - Delivering Unified IT Operations
Vistara 3.1 - Delivering Unified IT OperationsVistara
Patterns of Distributed Application DesignPatterns of Distributed Application Design
Patterns of Distributed Application DesignOrkhan Gasimov
Implementing Advanced Analytics PlatformImplementing Advanced Analytics Platform
Implementing Advanced Analytics PlatformArvind Sathi

Similar to HBaseCon 2013: Realtime User Segmentation using Apache HBase -- Architectural Case Study (20)

More from Cloudera, Inc.

Partner Briefing_January 25 (FINAL).pptxPartner Briefing_January 25 (FINAL).pptx
Partner Briefing_January 25 (FINAL).pptxCloudera, Inc.
Cloudera Data Impact Awards 2021 - Finalists Cloudera Data Impact Awards 2021 - Finalists
Cloudera Data Impact Awards 2021 - Finalists Cloudera, Inc.
2020 Cloudera Data Impact Awards Finalists2020 Cloudera Data Impact Awards Finalists
2020 Cloudera Data Impact Awards FinalistsCloudera, Inc.
Edc event vienna presentation 1 oct 2019Edc event vienna presentation 1 oct 2019
Edc event vienna presentation 1 oct 2019Cloudera, Inc.
Machine Learning with Limited Labeled Data 4/3/19Machine Learning with Limited Labeled Data 4/3/19
Machine Learning with Limited Labeled Data 4/3/19Cloudera, Inc.
Data Driven With the Cloudera Modern Data Warehouse 3.19.19Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Data Driven With the Cloudera Modern Data Warehouse 3.19.19Cloudera, Inc.

More from Cloudera, Inc.(20)

Recently uploaded

Webhook Testing StrategyWebhook Testing Strategy
Webhook Testing StrategyDimpy Adhikary
Safe Community Call #12.pdfSafe Community Call #12.pdf
Safe Community Call #12.pdfLornyPfeifer
h2 meet pdf test.pdfh2 meet pdf test.pdf
h2 meet pdf test.pdfJohnLee971654
Easy Salesforce CI/CD with Open Source Only - Dreamforce 23Easy Salesforce CI/CD with Open Source Only - Dreamforce 23
Easy Salesforce CI/CD with Open Source Only - Dreamforce 23NicolasVuillamy1
How is AI changing journalism? Strategic considerations for publishers and ne...How is AI changing journalism? Strategic considerations for publishers and ne...
How is AI changing journalism? Strategic considerations for publishers and ne...Damian Radcliffe
Experts Live Europe 2023 - Ensure your compliance in Microsoft Teams with Mic...Experts Live Europe 2023 - Ensure your compliance in Microsoft Teams with Mic...
Experts Live Europe 2023 - Ensure your compliance in Microsoft Teams with Mic...Jasper Oosterveld

HBaseCon 2013: Realtime User Segmentation using Apache HBase -- Architectural Case Study

Editor's Notes

  1. How we plan to go over stuff
  2. The RichRelevance DataMesh Cloud Platform delivers a single view of your customer by:Giving you one single place to house unlimited sets of dataExample use cases:Create your own run-time strategies (predictive models)Create and manage segments via toolAutomatic & real-time segment creationView performance of strategies against KPIs Run adhoc queries using SQL-like toolImport into offline toolsOLAP capabilitiesMarket Basket AnalysisCustomer Lifetime ValueSequential Pattern miningManage APIs, build products & applications
  3. Nuggets or Data Points1.5PB not as big as yahoo or facebook – huge from a retail industry perspective
  4. Distributed System:: i.e. producers, brokers and consumer entities can all be deployed to different hosts in different colos in a truly distributed fashion and coordination controlled through zookeeperPersistence of Messages: messages need to be persisted on the broker for reliability, replay and temporary storagePush & Pull Mechanism:: i.e. push data to Kafka server and pull data from it using a consumer. This allows for two different rates: rate at which messages are transferred to the kafka server and the rate at which the messages are consumed.: Kafka supports GZIP and version 0.8 will additionally support Snappy compression.