• Like

Hw09 Real Time Business Intelligence

  • 1,813 views
Uploaded on

 

More in: Technology , Business
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Be the first to comment
No Downloads

Views

Total Views
1,813
On Slideshare
0
From Embeds
0
Number of Embeds
0

Actions

Shares
Downloads
113
Comments
0
Likes
3

Embeds 0

No embeds

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
    No notes for slide

Transcript

  • 1. Real-Time BI in Hadoop
    • Bradford Stephens
    • Lead Engineer, Visible Technologies
    • Principal Consultant, Drawn to Scale Consulting
  • 2. Topics
    • Scalability and BI
    • Costs and Abilities
    • Search as BI
  • 3.  
  • 4.  
  • 5.  
  • 6. What Is BI?
  • 7.  
  • 8. What is “Real-Time”
    • Understanding Latency
    • We aim for <5 secs.
  • 9.  
  • 10. Scalability in BI
    • Scalbility matters now
    • Social Media: Catalyst
    • All data is important
    • Data doesn’t scale with business size any more
  • 11. Search as BI
    • Katta = Distributed Search on Haddoop
    • Bobo = Faceted Lucene
  • 12.  
  • 13.  
  • 14.  
  • 15.  
  • 16.  
  • 17. Doing it Cheap
    • 100 TB, Structured and Unstructured
    • Oracle- $100,000,000
    • “ NewSQL” - $4,000,000
    • Hadoop + Katta - $250,000
  • 18. Why We Need Hadoop
    • Need to process high-latency data to get the “small stuff” fast
    • Robust Ecosystem
    • Need more than SQL. RDBMS not a Swiss-Army Knife
  • 19. Aggregation is Real-Time
    • Distributed Search w/ Katta + Facets = Aggregation-Based BI
    • Sum, Count, Filter, Avg, Group
  • 20. Protips: Review
    • Understand High vs. Low Latency data
    • Hadoop makes it cheap
    • Pre-aggregate w/ Hadoop, Explore w/ Katta + Faceted Search
  • 21. The Future
    • Search/BI as a Platform: “Google my Data Warehouse”
    • Real-Time MR on HBase