Your SlideShare is downloading. ×
0
Hw09   Real Time Business Intelligence
Hw09   Real Time Business Intelligence
Hw09   Real Time Business Intelligence
Hw09   Real Time Business Intelligence
Hw09   Real Time Business Intelligence
Hw09   Real Time Business Intelligence
Hw09   Real Time Business Intelligence
Hw09   Real Time Business Intelligence
Hw09   Real Time Business Intelligence
Hw09   Real Time Business Intelligence
Hw09   Real Time Business Intelligence
Hw09   Real Time Business Intelligence
Hw09   Real Time Business Intelligence
Hw09   Real Time Business Intelligence
Hw09   Real Time Business Intelligence
Hw09   Real Time Business Intelligence
Hw09   Real Time Business Intelligence
Hw09   Real Time Business Intelligence
Hw09   Real Time Business Intelligence
Hw09   Real Time Business Intelligence
Hw09   Real Time Business Intelligence
Upcoming SlideShare
Loading in...5
×

Thanks for flagging this SlideShare!

Oops! An error has occurred.

×
Saving this for later? Get the SlideShare app to save on your phone or tablet. Read anywhere, anytime – even offline.
Text the download link to your phone
Standard text messaging rates apply

Hw09 Real Time Business Intelligence

1,881

Published on

Published in: Technology, Business
0 Comments
3 Likes
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total Views
1,881
On Slideshare
0
From Embeds
0
Number of Embeds
0
Actions
Shares
0
Downloads
116
Comments
0
Likes
3
Embeds 0
No embeds

Report content
Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
No notes for slide

Transcript

  • 1. Real-Time BI in Hadoop <ul><li>Bradford Stephens </li></ul><ul><li>Lead Engineer, Visible Technologies </li></ul><ul><li>Principal Consultant, Drawn to Scale Consulting </li></ul>
  • 2. Topics <ul><li>Scalability and BI </li></ul><ul><li>Costs and Abilities </li></ul><ul><li>Search as BI </li></ul>
  • 3.  
  • 4.  
  • 5.  
  • 6. What Is BI?
  • 7.  
  • 8. What is “Real-Time” <ul><li>Understanding Latency </li></ul><ul><li>We aim for <5 secs. </li></ul>
  • 9.  
  • 10. Scalability in BI <ul><li>Scalbility matters now </li></ul><ul><li>Social Media: Catalyst </li></ul><ul><li>All data is important </li></ul><ul><li>Data doesn’t scale with business size any more </li></ul>
  • 11. Search as BI <ul><li>Katta = Distributed Search on Haddoop </li></ul><ul><li>Bobo = Faceted Lucene </li></ul>
  • 12.  
  • 13.  
  • 14.  
  • 15.  
  • 16.  
  • 17. Doing it Cheap <ul><li>100 TB, Structured and Unstructured </li></ul><ul><li>Oracle- $100,000,000 </li></ul><ul><li>“ NewSQL” - $4,000,000 </li></ul><ul><li>Hadoop + Katta - $250,000 </li></ul>
  • 18. Why We Need Hadoop <ul><li>Need to process high-latency data to get the “small stuff” fast </li></ul><ul><li>Robust Ecosystem </li></ul><ul><li>Need more than SQL. RDBMS not a Swiss-Army Knife </li></ul>
  • 19. Aggregation is Real-Time <ul><li>Distributed Search w/ Katta + Facets = Aggregation-Based BI </li></ul><ul><li>Sum, Count, Filter, Avg, Group </li></ul>
  • 20. Protips: Review <ul><li>Understand High vs. Low Latency data </li></ul><ul><li>Hadoop makes it cheap </li></ul><ul><li>Pre-aggregate w/ Hadoop, Explore w/ Katta + Faceted Search </li></ul>
  • 21. The Future <ul><li>Search/BI as a Platform: “Google my Data Warehouse” </li></ul><ul><li>Real-Time MR on HBase </li></ul>

×