Your SlideShare is downloading. ×

Not about the Big in Big Data

940

Published on

Published in: Technology, Business
0 Comments
2 Likes
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total Views
940
On Slideshare
0
From Embeds
0
Number of Embeds
0
Actions
Shares
0
Downloads
0
Comments
0
Likes
2
Embeds 0
No embeds

Report content
Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
No notes for slide

Transcript

  • 1. It’s Not About the “Big” in Big Data
  • 2. $26.5 BILLION Nick Leeson Jerome Kerviel Kweku Adoboli Bruno Iksil Barings Bank Societe Generale UBS JPMC $1.4 billion $6 billion $1.4 billion $17.5 Billion Nasa $17.6BRogue Traders $26.5B
  • 3. Hadoop in 2009
  • 4. Hadoop 2012
  • 5. Hype Cycle We are here
  • 6. Crossing the Chasm We are here
  • 7. Hadoop - Enabler
  • 8. Database Harddrive Unstructured ( 61.7% growths ) time to find one record = logb N * 10ms log100(100,000,000) * 10ms = 40ms time to read record = 10ms 10,000,000 * 50ms = 5.8 days
  • 9. Hadoop Harddrive throughput = 10MB/s time to transfer record = 10ms 10,000,000 * 10ms = 1.5 days random reads = (5.8 days)
  • 10. Laws of Physics Random Sequential Values/Sec. 316 Disk 53,200,000 1,924 SSD 42,200,000 36,700,000Memory 358,200,000 1 10 100 1,000 10,000 100,000 1,000,000 10,000,000 100,000,000 1,000,000,000 Adam Jacobs The Pathologies of Big Data
  • 11. People costHardware Cost
  • 12. Time to insightTime for decision
  • 13. Volume Data Size Data Complexity Sp ang es So ta Ch ee e urc Da do etyVe f ri loc Va y it
  • 14. Impossible: 360 View
  • 15. Game Changer Slow Static Barrier Business ETL Data Warehouse Intelligence Fast Dynamic View Raw Load Hadoop Data Pipeline
  • 16. NO - SQL RDBMS Standard SQL Structured Data Response in sec. No SQL Unstructured Data Batch Hadoop
  • 17. Common Applications Asset Management Analytics Security Analytics Product Cohort Analytics Advanced Web Analytics Structured + Many Unstructured Decision Makers Data Sources Data
  • 18. follow us: @datameer @stefanGroschupf

×