120412 oracle big data summit

583 views
541 views

Published on

Published in: Technology, Business
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total views
583
On SlideShare
0
From Embeds
0
Number of Embeds
0
Actions
Shares
0
Downloads
2
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide

120412 oracle big data summit

  1. 1. De praktijk vanBig Data Friso van Vollenhoven fvanvollenhoven@xebia.comEn waarom de huidigetechnologie niet (altijd)voldoet
  2. 2. Big Data
  3. 3. Big Data
  4. 4. Big Data
  5. 5. Big Data Requirement: Full table scan, 200GB table
  6. 6. Big Data
  7. 7. Big Data Egypte, 27 januari 2011
  8. 8. Big Data Requirement: 40.000 updates per seconde, 24/7.
  9. 9. Databases = + +
  10. 10. Databases = + +network SAN storage
  11. 11. HDFS en MapReduce bottleneck SELECT SESSION, COUNT(*) FROM WEB_CLICKS GROUP BY SESSION; CLIENTstoragenetwork
  12. 12. HDFS en MapReduce SELECT SESSION, COUNT(*) FROM WEB_CLICKS GROUP BY SESSION; CLIENTstoragenetwork bottleneck
  13. 13. HDFS en MapReduce
  14. 14. HDFS en MapReduceSELECT * FROM WEB_CLICKS;SELECT * FROM SELECT * FROM WEB_CLICKS; WEB_CLICKS;
  15. 15. HDFS en MapReduce GROUP BY SESSION
  16. 16. HDFS en MapReduce COUNT(*) COUNT(*) COUNT(*)
  17. 17. HDFS en MapReduce MAP REDUCESELECT * FROM COUNT(*) WEB_CLICKS; SORT/SHUFFLE GROUP BY SESSION MAP REDUCE MAP REDUCESELECT * FROM SELECT * FROM COUNT(*) COUNT(*) WEB_CLICKS; WEB_CLICKS;
  18. 18. NoSQLindex A B C D E F G H I J K L M N O P Q R S T U V W X Y Z

×