0
De praktijk vanBig Data                            Friso van Vollenhoven                            fvanvollenhoven@xebia....
Big Data
Big Data
Big Data
Big Data                   Requirement:           Full table scan, 200GB table
Big Data
Big Data           Egypte, 27 januari 2011
Big Data                    Requirement:           40.000 updates per seconde, 24/7.
Databases            =   +   +
Databases              =        +    +network                   SAN                  storage
HDFS en MapReduce                 bottleneck                 SELECT SESSION, COUNT(*) FROM                 WEB_CLICKS GROU...
HDFS en MapReduce                 SELECT SESSION, COUNT(*) FROM                 WEB_CLICKS GROUP BY SESSION;              ...
HDFS en MapReduce
HDFS en MapReduceSELECT * FROM WEB_CLICKS;SELECT * FROM           SELECT * FROM WEB_CLICKS;             WEB_CLICKS;
HDFS en MapReduce                    GROUP BY SESSION
HDFS en MapReduce               COUNT(*)               COUNT(*)   COUNT(*)
HDFS en MapReduce        MAP     REDUCESELECT * FROM                    COUNT(*) WEB_CLICKS;                              ...
NoSQLindex     A B C D E F G H I   J K L M N O P Q R S T U V W X Y Z
Upcoming SlideShare
Loading in...5
×

Oracle Big Data Summit

633

Published on

Presentatie @fzk tijdens Oracle #Bigdata Summit.
(12 april 2012)

Meer informatie: bigdataseminar.nl

Published in: Technology, News & Politics
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total Views
633
On Slideshare
0
From Embeds
0
Number of Embeds
0
Actions
Shares
0
Downloads
9
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide

Transcript of "Oracle Big Data Summit"

  1. 1. De praktijk vanBig Data Friso van Vollenhoven fvanvollenhoven@xebia.comEn waarom de huidigetechnologie niet (altijd)voldoet
  2. 2. Big Data
  3. 3. Big Data
  4. 4. Big Data
  5. 5. Big Data Requirement: Full table scan, 200GB table
  6. 6. Big Data
  7. 7. Big Data Egypte, 27 januari 2011
  8. 8. Big Data Requirement: 40.000 updates per seconde, 24/7.
  9. 9. Databases = + +
  10. 10. Databases = + +network SAN storage
  11. 11. HDFS en MapReduce bottleneck SELECT SESSION, COUNT(*) FROM WEB_CLICKS GROUP BY SESSION; CLIENTstoragenetwork
  12. 12. HDFS en MapReduce SELECT SESSION, COUNT(*) FROM WEB_CLICKS GROUP BY SESSION; CLIENTstoragenetwork bottleneck
  13. 13. HDFS en MapReduce
  14. 14. HDFS en MapReduceSELECT * FROM WEB_CLICKS;SELECT * FROM SELECT * FROM WEB_CLICKS; WEB_CLICKS;
  15. 15. HDFS en MapReduce GROUP BY SESSION
  16. 16. HDFS en MapReduce COUNT(*) COUNT(*) COUNT(*)
  17. 17. HDFS en MapReduce MAP REDUCESELECT * FROM COUNT(*) WEB_CLICKS; SORT/SHUFFLE GROUP BY SESSION MAP REDUCE MAP REDUCESELECT * FROM SELECT * FROM COUNT(*) COUNT(*) WEB_CLICKS; WEB_CLICKS;
  18. 18. NoSQLindex A B C D E F G H I J K L M N O P Q R S T U V W X Y Z
  1. A particular slide catching your eye?

    Clipping is a handy way to collect important slides you want to go back to later.

×