BIG DATA
"datasets that grow solarge that they become  difficult to work with     using relationaldatabases and within atolerable e...
BIG DATA IS BIG
LIKE, REALLY BIG
FACEBOOK:       140 BILLION                PHOTOSHUMAN GENOME:   3 BILLION                BASE PAIRSGOOGLE:         50 BIL...
NOT REALLY
EUROPEANA:        20 MILLION                  (715K / COUNTRY)LIBRARY OF        1.9 MILLIONCONGRESS:CANADIANA:        1 MI...
BIG DATAIS COMPLICATED
1966
1976
≠
≠
NOT REALLY
ಠ_ಠ
SCALABILITY
●   ICA-AtoM (LAMP)●   BENCHMARK 3.5M RECORDS●   100% OPEN SOURCE SOFTWARE●   COMMODITY HARDWARE
CAN WE DO IT?
WRITE SPEED
READ SPEED
WRITE MEMORY
READ MEMORY
NOSQL vs. SQL       (a.k.a. ODM vs. ORM)●   4x - 10x FASTER●   50% - 90% LESS MEMORY
RELATIONAL       IF YOUR DATADATABASES        IS NOTSCALE WELL       HIERARCHICALSOLR             IF YOU HAVESCALES WELL  ...
THE CLOUD IS A LIE
“big data is less about size, and more about        freedom”  open source tools + distributed design = new opportunities
Access2011 van garderen-suhonos-part2
Access2011 van garderen-suhonos-part2
Access2011 van garderen-suhonos-part2
Access2011 van garderen-suhonos-part2
Access2011 van garderen-suhonos-part2
Access2011 van garderen-suhonos-part2
Access2011 van garderen-suhonos-part2
Upcoming SlideShare
Loading in …5
×

Access2011 van garderen-suhonos-part2

889 views

Published on

Published in: Technology, News & Politics
0 Comments
1 Like
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total views
889
On SlideShare
0
From Embeds
0
Number of Embeds
2
Actions
Shares
0
Downloads
6
Comments
0
Likes
1
Embeds 0
No embeds

No notes for slide

Access2011 van garderen-suhonos-part2

  1. 1. BIG DATA
  2. 2. "datasets that grow solarge that they become difficult to work with using relationaldatabases and within atolerable elapsed time"
  3. 3. BIG DATA IS BIG
  4. 4. LIKE, REALLY BIG
  5. 5. FACEBOOK: 140 BILLION PHOTOSHUMAN GENOME: 3 BILLION BASE PAIRSGOOGLE: 50 BILLION WEB PAGESWORLDCAT: 1.5 BILLION ITEM RECORDS
  6. 6. NOT REALLY
  7. 7. EUROPEANA: 20 MILLION (715K / COUNTRY)LIBRARY OF 1.9 MILLIONCONGRESS:CANADIANA: 1 MILLIONLIBRARY AND 3.5 MILLIONARCHIVES CANADA: (ARCHIVAL DESCRIPTIONS)
  8. 8. BIG DATAIS COMPLICATED
  9. 9. 1966
  10. 10. 1976
  11. 11.
  12. 12.
  13. 13. NOT REALLY
  14. 14. ಠ_ಠ
  15. 15. SCALABILITY
  16. 16. ● ICA-AtoM (LAMP)● BENCHMARK 3.5M RECORDS● 100% OPEN SOURCE SOFTWARE● COMMODITY HARDWARE
  17. 17. CAN WE DO IT?
  18. 18. WRITE SPEED
  19. 19. READ SPEED
  20. 20. WRITE MEMORY
  21. 21. READ MEMORY
  22. 22. NOSQL vs. SQL (a.k.a. ODM vs. ORM)● 4x - 10x FASTER● 50% - 90% LESS MEMORY
  23. 23. RELATIONAL IF YOUR DATADATABASES IS NOTSCALE WELL HIERARCHICALSOLR IF YOU HAVESCALES WELL INFINITE RAMBEWARE THE NOSQL IS ADOGMA OF SQL VIABLE OPTIONTHINK SIDEWAYS SCALE OUT →
  24. 24. THE CLOUD IS A LIE
  25. 25. “big data is less about size, and more about freedom” open source tools + distributed design = new opportunities

×