#dido12 presentation

599 views

Published on

Presentation at #DiDo, 2013-04-04.

Published in: Technology
0 Comments
1 Like
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total views
599
On SlideShare
0
From Embeds
0
Number of Embeds
92
Actions
Shares
0
Downloads
4
Comments
0
Likes
1
Embeds 0
No embeds

No notes for slide

#dido12 presentation

  1. 1. @fzkfvanvollenhoven@xebia.com
  2. 2. Big Data?
  3. 3. Big Data?
  4. 4. Big Data?
  5. 5. Big Data?
  6. 6. Hadoop?
  7. 7. Hadoop?
  8. 8. Hadoop?
  9. 9. Hadoop?
  10. 10. Doing Big DataCollecting / obtaining data:•Forget about retention policy•Storage is cheap (it really is!)•Keep all of it online (or at least almost all)•Make it scalable (no manual processes)
  11. 11. Doing Big DataClean:•Data is always a mess•Don’t treat it as an expception•Make it your problem
  12. 12. Doing Big DataExplore:•Think about products, not insights per se•90% people / 10% tools•Support ad hoc querying•Never assume•This is where the fancy charts come in•Never assume
  13. 13. Doing Big DataModel:•What do I want to predict•Keep it simple (let the data work)•Think about scale
  14. 14. Doing Big Data
  15. 15. Doing Big DataBuild:•Build functionality•Think about scale, once more•A dashboard is not a data driven solution
  16. 16. Doing Big Data
  17. 17. Q&A @fzkfvanvollenhoven@xebia.com

×