Your SlideShare is downloading. ×
0
The Reality of Big Data
#beltech2014
#1 – What problem are you
trying to solve?
Most of SME’s problems aren’t
Big Data, it’s just data.
Without a question you are
wasting your time.
#2 – Data will need cleaning
Roughly 80% of your data project
will be getting the data into shape
before processing.
Btiany Spears
#3 – Hadoop, on it’s own, will
NOT give you the answers.
#3 – Hadoop, on it’s own, will
NOT give you the answers.
(The Big Data version of “putting it in
the cloud”)
If anyone says, “will Hadoop just
give us the answers” or “put it in the
cloud”, do this….
Spit on one, or both, of their feet
and bite your thumb while shouting:
“The fig of Spain!”.
#4 – Do you actually need
Hadoop?
A well crafted algorithm may give
you more benefit.
It’s about knowing the right
questions.
And refining and refining and
refining…..
The first run won't work at all
The second only makes you wonder
The third will have you on your
knees.....
#5 – Data changes
…especially when you don’t own
it.
If you feel your data has value
then retain it.
If your data passes over the “creepy
line” then definitely retain it.
#6 – Skills are in short supply
Work with what you have.
Play with data, it’s the best way
to learn.
Collaborate with others to fill the
skills gaps.
Thank you
http://about.me/jasebell
@hadooping
Upcoming SlideShare
Loading in...5
×

The Reality of Bigdata - #Beltech2014

433

Published on

The Reality of BigData - slides for the keynote at Beltech2014 (4th April 2014) in Belfast.

0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total Views
433
On Slideshare
0
From Embeds
0
Number of Embeds
6
Actions
Shares
0
Downloads
1
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide

Transcript of "The Reality of Bigdata - #Beltech2014"

  1. 1. The Reality of Big Data
  2. 2. #beltech2014
  3. 3. #1 – What problem are you trying to solve?
  4. 4. Most of SME’s problems aren’t Big Data, it’s just data.
  5. 5. Without a question you are wasting your time.
  6. 6. #2 – Data will need cleaning
  7. 7. Roughly 80% of your data project will be getting the data into shape before processing.
  8. 8. Btiany Spears
  9. 9. #3 – Hadoop, on it’s own, will NOT give you the answers.
  10. 10. #3 – Hadoop, on it’s own, will NOT give you the answers. (The Big Data version of “putting it in the cloud”)
  11. 11. If anyone says, “will Hadoop just give us the answers” or “put it in the cloud”, do this….
  12. 12. Spit on one, or both, of their feet and bite your thumb while shouting: “The fig of Spain!”.
  13. 13. #4 – Do you actually need Hadoop?
  14. 14. A well crafted algorithm may give you more benefit.
  15. 15. It’s about knowing the right questions.
  16. 16. And refining and refining and refining…..
  17. 17. The first run won't work at all The second only makes you wonder The third will have you on your knees.....
  18. 18. #5 – Data changes
  19. 19. …especially when you don’t own it.
  20. 20. If you feel your data has value then retain it.
  21. 21. If your data passes over the “creepy line” then definitely retain it.
  22. 22. #6 – Skills are in short supply
  23. 23. Work with what you have.
  24. 24. Play with data, it’s the best way to learn.
  25. 25. Collaborate with others to fill the skills gaps.
  26. 26. Thank you http://about.me/jasebell @hadooping
  1. A particular slide catching your eye?

    Clipping is a handy way to collect important slides you want to go back to later.

×