This document discusses data driven design with big data. It begins by stating that the presentation will cover data driven design with a big data slant but not technical debates. It then discusses how big data can provide storage, analytics, and access to data. However, it notes big data is still new and complex to manage and needs to integrate with existing systems. It provides an example of how data helped the Oakland A's win games. The rest of the document discusses best practices like not boiling the ocean, making data part of the design process, prioritizing data quality, and driving a data culture. It concludes by listing some open source big data resources.
3. WHY WE’RE HERE
WHAT IS DATA DRIVEN DESIGN”?
HOW DOES THIS WORK WITH BIG
DATA?
4. A LITTLE ON THIS PRESENTATION
This is a HUGE topic.
- Today we DO cover: Data Driven Design with a Big Data
slant.
- Today we DON’T cover: Hadoop (or any part of) vs. NoSQL
(or any part of), Python vs. Scala, or other raging hot
debates of technology.
5. BIG DATA AND PRODUCT
We all know big data can provide:
- Storage
- Analytic Wizardry
- Access to all of your data with less effort
- “Absolute mastery over the Universe, your
Marriage and kids!!”
- - Fast talking Big Incites Sales guy, pictured at right.
What does success look like?
10. MONEYBALL EXAMPLE – “GET ON BASE”
Product?
Goal?
Feature Enhancement?
Release Cycles?
Data Available, Clean and Understood?
Was the culture Data Driven?
Could they get the result from the data they had?
What needed to change?
Was data the only reason they won?
12. THE STUFF THE SALES GUYS DON’T TALK ABOUT
• Big Data still “new” (especially in Canada)
• Big Data can be tricky to manage
• Big Data needs to INTEGRATE with YOUR ecosystem
• Big Data - NOT the magic bullet to understanding your
data
• There’s A LOT to know
• Technology may not fix your problems
• If done wrong, it can be expensive
13.
14. HAVING SAID ALL THAT… IT’S SOOOO WORTH IT
When you do it right:
• Billions of rows can be chewed AND understood
• You can prove yourself right, then prove yourself wrong
• You can integrate speed and volume
• Your kids: won’t be alone with the “New Math”
15. NOW, THE HOW
• Don’t boil the ocean
• Make data part of your design process
• Give and Take, but DON’T BE EVIL!
• Put data quality first
• Be conscious of your Operations, existing IT and tertiary Processes
• Drive a data culture
• Know your limits, play within them, then blow them away
24. SOME TECHY RESOURCES…
Almost all current Big Data stuffs are open source and free (to start with,
anyway). Examples:
• Hadoop: Cloudera and HortonWorks both offer QuickStart VM’s
• NoSQL:
• MongoDB – Training and DB are free to use
• Cassandra – about near the same
• Programming: Python: Coursera, CodeCademy
• Spark: Open source to try, works with Hadoop and Cassandra
25. Thanks for coming out,
man!
@bigincites
@insightsandstuf (one ‘f’ – clever!)
LinkedIn – Steven Ingram
singram@bigincites.com
solutions@bigincites.com