Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.
Data Science: A Mindset for Productivity
Daniel Tunkelang
@dtunkelang
tl;dr
The most important part of data science is picking
the right problem and figuring out how to frame it.
We’re all technologists, right?
But nobody knows everything.*
Class HashMap<K,V>
java.lang.Object
java.util.AbstractMap<K,V>
java.util.HashMap<K,V>
Type P...
Math and computer science matter…
But you have to solve the right problem.
Stay friends with your exes.
explain
express
experiment
Data science is a mindset.
Explain
Iterate using explainable models.
Express
Model your utility and inputs.
Experiment
Opt...
Explain
With apologies to the little prince.
Deep learning is the new black.
But accuracy isn’t everything.
The importance of being explainable.
• Algorithms can protect you from overfitting, but they
can’t protect you from the bi...
Linear models? Decision trees?
• Linear regression and decision trees favor explainability over accuracy,
compared to more...
Express
Machine learning for dummies.
• Define objective function.
• Collect training data.
• Build models.
• Profit!
You only improve what you measure.
Clicks?
Actions?
Outcomes?
Sometimes accuracy is complicated.
What’s your error function?
Consider stratified sampling.
Experiment
How to find your prince.
You have to kiss a lot of frogs to find one prince. So
how can you find your prince faster?
By fi...
Think like an economist.
Yesterday
Experiments are expensive,
choose hypotheses wisely.
Today
Experiments are cheap,
do as...
But don’t forget you’re a scientist.
Optimize for the speed of learning.
Test one variable at a time.
• Autocomplete
• Entity Tagging
• Vertical Intent
• # of Suggestions
• Suggestion Order
• Lan...
tl;dr
The most important part of data science is picking
the right problem and figuring out how to frame it.
Daniel Tunkelang
dtunkelang@gmail.com
https://linkedin.com/in/dtunkelang
@dtunkelang
Upcoming SlideShare
Loading in …5
×

Data Science: A Mindset for Productivity

7,400 views

Published on

Data Science: A Mindset for Productivity

Keynote at 2015 Ronin Labs West Coast CTO Summit
https://www.eventjoy.com/e/west-coast-cto-summit-2015

Abstract

Data science isn't just about using a collection of technologies and algorithms. Data science requires a mindset that solves problems at a higher level of abstraction. How do we model utility when we think about optimization? How do we decide which hypotheses to test? How do we allocate our scarce resources to make progress?

There are no silver bullets. But I'll share what I've learned from a variety of contexts over the course of my work at Endeca, Google, and LinkedIn; and I hope you'll leave this talk with some practical wisdom you can apply to your next data science project.

Published in: Data & Analytics
  • Hello! Get Your Professional Job-Winning Resume Here - Check our website! https://vk.cc/818RFv
       Reply 
    Are you sure you want to  Yes  No
    Your message goes here
  • hi
       Reply 
    Are you sure you want to  Yes  No
    Your message goes here

Data Science: A Mindset for Productivity

  1. Data Science: A Mindset for Productivity Daniel Tunkelang @dtunkelang
  2. tl;dr The most important part of data science is picking the right problem and figuring out how to frame it.
  3. We’re all technologists, right?
  4. But nobody knows everything.* Class HashMap<K,V> java.lang.Object java.util.AbstractMap<K,V> java.util.HashMap<K,V> Type Parameters: K - the type of keys maintained by this map V - the type of mapped values All Implemented Interfaces: Serializable, Cloneable, Map<K,V> *Except Jeff Dean.
  5. Math and computer science matter…
  6. But you have to solve the right problem.
  7. Stay friends with your exes. explain express experiment
  8. Data science is a mindset. Explain Iterate using explainable models. Express Model your utility and inputs. Experiment Optimize for speed of learning.
  9. Explain
  10. With apologies to the little prince.
  11. Deep learning is the new black.
  12. But accuracy isn’t everything.
  13. The importance of being explainable. • Algorithms can protect you from overfitting, but they can’t protect you from the biases you introduce. • Introspection into your models and features makes it easier for you and others to debug them. • Especially if you don’t completely trust your objective function or representativeness of your training data.
  14. Linear models? Decision trees? • Linear regression and decision trees favor explainability over accuracy, compared to more sophisticated models. • But size matters. If you have too many features or too deep a decision tree, you lose explainability. • You can always upgrade to a more sophisticated model when you trust your objective function and training data. • Build a machine learning model is an iterative process. Optimize for the speed of your own learning.
  15. Express
  16. Machine learning for dummies. • Define objective function. • Collect training data. • Build models. • Profit!
  17. You only improve what you measure. Clicks? Actions? Outcomes?
  18. Sometimes accuracy is complicated.
  19. What’s your error function?
  20. Consider stratified sampling.
  21. Experiment
  22. How to find your prince. You have to kiss a lot of frogs to find one prince. So how can you find your prince faster? By finding more frogs and kissing them faster and faster. -- Mike Moran
  23. Think like an economist. Yesterday Experiments are expensive, choose hypotheses wisely. Today Experiments are cheap, do as many as you can!
  24. But don’t forget you’re a scientist.
  25. Optimize for the speed of learning.
  26. Test one variable at a time. • Autocomplete • Entity Tagging • Vertical Intent • # of Suggestions • Suggestion Order • Language • Query Construction • Ranking Model
  27. tl;dr The most important part of data science is picking the right problem and figuring out how to frame it.
  28. Daniel Tunkelang dtunkelang@gmail.com https://linkedin.com/in/dtunkelang @dtunkelang

×