How to Interview a Data Scientist

Daniel Tunkelang
Daniel TunkelangHigh-Class Consultant
Daniel




How to Interview a Data Scientist
Daniel Tunkelang
Director of Data Science, LinkedIn
     Recruiting Solutions                     1
Drew Conway’s Venn Diagram




                             2
GOAL




       3
Specification for a Data Scientist



                        implements
                         algorithms

      analyzes data
                      thinks product



                                       4
What about




C                  ulture
             ommunication
                 uriosity


     Hold that thought…
                            ?
                                5
What can you learn from an interview?




                                        6
Interviewing is a last resort.




               Alternatives?

                                 7
Only hire people you’ve worked with.




                                       8
Hire interns. Convert to full-time. Profit!




                                              9
Try before you buy: short-term contracts.




                                            10
Alternatives are at best a partial solution.

§  Only hiring people you’ve worked with doesn’t scale.
   –  And traps you in a locally optimal monoculture.


§  Interns are great! But they are a significant investment.
   –  Managing interns well is a productivity gamble.
   –  Most interns have at least a year of school left.
   –  Not all interns will make your bar. You won’t always make theirs.


§  Try before you buy: nice in theory.
   –  Adverse selection bias when other offers are permanent roles.
   –  Creates bureaucracy.


                                                                          11
Can we at least make interviews natural?




                                           12
Spend a day working together.




                                13
Take-home assignment.




                        14
Review candidate’s previous work.




                                    15
High-fructose corn syrup is 100% natural.
§  Working sessions are difficult to set up.
   –  No more natural than a final exam.
   –  High variance, and very difficult to calibrate performance.


§  Take-home assignments are great for the employer.
   –  But they are a significant investment for the candidate.
   –  Adverse selection bias if other companies don’t require them.
   –  Creates incentive to cheat if significant part of hiring process.


§  Previous work is like natural experiments.
   –  Always good to review a candidate’s previous work.
   –  But not always possible to find work with high predictive value.



                                                                          16
So you gotta do interviews. But how?




                                       17
Three Principles

1.  Keep it real.

2.  No gotchas.

3.  Maybe = no.




                    18
Keeping It Real




                  19
Test basic coding with FizzBuzz questions.

        multiple of 3 -> Fizz
        multiple of 5 -> Buzz
        multiple of 15 -> FizzBuzz

   1, 2, Fizz, 4, Buzz, Fizz,
   7, 8, Fizz, Buzz, 11, Fizz,
   13, 14, FizzBuzz, 16, …
                                        20
Whiteboards suck for coding.




      http://ericleads.com/2012/10/how-to-conduct-a-better-coding-interview/


                                                                               21
Don’t ask pointless algorithm questions.




             implement




                                           22
Use real-world algorithms questions.



        bigdatascientist


         Did you mean:
         big data scientist
                                       23
Ask candidates to design your products.




                                          24
Keeping it real is also a great sell.
                                Similar Profiles




                               People You May Know




                                                     25
But no gotchas.




                  26
Gotchas reduce the signal-to-noise ratio.

§  Avoid problems where success hinges on a single insight.
   –  Good interview problems offer lots of room for partial credit.
   –  Making a key insight often reflects experience, not intelligence.


§  Don’t test a candidate’s knowledge of a niche technique.
   –  Unless that niche technique is critical to job performance.
   –  And can’t be learned on the job as part of on-boarding.


§  Be a hard interviewer, but don’t be an asshole.
   –  An interview is not a stress-test to see where candidates break.
   –  Interviews communicate your values to the candidate.


                                                                          27
Maybe = no.




              28
Commit to binary interview outcomes.

§  Forced choice so interviewers don’t take easy way out.
   –  Just like having 4 choices instead of 5 on a rating scale.
   –  Encourages interviewers to take their role seriously.


§  Each team member is a critical filter.
   –  Two no’s or one strong no is a no.
   –  All weak yes’s is a no.


§  Short-circuit candidates early in the process.
   –  Resume and phone screening should be aggressive.
   –  Onsite interviews should have ~50% chance of leading to offers.


                                                                        29
But what about




C                 ulture
            ommunication
                uriosity

         All are must-haves.
                                ?
 Every interview evaluates all three.
                                        30
Remember Your Goal




                     31
Three Principles

1.  Keep it real.
  –  Avoid whiteboard coding. Filter with FizzBuzz.
  –  Use real-world algorithms questions.
  –  Ask candidates to design your products.
2.  No gotchas.
  –  Gotchas reduce the signal-to-noise ratio.
3.  Maybe = no.
  –  Bad hires suck. Be conservative.
  –  Trust your team.
                                                  32
Thank you!




             33
1 of 33

Recommended

Full-stack Data Scientist by
Full-stack Data ScientistFull-stack Data Scientist
Full-stack Data ScientistAlexey Grigorev
11.9K views74 slides
Feature Engineering by
Feature EngineeringFeature Engineering
Feature EngineeringHJ van Veen
151K views76 slides
How to Become a Data Scientist by
How to Become a Data ScientistHow to Become a Data Scientist
How to Become a Data Scientistryanorban
197.6K views52 slides
Feature Engineering by
Feature EngineeringFeature Engineering
Feature EngineeringSri Ambati
10.7K views45 slides
Introduction to Data Science by
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data ScienceNiko Vuokko
18.5K views31 slides
10 Lessons Learned from Building Machine Learning Systems by
10 Lessons Learned from Building Machine Learning Systems10 Lessons Learned from Building Machine Learning Systems
10 Lessons Learned from Building Machine Learning SystemsXavier Amatriain
378.5K views38 slides

More Related Content

What's hot

eScience SHAP talk by
eScience SHAP talkeScience SHAP talk
eScience SHAP talkScott Lundberg
1.3K views56 slides
Android/iOS端末におけるエッジ推論のチューニング by
Android/iOS端末におけるエッジ推論のチューニングAndroid/iOS端末におけるエッジ推論のチューニング
Android/iOS端末におけるエッジ推論のチューニングDeep Learning Lab(ディープラーニング・ラボ)
9.1K views33 slides
AIと最適化の違いをうっかり聞いてしまう前に by
AIと最適化の違いをうっかり聞いてしまう前にAIと最適化の違いをうっかり聞いてしまう前に
AIと最適化の違いをうっかり聞いてしまう前にMonta Yashi
2.3K views8 slides
Winning Kaggle 101: Introduction to Stacking by
Winning Kaggle 101: Introduction to StackingWinning Kaggle 101: Introduction to Stacking
Winning Kaggle 101: Introduction to StackingTed Xiao
8.8K views21 slides
Data Science - Part XI - Text Analytics by
Data Science - Part XI - Text AnalyticsData Science - Part XI - Text Analytics
Data Science - Part XI - Text AnalyticsDerek Kane
11.7K views104 slides
『機械学習による故障予測・異常検知 事例紹介とデータ分析プロジェクト推進ポイント』 by
『機械学習による故障予測・異常検知 事例紹介とデータ分析プロジェクト推進ポイント』『機械学習による故障予測・異常検知 事例紹介とデータ分析プロジェクト推進ポイント』
『機械学習による故障予測・異常検知 事例紹介とデータ分析プロジェクト推進ポイント』The Japan DataScientist Society
16.6K views36 slides

What's hot(20)

AIと最適化の違いをうっかり聞いてしまう前に by Monta Yashi
AIと最適化の違いをうっかり聞いてしまう前にAIと最適化の違いをうっかり聞いてしまう前に
AIと最適化の違いをうっかり聞いてしまう前に
Monta Yashi2.3K views
Winning Kaggle 101: Introduction to Stacking by Ted Xiao
Winning Kaggle 101: Introduction to StackingWinning Kaggle 101: Introduction to Stacking
Winning Kaggle 101: Introduction to Stacking
Ted Xiao8.8K views
Data Science - Part XI - Text Analytics by Derek Kane
Data Science - Part XI - Text AnalyticsData Science - Part XI - Text Analytics
Data Science - Part XI - Text Analytics
Derek Kane11.7K views
『機械学習による故障予測・異常検知 事例紹介とデータ分析プロジェクト推進ポイント』 by The Japan DataScientist Society
『機械学習による故障予測・異常検知 事例紹介とデータ分析プロジェクト推進ポイント』『機械学習による故障予測・異常検知 事例紹介とデータ分析プロジェクト推進ポイント』
『機械学習による故障予測・異常検知 事例紹介とデータ分析プロジェクト推進ポイント』
失敗から学ぶ機械学習応用 by Hiroyuki Masuda
失敗から学ぶ機械学習応用失敗から学ぶ機械学習応用
失敗から学ぶ機械学習応用
Hiroyuki Masuda51.4K views
What’s next for deep learning for Search? by Bhaskar Mitra
What’s next for deep learning for Search?What’s next for deep learning for Search?
What’s next for deep learning for Search?
Bhaskar Mitra591 views
Introduction to Machine Learning by Raveen Perera
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine Learning
Raveen Perera642 views
Unified Approach to Interpret Machine Learning Model: SHAP + LIME by Databricks
Unified Approach to Interpret Machine Learning Model: SHAP + LIMEUnified Approach to Interpret Machine Learning Model: SHAP + LIME
Unified Approach to Interpret Machine Learning Model: SHAP + LIME
Databricks2.8K views
平面グラフと交通ネットワークのアルゴリズム by Takuya Akiba
平面グラフと交通ネットワークのアルゴリズム平面グラフと交通ネットワークのアルゴリズム
平面グラフと交通ネットワークのアルゴリズム
Takuya Akiba26.2K views
AIと最適化の違いをうっかり聞いてしまう前に v2 by Monta Yashi
AIと最適化の違いをうっかり聞いてしまう前に v2AIと最適化の違いをうっかり聞いてしまう前に v2
AIと最適化の違いをうっかり聞いてしまう前に v2
Monta Yashi428 views
Machine Learning by Vivek Garg
Machine LearningMachine Learning
Machine Learning
Vivek Garg2.5K views
Explainable AI - making ML and DL models more interpretable by Aditya Bhattacharya
Explainable AI - making ML and DL models more interpretableExplainable AI - making ML and DL models more interpretable
Explainable AI - making ML and DL models more interpretable
機械学習の課題設定講座 by 幹雄 小川
機械学習の課題設定講座機械学習の課題設定講座
機械学習の課題設定講座
幹雄 小川3.4K views
Intro to Data Science for Enterprise Big Data by Paco Nathan
Intro to Data Science for Enterprise Big DataIntro to Data Science for Enterprise Big Data
Intro to Data Science for Enterprise Big Data
Paco Nathan52.8K views
How to win data science competitions with Deep Learning by Sri Ambati
How to win data science competitions with Deep LearningHow to win data science competitions with Deep Learning
How to win data science competitions with Deep Learning
Sri Ambati28K views

Viewers also liked

Hadoop and Machine Learning by
Hadoop and Machine LearningHadoop and Machine Learning
Hadoop and Machine Learningjoshwills
53.1K views26 slides
A Statistician's View on Big Data and Data Science (Version 1) by
A Statistician's View on Big Data and Data Science (Version 1)A Statistician's View on Big Data and Data Science (Version 1)
A Statistician's View on Big Data and Data Science (Version 1)Prof. Dr. Diego Kuonen
95.4K views40 slides
Data By The People, For The People by
Data By The People, For The PeopleData By The People, For The People
Data By The People, For The PeopleDaniel Tunkelang
83.1K views33 slides
Hands-on Deep Learning in Python by
Hands-on Deep Learning in PythonHands-on Deep Learning in Python
Hands-on Deep Learning in PythonImry Kissos
30.7K views65 slides
Big Data [sorry] & Data Science: What Does a Data Scientist Do? by
Big Data [sorry] & Data Science: What Does a Data Scientist Do?Big Data [sorry] & Data Science: What Does a Data Scientist Do?
Big Data [sorry] & Data Science: What Does a Data Scientist Do?Data Science London
147.9K views54 slides
A tutorial on deep learning at icml 2013 by
A tutorial on deep learning at icml 2013A tutorial on deep learning at icml 2013
A tutorial on deep learning at icml 2013Philip Zheng
45.4K views204 slides

Viewers also liked(20)

Hadoop and Machine Learning by joshwills
Hadoop and Machine LearningHadoop and Machine Learning
Hadoop and Machine Learning
joshwills53.1K views
A Statistician's View on Big Data and Data Science (Version 1) by Prof. Dr. Diego Kuonen
A Statistician's View on Big Data and Data Science (Version 1)A Statistician's View on Big Data and Data Science (Version 1)
A Statistician's View on Big Data and Data Science (Version 1)
Data By The People, For The People by Daniel Tunkelang
Data By The People, For The PeopleData By The People, For The People
Data By The People, For The People
Daniel Tunkelang83.1K views
Hands-on Deep Learning in Python by Imry Kissos
Hands-on Deep Learning in PythonHands-on Deep Learning in Python
Hands-on Deep Learning in Python
Imry Kissos30.7K views
Big Data [sorry] & Data Science: What Does a Data Scientist Do? by Data Science London
Big Data [sorry] & Data Science: What Does a Data Scientist Do?Big Data [sorry] & Data Science: What Does a Data Scientist Do?
Big Data [sorry] & Data Science: What Does a Data Scientist Do?
Data Science London147.9K views
A tutorial on deep learning at icml 2013 by Philip Zheng
A tutorial on deep learning at icml 2013A tutorial on deep learning at icml 2013
A tutorial on deep learning at icml 2013
Philip Zheng45.4K views
Deep Learning for Natural Language Processing by Devashish Shanker
Deep Learning for Natural Language ProcessingDeep Learning for Natural Language Processing
Deep Learning for Natural Language Processing
Devashish Shanker32K views
Introduction to Mahout and Machine Learning by Varad Meru
Introduction to Mahout and Machine LearningIntroduction to Mahout and Machine Learning
Introduction to Mahout and Machine Learning
Varad Meru77K views
An Introduction to Supervised Machine Learning and Pattern Classification: Th... by Sebastian Raschka
An Introduction to Supervised Machine Learning and Pattern Classification: Th...An Introduction to Supervised Machine Learning and Pattern Classification: Th...
An Introduction to Supervised Machine Learning and Pattern Classification: Th...
Sebastian Raschka101.9K views
Machine Learning and Data Mining: 12 Classification Rules by Pier Luca Lanzi
Machine Learning and Data Mining: 12 Classification RulesMachine Learning and Data Mining: 12 Classification Rules
Machine Learning and Data Mining: 12 Classification Rules
Pier Luca Lanzi56.2K views
Myths and Mathemagical Superpowers of Data Scientists by David Pittman
Myths and Mathemagical Superpowers of Data ScientistsMyths and Mathemagical Superpowers of Data Scientists
Myths and Mathemagical Superpowers of Data Scientists
David Pittman96.7K views
Tutorial on Deep learning and Applications by NhatHai Phan
Tutorial on Deep learning and ApplicationsTutorial on Deep learning and Applications
Tutorial on Deep learning and Applications
NhatHai Phan28.9K views
Tips for data science competitions by Owen Zhang
Tips for data science competitionsTips for data science competitions
Tips for data science competitions
Owen Zhang85.6K views
Deep neural networks by Si Haem
Deep neural networksDeep neural networks
Deep neural networks
Si Haem162.5K views
Introduction to Big Data/Machine Learning by Lars Marius Garshol
Introduction to Big Data/Machine LearningIntroduction to Big Data/Machine Learning
Introduction to Big Data/Machine Learning
Lars Marius Garshol306.6K views
Artificial neural network by DEEPASHRI HK
Artificial neural networkArtificial neural network
Artificial neural network
DEEPASHRI HK186.7K views
10 R Packages to Win Kaggle Competitions by DataRobot
10 R Packages to Win Kaggle Competitions10 R Packages to Win Kaggle Competitions
10 R Packages to Win Kaggle Competitions
DataRobot117.7K views
Artificial Intelligence Presentation by lpaviglianiti
Artificial Intelligence PresentationArtificial Intelligence Presentation
Artificial Intelligence Presentation
lpaviglianiti344.7K views
The Business Analytics Value Proposition by Eric Stephens
The Business Analytics Value PropositionThe Business Analytics Value Proposition
The Business Analytics Value Proposition
Eric Stephens5.8K views

Similar to How to Interview a Data Scientist

Avoiding the Heuristic Solution: Moving past functional and correct to joyful... by
Avoiding the Heuristic Solution: Moving past functional and correct to joyful...Avoiding the Heuristic Solution: Moving past functional and correct to joyful...
Avoiding the Heuristic Solution: Moving past functional and correct to joyful...Steven Hoober
4.2K views23 slides
10 Observations from 10+ years in the Corporate UX Trenches by
10 Observations from 10+ years in the Corporate UX Trenches10 Observations from 10+ years in the Corporate UX Trenches
10 Observations from 10+ years in the Corporate UX TrenchesArio Jafarzadeh
829 views54 slides
Strata 2012: Humans, Machines, and the Dimensions of Microwork by
Strata 2012: Humans, Machines, and the Dimensions of MicroworkStrata 2012: Humans, Machines, and the Dimensions of Microwork
Strata 2012: Humans, Machines, and the Dimensions of MicroworkDaniel Tunkelang
2.5K views21 slides
The Top Ten Execution Missteps by
The Top Ten Execution MisstepsThe Top Ten Execution Missteps
The Top Ten Execution MisstepsBill Nussey
414 views22 slides
How to change your career to consulting by
How to change your career to consultingHow to change your career to consulting
How to change your career to consultingPurnima Thakre
1.2K views54 slides
La potenza di Linkedin: i candidati passivi by
La potenza di Linkedin: i candidati passiviLa potenza di Linkedin: i candidati passivi
La potenza di Linkedin: i candidati passiviAndrea Attana
210 views12 slides

Similar to How to Interview a Data Scientist(20)

Avoiding the Heuristic Solution: Moving past functional and correct to joyful... by Steven Hoober
Avoiding the Heuristic Solution: Moving past functional and correct to joyful...Avoiding the Heuristic Solution: Moving past functional and correct to joyful...
Avoiding the Heuristic Solution: Moving past functional and correct to joyful...
Steven Hoober4.2K views
10 Observations from 10+ years in the Corporate UX Trenches by Ario Jafarzadeh
10 Observations from 10+ years in the Corporate UX Trenches10 Observations from 10+ years in the Corporate UX Trenches
10 Observations from 10+ years in the Corporate UX Trenches
Ario Jafarzadeh829 views
Strata 2012: Humans, Machines, and the Dimensions of Microwork by Daniel Tunkelang
Strata 2012: Humans, Machines, and the Dimensions of MicroworkStrata 2012: Humans, Machines, and the Dimensions of Microwork
Strata 2012: Humans, Machines, and the Dimensions of Microwork
Daniel Tunkelang2.5K views
The Top Ten Execution Missteps by Bill Nussey
The Top Ten Execution MisstepsThe Top Ten Execution Missteps
The Top Ten Execution Missteps
Bill Nussey414 views
How to change your career to consulting by Purnima Thakre
How to change your career to consultingHow to change your career to consulting
How to change your career to consulting
Purnima Thakre1.2K views
La potenza di Linkedin: i candidati passivi by Andrea Attana
La potenza di Linkedin: i candidati passiviLa potenza di Linkedin: i candidati passivi
La potenza di Linkedin: i candidati passivi
Andrea Attana210 views
Inside the World of Passive Talent by harrydhebar
Inside the World of Passive TalentInside the World of Passive Talent
Inside the World of Passive Talent
harrydhebar176 views
Inside the world of passive talent research & tips by Carly Rodger
Inside the world of passive talent  research & tipsInside the world of passive talent  research & tips
Inside the world of passive talent research & tips
Carly Rodger283 views
Inside the world of passive talent research & tips by Lynne Rooney
Inside the world of passive talent   research & tipsInside the world of passive talent   research & tips
Inside the world of passive talent research & tips
Lynne Rooney267 views
Inside the world of passive talent research & tips by Donna Graham
Inside the world of passive talent research & tipsInside the world of passive talent research & tips
Inside the world of passive talent research & tips
Donna Graham179 views
Inside the world of passive talent - Research tips by Harry Dhebar
Inside the world of passive talent - Research tipsInside the world of passive talent - Research tips
Inside the world of passive talent - Research tips
Harry Dhebar237 views
Inside The World Of Passive Talent Research & Tips by haimeecode
Inside The World Of Passive Talent   Research & TipsInside The World Of Passive Talent   Research & Tips
Inside The World Of Passive Talent Research & Tips
haimeecode255 views
Valtech - Innovation Needs Waste by Valtech
Valtech - Innovation Needs WasteValtech - Innovation Needs Waste
Valtech - Innovation Needs Waste
Valtech900 views
Macadamian - Product Camp - Top10 researchsecretswebinar dan_arra_march,2013 by Dan Arra
Macadamian - Product Camp - Top10 researchsecretswebinar dan_arra_march,2013Macadamian - Product Camp - Top10 researchsecretswebinar dan_arra_march,2013
Macadamian - Product Camp - Top10 researchsecretswebinar dan_arra_march,2013
Dan Arra352 views
Design thinking in everyday life by Madhumita Gupta
Design thinking in everyday lifeDesign thinking in everyday life
Design thinking in everyday life
Madhumita Gupta666 views

More from Daniel Tunkelang

Semantic Equivalence of e-Commerce Queries by
Semantic Equivalence of e-Commerce QueriesSemantic Equivalence of e-Commerce Queries
Semantic Equivalence of e-Commerce QueriesDaniel Tunkelang
730 views13 slides
Helping Searchers Satisfice through Query Understanding by
Helping Searchers Satisfice through Query UnderstandingHelping Searchers Satisfice through Query Understanding
Helping Searchers Satisfice through Query UnderstandingDaniel Tunkelang
706 views32 slides
MMM, Search! by
MMM, Search!MMM, Search!
MMM, Search!Daniel Tunkelang
4.3K views45 slides
Enterprise Intelligence by
Enterprise IntelligenceEnterprise Intelligence
Enterprise IntelligenceDaniel Tunkelang
9.5K views35 slides
Query Understanding: A Manifesto by
Query Understanding: A ManifestoQuery Understanding: A Manifesto
Query Understanding: A ManifestoDaniel Tunkelang
16.9K views37 slides
Where should you put your data scientists? by
Where should you put your data scientists?Where should you put your data scientists?
Where should you put your data scientists?Daniel Tunkelang
13.1K views28 slides

More from Daniel Tunkelang(20)

Semantic Equivalence of e-Commerce Queries by Daniel Tunkelang
Semantic Equivalence of e-Commerce QueriesSemantic Equivalence of e-Commerce Queries
Semantic Equivalence of e-Commerce Queries
Daniel Tunkelang730 views
Helping Searchers Satisfice through Query Understanding by Daniel Tunkelang
Helping Searchers Satisfice through Query UnderstandingHelping Searchers Satisfice through Query Understanding
Helping Searchers Satisfice through Query Understanding
Daniel Tunkelang706 views
Query Understanding: A Manifesto by Daniel Tunkelang
Query Understanding: A ManifestoQuery Understanding: A Manifesto
Query Understanding: A Manifesto
Daniel Tunkelang16.9K views
Where should you put your data scientists? by Daniel Tunkelang
Where should you put your data scientists?Where should you put your data scientists?
Where should you put your data scientists?
Daniel Tunkelang13.1K views
Data Science: A Mindset for Productivity by Daniel Tunkelang
Data Science: A Mindset for ProductivityData Science: A Mindset for Productivity
Data Science: A Mindset for Productivity
Daniel Tunkelang11.9K views
My Three Ex’s: A Data Science Approach for Applied Machine Learning by Daniel Tunkelang
My Three Ex’s: A Data Science Approach for Applied Machine LearningMy Three Ex’s: A Data Science Approach for Applied Machine Learning
My Three Ex’s: A Data Science Approach for Applied Machine Learning
Daniel Tunkelang15.9K views
Web science - How is it different? by Daniel Tunkelang
Web science - How is it different?Web science - How is it different?
Web science - How is it different?
Daniel Tunkelang14.8K views
Better Search Through Query Understanding by Daniel Tunkelang
Better Search Through Query UnderstandingBetter Search Through Query Understanding
Better Search Through Query Understanding
Daniel Tunkelang31.6K views
Social Search in a Professional Context by Daniel Tunkelang
Social Search in a Professional ContextSocial Search in a Professional Context
Social Search in a Professional Context
Daniel Tunkelang18.3K views
Find and be Found: Information Retrieval at LinkedIn by Daniel Tunkelang
Find and be Found: Information Retrieval at LinkedInFind and be Found: Information Retrieval at LinkedIn
Find and be Found: Information Retrieval at LinkedIn
Daniel Tunkelang19.9K views
Search as Communication: Lessons from a Personal Journey by Daniel Tunkelang
Search as Communication: Lessons from a Personal JourneySearch as Communication: Lessons from a Personal Journey
Search as Communication: Lessons from a Personal Journey
Daniel Tunkelang13.2K views
Enterprise Search: How do we get there from here? by Daniel Tunkelang
Enterprise Search: How do we get there from here?Enterprise Search: How do we get there from here?
Enterprise Search: How do we get there from here?
Daniel Tunkelang17.1K views
Big Data, We Have a Communication Problem by Daniel Tunkelang
Big Data, We Have a Communication Problem Big Data, We Have a Communication Problem
Big Data, We Have a Communication Problem
Daniel Tunkelang40.9K views
Information, Attention, and Trust: A Hierarchy of Needs by Daniel Tunkelang
Information, Attention, and Trust: A Hierarchy of NeedsInformation, Attention, and Trust: A Hierarchy of Needs
Information, Attention, and Trust: A Hierarchy of Needs
Daniel Tunkelang4.2K views
Content, Connections, and Context by Daniel Tunkelang
Content, Connections, and ContextContent, Connections, and Context
Content, Connections, and Context
Daniel Tunkelang46.4K views
Scale, Structure, and Semantics by Daniel Tunkelang
Scale, Structure, and SemanticsScale, Structure, and Semantics
Scale, Structure, and Semantics
Daniel Tunkelang11.5K views
Recommendations as a Conversation with the User by Daniel Tunkelang
Recommendations as a Conversation with the UserRecommendations as a Conversation with the User
Recommendations as a Conversation with the User
Daniel Tunkelang26.4K views
Keeping It Professional: Relevance, Recommendations, and Reputation at LinkedIn by Daniel Tunkelang
Keeping It Professional: Relevance, Recommendations, and Reputation at LinkedInKeeping It Professional: Relevance, Recommendations, and Reputation at LinkedIn
Keeping It Professional: Relevance, Recommendations, and Reputation at LinkedIn
Daniel Tunkelang15.4K views

Recently uploaded

Migrating VMware Infra to KVM Using CloudStack - Nicolas Vazquez - ShapeBlue by
Migrating VMware Infra to KVM Using CloudStack - Nicolas Vazquez - ShapeBlueMigrating VMware Infra to KVM Using CloudStack - Nicolas Vazquez - ShapeBlue
Migrating VMware Infra to KVM Using CloudStack - Nicolas Vazquez - ShapeBlueShapeBlue
147 views20 slides
Live Demo Showcase: Unveiling Dell PowerFlex’s IaaS Capabilities with Apache ... by
Live Demo Showcase: Unveiling Dell PowerFlex’s IaaS Capabilities with Apache ...Live Demo Showcase: Unveiling Dell PowerFlex’s IaaS Capabilities with Apache ...
Live Demo Showcase: Unveiling Dell PowerFlex’s IaaS Capabilities with Apache ...ShapeBlue
52 views10 slides
Kyo - Functional Scala 2023.pdf by
Kyo - Functional Scala 2023.pdfKyo - Functional Scala 2023.pdf
Kyo - Functional Scala 2023.pdfFlavio W. Brasil
443 views92 slides
Network Source of Truth and Infrastructure as Code revisited by
Network Source of Truth and Infrastructure as Code revisitedNetwork Source of Truth and Infrastructure as Code revisited
Network Source of Truth and Infrastructure as Code revisitedNetwork Automation Forum
49 views45 slides
2FA and OAuth2 in CloudStack - Andrija Panić - ShapeBlue by
2FA and OAuth2 in CloudStack - Andrija Panić - ShapeBlue2FA and OAuth2 in CloudStack - Andrija Panić - ShapeBlue
2FA and OAuth2 in CloudStack - Andrija Panić - ShapeBlueShapeBlue
75 views23 slides
What’s New in CloudStack 4.19 - Abhishek Kumar - ShapeBlue by
What’s New in CloudStack 4.19 - Abhishek Kumar - ShapeBlueWhat’s New in CloudStack 4.19 - Abhishek Kumar - ShapeBlue
What’s New in CloudStack 4.19 - Abhishek Kumar - ShapeBlueShapeBlue
191 views23 slides

Recently uploaded(20)

Migrating VMware Infra to KVM Using CloudStack - Nicolas Vazquez - ShapeBlue by ShapeBlue
Migrating VMware Infra to KVM Using CloudStack - Nicolas Vazquez - ShapeBlueMigrating VMware Infra to KVM Using CloudStack - Nicolas Vazquez - ShapeBlue
Migrating VMware Infra to KVM Using CloudStack - Nicolas Vazquez - ShapeBlue
ShapeBlue147 views
Live Demo Showcase: Unveiling Dell PowerFlex’s IaaS Capabilities with Apache ... by ShapeBlue
Live Demo Showcase: Unveiling Dell PowerFlex’s IaaS Capabilities with Apache ...Live Demo Showcase: Unveiling Dell PowerFlex’s IaaS Capabilities with Apache ...
Live Demo Showcase: Unveiling Dell PowerFlex’s IaaS Capabilities with Apache ...
ShapeBlue52 views
2FA and OAuth2 in CloudStack - Andrija Panić - ShapeBlue by ShapeBlue
2FA and OAuth2 in CloudStack - Andrija Panić - ShapeBlue2FA and OAuth2 in CloudStack - Andrija Panić - ShapeBlue
2FA and OAuth2 in CloudStack - Andrija Panić - ShapeBlue
ShapeBlue75 views
What’s New in CloudStack 4.19 - Abhishek Kumar - ShapeBlue by ShapeBlue
What’s New in CloudStack 4.19 - Abhishek Kumar - ShapeBlueWhat’s New in CloudStack 4.19 - Abhishek Kumar - ShapeBlue
What’s New in CloudStack 4.19 - Abhishek Kumar - ShapeBlue
ShapeBlue191 views
Developments to CloudStack’s SDN ecosystem: Integration with VMWare NSX 4 - P... by ShapeBlue
Developments to CloudStack’s SDN ecosystem: Integration with VMWare NSX 4 - P...Developments to CloudStack’s SDN ecosystem: Integration with VMWare NSX 4 - P...
Developments to CloudStack’s SDN ecosystem: Integration with VMWare NSX 4 - P...
ShapeBlue120 views
Backroll, News and Demo - Pierre Charton, Matthias Dhellin, Ousmane Diarra - ... by ShapeBlue
Backroll, News and Demo - Pierre Charton, Matthias Dhellin, Ousmane Diarra - ...Backroll, News and Demo - Pierre Charton, Matthias Dhellin, Ousmane Diarra - ...
Backroll, News and Demo - Pierre Charton, Matthias Dhellin, Ousmane Diarra - ...
ShapeBlue121 views
Backup and Disaster Recovery with CloudStack and StorPool - Workshop - Venko ... by ShapeBlue
Backup and Disaster Recovery with CloudStack and StorPool - Workshop - Venko ...Backup and Disaster Recovery with CloudStack and StorPool - Workshop - Venko ...
Backup and Disaster Recovery with CloudStack and StorPool - Workshop - Venko ...
ShapeBlue114 views
Digital Personal Data Protection (DPDP) Practical Approach For CISOs by Priyanka Aash
Digital Personal Data Protection (DPDP) Practical Approach For CISOsDigital Personal Data Protection (DPDP) Practical Approach For CISOs
Digital Personal Data Protection (DPDP) Practical Approach For CISOs
Priyanka Aash103 views
Declarative Kubernetes Cluster Deployment with Cloudstack and Cluster API - O... by ShapeBlue
Declarative Kubernetes Cluster Deployment with Cloudstack and Cluster API - O...Declarative Kubernetes Cluster Deployment with Cloudstack and Cluster API - O...
Declarative Kubernetes Cluster Deployment with Cloudstack and Cluster API - O...
ShapeBlue59 views
Automating a World-Class Technology Conference; Behind the Scenes of CiscoLive by Network Automation Forum
Automating a World-Class Technology Conference; Behind the Scenes of CiscoLiveAutomating a World-Class Technology Conference; Behind the Scenes of CiscoLive
Automating a World-Class Technology Conference; Behind the Scenes of CiscoLive
Elevating Privacy and Security in CloudStack - Boris Stoyanov - ShapeBlue by ShapeBlue
Elevating Privacy and Security in CloudStack - Boris Stoyanov - ShapeBlueElevating Privacy and Security in CloudStack - Boris Stoyanov - ShapeBlue
Elevating Privacy and Security in CloudStack - Boris Stoyanov - ShapeBlue
ShapeBlue149 views
Centralized Logging Feature in CloudStack using ELK and Grafana - Kiran Chava... by ShapeBlue
Centralized Logging Feature in CloudStack using ELK and Grafana - Kiran Chava...Centralized Logging Feature in CloudStack using ELK and Grafana - Kiran Chava...
Centralized Logging Feature in CloudStack using ELK and Grafana - Kiran Chava...
ShapeBlue74 views
DRaaS using Snapshot copy and destination selection (DRaaS) - Alexandre Matti... by ShapeBlue
DRaaS using Snapshot copy and destination selection (DRaaS) - Alexandre Matti...DRaaS using Snapshot copy and destination selection (DRaaS) - Alexandre Matti...
DRaaS using Snapshot copy and destination selection (DRaaS) - Alexandre Matti...
ShapeBlue69 views
Business Analyst Series 2023 - Week 4 Session 7 by DianaGray10
Business Analyst Series 2023 -  Week 4 Session 7Business Analyst Series 2023 -  Week 4 Session 7
Business Analyst Series 2023 - Week 4 Session 7
DianaGray10110 views
Setting Up Your First CloudStack Environment with Beginners Challenges - MD R... by ShapeBlue
Setting Up Your First CloudStack Environment with Beginners Challenges - MD R...Setting Up Your First CloudStack Environment with Beginners Challenges - MD R...
Setting Up Your First CloudStack Environment with Beginners Challenges - MD R...
ShapeBlue105 views
Extending KVM Host HA for Non-NFS Storage - Alex Ivanov - StorPool by ShapeBlue
Extending KVM Host HA for Non-NFS Storage -  Alex Ivanov - StorPoolExtending KVM Host HA for Non-NFS Storage -  Alex Ivanov - StorPool
Extending KVM Host HA for Non-NFS Storage - Alex Ivanov - StorPool
ShapeBlue56 views
Zero to Cloud Hero: Crafting a Private Cloud from Scratch with XCP-ng, Xen Or... by ShapeBlue
Zero to Cloud Hero: Crafting a Private Cloud from Scratch with XCP-ng, Xen Or...Zero to Cloud Hero: Crafting a Private Cloud from Scratch with XCP-ng, Xen Or...
Zero to Cloud Hero: Crafting a Private Cloud from Scratch with XCP-ng, Xen Or...
ShapeBlue128 views

How to Interview a Data Scientist

  • 1. Daniel How to Interview a Data Scientist Daniel Tunkelang Director of Data Science, LinkedIn Recruiting Solutions 1
  • 3. GOAL 3
  • 4. Specification for a Data Scientist implements algorithms analyzes data thinks product 4
  • 5. What about C ulture ommunication uriosity Hold that thought… ? 5
  • 6. What can you learn from an interview? 6
  • 7. Interviewing is a last resort. Alternatives? 7
  • 8. Only hire people you’ve worked with. 8
  • 9. Hire interns. Convert to full-time. Profit! 9
  • 10. Try before you buy: short-term contracts. 10
  • 11. Alternatives are at best a partial solution. §  Only hiring people you’ve worked with doesn’t scale. –  And traps you in a locally optimal monoculture. §  Interns are great! But they are a significant investment. –  Managing interns well is a productivity gamble. –  Most interns have at least a year of school left. –  Not all interns will make your bar. You won’t always make theirs. §  Try before you buy: nice in theory. –  Adverse selection bias when other offers are permanent roles. –  Creates bureaucracy. 11
  • 12. Can we at least make interviews natural? 12
  • 13. Spend a day working together. 13
  • 16. High-fructose corn syrup is 100% natural. §  Working sessions are difficult to set up. –  No more natural than a final exam. –  High variance, and very difficult to calibrate performance. §  Take-home assignments are great for the employer. –  But they are a significant investment for the candidate. –  Adverse selection bias if other companies don’t require them. –  Creates incentive to cheat if significant part of hiring process. §  Previous work is like natural experiments. –  Always good to review a candidate’s previous work. –  But not always possible to find work with high predictive value. 16
  • 17. So you gotta do interviews. But how? 17
  • 18. Three Principles 1.  Keep it real. 2.  No gotchas. 3.  Maybe = no. 18
  • 20. Test basic coding with FizzBuzz questions. multiple of 3 -> Fizz multiple of 5 -> Buzz multiple of 15 -> FizzBuzz 1, 2, Fizz, 4, Buzz, Fizz, 7, 8, Fizz, Buzz, 11, Fizz, 13, 14, FizzBuzz, 16, … 20
  • 21. Whiteboards suck for coding. http://ericleads.com/2012/10/how-to-conduct-a-better-coding-interview/ 21
  • 22. Don’t ask pointless algorithm questions. implement 22
  • 23. Use real-world algorithms questions. bigdatascientist Did you mean: big data scientist 23
  • 24. Ask candidates to design your products. 24
  • 25. Keeping it real is also a great sell. Similar Profiles People You May Know 25
  • 27. Gotchas reduce the signal-to-noise ratio. §  Avoid problems where success hinges on a single insight. –  Good interview problems offer lots of room for partial credit. –  Making a key insight often reflects experience, not intelligence. §  Don’t test a candidate’s knowledge of a niche technique. –  Unless that niche technique is critical to job performance. –  And can’t be learned on the job as part of on-boarding. §  Be a hard interviewer, but don’t be an asshole. –  An interview is not a stress-test to see where candidates break. –  Interviews communicate your values to the candidate. 27
  • 29. Commit to binary interview outcomes. §  Forced choice so interviewers don’t take easy way out. –  Just like having 4 choices instead of 5 on a rating scale. –  Encourages interviewers to take their role seriously. §  Each team member is a critical filter. –  Two no’s or one strong no is a no. –  All weak yes’s is a no. §  Short-circuit candidates early in the process. –  Resume and phone screening should be aggressive. –  Onsite interviews should have ~50% chance of leading to offers. 29
  • 30. But what about C ulture ommunication uriosity All are must-haves. ? Every interview evaluates all three. 30
  • 32. Three Principles 1.  Keep it real. –  Avoid whiteboard coding. Filter with FizzBuzz. –  Use real-world algorithms questions. –  Ask candidates to design your products. 2.  No gotchas. –  Gotchas reduce the signal-to-noise ratio. 3.  Maybe = no. –  Bad hires suck. Be conservative. –  Trust your team. 32