SlideShare a Scribd company logo
1 of 38
Bridging the Gap Between Data Science
& Engineering:
Building High-Performing Teams
How do I hire a data scientist?
Software Engineer Data Engineer Data Scientist Applied Scientist Research Scientist
Continuum of Skills
Software Engineer Data Engineer Data Scientist Applied Scientist Research Scientist
Continuum of Skills
Math &
Stats
Computer
Science
Domain
Expertise
Machine
Learning
Software
Engineering Research
Unicorn
Data Science
Many companies try to ļ¬nd all of these skills in a
single person.
Which leads to job requirements like thisā€¦
ā€¢ MSc/PhD in Computer Science, Electrical Engineering, Math or Statistics
ā€¢ At least 5 years of experience in solving real-world practical problems using Machine Learning
ā€¢ At least 5 years of experience on mining and modeling large-scale data (hundreds of terabytes)
ā€¢ Extensive in-depth knowledge of Data Mining, Machine Learning, Algorithms
ā€¢ Knowledge of at least one high-level programming language (C++, Java)
ā€¢ Knowledge of at least one scripting language (Perl, Python, Ruby)
ā€¢ Knowledge of SQL and experience with large relational databases
ā€¢ Knowledge of at least one ML toolset (R, Weka, KNIME, Octave, Mahout, scikit-learn)
ā€¢ Strong ability to formalize and provide practical solutions to research problems
ā€¢ Strong communication skills and ability to work independently to get an idea from inception to
implementation.
ā€¢ Knowledge of the state of the art in at least one of Bayesian Optimization, Recommendation
Systems, Social Network Analysis, Information Retrieval
ā€¢ At least 5 years of experience with storing, sampling, querying large-scale data (hundreds of
terabytes) and experimentation frameworks
ā€¢ At least 5 years of experience with Hadoop, Spark, Mahout or Giraph
Data Science Unicorn
These people do exist, but they are often already
well-compensated, and only want to work on
interesting problems.
What can you do?
Build a team instead.
Broad-range generalist
Deepexpertise
Look for T-shaped people
Machine Learning,
Statistics, Domain Knowledge
Softw
are
Engineering
Business
Acum
en
Distributed
Com
puting
Com
m
unication
Look for T-shaped people
ā€¢ Compose teams of individuals who
have overlapping skill-sets and
deep expertise in one area
(machine learning, statistics,
engineering, business, etc.)
ā€¢ The overlap allows them to speak
the same language and work
collaboratively on solving problems
How do I structure my data science team within
my organization?
Data Science Team Structures
CentralizedEmbeddedHub & Spoke
Centralized
Data Scientists sit on a team that
acts as internal consultants, ļ¬elding
and answering questions from
multiple teams within the
organization, deļ¬ning tools for the
organization, and acting as highly
powered consultants.
Embedded
ā€¢ Data Scientists are almost wholly
embedded within one particular team
and focus on solving problems for that
team.
ā€¢ Teams are assigned to one particular
product or function within the company
and deļ¬ne and answer questions for
that product or function.
Hub & Spoke
ā€¢ The data science team sits
together physically and works
collaboratively to solve problems.
ā€¢ However, each data scientist (or
a combination of them) gets
deployed to work on problems
within the organization.
ā€¢ Tends to apply to companies
who have a lot of users.
Data Science Team Structure
CentralizedEmbeddedHub & Spoke
> >
How do I get my data scientists to work with
engineering?
Data Science
Python R
modeling & prototyping production
Software Engineering
Java/C++ RoR/Javascript
Data Science Software Engineering
Python R Java/C++ RoR/Javascript
modeling & prototyping production
Data scientists learn
to write prototypes
in production
languages
Engineers learn the
basics of data
science so they can
understand how
the models work
Goal is to have both teams speak
the same language and engender
trust through communication
Data Science Data Engineering
Common Core
Data Science
Curriculum
Data Engineering
Curriculum
Data Science Data Engineering
Projects
Data Science Engineering
Initial Planning
Data Science Engineering
Data Science Engineering
Production
ā€¢ Donā€™t look for unicorns, build collaborative
teams of T-shaped people
ā€¢ Pay attention to how your data science team is
structured within your organization
ā€¢ Get your data science and engineering teams to
speak the same language, allowing them to build
trust and work collaboratively
Summary
We believe an opportunity belongs ā€Ø
to anyone with aptitude and ambition.
29Galvanize 2015
NODES ON THE NETWORK
COLORADO (BOULDER, DENVER, FORT COLLINS)
SEATTLE, WA
SAN FRANCISCO, CA
AUSTIN, TX (OPENING Q1 2016)
Programs: Full Stack Immersive, Data Science Immersive,
Entrepreneurship
Programs: Full Stack Immersive, Data Science Immersive,
Entrepreneurship
Programs: Full Stack Immersive, Data Science Immersive, Data
Engineering Immersive, Masters of Science in Data Science,
Entrepreneurship
Programs: Full Stack Immersive, Data Science Immersive,
Entrepreneurship
[Explanation Text]
30Galvanize 2015
PLACEMENT STATS
FULL STACK IMMERSIVE DATA SCIENCE IMMERSIVE
$43K $77KPre-program Salary
Average Starting Salary
97% Placement
Rate*
*Galvanize is a founder member of NESTA (New Economy Skills Training Association), a trade organization founded to regulate the new ā€œbootcampā€ market.
This place rate is more rigorous than that requested by state licensure agencies. The placement rate is calculated 6 months after graduation.
$72K $114KPre-program Salary
94%Placement
Rate*
Average Starting Salary
31Galvanize 2015
5 PROGRAMS
ā€¢ Full Stack Immersive
ā€¢ Data Science Immersive
ā€¢ Data Engineering Immersive
Project over 500 Student Member Graduates in 2015
Currently over 1500 Members
ā€¢ Master of Science in Data Science ā€Ø
(University of New Haven)
ā€¢ Startup Membership
32Galvanize 2015
FULL STACK IMMERSIVE
ā€¢ 97% Placement Rate ā€Ø
within 6 months
ā€¢ $77K Average Starting Salary
ā€¢ 6 Month Program
33Galvanize 2015
FULL STACK IMMERSIVE
34Galvanize 2015
DATA SCIENCE IMMERSIVE
ā€¢ 94% Placement Rate ā€Ø
within 6 months
ā€¢ $114K Average Starting Salary
ā€¢ 3 Month Program
35Galvanize 2015
DATA SCIENCE IMMERSIVE
Week 1 - Exploratory Data Analysis and Software Engineering Best Practices
Week 2 - Statistical Inference, Bayesian Methods, A/B Testing, Multi-Armed Bandit
Week 3 - Regression, Regularization, Gradient Descent
Week 4 - Supervised Machine Learning: Classiļ¬cation, Validation, Ensemble Methods
Week 5 - Clustering, Topic Modeling (NMF, LDA), NLP
Week 6 - Network Analysis, Matrix Factorization, and Time Series
Week 7 - Hadoop, Hive, and MapReduce
Week 8 - Data Visualization with D3.js, Data Products, and Fraud Detection Case Study
Weeks 9-10 - Capstone Projects
Week 12 - Onsite Interviews
36Galvanize 2015
DATA SCIENCE IMMERSIVE
37Galvanize 2015
DATA ENGINEERING IMMERSIVE
ā€¢ Launched Oct. 2015
ā€¢ Built in partnership with Nvent and
Concurrent
ā€¢ 3 Month Program
THANK YOU
RYAN ORBAN | EVP OF PRODUCT & STRATEGY
ryan.orban@galvanize.com
@ryanorban
www.galvanize.com

More Related Content

What's hot

Lean Startup & Corporate Innovation Strategies - April 2015
Lean Startup & Corporate Innovation Strategies - April 2015Lean Startup & Corporate Innovation Strategies - April 2015
Lean Startup & Corporate Innovation Strategies - April 2015Kevin Shutta
Ā 
SlideShare Experts - 7 Experts Reveal Their Presentation Design Secrets
SlideShare Experts - 7 Experts Reveal Their Presentation Design SecretsSlideShare Experts - 7 Experts Reveal Their Presentation Design Secrets
SlideShare Experts - 7 Experts Reveal Their Presentation Design SecretsEugene Cheng
Ā 
The Key Recruitment Metric You're Not Tracking: Source of Influence
The Key Recruitment Metric You're Not Tracking: Source of InfluenceThe Key Recruitment Metric You're Not Tracking: Source of Influence
The Key Recruitment Metric You're Not Tracking: Source of InfluenceGlassdoor
Ā 
Strategic Relations Sales Deck | 2019 v3
Strategic Relations Sales Deck | 2019 v3Strategic Relations Sales Deck | 2019 v3
Strategic Relations Sales Deck | 2019 v3RICHTER
Ā 
AI & Startups
AI & StartupsAI & Startups
AI & StartupsTomaszTunguz
Ā 
ProdPad Sales Deck - Software for Highly Effective Product Managers
ProdPad Sales Deck - Software for Highly Effective Product Managers ProdPad Sales Deck - Software for Highly Effective Product Managers
ProdPad Sales Deck - Software for Highly Effective Product Managers ProdPad
Ā 
Visual Design with Data
Visual Design with DataVisual Design with Data
Visual Design with DataSeth Familian
Ā 
Talking to Humans at the Lean Startup Conference
Talking to Humans at the Lean Startup ConferenceTalking to Humans at the Lean Startup Conference
Talking to Humans at the Lean Startup ConferenceNew York University
Ā 
29 Revenue Model Options for Industrial enterprises (curated by @arnevbalen -...
29 Revenue Model Options for Industrial enterprises (curated by @arnevbalen -...29 Revenue Model Options for Industrial enterprises (curated by @arnevbalen -...
29 Revenue Model Options for Industrial enterprises (curated by @arnevbalen -...Board of Innovation
Ā 
The Rise of All-In-One SaaS
The Rise of All-In-One SaaSThe Rise of All-In-One SaaS
The Rise of All-In-One SaaSHiten Shah
Ā 
How To Create A Successful Investment Pitch Deck by Piktochart and HighSpark
How To Create A Successful Investment Pitch Deck by Piktochart and HighSparkHow To Create A Successful Investment Pitch Deck by Piktochart and HighSpark
How To Create A Successful Investment Pitch Deck by Piktochart and HighSparkPiktochart
Ā 
Apply Design Thinking (Design Thinking Action Lab - Stanford University)
Apply Design Thinking (Design Thinking Action Lab - Stanford University)Apply Design Thinking (Design Thinking Action Lab - Stanford University)
Apply Design Thinking (Design Thinking Action Lab - Stanford University)Esfandiar Khaleghi
Ā 
Continuous discovery - Caitlin Blackwell
Continuous discovery - Caitlin BlackwellContinuous discovery - Caitlin Blackwell
Continuous discovery - Caitlin BlackwellProduct Anonymous
Ā 
Creative Traction Methodology - For Early Stage Startups
Creative Traction Methodology - For Early Stage StartupsCreative Traction Methodology - For Early Stage Startups
Creative Traction Methodology - For Early Stage StartupsTommaso Di Bartolo
Ā 
WTF - Why the Future Is Up to Us - pptx version
WTF - Why the Future Is Up to Us - pptx versionWTF - Why the Future Is Up to Us - pptx version
WTF - Why the Future Is Up to Us - pptx versionTim O'Reilly
Ā 
Design Thinking 101 Workshop
Design Thinking 101 WorkshopDesign Thinking 101 Workshop
Design Thinking 101 WorkshopNatalie Hollier
Ā 
24 Time Management Hacks to Develop for Increased Productivity
24 Time Management Hacks to Develop for Increased Productivity24 Time Management Hacks to Develop for Increased Productivity
24 Time Management Hacks to Develop for Increased ProductivityIulian Olariu
Ā 

What's hot (20)

Business & Revenue Models - Emad Saif
Business & Revenue Models - Emad SaifBusiness & Revenue Models - Emad Saif
Business & Revenue Models - Emad Saif
Ā 
Workshop MVP
Workshop MVPWorkshop MVP
Workshop MVP
Ā 
Lean Startup & Corporate Innovation Strategies - April 2015
Lean Startup & Corporate Innovation Strategies - April 2015Lean Startup & Corporate Innovation Strategies - April 2015
Lean Startup & Corporate Innovation Strategies - April 2015
Ā 
SlideShare Experts - 7 Experts Reveal Their Presentation Design Secrets
SlideShare Experts - 7 Experts Reveal Their Presentation Design SecretsSlideShare Experts - 7 Experts Reveal Their Presentation Design Secrets
SlideShare Experts - 7 Experts Reveal Their Presentation Design Secrets
Ā 
The Key Recruitment Metric You're Not Tracking: Source of Influence
The Key Recruitment Metric You're Not Tracking: Source of InfluenceThe Key Recruitment Metric You're Not Tracking: Source of Influence
The Key Recruitment Metric You're Not Tracking: Source of Influence
Ā 
Strategic Relations Sales Deck | 2019 v3
Strategic Relations Sales Deck | 2019 v3Strategic Relations Sales Deck | 2019 v3
Strategic Relations Sales Deck | 2019 v3
Ā 
AI & Startups
AI & StartupsAI & Startups
AI & Startups
Ā 
ProdPad Sales Deck - Software for Highly Effective Product Managers
ProdPad Sales Deck - Software for Highly Effective Product Managers ProdPad Sales Deck - Software for Highly Effective Product Managers
ProdPad Sales Deck - Software for Highly Effective Product Managers
Ā 
Visual Design with Data
Visual Design with DataVisual Design with Data
Visual Design with Data
Ā 
Talking to Humans at the Lean Startup Conference
Talking to Humans at the Lean Startup ConferenceTalking to Humans at the Lean Startup Conference
Talking to Humans at the Lean Startup Conference
Ā 
29 Revenue Model Options for Industrial enterprises (curated by @arnevbalen -...
29 Revenue Model Options for Industrial enterprises (curated by @arnevbalen -...29 Revenue Model Options for Industrial enterprises (curated by @arnevbalen -...
29 Revenue Model Options for Industrial enterprises (curated by @arnevbalen -...
Ā 
The Rise of All-In-One SaaS
The Rise of All-In-One SaaSThe Rise of All-In-One SaaS
The Rise of All-In-One SaaS
Ā 
How To Create A Successful Investment Pitch Deck by Piktochart and HighSpark
How To Create A Successful Investment Pitch Deck by Piktochart and HighSparkHow To Create A Successful Investment Pitch Deck by Piktochart and HighSpark
How To Create A Successful Investment Pitch Deck by Piktochart and HighSpark
Ā 
Apply Design Thinking (Design Thinking Action Lab - Stanford University)
Apply Design Thinking (Design Thinking Action Lab - Stanford University)Apply Design Thinking (Design Thinking Action Lab - Stanford University)
Apply Design Thinking (Design Thinking Action Lab - Stanford University)
Ā 
Continuous discovery - Caitlin Blackwell
Continuous discovery - Caitlin BlackwellContinuous discovery - Caitlin Blackwell
Continuous discovery - Caitlin Blackwell
Ā 
Creative Traction Methodology - For Early Stage Startups
Creative Traction Methodology - For Early Stage StartupsCreative Traction Methodology - For Early Stage Startups
Creative Traction Methodology - For Early Stage Startups
Ā 
WTF - Why the Future Is Up to Us - pptx version
WTF - Why the Future Is Up to Us - pptx versionWTF - Why the Future Is Up to Us - pptx version
WTF - Why the Future Is Up to Us - pptx version
Ā 
Design Thinking 101 Workshop
Design Thinking 101 WorkshopDesign Thinking 101 Workshop
Design Thinking 101 Workshop
Ā 
24 Time Management Hacks to Develop for Increased Productivity
24 Time Management Hacks to Develop for Increased Productivity24 Time Management Hacks to Develop for Increased Productivity
24 Time Management Hacks to Develop for Increased Productivity
Ā 
Kickstarting Design Thinking
Kickstarting Design ThinkingKickstarting Design Thinking
Kickstarting Design Thinking
Ā 

Similar to Bridging the Gap Between Data Science & Engineer: Building High-Performance Teams

Data Science Highlights
Data Science Highlights Data Science Highlights
Data Science Highlights Joe Lamantia
Ā 
Which institute is best for data science?
Which institute is best for data science?Which institute is best for data science?
Which institute is best for data science?DIGITALSAI1
Ā 
Best Selenium certification course
Best Selenium certification courseBest Selenium certification course
Best Selenium certification courseKumarNaik21
Ā 
Data science training in hyd ppt (1)
Data science training in hyd ppt (1)Data science training in hyd ppt (1)
Data science training in hyd ppt (1)SayyedYusufali
Ā 
Data science training institute in hyderabad
Data science training institute in hyderabadData science training institute in hyderabad
Data science training institute in hyderabadVamsiNihal
Ā 
Data science training in Hyderabad
Data science  training in HyderabadData science  training in Hyderabad
Data science training in Hyderabadsaitejavella
Ā 
Data science training Hyderabad
Data science training HyderabadData science training Hyderabad
Data science training HyderabadNithinsunil1
Ā 
Data science online training in hyderabad
Data science online training in hyderabadData science online training in hyderabad
Data science online training in hyderabadVamsiNihal
Ā 
Data science training in hyd ppt (1)
Data science training in hyd ppt (1)Data science training in hyd ppt (1)
Data science training in hyd ppt (1)SayyedYusufali
Ā 
data science training and placement
data science training and placementdata science training and placement
data science training and placementSaiprasadVella
Ā 
online data science training
online data science trainingonline data science training
online data science trainingDIGITALSAI1
Ā 
Data science online training in hyderabad
Data science online training in hyderabadData science online training in hyderabad
Data science online training in hyderabadVamsiNihal
Ā 
data science online training in hyderabad
data science online training in hyderabaddata science online training in hyderabad
data science online training in hyderabadVamsiNihal
Ā 
Best data science training in Hyderabad
Best data science training in HyderabadBest data science training in Hyderabad
Best data science training in HyderabadKumarNaik21
Ā 
Data science training Hyderabad
Data science training HyderabadData science training Hyderabad
Data science training HyderabadNithinsunil1
Ā 
Best Selenium certification course
Best Selenium certification courseBest Selenium certification course
Best Selenium certification courseKumarNaik21
Ā 
Data science training in hyd ppt converted (1)
Data science training in hyd ppt converted (1)Data science training in hyd ppt converted (1)
Data science training in hyd ppt converted (1)SayyedYusufali
Ā 
Data science training in hyd pdf converted (1)
Data science training in hyd pdf converted (1)Data science training in hyd pdf converted (1)
Data science training in hyd pdf converted (1)SayyedYusufali
Ā 
Data science training in hydpdf converted (1)
Data science training in hydpdf  converted (1)Data science training in hydpdf  converted (1)
Data science training in hydpdf converted (1)SayyedYusufali
Ā 
Data Science Training and Placement
Data Science Training and PlacementData Science Training and Placement
Data Science Training and PlacementAkhilGGM
Ā 

Similar to Bridging the Gap Between Data Science & Engineer: Building High-Performance Teams (20)

Data Science Highlights
Data Science Highlights Data Science Highlights
Data Science Highlights
Ā 
Which institute is best for data science?
Which institute is best for data science?Which institute is best for data science?
Which institute is best for data science?
Ā 
Best Selenium certification course
Best Selenium certification courseBest Selenium certification course
Best Selenium certification course
Ā 
Data science training in hyd ppt (1)
Data science training in hyd ppt (1)Data science training in hyd ppt (1)
Data science training in hyd ppt (1)
Ā 
Data science training institute in hyderabad
Data science training institute in hyderabadData science training institute in hyderabad
Data science training institute in hyderabad
Ā 
Data science training in Hyderabad
Data science  training in HyderabadData science  training in Hyderabad
Data science training in Hyderabad
Ā 
Data science training Hyderabad
Data science training HyderabadData science training Hyderabad
Data science training Hyderabad
Ā 
Data science online training in hyderabad
Data science online training in hyderabadData science online training in hyderabad
Data science online training in hyderabad
Ā 
Data science training in hyd ppt (1)
Data science training in hyd ppt (1)Data science training in hyd ppt (1)
Data science training in hyd ppt (1)
Ā 
data science training and placement
data science training and placementdata science training and placement
data science training and placement
Ā 
online data science training
online data science trainingonline data science training
online data science training
Ā 
Data science online training in hyderabad
Data science online training in hyderabadData science online training in hyderabad
Data science online training in hyderabad
Ā 
data science online training in hyderabad
data science online training in hyderabaddata science online training in hyderabad
data science online training in hyderabad
Ā 
Best data science training in Hyderabad
Best data science training in HyderabadBest data science training in Hyderabad
Best data science training in Hyderabad
Ā 
Data science training Hyderabad
Data science training HyderabadData science training Hyderabad
Data science training Hyderabad
Ā 
Best Selenium certification course
Best Selenium certification courseBest Selenium certification course
Best Selenium certification course
Ā 
Data science training in hyd ppt converted (1)
Data science training in hyd ppt converted (1)Data science training in hyd ppt converted (1)
Data science training in hyd ppt converted (1)
Ā 
Data science training in hyd pdf converted (1)
Data science training in hyd pdf converted (1)Data science training in hyd pdf converted (1)
Data science training in hyd pdf converted (1)
Ā 
Data science training in hydpdf converted (1)
Data science training in hydpdf  converted (1)Data science training in hydpdf  converted (1)
Data science training in hydpdf converted (1)
Ā 
Data Science Training and Placement
Data Science Training and PlacementData Science Training and Placement
Data Science Training and Placement
Ā 

Recently uploaded

Call Girls in Sarai Kale Khan Delhi šŸ’Æ Call Us šŸ”9205541914 šŸ”( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi šŸ’Æ Call Us šŸ”9205541914 šŸ”( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi šŸ’Æ Call Us šŸ”9205541914 šŸ”( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi šŸ’Æ Call Us šŸ”9205541914 šŸ”( Delhi) Escorts S...Delhi Call girls
Ā 
ź§ā¤ Greater Noida Call Girls Delhi ā¤ź§‚ 9711199171 ā˜Žļø Hard And Sexy Vip Call
ź§ā¤ Greater Noida Call Girls Delhi ā¤ź§‚ 9711199171 ā˜Žļø Hard And Sexy Vip Callź§ā¤ Greater Noida Call Girls Delhi ā¤ź§‚ 9711199171 ā˜Žļø Hard And Sexy Vip Call
ź§ā¤ Greater Noida Call Girls Delhi ā¤ź§‚ 9711199171 ā˜Žļø Hard And Sexy Vip Callshivangimorya083
Ā 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxolyaivanovalion
Ā 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightDelhi Call girls
Ā 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramMoniSankarHazra
Ā 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
Ā 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxJohnnyPlasten
Ā 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxolyaivanovalion
Ā 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H
Ā 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxolyaivanovalion
Ā 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxolyaivanovalion
Ā 
Junnasandra Call Girls: šŸ“ 7737669865 šŸ“ High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: šŸ“ 7737669865 šŸ“ High Profile Model Escorts | Bangalore...Junnasandra Call Girls: šŸ“ 7737669865 šŸ“ High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: šŸ“ 7737669865 šŸ“ High Profile Model Escorts | Bangalore...amitlee9823
Ā 
Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...shambhavirathore45
Ā 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxfirstjob4
Ā 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxolyaivanovalion
Ā 
ALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptxALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptxolyaivanovalion
Ā 
Zuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptxZuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptxolyaivanovalion
Ā 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxolyaivanovalion
Ā 

Recently uploaded (20)

Call Girls in Sarai Kale Khan Delhi šŸ’Æ Call Us šŸ”9205541914 šŸ”( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi šŸ’Æ Call Us šŸ”9205541914 šŸ”( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi šŸ’Æ Call Us šŸ”9205541914 šŸ”( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi šŸ’Æ Call Us šŸ”9205541914 šŸ”( Delhi) Escorts S...
Ā 
ź§ā¤ Greater Noida Call Girls Delhi ā¤ź§‚ 9711199171 ā˜Žļø Hard And Sexy Vip Call
ź§ā¤ Greater Noida Call Girls Delhi ā¤ź§‚ 9711199171 ā˜Žļø Hard And Sexy Vip Callź§ā¤ Greater Noida Call Girls Delhi ā¤ź§‚ 9711199171 ā˜Žļø Hard And Sexy Vip Call
ź§ā¤ Greater Noida Call Girls Delhi ā¤ź§‚ 9711199171 ā˜Žļø Hard And Sexy Vip Call
Ā 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptx
Ā 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Ā 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics Program
Ā 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
Ā 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptx
Ā 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptx
Ā 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Ā 
ź§ā¤ Aerocity Call Girls Service Aerocity Delhi ā¤ź§‚ 9999965857 ā˜Žļø Hard And Sexy ...
ź§ā¤ Aerocity Call Girls Service Aerocity Delhi ā¤ź§‚ 9999965857 ā˜Žļø Hard And Sexy ...ź§ā¤ Aerocity Call Girls Service Aerocity Delhi ā¤ź§‚ 9999965857 ā˜Žļø Hard And Sexy ...
ź§ā¤ Aerocity Call Girls Service Aerocity Delhi ā¤ź§‚ 9999965857 ā˜Žļø Hard And Sexy ...
Ā 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Ā 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
Ā 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptx
Ā 
Junnasandra Call Girls: šŸ“ 7737669865 šŸ“ High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: šŸ“ 7737669865 šŸ“ High Profile Model Escorts | Bangalore...Junnasandra Call Girls: šŸ“ 7737669865 šŸ“ High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: šŸ“ 7737669865 šŸ“ High Profile Model Escorts | Bangalore...
Ā 
Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...
Ā 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptx
Ā 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptx
Ā 
ALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptxALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptx
Ā 
Zuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptxZuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptx
Ā 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFx
Ā 

Bridging the Gap Between Data Science & Engineer: Building High-Performance Teams

  • 1. Bridging the Gap Between Data Science & Engineering: Building High-Performing Teams
  • 2. How do I hire a data scientist?
  • 3. Software Engineer Data Engineer Data Scientist Applied Scientist Research Scientist Continuum of Skills
  • 4. Software Engineer Data Engineer Data Scientist Applied Scientist Research Scientist Continuum of Skills
  • 6. Many companies try to ļ¬nd all of these skills in a single person.
  • 7. Which leads to job requirements like thisā€¦ ā€¢ MSc/PhD in Computer Science, Electrical Engineering, Math or Statistics ā€¢ At least 5 years of experience in solving real-world practical problems using Machine Learning ā€¢ At least 5 years of experience on mining and modeling large-scale data (hundreds of terabytes) ā€¢ Extensive in-depth knowledge of Data Mining, Machine Learning, Algorithms ā€¢ Knowledge of at least one high-level programming language (C++, Java) ā€¢ Knowledge of at least one scripting language (Perl, Python, Ruby) ā€¢ Knowledge of SQL and experience with large relational databases ā€¢ Knowledge of at least one ML toolset (R, Weka, KNIME, Octave, Mahout, scikit-learn) ā€¢ Strong ability to formalize and provide practical solutions to research problems ā€¢ Strong communication skills and ability to work independently to get an idea from inception to implementation. ā€¢ Knowledge of the state of the art in at least one of Bayesian Optimization, Recommendation Systems, Social Network Analysis, Information Retrieval ā€¢ At least 5 years of experience with storing, sampling, querying large-scale data (hundreds of terabytes) and experimentation frameworks ā€¢ At least 5 years of experience with Hadoop, Spark, Mahout or Giraph
  • 9. These people do exist, but they are often already well-compensated, and only want to work on interesting problems.
  • 10. What can you do? Build a team instead.
  • 11.
  • 13. Machine Learning, Statistics, Domain Knowledge Softw are Engineering Business Acum en Distributed Com puting Com m unication Look for T-shaped people
  • 14. ā€¢ Compose teams of individuals who have overlapping skill-sets and deep expertise in one area (machine learning, statistics, engineering, business, etc.) ā€¢ The overlap allows them to speak the same language and work collaboratively on solving problems
  • 15. How do I structure my data science team within my organization?
  • 16. Data Science Team Structures CentralizedEmbeddedHub & Spoke
  • 17. Centralized Data Scientists sit on a team that acts as internal consultants, ļ¬elding and answering questions from multiple teams within the organization, deļ¬ning tools for the organization, and acting as highly powered consultants.
  • 18. Embedded ā€¢ Data Scientists are almost wholly embedded within one particular team and focus on solving problems for that team. ā€¢ Teams are assigned to one particular product or function within the company and deļ¬ne and answer questions for that product or function.
  • 19. Hub & Spoke ā€¢ The data science team sits together physically and works collaboratively to solve problems. ā€¢ However, each data scientist (or a combination of them) gets deployed to work on problems within the organization. ā€¢ Tends to apply to companies who have a lot of users.
  • 20. Data Science Team Structure CentralizedEmbeddedHub & Spoke > >
  • 21. How do I get my data scientists to work with engineering?
  • 22. Data Science Python R modeling & prototyping production Software Engineering Java/C++ RoR/Javascript
  • 23. Data Science Software Engineering Python R Java/C++ RoR/Javascript modeling & prototyping production
  • 24. Data scientists learn to write prototypes in production languages Engineers learn the basics of data science so they can understand how the models work Goal is to have both teams speak the same language and engender trust through communication
  • 25. Data Science Data Engineering Common Core Data Science Curriculum Data Engineering Curriculum Data Science Data Engineering Projects
  • 26. Data Science Engineering Initial Planning Data Science Engineering Data Science Engineering Production
  • 27. ā€¢ Donā€™t look for unicorns, build collaborative teams of T-shaped people ā€¢ Pay attention to how your data science team is structured within your organization ā€¢ Get your data science and engineering teams to speak the same language, allowing them to build trust and work collaboratively Summary
  • 28. We believe an opportunity belongs ā€Ø to anyone with aptitude and ambition.
  • 29. 29Galvanize 2015 NODES ON THE NETWORK COLORADO (BOULDER, DENVER, FORT COLLINS) SEATTLE, WA SAN FRANCISCO, CA AUSTIN, TX (OPENING Q1 2016) Programs: Full Stack Immersive, Data Science Immersive, Entrepreneurship Programs: Full Stack Immersive, Data Science Immersive, Entrepreneurship Programs: Full Stack Immersive, Data Science Immersive, Data Engineering Immersive, Masters of Science in Data Science, Entrepreneurship Programs: Full Stack Immersive, Data Science Immersive, Entrepreneurship [Explanation Text]
  • 30. 30Galvanize 2015 PLACEMENT STATS FULL STACK IMMERSIVE DATA SCIENCE IMMERSIVE $43K $77KPre-program Salary Average Starting Salary 97% Placement Rate* *Galvanize is a founder member of NESTA (New Economy Skills Training Association), a trade organization founded to regulate the new ā€œbootcampā€ market. This place rate is more rigorous than that requested by state licensure agencies. The placement rate is calculated 6 months after graduation. $72K $114KPre-program Salary 94%Placement Rate* Average Starting Salary
  • 31. 31Galvanize 2015 5 PROGRAMS ā€¢ Full Stack Immersive ā€¢ Data Science Immersive ā€¢ Data Engineering Immersive Project over 500 Student Member Graduates in 2015 Currently over 1500 Members ā€¢ Master of Science in Data Science ā€Ø (University of New Haven) ā€¢ Startup Membership
  • 32. 32Galvanize 2015 FULL STACK IMMERSIVE ā€¢ 97% Placement Rate ā€Ø within 6 months ā€¢ $77K Average Starting Salary ā€¢ 6 Month Program
  • 34. 34Galvanize 2015 DATA SCIENCE IMMERSIVE ā€¢ 94% Placement Rate ā€Ø within 6 months ā€¢ $114K Average Starting Salary ā€¢ 3 Month Program
  • 35. 35Galvanize 2015 DATA SCIENCE IMMERSIVE Week 1 - Exploratory Data Analysis and Software Engineering Best Practices Week 2 - Statistical Inference, Bayesian Methods, A/B Testing, Multi-Armed Bandit Week 3 - Regression, Regularization, Gradient Descent Week 4 - Supervised Machine Learning: Classiļ¬cation, Validation, Ensemble Methods Week 5 - Clustering, Topic Modeling (NMF, LDA), NLP Week 6 - Network Analysis, Matrix Factorization, and Time Series Week 7 - Hadoop, Hive, and MapReduce Week 8 - Data Visualization with D3.js, Data Products, and Fraud Detection Case Study Weeks 9-10 - Capstone Projects Week 12 - Onsite Interviews
  • 37. 37Galvanize 2015 DATA ENGINEERING IMMERSIVE ā€¢ Launched Oct. 2015 ā€¢ Built in partnership with Nvent and Concurrent ā€¢ 3 Month Program
  • 38. THANK YOU RYAN ORBAN | EVP OF PRODUCT & STRATEGY ryan.orban@galvanize.com @ryanorban www.galvanize.com