SlideShare a Scribd company logo
1 of 14
The Ambiguity of Data Science Team
Roles and the Need for
a Data Science Workforce
Framework
Authors- Jeffrey S. Saltz, Nancy W. Grady
Presented by- K.K. Tripathi
(Course- CS795: Intro to Data Science)
October 18th, 2018
Introduction
2
Aim
 Enable organizations to staff their data science teams more accurately with the desired skillsets.
 Providing job titles and job descriptions that are more clearly identify tasks, knowledge, skills,
and abilities that can benefit the data science community.
 Remove the overloading of the term data scientist.
Objective
 To address this challenge, this paper frames and provides data science workforce definitions
with examples.
Background
3
Issue
 Generalization of the “Data Science” word.
Problems
 Difficulty to ascertain what skills are needed to perform the specific tasks required to build and deploy
big data analytics (BDA) systems.
 This lack of vocabulary creates many issues (e.g., identifying the appropriate person that should be
hired for a specific role within a data science team).
 There is not an agreed upon process model for data science (lack of process model).
 Overlapping skillsets (Software development lifecycles).
Role based model by NICE (US DOD CWF)
4
Employers
• Track staff skills
• Training
• Qualifications
• Improve position descriptions
• Develop career paths
• Analyze proficiency
Educators
• Develop curriculum and
conduct training for programs
• Courses
• Seminars for specific roles
Technology
providers
• Identify work roles
• Tasks
• Knowledge
• Skills
• Abilities associated with
products
Based on the list of tasks, knowledge, skills, ability descriptions, a workforce framework map them to work
roles.
Domain benefits:
Methodology (case studies)
5
StandardOrganizations
NIST
EDISON
Industryorganizations
SAIC
Springboard
Advisorycompany
Gartner
Goal: To explore the commonality and diversity of the vocabulary used to describe roles within data
science teams
Qualitative case studies based on selected organizations.
NIST
6
Develop a big data reference architecture that categorizes the components of big data systems
RA consists of 5 components and identifies their respective roles.
 System Orchestrator: integrate the data app
 Data Provider: introduces new data into the BDS
 Big Data Application Provider
 Big Data Framework provider
 Data Consumer
 Security and Privacy: interacts with sys. orch.
 Management: big data life cycle
eg. Package, software, and backup management
EDISON
7
An European Union funded project to build the data science profession
EDSF (Edison data Science Framework) comprises several documents including DS professional
profiles and the Model Curriculum
 Data Scientist: merge, manage, interpret large data-sets
 Data Science Researcher: applies scientific discovery research/process, hypothesis testing
 Data Science Architect: create relevant data models and process workflow
 Data Science Programmer: design, develops, code large data (science) analytics applications
 Data/Business Analyst: extract info about system, services, or organization performance
SAIC
8
A system integrator works primarily for the federal gov.,
Including civilian, defense, and intelligence customers
- Developed Data Science Edge (an internal process model)
- Extends CRISP-DM process to align with big data
 Information Architect: develops data models for optimal performance in databases.
 Data Scientist: works in cross-functional teams at all stages of analysis lifecycle.
- Follows a scientific approach to generate value from data
 Metrics and Data: develops, inspects, mines, transforms,
models data to improve productivity
 Knowledge and Collaboration Engineer: design &
implements tools
 Big Data Engineer: works with the full open source
Hadoop stack from cluster management to repository
Springboard
9
An online data science education startup. Defines 3 following roles:
 Data Engineer: typically knows a variety of programming languages, focuses on coding,
cleaning up data sets; takes the predictive model from the data scientist and implement it in
coding
 Data Scientist: bridge the gap between programming and implementation of data science,
theory of data science, and the business implication of data
 Data Analyst: provide visualizations and reports, explain insights
 Data architect: focuses on structuring the technology
that manages the data models.
Gartner
10
A research / advisory consulting firm. Basically, advise to upper level decision makers.
Set of suggested roles:-
 Data Scientists: extract various types of knowledge from data; end to end process
 Data Engineers: make the data accessible and available for data scientists
 Business Experts: business domain experts
 Source System Experts: knowledge of data at the
business application level
 Software Engineers: for custom coding requirements
 Quant Geeks: certain situations: “nice-to-have” but
in rare situation: “must-have”
 Unicorns: well versed data scientists
Discussion (integrated view of roles used)
11
Search phrases used on Dice.com
Data Scientist vs. Data Engineer
12
Most frequent key phrases used in job descriptions:
Future & Conclusion
13
 Future:
Next changes in future will occur in cases such as:
Blending of data-intensive and compute-intensive applications
eg. Rise of High Performance Data Analytics (HPDA)
 Conclusion:
Rerun of an analysis is required of role usage in the industry in the future (every 6 months)
to identify trends over time
14
Thank
you

More Related Content

What's hot

Big data analytics
Big data analyticsBig data analytics
Big data analyticsRavi Teja
 
What is Data Science? |Role of Data Science in Big Data, Hadoop & Machine Lea...
What is Data Science? |Role of Data Science in Big Data, Hadoop & Machine Lea...What is Data Science? |Role of Data Science in Big Data, Hadoop & Machine Lea...
What is Data Science? |Role of Data Science in Big Data, Hadoop & Machine Lea...vinayiqbusiness
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data ScienceANOOP V S
 
Introduction to data science
Introduction to data scienceIntroduction to data science
Introduction to data scienceSampath Kumar
 
Data science presentation
Data science presentationData science presentation
Data science presentationMSDEVMTL
 
Introduction to Data Science (Data Summit, 2017)
Introduction to Data Science (Data Summit, 2017)Introduction to Data Science (Data Summit, 2017)
Introduction to Data Science (Data Summit, 2017)Caserta
 
Data Scientist Salary, Skills, Jobs And Resume | Data Scientist Career | Data...
Data Scientist Salary, Skills, Jobs And Resume | Data Scientist Career | Data...Data Scientist Salary, Skills, Jobs And Resume | Data Scientist Career | Data...
Data Scientist Salary, Skills, Jobs And Resume | Data Scientist Career | Data...Simplilearn
 
Introduction of Data Science
Introduction of Data ScienceIntroduction of Data Science
Introduction of Data ScienceJason Geng
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data ScienceCaserta
 
Data Visibility and Protection at the Scale of Life Sciences
Data Visibility and Protection at the Scale of Life SciencesData Visibility and Protection at the Scale of Life Sciences
Data Visibility and Protection at the Scale of Life SciencesAdam Marko
 
Introduction to Data Science - Week 4 - Tools and Technologies in Data Science
Introduction to Data Science - Week 4 - Tools and Technologies in Data ScienceIntroduction to Data Science - Week 4 - Tools and Technologies in Data Science
Introduction to Data Science - Week 4 - Tools and Technologies in Data ScienceFerdin Joe John Joseph PhD
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data ScienceEdureka!
 
Data science
Data scienceData science
Data science9diov
 
Intro to Data Science Big Data
Intro to Data Science Big DataIntro to Data Science Big Data
Intro to Data Science Big DataIndu Khemchandani
 
data science chapter-4,5,6
data science chapter-4,5,6data science chapter-4,5,6
data science chapter-4,5,6varshakumar21
 
Big data deep learning: applications and challenges
Big data deep learning: applications and challengesBig data deep learning: applications and challenges
Big data deep learning: applications and challengesfazail amin
 
AHM 2014: OceanLink, Smart Data versus Smart Applications
AHM 2014: OceanLink, Smart Data versus Smart Applications AHM 2014: OceanLink, Smart Data versus Smart Applications
AHM 2014: OceanLink, Smart Data versus Smart Applications EarthCube
 

What's hot (20)

Big data analytics
Big data analyticsBig data analytics
Big data analytics
 
What is Data Science? |Role of Data Science in Big Data, Hadoop & Machine Lea...
What is Data Science? |Role of Data Science in Big Data, Hadoop & Machine Lea...What is Data Science? |Role of Data Science in Big Data, Hadoop & Machine Lea...
What is Data Science? |Role of Data Science in Big Data, Hadoop & Machine Lea...
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Introduction to data science
Introduction to data scienceIntroduction to data science
Introduction to data science
 
Data science presentation
Data science presentationData science presentation
Data science presentation
 
Introduction to Data Science (Data Summit, 2017)
Introduction to Data Science (Data Summit, 2017)Introduction to Data Science (Data Summit, 2017)
Introduction to Data Science (Data Summit, 2017)
 
Data Scientist Salary, Skills, Jobs And Resume | Data Scientist Career | Data...
Data Scientist Salary, Skills, Jobs And Resume | Data Scientist Career | Data...Data Scientist Salary, Skills, Jobs And Resume | Data Scientist Career | Data...
Data Scientist Salary, Skills, Jobs And Resume | Data Scientist Career | Data...
 
Introduction of Data Science
Introduction of Data ScienceIntroduction of Data Science
Introduction of Data Science
 
Data science
Data scienceData science
Data science
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Data Visibility and Protection at the Scale of Life Sciences
Data Visibility and Protection at the Scale of Life SciencesData Visibility and Protection at the Scale of Life Sciences
Data Visibility and Protection at the Scale of Life Sciences
 
Data science
Data science Data science
Data science
 
Introduction to Data Science - Week 4 - Tools and Technologies in Data Science
Introduction to Data Science - Week 4 - Tools and Technologies in Data ScienceIntroduction to Data Science - Week 4 - Tools and Technologies in Data Science
Introduction to Data Science - Week 4 - Tools and Technologies in Data Science
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Data science
Data scienceData science
Data science
 
Intro to Data Science Big Data
Intro to Data Science Big DataIntro to Data Science Big Data
Intro to Data Science Big Data
 
data science chapter-4,5,6
data science chapter-4,5,6data science chapter-4,5,6
data science chapter-4,5,6
 
Introduction to BigData
Introduction to BigData Introduction to BigData
Introduction to BigData
 
Big data deep learning: applications and challenges
Big data deep learning: applications and challengesBig data deep learning: applications and challenges
Big data deep learning: applications and challenges
 
AHM 2014: OceanLink, Smart Data versus Smart Applications
AHM 2014: OceanLink, Smart Data versus Smart Applications AHM 2014: OceanLink, Smart Data versus Smart Applications
AHM 2014: OceanLink, Smart Data versus Smart Applications
 

Similar to Paper presentation

Ch1IntroductiontoDataScience.pptx
Ch1IntroductiontoDataScience.pptxCh1IntroductiontoDataScience.pptx
Ch1IntroductiontoDataScience.pptxAbderrahmanABID2
 
Data science Nagarajan and madhav.pptx
Data science Nagarajan and madhav.pptxData science Nagarajan and madhav.pptx
Data science Nagarajan and madhav.pptxNagarajanG35
 
Futuristic knowledge management ppt bec bagalkot mba
Futuristic knowledge management ppt bec bagalkot mbaFuturistic knowledge management ppt bec bagalkot mba
Futuristic knowledge management ppt bec bagalkot mbaBabasab Patil
 
Tips and Tricks to be an Effective Data Scientist
Tips and Tricks to be an Effective Data ScientistTips and Tricks to be an Effective Data Scientist
Tips and Tricks to be an Effective Data ScientistLisa Cohen
 
Tips for Effective Data Science in the Enterprise
Tips for Effective Data Science in the EnterpriseTips for Effective Data Science in the Enterprise
Tips for Effective Data Science in the EnterpriseLisa Cohen
 
MAT111–Spring2020 Name__________________________.docx
MAT111–Spring2020     Name__________________________.docxMAT111–Spring2020     Name__________________________.docx
MAT111–Spring2020 Name__________________________.docxalfredacavx97
 
How Data Virtualization Adds Value to Your Data Science Stack
How Data Virtualization Adds Value to Your Data Science StackHow Data Virtualization Adds Value to Your Data Science Stack
How Data Virtualization Adds Value to Your Data Science StackDenodo
 
Data Science- Basics.pptx
Data Science- Basics.pptxData Science- Basics.pptx
Data Science- Basics.pptxRupaliKute3
 
Introduction to Data Science.pdf
Introduction to Data Science.pdfIntroduction to Data Science.pdf
Introduction to Data Science.pdfUniversity of Sindh
 
Big Data Driven Solutions to Combat Covid' 19
Big Data Driven Solutions to Combat Covid' 19Big Data Driven Solutions to Combat Covid' 19
Big Data Driven Solutions to Combat Covid' 19Prof.Balakrishnan S
 
Building Data Science Teams
Building Data Science TeamsBuilding Data Science Teams
Building Data Science TeamsEMC
 
Secrets of Enterprise Data Mining: SQL Saturday Oregon 201411
Secrets of Enterprise Data Mining: SQL Saturday Oregon 201411Secrets of Enterprise Data Mining: SQL Saturday Oregon 201411
Secrets of Enterprise Data Mining: SQL Saturday Oregon 201411Mark Tabladillo
 
Data science.pptx
Data science.pptxData science.pptx
Data science.pptxHakkinsRaj
 
BUILDING BETTER PREDICTIVE MODELS WITH COGNITIVE ASSISTANCE IN A DATA SCIENCE...
BUILDING BETTER PREDICTIVE MODELS WITH COGNITIVE ASSISTANCE IN A DATA SCIENCE...BUILDING BETTER PREDICTIVE MODELS WITH COGNITIVE ASSISTANCE IN A DATA SCIENCE...
BUILDING BETTER PREDICTIVE MODELS WITH COGNITIVE ASSISTANCE IN A DATA SCIENCE...Alex Liu
 
BIAM 410 Final Paper - Beyond the Buzzwords: Big Data, Machine Learning, What...
BIAM 410 Final Paper - Beyond the Buzzwords: Big Data, Machine Learning, What...BIAM 410 Final Paper - Beyond the Buzzwords: Big Data, Machine Learning, What...
BIAM 410 Final Paper - Beyond the Buzzwords: Big Data, Machine Learning, What...Thomas Rones
 
Building the Data Science Profession in Europe
Building the Data Science Profession in EuropeBuilding the Data Science Profession in Europe
Building the Data Science Profession in EuropeSteven Miller
 
Developing and deploying AI solutions on the cloud using Team Data Science Pr...
Developing and deploying AI solutions on the cloud using Team Data Science Pr...Developing and deploying AI solutions on the cloud using Team Data Science Pr...
Developing and deploying AI solutions on the cloud using Team Data Science Pr...Debraj GuhaThakurta
 
Unit 2 - Data Manipulation with R.pptx
Unit 2 - Data Manipulation with R.pptxUnit 2 - Data Manipulation with R.pptx
Unit 2 - Data Manipulation with R.pptxMalla Reddy University
 
How to Prepare for a Career in Data Science
How to Prepare for a Career in Data ScienceHow to Prepare for a Career in Data Science
How to Prepare for a Career in Data ScienceJuuso Parkkinen
 

Similar to Paper presentation (20)

Ch1IntroductiontoDataScience.pptx
Ch1IntroductiontoDataScience.pptxCh1IntroductiontoDataScience.pptx
Ch1IntroductiontoDataScience.pptx
 
Data science Nagarajan and madhav.pptx
Data science Nagarajan and madhav.pptxData science Nagarajan and madhav.pptx
Data science Nagarajan and madhav.pptx
 
Futuristic knowledge management ppt bec bagalkot mba
Futuristic knowledge management ppt bec bagalkot mbaFuturistic knowledge management ppt bec bagalkot mba
Futuristic knowledge management ppt bec bagalkot mba
 
Welcome to CS310!
Welcome to CS310!Welcome to CS310!
Welcome to CS310!
 
Tips and Tricks to be an Effective Data Scientist
Tips and Tricks to be an Effective Data ScientistTips and Tricks to be an Effective Data Scientist
Tips and Tricks to be an Effective Data Scientist
 
Tips for Effective Data Science in the Enterprise
Tips for Effective Data Science in the EnterpriseTips for Effective Data Science in the Enterprise
Tips for Effective Data Science in the Enterprise
 
MAT111–Spring2020 Name__________________________.docx
MAT111–Spring2020     Name__________________________.docxMAT111–Spring2020     Name__________________________.docx
MAT111–Spring2020 Name__________________________.docx
 
How Data Virtualization Adds Value to Your Data Science Stack
How Data Virtualization Adds Value to Your Data Science StackHow Data Virtualization Adds Value to Your Data Science Stack
How Data Virtualization Adds Value to Your Data Science Stack
 
Data Science- Basics.pptx
Data Science- Basics.pptxData Science- Basics.pptx
Data Science- Basics.pptx
 
Introduction to Data Science.pdf
Introduction to Data Science.pdfIntroduction to Data Science.pdf
Introduction to Data Science.pdf
 
Big Data Driven Solutions to Combat Covid' 19
Big Data Driven Solutions to Combat Covid' 19Big Data Driven Solutions to Combat Covid' 19
Big Data Driven Solutions to Combat Covid' 19
 
Building Data Science Teams
Building Data Science TeamsBuilding Data Science Teams
Building Data Science Teams
 
Secrets of Enterprise Data Mining: SQL Saturday Oregon 201411
Secrets of Enterprise Data Mining: SQL Saturday Oregon 201411Secrets of Enterprise Data Mining: SQL Saturday Oregon 201411
Secrets of Enterprise Data Mining: SQL Saturday Oregon 201411
 
Data science.pptx
Data science.pptxData science.pptx
Data science.pptx
 
BUILDING BETTER PREDICTIVE MODELS WITH COGNITIVE ASSISTANCE IN A DATA SCIENCE...
BUILDING BETTER PREDICTIVE MODELS WITH COGNITIVE ASSISTANCE IN A DATA SCIENCE...BUILDING BETTER PREDICTIVE MODELS WITH COGNITIVE ASSISTANCE IN A DATA SCIENCE...
BUILDING BETTER PREDICTIVE MODELS WITH COGNITIVE ASSISTANCE IN A DATA SCIENCE...
 
BIAM 410 Final Paper - Beyond the Buzzwords: Big Data, Machine Learning, What...
BIAM 410 Final Paper - Beyond the Buzzwords: Big Data, Machine Learning, What...BIAM 410 Final Paper - Beyond the Buzzwords: Big Data, Machine Learning, What...
BIAM 410 Final Paper - Beyond the Buzzwords: Big Data, Machine Learning, What...
 
Building the Data Science Profession in Europe
Building the Data Science Profession in EuropeBuilding the Data Science Profession in Europe
Building the Data Science Profession in Europe
 
Developing and deploying AI solutions on the cloud using Team Data Science Pr...
Developing and deploying AI solutions on the cloud using Team Data Science Pr...Developing and deploying AI solutions on the cloud using Team Data Science Pr...
Developing and deploying AI solutions on the cloud using Team Data Science Pr...
 
Unit 2 - Data Manipulation with R.pptx
Unit 2 - Data Manipulation with R.pptxUnit 2 - Data Manipulation with R.pptx
Unit 2 - Data Manipulation with R.pptx
 
How to Prepare for a Career in Data Science
How to Prepare for a Career in Data ScienceHow to Prepare for a Career in Data Science
How to Prepare for a Career in Data Science
 

Recently uploaded

Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingNeil Barnes
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxolyaivanovalion
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiSuhani Kapoor
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxStephen266013
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxolyaivanovalion
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxolyaivanovalion
 
Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSAishani27
 
Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023ymrp368
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFxolyaivanovalion
 
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...Suhani Kapoor
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...Suhani Kapoor
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysismanisha194592
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationshipsccctableauusergroup
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionfulawalesam
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusTimothy Spann
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxfirstjob4
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfMarinCaroMartnezBerg
 

Recently uploaded (20)

Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data Storytelling
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFx
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docx
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptx
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptx
 
Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICS
 
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
 
Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023
 
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptx
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 

Paper presentation

  • 1. The Ambiguity of Data Science Team Roles and the Need for a Data Science Workforce Framework Authors- Jeffrey S. Saltz, Nancy W. Grady Presented by- K.K. Tripathi (Course- CS795: Intro to Data Science) October 18th, 2018
  • 2. Introduction 2 Aim  Enable organizations to staff their data science teams more accurately with the desired skillsets.  Providing job titles and job descriptions that are more clearly identify tasks, knowledge, skills, and abilities that can benefit the data science community.  Remove the overloading of the term data scientist. Objective  To address this challenge, this paper frames and provides data science workforce definitions with examples.
  • 3. Background 3 Issue  Generalization of the “Data Science” word. Problems  Difficulty to ascertain what skills are needed to perform the specific tasks required to build and deploy big data analytics (BDA) systems.  This lack of vocabulary creates many issues (e.g., identifying the appropriate person that should be hired for a specific role within a data science team).  There is not an agreed upon process model for data science (lack of process model).  Overlapping skillsets (Software development lifecycles).
  • 4. Role based model by NICE (US DOD CWF) 4 Employers • Track staff skills • Training • Qualifications • Improve position descriptions • Develop career paths • Analyze proficiency Educators • Develop curriculum and conduct training for programs • Courses • Seminars for specific roles Technology providers • Identify work roles • Tasks • Knowledge • Skills • Abilities associated with products Based on the list of tasks, knowledge, skills, ability descriptions, a workforce framework map them to work roles. Domain benefits:
  • 5. Methodology (case studies) 5 StandardOrganizations NIST EDISON Industryorganizations SAIC Springboard Advisorycompany Gartner Goal: To explore the commonality and diversity of the vocabulary used to describe roles within data science teams Qualitative case studies based on selected organizations.
  • 6. NIST 6 Develop a big data reference architecture that categorizes the components of big data systems RA consists of 5 components and identifies their respective roles.  System Orchestrator: integrate the data app  Data Provider: introduces new data into the BDS  Big Data Application Provider  Big Data Framework provider  Data Consumer  Security and Privacy: interacts with sys. orch.  Management: big data life cycle eg. Package, software, and backup management
  • 7. EDISON 7 An European Union funded project to build the data science profession EDSF (Edison data Science Framework) comprises several documents including DS professional profiles and the Model Curriculum  Data Scientist: merge, manage, interpret large data-sets  Data Science Researcher: applies scientific discovery research/process, hypothesis testing  Data Science Architect: create relevant data models and process workflow  Data Science Programmer: design, develops, code large data (science) analytics applications  Data/Business Analyst: extract info about system, services, or organization performance
  • 8. SAIC 8 A system integrator works primarily for the federal gov., Including civilian, defense, and intelligence customers - Developed Data Science Edge (an internal process model) - Extends CRISP-DM process to align with big data  Information Architect: develops data models for optimal performance in databases.  Data Scientist: works in cross-functional teams at all stages of analysis lifecycle. - Follows a scientific approach to generate value from data  Metrics and Data: develops, inspects, mines, transforms, models data to improve productivity  Knowledge and Collaboration Engineer: design & implements tools  Big Data Engineer: works with the full open source Hadoop stack from cluster management to repository
  • 9. Springboard 9 An online data science education startup. Defines 3 following roles:  Data Engineer: typically knows a variety of programming languages, focuses on coding, cleaning up data sets; takes the predictive model from the data scientist and implement it in coding  Data Scientist: bridge the gap between programming and implementation of data science, theory of data science, and the business implication of data  Data Analyst: provide visualizations and reports, explain insights  Data architect: focuses on structuring the technology that manages the data models.
  • 10. Gartner 10 A research / advisory consulting firm. Basically, advise to upper level decision makers. Set of suggested roles:-  Data Scientists: extract various types of knowledge from data; end to end process  Data Engineers: make the data accessible and available for data scientists  Business Experts: business domain experts  Source System Experts: knowledge of data at the business application level  Software Engineers: for custom coding requirements  Quant Geeks: certain situations: “nice-to-have” but in rare situation: “must-have”  Unicorns: well versed data scientists
  • 11. Discussion (integrated view of roles used) 11 Search phrases used on Dice.com
  • 12. Data Scientist vs. Data Engineer 12 Most frequent key phrases used in job descriptions:
  • 13. Future & Conclusion 13  Future: Next changes in future will occur in cases such as: Blending of data-intensive and compute-intensive applications eg. Rise of High Performance Data Analytics (HPDA)  Conclusion: Rerun of an analysis is required of role usage in the industry in the future (every 6 months) to identify trends over time

Editor's Notes

  1. NIST – National Institute of Standard and Technology developed a cybersecurity Workforce Framework - NICE (National Initiative for cyber security framework)
  2. National Institute of Standards and Technology
  3. Science Applications International Corporation (SAIC) parent company changed the name as Leidos