SlideShare a Scribd company logo
Symposium on Digital Curation in
the Era of Big Data:
Career Opportunities and
Educational Requirements:
A Data Scientist Perspective
Dr. Vicki Lynn Ferrini
Lamont-Doherty Earth Observatory
Background (What I do)
•
•
•
•
•
•
•
•
•
•
•

Data Documentation (Metadata)
Data Management
Data Discovery & Access Tools
Develop/Implement QA/QC
Data Syntheses
Data Compliance Tools
Education Materials
Delivery to National Data Centers, Libraries
Data Publication & Links to Scientific Literature
Data Integration, Visualization & Analysis Tools
Best Practice Guidelines for Optimizing Acquisition

“Support, sustain, and advance the geosciences by providing
data services for observational solid earth data from the
Ocean, Earth, and Polar Sciences.”

rvdata.us
Scientific Data Continuum
Data
Producers

Scientific
Literature

Data
Consumers
THEN

Data
Producers

Scientific
Literature
Data
Providers

Data
Consumers

Varying Goals/Perspectives/Needs

NOW
Perspective of Data Producers

Domain Specialists

• Goal: Scientific Discovery
• Data Acquisition&
Reduction
• Data Assembly
• Visualization, Integration
& Interpretation
• Scientific Standards
• Technical & Operational
Limitations
• Data documentation
• Varies by domain
• Often difficult
• Heterogeneous
Perspective of Data Consumers
•
•
•
•
•
•

Domain Specialists & Public

Goal: Discovery
Data Discoverability & Access
Cross-disciplinary
Scientific Standards
Interpretation
Increased importance of
documentation
• Data not self-generated
• Data Quality/Reliability
• Data Use/Misuse
Perspective of Data Providers
• Goal: Access/Preservation/Re-Use
• Data Formats & Standards
• Data Documentation &
Preservation Techniques
• Scientific & Metadata Standards
• Data Citation
• Data Transfer Mechanisms
• System Usability
• Interoperability/Linked Data
• Needs of Diversity of User
Community
• Knowledge of Content

Human & Digital Bridge between
Producers & Consumers
At the Intersection:
The Data Scientist

Data
Producers

Data
Consumers

Data
Providers
Data Stewardship Continuum
DATA
PRODUCERS

DATA
PROVIDERS

Data Scientist

DATA
CONSUMERS
Key Attributes of Data Scientists
• Knowledge spanning full scientific data
stewardship continuum
• Domain Experience
• Content & applications
• Data acquisition & reduction practices
• Nuances of Data
• Technical knowledge
• Evolving Technologies
• Data Acquisition & Management
• Metadata
Key Attributes of Data Scientists
• Other skills (seldom taught)
• Communication & Organization
• Understand cultural aspects of user
community
• People/Project Management
• Balance between micro- and macroperspectives
Key Attributes Tech Team Members
• Basic knowledge of content OR interest/curiosity
• Experience with Data Production/Consumption
• Technical skills:
– web development & technology
– geospatially enabled data management tools
– experience with data analysis tools
– ability to work in a variety of tech environments

• Complementary skill sets
• Innovation & creativity
• Willingness to ask questions – assumptions can be
dangerous
Challenges & Opportunities
• Difficult to find right balance between technical
skills and interest in content
– Team dynamics, management approaches evolving
– Increasing opportunities to engage/educate computer
scientists in domain science

• Data producers are slow to join the digital era
– Educational opportunities
– Scientific benefits continue to grow
– New generation incorporating data sharing into scientific
workflow

• Difficult to keep pace with evolving technologies
– Educational & Professional Development opportunities
The Future?

Data
Scientists

Data
Producers

Data
Consumers

Data
Providers

More Related Content

What's hot

RDAP 15 Local ICPSR Data Curation Workshop Pilot Project
RDAP 15 Local ICPSR Data Curation Workshop Pilot ProjectRDAP 15 Local ICPSR Data Curation Workshop Pilot Project
RDAP 15 Local ICPSR Data Curation Workshop Pilot Project
ASIS&T
 
Collaborate, Automate, Prepare, Prioritize: Creating Metadata for Legacy Rese...
Collaborate, Automate, Prepare, Prioritize: Creating Metadata for Legacy Rese...Collaborate, Automate, Prepare, Prioritize: Creating Metadata for Legacy Rese...
Collaborate, Automate, Prepare, Prioritize: Creating Metadata for Legacy Rese...
Jennifer Liss
 
Mike Mertens Directions for RDM day one summary
Mike Mertens Directions for RDM day one summaryMike Mertens Directions for RDM day one summary
Mike Mertens Directions for RDM day one summary
Jisc
 
Data Management Planning for researchers
Data Management Planning for researchersData Management Planning for researchers
Data Management Planning for researchers
Sarah Jones
 
ANDS and Data Management
ANDS and Data ManagementANDS and Data Management
ANDS and Data Management
Julia Gross
 
Presentation to the UM Library Emergent Research Series
Presentation to the UM Library Emergent Research SeriesPresentation to the UM Library Emergent Research Series
Presentation to the UM Library Emergent Research Series
SEAD
 
SEAD: Lightweight Data Services for Sustainability Research
SEAD: Lightweight Data Services for Sustainability ResearchSEAD: Lightweight Data Services for Sustainability Research
SEAD: Lightweight Data Services for Sustainability Research
SEAD
 
Digital Library Federation - DataNets Panel presentation (Nov. 1st, 2011)
Digital Library Federation - DataNets Panel presentation (Nov. 1st, 2011)Digital Library Federation - DataNets Panel presentation (Nov. 1st, 2011)
Digital Library Federation - DataNets Panel presentation (Nov. 1st, 2011)
SEAD
 
Data cycle health
Data cycle healthData cycle health
Data cycle health
jyotikhadake
 
Incentivising the uptake of reusable metadata in the survey production process
Incentivising the uptake of reusable metadata in the survey production processIncentivising the uptake of reusable metadata in the survey production process
Incentivising the uptake of reusable metadata in the survey production process
Louise Corti
 
To share or not to share?
To share or not to share?To share or not to share?
To share or not to share?
Research Information Network
 
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
National Information Standards Organization (NISO)
 
Next generation data services at the Marriott Library
Next generation data services at the Marriott LibraryNext generation data services at the Marriott Library
Next generation data services at the Marriott Library
Rebekah Cummings
 
Data Management for Undergraduate Researchers
Data Management for Undergraduate ResearchersData Management for Undergraduate Researchers
Data Management for Undergraduate Researchers
Rebekah Cummings
 
RDAP14: Building a data management and curation program on a shoestring budget
RDAP14: Building a data management and curation program on a shoestring budgetRDAP14: Building a data management and curation program on a shoestring budget
RDAP14: Building a data management and curation program on a shoestring budget
ASIS&T
 
Who owns the data? Intellectual property considerations for academic research...
Who owns the data? Intellectual property considerations for academic research...Who owns the data? Intellectual property considerations for academic research...
Who owns the data? Intellectual property considerations for academic research...
Rebekah Cummings
 
Llebot "Research Data Support for Researchers: Metadata, Challenges, and Oppo...
Llebot "Research Data Support for Researchers: Metadata, Challenges, and Oppo...Llebot "Research Data Support for Researchers: Metadata, Challenges, and Oppo...
Llebot "Research Data Support for Researchers: Metadata, Challenges, and Oppo...
National Information Standards Organization (NISO)
 
End of COBWEB Co-Design Projects Celebration
End of COBWEB Co-Design Projects Celebration		End of COBWEB Co-Design Projects Celebration
End of COBWEB Co-Design Projects Celebration
EDINA, University of Edinburgh
 
Discovery event stuart lee (the humanities researcher)
Discovery event stuart lee (the humanities researcher)Discovery event stuart lee (the humanities researcher)
Discovery event stuart lee (the humanities researcher)RDTF-Discovery
 
RDAP14: Policy Recommendations for Institutions to Serve as Trustworthy Stewa...
RDAP14: Policy Recommendations for Institutions to Serve as Trustworthy Stewa...RDAP14: Policy Recommendations for Institutions to Serve as Trustworthy Stewa...
RDAP14: Policy Recommendations for Institutions to Serve as Trustworthy Stewa...
ASIS&T
 

What's hot (20)

RDAP 15 Local ICPSR Data Curation Workshop Pilot Project
RDAP 15 Local ICPSR Data Curation Workshop Pilot ProjectRDAP 15 Local ICPSR Data Curation Workshop Pilot Project
RDAP 15 Local ICPSR Data Curation Workshop Pilot Project
 
Collaborate, Automate, Prepare, Prioritize: Creating Metadata for Legacy Rese...
Collaborate, Automate, Prepare, Prioritize: Creating Metadata for Legacy Rese...Collaborate, Automate, Prepare, Prioritize: Creating Metadata for Legacy Rese...
Collaborate, Automate, Prepare, Prioritize: Creating Metadata for Legacy Rese...
 
Mike Mertens Directions for RDM day one summary
Mike Mertens Directions for RDM day one summaryMike Mertens Directions for RDM day one summary
Mike Mertens Directions for RDM day one summary
 
Data Management Planning for researchers
Data Management Planning for researchersData Management Planning for researchers
Data Management Planning for researchers
 
ANDS and Data Management
ANDS and Data ManagementANDS and Data Management
ANDS and Data Management
 
Presentation to the UM Library Emergent Research Series
Presentation to the UM Library Emergent Research SeriesPresentation to the UM Library Emergent Research Series
Presentation to the UM Library Emergent Research Series
 
SEAD: Lightweight Data Services for Sustainability Research
SEAD: Lightweight Data Services for Sustainability ResearchSEAD: Lightweight Data Services for Sustainability Research
SEAD: Lightweight Data Services for Sustainability Research
 
Digital Library Federation - DataNets Panel presentation (Nov. 1st, 2011)
Digital Library Federation - DataNets Panel presentation (Nov. 1st, 2011)Digital Library Federation - DataNets Panel presentation (Nov. 1st, 2011)
Digital Library Federation - DataNets Panel presentation (Nov. 1st, 2011)
 
Data cycle health
Data cycle healthData cycle health
Data cycle health
 
Incentivising the uptake of reusable metadata in the survey production process
Incentivising the uptake of reusable metadata in the survey production processIncentivising the uptake of reusable metadata in the survey production process
Incentivising the uptake of reusable metadata in the survey production process
 
To share or not to share?
To share or not to share?To share or not to share?
To share or not to share?
 
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
 
Next generation data services at the Marriott Library
Next generation data services at the Marriott LibraryNext generation data services at the Marriott Library
Next generation data services at the Marriott Library
 
Data Management for Undergraduate Researchers
Data Management for Undergraduate ResearchersData Management for Undergraduate Researchers
Data Management for Undergraduate Researchers
 
RDAP14: Building a data management and curation program on a shoestring budget
RDAP14: Building a data management and curation program on a shoestring budgetRDAP14: Building a data management and curation program on a shoestring budget
RDAP14: Building a data management and curation program on a shoestring budget
 
Who owns the data? Intellectual property considerations for academic research...
Who owns the data? Intellectual property considerations for academic research...Who owns the data? Intellectual property considerations for academic research...
Who owns the data? Intellectual property considerations for academic research...
 
Llebot "Research Data Support for Researchers: Metadata, Challenges, and Oppo...
Llebot "Research Data Support for Researchers: Metadata, Challenges, and Oppo...Llebot "Research Data Support for Researchers: Metadata, Challenges, and Oppo...
Llebot "Research Data Support for Researchers: Metadata, Challenges, and Oppo...
 
End of COBWEB Co-Design Projects Celebration
End of COBWEB Co-Design Projects Celebration		End of COBWEB Co-Design Projects Celebration
End of COBWEB Co-Design Projects Celebration
 
Discovery event stuart lee (the humanities researcher)
Discovery event stuart lee (the humanities researcher)Discovery event stuart lee (the humanities researcher)
Discovery event stuart lee (the humanities researcher)
 
RDAP14: Policy Recommendations for Institutions to Serve as Trustworthy Stewa...
RDAP14: Policy Recommendations for Institutions to Serve as Trustworthy Stewa...RDAP14: Policy Recommendations for Institutions to Serve as Trustworthy Stewa...
RDAP14: Policy Recommendations for Institutions to Serve as Trustworthy Stewa...
 

Similar to A Data Scientist Perspective on Data Curation in the Digital Era

Changing the Curation Equation: A Data Lifecycle Approach to Lowering Costs a...
Changing the Curation Equation: A Data Lifecycle Approach to Lowering Costs a...Changing the Curation Equation: A Data Lifecycle Approach to Lowering Costs a...
Changing the Curation Equation: A Data Lifecycle Approach to Lowering Costs a...
SEAD
 
Research Data Management: An Introductory Webinar from OpenAIRE and EUDAT
Research Data Management: An Introductory Webinar from OpenAIRE and EUDATResearch Data Management: An Introductory Webinar from OpenAIRE and EUDAT
Research Data Management: An Introductory Webinar from OpenAIRE and EUDAT
Tony Ross-Hellauer
 
Research Data Management: An Introductory Webinar from OpenAIRE and EUDAT
Research Data Management: An Introductory Webinar from OpenAIRE and EUDATResearch Data Management: An Introductory Webinar from OpenAIRE and EUDAT
Research Data Management: An Introductory Webinar from OpenAIRE and EUDAT
OpenAIRE
 
Research Data Management Introduction: EUDAT/Open AIRE Webinar| www.eudat.eu |
Research Data Management Introduction: EUDAT/Open AIRE Webinar| www.eudat.eu | Research Data Management Introduction: EUDAT/Open AIRE Webinar| www.eudat.eu |
Research Data Management Introduction: EUDAT/Open AIRE Webinar| www.eudat.eu |
EUDAT
 
Engaging with students and researchers: the case of the social sciences
Engaging with students and researchers: the case of the social sciencesEngaging with students and researchers: the case of the social sciences
Engaging with students and researchers: the case of the social sciences
Louise Corti
 
Introduction to Research Data Management - 2017-02-15 - MPLS Division, Univer...
Introduction to Research Data Management - 2017-02-15 - MPLS Division, Univer...Introduction to Research Data Management - 2017-02-15 - MPLS Division, Univer...
Introduction to Research Data Management - 2017-02-15 - MPLS Division, Univer...
Research Support Team, IT Services, University of Oxford
 
Research Data Mangagement Essentials, 5th July 2017
Research Data Mangagement Essentials, 5th July 2017Research Data Mangagement Essentials, 5th July 2017
Research Data Mangagement Essentials, 5th July 2017
Research Data Leeds
 
ROER4D Open Data Initiative
ROER4D Open Data InitiativeROER4D Open Data Initiative
ROER4D Open Data Initiative
Michelle Willmers
 
Slides | Research data literacy and the library
Slides | Research data literacy and the librarySlides | Research data literacy and the library
Slides | Research data literacy and the library
Colleen DeLory
 
Slides | Research data literacy and the library
Slides | Research data literacy and the librarySlides | Research data literacy and the library
Slides | Research data literacy and the library
Library_Connect
 
Data management: The new frontier for libraries
Data management: The new frontier for librariesData management: The new frontier for libraries
Data management: The new frontier for libraries
LEARN Project
 
Challenges in setting up an RDM Support Service
Challenges in setting up an RDM Support ServiceChallenges in setting up an RDM Support Service
Challenges in setting up an RDM Support Service
GarethKnight
 
Gsa rdm training
Gsa rdm trainingGsa rdm training
Gsa rdm training
JISC funded KAPTUR project
 
Critical infrastructure to promote data synthesis
Critical infrastructure to promote data synthesis Critical infrastructure to promote data synthesis
Critical infrastructure to promote data synthesis
Soil and Water Conservation Society
 
Love Your Data Locally
Love Your Data LocallyLove Your Data Locally
Love Your Data Locally
Erin D. Foster
 
Data Management Planning for Engineers
Data Management Planning for EngineersData Management Planning for Engineers
Data Management Planning for Engineers
Sherry Lake
 
Supporting Data Stewardship in the Solid Earth Sciences
Supporting Data Stewardship in the Solid Earth SciencesSupporting Data Stewardship in the Solid Earth Sciences
Supporting Data Stewardship in the Solid Earth Sciences
Vicki Ferrini
 
Meeting Federal Research Requirements for Data Management Plans, Public Acces...
Meeting Federal Research Requirements for Data Management Plans, Public Acces...Meeting Federal Research Requirements for Data Management Plans, Public Acces...
Meeting Federal Research Requirements for Data Management Plans, Public Acces...
ICPSR
 
Research Data Management in Academic Libraries: Meeting the Challenge
Research Data Management in Academic Libraries: Meeting the ChallengeResearch Data Management in Academic Libraries: Meeting the Challenge
Research Data Management in Academic Libraries: Meeting the Challenge
Spencer Keralis
 
The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...
Projeto RCAAP
 

Similar to A Data Scientist Perspective on Data Curation in the Digital Era (20)

Changing the Curation Equation: A Data Lifecycle Approach to Lowering Costs a...
Changing the Curation Equation: A Data Lifecycle Approach to Lowering Costs a...Changing the Curation Equation: A Data Lifecycle Approach to Lowering Costs a...
Changing the Curation Equation: A Data Lifecycle Approach to Lowering Costs a...
 
Research Data Management: An Introductory Webinar from OpenAIRE and EUDAT
Research Data Management: An Introductory Webinar from OpenAIRE and EUDATResearch Data Management: An Introductory Webinar from OpenAIRE and EUDAT
Research Data Management: An Introductory Webinar from OpenAIRE and EUDAT
 
Research Data Management: An Introductory Webinar from OpenAIRE and EUDAT
Research Data Management: An Introductory Webinar from OpenAIRE and EUDATResearch Data Management: An Introductory Webinar from OpenAIRE and EUDAT
Research Data Management: An Introductory Webinar from OpenAIRE and EUDAT
 
Research Data Management Introduction: EUDAT/Open AIRE Webinar| www.eudat.eu |
Research Data Management Introduction: EUDAT/Open AIRE Webinar| www.eudat.eu | Research Data Management Introduction: EUDAT/Open AIRE Webinar| www.eudat.eu |
Research Data Management Introduction: EUDAT/Open AIRE Webinar| www.eudat.eu |
 
Engaging with students and researchers: the case of the social sciences
Engaging with students and researchers: the case of the social sciencesEngaging with students and researchers: the case of the social sciences
Engaging with students and researchers: the case of the social sciences
 
Introduction to Research Data Management - 2017-02-15 - MPLS Division, Univer...
Introduction to Research Data Management - 2017-02-15 - MPLS Division, Univer...Introduction to Research Data Management - 2017-02-15 - MPLS Division, Univer...
Introduction to Research Data Management - 2017-02-15 - MPLS Division, Univer...
 
Research Data Mangagement Essentials, 5th July 2017
Research Data Mangagement Essentials, 5th July 2017Research Data Mangagement Essentials, 5th July 2017
Research Data Mangagement Essentials, 5th July 2017
 
ROER4D Open Data Initiative
ROER4D Open Data InitiativeROER4D Open Data Initiative
ROER4D Open Data Initiative
 
Slides | Research data literacy and the library
Slides | Research data literacy and the librarySlides | Research data literacy and the library
Slides | Research data literacy and the library
 
Slides | Research data literacy and the library
Slides | Research data literacy and the librarySlides | Research data literacy and the library
Slides | Research data literacy and the library
 
Data management: The new frontier for libraries
Data management: The new frontier for librariesData management: The new frontier for libraries
Data management: The new frontier for libraries
 
Challenges in setting up an RDM Support Service
Challenges in setting up an RDM Support ServiceChallenges in setting up an RDM Support Service
Challenges in setting up an RDM Support Service
 
Gsa rdm training
Gsa rdm trainingGsa rdm training
Gsa rdm training
 
Critical infrastructure to promote data synthesis
Critical infrastructure to promote data synthesis Critical infrastructure to promote data synthesis
Critical infrastructure to promote data synthesis
 
Love Your Data Locally
Love Your Data LocallyLove Your Data Locally
Love Your Data Locally
 
Data Management Planning for Engineers
Data Management Planning for EngineersData Management Planning for Engineers
Data Management Planning for Engineers
 
Supporting Data Stewardship in the Solid Earth Sciences
Supporting Data Stewardship in the Solid Earth SciencesSupporting Data Stewardship in the Solid Earth Sciences
Supporting Data Stewardship in the Solid Earth Sciences
 
Meeting Federal Research Requirements for Data Management Plans, Public Acces...
Meeting Federal Research Requirements for Data Management Plans, Public Acces...Meeting Federal Research Requirements for Data Management Plans, Public Acces...
Meeting Federal Research Requirements for Data Management Plans, Public Acces...
 
Research Data Management in Academic Libraries: Meeting the Challenge
Research Data Management in Academic Libraries: Meeting the ChallengeResearch Data Management in Academic Libraries: Meeting the Challenge
Research Data Management in Academic Libraries: Meeting the Challenge
 
The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...
 

More from Vicki Ferrini

Multibeam Advisory Committee - 2016 UNOLS FIC Meeting
Multibeam Advisory Committee - 2016 UNOLS FIC MeetingMultibeam Advisory Committee - 2016 UNOLS FIC Meeting
Multibeam Advisory Committee - 2016 UNOLS FIC Meeting
Vicki Ferrini
 
Underwater Video – Community Perspectives on Needs, Challenges and Opportunit...
Underwater Video – Community Perspectives on Needs, Challenges and Opportunit...Underwater Video – Community Perspectives on Needs, Challenges and Opportunit...
Underwater Video – Community Perspectives on Needs, Challenges and Opportunit...
Vicki Ferrini
 
Ocean Sciences 2016 Presentation
Ocean Sciences 2016 PresentationOcean Sciences 2016 Presentation
Ocean Sciences 2016 Presentation
Vicki Ferrini
 
Navigating the Marine Geophysical Data Life Cycle
Navigating the Marine Geophysical Data Life CycleNavigating the Marine Geophysical Data Life Cycle
Navigating the Marine Geophysical Data Life Cycle
Vicki Ferrini
 
Data Management Planning and Data Compliance Reporting with IEDA
Data Management Planning and Data Compliance Reporting with IEDAData Management Planning and Data Compliance Reporting with IEDA
Data Management Planning and Data Compliance Reporting with IEDA
Vicki Ferrini
 
Global Multi-Resolution Topography (GMRT) - Making bathymetry data openly acc...
Global Multi-Resolution Topography (GMRT) - Making bathymetry data openly acc...Global Multi-Resolution Topography (GMRT) - Making bathymetry data openly acc...
Global Multi-Resolution Topography (GMRT) - Making bathymetry data openly acc...
Vicki Ferrini
 

More from Vicki Ferrini (6)

Multibeam Advisory Committee - 2016 UNOLS FIC Meeting
Multibeam Advisory Committee - 2016 UNOLS FIC MeetingMultibeam Advisory Committee - 2016 UNOLS FIC Meeting
Multibeam Advisory Committee - 2016 UNOLS FIC Meeting
 
Underwater Video – Community Perspectives on Needs, Challenges and Opportunit...
Underwater Video – Community Perspectives on Needs, Challenges and Opportunit...Underwater Video – Community Perspectives on Needs, Challenges and Opportunit...
Underwater Video – Community Perspectives on Needs, Challenges and Opportunit...
 
Ocean Sciences 2016 Presentation
Ocean Sciences 2016 PresentationOcean Sciences 2016 Presentation
Ocean Sciences 2016 Presentation
 
Navigating the Marine Geophysical Data Life Cycle
Navigating the Marine Geophysical Data Life CycleNavigating the Marine Geophysical Data Life Cycle
Navigating the Marine Geophysical Data Life Cycle
 
Data Management Planning and Data Compliance Reporting with IEDA
Data Management Planning and Data Compliance Reporting with IEDAData Management Planning and Data Compliance Reporting with IEDA
Data Management Planning and Data Compliance Reporting with IEDA
 
Global Multi-Resolution Topography (GMRT) - Making bathymetry data openly acc...
Global Multi-Resolution Topography (GMRT) - Making bathymetry data openly acc...Global Multi-Resolution Topography (GMRT) - Making bathymetry data openly acc...
Global Multi-Resolution Topography (GMRT) - Making bathymetry data openly acc...
 

Recently uploaded

Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
MysoreMuleSoftMeetup
 
The Diamond Necklace by Guy De Maupassant.pptx
The Diamond Necklace by Guy De Maupassant.pptxThe Diamond Necklace by Guy De Maupassant.pptx
The Diamond Necklace by Guy De Maupassant.pptx
DhatriParmar
 
How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...
Jisc
 
The Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptxThe Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptx
DhatriParmar
 
South African Journal of Science: Writing with integrity workshop (2024)
South African Journal of Science: Writing with integrity workshop (2024)South African Journal of Science: Writing with integrity workshop (2024)
South African Journal of Science: Writing with integrity workshop (2024)
Academy of Science of South Africa
 
2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...
Sandy Millin
 
JEE1_This_section_contains_FOUR_ questions
JEE1_This_section_contains_FOUR_ questionsJEE1_This_section_contains_FOUR_ questions
JEE1_This_section_contains_FOUR_ questions
ShivajiThube2
 
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
Nguyen Thanh Tu Collection
 
Normal Labour/ Stages of Labour/ Mechanism of Labour
Normal Labour/ Stages of Labour/ Mechanism of LabourNormal Labour/ Stages of Labour/ Mechanism of Labour
Normal Labour/ Stages of Labour/ Mechanism of Labour
Wasim Ak
 
Unit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdfUnit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdf
Thiyagu K
 
S1-Introduction-Biopesticides in ICM.pptx
S1-Introduction-Biopesticides in ICM.pptxS1-Introduction-Biopesticides in ICM.pptx
S1-Introduction-Biopesticides in ICM.pptx
tarandeep35
 
1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx
JosvitaDsouza2
 
The Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official PublicationThe Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official Publication
Delapenabediema
 
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
Dr. Vinod Kumar Kanvaria
 
A Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in EducationA Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in Education
Peter Windle
 
Operation Blue Star - Saka Neela Tara
Operation Blue Star   -  Saka Neela TaraOperation Blue Star   -  Saka Neela Tara
Operation Blue Star - Saka Neela Tara
Balvir Singh
 
Model Attribute Check Company Auto Property
Model Attribute  Check Company Auto PropertyModel Attribute  Check Company Auto Property
Model Attribute Check Company Auto Property
Celine George
 
Azure Interview Questions and Answers PDF By ScholarHat
Azure Interview Questions and Answers PDF By ScholarHatAzure Interview Questions and Answers PDF By ScholarHat
Azure Interview Questions and Answers PDF By ScholarHat
Scholarhat
 
"Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe..."Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe...
SACHIN R KONDAGURI
 
Supporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptxSupporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptx
Jisc
 

Recently uploaded (20)

Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
 
The Diamond Necklace by Guy De Maupassant.pptx
The Diamond Necklace by Guy De Maupassant.pptxThe Diamond Necklace by Guy De Maupassant.pptx
The Diamond Necklace by Guy De Maupassant.pptx
 
How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...
 
The Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptxThe Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptx
 
South African Journal of Science: Writing with integrity workshop (2024)
South African Journal of Science: Writing with integrity workshop (2024)South African Journal of Science: Writing with integrity workshop (2024)
South African Journal of Science: Writing with integrity workshop (2024)
 
2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...
 
JEE1_This_section_contains_FOUR_ questions
JEE1_This_section_contains_FOUR_ questionsJEE1_This_section_contains_FOUR_ questions
JEE1_This_section_contains_FOUR_ questions
 
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
 
Normal Labour/ Stages of Labour/ Mechanism of Labour
Normal Labour/ Stages of Labour/ Mechanism of LabourNormal Labour/ Stages of Labour/ Mechanism of Labour
Normal Labour/ Stages of Labour/ Mechanism of Labour
 
Unit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdfUnit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdf
 
S1-Introduction-Biopesticides in ICM.pptx
S1-Introduction-Biopesticides in ICM.pptxS1-Introduction-Biopesticides in ICM.pptx
S1-Introduction-Biopesticides in ICM.pptx
 
1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx
 
The Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official PublicationThe Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official Publication
 
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
 
A Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in EducationA Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in Education
 
Operation Blue Star - Saka Neela Tara
Operation Blue Star   -  Saka Neela TaraOperation Blue Star   -  Saka Neela Tara
Operation Blue Star - Saka Neela Tara
 
Model Attribute Check Company Auto Property
Model Attribute  Check Company Auto PropertyModel Attribute  Check Company Auto Property
Model Attribute Check Company Auto Property
 
Azure Interview Questions and Answers PDF By ScholarHat
Azure Interview Questions and Answers PDF By ScholarHatAzure Interview Questions and Answers PDF By ScholarHat
Azure Interview Questions and Answers PDF By ScholarHat
 
"Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe..."Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe...
 
Supporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptxSupporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptx
 

A Data Scientist Perspective on Data Curation in the Digital Era

  • 1. Symposium on Digital Curation in the Era of Big Data: Career Opportunities and Educational Requirements: A Data Scientist Perspective Dr. Vicki Lynn Ferrini Lamont-Doherty Earth Observatory
  • 2. Background (What I do) • • • • • • • • • • • Data Documentation (Metadata) Data Management Data Discovery & Access Tools Develop/Implement QA/QC Data Syntheses Data Compliance Tools Education Materials Delivery to National Data Centers, Libraries Data Publication & Links to Scientific Literature Data Integration, Visualization & Analysis Tools Best Practice Guidelines for Optimizing Acquisition “Support, sustain, and advance the geosciences by providing data services for observational solid earth data from the Ocean, Earth, and Polar Sciences.” rvdata.us
  • 4. Perspective of Data Producers Domain Specialists • Goal: Scientific Discovery • Data Acquisition& Reduction • Data Assembly • Visualization, Integration & Interpretation • Scientific Standards • Technical & Operational Limitations • Data documentation • Varies by domain • Often difficult • Heterogeneous
  • 5. Perspective of Data Consumers • • • • • • Domain Specialists & Public Goal: Discovery Data Discoverability & Access Cross-disciplinary Scientific Standards Interpretation Increased importance of documentation • Data not self-generated • Data Quality/Reliability • Data Use/Misuse
  • 6. Perspective of Data Providers • Goal: Access/Preservation/Re-Use • Data Formats & Standards • Data Documentation & Preservation Techniques • Scientific & Metadata Standards • Data Citation • Data Transfer Mechanisms • System Usability • Interoperability/Linked Data • Needs of Diversity of User Community • Knowledge of Content Human & Digital Bridge between Producers & Consumers
  • 7. At the Intersection: The Data Scientist Data Producers Data Consumers Data Providers
  • 9. Key Attributes of Data Scientists • Knowledge spanning full scientific data stewardship continuum • Domain Experience • Content & applications • Data acquisition & reduction practices • Nuances of Data • Technical knowledge • Evolving Technologies • Data Acquisition & Management • Metadata
  • 10. Key Attributes of Data Scientists • Other skills (seldom taught) • Communication & Organization • Understand cultural aspects of user community • People/Project Management • Balance between micro- and macroperspectives
  • 11. Key Attributes Tech Team Members • Basic knowledge of content OR interest/curiosity • Experience with Data Production/Consumption • Technical skills: – web development & technology – geospatially enabled data management tools – experience with data analysis tools – ability to work in a variety of tech environments • Complementary skill sets • Innovation & creativity • Willingness to ask questions – assumptions can be dangerous
  • 12. Challenges & Opportunities • Difficult to find right balance between technical skills and interest in content – Team dynamics, management approaches evolving – Increasing opportunities to engage/educate computer scientists in domain science • Data producers are slow to join the digital era – Educational opportunities – Scientific benefits continue to grow – New generation incorporating data sharing into scientific workflow • Difficult to keep pace with evolving technologies – Educational & Professional Development opportunities