SlideShare a Scribd company logo
Data science education resources for
everyone
Nicole Vasilevsky, Jackie Wirz, Bjorn Pederson, Ted Laderas, Shannon McWeeney,
William Hersh, David A. Dorr, Melissa Haendel
Oregon Health & Science University
MLA/PNC 2016
The problem
Major challenge: how to manage, analyze
and interpret vast amounts of data being
generated in biomedical research
One goal of NIH Big Data to Knowledge
(BD2K) initiative: provide training for
students and researchers to address this
Research team in the Library and
Department of Medical Informatics and
Clinical Epidemiology (DMICE) is
developing skills courses and open
educational resources (OERs)
http://whelf.ac.uk/activity-data-delivering-benefits-from-the-data-deluge/
Our Approach
Skills courses and OERs connect the dots that help researchers understand how to apply
data science techniques in the context of their whole research life cycle
Skills courses and OER topics are aimed to fill specific gaps
BD2K Skills Courses
Taught by BD2K Faculty,
Post-doc and Staff
In person format Targeted to a variety of
students
Defining The Problem Wrangling Data
Data Identification And Resources
 Problems amenable to analytics
 Importance of question
 Team definitions
 Scope
 When we do this wrong: methods don't match
 Finding the right data
 Search methods
 Use of metadata
 Data management
 Exploratory Data Analysis
 Data Dictionary
 As you touch data, what can go wrong?
Methods, Tools And Analysis Scientific Communication
 Visualization
 Matching algorithms to problems
…
 Reporting Findings and Limitations
 Giving “Elevator Speech” on ideas of how to
approach problem
 Critique of related problem
Course Offerings
Course Length Who WhatWhen
Intro Course
Week long course
(~40 hrs)
July
2015
Interns and
undergraduates
Taught basics of data
science in the context
of the research life
cycle
Data After Dark
2 evening course
(4 hrs/nt)
January
2016
OHSU students,
staff and faculty
Emerging data science
activities/research
impact
Data and Donuts
2 morning course
(3 hrs/day)
June
2016
OHSU Summer
interns
Basics of data science
Advanced Course
4 evening course
(2 hrs/nt)
May
2016
OHSU students,
staff and faculty
Hands on Data viz /
Data wrangling
Data and Donuts
West
4 hour course
July
2016
OHSU summer
interns (West
Campus)
Basics of data science
Think like a data scientis
t
- the Data and Donuts workshop
will provide an introduction to data science for those new
to research. Summer interns encouraged to attend!
Topics covered will include
• What is Big Data?
• Asking the right question and getting
the right
answers from your data
• Finding data resources in the real world
• Data handling 101
• Ethics of data• Communicating your science for maximal
impact
June 2 8 & 2 9 | 9 - 1 2 PM | D onut s!
Fr ee Wor kshop!
DataAndDonuts
Interested?
Register at http
:
// bit.ly/ 1sfDeXz
or email wirzj@ohsu.eduw w w .ohsu.edu/ bd2 k
Hands-on! Learn by Doing!
Join us for a 4 evening workshop:
· Data Wrangling with Python and Pandas
· Interactive visualization with R/ Shiny
· Supervised Learning Algorithms + Kaggle Challenge
Familiarity with R and Git is required. Bring your laptop!
!
May 23-26th
5-7pm
Register at http:/ / bit.ly/ 1pFVvLv
Department of Medical Informatics + Clinical Epidemiology + OHSU Library
Funding: NIH 5R25EB020379
For more information, e-mail bd2k@ohsu.edu
FREE OHSU BD2K ADVANCED
DATA AFTER DARK WORKSHOP
Evaluation of Skills Courses
0% 20% 40% 60% 80% 100%
Evaluation Summary from Beginnner
Students
Beginner Percent 6 & 7 Beginner Percent 3, 4 & 5
Beginner Percent 1 & 2
0% 20% 40% 60% 80% 100%
Evaluation Summary from Advanced
Students
Advanced Percent 6 & 7 Advanced Percent 3, 4 & 5
Advanced Percent 1 & 2
The instructors clearly presented the
skills to be learned
The instructors presented
content in an organized manner
The instructors effectively presented
concepts and techniques
OER Modules
01 | Biomedical Big Data Science
02 | Introduction to Big Data in Biology and Medicine
03 | Ethical Issues in Use of Big Data
04 | Clinical Standards Related to Big Data
05 | Basic Research Data Standards
06 | Public Health and Big Data
07 | Team Science
08 | Secondary Use (Reuse) of Clinical Data
09 | Publication and Peer Review
10 | Information Retrieval
11 | Version Control and Identifiers
12 | Data annotation and curation
13 | Data Tools and Landscape
14 | Ontologies 101
15 | Data metadata and provenance
16 | Semantic data interoperability
17 | Choice of Algorithms and Algorithm Dynamics
18 | Visualization and Interpretation
19 | Replication, Validation and the spectrum of
Reproducibility Semantic data interoperability
20 | Regulatory Issues in Big Data for Genomics and Health
Semantic Web data
21 | Hosting data dissemination and data stewardship
workshops
22 | Hosting data dissemination and data stewardship
workshops
23 | Terminology of Biomedical, Clinical, and Translational
Research
24 | Computing Concepts for Big Data
25 | Data modeling
26 | Semantic Web data
27 | Context-based selection of data
28 | Translating the Question
29 | Implications of Provenance and Pre-processing
30 | Data tells a story
31 | Statistical Significance, P-hacking and Multiple-testing
32 | Displaying Confidence and Uncertainty
https://dmice.ohsu.edu/bd2k/topics.html
What is available in the modules?
Module Overview Online viewing Powerpoint files Audio files
Exercises References Resources
MLA- Professional Competencies For Health Sciences Librarians
https://dmice.ohsu.edu/bd2k/mapping_MLA.html
Competency #1
Understand the health sciences and
health care environment and the policies,
issues, and trends that impact that
environment
BDK02 - Introduction
To Big Data In Biology
And Medicine
BDK03 - Ethical Issues
In Use Of Big Data
BDK07- Team Science
Competency #3
Understand the principles and practices
related to providing information services
to meet users' needs
BDK10 - Information
Retrieval
BDK22 - Guidelines For
Reporting, Publications,
And Data Sharing
Competency #4
Have the ability to manage health
information resources in a broad range of
formats
BDK09 - Publication And Peer
Review
BDK12 - Data Annotation And
Curation
BDK14 - Ontologies 101
BDK15 - Data Metadata And
Provenance
Competency #5
Understand and use technology and
systems to manage all forms of
information
BDK10 - Information Retrieval
BDK12 - Data Annotation And
Curation
BDK13 - Data and tools
landscape
BDK14 - Ontologies 101
BDK26 - Introduction to
Semantic Web data
Competency #6
Understand curricular design and
instruction and have the ability to teach
ways to access, organize, and use
information
BDK21 - Hosting Data
Dissemination And Data
Stewardship Workshops
Competency #7
Understand scientific research methods
and have the ability to critically examine
and filter research literature from many
related disciplines
BDK07- Team Science
BDK18 - Visualization And
Interpretation
BDK19 - Replication,
Validation And The
Spectrum Of Reproducibility
BDK01 - Biomedical Big Data
Science
BDK04 - Clinical Data And Standards
Related To Big Data
BDK05 - Basic Research Data Standards
BDK04 - Clinical Data And Standards
Related To Big Data
BDK05 - Basic Research Data Standards
Challenges
Scope
Images
Style
Dissemination
How to scope generic curricula for different
levels of users
How to translate diverse teaching
styles into general materials
How to maximize dissemination
while protecting intellectual
property
How to incorporate images and other
copyrighted materials into open
resources
Who are these resources for? EVERYONE!
thenounproject.com
Undergraduate
Students
Graduate
Students
Clinicians Post-docs
Librarians Staff Faculty
Help review our modules:
https://dmice.ohsu.edu/bd2k/topics.html
Acknowledgements
Bill Hersh, PI Melissa Haendel, PI Shannon McWeeney, PI David Dorr, PI
Ted Laderas,
Instructor
Jackie Wirz,
Instructor
Nicole Vasilevsky,
Instructor
Bjorn Pederson,
Instructional Designer
This work is supported by NIH Grants 1R25EB020379-01 and 1R25GM114820-01.
You can find me at:
@n_vasilevsky
vasilevs@ohsu.edu
Thanks!

More Related Content

What's hot

Principles, key responsibilities, and their intersection
Principles, key responsibilities, and their intersectionPrinciples, key responsibilities, and their intersection
Principles, key responsibilities, and their intersection
ARDC
 
Research Integrity Advisor and Data Management
Research Integrity Advisor and Data ManagementResearch Integrity Advisor and Data Management
Research Integrity Advisor and Data Management
ARDC
 
Research Data Mgt
Research Data MgtResearch Data Mgt
Research Data Mgt
idola008
 
Scientific information retrieval: Challenges and opportunities
Scientific information retrieval: Challenges and opportunitiesScientific information retrieval: Challenges and opportunities
Scientific information retrieval: Challenges and opportunities
Ludo Waltman
 
Data peer review workshop
Data peer review workshopData peer review workshop
Data peer review workshop
Varsha Khodiyar
 
Research evaluation in iraq from 1996 to 2014
Research evaluation in iraq from 1996 to 2014Research evaluation in iraq from 1996 to 2014
Research evaluation in iraq from 1996 to 2014
Professor Dr. Bassim H. Hameed
 
Gaining credit for sharing research data
Gaining credit for sharing research dataGaining credit for sharing research data
Gaining credit for sharing research data
Varsha Khodiyar
 
Henning Müller et Michael Schumacher pour la journée e-health 2013
Henning Müller et Michael Schumacher pour la journée e-health 2013Henning Müller et Michael Schumacher pour la journée e-health 2013
Henning Müller et Michael Schumacher pour la journée e-health 2013
Thearkvalais
 
Comparing scientific performance across disciplines: Methodological and conce...
Comparing scientific performance across disciplines: Methodological and conce...Comparing scientific performance across disciplines: Methodological and conce...
Comparing scientific performance across disciplines: Methodological and conce...
Ludo Waltman
 
Enhancing Our Capacity for Large Health Dataset Analysis
Enhancing Our Capacity for Large Health Dataset AnalysisEnhancing Our Capacity for Large Health Dataset Analysis
Enhancing Our Capacity for Large Health Dataset Analysis
CTSI at UCSF
 
Midwest Medical Library Association 2015 Big Data Panel
Midwest Medical Library Association 2015 Big Data PanelMidwest Medical Library Association 2015 Big Data Panel
Midwest Medical Library Association 2015 Big Data Panel
IUPUI
 
Research Data Management Planning: problems and solutions
Research Data Management Planning: problems and solutionsResearch Data Management Planning: problems and solutions
Research Data Management Planning: problems and solutions
Arhiv družboslovnih podatkov
 
RDAP 16 Poster: Measuring adoption of Electronic Lab Notebooks and their impa...
RDAP 16 Poster: Measuring adoption of Electronic Lab Notebooks and their impa...RDAP 16 Poster: Measuring adoption of Electronic Lab Notebooks and their impa...
RDAP 16 Poster: Measuring adoption of Electronic Lab Notebooks and their impa...
ASIS&T
 
Embedding ORCID across researcher career paths
Embedding ORCID across researcher career pathsEmbedding ORCID across researcher career paths
Embedding ORCID across researcher career paths
ORCID, Inc
 
eHealth unit HES-SO in Sierre
eHealth unit HES-SO in SierreeHealth unit HES-SO in Sierre
eHealth unit HES-SO in Sierre
Institute of Information Systems (HES-SO)
 
Quality analysis of NSF DMP plans - Wayne State University
Quality analysis of NSF DMP plans - Wayne State UniversityQuality analysis of NSF DMP plans - Wayne State University
Quality analysis of NSF DMP plans - Wayne State University
rds-wayne-edu
 
MSc transneuro & gastro 2013-14
MSc transneuro & gastro 2013-14MSc transneuro & gastro 2013-14
MSc transneuro & gastro 2013-14
PaulaFunnell
 
Case studies for open science
Case studies for open scienceCase studies for open science
Case studies for open science
IUPUI
 
Web of Science, Scopus, Dimensions, and beyond: The evolving landscape of bib...
Web of Science, Scopus, Dimensions, and beyond: The evolving landscape of bib...Web of Science, Scopus, Dimensions, and beyond: The evolving landscape of bib...
Web of Science, Scopus, Dimensions, and beyond: The evolving landscape of bib...
Ludo Waltman
 
The Alzheimer's Disease Research Network and the Uniform Data Set
The Alzheimer's Disease Research Network and the Uniform Data SetThe Alzheimer's Disease Research Network and the Uniform Data Set
The Alzheimer's Disease Research Network and the Uniform Data Set
Sociotechnical Roundtable
 

What's hot (20)

Principles, key responsibilities, and their intersection
Principles, key responsibilities, and their intersectionPrinciples, key responsibilities, and their intersection
Principles, key responsibilities, and their intersection
 
Research Integrity Advisor and Data Management
Research Integrity Advisor and Data ManagementResearch Integrity Advisor and Data Management
Research Integrity Advisor and Data Management
 
Research Data Mgt
Research Data MgtResearch Data Mgt
Research Data Mgt
 
Scientific information retrieval: Challenges and opportunities
Scientific information retrieval: Challenges and opportunitiesScientific information retrieval: Challenges and opportunities
Scientific information retrieval: Challenges and opportunities
 
Data peer review workshop
Data peer review workshopData peer review workshop
Data peer review workshop
 
Research evaluation in iraq from 1996 to 2014
Research evaluation in iraq from 1996 to 2014Research evaluation in iraq from 1996 to 2014
Research evaluation in iraq from 1996 to 2014
 
Gaining credit for sharing research data
Gaining credit for sharing research dataGaining credit for sharing research data
Gaining credit for sharing research data
 
Henning Müller et Michael Schumacher pour la journée e-health 2013
Henning Müller et Michael Schumacher pour la journée e-health 2013Henning Müller et Michael Schumacher pour la journée e-health 2013
Henning Müller et Michael Schumacher pour la journée e-health 2013
 
Comparing scientific performance across disciplines: Methodological and conce...
Comparing scientific performance across disciplines: Methodological and conce...Comparing scientific performance across disciplines: Methodological and conce...
Comparing scientific performance across disciplines: Methodological and conce...
 
Enhancing Our Capacity for Large Health Dataset Analysis
Enhancing Our Capacity for Large Health Dataset AnalysisEnhancing Our Capacity for Large Health Dataset Analysis
Enhancing Our Capacity for Large Health Dataset Analysis
 
Midwest Medical Library Association 2015 Big Data Panel
Midwest Medical Library Association 2015 Big Data PanelMidwest Medical Library Association 2015 Big Data Panel
Midwest Medical Library Association 2015 Big Data Panel
 
Research Data Management Planning: problems and solutions
Research Data Management Planning: problems and solutionsResearch Data Management Planning: problems and solutions
Research Data Management Planning: problems and solutions
 
RDAP 16 Poster: Measuring adoption of Electronic Lab Notebooks and their impa...
RDAP 16 Poster: Measuring adoption of Electronic Lab Notebooks and their impa...RDAP 16 Poster: Measuring adoption of Electronic Lab Notebooks and their impa...
RDAP 16 Poster: Measuring adoption of Electronic Lab Notebooks and their impa...
 
Embedding ORCID across researcher career paths
Embedding ORCID across researcher career pathsEmbedding ORCID across researcher career paths
Embedding ORCID across researcher career paths
 
eHealth unit HES-SO in Sierre
eHealth unit HES-SO in SierreeHealth unit HES-SO in Sierre
eHealth unit HES-SO in Sierre
 
Quality analysis of NSF DMP plans - Wayne State University
Quality analysis of NSF DMP plans - Wayne State UniversityQuality analysis of NSF DMP plans - Wayne State University
Quality analysis of NSF DMP plans - Wayne State University
 
MSc transneuro & gastro 2013-14
MSc transneuro & gastro 2013-14MSc transneuro & gastro 2013-14
MSc transneuro & gastro 2013-14
 
Case studies for open science
Case studies for open scienceCase studies for open science
Case studies for open science
 
Web of Science, Scopus, Dimensions, and beyond: The evolving landscape of bib...
Web of Science, Scopus, Dimensions, and beyond: The evolving landscape of bib...Web of Science, Scopus, Dimensions, and beyond: The evolving landscape of bib...
Web of Science, Scopus, Dimensions, and beyond: The evolving landscape of bib...
 
The Alzheimer's Disease Research Network and the Uniform Data Set
The Alzheimer's Disease Research Network and the Uniform Data SetThe Alzheimer's Disease Research Network and the Uniform Data Set
The Alzheimer's Disease Research Network and the Uniform Data Set
 

Viewers also liked

Enhancing the Human Phenotype Ontology for Use by the Layperson
Enhancing the Human Phenotype Ontology for Use by the LaypersonEnhancing the Human Phenotype Ontology for Use by the Layperson
Enhancing the Human Phenotype Ontology for Use by the Layperson
Nicole Vasilevsky
 
Empowering patients by increasing accessibility to clinical terminology
Empowering patients by increasing accessibility to clinical terminologyEmpowering patients by increasing accessibility to clinical terminology
Empowering patients by increasing accessibility to clinical terminology
Nicole Vasilevsky
 
Acrl march2015 final
Acrl march2015 finalAcrl march2015 final
Acrl march2015 final
Nicole Vasilevsky
 
On the Reproducibility of Science: Unique Identification of Research Resourc...
On the Reproducibility of Science: Unique Identification of  Research Resourc...On the Reproducibility of Science: Unique Identification of  Research Resourc...
On the Reproducibility of Science: Unique Identification of Research Resourc...
Nicole Vasilevsky
 
Creating Sustainable Communities in Open Data Resources: The eagle-i and VIVO...
Creating Sustainable Communities in Open Data Resources: The eagle-i and VIVO...Creating Sustainable Communities in Open Data Resources: The eagle-i and VIVO...
Creating Sustainable Communities in Open Data Resources: The eagle-i and VIVO...
Robert H. McDonald
 
Research resources: curating the new eagle-i discovery system
Research resources: curating the new eagle-i discovery systemResearch resources: curating the new eagle-i discovery system
Research resources: curating the new eagle-i discovery system
Nicole Vasilevsky
 
The Role of Libraries in Data Management and Curation
The Role of Libraries in Data Management and CurationThe Role of Libraries in Data Management and Curation
The Role of Libraries in Data Management and Curation
Nicole Vasilevsky
 
Deep phenotyping for everyone
Deep phenotyping for everyoneDeep phenotyping for everyone
Deep phenotyping for everyone
mhaendel
 
Why the world needs phenopacketeers, and how to be one
Why the world needs phenopacketeers, and how to be oneWhy the world needs phenopacketeers, and how to be one
Why the world needs phenopacketeers, and how to be one
mhaendel
 

Viewers also liked (9)

Enhancing the Human Phenotype Ontology for Use by the Layperson
Enhancing the Human Phenotype Ontology for Use by the LaypersonEnhancing the Human Phenotype Ontology for Use by the Layperson
Enhancing the Human Phenotype Ontology for Use by the Layperson
 
Empowering patients by increasing accessibility to clinical terminology
Empowering patients by increasing accessibility to clinical terminologyEmpowering patients by increasing accessibility to clinical terminology
Empowering patients by increasing accessibility to clinical terminology
 
Acrl march2015 final
Acrl march2015 finalAcrl march2015 final
Acrl march2015 final
 
On the Reproducibility of Science: Unique Identification of Research Resourc...
On the Reproducibility of Science: Unique Identification of  Research Resourc...On the Reproducibility of Science: Unique Identification of  Research Resourc...
On the Reproducibility of Science: Unique Identification of Research Resourc...
 
Creating Sustainable Communities in Open Data Resources: The eagle-i and VIVO...
Creating Sustainable Communities in Open Data Resources: The eagle-i and VIVO...Creating Sustainable Communities in Open Data Resources: The eagle-i and VIVO...
Creating Sustainable Communities in Open Data Resources: The eagle-i and VIVO...
 
Research resources: curating the new eagle-i discovery system
Research resources: curating the new eagle-i discovery systemResearch resources: curating the new eagle-i discovery system
Research resources: curating the new eagle-i discovery system
 
The Role of Libraries in Data Management and Curation
The Role of Libraries in Data Management and CurationThe Role of Libraries in Data Management and Curation
The Role of Libraries in Data Management and Curation
 
Deep phenotyping for everyone
Deep phenotyping for everyoneDeep phenotyping for everyone
Deep phenotyping for everyone
 
Why the world needs phenopacketeers, and how to be one
Why the world needs phenopacketeers, and how to be oneWhy the world needs phenopacketeers, and how to be one
Why the world needs phenopacketeers, and how to be one
 

Similar to Data science education resources for everyone

Open Educational Resources for Big Data Science
Open Educational Resources for Big Data ScienceOpen Educational Resources for Big Data Science
Open Educational Resources for Big Data Science
William Hersh, MD
 
Teaching Data Science to Undergraduate Students
Teaching Data Science to Undergraduate StudentsTeaching Data Science to Undergraduate Students
Teaching Data Science to Undergraduate Students
Nicole Vasilevsky
 
Rachel Bruce UK research and data management where are we now
Rachel Bruce UK research and data management where are we nowRachel Bruce UK research and data management where are we now
Rachel Bruce UK research and data management where are we now
Jisc
 
Critical infrastructure to promote data synthesis
Critical infrastructure to promote data synthesis Critical infrastructure to promote data synthesis
Critical infrastructure to promote data synthesis
Soil and Water Conservation Society
 
Supporting Research Data Management in UK Universities: the Jisc Managing Res...
Supporting Research Data Management in UK Universities: the Jisc Managing Res...Supporting Research Data Management in UK Universities: the Jisc Managing Res...
Supporting Research Data Management in UK Universities: the Jisc Managing Res...
L Molloy
 
INCLUSION OF DATA ARCHIVES IN DATA MANAGEMENT PLAN
INCLUSION OF DATA ARCHIVES IN DATA MANAGEMENT PLANINCLUSION OF DATA ARCHIVES IN DATA MANAGEMENT PLAN
INCLUSION OF DATA ARCHIVES IN DATA MANAGEMENT PLAN
Arhiv družboslovnih podatkov
 
Read Surkis Facilitating Development of Research Data Services
Read Surkis Facilitating Development of Research Data ServicesRead Surkis Facilitating Development of Research Data Services
Read Surkis Facilitating Development of Research Data Services
National Information Standards Organization (NISO)
 
Data Science for Every Student at RPI
Data Science for Every Student at RPIData Science for Every Student at RPI
Data Science for Every Student at RPI
Steven Miller
 
Institutional Data Management Blueprint
Institutional Data Management BlueprintInstitutional Data Management Blueprint
Institutional Data Management Blueprint
Eduserv
 
Libraries and Research Data Management – What Works? Lessons Learned from the...
Libraries and Research Data Management – What Works? Lessons Learned from the...Libraries and Research Data Management – What Works? Lessons Learned from the...
Libraries and Research Data Management – What Works? Lessons Learned from the...
LIBER Europe
 
Open Access to Research Data: Challenges and Solutions
Open Access to Research Data: Challenges and SolutionsOpen Access to Research Data: Challenges and Solutions
Open Access to Research Data: Challenges and Solutions
Martin Donnelly
 
Briefing on Research Data Management at LSBU December 2015
Briefing on Research Data Management at LSBU December 2015Briefing on Research Data Management at LSBU December 2015
Briefing on Research Data Management at LSBU December 2015
London South Bank University
 
Data Management and Broader Impacts: a holistic approach
Data Management and Broader Impacts: a holistic approachData Management and Broader Impacts: a holistic approach
Data Management and Broader Impacts: a holistic approach
Megan O'Donnell
 
Martin Donnelly - Digital Data Curation at the Digital Curation Centre (DH2016)
Martin Donnelly - Digital Data Curation at the Digital Curation Centre (DH2016)Martin Donnelly - Digital Data Curation at the Digital Curation Centre (DH2016)
Martin Donnelly - Digital Data Curation at the Digital Curation Centre (DH2016)
dri_ireland
 
Simon hodson
Simon hodsonSimon hodson
Digital curation for postgraduate students
Digital curation for postgraduate studentsDigital curation for postgraduate students
Digital curation for postgraduate students
Sarah Jones
 
Kotarski2011
Kotarski2011Kotarski2011
Next generation data services at the Marriott Library
Next generation data services at the Marriott LibraryNext generation data services at the Marriott Library
Next generation data services at the Marriott Library
Rebekah Cummings
 
DMP health sciences
DMP health sciencesDMP health sciences
DMP health sciences
Sarah Jones
 
DMPTool Webinar 8: Data Curation Profiles and the DMPTool (presented by Jake ...
DMPTool Webinar 8: Data Curation Profiles and the DMPTool (presented by Jake ...DMPTool Webinar 8: Data Curation Profiles and the DMPTool (presented by Jake ...
DMPTool Webinar 8: Data Curation Profiles and the DMPTool (presented by Jake ...
University of California Curation Center
 

Similar to Data science education resources for everyone (20)

Open Educational Resources for Big Data Science
Open Educational Resources for Big Data ScienceOpen Educational Resources for Big Data Science
Open Educational Resources for Big Data Science
 
Teaching Data Science to Undergraduate Students
Teaching Data Science to Undergraduate StudentsTeaching Data Science to Undergraduate Students
Teaching Data Science to Undergraduate Students
 
Rachel Bruce UK research and data management where are we now
Rachel Bruce UK research and data management where are we nowRachel Bruce UK research and data management where are we now
Rachel Bruce UK research and data management where are we now
 
Critical infrastructure to promote data synthesis
Critical infrastructure to promote data synthesis Critical infrastructure to promote data synthesis
Critical infrastructure to promote data synthesis
 
Supporting Research Data Management in UK Universities: the Jisc Managing Res...
Supporting Research Data Management in UK Universities: the Jisc Managing Res...Supporting Research Data Management in UK Universities: the Jisc Managing Res...
Supporting Research Data Management in UK Universities: the Jisc Managing Res...
 
INCLUSION OF DATA ARCHIVES IN DATA MANAGEMENT PLAN
INCLUSION OF DATA ARCHIVES IN DATA MANAGEMENT PLANINCLUSION OF DATA ARCHIVES IN DATA MANAGEMENT PLAN
INCLUSION OF DATA ARCHIVES IN DATA MANAGEMENT PLAN
 
Read Surkis Facilitating Development of Research Data Services
Read Surkis Facilitating Development of Research Data ServicesRead Surkis Facilitating Development of Research Data Services
Read Surkis Facilitating Development of Research Data Services
 
Data Science for Every Student at RPI
Data Science for Every Student at RPIData Science for Every Student at RPI
Data Science for Every Student at RPI
 
Institutional Data Management Blueprint
Institutional Data Management BlueprintInstitutional Data Management Blueprint
Institutional Data Management Blueprint
 
Libraries and Research Data Management – What Works? Lessons Learned from the...
Libraries and Research Data Management – What Works? Lessons Learned from the...Libraries and Research Data Management – What Works? Lessons Learned from the...
Libraries and Research Data Management – What Works? Lessons Learned from the...
 
Open Access to Research Data: Challenges and Solutions
Open Access to Research Data: Challenges and SolutionsOpen Access to Research Data: Challenges and Solutions
Open Access to Research Data: Challenges and Solutions
 
Briefing on Research Data Management at LSBU December 2015
Briefing on Research Data Management at LSBU December 2015Briefing on Research Data Management at LSBU December 2015
Briefing on Research Data Management at LSBU December 2015
 
Data Management and Broader Impacts: a holistic approach
Data Management and Broader Impacts: a holistic approachData Management and Broader Impacts: a holistic approach
Data Management and Broader Impacts: a holistic approach
 
Martin Donnelly - Digital Data Curation at the Digital Curation Centre (DH2016)
Martin Donnelly - Digital Data Curation at the Digital Curation Centre (DH2016)Martin Donnelly - Digital Data Curation at the Digital Curation Centre (DH2016)
Martin Donnelly - Digital Data Curation at the Digital Curation Centre (DH2016)
 
Simon hodson
Simon hodsonSimon hodson
Simon hodson
 
Digital curation for postgraduate students
Digital curation for postgraduate studentsDigital curation for postgraduate students
Digital curation for postgraduate students
 
Kotarski2011
Kotarski2011Kotarski2011
Kotarski2011
 
Next generation data services at the Marriott Library
Next generation data services at the Marriott LibraryNext generation data services at the Marriott Library
Next generation data services at the Marriott Library
 
DMP health sciences
DMP health sciencesDMP health sciences
DMP health sciences
 
DMPTool Webinar 8: Data Curation Profiles and the DMPTool (presented by Jake ...
DMPTool Webinar 8: Data Curation Profiles and the DMPTool (presented by Jake ...DMPTool Webinar 8: Data Curation Profiles and the DMPTool (presented by Jake ...
DMPTool Webinar 8: Data Curation Profiles and the DMPTool (presented by Jake ...
 

Recently uploaded

BBR 2024 Summer Sessions Interview Training
BBR  2024 Summer Sessions Interview TrainingBBR  2024 Summer Sessions Interview Training
BBR 2024 Summer Sessions Interview Training
Katrina Pritchard
 
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
Dr. Vinod Kumar Kanvaria
 
বাংলাদেশ অর্থনৈতিক সমীক্ষা (Economic Review) ২০২৪ UJS App.pdf
বাংলাদেশ অর্থনৈতিক সমীক্ষা (Economic Review) ২০২৪ UJS App.pdfবাংলাদেশ অর্থনৈতিক সমীক্ষা (Economic Review) ২০২৪ UJS App.pdf
বাংলাদেশ অর্থনৈতিক সমীক্ষা (Economic Review) ২০২৪ UJS App.pdf
eBook.com.bd (প্রয়োজনীয় বাংলা বই)
 
MARY JANE WILSON, A “BOA MÃE” .
MARY JANE WILSON, A “BOA MÃE”           .MARY JANE WILSON, A “BOA MÃE”           .
MARY JANE WILSON, A “BOA MÃE” .
Colégio Santa Teresinha
 
Azure Interview Questions and Answers PDF By ScholarHat
Azure Interview Questions and Answers PDF By ScholarHatAzure Interview Questions and Answers PDF By ScholarHat
Azure Interview Questions and Answers PDF By ScholarHat
Scholarhat
 
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdfANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
Priyankaranawat4
 
Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...
Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...
Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...
National Information Standards Organization (NISO)
 
How to Build a Module in Odoo 17 Using the Scaffold Method
How to Build a Module in Odoo 17 Using the Scaffold MethodHow to Build a Module in Odoo 17 Using the Scaffold Method
How to Build a Module in Odoo 17 Using the Scaffold Method
Celine George
 
South African Journal of Science: Writing with integrity workshop (2024)
South African Journal of Science: Writing with integrity workshop (2024)South African Journal of Science: Writing with integrity workshop (2024)
South African Journal of Science: Writing with integrity workshop (2024)
Academy of Science of South Africa
 
A Independência da América Espanhola LAPBOOK.pdf
A Independência da América Espanhola LAPBOOK.pdfA Independência da América Espanhola LAPBOOK.pdf
A Independência da América Espanhola LAPBOOK.pdf
Jean Carlos Nunes Paixão
 
How to Setup Warehouse & Location in Odoo 17 Inventory
How to Setup Warehouse & Location in Odoo 17 InventoryHow to Setup Warehouse & Location in Odoo 17 Inventory
How to Setup Warehouse & Location in Odoo 17 Inventory
Celine George
 
DRUGS AND ITS classification slide share
DRUGS AND ITS classification slide shareDRUGS AND ITS classification slide share
DRUGS AND ITS classification slide share
taiba qazi
 
How to Fix the Import Error in the Odoo 17
How to Fix the Import Error in the Odoo 17How to Fix the Import Error in the Odoo 17
How to Fix the Import Error in the Odoo 17
Celine George
 
Pengantar Penggunaan Flutter - Dart programming language1.pptx
Pengantar Penggunaan Flutter - Dart programming language1.pptxPengantar Penggunaan Flutter - Dart programming language1.pptx
Pengantar Penggunaan Flutter - Dart programming language1.pptx
Fajar Baskoro
 
The simplified electron and muon model, Oscillating Spacetime: The Foundation...
The simplified electron and muon model, Oscillating Spacetime: The Foundation...The simplified electron and muon model, Oscillating Spacetime: The Foundation...
The simplified electron and muon model, Oscillating Spacetime: The Foundation...
RitikBhardwaj56
 
C1 Rubenstein AP HuG xxxxxxxxxxxxxx.pptx
C1 Rubenstein AP HuG xxxxxxxxxxxxxx.pptxC1 Rubenstein AP HuG xxxxxxxxxxxxxx.pptx
C1 Rubenstein AP HuG xxxxxxxxxxxxxx.pptx
mulvey2
 
How to Manage Your Lost Opportunities in Odoo 17 CRM
How to Manage Your Lost Opportunities in Odoo 17 CRMHow to Manage Your Lost Opportunities in Odoo 17 CRM
How to Manage Your Lost Opportunities in Odoo 17 CRM
Celine George
 
The basics of sentences session 6pptx.pptx
The basics of sentences session 6pptx.pptxThe basics of sentences session 6pptx.pptx
The basics of sentences session 6pptx.pptx
heathfieldcps1
 
Natural birth techniques - Mrs.Akanksha Trivedi Rama University
Natural birth techniques - Mrs.Akanksha Trivedi Rama UniversityNatural birth techniques - Mrs.Akanksha Trivedi Rama University
Natural birth techniques - Mrs.Akanksha Trivedi Rama University
Akanksha trivedi rama nursing college kanpur.
 
PCOS corelations and management through Ayurveda.
PCOS corelations and management through Ayurveda.PCOS corelations and management through Ayurveda.
PCOS corelations and management through Ayurveda.
Dr. Shivangi Singh Parihar
 

Recently uploaded (20)

BBR 2024 Summer Sessions Interview Training
BBR  2024 Summer Sessions Interview TrainingBBR  2024 Summer Sessions Interview Training
BBR 2024 Summer Sessions Interview Training
 
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
 
বাংলাদেশ অর্থনৈতিক সমীক্ষা (Economic Review) ২০২৪ UJS App.pdf
বাংলাদেশ অর্থনৈতিক সমীক্ষা (Economic Review) ২০২৪ UJS App.pdfবাংলাদেশ অর্থনৈতিক সমীক্ষা (Economic Review) ২০২৪ UJS App.pdf
বাংলাদেশ অর্থনৈতিক সমীক্ষা (Economic Review) ২০২৪ UJS App.pdf
 
MARY JANE WILSON, A “BOA MÃE” .
MARY JANE WILSON, A “BOA MÃE”           .MARY JANE WILSON, A “BOA MÃE”           .
MARY JANE WILSON, A “BOA MÃE” .
 
Azure Interview Questions and Answers PDF By ScholarHat
Azure Interview Questions and Answers PDF By ScholarHatAzure Interview Questions and Answers PDF By ScholarHat
Azure Interview Questions and Answers PDF By ScholarHat
 
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdfANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
 
Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...
Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...
Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...
 
How to Build a Module in Odoo 17 Using the Scaffold Method
How to Build a Module in Odoo 17 Using the Scaffold MethodHow to Build a Module in Odoo 17 Using the Scaffold Method
How to Build a Module in Odoo 17 Using the Scaffold Method
 
South African Journal of Science: Writing with integrity workshop (2024)
South African Journal of Science: Writing with integrity workshop (2024)South African Journal of Science: Writing with integrity workshop (2024)
South African Journal of Science: Writing with integrity workshop (2024)
 
A Independência da América Espanhola LAPBOOK.pdf
A Independência da América Espanhola LAPBOOK.pdfA Independência da América Espanhola LAPBOOK.pdf
A Independência da América Espanhola LAPBOOK.pdf
 
How to Setup Warehouse & Location in Odoo 17 Inventory
How to Setup Warehouse & Location in Odoo 17 InventoryHow to Setup Warehouse & Location in Odoo 17 Inventory
How to Setup Warehouse & Location in Odoo 17 Inventory
 
DRUGS AND ITS classification slide share
DRUGS AND ITS classification slide shareDRUGS AND ITS classification slide share
DRUGS AND ITS classification slide share
 
How to Fix the Import Error in the Odoo 17
How to Fix the Import Error in the Odoo 17How to Fix the Import Error in the Odoo 17
How to Fix the Import Error in the Odoo 17
 
Pengantar Penggunaan Flutter - Dart programming language1.pptx
Pengantar Penggunaan Flutter - Dart programming language1.pptxPengantar Penggunaan Flutter - Dart programming language1.pptx
Pengantar Penggunaan Flutter - Dart programming language1.pptx
 
The simplified electron and muon model, Oscillating Spacetime: The Foundation...
The simplified electron and muon model, Oscillating Spacetime: The Foundation...The simplified electron and muon model, Oscillating Spacetime: The Foundation...
The simplified electron and muon model, Oscillating Spacetime: The Foundation...
 
C1 Rubenstein AP HuG xxxxxxxxxxxxxx.pptx
C1 Rubenstein AP HuG xxxxxxxxxxxxxx.pptxC1 Rubenstein AP HuG xxxxxxxxxxxxxx.pptx
C1 Rubenstein AP HuG xxxxxxxxxxxxxx.pptx
 
How to Manage Your Lost Opportunities in Odoo 17 CRM
How to Manage Your Lost Opportunities in Odoo 17 CRMHow to Manage Your Lost Opportunities in Odoo 17 CRM
How to Manage Your Lost Opportunities in Odoo 17 CRM
 
The basics of sentences session 6pptx.pptx
The basics of sentences session 6pptx.pptxThe basics of sentences session 6pptx.pptx
The basics of sentences session 6pptx.pptx
 
Natural birth techniques - Mrs.Akanksha Trivedi Rama University
Natural birth techniques - Mrs.Akanksha Trivedi Rama UniversityNatural birth techniques - Mrs.Akanksha Trivedi Rama University
Natural birth techniques - Mrs.Akanksha Trivedi Rama University
 
PCOS corelations and management through Ayurveda.
PCOS corelations and management through Ayurveda.PCOS corelations and management through Ayurveda.
PCOS corelations and management through Ayurveda.
 

Data science education resources for everyone

  • 1. Data science education resources for everyone Nicole Vasilevsky, Jackie Wirz, Bjorn Pederson, Ted Laderas, Shannon McWeeney, William Hersh, David A. Dorr, Melissa Haendel Oregon Health & Science University MLA/PNC 2016
  • 2. The problem Major challenge: how to manage, analyze and interpret vast amounts of data being generated in biomedical research One goal of NIH Big Data to Knowledge (BD2K) initiative: provide training for students and researchers to address this Research team in the Library and Department of Medical Informatics and Clinical Epidemiology (DMICE) is developing skills courses and open educational resources (OERs) http://whelf.ac.uk/activity-data-delivering-benefits-from-the-data-deluge/
  • 3. Our Approach Skills courses and OERs connect the dots that help researchers understand how to apply data science techniques in the context of their whole research life cycle Skills courses and OER topics are aimed to fill specific gaps
  • 4. BD2K Skills Courses Taught by BD2K Faculty, Post-doc and Staff In person format Targeted to a variety of students
  • 5. Defining The Problem Wrangling Data Data Identification And Resources  Problems amenable to analytics  Importance of question  Team definitions  Scope  When we do this wrong: methods don't match  Finding the right data  Search methods  Use of metadata  Data management  Exploratory Data Analysis  Data Dictionary  As you touch data, what can go wrong? Methods, Tools And Analysis Scientific Communication  Visualization  Matching algorithms to problems …  Reporting Findings and Limitations  Giving “Elevator Speech” on ideas of how to approach problem  Critique of related problem
  • 6. Course Offerings Course Length Who WhatWhen Intro Course Week long course (~40 hrs) July 2015 Interns and undergraduates Taught basics of data science in the context of the research life cycle Data After Dark 2 evening course (4 hrs/nt) January 2016 OHSU students, staff and faculty Emerging data science activities/research impact Data and Donuts 2 morning course (3 hrs/day) June 2016 OHSU Summer interns Basics of data science Advanced Course 4 evening course (2 hrs/nt) May 2016 OHSU students, staff and faculty Hands on Data viz / Data wrangling Data and Donuts West 4 hour course July 2016 OHSU summer interns (West Campus) Basics of data science
  • 7. Think like a data scientis t - the Data and Donuts workshop will provide an introduction to data science for those new to research. Summer interns encouraged to attend! Topics covered will include • What is Big Data? • Asking the right question and getting the right answers from your data • Finding data resources in the real world • Data handling 101 • Ethics of data• Communicating your science for maximal impact June 2 8 & 2 9 | 9 - 1 2 PM | D onut s! Fr ee Wor kshop! DataAndDonuts Interested? Register at http : // bit.ly/ 1sfDeXz or email wirzj@ohsu.eduw w w .ohsu.edu/ bd2 k Hands-on! Learn by Doing! Join us for a 4 evening workshop: · Data Wrangling with Python and Pandas · Interactive visualization with R/ Shiny · Supervised Learning Algorithms + Kaggle Challenge Familiarity with R and Git is required. Bring your laptop! ! May 23-26th 5-7pm Register at http:/ / bit.ly/ 1pFVvLv Department of Medical Informatics + Clinical Epidemiology + OHSU Library Funding: NIH 5R25EB020379 For more information, e-mail bd2k@ohsu.edu FREE OHSU BD2K ADVANCED DATA AFTER DARK WORKSHOP
  • 8. Evaluation of Skills Courses 0% 20% 40% 60% 80% 100% Evaluation Summary from Beginnner Students Beginner Percent 6 & 7 Beginner Percent 3, 4 & 5 Beginner Percent 1 & 2 0% 20% 40% 60% 80% 100% Evaluation Summary from Advanced Students Advanced Percent 6 & 7 Advanced Percent 3, 4 & 5 Advanced Percent 1 & 2 The instructors clearly presented the skills to be learned The instructors presented content in an organized manner The instructors effectively presented concepts and techniques
  • 9. OER Modules 01 | Biomedical Big Data Science 02 | Introduction to Big Data in Biology and Medicine 03 | Ethical Issues in Use of Big Data 04 | Clinical Standards Related to Big Data 05 | Basic Research Data Standards 06 | Public Health and Big Data 07 | Team Science 08 | Secondary Use (Reuse) of Clinical Data 09 | Publication and Peer Review 10 | Information Retrieval 11 | Version Control and Identifiers 12 | Data annotation and curation 13 | Data Tools and Landscape 14 | Ontologies 101 15 | Data metadata and provenance 16 | Semantic data interoperability 17 | Choice of Algorithms and Algorithm Dynamics 18 | Visualization and Interpretation 19 | Replication, Validation and the spectrum of Reproducibility Semantic data interoperability 20 | Regulatory Issues in Big Data for Genomics and Health Semantic Web data 21 | Hosting data dissemination and data stewardship workshops 22 | Hosting data dissemination and data stewardship workshops 23 | Terminology of Biomedical, Clinical, and Translational Research 24 | Computing Concepts for Big Data 25 | Data modeling 26 | Semantic Web data 27 | Context-based selection of data 28 | Translating the Question 29 | Implications of Provenance and Pre-processing 30 | Data tells a story 31 | Statistical Significance, P-hacking and Multiple-testing 32 | Displaying Confidence and Uncertainty https://dmice.ohsu.edu/bd2k/topics.html
  • 10. What is available in the modules? Module Overview Online viewing Powerpoint files Audio files Exercises References Resources
  • 11. MLA- Professional Competencies For Health Sciences Librarians https://dmice.ohsu.edu/bd2k/mapping_MLA.html Competency #1 Understand the health sciences and health care environment and the policies, issues, and trends that impact that environment BDK02 - Introduction To Big Data In Biology And Medicine BDK03 - Ethical Issues In Use Of Big Data BDK07- Team Science Competency #3 Understand the principles and practices related to providing information services to meet users' needs BDK10 - Information Retrieval BDK22 - Guidelines For Reporting, Publications, And Data Sharing Competency #4 Have the ability to manage health information resources in a broad range of formats BDK09 - Publication And Peer Review BDK12 - Data Annotation And Curation BDK14 - Ontologies 101 BDK15 - Data Metadata And Provenance Competency #5 Understand and use technology and systems to manage all forms of information BDK10 - Information Retrieval BDK12 - Data Annotation And Curation BDK13 - Data and tools landscape BDK14 - Ontologies 101 BDK26 - Introduction to Semantic Web data Competency #6 Understand curricular design and instruction and have the ability to teach ways to access, organize, and use information BDK21 - Hosting Data Dissemination And Data Stewardship Workshops Competency #7 Understand scientific research methods and have the ability to critically examine and filter research literature from many related disciplines BDK07- Team Science BDK18 - Visualization And Interpretation BDK19 - Replication, Validation And The Spectrum Of Reproducibility BDK01 - Biomedical Big Data Science BDK04 - Clinical Data And Standards Related To Big Data BDK05 - Basic Research Data Standards BDK04 - Clinical Data And Standards Related To Big Data BDK05 - Basic Research Data Standards
  • 12. Challenges Scope Images Style Dissemination How to scope generic curricula for different levels of users How to translate diverse teaching styles into general materials How to maximize dissemination while protecting intellectual property How to incorporate images and other copyrighted materials into open resources
  • 13. Who are these resources for? EVERYONE! thenounproject.com Undergraduate Students Graduate Students Clinicians Post-docs Librarians Staff Faculty
  • 14. Help review our modules: https://dmice.ohsu.edu/bd2k/topics.html
  • 15. Acknowledgements Bill Hersh, PI Melissa Haendel, PI Shannon McWeeney, PI David Dorr, PI Ted Laderas, Instructor Jackie Wirz, Instructor Nicole Vasilevsky, Instructor Bjorn Pederson, Instructional Designer This work is supported by NIH Grants 1R25EB020379-01 and 1R25GM114820-01.
  • 16. You can find me at: @n_vasilevsky vasilevs@ohsu.edu Thanks!

Editor's Notes

  1. I won’t read all the content on this slide, the point will just be that we mapped the MLA professional competencies to the BD2K modules. For 6 of the 7 MLA professional competencies, there are BD2K modules that could help train Librarians in these areas.