SlideShare a Scribd company logo
CSC – Suomalainen tutkimuksen, koulutuksen ja julkishallinnon ICT-osaamiskeskus
Get the most out of your data! Data
publication, tracking and citation
Jessica PvE 28.2.2018
Working with data: amount of work
28.2.20182
Data Science Report 2016 http://visit.crowdflower.com/rs/416-ZBE-
142/images/CrowdFlower_DataScienceReport_2016.pdf
Monya Baker: 1,500 scientists lift the lid on reproducibility.
Survey sheds light on the ‘crisis’ rocking research. Nature 533,
2016. doi:10.1038/533452a
Data citation
• Project by Finnish Committee for Research
• Research data should be FAIR
• Data should have: Creator, title, publisher, publication
time, identifier
• Recommended additional information: Version,
resource type, copyright status
28.2.20184
28.2.20185
FCRD
https://www.fcrd.fi/data-citation/
• Include principles of data as evidence
and data transparency in next version
of Finnish RCR guideline byTENK.
• Recognise data authorship as a
distinct issue and discussion in the
TENK authorship guideline
http://www.tenk.fi/sites/tenk.fi/files/TENK_suositus_tek
ijyys.pdf
• Create a multi-institutional,
multidisciplinary working group to
define principles for defining data
authorship, coordinated f.e. byTENK,
OR assign national representation to
a relevant international activity with
the same goal.
• Organize multidisciplinary
discussion on data management
and citation, with the aim of
creating interoperable practices.
• Promote the use of data reference
model also when referring to
authors own primary source data.
• Define field specific level of
granularity for data citation.
Recommendations
6
• data licensing
• metadata
standards
• open data
formats
• shared
vocabularies
• informed decisions
to share data
• access rights
• descriptive
metadata
• persistent
identifiers
Findable Accessible
ReusableInter-operable
force11.org/group/fairgroup/fairprinciples
FAIR
Findable
Accessible
Interoperable
Reusable
By Martin Grandjean - Own work : http://www.martingrandjean.ch/wp-content/uploads/2013/10/Graphe3.png, CC BY-SA 3.0,
https://commons.wikimedia.org/w/index.php?curid=29364647
1.(Meta)data are assigned a globally
unique and persistent identifier
2. Data are described with rich metadata
3. Metadata clearly and explicitly include
the identifier of the data it describes
4. (Meta)data are registered or indexed in
a searchable resource
URN
1.(Meta)data are retrievable by their
identifier using a standardized
communications protocol
2.Metadata are accessible, even when
the data are no longer available
1.(Meta)data use a formal, accessible,
shared, and broadly applicable language
for knowledge representation.
2.(Meta)data use vocabularies that
follow FAIR principles
3. (Meta)data include qualified
references to other (meta)data
Meta(data) are richly described with a
plurality of accurate and relevant
attributes
Persistent Identifiers (DOI, URN)
Data Catalog
Resolver
PID
Data file
License
Configuration
files
Read me
Landing page
Nano publications, linking data and compact identifiers
• https://www.go-fair.org/
• http://identifiers.org/
28.2.201814
Schofield et al (2015) http://digitalhumanities.org:8081/dhq/vol/9/3/000227/000227.html
Rutgers lib guide
Alan O’Rourke CC-BY
audiencestack.com
Organising your data
• Sort and classify your information
• For instance: don’t mix different types of information in excel
columns: it is usually easier to combine datasets than sort out
ill structured data later
• Think about granularity (file size) and metadata
• Decide on formats, units, codes etc and be consequent
28.2.201817
Organising your data
• Write a code book, document
• Think about intelligibility
• Be careful when rearranging, reformatting, sorting or copy-
pasting data
• Use common file formats, preferably open
• Have processes in place for checking the data quality and
completeness
• Be clear about master copies and other copies
28.2.201818
Organising your data
• Be careful and plan well for sensitive data and anonymization
• Think about security and access rights
• Plan and agree on which versions of a dataset will be archived
and/or published
• Think about reproducibility and citing data
28.2.201819
Files and folders
• Create and agree on a system for naming files and folders and
be consequent
• Avoid very deep folder structures, since they can be difficult to
handle
• Use meaningful, unique file and folder names
• Keep names as short as possible but relevant. 25 characters is
usually considered maximum.
28.2.201820
Files and folders
• Dates inYYYY-MM-DD format allows you to sort and search
your files
• Avoid using special characters such as % & /  : ; * . ? < > ^! “ ()
and Scandinavians
• Use three digits (or 4 if you have a large number of files) i.e.
001, 002…….201, 202 (not 1, 2, 21).
• Use underscores (_) instead of spaces
28.2.201821
Files and folders
• If using a personal name in the name give the surname first
followed by first name.
• Though, be very careful with personal data when naming files
and folders
• Indicate version number by using ‘V’ or “version” and number
(and subversions with more digits if minor changes)
28.2.201822
Types of data in research
Open Data Generic Research Data Fixed Research
Datasets
Description • Public data for any
use, commercial
• May be dynamic
• PSI
• Mature data
products
• Not produced
/monitored by
researchers
• Ex. Weather data,
public transport
• Produced
by/with/for
researchers
• Validated, good
quality
• Well documented
• Raw or processed
• Datasets
stable/version
controlled
• Ex. Corpus, SMEAR
• Produced for
specific research
question
• Underpins
research, for
replication
• Might be very
processed
• Reuse value
might be low
unless mature
field
Format Stable, standardised May vary over time Must be preserved
according to plan
28.2.201823
License and publish your data
• FSDTietoarkisto
• Language Bank of Finland
• Figshare
• Zenodo
• EUDAT
• IDA and Etsin
28.2.201824
pexels
https://www.facebook.com/CSCfi
https://twitter.com/CSCfi
https://www.youtube.com/c/CSCfi
https://www.linkedin.com/company/csc---it-center-for-science
Thank you!
Jessica Parland-von Essen
Parland (at) csc.fi
https://orcid.org/0000-0003-4460-3906
28.2.201826

More Related Content

What's hot

A basic course on Reseach data management, part 2: protecting and organizing ...
A basic course on Reseach data management, part 2: protecting and organizing ...A basic course on Reseach data management, part 2: protecting and organizing ...
A basic course on Reseach data management, part 2: protecting and organizing ...
Leon Osinski
 
A basic course on Research data management, part 3: sharing your data
A basic course on Research data management, part 3: sharing your dataA basic course on Research data management, part 3: sharing your data
A basic course on Research data management, part 3: sharing your data
Leon Osinski
 
EDI Training Module 4: Organizing Data Into Publishable Units
EDI Training Module 4: Organizing Data Into Publishable UnitsEDI Training Module 4: Organizing Data Into Publishable Units
EDI Training Module 4: Organizing Data Into Publishable Units
Environmental Data Initiative
 
DataONE Education Module 01: Why Data Management?
DataONE Education Module 01: Why Data Management?DataONE Education Module 01: Why Data Management?
DataONE Education Module 01: Why Data Management?
DataONE
 
DataONE Education Module 10: Legal and Policy Issues
DataONE Education Module 10: Legal and Policy IssuesDataONE Education Module 10: Legal and Policy Issues
DataONE Education Module 10: Legal and Policy Issues
DataONE
 
Research data management at TU Eindhoven
Research data management at TU EindhovenResearch data management at TU Eindhoven
Research data management at TU Eindhoven
Leon Osinski
 
data citation
data citationdata citation
data citation
Sarah Jones
 
EDI Training Module 12: Learn to Cite and Link Your Data
EDI Training Module 12:  Learn to Cite and Link Your DataEDI Training Module 12:  Learn to Cite and Link Your Data
EDI Training Module 12: Learn to Cite and Link Your Data
Environmental Data Initiative
 
DataONE Education Module 03: Data Management Planning
DataONE Education Module 03: Data Management PlanningDataONE Education Module 03: Data Management Planning
DataONE Education Module 03: Data Management Planning
DataONE
 
What funders want you to do with your data
What funders want you to do with your dataWhat funders want you to do with your data
What funders want you to do with your data
Leon Osinski
 
Metadata lecture(9 17-14)
Metadata lecture(9 17-14)Metadata lecture(9 17-14)
Metadata lecture(9 17-14)
mhb120
 
Advantages of metadata
Advantages of metadataAdvantages of metadata
Advantages of metadata
Azeem Sultan
 
Ag Data Commons: Agricultural research metadata and data
Ag Data Commons: Agricultural research metadata and dataAg Data Commons: Agricultural research metadata and data
Ag Data Commons: Agricultural research metadata and data
Cyndy Parr
 
Dataset Catalogs as a Foundation for FAIR* Data
Dataset Catalogs as a Foundation for FAIR* DataDataset Catalogs as a Foundation for FAIR* Data
Dataset Catalogs as a Foundation for FAIR* Data
Tom Plasterer
 
Va sla nov 15 final
Va sla nov 15 finalVa sla nov 15 final
Va sla nov 15 final
Margaret Henderson
 
FAIR Data Knowledge Graphs
FAIR Data Knowledge GraphsFAIR Data Knowledge Graphs
FAIR Data Knowledge Graphs
Tom Plasterer
 
bioCADDIE Webinar: The NIDDK Information Network (dkNET) - A Community Resear...
bioCADDIE Webinar: The NIDDK Information Network (dkNET) - A Community Resear...bioCADDIE Webinar: The NIDDK Information Network (dkNET) - A Community Resear...
bioCADDIE Webinar: The NIDDK Information Network (dkNET) - A Community Resear...
dkNET
 
Essentials 4 Data Support: a fine course in FAIR Data Support
Essentials 4 Data Support: a fine course in FAIR Data SupportEssentials 4 Data Support: a fine course in FAIR Data Support
Essentials 4 Data Support: a fine course in FAIR Data Support
Ellen Verbakel
 
You down with dmp yeah you know me!
You down with dmp  yeah you know me!You down with dmp  yeah you know me!
You down with dmp yeah you know me!
Renaine Julian
 
Supporting the development of a national Research Data Discovery Service - A ...
Supporting the development of a national Research Data Discovery Service - A ...Supporting the development of a national Research Data Discovery Service - A ...
Supporting the development of a national Research Data Discovery Service - A ...
Historic Environment Scotland
 

What's hot (20)

A basic course on Reseach data management, part 2: protecting and organizing ...
A basic course on Reseach data management, part 2: protecting and organizing ...A basic course on Reseach data management, part 2: protecting and organizing ...
A basic course on Reseach data management, part 2: protecting and organizing ...
 
A basic course on Research data management, part 3: sharing your data
A basic course on Research data management, part 3: sharing your dataA basic course on Research data management, part 3: sharing your data
A basic course on Research data management, part 3: sharing your data
 
EDI Training Module 4: Organizing Data Into Publishable Units
EDI Training Module 4: Organizing Data Into Publishable UnitsEDI Training Module 4: Organizing Data Into Publishable Units
EDI Training Module 4: Organizing Data Into Publishable Units
 
DataONE Education Module 01: Why Data Management?
DataONE Education Module 01: Why Data Management?DataONE Education Module 01: Why Data Management?
DataONE Education Module 01: Why Data Management?
 
DataONE Education Module 10: Legal and Policy Issues
DataONE Education Module 10: Legal and Policy IssuesDataONE Education Module 10: Legal and Policy Issues
DataONE Education Module 10: Legal and Policy Issues
 
Research data management at TU Eindhoven
Research data management at TU EindhovenResearch data management at TU Eindhoven
Research data management at TU Eindhoven
 
data citation
data citationdata citation
data citation
 
EDI Training Module 12: Learn to Cite and Link Your Data
EDI Training Module 12:  Learn to Cite and Link Your DataEDI Training Module 12:  Learn to Cite and Link Your Data
EDI Training Module 12: Learn to Cite and Link Your Data
 
DataONE Education Module 03: Data Management Planning
DataONE Education Module 03: Data Management PlanningDataONE Education Module 03: Data Management Planning
DataONE Education Module 03: Data Management Planning
 
What funders want you to do with your data
What funders want you to do with your dataWhat funders want you to do with your data
What funders want you to do with your data
 
Metadata lecture(9 17-14)
Metadata lecture(9 17-14)Metadata lecture(9 17-14)
Metadata lecture(9 17-14)
 
Advantages of metadata
Advantages of metadataAdvantages of metadata
Advantages of metadata
 
Ag Data Commons: Agricultural research metadata and data
Ag Data Commons: Agricultural research metadata and dataAg Data Commons: Agricultural research metadata and data
Ag Data Commons: Agricultural research metadata and data
 
Dataset Catalogs as a Foundation for FAIR* Data
Dataset Catalogs as a Foundation for FAIR* DataDataset Catalogs as a Foundation for FAIR* Data
Dataset Catalogs as a Foundation for FAIR* Data
 
Va sla nov 15 final
Va sla nov 15 finalVa sla nov 15 final
Va sla nov 15 final
 
FAIR Data Knowledge Graphs
FAIR Data Knowledge GraphsFAIR Data Knowledge Graphs
FAIR Data Knowledge Graphs
 
bioCADDIE Webinar: The NIDDK Information Network (dkNET) - A Community Resear...
bioCADDIE Webinar: The NIDDK Information Network (dkNET) - A Community Resear...bioCADDIE Webinar: The NIDDK Information Network (dkNET) - A Community Resear...
bioCADDIE Webinar: The NIDDK Information Network (dkNET) - A Community Resear...
 
Essentials 4 Data Support: a fine course in FAIR Data Support
Essentials 4 Data Support: a fine course in FAIR Data SupportEssentials 4 Data Support: a fine course in FAIR Data Support
Essentials 4 Data Support: a fine course in FAIR Data Support
 
You down with dmp yeah you know me!
You down with dmp  yeah you know me!You down with dmp  yeah you know me!
You down with dmp yeah you know me!
 
Supporting the development of a national Research Data Discovery Service - A ...
Supporting the development of a national Research Data Discovery Service - A ...Supporting the development of a national Research Data Discovery Service - A ...
Supporting the development of a national Research Data Discovery Service - A ...
 

Similar to Research data management for historians

The challenge of sharing data well, how publishers can help
The challenge of sharing data well, how publishers can helpThe challenge of sharing data well, how publishers can help
The challenge of sharing data well, how publishers can help
Varsha Khodiyar
 
Research data management workshop april12 2016
Research data management workshop april12 2016 Research data management workshop april12 2016
Research data management workshop april12 2016
Rebecca Raworth, MLIS
 
Research data management workshop April 2016
Research data management workshop April 2016Research data management workshop April 2016
Research data management workshop April 2016
Rebecca Raworth, MLIS
 
Research data management: course 0HV90, Behavioral Research Methods
Research data management: course 0HV90, Behavioral Research MethodsResearch data management: course 0HV90, Behavioral Research Methods
Research data management: course 0HV90, Behavioral Research Methods
Leon Osinski
 
Data management (newest version)
Data management (newest version)Data management (newest version)
Data management (newest version)
Graça Gabriel
 
Documentation and Metdata - VA DM Bootcamp
Documentation and Metdata - VA DM BootcampDocumentation and Metdata - VA DM Bootcamp
Documentation and Metdata - VA DM Bootcamp
Sherry Lake
 
Introduction to data management
Introduction to data managementIntroduction to data management
Introduction to data management
Cunera Buys
 
Research data management : [part of] PROOF course Finding and controlling sci...
Research data management : [part of] PROOF course Finding and controlling sci...Research data management : [part of] PROOF course Finding and controlling sci...
Research data management : [part of] PROOF course Finding and controlling sci...
Leon Osinski
 
Data Archiving and Sharing
Data Archiving and SharingData Archiving and Sharing
Data Archiving and Sharing
C. Tobin Magle
 
Big Data Europe: SC6 Workshop 3: The European Research Data Landscape: Opport...
Big Data Europe: SC6 Workshop 3: The European Research Data Landscape: Opport...Big Data Europe: SC6 Workshop 3: The European Research Data Landscape: Opport...
Big Data Europe: SC6 Workshop 3: The European Research Data Landscape: Opport...
BigData_Europe
 
Make your data great now
Make your data great nowMake your data great now
Make your data great now
Daniel JACOB
 
Managing data throughout the research lifecycle
Managing data throughout the research lifecycleManaging data throughout the research lifecycle
Managing data throughout the research lifecycle
Marieke Guy
 
NFAIS Talk on Enabling FAIR Data
NFAIS Talk on Enabling FAIR DataNFAIS Talk on Enabling FAIR Data
NFAIS Talk on Enabling FAIR Data
Anita de Waard
 
Data sharing as part of the research workflow
Data sharing as part of the research workflowData sharing as part of the research workflow
Data sharing as part of the research workflow
Varsha Khodiyar
 
Data peer review workshop
Data peer review workshopData peer review workshop
Data peer review workshop
Varsha Khodiyar
 
Introduction to research data management
Introduction to research data managementIntroduction to research data management
Introduction to research data management
dri_ireland
 
Good Practice in Research Data Management
Good Practice in Research Data ManagementGood Practice in Research Data Management
Good Practice in Research Data Management
Historic Environment Scotland
 
NISO Training Thursday Crafting a Scientific Data Management Plan
NISO Training Thursday Crafting a Scientific Data Management PlanNISO Training Thursday Crafting a Scientific Data Management Plan
NISO Training Thursday Crafting a Scientific Data Management Plan
National Information Standards Organization (NISO)
 
OpenAIRE webinar on Open Research Data in H2020 (OAW2016)
OpenAIRE webinar on Open Research Data in H2020 (OAW2016)OpenAIRE webinar on Open Research Data in H2020 (OAW2016)
OpenAIRE webinar on Open Research Data in H2020 (OAW2016)
OpenAIRE
 
A Generic Scientific Data Model and Ontology for Representation of Chemical Data
A Generic Scientific Data Model and Ontology for Representation of Chemical DataA Generic Scientific Data Model and Ontology for Representation of Chemical Data
A Generic Scientific Data Model and Ontology for Representation of Chemical Data
Stuart Chalk
 

Similar to Research data management for historians (20)

The challenge of sharing data well, how publishers can help
The challenge of sharing data well, how publishers can helpThe challenge of sharing data well, how publishers can help
The challenge of sharing data well, how publishers can help
 
Research data management workshop april12 2016
Research data management workshop april12 2016 Research data management workshop april12 2016
Research data management workshop april12 2016
 
Research data management workshop April 2016
Research data management workshop April 2016Research data management workshop April 2016
Research data management workshop April 2016
 
Research data management: course 0HV90, Behavioral Research Methods
Research data management: course 0HV90, Behavioral Research MethodsResearch data management: course 0HV90, Behavioral Research Methods
Research data management: course 0HV90, Behavioral Research Methods
 
Data management (newest version)
Data management (newest version)Data management (newest version)
Data management (newest version)
 
Documentation and Metdata - VA DM Bootcamp
Documentation and Metdata - VA DM BootcampDocumentation and Metdata - VA DM Bootcamp
Documentation and Metdata - VA DM Bootcamp
 
Introduction to data management
Introduction to data managementIntroduction to data management
Introduction to data management
 
Research data management : [part of] PROOF course Finding and controlling sci...
Research data management : [part of] PROOF course Finding and controlling sci...Research data management : [part of] PROOF course Finding and controlling sci...
Research data management : [part of] PROOF course Finding and controlling sci...
 
Data Archiving and Sharing
Data Archiving and SharingData Archiving and Sharing
Data Archiving and Sharing
 
Big Data Europe: SC6 Workshop 3: The European Research Data Landscape: Opport...
Big Data Europe: SC6 Workshop 3: The European Research Data Landscape: Opport...Big Data Europe: SC6 Workshop 3: The European Research Data Landscape: Opport...
Big Data Europe: SC6 Workshop 3: The European Research Data Landscape: Opport...
 
Make your data great now
Make your data great nowMake your data great now
Make your data great now
 
Managing data throughout the research lifecycle
Managing data throughout the research lifecycleManaging data throughout the research lifecycle
Managing data throughout the research lifecycle
 
NFAIS Talk on Enabling FAIR Data
NFAIS Talk on Enabling FAIR DataNFAIS Talk on Enabling FAIR Data
NFAIS Talk on Enabling FAIR Data
 
Data sharing as part of the research workflow
Data sharing as part of the research workflowData sharing as part of the research workflow
Data sharing as part of the research workflow
 
Data peer review workshop
Data peer review workshopData peer review workshop
Data peer review workshop
 
Introduction to research data management
Introduction to research data managementIntroduction to research data management
Introduction to research data management
 
Good Practice in Research Data Management
Good Practice in Research Data ManagementGood Practice in Research Data Management
Good Practice in Research Data Management
 
NISO Training Thursday Crafting a Scientific Data Management Plan
NISO Training Thursday Crafting a Scientific Data Management PlanNISO Training Thursday Crafting a Scientific Data Management Plan
NISO Training Thursday Crafting a Scientific Data Management Plan
 
OpenAIRE webinar on Open Research Data in H2020 (OAW2016)
OpenAIRE webinar on Open Research Data in H2020 (OAW2016)OpenAIRE webinar on Open Research Data in H2020 (OAW2016)
OpenAIRE webinar on Open Research Data in H2020 (OAW2016)
 
A Generic Scientific Data Model and Ontology for Representation of Chemical Data
A Generic Scientific Data Model and Ontology for Representation of Chemical DataA Generic Scientific Data Model and Ontology for Representation of Chemical Data
A Generic Scientific Data Model and Ontology for Representation of Chemical Data
 

More from Jessica Parland-von Essen

Planning a Finnish PID Roadmap
Planning a Finnish PID Roadmap Planning a Finnish PID Roadmap
Planning a Finnish PID Roadmap
Jessica Parland-von Essen
 
Tutkimusaineistojen kuvailu, metadata ja yhteentoimivuus
Tutkimusaineistojen kuvailu, metadata ja yhteentoimivuusTutkimusaineistojen kuvailu, metadata ja yhteentoimivuus
Tutkimusaineistojen kuvailu, metadata ja yhteentoimivuus
Jessica Parland-von Essen
 
Pid landscape in finland
Pid landscape in finlandPid landscape in finland
Pid landscape in finland
Jessica Parland-von Essen
 
Fairdata-palvelut ja tutkimusaineistojen pitkäaikaissäilytys
Fairdata-palvelut ja tutkimusaineistojen pitkäaikaissäilytysFairdata-palvelut ja tutkimusaineistojen pitkäaikaissäilytys
Fairdata-palvelut ja tutkimusaineistojen pitkäaikaissäilytys
Jessica Parland-von Essen
 
Open Science goes FAIR
Open Science goes FAIROpen Science goes FAIR
Open Science goes FAIR
Jessica Parland-von Essen
 
Metatiedot tunnisteet tutkimisdata
Metatiedot tunnisteet tutkimisdataMetatiedot tunnisteet tutkimisdata
Metatiedot tunnisteet tutkimisdata
Jessica Parland-von Essen
 
Towards a FAIR lifecycle
Towards a FAIR lifecycleTowards a FAIR lifecycle
Towards a FAIR lifecycle
Jessica Parland-von Essen
 
A Finnish perspective on FAIRsFAIR outputs
A Finnish perspective on FAIRsFAIR outputsA Finnish perspective on FAIRsFAIR outputs
A Finnish perspective on FAIRsFAIR outputs
Jessica Parland-von Essen
 
Persistence and Interoperability
Persistence and InteroperabilityPersistence and Interoperability
Persistence and Interoperability
Jessica Parland-von Essen
 
Collections meet the researcher. Digitalization, disintegration and disillusi...
Collections meet the researcher. Digitalization, disintegration and disillusi...Collections meet the researcher. Digitalization, disintegration and disillusi...
Collections meet the researcher. Digitalization, disintegration and disillusi...
Jessica Parland-von Essen
 
Supporting FAIR data principles with data categorization
Supporting FAIR data principles with data categorizationSupporting FAIR data principles with data categorization
Supporting FAIR data principles with data categorization
Jessica Parland-von Essen
 
FAIR data and the Etsin service
FAIR data and the Etsin serviceFAIR data and the Etsin service
FAIR data and the Etsin service
Jessica Parland-von Essen
 
Yhteiskuntatieteen aineistot
Yhteiskuntatieteen aineistotYhteiskuntatieteen aineistot
Yhteiskuntatieteen aineistot
Jessica Parland-von Essen
 
Avoimen suomen historia
Avoimen suomen historiaAvoimen suomen historia
Avoimen suomen historia
Jessica Parland-von Essen
 
Open Science Process
Open Science ProcessOpen Science Process
Open Science Process
Jessica Parland-von Essen
 
Tutkimusaineistoihiin viittaaminen, pysyvät tunnisteet ja linkittäminen
Tutkimusaineistoihiin viittaaminen, pysyvät tunnisteet ja linkittäminenTutkimusaineistoihiin viittaaminen, pysyvät tunnisteet ja linkittäminen
Tutkimusaineistoihiin viittaaminen, pysyvät tunnisteet ja linkittäminen
Jessica Parland-von Essen
 
Avoin tiede Suomessa
Avoin tiede SuomessaAvoin tiede Suomessa
Avoin tiede Suomessa
Jessica Parland-von Essen
 
Forskningsdataforhumanister
ForskningsdataforhumanisterForskningsdataforhumanister
Forskningsdataforhumanister
Jessica Parland-von Essen
 
Data Management in Research
Data Management in ResearchData Management in Research
Data Management in Research
Jessica Parland-von Essen
 

More from Jessica Parland-von Essen (20)

Planning a Finnish PID Roadmap
Planning a Finnish PID Roadmap Planning a Finnish PID Roadmap
Planning a Finnish PID Roadmap
 
Tutkimusaineistojen kuvailu, metadata ja yhteentoimivuus
Tutkimusaineistojen kuvailu, metadata ja yhteentoimivuusTutkimusaineistojen kuvailu, metadata ja yhteentoimivuus
Tutkimusaineistojen kuvailu, metadata ja yhteentoimivuus
 
Pid landscape in finland
Pid landscape in finlandPid landscape in finland
Pid landscape in finland
 
Fairdata-palvelut ja tutkimusaineistojen pitkäaikaissäilytys
Fairdata-palvelut ja tutkimusaineistojen pitkäaikaissäilytysFairdata-palvelut ja tutkimusaineistojen pitkäaikaissäilytys
Fairdata-palvelut ja tutkimusaineistojen pitkäaikaissäilytys
 
Open Science goes FAIR
Open Science goes FAIROpen Science goes FAIR
Open Science goes FAIR
 
Metatiedot tunnisteet tutkimisdata
Metatiedot tunnisteet tutkimisdataMetatiedot tunnisteet tutkimisdata
Metatiedot tunnisteet tutkimisdata
 
Towards a FAIR lifecycle
Towards a FAIR lifecycleTowards a FAIR lifecycle
Towards a FAIR lifecycle
 
A Finnish perspective on FAIRsFAIR outputs
A Finnish perspective on FAIRsFAIR outputsA Finnish perspective on FAIRsFAIR outputs
A Finnish perspective on FAIRsFAIR outputs
 
Persistence and Interoperability
Persistence and InteroperabilityPersistence and Interoperability
Persistence and Interoperability
 
Collections meet the researcher. Digitalization, disintegration and disillusi...
Collections meet the researcher. Digitalization, disintegration and disillusi...Collections meet the researcher. Digitalization, disintegration and disillusi...
Collections meet the researcher. Digitalization, disintegration and disillusi...
 
Supporting FAIR data principles with data categorization
Supporting FAIR data principles with data categorizationSupporting FAIR data principles with data categorization
Supporting FAIR data principles with data categorization
 
FAIR data and the Etsin service
FAIR data and the Etsin serviceFAIR data and the Etsin service
FAIR data and the Etsin service
 
Yhteiskuntatieteen aineistot
Yhteiskuntatieteen aineistotYhteiskuntatieteen aineistot
Yhteiskuntatieteen aineistot
 
Avoimen suomen historia
Avoimen suomen historiaAvoimen suomen historia
Avoimen suomen historia
 
Open Science Process
Open Science ProcessOpen Science Process
Open Science Process
 
Tutkimusaineistoihiin viittaaminen, pysyvät tunnisteet ja linkittäminen
Tutkimusaineistoihiin viittaaminen, pysyvät tunnisteet ja linkittäminenTutkimusaineistoihiin viittaaminen, pysyvät tunnisteet ja linkittäminen
Tutkimusaineistoihiin viittaaminen, pysyvät tunnisteet ja linkittäminen
 
AffarerAllianserAnseende
AffarerAllianserAnseendeAffarerAllianserAnseende
AffarerAllianserAnseende
 
Avoin tiede Suomessa
Avoin tiede SuomessaAvoin tiede Suomessa
Avoin tiede Suomessa
 
Forskningsdataforhumanister
ForskningsdataforhumanisterForskningsdataforhumanister
Forskningsdataforhumanister
 
Data Management in Research
Data Management in ResearchData Management in Research
Data Management in Research
 

Recently uploaded

Girls Call Vadodara 000XX00000 Provide Best And Top Girl Service And No1 in City
Girls Call Vadodara 000XX00000 Provide Best And Top Girl Service And No1 in CityGirls Call Vadodara 000XX00000 Provide Best And Top Girl Service And No1 in City
Girls Call Vadodara 000XX00000 Provide Best And Top Girl Service And No1 in City
gargnatasha985
 
potential usefulness of multi-agent maze-solving in general
potential usefulness of multi-agent maze-solving in generalpotential usefulness of multi-agent maze-solving in general
potential usefulness of multi-agent maze-solving in general
huseindihon
 
Research proposal seminar ,Research Methodology
Research proposal seminar ,Research MethodologyResearch proposal seminar ,Research Methodology
Research proposal seminar ,Research Methodology
doctorzlife786
 
DU degree offer diploma Transcript
DU degree offer diploma TranscriptDU degree offer diploma Transcript
DU degree offer diploma Transcript
uapta
 
ch8_multiplexing cs553 st07 slide share ss
ch8_multiplexing cs553 st07 slide share ssch8_multiplexing cs553 st07 slide share ss
ch8_multiplexing cs553 st07 slide share ss
MinThetLwin1
 
VIP Girls Call Mumbai 9910780858 Provide Best And Top Girl Service And No1 in...
VIP Girls Call Mumbai 9910780858 Provide Best And Top Girl Service And No1 in...VIP Girls Call Mumbai 9910780858 Provide Best And Top Girl Service And No1 in...
VIP Girls Call Mumbai 9910780858 Provide Best And Top Girl Service And No1 in...
44annissa
 
Fine-Tuning of Small/Medium LLMs for Business QA on Structured Data
Fine-Tuning of Small/Medium LLMs for Business QA on Structured DataFine-Tuning of Small/Medium LLMs for Business QA on Structured Data
Fine-Tuning of Small/Medium LLMs for Business QA on Structured Data
kevig
 
the potential of the development of the Ford–Fulkerson algorithm to solve the...
the potential of the development of the Ford–Fulkerson algorithm to solve the...the potential of the development of the Ford–Fulkerson algorithm to solve the...
the potential of the development of the Ford–Fulkerson algorithm to solve the...
huseindihon
 
CHAPTER-1-Introduction-to-Marketing.pptx
CHAPTER-1-Introduction-to-Marketing.pptxCHAPTER-1-Introduction-to-Marketing.pptx
CHAPTER-1-Introduction-to-Marketing.pptx
girewiy968
 
DataScienceConcept_Kanchana_Weerasinghe.pptx
DataScienceConcept_Kanchana_Weerasinghe.pptxDataScienceConcept_Kanchana_Weerasinghe.pptx
DataScienceConcept_Kanchana_Weerasinghe.pptx
Kanchana Weerasinghe
 
BDSM Girls Call Delhi 🎈🔥9711199171 🔥💋🎈 Provide Best And Top Girl Service And ...
BDSM Girls Call Delhi 🎈🔥9711199171 🔥💋🎈 Provide Best And Top Girl Service And ...BDSM Girls Call Delhi 🎈🔥9711199171 🔥💋🎈 Provide Best And Top Girl Service And ...
BDSM Girls Call Delhi 🎈🔥9711199171 🔥💋🎈 Provide Best And Top Girl Service And ...
fatima shekh$A17
 
Mumbai Girls Call Mumbai 🛵🚡9910780858 💃 Choose Best And Top Girl Service And ...
Mumbai Girls Call Mumbai 🛵🚡9910780858 💃 Choose Best And Top Girl Service And ...Mumbai Girls Call Mumbai 🛵🚡9910780858 💃 Choose Best And Top Girl Service And ...
Mumbai Girls Call Mumbai 🛵🚡9910780858 💃 Choose Best And Top Girl Service And ...
norina2645
 
Celebrity Girls Call Andheri 9930245274 Unlimited Short Providing Girls Servi...
Celebrity Girls Call Andheri 9930245274 Unlimited Short Providing Girls Servi...Celebrity Girls Call Andheri 9930245274 Unlimited Short Providing Girls Servi...
Celebrity Girls Call Andheri 9930245274 Unlimited Short Providing Girls Servi...
revolutionary575
 
NPS_Presentation_V3.pptx it is regarding National pension scheme
NPS_Presentation_V3.pptx it is regarding National pension schemeNPS_Presentation_V3.pptx it is regarding National pension scheme
NPS_Presentation_V3.pptx it is regarding National pension scheme
ASISHSABAT3
 
Coimbatore Girls call Service 000XX00000 Provide Best And Top Girl Service An...
Coimbatore Girls call Service 000XX00000 Provide Best And Top Girl Service An...Coimbatore Girls call Service 000XX00000 Provide Best And Top Girl Service An...
Coimbatore Girls call Service 000XX00000 Provide Best And Top Girl Service An...
vrvipin164
 
Nipissing University degree offer Nipissing diploma Transcript
Nipissing University degree offer Nipissing diploma TranscriptNipissing University degree offer Nipissing diploma Transcript
Nipissing University degree offer Nipissing diploma Transcript
zyqedad
 
High Girls Call Nagpur 000XX00000 Provide Best And Top Girl Service And No1 i...
High Girls Call Nagpur 000XX00000 Provide Best And Top Girl Service And No1 i...High Girls Call Nagpur 000XX00000 Provide Best And Top Girl Service And No1 i...
High Girls Call Nagpur 000XX00000 Provide Best And Top Girl Service And No1 i...
saadkhan1485265
 
Female Service Girls Call Navi Mumbai 9930245274 Provide Best And Top Girl Se...
Female Service Girls Call Navi Mumbai 9930245274 Provide Best And Top Girl Se...Female Service Girls Call Navi Mumbai 9930245274 Provide Best And Top Girl Se...
Female Service Girls Call Navi Mumbai 9930245274 Provide Best And Top Girl Se...
dizzycaye
 
Beautiful Girls Call Pune 000XX00000 Provide Best And Top Girl Service And No...
Beautiful Girls Call Pune 000XX00000 Provide Best And Top Girl Service And No...Beautiful Girls Call Pune 000XX00000 Provide Best And Top Girl Service And No...
Beautiful Girls Call Pune 000XX00000 Provide Best And Top Girl Service And No...
birajmohan012
 
Welcome back to Instagram. Sign in to check out what your
Welcome back to Instagram. Sign in to check out what yourWelcome back to Instagram. Sign in to check out what your
Welcome back to Instagram. Sign in to check out what your
Virni Arrora
 

Recently uploaded (20)

Girls Call Vadodara 000XX00000 Provide Best And Top Girl Service And No1 in City
Girls Call Vadodara 000XX00000 Provide Best And Top Girl Service And No1 in CityGirls Call Vadodara 000XX00000 Provide Best And Top Girl Service And No1 in City
Girls Call Vadodara 000XX00000 Provide Best And Top Girl Service And No1 in City
 
potential usefulness of multi-agent maze-solving in general
potential usefulness of multi-agent maze-solving in generalpotential usefulness of multi-agent maze-solving in general
potential usefulness of multi-agent maze-solving in general
 
Research proposal seminar ,Research Methodology
Research proposal seminar ,Research MethodologyResearch proposal seminar ,Research Methodology
Research proposal seminar ,Research Methodology
 
DU degree offer diploma Transcript
DU degree offer diploma TranscriptDU degree offer diploma Transcript
DU degree offer diploma Transcript
 
ch8_multiplexing cs553 st07 slide share ss
ch8_multiplexing cs553 st07 slide share ssch8_multiplexing cs553 st07 slide share ss
ch8_multiplexing cs553 st07 slide share ss
 
VIP Girls Call Mumbai 9910780858 Provide Best And Top Girl Service And No1 in...
VIP Girls Call Mumbai 9910780858 Provide Best And Top Girl Service And No1 in...VIP Girls Call Mumbai 9910780858 Provide Best And Top Girl Service And No1 in...
VIP Girls Call Mumbai 9910780858 Provide Best And Top Girl Service And No1 in...
 
Fine-Tuning of Small/Medium LLMs for Business QA on Structured Data
Fine-Tuning of Small/Medium LLMs for Business QA on Structured DataFine-Tuning of Small/Medium LLMs for Business QA on Structured Data
Fine-Tuning of Small/Medium LLMs for Business QA on Structured Data
 
the potential of the development of the Ford–Fulkerson algorithm to solve the...
the potential of the development of the Ford–Fulkerson algorithm to solve the...the potential of the development of the Ford–Fulkerson algorithm to solve the...
the potential of the development of the Ford–Fulkerson algorithm to solve the...
 
CHAPTER-1-Introduction-to-Marketing.pptx
CHAPTER-1-Introduction-to-Marketing.pptxCHAPTER-1-Introduction-to-Marketing.pptx
CHAPTER-1-Introduction-to-Marketing.pptx
 
DataScienceConcept_Kanchana_Weerasinghe.pptx
DataScienceConcept_Kanchana_Weerasinghe.pptxDataScienceConcept_Kanchana_Weerasinghe.pptx
DataScienceConcept_Kanchana_Weerasinghe.pptx
 
BDSM Girls Call Delhi 🎈🔥9711199171 🔥💋🎈 Provide Best And Top Girl Service And ...
BDSM Girls Call Delhi 🎈🔥9711199171 🔥💋🎈 Provide Best And Top Girl Service And ...BDSM Girls Call Delhi 🎈🔥9711199171 🔥💋🎈 Provide Best And Top Girl Service And ...
BDSM Girls Call Delhi 🎈🔥9711199171 🔥💋🎈 Provide Best And Top Girl Service And ...
 
Mumbai Girls Call Mumbai 🛵🚡9910780858 💃 Choose Best And Top Girl Service And ...
Mumbai Girls Call Mumbai 🛵🚡9910780858 💃 Choose Best And Top Girl Service And ...Mumbai Girls Call Mumbai 🛵🚡9910780858 💃 Choose Best And Top Girl Service And ...
Mumbai Girls Call Mumbai 🛵🚡9910780858 💃 Choose Best And Top Girl Service And ...
 
Celebrity Girls Call Andheri 9930245274 Unlimited Short Providing Girls Servi...
Celebrity Girls Call Andheri 9930245274 Unlimited Short Providing Girls Servi...Celebrity Girls Call Andheri 9930245274 Unlimited Short Providing Girls Servi...
Celebrity Girls Call Andheri 9930245274 Unlimited Short Providing Girls Servi...
 
NPS_Presentation_V3.pptx it is regarding National pension scheme
NPS_Presentation_V3.pptx it is regarding National pension schemeNPS_Presentation_V3.pptx it is regarding National pension scheme
NPS_Presentation_V3.pptx it is regarding National pension scheme
 
Coimbatore Girls call Service 000XX00000 Provide Best And Top Girl Service An...
Coimbatore Girls call Service 000XX00000 Provide Best And Top Girl Service An...Coimbatore Girls call Service 000XX00000 Provide Best And Top Girl Service An...
Coimbatore Girls call Service 000XX00000 Provide Best And Top Girl Service An...
 
Nipissing University degree offer Nipissing diploma Transcript
Nipissing University degree offer Nipissing diploma TranscriptNipissing University degree offer Nipissing diploma Transcript
Nipissing University degree offer Nipissing diploma Transcript
 
High Girls Call Nagpur 000XX00000 Provide Best And Top Girl Service And No1 i...
High Girls Call Nagpur 000XX00000 Provide Best And Top Girl Service And No1 i...High Girls Call Nagpur 000XX00000 Provide Best And Top Girl Service And No1 i...
High Girls Call Nagpur 000XX00000 Provide Best And Top Girl Service And No1 i...
 
Female Service Girls Call Navi Mumbai 9930245274 Provide Best And Top Girl Se...
Female Service Girls Call Navi Mumbai 9930245274 Provide Best And Top Girl Se...Female Service Girls Call Navi Mumbai 9930245274 Provide Best And Top Girl Se...
Female Service Girls Call Navi Mumbai 9930245274 Provide Best And Top Girl Se...
 
Beautiful Girls Call Pune 000XX00000 Provide Best And Top Girl Service And No...
Beautiful Girls Call Pune 000XX00000 Provide Best And Top Girl Service And No...Beautiful Girls Call Pune 000XX00000 Provide Best And Top Girl Service And No...
Beautiful Girls Call Pune 000XX00000 Provide Best And Top Girl Service And No...
 
Welcome back to Instagram. Sign in to check out what your
Welcome back to Instagram. Sign in to check out what yourWelcome back to Instagram. Sign in to check out what your
Welcome back to Instagram. Sign in to check out what your
 

Research data management for historians

  • 1. CSC – Suomalainen tutkimuksen, koulutuksen ja julkishallinnon ICT-osaamiskeskus Get the most out of your data! Data publication, tracking and citation Jessica PvE 28.2.2018
  • 2. Working with data: amount of work 28.2.20182 Data Science Report 2016 http://visit.crowdflower.com/rs/416-ZBE- 142/images/CrowdFlower_DataScienceReport_2016.pdf
  • 3. Monya Baker: 1,500 scientists lift the lid on reproducibility. Survey sheds light on the ‘crisis’ rocking research. Nature 533, 2016. doi:10.1038/533452a
  • 4. Data citation • Project by Finnish Committee for Research • Research data should be FAIR • Data should have: Creator, title, publisher, publication time, identifier • Recommended additional information: Version, resource type, copyright status 28.2.20184
  • 6. • Include principles of data as evidence and data transparency in next version of Finnish RCR guideline byTENK. • Recognise data authorship as a distinct issue and discussion in the TENK authorship guideline http://www.tenk.fi/sites/tenk.fi/files/TENK_suositus_tek ijyys.pdf • Create a multi-institutional, multidisciplinary working group to define principles for defining data authorship, coordinated f.e. byTENK, OR assign national representation to a relevant international activity with the same goal. • Organize multidisciplinary discussion on data management and citation, with the aim of creating interoperable practices. • Promote the use of data reference model also when referring to authors own primary source data. • Define field specific level of granularity for data citation. Recommendations 6
  • 7. • data licensing • metadata standards • open data formats • shared vocabularies • informed decisions to share data • access rights • descriptive metadata • persistent identifiers Findable Accessible ReusableInter-operable force11.org/group/fairgroup/fairprinciples
  • 8. FAIR Findable Accessible Interoperable Reusable By Martin Grandjean - Own work : http://www.martingrandjean.ch/wp-content/uploads/2013/10/Graphe3.png, CC BY-SA 3.0, https://commons.wikimedia.org/w/index.php?curid=29364647
  • 9. 1.(Meta)data are assigned a globally unique and persistent identifier 2. Data are described with rich metadata 3. Metadata clearly and explicitly include the identifier of the data it describes 4. (Meta)data are registered or indexed in a searchable resource URN
  • 10. 1.(Meta)data are retrievable by their identifier using a standardized communications protocol 2.Metadata are accessible, even when the data are no longer available
  • 11. 1.(Meta)data use a formal, accessible, shared, and broadly applicable language for knowledge representation. 2.(Meta)data use vocabularies that follow FAIR principles 3. (Meta)data include qualified references to other (meta)data
  • 12. Meta(data) are richly described with a plurality of accurate and relevant attributes
  • 13. Persistent Identifiers (DOI, URN) Data Catalog Resolver PID Data file License Configuration files Read me Landing page
  • 14. Nano publications, linking data and compact identifiers • https://www.go-fair.org/ • http://identifiers.org/ 28.2.201814
  • 15. Schofield et al (2015) http://digitalhumanities.org:8081/dhq/vol/9/3/000227/000227.html Rutgers lib guide
  • 17. Organising your data • Sort and classify your information • For instance: don’t mix different types of information in excel columns: it is usually easier to combine datasets than sort out ill structured data later • Think about granularity (file size) and metadata • Decide on formats, units, codes etc and be consequent 28.2.201817
  • 18. Organising your data • Write a code book, document • Think about intelligibility • Be careful when rearranging, reformatting, sorting or copy- pasting data • Use common file formats, preferably open • Have processes in place for checking the data quality and completeness • Be clear about master copies and other copies 28.2.201818
  • 19. Organising your data • Be careful and plan well for sensitive data and anonymization • Think about security and access rights • Plan and agree on which versions of a dataset will be archived and/or published • Think about reproducibility and citing data 28.2.201819
  • 20. Files and folders • Create and agree on a system for naming files and folders and be consequent • Avoid very deep folder structures, since they can be difficult to handle • Use meaningful, unique file and folder names • Keep names as short as possible but relevant. 25 characters is usually considered maximum. 28.2.201820
  • 21. Files and folders • Dates inYYYY-MM-DD format allows you to sort and search your files • Avoid using special characters such as % & / : ; * . ? < > ^! “ () and Scandinavians • Use three digits (or 4 if you have a large number of files) i.e. 001, 002…….201, 202 (not 1, 2, 21). • Use underscores (_) instead of spaces 28.2.201821
  • 22. Files and folders • If using a personal name in the name give the surname first followed by first name. • Though, be very careful with personal data when naming files and folders • Indicate version number by using ‘V’ or “version” and number (and subversions with more digits if minor changes) 28.2.201822
  • 23. Types of data in research Open Data Generic Research Data Fixed Research Datasets Description • Public data for any use, commercial • May be dynamic • PSI • Mature data products • Not produced /monitored by researchers • Ex. Weather data, public transport • Produced by/with/for researchers • Validated, good quality • Well documented • Raw or processed • Datasets stable/version controlled • Ex. Corpus, SMEAR • Produced for specific research question • Underpins research, for replication • Might be very processed • Reuse value might be low unless mature field Format Stable, standardised May vary over time Must be preserved according to plan 28.2.201823
  • 24. License and publish your data • FSDTietoarkisto • Language Bank of Finland • Figshare • Zenodo • EUDAT • IDA and Etsin 28.2.201824

Editor's Notes

  1. It gets funded It’s reproducible It’s well documented
  2. Fair data is good It’s linked to other information be It’s sustainable, the links are unique and persistent It’s standardized and interoperable