Supporting FAIR data principles with data categorization

•Download as PPTX, PDF•

1 like•196 views

Presentation on how research data can be divided into categories and how this can help data management for both service providers and researchers. Paper will be published in the journal Informaatiotutkimus in December 2018.

Data & Analytics

CSC – Suomalainen tutkimuksen, koulutuksen, kulttuurin ja julkishallinnon ICT-osaamiskeskusCSC – Suomalainen tutkimuksen, koulutuksen, kulttuurin ja julkishallinnon ICT-osaamiskeskus
Supporting FAIR data.
Categorization of research data as a tool in
data management
Jessica Parland-von Essen https://orcid.org/0000-0003-4460-3906, Katja Fält https://orcid.org/0000-
0002-6172-5377, Zubair Maalick https://orcid.org/0000-0002-0975-1471, Miika Alonen
https://orcid.org/0000-0002-0065-0017, Eduardo Gonzalez https://orcid.org/0000-0003-1400-0995

Persistent identifiers
3
a) Cite a specific slice or subset (the set of updates to the
dataset made during a particular period of time or to a particular
area of the dataset).
b) Cite a specific snapshot (a copy of the entire dataset made at
a specific time).
c) Cite the continuously updated dataset, but add Access Date
and Time to the citation. (Does not necessarily ensure
reproducibility.)
d) Cite a query, time-stamped for re-execution against a
versioned database.
DYNAMIC DATASETS
IMMUTABLE DATASETS

Maybe we need to be more specific and find common ground in concepts?
4
CHUNKING UP RESEARCH DATA

Categorization according to technical properties
• Modality, DCMI types
oDublin Core –type of thinking
• Format, DCMI format
oMIME types
oSoftware related
• Language, coding
oHuman interpretation
5
By Lin Kristensen from New Jersey, USA (Timeless Books) [CC BY 2.0
(https://creativecommons.org/licenses/by/2.0)], via Wikimedia Commons

Categorization according to contextual traits
• Origin
oObservational, experimental,
simulation, derived etc
• Use category
oSource, output, method
• Provenance, lifecycle
oPrimary, secondary, data levels,
qualitative, quantitative
6
By David Monniaux CC-BY-SA-3.0 (http://creativecommons.org/licenses/by-sa/3.0/), from Wikimedia Commons

Categorization according to inherent traits
• Access type (availability)
oOpen data, sensitive data
• Semantic structure
oCoherence, levels of measurement,
groupings, classifications
• Research data type (stability)
oGeneric data, Generic research data,
research data publications
7

9
Dynamic and growing
datasets
URN allows use of
fragments
Avoid PID inflation
Consider costs and
sustainability
Ad hoc creation rather
than automatic minting
and allocation?

Operational data Generic research data Research dataset
Description Data for any use, private or government
owned, might fall within PSI.
Produced by/with/for
researchers, validated, good
quality, well documented, might
be raw or processed.
Dataset produced for a certain
research question
Might be highly processed,
reuse difficult unless mature
field. The main purpose is
assessment and reproducibilty.
Format May be dynamic mature solutions,
active or even hot data.
Coherent and well documented
formats. Data should be quite
stable with versioning. Should be
possible to cite and enable
reproducible research.
Usually in files, but might also
be a database with
applications. Citation does not
require date. Two-tier resolver
for identifier and landing page
with metadata available even
after data is gone. Might have
defined lifespan.
Examples - weather data
- data catalogue
- big data from social media
- corpora
- time series of
experimental or
observational data from
technical instruments
- similar social or clinical
surveys
- data paper
- data cited in article and
published in Zenodo,
EUDAT B2Share, other
or journal repository

Using research data types …
… makes it easier to describe services
… makes it easier for researchers to plan data life cycle
… makes developing solutions for citation and FAIR data
creation and use easier
…makes it easier to describe and manage research data
11

facebook.com/CSCfi
twitter.com/CSCfi
youtube.com/CSCfi
linkedin.com/company/csc---it-center-for-science
Kuvat CSC:n arkisto ja Thinkstock
github.com/CSCfi
Jessica PvE parland@csc.fi

The document discusses two methods for optimizing data visualization for competitive intelligence analysis of pipeline and clinical trials data: 1) Using BizInt and VantagePoint solutions to combine pipeline, clinical trial, and other data sources and generate customizable reports and visualizations. 2) Developing an in-house tool called VALEM360 to provide a 360-degree view of pancreatic cancer competitive landscape data through multiple interactive visualization dashboards and a treatment decision tree. The VALEM360 tool demonstrated the potential of a data-driven approach but would benefit from improving data quality, automation of updates, and applying the methodology to other disease areas. Overall, data visualization is very useful for competitive intelligence analysis but requires expertise in topic areas,

IC-SDV 2019: OntoChem

Dr. Haxel Consult

The OntoChem IT Solutions GmbH ... ... was founded in 2015 as a purely IT-oriented offshoot of the OntoChem GmbH. Even before we had many years of experience and it has always been our mission to provide added value to our customers by helping them to navigate today’s complex information world by developing cognitive computing solutions, indexing intranet and internet data and applying semantic search solutions for pharmaceutical, material sciences and technology driven businesses. We strive to support our customers with the most useful tools for knowledge discovery possible, encompassing up-to-date data sources, optimized ontologies and high-throughput semantic document processing and annotation techniques. We create new knowledge from structured and unstructured data by extracting relationships thereby exploiting the full potential of full-text documents & databases while also scanning social media, news flows and analyzing web-pages. We aim at an unprecedented, machine understanding of text and subsequent knowledge extraction and inference. The application of our methods towards chemical compounds and their properties supports our customers in generating intellectual property and their use as novel therapeutics, agrochemical products, nutraceuticals, cosmetics and in the field of novel materials. It's our mission to provide added value to customers by: developing and applying cognitive computing solutions creating intranet and internet data indexing and semantic search solutions Big Data analytics for technology driven businesses supporting product development and surveillance. We deliver useful tools for knowledge discovery for: creating background knowledge ontologies high-throughput semantic document processing and annotation knowledge mining by extracting relationships exploiting the full potential of full-text documents & databases while also scanning social media, news flows and analyzing web-pages.

dkNET Webinar: Creating and Sustaining a FAIR Biomedical Data Ecosystem 10/09...

dkNET

Abstract In this presentation, Susan Gregurick, Ph.D., Associate Director of Data Science and Director, Office of Data Science Strategy at the National Institutes of Health, will share the NIH’s vision for a modernized, integrated FAIR biomedical data ecosystem and the strategic roadmap that NIH is following to achieve this vision. Dr. Gregurick will highlight projects being implemented by team members across the NIH’s 27 institutes and centers and will ways that industry, academia, and other communities can help NIH enable a FAIR data ecosystem. Finally, she will weave in how this strategy is being leveraged to address the COVID-19 pandemic. Presenter: Susan Gregurick, Ph.D., Associate Director of Data Science and Director, Office of Data Science Strategy at the National Institutes of Health dkNET Webinar Information: https://dknet.org/about/webinar

Investigating plant systems using data integration and network analysis

Catherine Canevet

The document discusses challenges in integrating plant data from multiple sources and proposes solutions. It notes that plant data is sparse, distributed across many databases in various formats, and focused primarily on the model plant Arabidopsis. Data integration is necessary to address key biological questions by consolidating information from pathway databases, gene annotations, protein interactions, and more. The document outlines approaches to data integration including controlled vocabularies, ontologies, data standards, and integration applications specifically designed to combine data sources like Ondex. Effective integration is important to fully leverage available plant data.

Journal Data Requirements

Western Sydney University

The document discusses data sharing requirements for publishing in PLOS journals. PLOS requires authors to share all underlying data without restriction. Acceptable methods include depositing data in public repositories like Dryad or Figshare. The document also discusses other data repositories and journals, as well as obtaining identifiers like Data DOIs from services like Cite My Data. It provides an example of how the Hawkesbury Institute for the Environment publishes data using their HIEv application and Figshare to obtain DOIs for datasets associated with journal publications.

PA webinar on benefits & costs of FAIR implementation in life sciences

Pistoia Alliance

20070919 Bkt Padua Esf Dfg Workshop Intro

Deutsche Forschungsgemeinschaft (DFG) - German Research Foundation

Research data sharing enables validation and new analyses of results, ensures efficient use of public funds, and counters misconduct. Funding agencies can encourage open data practices by requiring long-term storage, promoting data publication, and helping make data findable through catalogs. They should work with research communities to understand infrastructure needs, partner with libraries on preservation, and consider discipline-specific approaches rather than one-size-fits-all solutions.

As scientists in the life sciences we are trained to pursue singular goals around a publication or a validated target or a drug submission. Our failure rates are exceedingly high especially as we move closer to patients in the attempt to collect sufficient clinical evidence to demonstrate the value of novel therapeutics. This wastes resources as well as time for patients depending upon us for the next breakthrough. Edge Informatics is an approach to ameliorate these failures. Using both technical and social solutions together knowledge can be shared and leveraged across the drug development process. This is accomplished by making data assets discoverable, accessible, self-described, reusable and annotatable. The Open PHACTS project pioneered this approach and has provided a number of the technical and social solutions to enable Edge Informatics. A number of pre-competitive consortia and some content providers have also embraced this approach, facilitating networks of collaborators within and outside a given organization. When taken together more accurate, timely and inclusive decision-making is fostered.

Application of recently developed FAIR metrics to the ELIXIR Core Data Resources

Pistoia Alliance

The FAIR (Findable, Accessible, Interoperable and Reusable) principles aim to maximize the discovery and reuse of digital resources. Using recently developed software and metrics to assess FAIRness and supported through an ELIXIR Implementation Study, Michel worked with a subset of ELIXIR Core Data Resources to apply these technologies. In this webinar, he will discuss their approach, findings, and lessons learned towards the understanding and promotion of the FAIR principles.

II-SDV 2016 Stefan Geißler Navigating complex information landscapes – Semant...

Dr. Haxel Consult

Information that is relevant for researchers and decision makers in the Life Sciences comes from many different backgrounds: Scientific publications, patents, news, clinical reports, user-generated content, they all may be required to understand trends, opportunities and threats. A key to providing quick and comprehensive overview is having information from various source in one place and semantically enrich and normalize them and relate them to one another. We present the key principles of a platform that serves that purpose and that provides users with insights into the scientific, clinical and competitive intelligence landscape of their respective area of interest. Forged in close collaboration with industry practitioners, the Luxid Biopharma Navigator is today used in production by hundreds of experts.

Is that a scientific report or just some cool pictures from the lab? Reproduc...

Greg Landrum

Requirements for reproducibility in computational chemistry publications include making available the data, code or algorithms, and results from the study. Authors should provide all data necessary to understand and assess their conclusions. Source code or detailed algorithm descriptions should also be included to allow independent reproduction of the work. Finally, publications must contain the actual results from applying the method rather than just describing results. Adopting these standards of transparency helps ensure others can evaluate and build upon published research claims.

Lankade data Vinnova webbinarium

Kerstin Forsberg

A Federated In-Memory Database System for Life Sciences

Matthieu Schapranow

The document proposes a federated in-memory database system for life sciences that addresses the needs of patients, clinicians, and researchers by enabling real-time analysis of big medical data while maintaining data privacy and locality. It describes key actors and a use case in cancer treatment. The proposed solution incorporates local compute resources through a federated in-memory database with a cloud service provider managing shared algorithms and master data, while sensitive patient data resides locally.

International perspective for sharing publicly funded medical research data

ARDC

Architecture and Standards

ARDC

NREM 601/605 Data Management Plans

Sara Rutter

This document provides guidance on creating a Data Management Plan (DMP). It discusses the key elements that should be included in a DMP, such as the types of data that will be collected, metadata standards, data sharing and access policies, plans for reusing and redistributing data, and archiving data for long-term preservation. It also notes that costs for implementing the DMP may be included in the proposal budget and that the DMP will be reviewed as part of the NSF proposal process. Template codes for elements like variable names and labels that could be included in a DMP are also provided.

NY Prostate Cancer Conference - P.A. Fearn - Session 1: Data management for p...

European School of Oncology

This document discusses data management requirements for predictive modeling using large datasets from multiple clinical, specimen, and lab repositories. It notes the need to assemble complete and up-to-date datasets while maintaining quality assurance and transparency. Over time, data storage systems experience problems with exponential data growth, manual data curation difficulties, and challenges integrating heterogeneous databases across different research groups. The document examines a spectrum of potential data management approaches and highlights collaborative networks and use of open source platforms as ways to address these issues.

Analyze Genomes: A Federated In-memory Database Computing Platform enabling r...

Matthieu Schapranow

Converged IT and Data Commons

Simon Twigger

NIH BD2K bioCADDIE DataMed: Data Discovery Index

Susanna-Assunta Sansone

This document summarizes the work of developing a Data Discovery Index prototype that helps users find and access shared biomedical data from various repositories. It ingests metadata from different standards and sources using ElasticSearch. It was presented at the Alan Turing Institute Symposium in April 2016. The project aims to organize data through an aggregator framework and portal. It involves mapping various metadata standards to have maximum coverage of use cases with minimal data elements. More information can be found at the listed websites.

THOR Workshop - Data Publishing PLOS

Maaike Duine

This document summarizes Catriona MacCallum's presentation on data publishing at PLOS. The key points are: 1) PLOS requires authors to make all underlying data openly available without restriction, with rare exceptions. Authors must provide a Data Availability Statement describing compliance. 2) Over 47,000 PLOS papers have included a data statement. Most data is found within submission files or repositories like Dryad and Figshare. PLOS checks data accessibility and ensures anonymity of clinical datasets. 3) PLOS supports initiatives like CRediT for attributing research contributions and data citation principles for giving credit to data producers. PLOS is also involved in projects beyond traditional publishing like preprints and experimental

Clinical Data Models - The Hyve - Bio IT World April 2019

Kees van Bochove

Population genetics and genomics is an emerging topic for the application of machine learning methods in healthcare and biomedical sciences. Currently, several large genomics initiatives, such as Genomics England, UK Biobank, the All of Us Project, and Europe's 1 Million Genomes Initiative are all in the process of making both clinical and genomics data available from large numbers of patients to benefit biomedical research. However, a key challenge in these initiatives is the standardization of the clinical and outcomes data in such a way that machine learning methods can be effectively trained to discover useful medical and scientific insights. In this talk, we will look at what data is available at scale, and review some of examples of the application of common data and evidence models such as OMOP, FHIR, GA4GH etc. in order to achieve this, based on projects which The Hyve has executed with some of these initiatives to harmonize their clinical, genomics, imaging and wearables data and make it FAIR.

Introduction to ADA

ARDC

Is one enough? Data warehousing for biomedical research

Greg Landrum

The document discusses challenges in storing and managing real-world biomedical data from multiple sources for analysis. It describes three different data warehouse case studies used at Novartis - Avalon, MAGMA, and the Entity Warehouse. The Entity Warehouse takes a novel approach of modeling data as entities that can be linked together, with results stored in tables by type. It is designed to integrate both internal and external data while allowing broad access. However, the document concludes that no single warehouse fits all needs, and multiple solutions may be required to fully enable data analysis.

Open science and medical evidence generation - Kees van Bochove - The Hyve

Kees van Bochove

Enabling Precise Identification and Citability of Dynamic Data: Recommendatio...

LEARN Project

Data Science Provenance: From Drug Discovery to Fake Fans

Jameel Syed

Knowledge work adds value to raw data; how this activity is performed is critical for how reliably results can be reproduced and scrutinized. With a brief diversion into epistemology, the presentation will outline the challenges for practitioners and consumers of Big Data analysis, and demonstrate how these were tackled at Inforsense (life sciences workflow analytics platform) and Musicmetric (social media analytics for music). The talk covers the following issues with concrete examples: - Representations of provenance - Considerations to allow analysis computation to be recreated - Reliable collection of noisy data from the internet - Archiving of data and accommodating retrospective changes - Using linked data to direct Big Data analytics

What's hot

Research Data Management from a Software Engineering Perspective

Sarah Anna Stewart

Why should researchers care about data curation?

Varsha Khodiyar

Harnessing Edge Informatics to Accelerate Collaboration in BioPharma (Bio-IT ...

Tom Plasterer

Application of recently developed FAIR metrics to the ELIXIR Core Data Resources

Pistoia Alliance

II-SDV 2016 Stefan Geißler Navigating complex information landscapes – Semant...

Dr. Haxel Consult

Is that a scientific report or just some cool pictures from the lab? Reproduc...

Greg Landrum

Lankade data Vinnova webbinarium

Kerstin Forsberg

A Federated In-Memory Database System for Life Sciences

Matthieu Schapranow

International perspective for sharing publicly funded medical research data

ARDC

Architecture and Standards

ARDC

NREM 601/605 Data Management Plans

Sara Rutter

NY Prostate Cancer Conference - P.A. Fearn - Session 1: Data management for p...

European School of Oncology

Analyze Genomes: A Federated In-memory Database Computing Platform enabling r...

Matthieu Schapranow

Converged IT and Data Commons

Simon Twigger

NIH BD2K bioCADDIE DataMed: Data Discovery Index

Susanna-Assunta Sansone

THOR Workshop - Data Publishing PLOS

Maaike Duine

Clinical Data Models - The Hyve - Bio IT World April 2019

Kees van Bochove

Introduction to ADA

ARDC

Is one enough? Data warehousing for biomedical research

Greg Landrum

Open science and medical evidence generation - Kees van Bochove - The Hyve

Kees van Bochove

What's hot (20)

Research Data Management from a Software Engineering Perspective

Why should researchers care about data curation?

Harnessing Edge Informatics to Accelerate Collaboration in BioPharma (Bio-IT ...

Application of recently developed FAIR metrics to the ELIXIR Core Data Resources

II-SDV 2016 Stefan Geißler Navigating complex information landscapes – Semant...

Is that a scientific report or just some cool pictures from the lab? Reproduc...

Lankade data Vinnova webbinarium

A Federated In-Memory Database System for Life Sciences

International perspective for sharing publicly funded medical research data

Architecture and Standards

NREM 601/605 Data Management Plans

NY Prostate Cancer Conference - P.A. Fearn - Session 1: Data management for p...

Analyze Genomes: A Federated In-memory Database Computing Platform enabling r...

Converged IT and Data Commons

NIH BD2K bioCADDIE DataMed: Data Discovery Index

THOR Workshop - Data Publishing PLOS

Clinical Data Models - The Hyve - Bio IT World April 2019

Introduction to ADA

Is one enough? Data warehousing for biomedical research

Open science and medical evidence generation - Kees van Bochove - The Hyve

Similar to Supporting FAIR data principles with data categorization

Enabling Precise Identification and Citability of Dynamic Data: Recommendatio...

LEARN Project

Data Science Provenance: From Drug Discovery to Fake Fans

Jameel Syed

FAIR Ddata in trustworthy repositories: the basics

OpenAIRE

This video illustrates how certified digital repositories contribute to making and keeping research data findable, accessible, interoperable and reusable (FAIR). Trustworthy repositories support Open Access to data, as well as Restricted Access when necessary, and they offer support for metadata, sustainable and interoperable file formats, and persistent identifiers for future citation. Presented by Marjan Grootveld (DANS, OpenAIRE). Main references • Core Trust Seal for trustworthy digital repositories: https://www.coretrustseal.org/ • EUDAT FAIR checklist: https://doi.org/10.5281/zenodo.1065991 • European Commission’s Guidelines on FAIR data management: http://ec.europa.eu/research/participants/data/ref/h2020/grants_manual/hi/oa_pilot/h2020-hi-oa-data-mgt_en.pdf • FAIR data principles: www.force11.org/group/fairgroup/fairprinciples • Overview of metadata standards and tools: https://rdamsc.dcc.ac.uk/

Make your data great now

Daniel JACOB

Managing and Sharing Research Data - Workshop at UiO - December 04, 2017

Michel Heeremans

Research Data Management, Challenges and Tools - Per Öster

LEARN Project

Sharing scientific data ethics and consent

Aboul Ella Hassanien

This document provides biographical and contact information for Professor Aboul Ella Hassanien, including that he is the founder and chair of the Scientific Research Group in Egypt and formerly served as dean of the faculty of computers and information at Beni-Suef University. It announces an upcoming presentation by Professor Hassanien on sharing scientific data, ethics, and consent taking place on January 20, 2018 at Cairo University.

Research data management : Open Research Data pilot, data management (plans),...

Leon Osinski

Research methods group accelarating impact by sharing data

World Agroforestry (ICRAF)

The document discusses sharing research data through open data platforms. It describes the CGIAR as uniquely positioned to collect agricultural data worldwide and argues that most CGIAR data should be archived and shared to increase its value. However, data archiving across CGIAR centers is currently poor. The document then discusses using the Dataverse platform to improve data sharing. Dataverse allows researchers to publish, share, cite, and analyze data. It also facilitates making data available while giving credit to data authors and institutions.

Introduction to Data Management

cunera

The document provides an introduction to data management, defining data and describing requirements for data sharing from federal funding agencies. It discusses best practices for data management, such as developing data management plans and file organization, as well as options for data preservation, sharing, and archiving. Resources for data management assistance at Northwestern University are also outlined.

Wilson-npg-scientific data-nfdp13

DataDryad

Simon hodson

JISC funded KAPTUR project

The document summarizes the Jisc Managing Research Data Programme which aims to support universities in improving research data management. It discusses why managing research data is important, highlighting funder policies and the benefits of open data. It provides an overview of Jisc's activities including training projects, guidance resources, and funding for institutional infrastructure services and repositories. The presentation emphasizes the importance of institutional policies, support services, skills development and cultural change to effectively manage research data in line with funder expectations.

داده های پژوهشی

Hosseinieh Ershad Public Library

Research data can be categorized as observational, experimental, simulation, derived or compiled, and reference or canonical. A highly effective data pyramid outlines key aspects for research data: being stored, preserved, accessible, discoverable, citable, comprehensible, reviewed, reproducible, reusable, and integrated. A data-driven company is one where decision makers have independent access to data when needed and the company continuously measures business metrics. Properties of data-driven companies include being comfortable with uncertainty, adapting culture, being agile, forward-looking technology acquisitions, updating processes, CEO leadership, removing organizational barriers, allocating resources differently, and productizing data.

User-friendly bioinformatics (Monthly Informational workshop)

Elia Brodsky

A Generic Scientific Data Model and Ontology for Representation of Chemical Data

Stuart Chalk

The current movement toward openness and sharing of data is likely to have a profound effect on the speed of scientific research and the complexity of questions we can answer. However, a fundamental problem with currently available datasets (and their metadata) is heterogeneity in terms of implementation, organization, and representation. To address this issue we have developed a generic scientific data model (SDM) to organize and annotate raw and processed data, and the associated metadata. This paper will present the current status of the SDM, implementation of the SDM in JSON-LD, and the associated scientific data model ontology (SDMO). Example usage of the SDM to store data from a variety of sources with be discussed along with future plans for the work.

Big Data – Shining the Light on Enterprise Dark Data

Hitachi Vantara

Content stored for a business purpose is often without structure or metadata required to determine its original purpose. With Hitachi Data Discovery Suite and Hitachi Content Platform, businesses can uncover dark data that could be leveraged for better business insight and uncover compliance issues that could prevent business risks. View this session and learn: What is enterprise dark data? How can enterprise dark data impact business decisions? How can you augment your underutilized data and deliver more value? How can you decrease the headache and challenges created by dark data? For more information please visit: http://www.hds.com/products/file-and-content/

I o dav data workshop prof wafula final 19.9.17

Tom Nyongesa

The document summarizes an iODaV Data Workshop held at JKUAT in Kenya on open data and the JORD policy. It discusses why open data is important for reproducibility, innovation and scientific discovery. It outlines the FAIR principles for open data and metadata to make data findable, accessible, interoperable and reusable. It also discusses opportunities and challenges of open data for universities, including developing skills and infrastructure. Finally, it provides examples of open data initiatives at JKUAT including developing an open data policy, the iODaV program, contributions to national ICT policies, and the digital health applied research centre.

Enabling Precise Identification and Citability of Dynamic Data: Recommendatio...

Research Data Alliance

DRIVE CENTRAL STUDY PLATFORM: Data flow, data quality and statistical analysi...

DRIVE research

Burton - Security, Privacy and Trust

National Information Standards Organization (NISO)

Similar to Supporting FAIR data principles with data categorization (20)

Enabling Precise Identification and Citability of Dynamic Data: Recommendatio...

Data Science Provenance: From Drug Discovery to Fake Fans

FAIR Ddata in trustworthy repositories: the basics

Make your data great now

Managing and Sharing Research Data - Workshop at UiO - December 04, 2017

Research Data Management, Challenges and Tools - Per Öster

Sharing scientific data ethics and consent

Research data management : Open Research Data pilot, data management (plans),...

Research methods group accelarating impact by sharing data

Introduction to Data Management

Wilson-npg-scientific data-nfdp13

Simon hodson

داده های پژوهشی

User-friendly bioinformatics (Monthly Informational workshop)

A Generic Scientific Data Model and Ontology for Representation of Chemical Data

Big Data – Shining the Light on Enterprise Dark Data

I o dav data workshop prof wafula final 19.9.17

Enabling Precise Identification and Citability of Dynamic Data: Recommendatio...

DRIVE CENTRAL STUDY PLATFORM: Data flow, data quality and statistical analysi...

Burton - Security, Privacy and Trust

More from Jessica Parland-von Essen

Planning a Finnish PID Roadmap

Jessica Parland-von Essen

The document outlines a road map for PID Forum Finland with 3 key steps: 1) Creating engagement around PIDs by raising awareness and building skills and trust. 2) Organizing management and funding by describing use cases, creating proofs of concept, and defining requirements. 3) Creating infrastructure by ensuring interoperability, building a resolver, and organizing support services. The overall goal is to make information traceable across different channels now and in the future.

Tutkimusaineistojen kuvailu, metadata ja yhteentoimivuus

Jessica Parland-von Essen

Pid landscape in finland

Jessica Parland-von Essen

Fairdata-palvelut ja tutkimusaineistojen pitkäaikaissäilytys

Jessica Parland-von Essen

Open Science goes FAIR

Jessica Parland-von Essen

Metatiedot tunnisteet tutkimisdata

Jessica Parland-von Essen

Towards a FAIR lifecycle

Jessica Parland-von Essen

A Finnish perspective on FAIRsFAIR outputs

Jessica Parland-von Essen

Persistence and Interoperability

Jessica Parland-von Essen

1) The document summarizes a report on requirements for FAIR (Findable, Accessible, Interoperable, Reusable) data persistence and interoperability. 2) It describes a 36-month, 10 million euro project involving 22 partners from 8 EU member states working on practical implementations of semantic interoperability across research infrastructures. 3) The report analyzes the current landscape of FAIR technologies, semantic artifacts, and infrastructure initiatives; identifies challenges around scope, terminology, and rapid development; and concludes that solutions must be user-friendly, context-sensitive, and transparent while promoting adoption of standards and registries.

Collections meet the researcher. Digitalization, disintegration and disillusi...

Jessica Parland-von Essen

Research data management for historians

Jessica Parland-von Essen

This document discusses best practices for organizing, managing, and publishing research data. It recommends using standardized file naming and folder structures, documenting data through code books and metadata, selecting open formats, and considering issues like data security, versions, and citations. FAIR principles of findable, accessible, interoperable and reusable data are presented. Options in Finland for publishing and archiving research data include repositories like FSD Tietoarkisto and Zenodo. Adopting these practices helps ensure well-organized, documented data that can enable reproducibility and reuse.

FAIR data and the Etsin service

Jessica Parland-von Essen

This document discusses making data Findable, Accessible, Interoperable and Reusable (FAIR). It provides principles for each component and examples of metadata standards and repositories that help achieve FAIR data. Resources referenced include guidelines for assigning persistent identifiers to data and metadata, describing data with rich metadata using shared vocabularies, and indexing metadata in searchable resources to enable discovery and access.

Yhteiskuntatieteen aineistot

Jessica Parland-von Essen

Avoimen suomen historia

Jessica Parland-von Essen

Open Science Process

Jessica Parland-von Essen

The document discusses open science and how it has changed research practices. It defines open science as making research data, notes, and processes openly available for collaboration and reuse. It outlines benefits like increasing quality, impact and innovation. Barriers like publishing costs are mentioned. The document recommends openly licensing data and publications, using open peer review and platforms, and sharing materials like code and presentations. Proper data management is important for openness, reproducibility and ensuring research integrity.

Tutkimusaineistoihiin viittaaminen, pysyvät tunnisteet ja linkittäminen

Jessica Parland-von Essen

AffarerAllianserAnseendeJessica Parland-von Essen

Avoin tiede Suomessa

Jessica Parland-von Essen

Forskningsdataforhumanister

Jessica Parland-von Essen

Data Management in Research

Jessica Parland-von Essen

This document discusses data management practices in research. It defines research data and emphasizes the importance of good data management for ensuring integrity, reproducibility and excellence in science. Key aspects of data management include planning, documentation, metadata, sustainability, and publication. Funders increasingly require and support open access to publications and research data. The document provides guidance and considerations for implementing responsible data management and open science practices.

More from Jessica Parland-von Essen (20)

Planning a Finnish PID Roadmap

Tutkimusaineistojen kuvailu, metadata ja yhteentoimivuus

Pid landscape in finland

Fairdata-palvelut ja tutkimusaineistojen pitkäaikaissäilytys

Open Science goes FAIR

Metatiedot tunnisteet tutkimisdata

Towards a FAIR lifecycle

A Finnish perspective on FAIRsFAIR outputs

Persistence and Interoperability

Collections meet the researcher. Digitalization, disintegration and disillusi...

Research data management for historians

FAIR data and the Etsin service

Yhteiskuntatieteen aineistot

Avoimen suomen historia

Open Science Process

Tutkimusaineistoihiin viittaaminen, pysyvät tunnisteet ja linkittäminen

AffarerAllianserAnseende

Avoin tiede Suomessa

Forskningsdataforhumanister

Data Management in Research

Recently uploaded

Namma-Kalvi-11th-Physics-Study-Material-Unit-1-EM-221086.pdf

22ad0301

Module 1 ppt BIG DATA ANALYTICS NOTES FOR MCA

yuvarajkumar334

一比一原版悉尼大学毕业证如何办理

keesa2

原版一模一样【微信：741003700 】【悉尼大学毕业证成绩单】【微信：741003700 】学位证，留信认证（真实可查，永久存档）原件一模一样纸张工艺/offer、雅思、外壳等材料/诚信可靠,可直接看成品样本，帮您解决无法毕业带来的各种难题！外壳，原版制作，诚信可靠，可直接看成品样本。行业标杆！精益求精，诚心合作，真诚制作！多年品质 ,按需精细制作，24小时接单,全套进口原装设备。十五年致力于帮助留学生解决难题，包您满意。本公司拥有海外各大学样板无数，能完美还原。 1:1完美还原海外各大学毕业材料上的工艺：水印，阴影底纹，钢印LOGO烫金烫银，LOGO烫金烫银复合重叠。文字图案浮雕、激光镭射、紫外荧光、温感、复印防伪等防伪工艺。材料咨询办理、认证咨询办理请加学历顾问Q/微741003700 【主营项目】一.毕业证【q微741003700】成绩单、使馆认证、教育部认证、雅思托福成绩单、学生卡等！二.真实使馆公证(即留学回国人员证明,不成功不收费) 三.真实教育部学历学位认证（教育部存档！教育部留服网站永久可查）四.办理各国各大学文凭(一对一专业服务,可全程监控跟踪进度) 如果您处于以下几种情况： ◇在校期间，因各种原因未能顺利毕业……拿不到官方毕业证【q/微741003700】 ◇面对父母的压力，希望尽快拿到； ◇不清楚认证流程以及材料该如何准备； ◇回国时间很长，忘记办理； ◇回国马上就要找工作，办给用人单位看； ◇企事业单位必须要求办理的 ◇需要报考公务员、购买免税车、落转户口 ◇申请留学生创业基金留信网认证的作用: 1:该专业认证可证明留学生真实身份 2:同时对留学生所学专业登记给予评定 3:国家专业人才认证中心颁发入库证书 4:这个认证书并且可以归档倒地方 5:凡事获得留信网入网的信息将会逐步更新到个人身份内，将在公安局网内查询个人身份证信息后，同步读取人才网入库信息 6:个人职称评审加20分 7:个人信誉贷款加10分 8:在国家人才网主办的国家网络招聘大会中纳入资料，供国家高端企业选择人才办理悉尼大学毕业证【微信：741003700 】外观非常简单，由纸质材料制成，上面印有校徽、校名、毕业生姓名、专业等信息。办理悉尼大学毕业证【微信：741003700 】格式相对统一，各专业都有相应的模板。通常包括以下部分：校徽：象征着学校的荣誉和传承。校名:学校英文全称授予学位：本部分将注明获得的具体学位名称。毕业生姓名：这是最重要的信息之一，标志着该证书是由特定人员获得的。颁发日期：这是毕业正式生效的时间，也代表着毕业生学业的结束。其他信息：根据不同的专业和学位，可能会有一些特定的信息或章节。办理悉尼大学毕业证【微信：741003700 】价值很高，需要妥善保管。一般来说，应放置在安全、干燥、防潮的地方，避免长时间暴露在阳光下。如需使用，最好使用复印件而不是原件，以免丢失。综上所述，办理悉尼大学毕业证【微信：741003700 】是证明身份和学历的高价值文件。外观简单庄重，格式统一，包括重要的个人信息和发布日期。对持有人来说，妥善保管是非常重要的。

一比一原版(UO毕业证)渥太华大学毕业证如何办理

bmucuha

原件一模一样【微信：95270640】【渥太华大学毕业证UO学位证成绩单】【微信：95270640】（留信学历认证永久存档查询）采用学校原版纸张、特殊工艺完全按照原版一比一制作（包括：隐形水印，阴影底纹，钢印LOGO烫金烫银，LOGO烫金烫银复合重叠，文字图案浮雕，激光镭射，紫外荧光，温感，复印防伪）行业标杆！精益求精，诚心合作，真诚制作！多年品质 ,按需精细制作，24小时接单,全套进口原装设备，十五年致力于帮助留学生解决难题，业务范围有加拿大、英国、澳洲、韩国、美国、新加坡，新西兰等学历材料，包您满意。【业务选择办理准则】一、工作未确定，回国需先给父母、亲戚朋友看下文凭的情况，办理一份就读学校的毕业证【微信：95270640】文凭即可二、回国进私企、外企、自己做生意的情况，这些单位是不查询毕业证真伪的，而且国内没有渠道去查询国外文凭的真假，也不需要提供真实教育部认证。鉴于此，办理一份毕业证【微信：95270640】即可三、进国企，银行，事业单位，考公务员等等，这些单位是必需要提供真实教育部认证的，办理教育部认证所需资料众多且烦琐，所有材料您都必须提供原件，我们凭借丰富的经验，快捷的绿色通道帮您快速整合材料，让您少走弯路。留信网认证的作用: 1:该专业认证可证明留学生真实身份【微信：95270640】 2:同时对留学生所学专业登记给予评定 3:国家专业人才认证中心颁发入库证书 4:这个认证书并且可以归档倒地方 5:凡事获得留信网入网的信息将会逐步更新到个人身份内，将在公安局网内查询个人身份证信息后，同步读取人才网入库信息 6:个人职称评审加20分 7:个人信誉贷款加10分 8:在国家人才网主办的国家网络招聘大会中纳入资料，供国家高端企业选择人才 → 【关于价格问题（保证一手价格）我们所定的价格是非常合理的，而且我们现在做得单子大多数都是代理和回头客户介绍的所以一般现在有新的单子我给客户的都是第一手的代理价格，因为我想坦诚对待大家不想跟大家在价格方面浪费时间对于老客户或者被老客户介绍过来的朋友，我们都会适当给一些优惠。选择实体注册公司办理，更放心，更安全！我们的承诺：可来公司面谈，可签订合同，会陪同客户一起到教育部认证窗口递交认证材料，客户在教育部官方认证查询网站查询到认证通过结果后付款，不成功不收费！办理渥太华大学毕业证毕业证offerUO学位证【微信：95270640 】外观非常精致，由特殊纸质材料制成，上面印有校徽、校名、毕业生姓名、专业等信息。办理渥太华大学毕业证UO学位证毕业证offer【微信：95270640 】格式相对统一，各专业都有相应的模板。通常包括以下部分：校徽：象征着学校的荣誉和传承。校名:学校英文全称授予学位：本部分将注明获得的具体学位名称。毕业生姓名：这是最重要的信息之一，标志着该证书是由特定人员获得的。颁发日期：这是毕业正式生效的时间，也代表着毕业生学业的结束。其他信息：根据不同的专业和学位，可能会有一些特定的信息或章节。办理渥太华大学毕业证毕业证offerUO学位证【微信：95270640 】价值很高，需要妥善保管。一般来说，应放置在安全、干燥、防潮的地方，避免长时间暴露在阳光下。如需使用，最好使用复印件而不是原件，以免丢失。综上所述，办理渥太华大学毕业证毕业证offerUO学位证【微信：95270640 】是证明身份和学历的高价值文件。外观简单庄重，格式统一，包括重要的个人信息和发布日期。对持有人来说，妥善保管是非常重要的。

一比一原版(UofT毕业证)多伦多大学毕业证如何办理

exukyp

原件一模一样【微信：95270640】【多伦多大学毕业证UofT学位证成绩单】【微信：95270640】（留信学历认证永久存档查询）采用学校原版纸张、特殊工艺完全按照原版一比一制作（包括：隐形水印，阴影底纹，钢印LOGO烫金烫银，LOGO烫金烫银复合重叠，文字图案浮雕，激光镭射，紫外荧光，温感，复印防伪）行业标杆！精益求精，诚心合作，真诚制作！多年品质 ,按需精细制作，24小时接单,全套进口原装设备，十五年致力于帮助留学生解决难题，业务范围有加拿大、英国、澳洲、韩国、美国、新加坡，新西兰等学历材料，包您满意。【业务选择办理准则】一、工作未确定，回国需先给父母、亲戚朋友看下文凭的情况，办理一份就读学校的毕业证【微信：95270640】文凭即可二、回国进私企、外企、自己做生意的情况，这些单位是不查询毕业证真伪的，而且国内没有渠道去查询国外文凭的真假，也不需要提供真实教育部认证。鉴于此，办理一份毕业证【微信：95270640】即可三、进国企，银行，事业单位，考公务员等等，这些单位是必需要提供真实教育部认证的，办理教育部认证所需资料众多且烦琐，所有材料您都必须提供原件，我们凭借丰富的经验，快捷的绿色通道帮您快速整合材料，让您少走弯路。留信网认证的作用: 1:该专业认证可证明留学生真实身份【微信：95270640】 2:同时对留学生所学专业登记给予评定 3:国家专业人才认证中心颁发入库证书 4:这个认证书并且可以归档倒地方 5:凡事获得留信网入网的信息将会逐步更新到个人身份内，将在公安局网内查询个人身份证信息后，同步读取人才网入库信息 6:个人职称评审加20分 7:个人信誉贷款加10分 8:在国家人才网主办的国家网络招聘大会中纳入资料，供国家高端企业选择人才 → 【关于价格问题（保证一手价格）我们所定的价格是非常合理的，而且我们现在做得单子大多数都是代理和回头客户介绍的所以一般现在有新的单子我给客户的都是第一手的代理价格，因为我想坦诚对待大家不想跟大家在价格方面浪费时间对于老客户或者被老客户介绍过来的朋友，我们都会适当给一些优惠。选择实体注册公司办理，更放心，更安全！我们的承诺：可来公司面谈，可签订合同，会陪同客户一起到教育部认证窗口递交认证材料，客户在教育部官方认证查询网站查询到认证通过结果后付款，不成功不收费！办理多伦多大学毕业证毕业证假文凭UofT学位证【微信：95270640 】外观非常精致，由特殊纸质材料制成，上面印有校徽、校名、毕业生姓名、专业等信息。办理多伦多大学毕业证UofT学位证毕业证假文凭【微信：95270640 】格式相对统一，各专业都有相应的模板。通常包括以下部分：校徽：象征着学校的荣誉和传承。校名:学校英文全称授予学位：本部分将注明获得的具体学位名称。毕业生姓名：这是最重要的信息之一，标志着该证书是由特定人员获得的。颁发日期：这是毕业正式生效的时间，也代表着毕业生学业的结束。其他信息：根据不同的专业和学位，可能会有一些特定的信息或章节。办理多伦多大学毕业证毕业证假文凭UofT学位证【微信：95270640 】价值很高，需要妥善保管。一般来说，应放置在安全、干燥、防潮的地方，避免长时间暴露在阳光下。如需使用，最好使用复印件而不是原件，以免丢失。综上所述，办理多伦多大学毕业证毕业证假文凭UofT学位证【微信：95270640 】是证明身份和学历的高价值文件。外观简单庄重，格式统一，包括重要的个人信息和发布日期。对持有人来说，妥善保管是非常重要的。

Discovering Digital Process Twins for What-if Analysis: a Process Mining Appr...

Marlon Dumas

一比一原版莱斯大学毕业证（rice毕业证）如何办理

zsafxbf

原版一模一样【微信：741003700 】【莱斯大学毕业证（rice毕业证）成绩单】【微信：741003700 】学位证，留信认证（真实可查，永久存档）原件一模一样纸张工艺/offer、雅思、外壳等材料/诚信可靠,可直接看成品样本，帮您解决无法毕业带来的各种难题！外壳，原版制作，诚信可靠，可直接看成品样本。行业标杆！精益求精，诚心合作，真诚制作！多年品质 ,按需精细制作，24小时接单,全套进口原装设备。十五年致力于帮助留学生解决难题，包您满意。本公司拥有海外各大学样板无数，能完美还原。 1:1完美还原海外各大学毕业材料上的工艺：水印，阴影底纹，钢印LOGO烫金烫银，LOGO烫金烫银复合重叠。文字图案浮雕、激光镭射、紫外荧光、温感、复印防伪等防伪工艺。材料咨询办理、认证咨询办理请加学历顾问Q/微741003700 【主营项目】一.毕业证【q微741003700】成绩单、使馆认证、教育部认证、雅思托福成绩单、学生卡等！二.真实使馆公证(即留学回国人员证明,不成功不收费) 三.真实教育部学历学位认证（教育部存档！教育部留服网站永久可查）四.办理各国各大学文凭(一对一专业服务,可全程监控跟踪进度) 如果您处于以下几种情况： ◇在校期间，因各种原因未能顺利毕业……拿不到官方毕业证【q/微741003700】 ◇面对父母的压力，希望尽快拿到； ◇不清楚认证流程以及材料该如何准备； ◇回国时间很长，忘记办理； ◇回国马上就要找工作，办给用人单位看； ◇企事业单位必须要求办理的 ◇需要报考公务员、购买免税车、落转户口 ◇申请留学生创业基金留信网认证的作用: 1:该专业认证可证明留学生真实身份 2:同时对留学生所学专业登记给予评定 3:国家专业人才认证中心颁发入库证书 4:这个认证书并且可以归档倒地方 5:凡事获得留信网入网的信息将会逐步更新到个人身份内，将在公安局网内查询个人身份证信息后，同步读取人才网入库信息 6:个人职称评审加20分 7:个人信誉贷款加10分 8:在国家人才网主办的国家网络招聘大会中纳入资料，供国家高端企业选择人才办理莱斯大学毕业证（rice毕业证）【微信：741003700 】外观非常简单，由纸质材料制成，上面印有校徽、校名、毕业生姓名、专业等信息。办理莱斯大学毕业证（rice毕业证）【微信：741003700 】格式相对统一，各专业都有相应的模板。通常包括以下部分：校徽：象征着学校的荣誉和传承。校名:学校英文全称授予学位：本部分将注明获得的具体学位名称。毕业生姓名：这是最重要的信息之一，标志着该证书是由特定人员获得的。颁发日期：这是毕业正式生效的时间，也代表着毕业生学业的结束。其他信息：根据不同的专业和学位，可能会有一些特定的信息或章节。办理莱斯大学毕业证（rice毕业证）【微信：741003700 】价值很高，需要妥善保管。一般来说，应放置在安全、干燥、防潮的地方，避免长时间暴露在阳光下。如需使用，最好使用复印件而不是原件，以免丢失。综上所述，办理莱斯大学毕业证（rice毕业证）【微信：741003700 】是证明身份和学历的高价值文件。外观简单庄重，格式统一，包括重要的个人信息和发布日期。对持有人来说，妥善保管是非常重要的。

一比一原版爱尔兰都柏林大学毕业证(本硕）ucd学位证书如何办理

hqfek

原版一模一样【微信：741003700 】【爱尔兰都柏林大学毕业证(本硕）ucd成绩单】【微信：741003700 】学位证，留信认证（真实可查，永久存档）原件一模一样纸张工艺/offer、雅思、外壳等材料/诚信可靠,可直接看成品样本，帮您解决无法毕业带来的各种难题！外壳，原版制作，诚信可靠，可直接看成品样本。行业标杆！精益求精，诚心合作，真诚制作！多年品质 ,按需精细制作，24小时接单,全套进口原装设备。十五年致力于帮助留学生解决难题，包您满意。本公司拥有海外各大学样板无数，能完美还原。 1:1完美还原海外各大学毕业材料上的工艺：水印，阴影底纹，钢印LOGO烫金烫银，LOGO烫金烫银复合重叠。文字图案浮雕、激光镭射、紫外荧光、温感、复印防伪等防伪工艺。材料咨询办理、认证咨询办理请加学历顾问Q/微741003700 【主营项目】一.毕业证【q微741003700】成绩单、使馆认证、教育部认证、雅思托福成绩单、学生卡等！二.真实使馆公证(即留学回国人员证明,不成功不收费) 三.真实教育部学历学位认证（教育部存档！教育部留服网站永久可查）四.办理各国各大学文凭(一对一专业服务,可全程监控跟踪进度) 如果您处于以下几种情况： ◇在校期间，因各种原因未能顺利毕业……拿不到官方毕业证【q/微741003700】 ◇面对父母的压力，希望尽快拿到； ◇不清楚认证流程以及材料该如何准备； ◇回国时间很长，忘记办理； ◇回国马上就要找工作，办给用人单位看； ◇企事业单位必须要求办理的 ◇需要报考公务员、购买免税车、落转户口 ◇申请留学生创业基金留信网认证的作用: 1:该专业认证可证明留学生真实身份 2:同时对留学生所学专业登记给予评定 3:国家专业人才认证中心颁发入库证书 4:这个认证书并且可以归档倒地方 5:凡事获得留信网入网的信息将会逐步更新到个人身份内，将在公安局网内查询个人身份证信息后，同步读取人才网入库信息 6:个人职称评审加20分 7:个人信誉贷款加10分 8:在国家人才网主办的国家网络招聘大会中纳入资料，供国家高端企业选择人才办理爱尔兰都柏林大学毕业证(本硕）ucd学位证书【微信：741003700 】外观非常简单，由纸质材料制成，上面印有校徽、校名、毕业生姓名、专业等信息。办理爱尔兰都柏林大学毕业证(本硕）ucd学位证书【微信：741003700 】格式相对统一，各专业都有相应的模板。通常包括以下部分：校徽：象征着学校的荣誉和传承。校名:学校英文全称授予学位：本部分将注明获得的具体学位名称。毕业生姓名：这是最重要的信息之一，标志着该证书是由特定人员获得的。颁发日期：这是毕业正式生效的时间，也代表着毕业生学业的结束。其他信息：根据不同的专业和学位，可能会有一些特定的信息或章节。办理爱尔兰都柏林大学毕业证(本硕）ucd学位证书【微信：741003700 】价值很高，需要妥善保管。一般来说，应放置在安全、干燥、防潮的地方，避免长时间暴露在阳光下。如需使用，最好使用复印件而不是原件，以免丢失。综上所述，办理爱尔兰都柏林大学毕业证(本硕）ucd学位证书【微信：741003700 】是证明身份和学历的高价值文件。外观简单庄重，格式统一，包括重要的个人信息和发布日期。对持有人来说，妥善保管是非常重要的。

一比一原版(曼大毕业证书)曼尼托巴大学毕业证如何办理

ytypuem

原版一模一样【微信：741003700 】【(曼大毕业证书)曼尼托巴大学毕业证成绩单】【微信：741003700 】学位证，留信认证（真实可查，永久存档）原件一模一样纸张工艺/offer、雅思、外壳等材料/诚信可靠,可直接看成品样本，帮您解决无法毕业带来的各种难题！外壳，原版制作，诚信可靠，可直接看成品样本。行业标杆！精益求精，诚心合作，真诚制作！多年品质 ,按需精细制作，24小时接单,全套进口原装设备。十五年致力于帮助留学生解决难题，包您满意。本公司拥有海外各大学样板无数，能完美还原。 1:1完美还原海外各大学毕业材料上的工艺：水印，阴影底纹，钢印LOGO烫金烫银，LOGO烫金烫银复合重叠。文字图案浮雕、激光镭射、紫外荧光、温感、复印防伪等防伪工艺。材料咨询办理、认证咨询办理请加学历顾问Q/微741003700 【主营项目】一.毕业证【q微741003700】成绩单、使馆认证、教育部认证、雅思托福成绩单、学生卡等！二.真实使馆公证(即留学回国人员证明,不成功不收费) 三.真实教育部学历学位认证（教育部存档！教育部留服网站永久可查）四.办理各国各大学文凭(一对一专业服务,可全程监控跟踪进度) 如果您处于以下几种情况： ◇在校期间，因各种原因未能顺利毕业……拿不到官方毕业证【q/微741003700】 ◇面对父母的压力，希望尽快拿到； ◇不清楚认证流程以及材料该如何准备； ◇回国时间很长，忘记办理； ◇回国马上就要找工作，办给用人单位看； ◇企事业单位必须要求办理的 ◇需要报考公务员、购买免税车、落转户口 ◇申请留学生创业基金留信网认证的作用: 1:该专业认证可证明留学生真实身份 2:同时对留学生所学专业登记给予评定 3:国家专业人才认证中心颁发入库证书 4:这个认证书并且可以归档倒地方 5:凡事获得留信网入网的信息将会逐步更新到个人身份内，将在公安局网内查询个人身份证信息后，同步读取人才网入库信息 6:个人职称评审加20分 7:个人信誉贷款加10分 8:在国家人才网主办的国家网络招聘大会中纳入资料，供国家高端企业选择人才办理(曼大毕业证书)曼尼托巴大学毕业证【微信：741003700 】外观非常简单，由纸质材料制成，上面印有校徽、校名、毕业生姓名、专业等信息。办理(曼大毕业证书)曼尼托巴大学毕业证【微信：741003700 】格式相对统一，各专业都有相应的模板。通常包括以下部分：校徽：象征着学校的荣誉和传承。校名:学校英文全称授予学位：本部分将注明获得的具体学位名称。毕业生姓名：这是最重要的信息之一，标志着该证书是由特定人员获得的。颁发日期：这是毕业正式生效的时间，也代表着毕业生学业的结束。其他信息：根据不同的专业和学位，可能会有一些特定的信息或章节。办理(曼大毕业证书)曼尼托巴大学毕业证【微信：741003700 】价值很高，需要妥善保管。一般来说，应放置在安全、干燥、防潮的地方，避免长时间暴露在阳光下。如需使用，最好使用复印件而不是原件，以免丢失。综上所述，办理(曼大毕业证书)曼尼托巴大学毕业证【微信：741003700 】是证明身份和学历的高价值文件。外观简单庄重，格式统一，包括重要的个人信息和发布日期。对持有人来说，妥善保管是非常重要的。

SAP BW4HANA Implementagtion Content Document

newdirectionconsulta

Econ3060_Screen Time and Success_ final_GroupProject.pdf

blueshagoo1

reading_sample_sap_press_operational_data_provisioning_with_sap_bw4hana (1).pdf

perranet1

Q4FY24 Investor-Presentation.pdf bank slide

mukulupadhayay1

Telemetry Solution for Gaming (AWS Summit'24)

GeorgiiSteshenko

Discover the cutting-edge telemetry solution implemented for Alan Wake 2 by Remedy Entertainment in collaboration with AWS. This comprehensive presentation dives into our objectives, detailing how we utilized advanced analytics to drive gameplay improvements and player engagement. Key highlights include: Primary Goals: Implementing gameplay and technical telemetry to capture detailed player behavior and game performance data, fostering data-driven decision-making. Tech Stack: Leveraging AWS services such as EKS for hosting, WAF for security, Karpenter for instance optimization, S3 for data storage, and OpenTelemetry Collector for data collection. EventBridge and Lambda were used for data compression, while Glue ETL and Athena facilitated data transformation and preparation. Data Utilization: Transforming raw data into actionable insights with technologies like Glue ETL (PySpark scripts), Glue Crawler, and Athena, culminating in detailed visualizations with Tableau. Achievements: Successfully managing 700 million to 1 billion events per month at a cost-effective rate, with significant savings compared to commercial solutions. This approach has enabled simplified scaling and substantial improvements in game design, reducing player churn through targeted adjustments. Community Engagement: Enhanced ability to engage with player communities by leveraging precise data insights, despite having a small community management team. This presentation is an invaluable resource for professionals in game development, data analytics, and cloud computing, offering insights into how telemetry and analytics can revolutionize player experience and game performance optimization.

PyData London 2024: Mistakes were made (Dr. Rebecca Bilbro)

Rebecca Bilbro

To honor ten years of PyData London, join Dr. Rebecca Bilbro as she takes us back in time to reflect on a little over ten years working as a data scientist. One of the many renegade PhDs who joined the fledgling field of data science of the 2010's, Rebecca will share lessons learned the hard way, often from watching data science projects go sideways and learning to fix broken things. Through the lens of these canon events, she'll identify some of the anti-patterns and red flags she's learned to steer around.

Senior Engineering Sample EM DOE - Sheet1.pdf

Vineet

一比一原版澳洲西澳大学毕业证（uwa毕业证书）如何办理

aguty

原版一模一样【微信：741003700 】【澳洲西澳大学毕业证（uwa毕业证书）成绩单】【微信：741003700 】学位证，留信认证（真实可查，永久存档）原件一模一样纸张工艺/offer、雅思、外壳等材料/诚信可靠,可直接看成品样本，帮您解决无法毕业带来的各种难题！外壳，原版制作，诚信可靠，可直接看成品样本。行业标杆！精益求精，诚心合作，真诚制作！多年品质 ,按需精细制作，24小时接单,全套进口原装设备。十五年致力于帮助留学生解决难题，包您满意。本公司拥有海外各大学样板无数，能完美还原。 1:1完美还原海外各大学毕业材料上的工艺：水印，阴影底纹，钢印LOGO烫金烫银，LOGO烫金烫银复合重叠。文字图案浮雕、激光镭射、紫外荧光、温感、复印防伪等防伪工艺。材料咨询办理、认证咨询办理请加学历顾问Q/微741003700 【主营项目】一.毕业证【q微741003700】成绩单、使馆认证、教育部认证、雅思托福成绩单、学生卡等！二.真实使馆公证(即留学回国人员证明,不成功不收费) 三.真实教育部学历学位认证（教育部存档！教育部留服网站永久可查）四.办理各国各大学文凭(一对一专业服务,可全程监控跟踪进度) 如果您处于以下几种情况： ◇在校期间，因各种原因未能顺利毕业……拿不到官方毕业证【q/微741003700】 ◇面对父母的压力，希望尽快拿到； ◇不清楚认证流程以及材料该如何准备； ◇回国时间很长，忘记办理； ◇回国马上就要找工作，办给用人单位看； ◇企事业单位必须要求办理的 ◇需要报考公务员、购买免税车、落转户口 ◇申请留学生创业基金留信网认证的作用: 1:该专业认证可证明留学生真实身份 2:同时对留学生所学专业登记给予评定 3:国家专业人才认证中心颁发入库证书 4:这个认证书并且可以归档倒地方 5:凡事获得留信网入网的信息将会逐步更新到个人身份内，将在公安局网内查询个人身份证信息后，同步读取人才网入库信息 6:个人职称评审加20分 7:个人信誉贷款加10分 8:在国家人才网主办的国家网络招聘大会中纳入资料，供国家高端企业选择人才办理澳洲西澳大学毕业证（uwa毕业证书）【微信：741003700 】外观非常简单，由纸质材料制成，上面印有校徽、校名、毕业生姓名、专业等信息。办理澳洲西澳大学毕业证（uwa毕业证书）【微信：741003700 】格式相对统一，各专业都有相应的模板。通常包括以下部分：校徽：象征着学校的荣誉和传承。校名:学校英文全称授予学位：本部分将注明获得的具体学位名称。毕业生姓名：这是最重要的信息之一，标志着该证书是由特定人员获得的。颁发日期：这是毕业正式生效的时间，也代表着毕业生学业的结束。其他信息：根据不同的专业和学位，可能会有一些特定的信息或章节。办理澳洲西澳大学毕业证（uwa毕业证书）【微信：741003700 】价值很高，需要妥善保管。一般来说，应放置在安全、干燥、防潮的地方，避免长时间暴露在阳光下。如需使用，最好使用复印件而不是原件，以免丢失。综上所述，办理澳洲西澳大学毕业证（uwa毕业证书）【微信：741003700 】是证明身份和学历的高价值文件。外观简单庄重，格式统一，包括重要的个人信息和发布日期。对持有人来说，妥善保管是非常重要的。

Bangalore ℂall Girl 000000 Bangalore Escorts Service

nhero3888

Salesforce AI + Data Community Tour Slides - Canarias

davidpietrzykowski1

一比一原版(lbs毕业证书)伦敦商学院毕业证如何办理

ywqeos

原版一模一样【微信：741003700 】【(lbs毕业证书)伦敦商学院毕业证成绩单】【微信：741003700 】学位证，留信认证（真实可查，永久存档）原件一模一样纸张工艺/offer、雅思、外壳等材料/诚信可靠,可直接看成品样本，帮您解决无法毕业带来的各种难题！外壳，原版制作，诚信可靠，可直接看成品样本。行业标杆！精益求精，诚心合作，真诚制作！多年品质 ,按需精细制作，24小时接单,全套进口原装设备。十五年致力于帮助留学生解决难题，包您满意。本公司拥有海外各大学样板无数，能完美还原。 1:1完美还原海外各大学毕业材料上的工艺：水印，阴影底纹，钢印LOGO烫金烫银，LOGO烫金烫银复合重叠。文字图案浮雕、激光镭射、紫外荧光、温感、复印防伪等防伪工艺。材料咨询办理、认证咨询办理请加学历顾问Q/微741003700 【主营项目】一.毕业证【q微741003700】成绩单、使馆认证、教育部认证、雅思托福成绩单、学生卡等！二.真实使馆公证(即留学回国人员证明,不成功不收费) 三.真实教育部学历学位认证（教育部存档！教育部留服网站永久可查）四.办理各国各大学文凭(一对一专业服务,可全程监控跟踪进度) 如果您处于以下几种情况： ◇在校期间，因各种原因未能顺利毕业……拿不到官方毕业证【q/微741003700】 ◇面对父母的压力，希望尽快拿到； ◇不清楚认证流程以及材料该如何准备； ◇回国时间很长，忘记办理； ◇回国马上就要找工作，办给用人单位看； ◇企事业单位必须要求办理的 ◇需要报考公务员、购买免税车、落转户口 ◇申请留学生创业基金留信网认证的作用: 1:该专业认证可证明留学生真实身份 2:同时对留学生所学专业登记给予评定 3:国家专业人才认证中心颁发入库证书 4:这个认证书并且可以归档倒地方 5:凡事获得留信网入网的信息将会逐步更新到个人身份内，将在公安局网内查询个人身份证信息后，同步读取人才网入库信息 6:个人职称评审加20分 7:个人信誉贷款加10分 8:在国家人才网主办的国家网络招聘大会中纳入资料，供国家高端企业选择人才办理(lbs毕业证书)伦敦商学院毕业证【微信：741003700 】外观非常简单，由纸质材料制成，上面印有校徽、校名、毕业生姓名、专业等信息。办理(lbs毕业证书)伦敦商学院毕业证【微信：741003700 】格式相对统一，各专业都有相应的模板。通常包括以下部分：校徽：象征着学校的荣誉和传承。校名:学校英文全称授予学位：本部分将注明获得的具体学位名称。毕业生姓名：这是最重要的信息之一，标志着该证书是由特定人员获得的。颁发日期：这是毕业正式生效的时间，也代表着毕业生学业的结束。其他信息：根据不同的专业和学位，可能会有一些特定的信息或章节。办理(lbs毕业证书)伦敦商学院毕业证【微信：741003700 】价值很高，需要妥善保管。一般来说，应放置在安全、干燥、防潮的地方，避免长时间暴露在阳光下。如需使用，最好使用复印件而不是原件，以免丢失。综上所述，办理(lbs毕业证书)伦敦商学院毕业证【微信：741003700 】是证明身份和学历的高价值文件。外观简单庄重，格式统一，包括重要的个人信息和发布日期。对持有人来说，妥善保管是非常重要的。

Recently uploaded (20)

Namma-Kalvi-11th-Physics-Study-Material-Unit-1-EM-221086.pdf

Module 1 ppt BIG DATA ANALYTICS NOTES FOR MCA

一比一原版悉尼大学毕业证如何办理

一比一原版(UO毕业证)渥太华大学毕业证如何办理

一比一原版(UofT毕业证)多伦多大学毕业证如何办理

Discovering Digital Process Twins for What-if Analysis: a Process Mining Appr...

一比一原版莱斯大学毕业证（rice毕业证）如何办理

一比一原版爱尔兰都柏林大学毕业证(本硕）ucd学位证书如何办理

一比一原版(曼大毕业证书)曼尼托巴大学毕业证如何办理

SAP BW4HANA Implementagtion Content Document

Econ3060_Screen Time and Success_ final_GroupProject.pdf

reading_sample_sap_press_operational_data_provisioning_with_sap_bw4hana (1).pdf

Q4FY24 Investor-Presentation.pdf bank slide

Telemetry Solution for Gaming (AWS Summit'24)

PyData London 2024: Mistakes were made (Dr. Rebecca Bilbro)

Senior Engineering Sample EM DOE - Sheet1.pdf

一比一原版澳洲西澳大学毕业证（uwa毕业证书）如何办理

Bangalore ℂall Girl 000000 Bangalore Escorts Service

Salesforce AI + Data Community Tour Slides - Canarias

一比一原版(lbs毕业证书)伦敦商学院毕业证如何办理

Supporting FAIR data principles with data categorization

1. CSC – Suomalainen tutkimuksen, koulutuksen, kulttuurin ja julkishallinnon ICT-osaamiskeskusCSC – Suomalainen tutkimuksen, koulutuksen, kulttuurin ja julkishallinnon ICT-osaamiskeskus Supporting FAIR data. Categorization of research data as a tool in data management Jessica Parland-von Essen https://orcid.org/0000-0003-4460-3906, Katja Fält https://orcid.org/0000- 0002-6172-5377, Zubair Maalick https://orcid.org/0000-0002-0975-1471, Miika Alonen https://orcid.org/0000-0002-0065-0017, Eduardo Gonzalez https://orcid.org/0000-0003-1400-0995

2. The FAIR principles for research data

3. Persistent identifiers 3 a) Cite a specific slice or subset (the set of updates to the dataset made during a particular period of time or to a particular area of the dataset). b) Cite a specific snapshot (a copy of the entire dataset made at a specific time). c) Cite the continuously updated dataset, but add Access Date and Time to the citation. (Does not necessarily ensure reproducibility.) d) Cite a query, time-stamped for re-execution against a versioned database. DYNAMIC DATASETS IMMUTABLE DATASETS

4. Maybe we need to be more specific and find common ground in concepts? 4 CHUNKING UP RESEARCH DATA

5. Categorization according to technical properties • Modality, DCMI types oDublin Core –type of thinking • Format, DCMI format oMIME types oSoftware related • Language, coding oHuman interpretation 5 By Lin Kristensen from New Jersey, USA (Timeless Books) [CC BY 2.0 (https://creativecommons.org/licenses/by/2.0)], via Wikimedia Commons

6. Categorization according to contextual traits • Origin oObservational, experimental, simulation, derived etc • Use category oSource, output, method • Provenance, lifecycle oPrimary, secondary, data levels, qualitative, quantitative 6 By David Monniaux CC-BY-SA-3.0 (http://creativecommons.org/licenses/by-sa/3.0/), from Wikimedia Commons

7. Categorization according to inherent traits • Access type (availability) oOpen data, sensitive data • Semantic structure oCoherence, levels of measurement, groupings, classifications • Research data type (stability) oGeneric data, Generic research data, research data publications 7

8. 8

9. 9 Dynamic and growing datasets URN allows use of fragments Avoid PID inflation Consider costs and sustainability Ad hoc creation rather than automatic minting and allocation?

10. Operational data Generic research data Research dataset Description Data for any use, private or government owned, might fall within PSI. Produced by/with/for researchers, validated, good quality, well documented, might be raw or processed. Dataset produced for a certain research question Might be highly processed, reuse difficult unless mature field. The main purpose is assessment and reproducibilty. Format May be dynamic mature solutions, active or even hot data. Coherent and well documented formats. Data should be quite stable with versioning. Should be possible to cite and enable reproducible research. Usually in files, but might also be a database with applications. Citation does not require date. Two-tier resolver for identifier and landing page with metadata available even after data is gone. Might have defined lifespan. Examples - weather data - data catalogue - big data from social media - corpora - time series of experimental or observational data from technical instruments - similar social or clinical surveys - data paper - data cited in article and published in Zenodo, EUDAT B2Share, other or journal repository

11. Using research data types … … makes it easier to describe services … makes it easier for researchers to plan data life cycle … makes developing solutions for citation and FAIR data creation and use easier …makes it easier to describe and manage research data 11

12. facebook.com/CSCfi twitter.com/CSCfi youtube.com/CSCfi linkedin.com/company/csc---it-center-for-science Kuvat CSC:n arkisto ja Thinkstock github.com/CSCfi Jessica PvE parland@csc.fi

Supporting FAIR data principles with data categorization

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Supporting FAIR data principles with data categorization

Similar to Supporting FAIR data principles with data categorization (20)

More from Jessica Parland-von Essen

More from Jessica Parland-von Essen (20)

Recently uploaded

Recently uploaded (20)

Supporting FAIR data principles with data categorization