Identity Awareness: Toward an Invisible e-Infrastructure for Identifying Data and Authors

•Download as PPTX, PDF•

2 likes•605 views

amiraryani

Presented by Amir Aryani and Adrian Burton at eResearch Australasia Conference 2012, Sydney, Australia

Identity Awareness
• Connecting data to
• Researchers
• Grants
• Publications
• Licence

ODIN Project
• Interoperability between
• ORCID
• DataCite

“My vision is a scientific community that
does not waste resources on recreating
data that have already been produced, in
particular if public money has helped to
collect those data in the first place.”
Neelie Kroes, Vice-President of the European Commission, Digital Agenda

Research Data Australia (RDA)

Number of published research
collections in RDA

40,000"

30,000"

20,000"

10,000"

0"
2009)11" 2011)09" 2012)01" 2012)05" 2012)07" 2012)09" 2012)10"

Research Data Australia
Coverage of Published Research Collections

Identity Awareness
means knowing how to
• Identify the researchers who contributed to a dataset
• Identify the publications that use a dataset
• Identify the related grant or the research project

• Identify the licence for a dataset

Researcher Licence
Data

Grants and
Publication
projects
7

RDA Quality Model for Data
RIF-CS Elements Requirement 1 2 3
1 registry object Required * * *
2 originating source Required * * *
3 group Required * * *
4 key Required * * *
5 collection type Required * * *
6 name/title Required * *
7 related party (researcher or organisation) Required * *
8 description Required * *
9 location/address Required * *
10 rights (Licence) Required * *
11 activity (grant or research project) Required if available *
12 subject Recommended *
13 spatial coverage Recommended *
14 temporal coverage Recommended *
15 citation Recommended *
16 identifier Recommended *

Open Researcher & Contributor ID

Work
Connecting researcher to (Publication)
(Data)

Partners: Grants
• American Physical Society
• CrossRef Affiliations
• Elsevier
• Thomson Reuters Patents
• Wellcome Trust
• …

DataCite
1,104,998
Digital Object Identifiers (DOI) by DataCite

The information on this slide was captured on 30 Oct 2012

Data Creator,
Researcher, Author
Birth Cohort Study
dataset
Non- Birth Cohort
Study dataset
Derived dataset

Grey Literature
1958
Published article

Citation
Data Creator
Derived Data Creator
External Data input
Author: Grey lit
External Data
Author: Article
(Census, Health etc )
1970

Acknowledgment
ANDS is supported by the The ODIN project is funded by the
Australian Government through European Union under FP7 call
the National Collaborative INFRA-2012-3.3 (Grant Agreement
Research Infrastructure Strategy number 312788)
Program and the Education
Investment Fund (EIF) Super
Science Initiative

Conclusion
Enabling identity awareness is an international challenge
that requires a collaborative effort. ANDS encourage your
collaboration in this area and particularly to investigate
these questions:

• How can we measure and improve identity awareness
of research data?
• How can we measure and improve data reuse?
• How can we measure research impact?
• How can your organisation take advantage of the some
of the emerging global identity infrastructures?

"Towards a Science of Reproducible Science?" DPRMA Workshop talk at JCDL 2013, Indianapolis, 25th July 2013. Workshop website is http://dprma.oerc.ox.ac.uk/ Paper is David De Roure. 2013. Towards computational research objects. In Proceedings of the 1st International Workshop on Digital Preservation of Research Methods and Artefacts (DPRMA '13). ACM, New York, NY, USA, 16-19. DOI=10.1145/2499583.2499590 http://doi.acm.org/10.1145/2499583.2499590

The Future of Digital Science - World Science Forum 2011Kaitlin Thaney

The Research Object Initiative:Frameworks and Use Cases

Carole Goble

Mining Whole Museum Collections Datasets for Expanding Understanding of Colle...

Matthew J Collins

Introduces the Global Unified Open Data Architecture (GUODA) collaboration between iDigBio, independent developers, and EOL which aims to provide support for processing large biodiversity data sets using Apache Spark. A specific example with text mining is described. This presentation was given during the 31st Annual Meeting in 2016 of the Society for Presentation of Natural History Collections (SPNHC) in Berlin, Germany

20151102koyama

Yukinobu Koyama

Data Equivalence Mark Parsons, Lead Project Manager, Senior Associate Scientist, National Snow and Ice Data Center Data citation, especially using persistent identifiers like Digital Object Identifiers (DOIs), is an increasingly accepted scientific practice. Recently, several, respected organizations have developed guidelines for data citation. The different guidelines are largely congruent in that they agree on the basic practice and elements of data citation, especially for relatively static, whole data collections. There is less agreement on the more subtle nuances of data citation that are sometimes necessary to ensure precise reference and scientific reproducibility--the core purpose of data citation. We need to be sure that if you follow a data reference you get to the precise data that were used or at least their scientific equivalent. Identifiers such as DOIs are necessary but not sufficient for the precise, detailed, references necessary. This talk discusses issues around data set versioning, micro-citation, and scientific equivalence. I propose some interim solutions and suggest research strategies for the future.

Identifying psychological research data in the digital environment.

Leibniz-Zentrum für Psychologische Information & Dokumentation

Mendeley Data: Enhancing Data Discovery, Sharing and Reuse

Anita de Waard

NISO Forum, Denver, Sept. 24, 2012: Scientific discovery and innovation in an...

National Information Standards Organization (NISO)

Scientific discovery and innovation in an era of data-intensive science William (Bill) Michener, Professor and Director of e-Science Initiatives for University Libraries, University of New Mexico; DataONE Principal Investigator The scope and nature of biological, environmental and earth sciences research are evolving rapidly in response to environmental challenges such as global climate change, invasive species and emergent diseases. Scientific studies are increasingly focusing on long-term, broad-scale, and complex questions that require massive amounts of diverse data collected by remote sensing platforms and embedded environmental sensor networks; collaborative, interdisciplinary science teams; and new tools that promote scientific data preservation, discovery, and innovation. This talk describes the challenges facing scientists as they transition into this new era of data intensive science, presents current solutions, and lays out a roadmap to the future where new information technologies significantly increase the pace of scientific discovery and innovation.

Visualising Research Graph using Neo4j and Gephi

amiraryani

Using the Research Graph and Data Switchboard for cross-platform discovery

amiraryani

RDA EU Webinar - DDRI WG / April2017 Overview: Driven by the rapid development of data storage technology, the number of data repositories is growing fast. Researchers now have access to a range of data infrastructures such as discipline-specific repositories and national (regional) data infrastructures. The problem is that these infrastructures are often operating in silos; that is, they do not connect their datasets to related research information in other platforms. One solution to this problem is the work undertaken by the Data Description Registry Interoperability (DDRI) WG of Research Data Alliance (RDA). The group has developed the Research Data Switchboard which connects datasets and related information across research data repositories using information on co-authorship and jointly funded projects. In this webinar, Dr Amir Aryani presents an overview of the Switchboard project and discuss how it enables connecting datasets to the Research Graph -- a distributed graph of scholarly works derived by the Switchboard project. Also, we will show a live demo of traversing the graph of connections between publications, datasets, researchers and research projects across repositories and data infrastructures. Target Audience: Research data managers, government agency representatives, data infrastructure managers, and technologists who are interested in interoperabilities between research infrastructures

Research Data Alliance Plenary 9: DDRI Working Group Session

amiraryani

Research Graph: Connecting Identifiers across Research Data Infrastructures

amiraryani

Using Neo4j for exploring the research graph connections made by RD-Switchboard

amiraryani

In this talk, Jingbo Wang (NCI) and Amir Aryani (ANDS) have presented the Neo4j queries that can help data managers to explore the connections between datasets, researchers, grants, and publications using the graph model and Research Data Switchboard. In addition, they have discussed a paper on "Graph connections made by RD-Switchboard using NCI’s metadata", presented in the Reproducible Open Science workshop in Hannover September 2016.

ORCID in RD-Switchboard

amiraryani

Similar to Identity Awareness: Toward an Invisible e-Infrastructure for Identifying Data and Authors

Zooniverse teachers workshopLaura Whyte

Publishing of Scientific Data - Science Foundation Ireland Summit 2010

jodischneider

Exploring Process Barriers to Release Public Sector Information in Local Gove...

Peter Conradie

Research Data Sharing LERU

LIBER Europe

DataCite: the Perfect Complement to CrossRefCrossref

Profile of an Industry: Research Data Services

Tanner Jessel

Publishing biodiversity: The interplay between Scratchpads and the new Biodiv...

Dimitrios Koureas

Scott Edmunds: Data publication in the data deluge

GigaScience, BGI Hong Kong

Scott Edmunds: GigaScience - Big-Data, Data Citation and Future Data Handling

GigaScience, BGI Hong Kong

J-P. Fauconnier, J. Roumier. Musonto - A Semantic Search Engine Dedicated to ...

MusicNet

Scott Edmunds: Data Dissemination in the era of "Big-Data"

GigaScience, BGI Hong Kong

Scalable Identifiers for Natural History CollectionsJohn Kunze

NISO Forum, Denver, Sept. 24, 2012: Data Equivalence

National Information Standards Organization (NISO)

Identifying psychological research data in the digital environment.

Leibniz-Zentrum für Psychologische Information & Dokumentation

Mendeley Data: Enhancing Data Discovery, Sharing and Reuse

Anita de Waard

NISO Forum, Denver, Sept. 24, 2012: Scientific discovery and innovation in an...

National Information Standards Organization (NISO)

Similar to Identity Awareness: Toward an Invisible e-Infrastructure for Identifying Data and Authors (16)

Zooniverse teachers workshop

Publishing of Scientific Data - Science Foundation Ireland Summit 2010

Exploring Process Barriers to Release Public Sector Information in Local Gove...

Research Data Sharing LERU

DataCite: the Perfect Complement to CrossRef

Profile of an Industry: Research Data Services

Publishing biodiversity: The interplay between Scratchpads and the new Biodiv...

Scott Edmunds: Data publication in the data deluge

Scott Edmunds: GigaScience - Big-Data, Data Citation and Future Data Handling

J-P. Fauconnier, J. Roumier. Musonto - A Semantic Search Engine Dedicated to ...

Scott Edmunds: Data Dissemination in the era of "Big-Data"

Scalable Identifiers for Natural History Collections

NISO Forum, Denver, Sept. 24, 2012: Data Equivalence

Identifying psychological research data in the digital environment.

Mendeley Data: Enhancing Data Discovery, Sharing and Reuse

NISO Forum, Denver, Sept. 24, 2012: Scientific discovery and innovation in an...

More from amiraryani

Visualising Research Graph using Neo4j and Gephi

amiraryani

Using the Research Graph and Data Switchboard for cross-platform discovery

amiraryani

Research Data Alliance Plenary 9: DDRI Working Group Session

amiraryani

Research Graph: Connecting Identifiers across Research Data Infrastructures

amiraryani

Using Neo4j for exploring the research graph connections made by RD-Switchboard

amiraryani

ORCID in RD-Switchboard

amiraryani

Research Data and the Future of Software Engineering

amiraryani

Report from RDAPlenary 3 to DataCitation Community in Australia

amiraryani

Data Description Registry Interoperability WG at Research Data Alliance Third...

amiraryani

ORCID integration: A case study from ANDS and international development

amiraryani

Can we predict dependencies using domain information?

amiraryani

More from amiraryani (11)

Visualising Research Graph using Neo4j and Gephi

Using the Research Graph and Data Switchboard for cross-platform discovery

Research Data Alliance Plenary 9: DDRI Working Group Session

Research Graph: Connecting Identifiers across Research Data Infrastructures

Using Neo4j for exploring the research graph connections made by RD-Switchboard

ORCID in RD-Switchboard

Research Data and the Future of Software Engineering

Report from RDAPlenary 3 to DataCitation Community in Australia

Data Description Registry Interoperability WG at Research Data Alliance Third...

ORCID integration: A case study from ANDS and international development

Can we predict dependencies using domain information?

Identity Awareness: Toward an Invisible e-Infrastructure for Identifying Data and Authors

1. Identity Awareness: Toward an Invisible e-Infrastructure for Identifying Data and Authors Amir Aryani, Adrian Burton Australian National Data Service

2. Identity Awareness • Connecting data to • Researchers • Grants • Publications • Licence ODIN Project • Interoperability between • ORCID • DataCite

3. Identity Awareness

4. “My vision is a scientific community that does not waste resources on recreating data that have already been produced, in particular if public money has helped to collect those data in the first place.” Neelie Kroes, Vice-President of the European Commission, Digital Agenda

5. Research Data Australia (RDA) Number of published research collections in RDA 40,000" 30,000" 20,000" 10,000" 0" 2009)11" 2011)09" 2012)01" 2012)05" 2012)07" 2012)09" 2012)10"

6. Research Data Australia Coverage of Published Research Collections

7. Identity Awareness means knowing how to • Identify the researchers who contributed to a dataset • Identify the publications that use a dataset • Identify the related grant or the research project • Identify the licence for a dataset Researcher Licence Data Grants and Publication projects 7

8. RDA Quality Model for Data RIF-CS Elements Requirement 1 2 3 1 registry object Required * * * 2 originating source Required * * * 3 group Required * * * 4 key Required * * * 5 collection type Required * * * 6 name/title Required * * 7 related party (researcher or organisation) Required * * 8 description Required * * 9 location/address Required * * 10 rights (Licence) Required * * 11 activity (grant or research project) Required if available * 12 subject Recommended * 13 spatial coverage Recommended * 14 temporal coverage Recommended * 15 citation Recommended * 16 identifier Recommended *

9. ODIN Project

10. 10

11. Open Researcher & Contributor ID Work Connecting researcher to (Publication) (Data) Partners: Grants • American Physical Society • CrossRef Affiliations • Elsevier • Thomson Reuters Patents • Wellcome Trust • …

12. DataCite 1,104,998 Digital Object Identifiers (DOI) by DataCite The information on this slide was captured on 30 Oct 2012

13. High energy physics data

14.

15.

16. Social Sciences Cohort Data

17. Data Creator, Researcher, Author Birth Cohort Study dataset Non- Birth Cohort Study dataset Derived dataset Grey Literature 1958 Published article Citation Data Creator Derived Data Creator External Data input Author: Grey lit External Data Author: Article (Census, Health etc ) 1970

18. Data Creator, Researcher, Author Birth Cohort Study dataset Non- Birth Cohort Study dataset Derived dataset Grey Literature 1958 Published article Citation Data Creator Derived Data Creator External Data input Author: Grey lit External Data Author: Article (Census, Health etc ) 1970

19. Data Creator, Researcher, Author Birth Cohort Study dataset Non- Birth Cohort Study dataset Derived dataset Grey Literature 1958 Published article Citation Data Creator Derived Data Creator External Data input Author: Grey lit External Data Author: Artticle (Census, Health etc ) 1970

20. Data Creator, Researcher, Author Birth Cohort Study dataset Non- Birth Cohort Study dataset Derived dataset Grey Literature 1958 Published article Citation Data Creator Derived Data Creator External Data input Author: Grey lit External Data Author: Article (Census, Health etc ) 1970

21. Acknowledgment ANDS is supported by the The ODIN project is funded by the Australian Government through European Union under FP7 call the National Collaborative INFRA-2012-3.3 (Grant Agreement Research Infrastructure Strategy number 312788) Program and the Education Investment Fund (EIF) Super Science Initiative

22. Conclusion Enabling identity awareness is an international challenge that requires a collaborative effort. ANDS encourage your collaboration in this area and particularly to investigate these questions: • How can we measure and improve identity awareness of research data? • How can we measure and improve data reuse? • How can we measure research impact? • How can your organisation take advantage of the some of the emerging global identity infrastructures?

Identity Awareness: Toward an Invisible e-Infrastructure for Identifying Data and Authors

Recommended

Recommended

More Related Content

Similar to Identity Awareness: Toward an Invisible e-Infrastructure for Identifying Data and Authors

Similar to Identity Awareness: Toward an Invisible e-Infrastructure for Identifying Data and Authors (16)

More from amiraryani

More from amiraryani (11)

Identity Awareness: Toward an Invisible e-Infrastructure for Identifying Data and Authors