SlideShare a Scribd company logo
Scott Edmunds, GigaScience/HKU
Quantifying how FAIR is Hong Kong: The Hong Kong
Shareability of Hong Kong University Research Experiment
The Hong Kong experience.
Asia’s Academic City?
8 Universities, many ranked top 50 worldwide
100K students (UG/PG/FT/PT)
1 major research funder (UGC/RGC)
UGC Policy: “Realization of
making Hong Kong Asia's
world city is only possible if it
is based upon the platform of
a very strong education and
higher education sector. “
http://www.ugc.edu.hk/eng/ugc/policy/policy.htm
Research Data policies growing globally
http://ec.europa.eu/research/openscience/index.cfm?section=monitor&pg=researchdata#1
http://dx.doi.org/10.17477/jcea.2018.17.2.200
…meanwhile in Hong Kong
“This ambivalence was reflected by the chairman of the Research Grants Council, who
stated in an interview that ‘there is no relationship between world-class research and
release of data’, questioning whether anyone might be interested in the completeness of
data.
The chairman also saw a conflict between competitiveness and openness, arguing that
the reputation of a researcher is built on publications, not on the underlying data. “
No policies, Mo’ problems
If Government doesn’t act,
Universities need to lead way
http://www.rss.hku.hk/integrity/research-data-records-management
First CRIS in HK, built upon Scholars Hub
http://hub.hku.hk/advanced-search?location=crisdataset
(CRIS = current research information system)
First CRIS in HK, built upon ScholarsHub
http://lib.hku.hk/researchdata/rpg.htm
“Beginning with the September 2017 intake, all HKU
research postgraduate (rpg) students have responsibility
for 1) using a data management plan (DMP), where
applicable, to describe the use of data in preparation for,
or in the generation of their theses, and 2) depositing,
where applicable, a dataset in the HKU Scholars Hub.”
Growing # of OA journals addressing this
http://dx.doi.org/10.1371/journal.pmed.1001607
CAN WE QUANTIFY IF THIS IS
WORKING?
http://reproducibility.cs.arizona.edu/
Arizona Repeatability in
Computer Science Experiment
• 2015 study examining extent Computer Systems
researchers share their research artifacts (code)
• NSF policies on sharing code since 2005
• Examined 613 papers from ACM conferences & journals
•
• Attempted to locate source code that backed up results
• If found, tried to build the code.
http://reproducibility.cs.arizona.edu/
Arizona Repeatability in
Computer Science Experiment
• Manual curation/look for
code that backed up results
• If missing, emailed authors
• Chased if no reply
• If found, tried to build the
code
• Resolve issues
• Survey results
http://reproducibility.cs.arizona.edu/
613 papers
tested
123 successful
Reproductions (20%)
Arizona Repeatability in
Computer Science Experiment
Can we do something similar in HK?
Teaching HKU MLIM students module on data curation and management.
HKU Repeatability in HK
Research Experiment
• HKU policy on data sharing from 2015
• PLOS policy mandating sharing of supporting March 1,
2014
• HKU has published ≈400 PLOS ONE papers 2014-date
• Can we quantify reproducibility in a sample of these?
• Compare with other less stringent journals (e.g. Springer
Nature data policy ranked journals1)
• Can we follow Arizona and harness crowdsourced
(student) power?
1. https://www.springernature.com/gp/authors/research-data-policy/data-policy-types/12327096
HKU Repeatability in HK
Research Experiment
• Easy exercise in literature curation for HKU MLIM
students
• Set as a project for 59 students, 2017-2019
http://hub.hku.hk/simple-
search?query=&location=publication&sort_by=score&order=desc&rpp=25&filter_field_1=journal&filter_type_1=equals
&filter_value_1=plos+one&etal=0&filtername=dateIssued&filterquery=[2014+TO+2019]&filtertype=equals
https://scholarlykitchen.sspnet.org/2018/01/10/future-oa-megajournal/
NPG (Scientific Reports) copies the PLOS One model…
Another question:
Rise (and fall) of megajournals
HKU Repeatability in HK
Research Experiment
https://scholarlykitchen.sspnet.org/2016/01/06/plos-one-shrinks-by-11-percent/
Rise (and fall) of megajournals
Driven by impact factor or “easier” data policies?
“ Because data requirements are not uniform
across all journals, PLOS has put itself at a
disadvantage as far as attracting authors because
other journals offer an easier path. If strictly
enforced, this new policy is likely to result in a
drop in submissions to PLOS journals. While no
other mega-journal has been able to shake PLOS
ONE’s hold on the market, this policy may provide
an opening for competitors to gain on PLOS ONE
and even overtake it.”
Can we quantify this?
HKU Repeatability in HK
Research Experiment
• Students assigned 2 PLOS + 2 SciRep papers (268 total)
• Quickly scan paper looking for supporting data
• If no data, go to the next paper
• If uses data, is it all associated with the paper?
• If external data, is it available from URL or accession?
• If “data available on request”, are they contactable?
• Spend about up to 10mins per article
• Add data into googledoc, and teacher double checks &
marks students on accuracy
Homework/Case study: literature curation exercise
HKU Repeatability in HK
Research Experiment
Alternative: webscraping option (code in GitHub)…
https://github.com/jessesiu/hku_scholars_hub
HKU Repeatability in HK
Research Experiment
See protocols in protocols.io: http://dx.doi.org/10.17504/protocols.io.6x7hfrn
Teachers protocol: http://dx.doi.org/10.17504/protocols.io.6x8hfrw
Students protocol: http://dx.doi.org/10.17504/protocols.io.6yahfse
HKU Repeatability in HK
Research Experiment
Example
http://hub.hku.hk/handle/10722/223364
HKU Repeatability in HK
Research Experiment
Is there data presented in the paper? – Yes
Is there external data, and if so what is the
link/accession? – No
Is all the data in the paper available? – No
Comments - Has questionnaire, but not data as
says "minimal anonymized dataset will be made
available upon request”
Example
HKU Repeatability in HK
Research Experiment
If data “available on request”, do the authors respond if contacted?
Example
Interesting examples
http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0165978
Several examples of missing Infectious Disease data
Interesting examples
Several examples of missing Infectious Disease data
http://www.vox.com/2015/6/17/8796225/mers-virus-data-sharing
http://www.nature.com/news/data-sharing-make-outbreak-research-open-access-1.16966
Results
148
Papers
114 with data 121
Respond 7
Missing 7
27 data on request
Bounce 5 No response 17
121 accessible data
(82%)
data accessibility
120
Papers
79 with data 87
Respond 8
Missing 25
16 data on request
No response 8
57 accessible data
(72.5%)
data accessibility
External Data Sources
• Growing number of papers hosted data via
general-purpose open-access repositories:
– figshare (12), Dryad (5), OSF (4), Zenodo (2), Dataverse
(2), PANGAEA (2), DANS (1)
– Since 2016 figshare use has been dropping &
OSF/Zenodo increasing
– Large numbers of government, IR & institutional
websites
– Other than one broken Dryad link, OA data repositories
much more stable than other URLs (many broken)
https://figshare.com/projects/HKU_Repeatability_in_HK_Research_Experiment/64118
Lessons
Learned
Do not rely on handles
Instability of older HKU Scholars Hub Identifiers & data
• Going back to older (papers collected in early 2017) 3/49 (6%) handles have
changed
• Checking back over time, the number of 2016/2017/2018 PLOS/SR papers
listed keeps increasing (have had to update our results)
Do not rely on “data available from our website”
http://bioinformatics.oxfordjournals.org/content/24/11/1381.long
Do not rely on “data available on request”
https://doi.org/10.1101/633255
Do not rely on “data available from the government”
HK Hospital Authority only shares data with researchers at UGC-funded universities
in Hong Kong, with data access charges on average 35,700 HKD per request1
1. https://www.accessinfo.hk/en/request/request_for_statistics_on_data_c
2. https://www.nature.com/articles/s41598-017-15579-z
“Thanks for your interest. I'm afraid we can't as the data came from our hospital
authority which is highly strict in using of their data and would not allow us to
use the data other the purposed we stated before.”
So why say it was available upon request?
Emailing the authors for the data:
Do not rely on GitHub (or google)
https://dev.to/mjraadi/if-you-don-t-know-now-you-know-github-is-restricting-access-for-users-from-iran-and-a-
few-other-embargoed-countries-5ga9
Lessons Learned: never trust “data on request”
• “Data Available on Request” does not work (65% requests failed after
2 attempts).
• Hong Kong Government (esp. Hospital Authority) data access policies
incompatible with international journal policies
• Email addresses not checked by journals : 5 bounced (one wasn’t
even in correct format). 1 example gave a postal address only.
• Data Access Committee system not working. None of the DACs of the
listed Consortia/Cohort projects responded to emails (Children of
1997, Guangzhou Biobank Cohort Study, JAGES, and China Research
Center on Aging DACs).
• Even if authors respond there are often problems
• t&c’s. e.g.: MTAs or co-authorship, can share a sample of the
processed data not the raw data as they were still writing
publications.
• Data missing, e.g. they deleted the raw sequencing data.
https://figshare.com/projects/HKU_Repeatability_in_HK_Research_Experiment/64118
Lessons Learned: problems with Scholars Hub
• Unstable identifiers – 6% (3/49) examples changed in 2
years
• Unstable indexing – numbers of historic publications
keep increasing (self-reporting by authors?)
• Unstable source of datasets: one example of data in a
thesis that was blocked for a period
• Inconsistent indexing/metadata – one example lacked a
link/DOI to the paper, inconsistent keywords & tagging
• Inconsistent authorship – multiple, unused ORCID IDs
registered by HKU
https://figshare.com/projects/HKU_Repeatability_in_HK_Research_Experiment/64118
Importance of FAIR snapshots
Why GigaScience set up
http://gigadb.org/
Importance of FAIR snapshots
Why GigaScience set up
https://doi.org/10.1093/database/baz016
Foundational Principles
• Can’t trust “data available on request” – need independent, trusted broker
• Follow FAIR principles (Findability, Accessibility, Interoperability, and
Reusability) for data stewardship & offer unlimited data hosting
• Use globally unique and persistent (stable) identifiers, e.g. DataCite DOIs
• Need to take unlimited sized snapshots of ”version of record” (data, code…)
• Increase Reusability with Interoperable CC licensing (we use CC0)
• Increase Findability & Reusability with rich open metadata (field specific,
DataCite, schema.org) and wide indexing (DataCite, NIH datamed, DCI, etc.)
Thanks to:
Laurie Goodman, Editor in Chief
Nicole Nogoy, Editor
Hans Zauner, Assistant Editor
Hongling Zhao, Assistant Editor
Peter Li, Lead Data Manager
Chris Hunter, Lead BioCurator
Chris Armit, Data Scientist
Mary Ann Tulli, Data Ediitor
Xiao (Jesse) Si Zhe, Database Developer
Chen Qi, Shenzhen Office.
@GigaScience
facebook.com/GigaScience
http://gigasciencejournal.com/blog/
Follow us:
www.gigasciencejournal.com
www.gigadb.org
+
Weibo
& WeChat
+ HKU MLIM students

More Related Content

What's hot

Oxford DTP - Sansone - Data publications and Scientific Data - Dec 2014
Oxford DTP - Sansone - Data publications and Scientific Data - Dec 2014Oxford DTP - Sansone - Data publications and Scientific Data - Dec 2014
Oxford DTP - Sansone - Data publications and Scientific Data - Dec 2014
Susanna-Assunta Sansone
 
Enabling Precise Identification and Citability of Dynamic Data: Recommendatio...
Enabling Precise Identification and Citability of Dynamic Data: Recommendatio...Enabling Precise Identification and Citability of Dynamic Data: Recommendatio...
Enabling Precise Identification and Citability of Dynamic Data: Recommendatio...
LEARN Project
 
RDA Scholarly Infrastructure 2015
RDA Scholarly Infrastructure 2015RDA Scholarly Infrastructure 2015
RDA Scholarly Infrastructure 2015
William Gunn
 
Cartegena051811
Cartegena051811Cartegena051811
Cartegena051811
Philip Bourne
 
AgriFood Data, Models, Standards, Tools, Use Cases
AgriFood Data, Models, Standards, Tools, Use CasesAgriFood Data, Models, Standards, Tools, Use Cases
AgriFood Data, Models, Standards, Tools, Use Cases
Rothamsted Research, UK
 
From Theory to Practice: Can Opennesss Improve the Quality of OER Research?
From Theory to Practice: Can Opennesss Improve the Quality of OER Research? From Theory to Practice: Can Opennesss Improve the Quality of OER Research?
From Theory to Practice: Can Opennesss Improve the Quality of OER Research?
Beck Pitt
 
CEDAR work bench for metadata management
CEDAR work bench for metadata managementCEDAR work bench for metadata management
CEDAR work bench for metadata management
Pistoia Alliance
 
Why study Data Sharing? (+ why share your data)
Why study Data Sharing?  (+ why share your data)Why study Data Sharing?  (+ why share your data)
Why study Data Sharing? (+ why share your data)
Heather Piwowar
 
Introduction to open-data
Introduction to open-dataIntroduction to open-data
Introduction to open-data
OpenAccessBelgium
 
Laurie Goodman at #aibsdata: Beyond Data Release Mandates - Helping Authors M...
Laurie Goodman at #aibsdata: Beyond Data Release Mandates - Helping Authors M...Laurie Goodman at #aibsdata: Beyond Data Release Mandates - Helping Authors M...
Laurie Goodman at #aibsdata: Beyond Data Release Mandates - Helping Authors M...
GigaScience, BGI Hong Kong
 
Text-Mining PubMed Search Results to Identify Emerging Technologies Relevant ...
Text-Mining PubMed Search Results to Identify Emerging Technologies Relevant ...Text-Mining PubMed Search Results to Identify Emerging Technologies Relevant ...
Text-Mining PubMed Search Results to Identify Emerging Technologies Relevant ...
University of Michigan Taubman Health Sciences Library
 
The Kaleidoscope of Impact: same data, different perspectives, constantly cha...
The Kaleidoscope of Impact: same data, different perspectives, constantly cha...The Kaleidoscope of Impact: same data, different perspectives, constantly cha...
The Kaleidoscope of Impact: same data, different perspectives, constantly cha...
Kudos
 
CINECA webinar slides: Making cohort data FAIR
CINECA webinar slides: Making cohort data FAIRCINECA webinar slides: Making cohort data FAIR
CINECA webinar slides: Making cohort data FAIR
CINECAProject
 
TIDSR
TIDSRTIDSR
TIDSR
Eric Meyer
 
USTLG Talk: The future of laboratory data: Libraries, Librarians and Digital...
USTLG Talk:  The future of laboratory data: Libraries, Librarians and Digital...USTLG Talk:  The future of laboratory data: Libraries, Librarians and Digital...
USTLG Talk: The future of laboratory data: Libraries, Librarians and Digital...
Jeremy Frey
 
Open Science: Research Data Management
Open Science: Research Data ManagementOpen Science: Research Data Management
Open Science: Research Data Management
Library_Connect
 
Research data management workshop april12 2016
Research data management workshop april12 2016 Research data management workshop april12 2016
Research data management workshop april12 2016
Rebecca Raworth, MLIS
 
Open access for researchers, policy makers and research managers - Short ver...
Open access  for researchers, policy makers and research managers - Short ver...Open access  for researchers, policy makers and research managers - Short ver...
Open access for researchers, policy makers and research managers - Short ver...
Iryna Kuchma
 
The State of Open Research Data
The State of Open Research DataThe State of Open Research Data
The State of Open Research Data
Ross Mounce
 
Capturing and Analyzing Publication, Citation and Usage Data for Contextual C...
Capturing and Analyzing Publication, Citation and Usage Data for Contextual C...Capturing and Analyzing Publication, Citation and Usage Data for Contextual C...
Capturing and Analyzing Publication, Citation and Usage Data for Contextual C...
NASIG
 

What's hot (20)

Oxford DTP - Sansone - Data publications and Scientific Data - Dec 2014
Oxford DTP - Sansone - Data publications and Scientific Data - Dec 2014Oxford DTP - Sansone - Data publications and Scientific Data - Dec 2014
Oxford DTP - Sansone - Data publications and Scientific Data - Dec 2014
 
Enabling Precise Identification and Citability of Dynamic Data: Recommendatio...
Enabling Precise Identification and Citability of Dynamic Data: Recommendatio...Enabling Precise Identification and Citability of Dynamic Data: Recommendatio...
Enabling Precise Identification and Citability of Dynamic Data: Recommendatio...
 
RDA Scholarly Infrastructure 2015
RDA Scholarly Infrastructure 2015RDA Scholarly Infrastructure 2015
RDA Scholarly Infrastructure 2015
 
Cartegena051811
Cartegena051811Cartegena051811
Cartegena051811
 
AgriFood Data, Models, Standards, Tools, Use Cases
AgriFood Data, Models, Standards, Tools, Use CasesAgriFood Data, Models, Standards, Tools, Use Cases
AgriFood Data, Models, Standards, Tools, Use Cases
 
From Theory to Practice: Can Opennesss Improve the Quality of OER Research?
From Theory to Practice: Can Opennesss Improve the Quality of OER Research? From Theory to Practice: Can Opennesss Improve the Quality of OER Research?
From Theory to Practice: Can Opennesss Improve the Quality of OER Research?
 
CEDAR work bench for metadata management
CEDAR work bench for metadata managementCEDAR work bench for metadata management
CEDAR work bench for metadata management
 
Why study Data Sharing? (+ why share your data)
Why study Data Sharing?  (+ why share your data)Why study Data Sharing?  (+ why share your data)
Why study Data Sharing? (+ why share your data)
 
Introduction to open-data
Introduction to open-dataIntroduction to open-data
Introduction to open-data
 
Laurie Goodman at #aibsdata: Beyond Data Release Mandates - Helping Authors M...
Laurie Goodman at #aibsdata: Beyond Data Release Mandates - Helping Authors M...Laurie Goodman at #aibsdata: Beyond Data Release Mandates - Helping Authors M...
Laurie Goodman at #aibsdata: Beyond Data Release Mandates - Helping Authors M...
 
Text-Mining PubMed Search Results to Identify Emerging Technologies Relevant ...
Text-Mining PubMed Search Results to Identify Emerging Technologies Relevant ...Text-Mining PubMed Search Results to Identify Emerging Technologies Relevant ...
Text-Mining PubMed Search Results to Identify Emerging Technologies Relevant ...
 
The Kaleidoscope of Impact: same data, different perspectives, constantly cha...
The Kaleidoscope of Impact: same data, different perspectives, constantly cha...The Kaleidoscope of Impact: same data, different perspectives, constantly cha...
The Kaleidoscope of Impact: same data, different perspectives, constantly cha...
 
CINECA webinar slides: Making cohort data FAIR
CINECA webinar slides: Making cohort data FAIRCINECA webinar slides: Making cohort data FAIR
CINECA webinar slides: Making cohort data FAIR
 
TIDSR
TIDSRTIDSR
TIDSR
 
USTLG Talk: The future of laboratory data: Libraries, Librarians and Digital...
USTLG Talk:  The future of laboratory data: Libraries, Librarians and Digital...USTLG Talk:  The future of laboratory data: Libraries, Librarians and Digital...
USTLG Talk: The future of laboratory data: Libraries, Librarians and Digital...
 
Open Science: Research Data Management
Open Science: Research Data ManagementOpen Science: Research Data Management
Open Science: Research Data Management
 
Research data management workshop april12 2016
Research data management workshop april12 2016 Research data management workshop april12 2016
Research data management workshop april12 2016
 
Open access for researchers, policy makers and research managers - Short ver...
Open access  for researchers, policy makers and research managers - Short ver...Open access  for researchers, policy makers and research managers - Short ver...
Open access for researchers, policy makers and research managers - Short ver...
 
The State of Open Research Data
The State of Open Research DataThe State of Open Research Data
The State of Open Research Data
 
Capturing and Analyzing Publication, Citation and Usage Data for Contextual C...
Capturing and Analyzing Publication, Citation and Usage Data for Contextual C...Capturing and Analyzing Publication, Citation and Usage Data for Contextual C...
Capturing and Analyzing Publication, Citation and Usage Data for Contextual C...
 

Similar to Scott Edmunds: Quantifying how FAIR is Hong Kong: The Hong Kong Shareability of Hong Kong University Research Experiment

Minimal viable data reuse
Minimal viable data reuseMinimal viable data reuse
Minimal viable data reuse
voginip
 
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Carole Goble
 
Data Science Meets Biomedicine, Does Anything Change
Data Science Meets Biomedicine, Does Anything ChangeData Science Meets Biomedicine, Does Anything Change
Data Science Meets Biomedicine, Does Anything Change
Philip Bourne
 
FAIR BioData Management
FAIR BioData ManagementFAIR BioData Management
FAIR BioData Management
Ulrike Wittig
 
Lern, june 2016, digital media slides
Lern, june 2016, digital media slidesLern, june 2016, digital media slides
Lern, june 2016, digital media slides
York University - Osgoode Hall Law School
 
Jonathan Tedds Distinguished Lecture at DLab, UC Berkeley, 12 Sep 2013: "The ...
Jonathan Tedds Distinguished Lecture at DLab, UC Berkeley, 12 Sep 2013: "The ...Jonathan Tedds Distinguished Lecture at DLab, UC Berkeley, 12 Sep 2013: "The ...
Jonathan Tedds Distinguished Lecture at DLab, UC Berkeley, 12 Sep 2013: "The ...
Jonathan Tedds
 
Genome sharing projects around the world nijmegen oct 29 - 2015
Genome sharing projects around the world   nijmegen oct 29 - 2015Genome sharing projects around the world   nijmegen oct 29 - 2015
Genome sharing projects around the world nijmegen oct 29 - 2015
Fiona Nielsen
 
Open Data - strategies for research data management & impact of best practices
Open Data - strategies for research data management & impact of best practicesOpen Data - strategies for research data management & impact of best practices
Open Data - strategies for research data management & impact of best practices
Martin Donnelly
 
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...
Carole Goble
 
Social media cafe ResearchGate
Social media cafe ResearchGateSocial media cafe ResearchGate
Social media cafe ResearchGate
Hugo Besemer
 
Seven questions about ResearchGate
Seven questions about ResearchGateSeven questions about ResearchGate
Seven questions about ResearchGate
Ellen Fest
 
HKU Data Curation MLIM7350 Class 8
HKU Data Curation MLIM7350 Class 8HKU Data Curation MLIM7350 Class 8
HKU Data Curation MLIM7350 Class 8
Scott Edmunds
 
2011.10.10 Multi-Disciplinary Research Themes and Training
2011.10.10 Multi-Disciplinary Research Themes and Training2011.10.10 Multi-Disciplinary Research Themes and Training
2011.10.10 Multi-Disciplinary Research Themes and Training
NUI Galway
 
Open science, open data - FOSTER training, Potsdam
Open science, open data - FOSTER training, PotsdamOpen science, open data - FOSTER training, Potsdam
Open science, open data - FOSTER training, Potsdam
Platforma Otwartej Nauki
 
The OpenCon Intro to Open Data
The OpenCon Intro to Open DataThe OpenCon Intro to Open Data
The OpenCon Intro to Open Data
Ross Mounce
 
Biomedical Data Science: We Are Not Alone
Biomedical Data Science: We Are Not AloneBiomedical Data Science: We Are Not Alone
Biomedical Data Science: We Are Not Alone
Philip Bourne
 
A Big Picture in Research Data Management
A Big Picture in Research Data ManagementA Big Picture in Research Data Management
A Big Picture in Research Data Management
Carole Goble
 
Research Data, or: How I Learned to Stop Worrying and Love the Policy
Research Data, or: How I Learned to Stop Worrying and Love the PolicyResearch Data, or: How I Learned to Stop Worrying and Love the Policy
Research Data, or: How I Learned to Stop Worrying and Love the Policy
Torsten Reimer
 
The Horizon 2020 Open Data Pilot
The Horizon 2020 Open Data PilotThe Horizon 2020 Open Data Pilot
The Horizon 2020 Open Data Pilot
Martin Donnelly
 
The Horizon2020 Open Data Pilot - OpenAIRE Webinar
The Horizon2020 Open Data Pilot - OpenAIRE WebinarThe Horizon2020 Open Data Pilot - OpenAIRE Webinar
The Horizon2020 Open Data Pilot - OpenAIRE Webinar
Martin Donnelly
 

Similar to Scott Edmunds: Quantifying how FAIR is Hong Kong: The Hong Kong Shareability of Hong Kong University Research Experiment (20)

Minimal viable data reuse
Minimal viable data reuseMinimal viable data reuse
Minimal viable data reuse
 
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
 
Data Science Meets Biomedicine, Does Anything Change
Data Science Meets Biomedicine, Does Anything ChangeData Science Meets Biomedicine, Does Anything Change
Data Science Meets Biomedicine, Does Anything Change
 
FAIR BioData Management
FAIR BioData ManagementFAIR BioData Management
FAIR BioData Management
 
Lern, june 2016, digital media slides
Lern, june 2016, digital media slidesLern, june 2016, digital media slides
Lern, june 2016, digital media slides
 
Jonathan Tedds Distinguished Lecture at DLab, UC Berkeley, 12 Sep 2013: "The ...
Jonathan Tedds Distinguished Lecture at DLab, UC Berkeley, 12 Sep 2013: "The ...Jonathan Tedds Distinguished Lecture at DLab, UC Berkeley, 12 Sep 2013: "The ...
Jonathan Tedds Distinguished Lecture at DLab, UC Berkeley, 12 Sep 2013: "The ...
 
Genome sharing projects around the world nijmegen oct 29 - 2015
Genome sharing projects around the world   nijmegen oct 29 - 2015Genome sharing projects around the world   nijmegen oct 29 - 2015
Genome sharing projects around the world nijmegen oct 29 - 2015
 
Open Data - strategies for research data management & impact of best practices
Open Data - strategies for research data management & impact of best practicesOpen Data - strategies for research data management & impact of best practices
Open Data - strategies for research data management & impact of best practices
 
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...
 
Social media cafe ResearchGate
Social media cafe ResearchGateSocial media cafe ResearchGate
Social media cafe ResearchGate
 
Seven questions about ResearchGate
Seven questions about ResearchGateSeven questions about ResearchGate
Seven questions about ResearchGate
 
HKU Data Curation MLIM7350 Class 8
HKU Data Curation MLIM7350 Class 8HKU Data Curation MLIM7350 Class 8
HKU Data Curation MLIM7350 Class 8
 
2011.10.10 Multi-Disciplinary Research Themes and Training
2011.10.10 Multi-Disciplinary Research Themes and Training2011.10.10 Multi-Disciplinary Research Themes and Training
2011.10.10 Multi-Disciplinary Research Themes and Training
 
Open science, open data - FOSTER training, Potsdam
Open science, open data - FOSTER training, PotsdamOpen science, open data - FOSTER training, Potsdam
Open science, open data - FOSTER training, Potsdam
 
The OpenCon Intro to Open Data
The OpenCon Intro to Open DataThe OpenCon Intro to Open Data
The OpenCon Intro to Open Data
 
Biomedical Data Science: We Are Not Alone
Biomedical Data Science: We Are Not AloneBiomedical Data Science: We Are Not Alone
Biomedical Data Science: We Are Not Alone
 
A Big Picture in Research Data Management
A Big Picture in Research Data ManagementA Big Picture in Research Data Management
A Big Picture in Research Data Management
 
Research Data, or: How I Learned to Stop Worrying and Love the Policy
Research Data, or: How I Learned to Stop Worrying and Love the PolicyResearch Data, or: How I Learned to Stop Worrying and Love the Policy
Research Data, or: How I Learned to Stop Worrying and Love the Policy
 
The Horizon 2020 Open Data Pilot
The Horizon 2020 Open Data PilotThe Horizon 2020 Open Data Pilot
The Horizon 2020 Open Data Pilot
 
The Horizon2020 Open Data Pilot - OpenAIRE Webinar
The Horizon2020 Open Data Pilot - OpenAIRE WebinarThe Horizon2020 Open Data Pilot - OpenAIRE Webinar
The Horizon2020 Open Data Pilot - OpenAIRE Webinar
 

More from GigaScience, BGI Hong Kong

IDW2022: A decades experiences in transparent and interactive publication of ...
IDW2022: A decades experiences in transparent and interactive publication of ...IDW2022: A decades experiences in transparent and interactive publication of ...
IDW2022: A decades experiences in transparent and interactive publication of ...
GigaScience, BGI Hong Kong
 
Scott Edmunds: Preparing a data paper for GigaByte
Scott Edmunds: Preparing a data paper for GigaByteScott Edmunds: Preparing a data paper for GigaByte
Scott Edmunds: Preparing a data paper for GigaByte
GigaScience, BGI Hong Kong
 
STM Week: Demonstrating bringing publications to life via an End-to-end XML p...
STM Week: Demonstrating bringing publications to life via an End-to-end XML p...STM Week: Demonstrating bringing publications to life via an End-to-end XML p...
STM Week: Demonstrating bringing publications to life via an End-to-end XML p...
GigaScience, BGI Hong Kong
 
Measuring richness. A RCT to quantify the benefits of metadata quality; Scott...
Measuring richness. A RCT to quantify the benefits of metadata quality; Scott...Measuring richness. A RCT to quantify the benefits of metadata quality; Scott...
Measuring richness. A RCT to quantify the benefits of metadata quality; Scott...
GigaScience, BGI Hong Kong
 
Scott Edmunds: A new publishing workflow for rapid dissemination of genomes u...
Scott Edmunds: A new publishing workflow for rapid dissemination of genomes u...Scott Edmunds: A new publishing workflow for rapid dissemination of genomes u...
Scott Edmunds: A new publishing workflow for rapid dissemination of genomes u...
GigaScience, BGI Hong Kong
 
Scott Edmunds talk at IARC: How can we make science more trustworthy and FAIR...
Scott Edmunds talk at IARC: How can we make science more trustworthy and FAIR...Scott Edmunds talk at IARC: How can we make science more trustworthy and FAIR...
Scott Edmunds talk at IARC: How can we make science more trustworthy and FAIR...
GigaScience, BGI Hong Kong
 
PAGAsia19 - The Digitalization of Ruili Botanical Garden Project: Production...
PAGAsia19 - The Digitalization of Ruili Botanical Garden Project:  Production...PAGAsia19 - The Digitalization of Ruili Botanical Garden Project:  Production...
PAGAsia19 - The Digitalization of Ruili Botanical Garden Project: Production...
GigaScience, BGI Hong Kong
 
Democratising biodiversity and genomics research: open and citizen science to...
Democratising biodiversity and genomics research: open and citizen science to...Democratising biodiversity and genomics research: open and citizen science to...
Democratising biodiversity and genomics research: open and citizen science to...
GigaScience, BGI Hong Kong
 
Hong Kong Open Access & GigaScience: CCHK@10
Hong Kong Open Access & GigaScience: CCHK@10Hong Kong Open Access & GigaScience: CCHK@10
Hong Kong Open Access & GigaScience: CCHK@10
GigaScience, BGI Hong Kong
 
Ricardo Wurmus: Reproducible genomics analysis pipelines with GNU Guix
Ricardo Wurmus: Reproducible genomics analysis pipelines with GNU GuixRicardo Wurmus: Reproducible genomics analysis pipelines with GNU Guix
Ricardo Wurmus: Reproducible genomics analysis pipelines with GNU Guix
GigaScience, BGI Hong Kong
 
Anil Thanki at #ICG13: Aequatus: An open-source homology browser
Anil Thanki at #ICG13: Aequatus: An open-source homology browserAnil Thanki at #ICG13: Aequatus: An open-source homology browser
Anil Thanki at #ICG13: Aequatus: An open-source homology browser
GigaScience, BGI Hong Kong
 
Paul Pavlidis at #ICG13: Monitoring changes in the Gene Ontology and their im...
Paul Pavlidis at #ICG13: Monitoring changes in the Gene Ontology and their im...Paul Pavlidis at #ICG13: Monitoring changes in the Gene Ontology and their im...
Paul Pavlidis at #ICG13: Monitoring changes in the Gene Ontology and their im...
GigaScience, BGI Hong Kong
 
Venice Juanillas at #ICG13: Rice Galaxy: an open resource for plant science
Venice Juanillas at #ICG13: Rice Galaxy: an open resource for plant scienceVenice Juanillas at #ICG13: Rice Galaxy: an open resource for plant science
Venice Juanillas at #ICG13: Rice Galaxy: an open resource for plant science
GigaScience, BGI Hong Kong
 
Stefan Prost at #ICG13: Genome analyses show strong selection on coloration, ...
Stefan Prost at #ICG13: Genome analyses show strong selection on coloration, ...Stefan Prost at #ICG13: Genome analyses show strong selection on coloration, ...
Stefan Prost at #ICG13: Genome analyses show strong selection on coloration, ...
GigaScience, BGI Hong Kong
 
Lisa Johnson at #ICG13: Re-assembly, quality evaluation, and annotation of 67...
Lisa Johnson at #ICG13: Re-assembly, quality evaluation, and annotation of 67...Lisa Johnson at #ICG13: Re-assembly, quality evaluation, and annotation of 67...
Lisa Johnson at #ICG13: Re-assembly, quality evaluation, and annotation of 67...
GigaScience, BGI Hong Kong
 
Chris Armit at IDW2018: Democratising Data Publishing: A Global Perspective
Chris Armit at IDW2018: Democratising Data Publishing: A Global PerspectiveChris Armit at IDW2018: Democratising Data Publishing: A Global Perspective
Chris Armit at IDW2018: Democratising Data Publishing: A Global Perspective
GigaScience, BGI Hong Kong
 
EMBL OA Week: FAIR or unfair? Principled publishing for more Open & Democrati...
EMBL OA Week: FAIR or unfair? Principled publishing for more Open & Democrati...EMBL OA Week: FAIR or unfair? Principled publishing for more Open & Democrati...
EMBL OA Week: FAIR or unfair? Principled publishing for more Open & Democrati...
GigaScience, BGI Hong Kong
 
Reproducible method and benchmarking publishing for the data (and evidence) d...
Reproducible method and benchmarking publishing for the data (and evidence) d...Reproducible method and benchmarking publishing for the data (and evidence) d...
Reproducible method and benchmarking publishing for the data (and evidence) d...
GigaScience, BGI Hong Kong
 
Mary Ann Tuli: What MODs can learn from Journals – a GigaDB curator’s perspec...
Mary Ann Tuli: What MODs can learn from Journals – a GigaDB curator’s perspec...Mary Ann Tuli: What MODs can learn from Journals – a GigaDB curator’s perspec...
Mary Ann Tuli: What MODs can learn from Journals – a GigaDB curator’s perspec...
GigaScience, BGI Hong Kong
 
Laurie Goodman: Sharing and Reusing Cell Image Data, ASCB/EMBO 2017 Subgroup ...
Laurie Goodman: Sharing and Reusing Cell Image Data, ASCB/EMBO 2017 Subgroup ...Laurie Goodman: Sharing and Reusing Cell Image Data, ASCB/EMBO 2017 Subgroup ...
Laurie Goodman: Sharing and Reusing Cell Image Data, ASCB/EMBO 2017 Subgroup ...
GigaScience, BGI Hong Kong
 

More from GigaScience, BGI Hong Kong (20)

IDW2022: A decades experiences in transparent and interactive publication of ...
IDW2022: A decades experiences in transparent and interactive publication of ...IDW2022: A decades experiences in transparent and interactive publication of ...
IDW2022: A decades experiences in transparent and interactive publication of ...
 
Scott Edmunds: Preparing a data paper for GigaByte
Scott Edmunds: Preparing a data paper for GigaByteScott Edmunds: Preparing a data paper for GigaByte
Scott Edmunds: Preparing a data paper for GigaByte
 
STM Week: Demonstrating bringing publications to life via an End-to-end XML p...
STM Week: Demonstrating bringing publications to life via an End-to-end XML p...STM Week: Demonstrating bringing publications to life via an End-to-end XML p...
STM Week: Demonstrating bringing publications to life via an End-to-end XML p...
 
Measuring richness. A RCT to quantify the benefits of metadata quality; Scott...
Measuring richness. A RCT to quantify the benefits of metadata quality; Scott...Measuring richness. A RCT to quantify the benefits of metadata quality; Scott...
Measuring richness. A RCT to quantify the benefits of metadata quality; Scott...
 
Scott Edmunds: A new publishing workflow for rapid dissemination of genomes u...
Scott Edmunds: A new publishing workflow for rapid dissemination of genomes u...Scott Edmunds: A new publishing workflow for rapid dissemination of genomes u...
Scott Edmunds: A new publishing workflow for rapid dissemination of genomes u...
 
Scott Edmunds talk at IARC: How can we make science more trustworthy and FAIR...
Scott Edmunds talk at IARC: How can we make science more trustworthy and FAIR...Scott Edmunds talk at IARC: How can we make science more trustworthy and FAIR...
Scott Edmunds talk at IARC: How can we make science more trustworthy and FAIR...
 
PAGAsia19 - The Digitalization of Ruili Botanical Garden Project: Production...
PAGAsia19 - The Digitalization of Ruili Botanical Garden Project:  Production...PAGAsia19 - The Digitalization of Ruili Botanical Garden Project:  Production...
PAGAsia19 - The Digitalization of Ruili Botanical Garden Project: Production...
 
Democratising biodiversity and genomics research: open and citizen science to...
Democratising biodiversity and genomics research: open and citizen science to...Democratising biodiversity and genomics research: open and citizen science to...
Democratising biodiversity and genomics research: open and citizen science to...
 
Hong Kong Open Access & GigaScience: CCHK@10
Hong Kong Open Access & GigaScience: CCHK@10Hong Kong Open Access & GigaScience: CCHK@10
Hong Kong Open Access & GigaScience: CCHK@10
 
Ricardo Wurmus: Reproducible genomics analysis pipelines with GNU Guix
Ricardo Wurmus: Reproducible genomics analysis pipelines with GNU GuixRicardo Wurmus: Reproducible genomics analysis pipelines with GNU Guix
Ricardo Wurmus: Reproducible genomics analysis pipelines with GNU Guix
 
Anil Thanki at #ICG13: Aequatus: An open-source homology browser
Anil Thanki at #ICG13: Aequatus: An open-source homology browserAnil Thanki at #ICG13: Aequatus: An open-source homology browser
Anil Thanki at #ICG13: Aequatus: An open-source homology browser
 
Paul Pavlidis at #ICG13: Monitoring changes in the Gene Ontology and their im...
Paul Pavlidis at #ICG13: Monitoring changes in the Gene Ontology and their im...Paul Pavlidis at #ICG13: Monitoring changes in the Gene Ontology and their im...
Paul Pavlidis at #ICG13: Monitoring changes in the Gene Ontology and their im...
 
Venice Juanillas at #ICG13: Rice Galaxy: an open resource for plant science
Venice Juanillas at #ICG13: Rice Galaxy: an open resource for plant scienceVenice Juanillas at #ICG13: Rice Galaxy: an open resource for plant science
Venice Juanillas at #ICG13: Rice Galaxy: an open resource for plant science
 
Stefan Prost at #ICG13: Genome analyses show strong selection on coloration, ...
Stefan Prost at #ICG13: Genome analyses show strong selection on coloration, ...Stefan Prost at #ICG13: Genome analyses show strong selection on coloration, ...
Stefan Prost at #ICG13: Genome analyses show strong selection on coloration, ...
 
Lisa Johnson at #ICG13: Re-assembly, quality evaluation, and annotation of 67...
Lisa Johnson at #ICG13: Re-assembly, quality evaluation, and annotation of 67...Lisa Johnson at #ICG13: Re-assembly, quality evaluation, and annotation of 67...
Lisa Johnson at #ICG13: Re-assembly, quality evaluation, and annotation of 67...
 
Chris Armit at IDW2018: Democratising Data Publishing: A Global Perspective
Chris Armit at IDW2018: Democratising Data Publishing: A Global PerspectiveChris Armit at IDW2018: Democratising Data Publishing: A Global Perspective
Chris Armit at IDW2018: Democratising Data Publishing: A Global Perspective
 
EMBL OA Week: FAIR or unfair? Principled publishing for more Open & Democrati...
EMBL OA Week: FAIR or unfair? Principled publishing for more Open & Democrati...EMBL OA Week: FAIR or unfair? Principled publishing for more Open & Democrati...
EMBL OA Week: FAIR or unfair? Principled publishing for more Open & Democrati...
 
Reproducible method and benchmarking publishing for the data (and evidence) d...
Reproducible method and benchmarking publishing for the data (and evidence) d...Reproducible method and benchmarking publishing for the data (and evidence) d...
Reproducible method and benchmarking publishing for the data (and evidence) d...
 
Mary Ann Tuli: What MODs can learn from Journals – a GigaDB curator’s perspec...
Mary Ann Tuli: What MODs can learn from Journals – a GigaDB curator’s perspec...Mary Ann Tuli: What MODs can learn from Journals – a GigaDB curator’s perspec...
Mary Ann Tuli: What MODs can learn from Journals – a GigaDB curator’s perspec...
 
Laurie Goodman: Sharing and Reusing Cell Image Data, ASCB/EMBO 2017 Subgroup ...
Laurie Goodman: Sharing and Reusing Cell Image Data, ASCB/EMBO 2017 Subgroup ...Laurie Goodman: Sharing and Reusing Cell Image Data, ASCB/EMBO 2017 Subgroup ...
Laurie Goodman: Sharing and Reusing Cell Image Data, ASCB/EMBO 2017 Subgroup ...
 

Recently uploaded

filosofia boliviana introducción jsjdjd.pptx
filosofia boliviana introducción jsjdjd.pptxfilosofia boliviana introducción jsjdjd.pptx
filosofia boliviana introducción jsjdjd.pptx
IvanMallco1
 
Comparative structure of adrenal gland in vertebrates
Comparative structure of adrenal gland in vertebratesComparative structure of adrenal gland in vertebrates
Comparative structure of adrenal gland in vertebrates
sachin783648
 
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
Ana Luísa Pinho
 
Leaf Initiation, Growth and Differentiation.pdf
Leaf Initiation, Growth and Differentiation.pdfLeaf Initiation, Growth and Differentiation.pdf
Leaf Initiation, Growth and Differentiation.pdf
RenuJangid3
 
Structures and textures of metamorphic rocks
Structures and textures of metamorphic rocksStructures and textures of metamorphic rocks
Structures and textures of metamorphic rocks
kumarmathi863
 
in vitro propagation of plants lecture note.pptx
in vitro propagation of plants lecture note.pptxin vitro propagation of plants lecture note.pptx
in vitro propagation of plants lecture note.pptx
yusufzako14
 
platelets_clotting_biogenesis.clot retractionpptx
platelets_clotting_biogenesis.clot retractionpptxplatelets_clotting_biogenesis.clot retractionpptx
platelets_clotting_biogenesis.clot retractionpptx
muralinath2
 
(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...
(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...
(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...
Scintica Instrumentation
 
Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...
Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...
Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...
NathanBaughman3
 
Structural Classification Of Protein (SCOP)
Structural Classification Of Protein  (SCOP)Structural Classification Of Protein  (SCOP)
Structural Classification Of Protein (SCOP)
aishnasrivastava
 
Richard's entangled aventures in wonderland
Richard's entangled aventures in wonderlandRichard's entangled aventures in wonderland
Richard's entangled aventures in wonderland
Richard Gill
 
What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.
moosaasad1975
 
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Sérgio Sacani
 
erythropoiesis-I_mechanism& clinical significance.pptx
erythropoiesis-I_mechanism& clinical significance.pptxerythropoiesis-I_mechanism& clinical significance.pptx
erythropoiesis-I_mechanism& clinical significance.pptx
muralinath2
 
insect taxonomy importance systematics and classification
insect taxonomy importance systematics and classificationinsect taxonomy importance systematics and classification
insect taxonomy importance systematics and classification
anitaento25
 
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
Sérgio Sacani
 
GBSN- Microbiology (Lab 3) Gram Staining
GBSN- Microbiology (Lab 3) Gram StainingGBSN- Microbiology (Lab 3) Gram Staining
GBSN- Microbiology (Lab 3) Gram Staining
Areesha Ahmad
 
Hemoglobin metabolism_pathophysiology.pptx
Hemoglobin metabolism_pathophysiology.pptxHemoglobin metabolism_pathophysiology.pptx
Hemoglobin metabolism_pathophysiology.pptx
muralinath2
 
role of pramana in research.pptx in science
role of pramana in research.pptx in sciencerole of pramana in research.pptx in science
role of pramana in research.pptx in science
sonaliswain16
 
general properties of oerganologametal.ppt
general properties of oerganologametal.pptgeneral properties of oerganologametal.ppt
general properties of oerganologametal.ppt
IqrimaNabilatulhusni
 

Recently uploaded (20)

filosofia boliviana introducción jsjdjd.pptx
filosofia boliviana introducción jsjdjd.pptxfilosofia boliviana introducción jsjdjd.pptx
filosofia boliviana introducción jsjdjd.pptx
 
Comparative structure of adrenal gland in vertebrates
Comparative structure of adrenal gland in vertebratesComparative structure of adrenal gland in vertebrates
Comparative structure of adrenal gland in vertebrates
 
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
 
Leaf Initiation, Growth and Differentiation.pdf
Leaf Initiation, Growth and Differentiation.pdfLeaf Initiation, Growth and Differentiation.pdf
Leaf Initiation, Growth and Differentiation.pdf
 
Structures and textures of metamorphic rocks
Structures and textures of metamorphic rocksStructures and textures of metamorphic rocks
Structures and textures of metamorphic rocks
 
in vitro propagation of plants lecture note.pptx
in vitro propagation of plants lecture note.pptxin vitro propagation of plants lecture note.pptx
in vitro propagation of plants lecture note.pptx
 
platelets_clotting_biogenesis.clot retractionpptx
platelets_clotting_biogenesis.clot retractionpptxplatelets_clotting_biogenesis.clot retractionpptx
platelets_clotting_biogenesis.clot retractionpptx
 
(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...
(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...
(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...
 
Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...
Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...
Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...
 
Structural Classification Of Protein (SCOP)
Structural Classification Of Protein  (SCOP)Structural Classification Of Protein  (SCOP)
Structural Classification Of Protein (SCOP)
 
Richard's entangled aventures in wonderland
Richard's entangled aventures in wonderlandRichard's entangled aventures in wonderland
Richard's entangled aventures in wonderland
 
What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.
 
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
 
erythropoiesis-I_mechanism& clinical significance.pptx
erythropoiesis-I_mechanism& clinical significance.pptxerythropoiesis-I_mechanism& clinical significance.pptx
erythropoiesis-I_mechanism& clinical significance.pptx
 
insect taxonomy importance systematics and classification
insect taxonomy importance systematics and classificationinsect taxonomy importance systematics and classification
insect taxonomy importance systematics and classification
 
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
 
GBSN- Microbiology (Lab 3) Gram Staining
GBSN- Microbiology (Lab 3) Gram StainingGBSN- Microbiology (Lab 3) Gram Staining
GBSN- Microbiology (Lab 3) Gram Staining
 
Hemoglobin metabolism_pathophysiology.pptx
Hemoglobin metabolism_pathophysiology.pptxHemoglobin metabolism_pathophysiology.pptx
Hemoglobin metabolism_pathophysiology.pptx
 
role of pramana in research.pptx in science
role of pramana in research.pptx in sciencerole of pramana in research.pptx in science
role of pramana in research.pptx in science
 
general properties of oerganologametal.ppt
general properties of oerganologametal.pptgeneral properties of oerganologametal.ppt
general properties of oerganologametal.ppt
 

Scott Edmunds: Quantifying how FAIR is Hong Kong: The Hong Kong Shareability of Hong Kong University Research Experiment

  • 1. Scott Edmunds, GigaScience/HKU Quantifying how FAIR is Hong Kong: The Hong Kong Shareability of Hong Kong University Research Experiment
  • 2. The Hong Kong experience. Asia’s Academic City? 8 Universities, many ranked top 50 worldwide 100K students (UG/PG/FT/PT) 1 major research funder (UGC/RGC) UGC Policy: “Realization of making Hong Kong Asia's world city is only possible if it is based upon the platform of a very strong education and higher education sector. “ http://www.ugc.edu.hk/eng/ugc/policy/policy.htm
  • 3. Research Data policies growing globally http://ec.europa.eu/research/openscience/index.cfm?section=monitor&pg=researchdata#1
  • 4. http://dx.doi.org/10.17477/jcea.2018.17.2.200 …meanwhile in Hong Kong “This ambivalence was reflected by the chairman of the Research Grants Council, who stated in an interview that ‘there is no relationship between world-class research and release of data’, questioning whether anyone might be interested in the completeness of data. The chairman also saw a conflict between competitiveness and openness, arguing that the reputation of a researcher is built on publications, not on the underlying data. “
  • 6. If Government doesn’t act, Universities need to lead way http://www.rss.hku.hk/integrity/research-data-records-management
  • 7. First CRIS in HK, built upon Scholars Hub http://hub.hku.hk/advanced-search?location=crisdataset (CRIS = current research information system)
  • 8. First CRIS in HK, built upon ScholarsHub http://lib.hku.hk/researchdata/rpg.htm “Beginning with the September 2017 intake, all HKU research postgraduate (rpg) students have responsibility for 1) using a data management plan (DMP), where applicable, to describe the use of data in preparation for, or in the generation of their theses, and 2) depositing, where applicable, a dataset in the HKU Scholars Hub.”
  • 9. Growing # of OA journals addressing this http://dx.doi.org/10.1371/journal.pmed.1001607
  • 10. CAN WE QUANTIFY IF THIS IS WORKING?
  • 11. http://reproducibility.cs.arizona.edu/ Arizona Repeatability in Computer Science Experiment • 2015 study examining extent Computer Systems researchers share their research artifacts (code) • NSF policies on sharing code since 2005 • Examined 613 papers from ACM conferences & journals • • Attempted to locate source code that backed up results • If found, tried to build the code.
  • 12. http://reproducibility.cs.arizona.edu/ Arizona Repeatability in Computer Science Experiment • Manual curation/look for code that backed up results • If missing, emailed authors • Chased if no reply • If found, tried to build the code • Resolve issues • Survey results
  • 13. http://reproducibility.cs.arizona.edu/ 613 papers tested 123 successful Reproductions (20%) Arizona Repeatability in Computer Science Experiment
  • 14. Can we do something similar in HK? Teaching HKU MLIM students module on data curation and management.
  • 15. HKU Repeatability in HK Research Experiment • HKU policy on data sharing from 2015 • PLOS policy mandating sharing of supporting March 1, 2014 • HKU has published ≈400 PLOS ONE papers 2014-date • Can we quantify reproducibility in a sample of these? • Compare with other less stringent journals (e.g. Springer Nature data policy ranked journals1) • Can we follow Arizona and harness crowdsourced (student) power? 1. https://www.springernature.com/gp/authors/research-data-policy/data-policy-types/12327096
  • 16. HKU Repeatability in HK Research Experiment • Easy exercise in literature curation for HKU MLIM students • Set as a project for 59 students, 2017-2019 http://hub.hku.hk/simple- search?query=&location=publication&sort_by=score&order=desc&rpp=25&filter_field_1=journal&filter_type_1=equals &filter_value_1=plos+one&etal=0&filtername=dateIssued&filterquery=[2014+TO+2019]&filtertype=equals
  • 17. https://scholarlykitchen.sspnet.org/2018/01/10/future-oa-megajournal/ NPG (Scientific Reports) copies the PLOS One model… Another question: Rise (and fall) of megajournals
  • 18. HKU Repeatability in HK Research Experiment https://scholarlykitchen.sspnet.org/2016/01/06/plos-one-shrinks-by-11-percent/ Rise (and fall) of megajournals Driven by impact factor or “easier” data policies? “ Because data requirements are not uniform across all journals, PLOS has put itself at a disadvantage as far as attracting authors because other journals offer an easier path. If strictly enforced, this new policy is likely to result in a drop in submissions to PLOS journals. While no other mega-journal has been able to shake PLOS ONE’s hold on the market, this policy may provide an opening for competitors to gain on PLOS ONE and even overtake it.” Can we quantify this?
  • 19. HKU Repeatability in HK Research Experiment • Students assigned 2 PLOS + 2 SciRep papers (268 total) • Quickly scan paper looking for supporting data • If no data, go to the next paper • If uses data, is it all associated with the paper? • If external data, is it available from URL or accession? • If “data available on request”, are they contactable? • Spend about up to 10mins per article • Add data into googledoc, and teacher double checks & marks students on accuracy Homework/Case study: literature curation exercise
  • 20. HKU Repeatability in HK Research Experiment Alternative: webscraping option (code in GitHub)… https://github.com/jessesiu/hku_scholars_hub
  • 21. HKU Repeatability in HK Research Experiment See protocols in protocols.io: http://dx.doi.org/10.17504/protocols.io.6x7hfrn Teachers protocol: http://dx.doi.org/10.17504/protocols.io.6x8hfrw Students protocol: http://dx.doi.org/10.17504/protocols.io.6yahfse
  • 22. HKU Repeatability in HK Research Experiment Example http://hub.hku.hk/handle/10722/223364
  • 23. HKU Repeatability in HK Research Experiment Is there data presented in the paper? – Yes Is there external data, and if so what is the link/accession? – No Is all the data in the paper available? – No Comments - Has questionnaire, but not data as says "minimal anonymized dataset will be made available upon request” Example
  • 24. HKU Repeatability in HK Research Experiment If data “available on request”, do the authors respond if contacted? Example
  • 26. Interesting examples Several examples of missing Infectious Disease data http://www.vox.com/2015/6/17/8796225/mers-virus-data-sharing http://www.nature.com/news/data-sharing-make-outbreak-research-open-access-1.16966
  • 28. 148 Papers 114 with data 121 Respond 7 Missing 7 27 data on request Bounce 5 No response 17 121 accessible data (82%) data accessibility
  • 29. 120 Papers 79 with data 87 Respond 8 Missing 25 16 data on request No response 8 57 accessible data (72.5%) data accessibility
  • 30. External Data Sources • Growing number of papers hosted data via general-purpose open-access repositories: – figshare (12), Dryad (5), OSF (4), Zenodo (2), Dataverse (2), PANGAEA (2), DANS (1) – Since 2016 figshare use has been dropping & OSF/Zenodo increasing – Large numbers of government, IR & institutional websites – Other than one broken Dryad link, OA data repositories much more stable than other URLs (many broken) https://figshare.com/projects/HKU_Repeatability_in_HK_Research_Experiment/64118
  • 32. Do not rely on handles Instability of older HKU Scholars Hub Identifiers & data • Going back to older (papers collected in early 2017) 3/49 (6%) handles have changed • Checking back over time, the number of 2016/2017/2018 PLOS/SR papers listed keeps increasing (have had to update our results)
  • 33. Do not rely on “data available from our website” http://bioinformatics.oxfordjournals.org/content/24/11/1381.long
  • 34. Do not rely on “data available on request” https://doi.org/10.1101/633255
  • 35. Do not rely on “data available from the government” HK Hospital Authority only shares data with researchers at UGC-funded universities in Hong Kong, with data access charges on average 35,700 HKD per request1 1. https://www.accessinfo.hk/en/request/request_for_statistics_on_data_c 2. https://www.nature.com/articles/s41598-017-15579-z “Thanks for your interest. I'm afraid we can't as the data came from our hospital authority which is highly strict in using of their data and would not allow us to use the data other the purposed we stated before.” So why say it was available upon request? Emailing the authors for the data:
  • 36. Do not rely on GitHub (or google) https://dev.to/mjraadi/if-you-don-t-know-now-you-know-github-is-restricting-access-for-users-from-iran-and-a- few-other-embargoed-countries-5ga9
  • 37. Lessons Learned: never trust “data on request” • “Data Available on Request” does not work (65% requests failed after 2 attempts). • Hong Kong Government (esp. Hospital Authority) data access policies incompatible with international journal policies • Email addresses not checked by journals : 5 bounced (one wasn’t even in correct format). 1 example gave a postal address only. • Data Access Committee system not working. None of the DACs of the listed Consortia/Cohort projects responded to emails (Children of 1997, Guangzhou Biobank Cohort Study, JAGES, and China Research Center on Aging DACs). • Even if authors respond there are often problems • t&c’s. e.g.: MTAs or co-authorship, can share a sample of the processed data not the raw data as they were still writing publications. • Data missing, e.g. they deleted the raw sequencing data. https://figshare.com/projects/HKU_Repeatability_in_HK_Research_Experiment/64118
  • 38. Lessons Learned: problems with Scholars Hub • Unstable identifiers – 6% (3/49) examples changed in 2 years • Unstable indexing – numbers of historic publications keep increasing (self-reporting by authors?) • Unstable source of datasets: one example of data in a thesis that was blocked for a period • Inconsistent indexing/metadata – one example lacked a link/DOI to the paper, inconsistent keywords & tagging • Inconsistent authorship – multiple, unused ORCID IDs registered by HKU https://figshare.com/projects/HKU_Repeatability_in_HK_Research_Experiment/64118
  • 39. Importance of FAIR snapshots Why GigaScience set up http://gigadb.org/
  • 40. Importance of FAIR snapshots Why GigaScience set up https://doi.org/10.1093/database/baz016 Foundational Principles • Can’t trust “data available on request” – need independent, trusted broker • Follow FAIR principles (Findability, Accessibility, Interoperability, and Reusability) for data stewardship & offer unlimited data hosting • Use globally unique and persistent (stable) identifiers, e.g. DataCite DOIs • Need to take unlimited sized snapshots of ”version of record” (data, code…) • Increase Reusability with Interoperable CC licensing (we use CC0) • Increase Findability & Reusability with rich open metadata (field specific, DataCite, schema.org) and wide indexing (DataCite, NIH datamed, DCI, etc.)
  • 41. Thanks to: Laurie Goodman, Editor in Chief Nicole Nogoy, Editor Hans Zauner, Assistant Editor Hongling Zhao, Assistant Editor Peter Li, Lead Data Manager Chris Hunter, Lead BioCurator Chris Armit, Data Scientist Mary Ann Tulli, Data Ediitor Xiao (Jesse) Si Zhe, Database Developer Chen Qi, Shenzhen Office. @GigaScience facebook.com/GigaScience http://gigasciencejournal.com/blog/ Follow us: www.gigasciencejournal.com www.gigadb.org + Weibo & WeChat + HKU MLIM students