This document discusses managing sensitive data in repositories. It notes that repositories can support fully open, mediated access, or closed/private data, and most contain a combination of open and mediated access data. Sensitive health data can be managed in repositories by de-identifying data where possible, restricting access otherwise, and ensuring proper licensing, metadata, and identifiers are included to aid discovery and reuse while maintaining privacy. The biggest challenge is having longitudinal individual data needed for research while maintaining privacy through de-identification, as some data may need to be omitted from datasets as a result.
The challenge of sharing data well, how publishers can helpVarsha Khodiyar
Researchers, academic institutes and funders are increasingly recognizing the importance of data sharing for reproducible science. However, it is not always straightforward and clear to researchers as to how best to share data in a useful way. At Springer Nature we are working on several initiatives to help facilitate the sharing of research data in a reusable way, with our overarching goal being to publish research that is robust and reproducible. I will talk about the effort that goes into our flagship data journal, Scientific Data, to facilitate best practices in publication and sharing of research data, and share some of our experiences publishing Challenge datasets. I will also describe some of the newer Research Data Services that are now available to help all researchers (not only Springer Nature authors) to share their data in a useful way.
Presentation by Dr Jeff Christiansen, Intersect. Presented at the ANDS/Intersect sharing health-y data: challenges and solutions II workshop on 26th October 2016
Presentation by Dr Steve McEachern, ADA, to the 'Unlocking value from publicly funded Clinical Research Data' workshop, cohosted by ARDC and CSIRO at ANU on 6 March 2019.
The challenge of sharing data well, how publishers can helpVarsha Khodiyar
Researchers, academic institutes and funders are increasingly recognizing the importance of data sharing for reproducible science. However, it is not always straightforward and clear to researchers as to how best to share data in a useful way. At Springer Nature we are working on several initiatives to help facilitate the sharing of research data in a reusable way, with our overarching goal being to publish research that is robust and reproducible. I will talk about the effort that goes into our flagship data journal, Scientific Data, to facilitate best practices in publication and sharing of research data, and share some of our experiences publishing Challenge datasets. I will also describe some of the newer Research Data Services that are now available to help all researchers (not only Springer Nature authors) to share their data in a useful way.
Presentation by Dr Jeff Christiansen, Intersect. Presented at the ANDS/Intersect sharing health-y data: challenges and solutions II workshop on 26th October 2016
Presentation by Dr Steve McEachern, ADA, to the 'Unlocking value from publicly funded Clinical Research Data' workshop, cohosted by ARDC and CSIRO at ANU on 6 March 2019.
This brief presentation about Airtable by Christy Navarro and Amy Studer was delivered via Zoom on February 8, 2022, as part of the MLA 2022 My Favorite Tool competition.
Introduction to the Environmental Data Initiative (EDI)Corinna Gries
The Environmental Data Initiative enables the environmental science community to maximize knowledge development through the reusability of FAIR environmental data by providing curation services, training, and a robust and modern data repository.
Please cite as: Gries, Corinna. (2018, December). Introduction to the Environmental Data Initiative (EDI) (Version 1.0). Zenodo. http://doi.org/10.5281/zenodo.4672376
Long-term storage – will it fill up with the good stuff, or the big, bad, an...DCC-info
Presentation by Angus Whyte at DCC-Arkivum event 'Data Storage & Preservation Strategies for Research Data Management' at University of Edinburgh 27 October 2014
This is module 6 in the EDI Data Publishing training course. In this module, you will learn how to create quality metadata and be introduced to the landscape of data repositories and their functions.
Introduction to the Research Integrity Advisor Data Management Workshop, Bris...ARDC
Dr Jacobs' introduction to the RIA Data Management Workshop in Brisbane on 31 March 2017. The RIA Data Management Workshop series is a joint collaboration of the Australian Research Council, the National Health and Medical Research Council, the Australasian Research Management Society and the Australian National Data Service.
Research data spring: extending the OPD to cover RDMJisc RDM
The research data spring project "Extending the Organisational Profile Document to cover Research Data Management" slides for the third sandpit workshop. Project led by Joy Davidson from the Digital Curation Centre.
Data management and sharing principles for health and medical data: CDUARDC
Presentation by Kate LeMay to researchers at Charles Darwin University on the topic "Data management and sharing principles for health and medical data" on 12 June 1028
Stop press: should embargo conditions apply to metadata?Jisc RDM
Sarah Middle of Cambridge University discusses whether embargo conditions should apply to metadata. Session held at the Research Data Network event in May 2016, Cardiff University.
OU Library Research Support webinar: Data sharingDaniel Crane
Slides from a webinar delivered on 06th February 2018 for OU research staff and students. Covers data sharing policies; Benefits of data sharing; Data repositories; Preparing data for sharing; and Re-using data.
ANDS: Increasing connections to add valueSimon Huggard
Andrew Treloar describes how the Australian National Data Service are working with partners to ensure data is available and linked together to add value to research impact.
What is sensitive data?
Can sensitive data be shared? 23 (research data) Things - Thing 10 Sharing Sensitive Data
Top five tips for sharing sensitive data
This brief presentation about Airtable by Christy Navarro and Amy Studer was delivered via Zoom on February 8, 2022, as part of the MLA 2022 My Favorite Tool competition.
Introduction to the Environmental Data Initiative (EDI)Corinna Gries
The Environmental Data Initiative enables the environmental science community to maximize knowledge development through the reusability of FAIR environmental data by providing curation services, training, and a robust and modern data repository.
Please cite as: Gries, Corinna. (2018, December). Introduction to the Environmental Data Initiative (EDI) (Version 1.0). Zenodo. http://doi.org/10.5281/zenodo.4672376
Long-term storage – will it fill up with the good stuff, or the big, bad, an...DCC-info
Presentation by Angus Whyte at DCC-Arkivum event 'Data Storage & Preservation Strategies for Research Data Management' at University of Edinburgh 27 October 2014
This is module 6 in the EDI Data Publishing training course. In this module, you will learn how to create quality metadata and be introduced to the landscape of data repositories and their functions.
Introduction to the Research Integrity Advisor Data Management Workshop, Bris...ARDC
Dr Jacobs' introduction to the RIA Data Management Workshop in Brisbane on 31 March 2017. The RIA Data Management Workshop series is a joint collaboration of the Australian Research Council, the National Health and Medical Research Council, the Australasian Research Management Society and the Australian National Data Service.
Research data spring: extending the OPD to cover RDMJisc RDM
The research data spring project "Extending the Organisational Profile Document to cover Research Data Management" slides for the third sandpit workshop. Project led by Joy Davidson from the Digital Curation Centre.
Data management and sharing principles for health and medical data: CDUARDC
Presentation by Kate LeMay to researchers at Charles Darwin University on the topic "Data management and sharing principles for health and medical data" on 12 June 1028
Stop press: should embargo conditions apply to metadata?Jisc RDM
Sarah Middle of Cambridge University discusses whether embargo conditions should apply to metadata. Session held at the Research Data Network event in May 2016, Cardiff University.
OU Library Research Support webinar: Data sharingDaniel Crane
Slides from a webinar delivered on 06th February 2018 for OU research staff and students. Covers data sharing policies; Benefits of data sharing; Data repositories; Preparing data for sharing; and Re-using data.
ANDS: Increasing connections to add valueSimon Huggard
Andrew Treloar describes how the Australian National Data Service are working with partners to ensure data is available and linked together to add value to research impact.
What is sensitive data?
Can sensitive data be shared? 23 (research data) Things - Thing 10 Sharing Sensitive Data
Top five tips for sharing sensitive data
Brisbane Health-y Data: The guide to publishing and sharing sensitive dataARDC
Presentation given by Kate LeMay at the 'Sharing Health-y Data Workshop: Challenges and Solutions' event co-hosted by ANDS and HISA. Held on Wednesday 16th March 2016 at the Translational Research Institute, Brisbane, Australia.
Presentation given by Kate LeMay at the 'Sharing Health-y Data: Challenges and Solutions' workshop, held at The Menzies Research Institute (Hobart, Tasmania) on 28th June 2016. The event was co-hosted by ANDS and the University of Tasmania library
Lesson 2 in a set of 10 created by DataONE on Best Practices fo Data Management. The full module can be downloaded from the DataONE.org website at: http://www.dataone.org/educaiton-modules. Released under a CC0 license, attribution and citation requested.
Slides from Thursday 2nd August 2018 - Data in the Scholarly Communications Life Cycle Course which is part of the FORCE11 Scholarly Communications Institute.
Presenter - Natasha Simons
"Open Science, Open Data" training for participants of Software Writing Skills for Your Research - Workshop for Proficient, Helmholtz Centre Potsdam - GFZ German Research Centre for Geosciences, Telegrafenberg, December 16, 2015
Workshop - finding and accessing data - Cambridge August 22 2016Fiona Nielsen
Finding and accessing human genomic data for research
University of Cambridge, United Kingdom | Seminar Room G
Monday, 22 August 2016 from 10:00 to 12:00 (BST)
Charlotte, Nadia and Fiona presented an overview of data sources around the world where you can find genomics data for your research and gave examples of the data access application for dbGaP and EGA with specific details relevant for University of Cambridge researchers.
Edge Informatics and FAIR (Findable, Accessible, Interoperable and Reusable) ...Tom Plasterer
Edge Informatics is an approach to accelerate collaboration in the BioPharma pipeline. By combining technical and social solutions knowledge can be shared and leveraged across the multiple internal and external silos participating in the drug development process. This is accomplished by making data assets findable, accessible, interoperable and reusable (FAIR). Public consortia and internal efforts embracing FAIR data and Edge Informatics are highlighted, in both preclinical and clinical domains.
This talk was presented at the Molecular Medicine Tri-Conference in San Francisco, CA on February 20, 2017
Research Data Management Services at UWA (November 2015)Katina Toufexis
Research Data Management Services at the University of Western Australia (November 2015).
Created by Katina Toufexis of the eResearch Support Unit (University Library).
CC-BY
Presentation by Hugo Leroux and Liming Zhu, CSIRO, to the 'Unlocking value from publicly funded Clinical Research Data' workshop, cohosted by ARDC and CSIRO at ANU on 6 March 2019.
Presentation by Kelly Hart, ONDC in PM&C, to the 'Unlocking value from publicly funded Clinical Research Data' workshop, cohosted by ARDC and CSIRO at ANU on 6 March 2019.
Presentation by Prof Chris Rowe, ADNet, to the 'Unlocking value from publicly funded Clinical Research Data' workshop, cohosted by ARDC and CSIRO at ANU on 6 March 2019.
Investigator-initiated clinical trials: a community perspectiveARDC
Presentation by Miranda Cumpston, ACTA, to the 'Unlocking value from publicly funded Clinical Research Data' workshop, cohosted by ARDC and CSIRO at ANU on 6 March 2019.
Presentation by Dr Merran Smith, PHRN, to the 'Unlocking value from publicly funded Clinical Research Data' workshop, cohosted by ARDC and CSIRO at ANU on 6 March 2019.
International perspective for sharing publicly funded medical research dataARDC
Presentation by Olivier Salvado, CSIRO, to the 'Unlocking value from publicly funded Clinical Research Data' workshop, cohosted by ARDC and CSIRO at ANU on 6 March 2019.
Presentation by Prof Lisa Askie, ANZCTR, to the 'Unlocking value from publicly funded Clinical Research Data' workshop, cohosted by ARDC and CSIRO at ANU on 6 March 2019.
Presentation by Dr Davina Ghersi, NHMRC, to the 'Unlocking value from publicly funded Clinical Research Data' workshop, cohosted by ARDC and CSIRO at ANU on 6 March 2019.
Presentation by Dr Adrian Burton, ARDC, to the 'Unlocking value from publicly funded Clinical Research Data' workshop, cohosted by ARDC and CSIRO at ANU on 6 March 2019.
FAIR for the future: embracing all things dataARDC
FAIR for the future: embracing all things data - Natasha Simons, Keith Russell and Liz Stokes, presented at Taylor & Francis Scholarly Summits in Sydney 11 Feb 2019 and Melbourne 14 Feb 2019.
How to make your data count webinar, 26 Nov 2018ARDC
Slides from the 26 Nov Make your data count webinar. The research community has long grappled with the problem of assessing and tracking the results of scholarship. Research data is no exception. The Make Data Count (MDC) project (https://makedatacount.org/), funded by the Sloan Foundation, has delivered a data usage metric standard (Code of Practice) and a workflow for the retrieval and display of standardised usage and citation metrics in your repository interface.
Listen to this webinar to learn more about the Make Data Count project and the 5 steps you can take to make the data in your repository count. Hear from MDC project team members who have already implemented MDC in the dash (https://dash.ucop.edu) and DataOne (https://search.dataone.org/data) repositories. Learn from their experience, see the results.
Our international speaker line-up includes Daniella Lowenberg (California Digital Library) and Patricia Cruse (DataCite).
Recording available: https://youtu.be/Lkysz0Mc7fo
As Europe's leading economic powerhouse and the fourth-largest hashtag#economy globally, Germany stands at the forefront of innovation and industrial might. Renowned for its precision engineering and high-tech sectors, Germany's economic structure is heavily supported by a robust service industry, accounting for approximately 68% of its GDP. This economic clout and strategic geopolitical stance position Germany as a focal point in the global cyber threat landscape.
In the face of escalating global tensions, particularly those emanating from geopolitical disputes with nations like hashtag#Russia and hashtag#China, hashtag#Germany has witnessed a significant uptick in targeted cyber operations. Our analysis indicates a marked increase in hashtag#cyberattack sophistication aimed at critical infrastructure and key industrial sectors. These attacks range from ransomware campaigns to hashtag#AdvancedPersistentThreats (hashtag#APTs), threatening national security and business integrity.
🔑 Key findings include:
🔍 Increased frequency and complexity of cyber threats.
🔍 Escalation of state-sponsored and criminally motivated cyber operations.
🔍 Active dark web exchanges of malicious tools and tactics.
Our comprehensive report delves into these challenges, using a blend of open-source and proprietary data collection techniques. By monitoring activity on critical networks and analyzing attack patterns, our team provides a detailed overview of the threats facing German entities.
This report aims to equip stakeholders across public and private sectors with the knowledge to enhance their defensive strategies, reduce exposure to cyber risks, and reinforce Germany's resilience against cyber threats.
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...John Andrews
SlideShare Description for "Chatty Kathy - UNC Bootcamp Final Project Presentation"
Title: Chatty Kathy: Enhancing Physical Activity Among Older Adults
Description:
Discover how Chatty Kathy, an innovative project developed at the UNC Bootcamp, aims to tackle the challenge of low physical activity among older adults. Our AI-driven solution uses peer interaction to boost and sustain exercise levels, significantly improving health outcomes. This presentation covers our problem statement, the rationale behind Chatty Kathy, synthetic data and persona creation, model performance metrics, a visual demonstration of the project, and potential future developments. Join us for an insightful Q&A session to explore the potential of this groundbreaking project.
Project Team: Jay Requarth, Jana Avery, John Andrews, Dr. Dick Davis II, Nee Buntoum, Nam Yeongjin & Mat Nicholas
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Subhajit Sahu
Abstract — Levelwise PageRank is an alternative method of PageRank computation which decomposes the input graph into a directed acyclic block-graph of strongly connected components, and processes them in topological order, one level at a time. This enables calculation for ranks in a distributed fashion without per-iteration communication, unlike the standard method where all vertices are processed in each iteration. It however comes with a precondition of the absence of dead ends in the input graph. Here, the native non-distributed performance of Levelwise PageRank was compared against Monolithic PageRank on a CPU as well as a GPU. To ensure a fair comparison, Monolithic PageRank was also performed on a graph where vertices were split by components. Results indicate that Levelwise PageRank is about as fast as Monolithic PageRank on the CPU, but quite a bit slower on the GPU. Slowdown on the GPU is likely caused by a large submission of small workloads, and expected to be non-issue when the computation is performed on massive graphs.
Explore our comprehensive data analysis project presentation on predicting product ad campaign performance. Learn how data-driven insights can optimize your marketing strategies and enhance campaign effectiveness. Perfect for professionals and students looking to understand the power of data analysis in advertising. for more details visit: https://bostoninstituteofanalytics.org/data-science-and-artificial-intelligence/
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Managing sensitive data in your repository
1. Managing sensitive data in your
repository
Natasha Simons
Sharing Health-y and Sensitive Data: Challenges and Solutions Workshop
Perth 3 September 2015
2. What is a data repository?
1
A research data repository is a
managed environment capable of
storing and sharing (largely)
digital data. The data repository
supports the process of curating,
preserving, and sharing research
data.
4. Are repositories for open data only?
3
Yes and no….because it depends on the purpose/scope
Repositories can support data that is:
1. Open access only
2. Mediated access only
3. Closed/private only
Most data repositories are a combination of 1 & 2
5. Are there health data repositories?
4
Yes, many!
http://www.nlm.nih.gov/NIHbmic/nih_data_sharing_repositories.html
6. What’s the point of data repositories?
5
Data repositories assist researchers and
the research community to:
1. Support data sharing, data discovery &
reuse, data preservation
2. Comply with publisher requirements
3. Comply with funder requirements
4. Comply with institutional or govt policy
requirements
5. Support institutional goals Illustration credit: Ainsley Seago. doi:10.1371/journal.pbio.1001779.g001
7. Can sensitive data be managed in a repository?
6
Yes!
Ask:
• Can the raw data be (de-identified and)
made completely open? Or will access be
restricted? Mediated?
• What licence should be applied to enable
data reuse?
• What metadata elements, links (e.g. to
publications) and identifiers (e.g. DOIs,
ORCIDs) will aid discovery and reuse of the
data? Source: http://www.slideshare.net/WLSA_ORG/wh2014-workshop-health-data-consortium
8. Can sensitive data be managed in a repository?
7
Also ask:
• Can a citation element be added to
support attribution and reuse
tracking?
• Who/what will be the method of
contact for the data?
• Are there other conditions that the
data is subject to e.g. release subject
to an embargo period?
15. What’s really challenging?
14
“Having longitudinal data on individuals is a part of many observational designs, and is
needed for research into outcomes, efficacy and many mechanistic studies. Most
repositories thus have longitudinal observations. To build such a database you need some
way to link observations on the same identified person. Therefore most repositories contain
personally identified data, but, because of privacy concerns, they often release only de-
identified data. Difficulties in the de-identification process can cause some data to be
omitted in a dataset. A lack of direct identifiers in a data collection or federation could
prevent linking of data for some patients.
From: Wade, T. Traits and Types of Health Data Repositories. Health Information Science
and Systems 2014, 2:4 doi:10.1186/2047-2501-2-4
http://www.hissjournal.com/content/2/1/4