Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.

Research Data Management in practice, RIA Data Management Workshop Adelaide 2017


Published on

Dr Paul Wong presented at the Adelaide Research Integrity Advisors Data Management Workshop, 16 June 2017

Published in: Education
  • Be the first to comment

  • Be the first to like this

Research Data Management in practice, RIA Data Management Workshop Adelaide 2017

  1. 1. Dr Paul Wong Research Integrity Advisor Data Management Workshop Senior Data Management Specialist 16 June 2017 Adelaide
  2. 2. The Australian National Data Service (ANDS) makes Australia’s research data assets more valuable for researchers, research institutions and the nation. Partnering Australian research organisations and co-funded 294 data projects, totally $54M Research Data Australia Cite My Data / DOIs minting Vocabulary service etc. In 2015, conducted over 100 workshops, forums, and webinars with over 4000 participants, developed online resources e.g. guides, videos etc.
  3. 3. • 40+ guides organised around different topics • Content is a moving target – changing policy landscape, new practices etc. • Designed as a community resource • If you see gaps, we want your help to make them better
  4. 4. • A dedicated set of webpages on data management • A community resource • If you see gaps, we want your help to make them better
  5. 5.
  6. 6. Research data: as input & output Research data may include: ü Laboratory and field notes ü Raw experimental data ü Analysed data ü Simulations and software ü Databases ü Clinical data, including clinical records ü Questionnaires/surveys ü Images and photographs ü Audio-visual materials Moynihan's field notes, Panama, 1958 – CC BY Screen capture of “Computer simulation of March 22, 2014 landslide event near Oso, Washington, by David L. George and Richard M. Iverson, USGS” CC BY
  7. 7. Creative arts research data Research data in the creative arts may include: ü Audio-visual recordings of a creative work ü Visual diaries ü Journals ü Drawings ü Photographs ü Manuscripts ü Musical annotations ü 3D models
  8. 8. Research Data: a Broad Church Hand written letters Images or photos Soil samples Tissue samples Archeological dig sites ….. Scanned & OCR version Scanned digital version Analysed result of samples Analysed result of samples 3D models of the dig site ….. Physical Digital ANDS’ primary focus is digital data
  9. 9. Why Bother? Why managing (digital) research data? In fact, why bother managing anything? • Prevent bad things from happening. • Enable good things to happen.
  10. 10. Data and Research Integrity Nature 533, 452–454 (26 May 2016) doi:10.1038/533452a. Reprint with permission © 2016 Macmillan
  11. 11. Data and Research Integrity “The Availability of Research Data Declines Rapidly with Article Age”, Vine et al, Current Biology, Volume 24, Issue 1, p94–97, 6 January 2014 • “For papers where authors reported the status of their data, the odds of the data being extant decreased by 17% per year...” • “Responses included authors being sure that the data were lost (e.g., on a stolen computer) or thinking that they might be stored in some distant location (e.g., their parent’s attic) to authors having some degree of certainty that the data are on a Zip or floppy disk in their possession but no longer having the appropriate hardware to access it.”
  12. 12. Make Data Awesome Open Research Data Collection Showcase communities/projects/open-research-data-collection #Dataimpact stories The companion case studies report of the Watt review 151202_case_studies_volume_nc_0.pdf
  13. 13. Data Management in Practice • One of ANDS’ guides to outline, in an easy to understand practical framework, how research data can be managed effectively in an institutional setting. • 15 key points – with short descriptions, 7 pages long. • Incorporating project management best practice • Shared responsibilities model • Continual data curation approach • Road tested with librarians, data managers, researchers and research support staff
  14. 14. The Current Thinking: FAIR Findable, Accessible, Interoperable, Reusable 15 principles to ensure research data is FAIR Mark D. Wilkinson et al. The FAIR Guiding Principles for scientific data management and stewardship, Scientific Data (2016). DOI: 10.1038/sdata.2016.18 “FAIRness is a prerequisite for proper data management and data stewardship”
  15. 15.
  16. 16. Continual data curation across domains
  17. 17. Data Curation as Documentation Assigning metadata (structured data about the data) • Who collected the data? • Who funded the research project? • When (and where) was it collected? • Instruments and setting for collecting the data? • Title of the dataset • Methods used to process the data • Etc. etc.
  18. 18. Light Touch Heavy Duty Ecological Geographic Biological Metadata Structured Detailed Machine readable Structured Minimal Human readable
  19. 19. What is Data Citation? Data citation refers to the practice of providing a reference to data in the same way as researchers routinely provide a bibliographic reference to outputs such as journal articles, reports and conference papers. Citing data is now recognised as one of the key practices leading to recognition of data as a primary research output. identifiers/data-citation
  20. 20. Data Citation Standard A standard citation would include the following elements: Author(s) (Year) : Title. Publisher(s). DOI (if used) Hanigan, Ivan (2012): Monthly drought data for Australia 1890-2008 using the Hutchinson Drought Index. The Australian National University Australian Data Archive. Alternatively, Author(s) (Year): Title. Version. Publisher(s). ResourceType. Identifier Bradford, Matt; Murphy, Helen; Ford, Andrew; Hogan, Dominic; Metcalfe, Dan (2014): CSIRO Permanent Rainforest Plots of North Queensland. v2. CSIRO. Data Collection.
  21. 21. Institutional Policy and Procedures Support services - people and other means of providing advice and support IT Infrastructure - the hardware, software and other facilities Metadata management - so that data records can be meaningful and fit for purpose Institutional Data Management Framework Pre Research
  22. 22. Data Management Plan • data organisation and storage; • metadata standards and guidelines; • backups; • archiving for long-term preservation; • version control and derived data products; • data sharing or publishing intentions, including licensing; • ensuring security of confidential data; • data synchronisation; and • governance, roles and responsibilities. Pre Research
  23. 23. Storage requirements may vary across domains
  24. 24. Publishing and Sharing Data Metadata Research Data Open Open Open Closed Closed Open Closed Closed Publishing and Sharing data ≠ Open Access to data “Open” and “Closed” are relative concepts. “Closed” ≈ conditional access based on individual permission “Closed” ≈ conditional access based on roles Post Research
  25. 25. Ethics Clearance and Data Access: A Case Study Data Managing and Sharing Research Data: A Guide to Good Practice, SAGE 2014 data/book240297 Colin Smith [CC BY-SA 2.0] (], via Wikimedia Commons from Wikimedia Commons)
  26. 26. Ethics Clearance and Data Access: A Case Study Health and Social Consequences of the Foot and Mouth Disease Epidemic in North Cumbria, 2001- 2003 (M. Mort Lancaster University 2006, funded by the Department of Health UK, Study Number 5407) • 54 local people were recruited to write weekly diaries over 18 months to describe their lives and the recovery they observed around the area • The study was supplemented with interviews and focus group discussions that included other stakeholders • The study obtained consent from participants before the research but did not get consent for sharing and archiving data • The research team and the Department of Health wanted to share and archive the data after the completion of the research. • Had to get consent retrospectively and needed expert advice from copyright specialists
  27. 27. The framework treats DM as a set of coordinated activities to preserve the evidence base of research findings and to make the evidence base more accessible and reusable in the long run.
  28. 28. Wise Advise an-early-career-researcher/ Mistakes I’ve made as an early career researcher APRIL 5, 2016 Nicola Hemmings (post-doc, University of Sheffield) Failing to organise my data adequately (circa 2007). Prepare your datasets like you would if you were giving them to a stranger who knew nothing about them. Label, annotate and meticulously file your R scripts. Incorporate read-me files into everything and write them for the monkey that will be you in five years, when you return to your data and/or analyses for some unforeseen but vitally important reason. Don’t get this wrong. You will regret it.
  29. 29. Special Healthy Data Year ‘Sharing health-y data: challenges and solutions’ workshops ANDS ran in all capital cities in 2016-2017 Attended by researchers, and staff from the library, research office and ethics office Topics covered § The data sharing landscape: funders and publishers § Data de-identification § Ethics and informed consent § Licensing data § How research data can be published (mediated access, metadata, repositories)
  30. 30. Special Healthy Data Year Coming up: Health and Medical Data: 3 Short Lunchtime Bites Webinars in May 2017 Workshops with Health Libraries Australia: 10 medical and health research data Things 'train the trainer' workshops. 31 May in Brisbane, 13 June in Melbourne, 14 July in Perth. For health librarians.
  31. 31. Senior Data Management Specialist +61 2 6125 0586 Dr Paul Wong With the exception of logos, third party images or where otherwise indicated, this work is licensed under the Creative Commons 4.0 International Attribution Licence. ANDS is supported by the Australian Government through the National Collaborative Research Infrastructure Strategy Program. Monash University leads the partnership with the Australian National University and CSIRO.