RARE and FAIR Science: Reproducibility and Research ObjectsCarole Goble
Keynote at JISC Digifest 2015 on Reproducibility and Research Objects in Scholarly Communication
Includes hidden slides
All material except maybe the IT Crowd screengrab reusable
Capturing Context in Scientific Experiments: Towards Computer-Driven Sciencedgarijo
Scientists publish computational experiments in ways that do not facilitate reproducibility or reuse. Significant domain expertise, time and effort are required to understand scientific experiments and their research outputs. In order to improve this situation, mechanisms are needed to capture the exact details and the context of computational experiments. Only then, Intelligent Systems would be able help researchers understand, discover, link and reuse products of existing research.
In this presentation I will introduce my work and vision towards enabling scientists share, link, curate and reuse their computational experiments and results. In the first part of the talk, I will present my work for capturing and sharing the context of scientific experiments by using scientific workflows and machine readable representations. Thanks to this approach, experiment results are described in an unambiguous manner, have a clear trace of their creation process and include a pointer to the sources used for their generation. In the second part of the talk, I will describe examples on how the context of scientific experiments may be exploited to browse, explore and inspect research results. I will end the talk by presenting new ideas for improving and benefiting from the capture of context of scientific experiments and how to involve scientists in the process of curating and creating abstractions on available research metadata.
My talk at the Open PHACTS last ever project meeting in Vienna 2016 where i was asked to talk about the challenges we addressed in open phacts with semantic web technology and what still needed to be done.
Recomendations for infrastructure and incentives for open science, presented to the Research Data Alliance 6th Plenary. Presenter: William Gunn, Director of Scholarly Communications for Mendeley.
FSUJena Environmental Cheminformatics to Identify Unknowns April 2019Emma Schymanski
The Environmental Cheminformatics group at the Luxembourg Centre for Systems Biomedicine focuses on the comprehensive identification of known and unknown chemicals in our environment to investigate their effects on health and disease. The environment and the chemicals to which we are exposed is incredibly complex, with over 125 million chemicals registered in the largest chemical registry and over 70,000 in household use alone. Detectable molecules in complex samples can now be captured using high resolution mass spectrometry (HRMS), which provides a “snapshot” of all chemicals present in a sample and allows for retrospective data analysis through digital archiving. However, scientists cannot yet identify the vast majority of the tens of thousands of features in each sample, leading to critical bottlenecks in identification and data interpretation. For instance, recent studies indicate a strong connection between the gut microbiome and Parkinson’s disease, yet over 60 % of significant metabolites in microbiome experiments are unknown. Unknown identification remains extremely time consuming and, in many cases, a matter of luck. Prioritizing efforts to find significant metabolites or potentially toxic substances responsible for observed effects is the key, which involves reconciling highly complex samples with expert knowledge and careful validation. This talk will cover European, US and worldwide community initiatives to help connect knowledge on chemistry and toxicity with environmental observations - from compound databases to spectral libraries and retrospective screening. It will touch on the challenges of standardized structure representations, data curation, deposition and communication between resources. Finally, it will show how interdisciplinary efforts and data sharing can facilitate research in metabolomics, exposomics and beyond. Kindly hosted by Prof. Christoph Steinbeck
RARE and FAIR Science: Reproducibility and Research ObjectsCarole Goble
Keynote at JISC Digifest 2015 on Reproducibility and Research Objects in Scholarly Communication
Includes hidden slides
All material except maybe the IT Crowd screengrab reusable
Capturing Context in Scientific Experiments: Towards Computer-Driven Sciencedgarijo
Scientists publish computational experiments in ways that do not facilitate reproducibility or reuse. Significant domain expertise, time and effort are required to understand scientific experiments and their research outputs. In order to improve this situation, mechanisms are needed to capture the exact details and the context of computational experiments. Only then, Intelligent Systems would be able help researchers understand, discover, link and reuse products of existing research.
In this presentation I will introduce my work and vision towards enabling scientists share, link, curate and reuse their computational experiments and results. In the first part of the talk, I will present my work for capturing and sharing the context of scientific experiments by using scientific workflows and machine readable representations. Thanks to this approach, experiment results are described in an unambiguous manner, have a clear trace of their creation process and include a pointer to the sources used for their generation. In the second part of the talk, I will describe examples on how the context of scientific experiments may be exploited to browse, explore and inspect research results. I will end the talk by presenting new ideas for improving and benefiting from the capture of context of scientific experiments and how to involve scientists in the process of curating and creating abstractions on available research metadata.
My talk at the Open PHACTS last ever project meeting in Vienna 2016 where i was asked to talk about the challenges we addressed in open phacts with semantic web technology and what still needed to be done.
Recomendations for infrastructure and incentives for open science, presented to the Research Data Alliance 6th Plenary. Presenter: William Gunn, Director of Scholarly Communications for Mendeley.
FSUJena Environmental Cheminformatics to Identify Unknowns April 2019Emma Schymanski
The Environmental Cheminformatics group at the Luxembourg Centre for Systems Biomedicine focuses on the comprehensive identification of known and unknown chemicals in our environment to investigate their effects on health and disease. The environment and the chemicals to which we are exposed is incredibly complex, with over 125 million chemicals registered in the largest chemical registry and over 70,000 in household use alone. Detectable molecules in complex samples can now be captured using high resolution mass spectrometry (HRMS), which provides a “snapshot” of all chemicals present in a sample and allows for retrospective data analysis through digital archiving. However, scientists cannot yet identify the vast majority of the tens of thousands of features in each sample, leading to critical bottlenecks in identification and data interpretation. For instance, recent studies indicate a strong connection between the gut microbiome and Parkinson’s disease, yet over 60 % of significant metabolites in microbiome experiments are unknown. Unknown identification remains extremely time consuming and, in many cases, a matter of luck. Prioritizing efforts to find significant metabolites or potentially toxic substances responsible for observed effects is the key, which involves reconciling highly complex samples with expert knowledge and careful validation. This talk will cover European, US and worldwide community initiatives to help connect knowledge on chemistry and toxicity with environmental observations - from compound databases to spectral libraries and retrospective screening. It will touch on the challenges of standardized structure representations, data curation, deposition and communication between resources. Finally, it will show how interdisciplinary efforts and data sharing can facilitate research in metabolomics, exposomics and beyond. Kindly hosted by Prof. Christoph Steinbeck
Reproducibility in Scientific Data Analysis - BioScience SeminarSamuel Lampa
Slides for a talk held at BioScience Seminar at Dept. of Pharmaceutical BioSciences at Uppsala University on December 16, 2016.
The event webpage: http://www.farmbio.uu.se/calendar/kalendarium-detaljsida/?eventId=22496
Structure of the talk:
Reproducibility in Scientific Data Analysis ...
● What is it?
● Why is it important?
● Why is it a problem?
● What can we do about it?
● What does pharmb.io do about it?
The increasing popularity of high mass accuracy non-target mass spectrometry methods has yielded extensive identification efforts based on spectral and chemical compound databases in the environmental community and beyond. Increasingly, new methods are relying on open data resources. Candidate structures are often retrieved with either exact mass or molecular formula from large resources such as PubChem, ChemSpider or the EPA CompTox Chemistry Dashboard. Smaller, selective lists of chemicals (also called “suspect lists”) can be used to perform more efficient annotation. Mass spectral libraries can then be used to increase the confidence in tentative identification. Additional metadata (e.g. exposure and hazard information, reference and data source information) can be extremely useful to prioritize substances of high environmental interest. Exchanging information and “sharing structural linkages” between these resources requires extensive curation to ensure that the correct information is shared correctly, yet many valuable datasets arise from scientists and regulators with little official cheminformatics training. This talk will cover curation efforts undertaken to map spectral libraries (e.g. MassBank.EU, mzCloud) and suspect lists from the NORMAN Suspect Exchange (http://www.norman-network.com/?q=node/236) to unique chemical identifiers associated with the US EPA CompTox Chemistry Dashboard. The curation workflow takes advantage of years of experience, as well as contact with the original data providers, to enable open access to valuable, curated datasets to support environmental scientists and the broader research community (e.g. https://comptox.epa.gov/dashboard/chemical_lists). Note: This abstract does not reflect US EPA policy.
Internal NIH Seminar to the BISTI Team on some early thoughts from the Associate Director for Data Science (ADDS). These ideas are for discussion only and in no way reflect what might happen subsequently. Presented April 1, 2014 (the date is purely a coincidence).
Mobile devices are now mainstream handheld computers providing access to computational power and storage that a decade ago was available only on desktop computers. In terms of chemistry informatics the majority of capabilities that were previously found only on desktop computers is fast migrating to mobile devices making use of the combination of powerful visualization capabilities, fast cloud-based calculations, websites optimized for the mobile platforms, and delivering “apps”. This presentation will provide an overview of how access to chemistry continues to be made increasingly mobile and specifically on how the Royal Society of Chemistry is contributing to this computing environment.
Workshop finding and accessing data - fiona nadia charlotte - cambridge apr...Fiona Nielsen
Workshop presentation on finding and accessing human genomics data for research.
Including statistics of publicly available data sources and tips on how to save time in your workflow of data access.
Organised in collaboration between DNAdigest and Open Data Cambridge.
Read more about our work:
http://DNAdigest.org
http://repositive.io
https://uk.linkedin.com/in/fionanielsen
http://www.data.cam.ac.uk
“Open Research Data: Implications for Science and Society”, Warsaw, Poland, May 28–29, 2015, conference organized by the Open Science Platform — an initiative of the Interdisciplinary Centre for Mathematical and Computational Modelling at the University of Warsaw. pon.edu.pl @OpenSciPlatform #ORD2015
Why i left my job in genomics R&D - Lunteren - april 18 - 2016Fiona Nielsen
Career talk about how I moved from bioinformatics scientist to become an entrepreneur.
Presented at BioSB2016, pre-conference PhD retreat for young researchers in bioinformatics and systems biology at Congrescentrum De Werelt in Lunteren. #BioSB2016 #BioSB16
Link to event:
http://www.youngcb.nl/events/biosb-phd-retreat-2016/
Read more about my work:
http://DNAdigest.org
http://repositive.io
https://uk.linkedin.com/in/fionanielsen
Slides of the 2015 Bio Data World Congress show how our analyzegenomes.com services are combined to support precision medicine in the context of modern oncology treatment.
Reproducibility in Scientific Data Analysis - BioScience SeminarSamuel Lampa
Slides for a talk held at BioScience Seminar at Dept. of Pharmaceutical BioSciences at Uppsala University on December 16, 2016.
The event webpage: http://www.farmbio.uu.se/calendar/kalendarium-detaljsida/?eventId=22496
Structure of the talk:
Reproducibility in Scientific Data Analysis ...
● What is it?
● Why is it important?
● Why is it a problem?
● What can we do about it?
● What does pharmb.io do about it?
The increasing popularity of high mass accuracy non-target mass spectrometry methods has yielded extensive identification efforts based on spectral and chemical compound databases in the environmental community and beyond. Increasingly, new methods are relying on open data resources. Candidate structures are often retrieved with either exact mass or molecular formula from large resources such as PubChem, ChemSpider or the EPA CompTox Chemistry Dashboard. Smaller, selective lists of chemicals (also called “suspect lists”) can be used to perform more efficient annotation. Mass spectral libraries can then be used to increase the confidence in tentative identification. Additional metadata (e.g. exposure and hazard information, reference and data source information) can be extremely useful to prioritize substances of high environmental interest. Exchanging information and “sharing structural linkages” between these resources requires extensive curation to ensure that the correct information is shared correctly, yet many valuable datasets arise from scientists and regulators with little official cheminformatics training. This talk will cover curation efforts undertaken to map spectral libraries (e.g. MassBank.EU, mzCloud) and suspect lists from the NORMAN Suspect Exchange (http://www.norman-network.com/?q=node/236) to unique chemical identifiers associated with the US EPA CompTox Chemistry Dashboard. The curation workflow takes advantage of years of experience, as well as contact with the original data providers, to enable open access to valuable, curated datasets to support environmental scientists and the broader research community (e.g. https://comptox.epa.gov/dashboard/chemical_lists). Note: This abstract does not reflect US EPA policy.
Internal NIH Seminar to the BISTI Team on some early thoughts from the Associate Director for Data Science (ADDS). These ideas are for discussion only and in no way reflect what might happen subsequently. Presented April 1, 2014 (the date is purely a coincidence).
Mobile devices are now mainstream handheld computers providing access to computational power and storage that a decade ago was available only on desktop computers. In terms of chemistry informatics the majority of capabilities that were previously found only on desktop computers is fast migrating to mobile devices making use of the combination of powerful visualization capabilities, fast cloud-based calculations, websites optimized for the mobile platforms, and delivering “apps”. This presentation will provide an overview of how access to chemistry continues to be made increasingly mobile and specifically on how the Royal Society of Chemistry is contributing to this computing environment.
Workshop finding and accessing data - fiona nadia charlotte - cambridge apr...Fiona Nielsen
Workshop presentation on finding and accessing human genomics data for research.
Including statistics of publicly available data sources and tips on how to save time in your workflow of data access.
Organised in collaboration between DNAdigest and Open Data Cambridge.
Read more about our work:
http://DNAdigest.org
http://repositive.io
https://uk.linkedin.com/in/fionanielsen
http://www.data.cam.ac.uk
“Open Research Data: Implications for Science and Society”, Warsaw, Poland, May 28–29, 2015, conference organized by the Open Science Platform — an initiative of the Interdisciplinary Centre for Mathematical and Computational Modelling at the University of Warsaw. pon.edu.pl @OpenSciPlatform #ORD2015
Why i left my job in genomics R&D - Lunteren - april 18 - 2016Fiona Nielsen
Career talk about how I moved from bioinformatics scientist to become an entrepreneur.
Presented at BioSB2016, pre-conference PhD retreat for young researchers in bioinformatics and systems biology at Congrescentrum De Werelt in Lunteren. #BioSB2016 #BioSB16
Link to event:
http://www.youngcb.nl/events/biosb-phd-retreat-2016/
Read more about my work:
http://DNAdigest.org
http://repositive.io
https://uk.linkedin.com/in/fionanielsen
Slides of the 2015 Bio Data World Congress show how our analyzegenomes.com services are combined to support precision medicine in the context of modern oncology treatment.
Results Vary: The Pragmatics of Reproducibility and Research Object FrameworksCarole Goble
Keynote presentation at the iConference 2015, Newport Beach, Los Angeles, 26 March 2015.
Results Vary: The Pragmatics of Reproducibility and Research Object Frameworks
http://ischools.org/the-iconference/
BEWARE: presentation includes hidden slides AND in situ build animations - best viewed by downloading.
Being FAIR: FAIR data and model management SSBSS 2017 Summer SchoolCarole Goble
Lecture 1:
Being FAIR: FAIR data and model management
In recent years we have seen a change in expectations for the management of all the outcomes of research – that is the “assets” of data, models, codes, SOPs, workflows. The “FAIR” (Findable, Accessible, Interoperable, Reusable) Guiding Principles for scientific data management and stewardship [1] have proved to be an effective rallying-cry. Funding agencies expect data (and increasingly software) management retention and access plans. Journals are raising their expectations of the availability of data and codes for pre- and post- publication. The multi-component, multi-disciplinary nature of Systems and Synthetic Biology demands the interlinking and exchange of assets and the systematic recording of metadata for their interpretation.
Our FAIRDOM project (http://www.fair-dom.org) supports Systems Biology research projects with their research data, methods and model management, with an emphasis on standards smuggled in by stealth and sensitivity to asset sharing and credit anxiety. The FAIRDOM Platform has been installed by over 30 labs or projects. Our public, centrally hosted Asset Commons, the FAIRDOMHub.org, supports the outcomes of 50+ projects.
Now established as a grassroots association, FAIRDOM has over 8 years of experience of practical asset sharing and data infrastructure at the researcher coal-face ranging across European programmes (SysMO and ERASysAPP ERANets), national initiatives (Germany's de.NBI and Systems Medicine of the Liver; Norway's Digital Life) and European Research Infrastructures (ISBE) as well as in PI's labs and Centres such as the SynBioChem Centre at Manchester.
In this talk I will show explore how FAIRDOM has been designed to support Systems Biology projects and show examples of its configuration and use. I will also explore the technical and social challenges we face.
I will also refer to European efforts to support public archives for the life sciences. ELIXIR (http:// http://www.elixir-europe.org/) the European Research Infrastructure of 21 national nodes and a hub funded by national agreements to coordinate and sustain key data repositories and archives for the Life Science community, improve access to them and related tools, support training and create a platform for dataset interoperability. As the Head of the ELIXIR-UK Node and co-lead of the ELIXIR Interoperability Platform I will show how this work relates to your projects.
[1] Wilkinson et al, The FAIR Guiding Principles for scientific data management and stewardship Scientific Data 3, doi:10.1038/sdata.2016.18
LIS 532G: Final Project Presentation
Case Study: What is it? How do you write it? How do you use it?
Preparation for Final Project to design and teach a case study.
Open Data in a Big Data World: easy to say, but hard to do?LEARN Project
Presentation at 3rd LEARN workshop on Research Data Management, “Make research data management policies work”
Helsinki, 28 June 2016, by Sarah Callaghan, STFC Rutherford Appleton Laboratory
In order to be reused, research data must be discoverable.
The EPSRC Research Data Expectations* requires research organisations to maintain a data catalogue to record metadata about research data generated by EPSRC-funded research projects.
Universities are increasingly making research data assets available through repositories or other data portals.
The requirement for a UK research data discovery service has grown as universities become more involved in RDM and capacity develops.
Scott Edmunds: Quantifying how FAIR is Hong Kong: The Hong Kong Shareability ...GigaScience, BGI Hong Kong
Scot Edmunds talk at CODATA2019 on Quantifying how FAIR is Hong Kong: The Hong Kong Shareability of Hong Kong University Research Experiment. 19th September 2019 in Beijing
Keynote speech - Carole Goble - Jisc Digital Festival 2015Jisc
Carole Goble is a professor in the school of computer science at the University of Manchester.
In this keynote, Carole offered her insights into research data management and data centres.
Overview of the UKRDDS pilot project at Univwersity of Edinburgh employing PhD interns to validate metadata about research data created by University of Edinburgh researchers and held in local RDM services solutions. This was presented at IASSIST in June 2016, Bergen, Norway.
Similar to USTLG Talk: The future of laboratory data: Libraries, Librarians and Digital Laboratory Notebooks (20)
Responsible, Reproducible Research & FAIR DATAJeremy Frey
Introductory talk at the Royal Society of Chemistry Faraday Division Software Tools for Physical Chemists Meeting.
http://www.rsc.org/events/detail/35843/faraday-division-chemistry-software-tools-meeting
Digital IUPAC: The need for global representation of chemistry and chemical i...Jeremy Frey
Frey, Jeremy G. (2016) Digital IUPAC: The need for global representation of chemistry and chemical information in the digital age At 251st American Chemical Society National Meeting and Exposition - Computers in Chemistry, United States. 13 - 17 Mar 2016
Is open science an inevitable outcome of e-science?Jeremy Frey
Frey, Jeremy G. (2016) Is open science an inevitable outcome of e-science? At 251st American Chemical Society National Meeting & Exposition - Computers in Chemistry, United States. 13 - 17 Mar 2016.
Developing an open source ELN to support open & interdisciplinary scienceJeremy Frey
Frey, Jeremy (2017) Developing an open source ELN to support open & interdisciplinary science At Electronic Lab Notebooks: Solutions for Paperless Research, United Kingdom. 13 Jan 2017.
Frey, Jeremy G. (2017) Reducing uncertainty: The raison d'Être for Open Science At Open Science and the Chemistry Lab of the Future, Rüdesheim, Germany. 22 - 24 May 2017.
ESR spectroscopy in liquid food and beverages.pptxPRIYANKA PATEL
With increasing population, people need to rely on packaged food stuffs. Packaging of food materials requires the preservation of food. There are various methods for the treatment of food to preserve them and irradiation treatment of food is one of them. It is the most common and the most harmless method for the food preservation as it does not alter the necessary micronutrients of food materials. Although irradiated food doesn’t cause any harm to the human health but still the quality assessment of food is required to provide consumers with necessary information about the food. ESR spectroscopy is the most sophisticated way to investigate the quality of the food and the free radicals induced during the processing of the food. ESR spin trapping technique is useful for the detection of highly unstable radicals in the food. The antioxidant capability of liquid food and beverages in mainly performed by spin trapping technique.
Seminar of U.V. Spectroscopy by SAMIR PANDASAMIR PANDA
Spectroscopy is a branch of science dealing the study of interaction of electromagnetic radiation with matter.
Ultraviolet-visible spectroscopy refers to absorption spectroscopy or reflect spectroscopy in the UV-VIS spectral region.
Ultraviolet-visible spectroscopy is an analytical method that can measure the amount of light received by the analyte.
Nutraceutical market, scope and growth: Herbal drug technologyLokesh Patil
As consumer awareness of health and wellness rises, the nutraceutical market—which includes goods like functional meals, drinks, and dietary supplements that provide health advantages beyond basic nutrition—is growing significantly. As healthcare expenses rise, the population ages, and people want natural and preventative health solutions more and more, this industry is increasing quickly. Further driving market expansion are product formulation innovations and the use of cutting-edge technology for customized nutrition. With its worldwide reach, the nutraceutical industry is expected to keep growing and provide significant chances for research and investment in a number of categories, including vitamins, minerals, probiotics, and herbal supplements.
Professional air quality monitoring systems provide immediate, on-site data for analysis, compliance, and decision-making.
Monitor common gases, weather parameters, particulates.
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...Travis Hills MN
Travis Hills of Minnesota developed a method to convert waste into high-value dry fertilizer, significantly enriching soil quality. By providing farmers with a valuable resource derived from waste, Travis Hills helps enhance farm profitability while promoting environmental stewardship. Travis Hills' sustainable practices lead to cost savings and increased revenue for farmers by improving resource efficiency and reducing waste.
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...University of Maribor
Slides from:
11th International Conference on Electrical, Electronics and Computer Engineering (IcETRAN), Niš, 3-6 June 2024
Track: Artificial Intelligence
https://www.etran.rs/2024/en/home-english/
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...University of Maribor
Slides from talk:
Aleš Zamuda: Remote Sensing and Computational, Evolutionary, Supercomputing, and Intelligent Systems.
11th International Conference on Electrical, Electronics and Computer Engineering (IcETRAN), Niš, 3-6 June 2024
Inter-Society Networking Panel GRSS/MTT-S/CIS Panel Session: Promoting Connection and Cooperation
https://www.etran.rs/2024/en/home-english/
This presentation explores a brief idea about the structural and functional attributes of nucleotides, the structure and function of genetic materials along with the impact of UV rays and pH upon them.
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
USTLG Talk: The future of laboratory data: Libraries, Librarians and Digital Laboratory Notebooks
1. The future of experimental
data: Libraries and
Laboratory Notebooks
USTLG Meeting
Promoting the role of libraries, library services
and librarians
Jeremy Frey
16 May 2014 1
3. Outline
• Introduction
• Open Access vs Intelligent Access to Data
• The Trove Software
• Digital Notebooks - Advantages
• Data Citation
• The Future and labs and data
16/05/2014 The future of laboratory data 3
6. Review Article
Laboratory notebooks in the digital era: the role of
ELNs in record keeping for chemistry and other
sciences
Colin L. Bird, Cerys Willoughby and Jeremy G. Frey
Chem. Soc. Rev., 2013,42, 8157-8175
DOI: 10.1039/C3CS60122F
This article is licensed under a Creative Commons
Attribution 3.0
16/05/2014 The future of laboratory data 6
7. Data Curation in the Chemical Sciences
16/05/2014 The future of laboratory data 7
http://dx.doi.org/10.3789/isqv25no3.2013.02
8. Subversive
and furtive
sharing &
exploitation of
data in virtual
space
RDF
OAI
Taxi
E-
user
Labs
Digital Repository
Data
16/05/2014
The future of laboratory data
Some projects need large amounts of data from the literature
8
12. THE NARRATIVE
What is the story? What is the why?
16/05/2014 The future of laboratory data 12
13. Faraday’s laboratory notebooks
are also remarkable in the amount
of detail that they give about the
design and setting up of
experiments, interspersed with
comments about their outcome
and thoughts of a more
philosophical kind. All are couched
in plain language, with many vivid
phrases of delightful spontaneity….
Peter Day, ‘The Philosopher’s Tree: A Selection of Michael
Faraday’s Writings’
16/05/2014 The future of laboratory data 13
14. The integrity of science as a discipline rests on the ability of
scientists to reproduce the claims of others.
While none of the organic chemistry journals go to the same
lengths as Organic Syntheses, where each procedure must be
reproduced as described in an independent laboratory before
publication, ……..
sufficient detail so that the procedures can be reproduced and
provide sufficient data to establish the structures …….
This information is necessary for the review process …… to base
their experiments on published work.Methods are as important as the data
16/05/2014 The future of laboratory data 14
15. 16/05/2014 The future of laboratory data
If only I knew exactly how
she did this experiments
I know all this supplementary
information could be useful but will
people really remember the format? Is
it worth all the hassle?I wish I could get the
numbers from this
graph - the pdf is not
much use.
I wish I had
recorded things at
the start the way I
do now…..
Typical Laboratory
15
16. Archiving of data
• Experiments are often repeated
– Data stored locally on a
computer and can’t be found
– Handwriting can’t be read
– Laboratory notebooks lost or
damaged
– Correct data not recorded
first time round
http://www.ecs.soton.ac.uk/regenesis/pictures
16/05/2014 The future of laboratory data 16
20. LabTrove and User defined metadata
16/05/2014 The future of laboratory data 20
21. A big proportion of
our users are not
really adding
metadata!
Librarians can help
here!
16/05/2014 The future of laboratory data 21
22. Figure 7. Visualization of posts as a network of resources.
Milsted AJ, Hale JR, Frey JG, Neylon C (2013) LabTrove: A Lightweight, Web Based, Laboratory “Blog” as a Route towards a
Marked Up Record of Work in a Bioscience Research Laboratory. PLoS ONE 8(7): e67460. doi:10.1371/journal.pone.0067460
http://www.plosone.org/article/info:doi/10.1371/journal.pone.0067460
16/05/2014 The future of laboratory data 22
25. doi: 10.1098/rsta.2005.1630 Phil. Trans.
R. Soc. A 15 August 2005 vol. 363 no.
1833 2075-2095
16/05/2014 The future of laboratory data 25
26. Kenji Takeda, Graeme Earl, Jeremy Frey, Simon
Keay, and Alex Wade
Enhancing research publications using Rich
Interactive Narratives Phil. Trans. R. Soc. A. 2013
371 20120090; doi:10.1098/rsta.2012.0090
16/05/2014 The future of laboratory data
Use of Deep Zoom technique
Data covers a huge range of scale and type
New ideas for presenting to avoid overload
26
29. Chemical Biology
Paper in PLoS
• Export of the entire laboratory
notebook (LabTrove) and deposit with
FigShare.
• At 50 Mb it exceeded the upload limit
of the journal!
16/05/2014 The future of laboratory data 29
32. Open Notebook
Science
• Certainly not always the way to work!
– IPR, Commercial, long term projects, recognition
issues, etc
• But
– Makes connection much easier if the data and
processes are “Open”
– Easy to export & access of “Linked-Data”
16/05/2014 The future of laboratory data
It is not necessary to change.
Survival is not mandatory.
W. Edwards Deming
32
33. 16/05/2014 The future of laboratory data
LabTrove Open Notebooks
Mat Todd’s Malaria Project
33
35. Unavailable Information
• Not just lots of data but why are many of the
structures unpublished so certainly
unavailable?
• The E-Crystals and E-Bank
Project looked at how to
address this issue
• Is making data available
the same as depositing
a copy with someone else?
16/05/2014 The future of laboratory data 35
36. 16/05/2014 The future of laboratory data
Graph/Network provides
intuitive navigation
36
37. DISSEMINATION IS PART
OF THE RESEARCH
Bringing dissemination in to the lab… use and
re-use of data… the digital research notebook
16/05/2014 The future of laboratory data 37
38. Change in the whole way we design and build
16/05/2014 The future of laboratory data
3D Printers: A radical change to the experimental section
of a paper!
38
39. All I am saying is that now is the time to
develop the technology to deflect an
asteroid
We must speed up the knowledge discovery process
16/05/2014 The future of laboratory data 39
40. IT as a Utility
funded under Research Council UK's Digital Economy theme
The Digital Economy vision is of the
transformational impact of digital
technologies on all aspects of life.
16/05/2014 The future of laboratory data 40
41. Join the ITaaU Network+
• Web: www.itutility.ac.uk
• Mailing list: http://jiscmail.ac.uk/itutility
• Hashtag #itaaun
• Twitter: SteveITaaU
• Facebook page: http://www.facebook.com/ITutilityNetwork
• Email: info@itutility.ac.uk
• LinkedIn group: IT-as-Utility-Network
16/05/2014 The future of laboratory data 41
ITaaU Network meeting 19 & 20th June
Southampton Hilton
42. Trust me Mort - no electronic communications
superhighway, no matter how vast and sophisticated,
will ever replace the art of the schmooze
Thank you for listening
The future of laboratory data16/05/2014
42
Editor's Notes
a researcher-centric web- and cloud-based ELN Blogging style – and so is flexible, lets the researchers record whatever they want to
Comments on computational models – in this case GODIVA is a way to show ocean models over the web (University of Reading) and with LabTrove added people can comment on geo-coded regions of the models results and have the video in the post – metadata taken from the models and put in the Trove.
Users can define their own metadata – two types, section and key/value – these aid with navigation in the notebook as shown on the right hand side and also with search in a LabTrove instance.
1/3 use only 1 section½ use a ‘catch-all’ section70% use less than 5 sections50% use no keysLess than 20% use more than 5 keys
Wells H.G(1937) World brain: the idea of a permanent world encyclopaedia. EncyclopédieFrançaise, Tome 18, 24-11-12..