Logs, Blogs and Pods Smart Electronic Laboratory Notebooks e-Research Open Meeting 2009 Reading Jeremy G. Frey School of Chemistry  University of Southampton
Talk Laboratory Notebooks  Laboratory BlogBooks Instruments and Blogjects Semantic Blogs
“ The internet wasn't created for mockery! It was created so scientists from different universities could share datasets....”  Simpson, H.   The Simpsons  (2005), Eds. Groening, M., Brooks, J.L. & Simon, S.,  Series 16, Episode 8, Original air date (US) 06-Feb-2005. http://www.tvtome.com/tvtome/servlet/GuidePageServlet/showid-146/epid-346864/
The Comb e Chem Project ‘ End to End’ linking Data (life-)cycle Do things ‘right’ at the start Make sure the metadata is of high quality Record properly at source in Digital Form Extensive provenance [email_address] The Chemistry Lab People & Machines working together
The Data Explosion Exponential growth The future overwhelms the past, but the past must not be lost PDB deposited  structures CCDC deposited structures
Chemists and programming Many Chemists think that they can program! You still use FORTRAN!! What about that! His brain uses old vacuum tubes
Typical laboratory conversations If only I knew exactly how she did this experiments I know all this supplementary information could be useful but will people really remember the format? Is it worth all the hassle? I wish I could get the numbers from this graph - the pdf is not much use. I wish I had recorded things at the start the way I do now…..
Some problems are due to the lack of information recorded at the time, others are due to loss of information over time. Supervisors and Managers I am sure we collected that  information a few years ago… The details should be  in her lab book…..   Can you read what it  says here….? Can you find the files  of data that were used  to make the plot?
Faraday’s laboratory notebooks are also remarkable in the amount of detail that they give about the design and setting up of experiments, interspersed with comments about their outcome and thoughts of a more philosophical kind. All are couched in plain language, with many vivid phrases of delightful spontaneity…. Peter Day, ‘The Philosopher’s Tree: A Selection of Michael Faraday’s Writings’
Mixture of text, images, plans, data
Electronic Laboratory Notebooks Permanent,  documented and primary record of  laboratory  observations
If you are caught using the  “ scrap of paper” technique,  your improperly recorded data  may be confiscated by your TA Observations   are   never collected on note pads, filter paper or other  temporary  paper for  later transfer into a  notebook
Attach Date Cut and Paste
Cross reference Ontology and Folksonomy  Folksonomy (also known as collaborative tagging, social classification, social indexing, and social tagging)  Provenance, Probity and Priority
Discussions
BUT – the data needs to be recorded somewhere! The data only lives if connected to the laboratory notebook to provide the context.  This link while essential is often fragile.
Electronic Laboratory Notebooks
He is charged with expressing contempt for meta-data meta Key is to make recording of the metadata easy and as automated as possible
COSHH Leverage off things we already have to do “ We have a cunning plan”
 
Smart Tea Project - User Centred Design, Design by Analogy to ensure the correct information is captured simply and easily.
The Two Interfaces Planning and Review Implementation
Procedure
Into the Lab!
Into the Lab!
 
Plans Plans in advance are useful This is the way things are supposed to be done The Plan provides a digital context so increases the value of planning Key to our ‘Smart Lab’ approach…. But is it the best way?
Laboratory “Blogs” Laboratory notebook is a Blog Encourage and facilitate collaboration Flexible Need a data repositories behind the  B log R4L E-Bank
Implementation of e-lab book Blog based format Purpose built engine Fully flexible system with arbitrary metadata Full record of changes (not currently easily accessible) http://chemtools.chem.soton.ac.uk/projects/blog /   “Bio Blogs” http://blogs.openwetware.org/scienceintheopen   Discussion
Implementation of e-lab book One post, one item approach Procedures can be tracked back to starting materials (or forwards to products) by clicking through Aim to ultimately be interpretable by machine and human
Templates
LIVECOP LINK <METADATA> <TITLE>album09 - jrh4880_19_competent_transformation_from_ligation</TITLE> <SIZE_X>1300</SIZE_X> <SIZE_Y>1026</SIZE_Y> <THUMB_SRC> http://imgstore.chem.soton.ac.uk/albums/album09/jrh4880_19_competent_transformation_from_ligation.thumb.jpg </THUMB_SRC> <PREVIEW_SRC> http://imgstore.chem.soton.ac.uk/albums/album09/jrh4880_19_competent_transformation_from_ligation.sized.jpg </PREVIEW_SRC> <PICTURE_URL> http://imgstore.chem.soton.ac.uk/album09/jrh4880_19_competent_transformation_from_ligation </PICTURE_URL> </METADATA>
Link to objects
Issues The Physical World Safety documentation Patent/IP – sign-off Trust Will computers survive in the laboratory? Remember we do have a physical world to keep in sync
 
  19 June 2008 10:32   / Journal publication Test Data RESULTS! Conference reports Grant Applications Analysis Management Time Line View
An rdf graph of posts and links between them rendered using Welkin (simile.mit.edu/welkin) Sortase Experiment Map of the X-Ray Blog (comments not shown)
 
Environment Automatically record as much of the laboratory environment as possible Blog for the day
Pub-Sub systems provide the flexible & extensible approach to distribution of real time laboratory monitoring & archiving Smart Laboratory Spaces
“ I just realized, Howard, that everything in this apartment is more sophisticated than we are”
Blog-jects Equipment become first class members of the web Interacts well with Pub-Sub as items are attached to topics, topics relate the Bog items With automation this evolves to a two-way communication Everything has a network connection – research equipment will catch up with the fridge & other commodity goods
Blog-jects Equipment become first class members of the web Interacts well with Pub-Sub as items are attached to topics, topics relate the Bog items With automation this evolves to a two-way communication Live Copy essential
Lab environment data and experimental output linked
Comments and Annotation A picture worth a thousand words!  Chemists like to sketch!
 
Can we have both the web 2.0 Blog style and the Semantics of the ELN? YES!
 
Simple Experiment Ontology
 
 
 
 
Impact on researchers Higher Quality Record Easier Collaboration Improved planning Improved discussions Efficiency gain in production of presentations/reports Change the nature of Professor/Student interactions
Influence on Meetings and Discussions Enable geographically / temporally separated discussions Meeting preparation much less of an imposition Posted material is discussed, comparison with older materials  is easy Change from ‘can I look at your data’ to ‘have you seen my blog post’
 
Growing need for the global (virtual) equivalent of the “Tea Room”
Separating Data from Interpretations: A crystallography example   Underlying data Intellect & Interpretation
Access to  ALL  underlying data eBank & eCrystals
Information Providers Information Consumers These are the same people – if we can ‘talk’ to ourselves efficiently over time then that is a good start to be able to ‘talk’ to others
 
 
Thanks RC UK, EPSRC, JISC for funding Colleagues and Students from the Schools of Chemistry, Electronics & Computer Science, Mathematics IBM, Microsoft www.combechem.org www.ecrystals.soton.ac.uk chemtools.chem.soton.ac.uk
Excerpted from  the Onion : The Recording Industry Association of America announced Tuesday that it will be taking legal action against anyone discovered telling friends, acquaintances, or associates about new songs, artists, or albums.  Data Sharing  &quot;We are merely exercising our right to defend our intellectual properties from unauthorized peer-to-peer notification of the existence of copyrighted material.&quot;  A daring daylight raid of copyright material ACS
Validation Increasing the value of data  How to bring all the necessary information together to enable appropriate validation Increasingly difficult & expensive to achieve Need provenance and context otherwise just a collection of items

Blogs Logs Pods: Smart Labs

  • 1.
    Logs, Blogs andPods Smart Electronic Laboratory Notebooks e-Research Open Meeting 2009 Reading Jeremy G. Frey School of Chemistry University of Southampton
  • 2.
    Talk Laboratory Notebooks Laboratory BlogBooks Instruments and Blogjects Semantic Blogs
  • 3.
    “ The internetwasn't created for mockery! It was created so scientists from different universities could share datasets....” Simpson, H. The Simpsons (2005), Eds. Groening, M., Brooks, J.L. & Simon, S., Series 16, Episode 8, Original air date (US) 06-Feb-2005. http://www.tvtome.com/tvtome/servlet/GuidePageServlet/showid-146/epid-346864/
  • 4.
    The Comb eChem Project ‘ End to End’ linking Data (life-)cycle Do things ‘right’ at the start Make sure the metadata is of high quality Record properly at source in Digital Form Extensive provenance [email_address] The Chemistry Lab People & Machines working together
  • 5.
    The Data ExplosionExponential growth The future overwhelms the past, but the past must not be lost PDB deposited structures CCDC deposited structures
  • 6.
    Chemists and programmingMany Chemists think that they can program! You still use FORTRAN!! What about that! His brain uses old vacuum tubes
  • 7.
    Typical laboratory conversationsIf only I knew exactly how she did this experiments I know all this supplementary information could be useful but will people really remember the format? Is it worth all the hassle? I wish I could get the numbers from this graph - the pdf is not much use. I wish I had recorded things at the start the way I do now…..
  • 8.
    Some problems aredue to the lack of information recorded at the time, others are due to loss of information over time. Supervisors and Managers I am sure we collected that information a few years ago… The details should be in her lab book….. Can you read what it says here….? Can you find the files of data that were used to make the plot?
  • 9.
    Faraday’s laboratory notebooksare also remarkable in the amount of detail that they give about the design and setting up of experiments, interspersed with comments about their outcome and thoughts of a more philosophical kind. All are couched in plain language, with many vivid phrases of delightful spontaneity…. Peter Day, ‘The Philosopher’s Tree: A Selection of Michael Faraday’s Writings’
  • 10.
    Mixture of text,images, plans, data
  • 11.
    Electronic Laboratory NotebooksPermanent, documented and primary record of laboratory observations
  • 12.
    If you arecaught using the “ scrap of paper” technique, your improperly recorded data may be confiscated by your TA Observations are never collected on note pads, filter paper or other temporary paper for later transfer into a notebook
  • 13.
    Attach Date Cutand Paste
  • 14.
    Cross reference Ontologyand Folksonomy Folksonomy (also known as collaborative tagging, social classification, social indexing, and social tagging) Provenance, Probity and Priority
  • 15.
  • 16.
    BUT – thedata needs to be recorded somewhere! The data only lives if connected to the laboratory notebook to provide the context. This link while essential is often fragile.
  • 17.
  • 18.
    He is chargedwith expressing contempt for meta-data meta Key is to make recording of the metadata easy and as automated as possible
  • 19.
    COSHH Leverage offthings we already have to do “ We have a cunning plan”
  • 20.
  • 21.
    Smart Tea Project- User Centred Design, Design by Analogy to ensure the correct information is captured simply and easily.
  • 22.
    The Two InterfacesPlanning and Review Implementation
  • 23.
  • 24.
  • 25.
  • 26.
  • 27.
    Plans Plans inadvance are useful This is the way things are supposed to be done The Plan provides a digital context so increases the value of planning Key to our ‘Smart Lab’ approach…. But is it the best way?
  • 28.
    Laboratory “Blogs” Laboratorynotebook is a Blog Encourage and facilitate collaboration Flexible Need a data repositories behind the B log R4L E-Bank
  • 29.
    Implementation of e-labbook Blog based format Purpose built engine Fully flexible system with arbitrary metadata Full record of changes (not currently easily accessible) http://chemtools.chem.soton.ac.uk/projects/blog / “Bio Blogs” http://blogs.openwetware.org/scienceintheopen Discussion
  • 30.
    Implementation of e-labbook One post, one item approach Procedures can be tracked back to starting materials (or forwards to products) by clicking through Aim to ultimately be interpretable by machine and human
  • 31.
  • 32.
    LIVECOP LINK <METADATA><TITLE>album09 - jrh4880_19_competent_transformation_from_ligation</TITLE> <SIZE_X>1300</SIZE_X> <SIZE_Y>1026</SIZE_Y> <THUMB_SRC> http://imgstore.chem.soton.ac.uk/albums/album09/jrh4880_19_competent_transformation_from_ligation.thumb.jpg </THUMB_SRC> <PREVIEW_SRC> http://imgstore.chem.soton.ac.uk/albums/album09/jrh4880_19_competent_transformation_from_ligation.sized.jpg </PREVIEW_SRC> <PICTURE_URL> http://imgstore.chem.soton.ac.uk/album09/jrh4880_19_competent_transformation_from_ligation </PICTURE_URL> </METADATA>
  • 33.
  • 34.
    Issues The PhysicalWorld Safety documentation Patent/IP – sign-off Trust Will computers survive in the laboratory? Remember we do have a physical world to keep in sync
  • 35.
  • 36.
      19 June2008 10:32   / Journal publication Test Data RESULTS! Conference reports Grant Applications Analysis Management Time Line View
  • 37.
    An rdf graphof posts and links between them rendered using Welkin (simile.mit.edu/welkin) Sortase Experiment Map of the X-Ray Blog (comments not shown)
  • 38.
  • 39.
    Environment Automatically recordas much of the laboratory environment as possible Blog for the day
  • 40.
    Pub-Sub systems providethe flexible & extensible approach to distribution of real time laboratory monitoring & archiving Smart Laboratory Spaces
  • 41.
    “ I justrealized, Howard, that everything in this apartment is more sophisticated than we are”
  • 42.
    Blog-jects Equipment becomefirst class members of the web Interacts well with Pub-Sub as items are attached to topics, topics relate the Bog items With automation this evolves to a two-way communication Everything has a network connection – research equipment will catch up with the fridge & other commodity goods
  • 43.
    Blog-jects Equipment becomefirst class members of the web Interacts well with Pub-Sub as items are attached to topics, topics relate the Bog items With automation this evolves to a two-way communication Live Copy essential
  • 44.
    Lab environment dataand experimental output linked
  • 45.
    Comments and AnnotationA picture worth a thousand words! Chemists like to sketch!
  • 46.
  • 47.
    Can we haveboth the web 2.0 Blog style and the Semantics of the ELN? YES!
  • 48.
  • 49.
  • 50.
  • 51.
  • 52.
  • 53.
  • 54.
    Impact on researchersHigher Quality Record Easier Collaboration Improved planning Improved discussions Efficiency gain in production of presentations/reports Change the nature of Professor/Student interactions
  • 55.
    Influence on Meetingsand Discussions Enable geographically / temporally separated discussions Meeting preparation much less of an imposition Posted material is discussed, comparison with older materials is easy Change from ‘can I look at your data’ to ‘have you seen my blog post’
  • 56.
  • 57.
    Growing need forthe global (virtual) equivalent of the “Tea Room”
  • 58.
    Separating Data fromInterpretations: A crystallography example Underlying data Intellect & Interpretation
  • 59.
    Access to ALL underlying data eBank & eCrystals
  • 60.
    Information Providers InformationConsumers These are the same people – if we can ‘talk’ to ourselves efficiently over time then that is a good start to be able to ‘talk’ to others
  • 61.
  • 62.
  • 63.
    Thanks RC UK,EPSRC, JISC for funding Colleagues and Students from the Schools of Chemistry, Electronics & Computer Science, Mathematics IBM, Microsoft www.combechem.org www.ecrystals.soton.ac.uk chemtools.chem.soton.ac.uk
  • 64.
    Excerpted from the Onion : The Recording Industry Association of America announced Tuesday that it will be taking legal action against anyone discovered telling friends, acquaintances, or associates about new songs, artists, or albums. Data Sharing &quot;We are merely exercising our right to defend our intellectual properties from unauthorized peer-to-peer notification of the existence of copyrighted material.&quot; A daring daylight raid of copyright material ACS
  • 65.
    Validation Increasing thevalue of data How to bring all the necessary information together to enable appropriate validation Increasingly difficult & expensive to achieve Need provenance and context otherwise just a collection of items