SlideShare a Scribd company logo
1 of 35
Download to read offline
Putting together the pieces:
Building a reaction-centric
electronic lab notebook for
mobile devices
Alex M. Clark, Ph.D.
November 2013

© 2013 Molecular Materials Informatics, Inc.
http://molmatinf.com
MOLECULAR MATERIALS INFORMATICS

2

Introduction

mainframes
minicomputers

personal computers
portable laptops

mobile tablets
smartphones

• The mobile platform is revolutionary: a clean break
-

entirely new user interface / user experience
no backward compatibility
highly constrained resources
applicable to entirely new situations
MOLECULAR MATERIALS INFORMATICS

3

Disruption
• Entire software industry ➠ mobile, like it or not

incredibly
simple

no learning
curve

just
works

app user
expectations

delightful
to use

cheap

trivial
install
MOLECULAR MATERIALS INFORMATICS

Technical Challenges
• Tiny screens (palm-sized)
• Fingers instead of mouse
• Reduced computing power
• Limited storage
• Delegate tasks to cloud servers:
split between UI & API
• Multiple platforms

4
MOLECULAR MATERIALS INFORMATICS

Chemistry App Ecosystem
• Reference data
• Education
• Structure drawing
• Database searching
• 3D viewing
• Reactions & collections
• Property calculation
• Model building
• Graphical presentation
• Data sharing

5
MOLECULAR MATERIALS INFORMATICS

Reaction Lab Notebook?
• Electronic Lab Notebooks: huge business
• Some products targeting iPad or mobile web
• Reaction-centric lab book app:
-

should understand chemistry
easy access to reference information
incorporate green chemistry metrics
make it easy to share or centralise data
appeal to academia & industry

6
MOLECULAR MATERIALS INFORMATICS

7

Reaction Canvas
• Reactants and products: atoms & bonds (mostly)

o

• Component separation: layout dependent
• Reagents, solvents, conditions: essentially
meaningless
Hong et al, JACS,
135, 11704 (2013)
MOLECULAR MATERIALS INFORMATICS

8

Reaction Representation
• Markup as component structures:
reactants

reagents

products

• Properties can be associated with each
component, e.g. name, stoichiometry
• Draw each structure separately
MOLECULAR MATERIALS INFORMATICS

Sketcher
• Drawing reactants & products
on an iPhone is quick

• Reagents & solvents added
subsequently

9
MOLECULAR MATERIALS INFORMATICS

Organometallics
• Global & local style conventions anathema to
machine-readability...
bonding?
oxidation state?

carbonyl?

graphic
object

salt?

formula?

10
MOLECULAR MATERIALS INFORMATICS

Organometallic Anatomy
• Commonly used connection table formats are
unable to provide aesthetics or accuracy

11
MOLECULAR MATERIALS INFORMATICS

12

SketchEl Format
SketchEl!(8,7)
Ru=0.0143,-1.3143;0,0,i0
H=2.2104,-1.3143;0,0,i0
H=1.5672,0.2386;0,0,i0
P|Ph{3}=0.0143,-3.9763;0,0,i0,
➥aSketchEl!(20002C22)000A*
P|Ph{3}=0.0143,1.3477;0,0,i0,
➥aSketchEl!(20002C22)000A*
P|Ph{3}=-1.9857,-1.3143;0,0,i0,
➥aSketchEl!(20002C22)000A*
C=-1.3975,-2.7260;0,0,e0
O=-2.4581,-3.7867;0,0,i0
1-2=1,0
1-3=1,0
1-4=0,0
1-5=0,0
1-6=0,0
1-7=0,0
C55H47OP3Ru
7-8=2,0
917.9517 g/mol
!End

J. Chem. Inf. Model., 52, 3149 (2011)

• Used by open source
SketchEl & mobile apps
• Core principles:
-

extreme minimalism
extensibility (forward)
zero-order bonds
virtual hydrogen control
inline abbreviations

• Lowest common
denominator:
SketchEl

MDL Molfile
MOLECULAR MATERIALS INFORMATICS

Extensible DataSheet
• Need a well designed format
• Surprisingly few widespread
options, all inadequate:
- MDL SDfile/RDfile
- tab/comma-separated text
- Excel
• Tabular, typed columns
• Native molecules
• Extensible meta-layers

13
MOLECULAR MATERIALS INFORMATICS

14

DataSheet XML
<?xml version="1.0" encoding="UTF-8"?>
<DataSheet>
<Summary>
<Title>Reaction Components</Title>
<Description><![CDATA[Compounds involved in amide formation reaction]]></Description>
</Summary>
<Header nrows="6" ncols="4">
<Column name="Molecule" type="molecule" id="1">Molecular structure</Column>
<Column name="Role" type="string" id="2">Role in reaction</Column>
<Column name="MW" type="real" id="3">Molecular weight (g/mol)</Column>
<Column name="MF" type="string" id="4">Molecular formula</Column>
</Header>
<Content>
<Row id="1">
<Cell id="1"><![CDATA[SketchEl!(10,10)
C=-0.8571,-0.7714;0,0,i0
C=0.4419,-0.0214;0,0,i2
C=1.7409,-0.7714;0,0,i2
C=-2.1562,-0.0214;0,0,i1
C=-3.4552,-0.7714;0,0,i1
C=-3.4552,-2.2714;0,0,i1
C=-2.1562,-3.0214;0,0,i1
C=-0.8571,-2.2714;0,0,i1
C=3.0400,-0.0214;0,0,i0
N=4.3391,0.7286;0,0,i0
1-2=1,0
2-3=1,0
1-4=1,0
4-5=2,0
5-6=1,0
6-7=2,0
7-8=1,0
8-1=2,0
3-9=1,0
9-10=3,0
!End]]></Cell>
<Cell id="2"><![CDATA[reactant 1]]></Cell>
<Cell id="3">131.174</Cell>
<Cell id="4"><![CDATA[C9H9N]]></Cell>
</Row>
<Row id="2">
...

• Minimalistic baseline
• Data types:
- molecule
- string
- boolean

- integer
- real
- extend

• Header, column
definitions
• Content: row-major
• Streamable
MOLECULAR MATERIALS INFORMATICS

Reaction Aspect
<?xml version="1.0" encoding="UTF-8"?>
<DataSheet>
<Summary>
<Title>Amide Formation</Title>
<Description><![CDATA[]]></Description>
</Summary>
<Header nrows="1" ncols="15">
<Column name="ReactantMol1" type="molecule" id="1"/>
<Column name="ReactantName1" type="string" id="2"/>
<Column name="ReactantStoich1" type="string" id="3"/>
<Column name="ReactantMol2" type="molecule" id="4"/>
<Column name="ReactantName2" type="string" id="5"/>
<Column name="ReactantStoich2" type="string" id="6"/>
<Column name="ProductMol1" type="molecule" id="7"/>
<Column name="ProductName1" type="string" id="8"/>
<Column name="ProductStoich1" type="string" id="9"/>
<Column name="ReagentMol1" type="molecule" id="10"/>
<Column name="ReagentName1" type="string" id="11"/>
<Column name="ReagentMol2" type="molecule" id="12"/>
<Column name="ReagentName2" type="string" id="13"/>
<Column name="ReagentMol3" type="molecule" id="14"/>
<Column name="ReagentName3" type="string" id="15"/>
</Header>
<Extension>
<Ext type="org.mmi.aspect.Reaction" name="Reaction">
<![CDATA[nreactants=2
nproducts=1
nreagents=3
]]>
</Ext>
</Extension>
<Content>
<Row id="1">
<Cell id="1"><![CDATA[SketchEl!(10,10)
C=-0.8571,-0.7714;0,0,i0
C=0.4419,-0.0214;0,0,i2
C=1.7409,-0.7714;0,0,i2
C=-2.1562,-0.0214;0,0,i1
C=-3.4552,-0.7714;0,0,i1
C=-3.4552,-2.2714;0,0,i1
C=-2.1562,-3.0214;0,0,i1
...

• An aspect is an extension field
• Implies additional behaviour &
visualisation features
• Parser recognition optional:
- fallback to baseline specification
- datasheet still highly editable
- backward and forward
compatible
• org.mmi.aspect.Reaction:
- comparable to MDL RDfile
- includes reagent structures,
stoichiometry
- readily extendable

15
MOLECULAR MATERIALS INFORMATICS

Yield101

• Prototype lab notebook already exists on the market
• Designed for undergraduates, more general use

16
MOLECULAR MATERIALS INFORMATICS

Quantities
• Adds quantities to the Reaction
aspect: auto-interconverted
• Green chemistry:
- solvent reference
- process mass intensity
calculation
• Automatic lookup of structures,
cross reference to Mobile
Reagents
• Graphics creation
• Private & public sharing of data

17
MOLECULAR MATERIALS INFORMATICS

18

Graphics Creation
hardcopy:
print or
export
PDF
MOLECULAR MATERIALS INFORMATICS

19

Public Sharing
• Scheme
uploaded
• Public
• Persistent
• URL access
• Formats
• Graphics
MOLECULAR MATERIALS INFORMATICS

19

Public Sharing
• Scheme
uploaded
• Public
• Persistent
• URL access
• Formats
• Graphics
MOLECULAR MATERIALS INFORMATICS

Manuscripts

• Export MS Word/Excel documents: DrawingML
• Encapsulated PostScript (EPS)
• Scalable Vector Graphics (SVG)

20
MOLECULAR MATERIALS INFORMATICS

Reaction Lab Notebook
Core Technology

• structure drawing
• reaction drawing
• quantity calculation
• green metrics
• graphics
• public databases
• data sharing

Enhancements

• professional chemists
• calculation services
• reference data
• green chemistry
• usability
• data centralisation
• sustainable business

21
MOLECULAR MATERIALS INFORMATICS

22

The Green Lab Notebook (GLN)
Transforms
Tutorial

Experiment
Folders

Feedstocks
Solvents

Green
Chemistry
Principles

Scratch
Sheet

Templates
MOLECULAR MATERIALS INFORMATICS

Folders of Experiments
• Groups of experiments
• Stored locally on device:
network optional
• Synchronised to central
location
• Expand on the basic reaction
definition...

23
MOLECULAR MATERIALS INFORMATICS

24

Reaction Experiments
• Multistep reaction editor, more supporting fields

Multistep:
★ reactant
★ reagent
★ intermediate
★ product
★ waste

★ quantities
★ conditions
★ literature
★ freeform
★ metadata
MOLECULAR MATERIALS INFORMATICS

Green Metrics
Process Mass Intensity (PMI) =

E-factor =

Atom Economy =

mass of all reactants
mass of products

mass of waste
mass of products

Σ molecular weight reactants
Σ molecular weight products

• Databases: e.g. Toxics Release Inventory (TRI), Persistent
Bioaccumulative Toxic Chemicals (PBT)
• Curate as structure-searchable database, automatic recall

25
MOLECULAR MATERIALS INFORMATICS

Green Solvents

• Environmental data from ACS GCI & GSK

26
MOLECULAR MATERIALS INFORMATICS

Feedstocks
• Reference collection:
- info about supply chain
- encourage renewable use
• Link lab-available quantities to
experiment records
• Could synchronise with
inventory software
• Lookup in vendor catalogs...

27
MOLECULAR MATERIALS INFORMATICS

28

Vendor Lookup
• MetaSearch
engine:
- PubChem
- ChEBI
- ChemSpider
• Vendor links
available
• Can integrate
more services
MOLECULAR MATERIALS INFORMATICS

Templates
• Editable list
• Advanced
placement
algorithms
• Complicated ring
systems
• Inorganic ligands

Journal of Cheminformatics, 2:8 (2010)

29
MOLECULAR MATERIALS INFORMATICS

30

Reaction Transforms
4
1

2

3

6

4
5

1

2

6

3
5

• Provide a pre-curated list of "green"
reaction transforms
• Promote user entered experiments
into transforms (numbering, clipping)
• Associate with:
-

reagents, catalysts & solvents
stoichiometry & quantities
yield & experimental conditions
literature & green reference data
MOLECULAR MATERIALS INFORMATICS

Matching Transforms
Reactants Products

• Search by Reactants or
Products
• Defer to webservice
• User-defined transforms
& server collection

• Use sketch coordinates of query & transform
molecules to produce consistent orientations
• Allow filtering & sorting by yield, green metrics,
availability of ingredients, etc.

31
MOLECULAR MATERIALS INFORMATICS

32

Tutorials
• Learned the hard way: intolerance for steep
learning curves
• Scientific software is inherently complicated
• Two strategies:
- workthrough tutorials
- feature level-up: unlock

?

• Careful to associate features with value
MOLECULAR MATERIALS INFORMATICS

Consumer vs. Enterprise
• Business paradox:
- apps expected to be low cost
- not all markets are high volume
Consumer

• App is cheap
• Webservices are free
• Data access unsecured
• No integration

Enterprise

• App cost negligible
• Services licensed
• Secure in-house data
• Hybrid infrastructure

33
Acknowledgments
• GDCh organisers
• RSC & ChemSpider,
Eidogen-Sertanty,
InfoChem, CDD,
PubChem, ChEBI,
CollabChem
• Inquiries to
info@molmatinf.com

http://molmatinf.com
http://molsync.com
http://cheminf20.org
@aclarkxyz

More Related Content

Viewers also liked

Managing samples in an electronic lab notebook
Managing samples in an electronic lab notebookManaging samples in an electronic lab notebook
Managing samples in an electronic lab notebookAxiope Limited
 
ESD Basics By Transforming Technologies
ESD Basics By Transforming TechnologiesESD Basics By Transforming Technologies
ESD Basics By Transforming TechnologiesEric Puszczewicz
 
Esd control-in-electronic-assembly
Esd control-in-electronic-assemblyEsd control-in-electronic-assembly
Esd control-in-electronic-assemblyTransformingTech
 
What are the methods of soldering electronic components
What are the methods of soldering electronic componentsWhat are the methods of soldering electronic components
What are the methods of soldering electronic componentselprocus
 
Electro Static Discharge Basics
Electro Static Discharge BasicsElectro Static Discharge Basics
Electro Static Discharge BasicsTransformingTech
 
PCBA Assembly Process Flow / PCB Assembly Manufacturing
PCBA Assembly Process Flow / PCB Assembly ManufacturingPCBA Assembly Process Flow / PCB Assembly Manufacturing
PCBA Assembly Process Flow / PCB Assembly ManufacturingAgile Circuit Co., Ltd
 
4 Content Marketing Challenges in 2016
4 Content Marketing Challenges in 20164 Content Marketing Challenges in 2016
4 Content Marketing Challenges in 2016Katai Robert
 

Viewers also liked (11)

Managing samples in an electronic lab notebook
Managing samples in an electronic lab notebookManaging samples in an electronic lab notebook
Managing samples in an electronic lab notebook
 
ESD Basics By Transforming Technologies
ESD Basics By Transforming TechnologiesESD Basics By Transforming Technologies
ESD Basics By Transforming Technologies
 
Esd control-in-electronic-assembly
Esd control-in-electronic-assemblyEsd control-in-electronic-assembly
Esd control-in-electronic-assembly
 
Presentation1
Presentation1Presentation1
Presentation1
 
Esd basics
Esd basicsEsd basics
Esd basics
 
What are the methods of soldering electronic components
What are the methods of soldering electronic componentsWhat are the methods of soldering electronic components
What are the methods of soldering electronic components
 
Electro Static Discharge Basics
Electro Static Discharge BasicsElectro Static Discharge Basics
Electro Static Discharge Basics
 
SMT
SMTSMT
SMT
 
PCBA Assembly Process Flow / PCB Assembly Manufacturing
PCBA Assembly Process Flow / PCB Assembly ManufacturingPCBA Assembly Process Flow / PCB Assembly Manufacturing
PCBA Assembly Process Flow / PCB Assembly Manufacturing
 
4 Content Marketing Challenges in 2016
4 Content Marketing Challenges in 20164 Content Marketing Challenges in 2016
4 Content Marketing Challenges in 2016
 
Marketing Is Like Kissing by imNylon
Marketing Is Like Kissing by imNylonMarketing Is Like Kissing by imNylon
Marketing Is Like Kissing by imNylon
 

Similar to Reaction Lab Notebooks for Mobile Devices - Alex M. Clark - GDCh 2013

Alex Clark : NETTAB 2013
Alex Clark : NETTAB 2013Alex Clark : NETTAB 2013
Alex Clark : NETTAB 2013Alex Clark
 
Bojan Veljanovski - Modular Software Architecture and Design (Code Camp 2016)
Bojan Veljanovski - Modular Software Architecture and Design (Code Camp 2016)Bojan Veljanovski - Modular Software Architecture and Design (Code Camp 2016)
Bojan Veljanovski - Modular Software Architecture and Design (Code Camp 2016)Bojan Veljanovski
 
Novo Nordisk's journey in developing an open-source application on Neo4j
Novo Nordisk's journey in developing an open-source application on Neo4jNovo Nordisk's journey in developing an open-source application on Neo4j
Novo Nordisk's journey in developing an open-source application on Neo4jNeo4j
 
Multi phase mixture media
Multi phase mixture mediaMulti phase mixture media
Multi phase mixture mediaModelon
 
Application of domain engineering to generate customized information dashboards
Application of domain engineering to generate customized information dashboardsApplication of domain engineering to generate customized information dashboards
Application of domain engineering to generate customized information dashboardsFranciscoJosGarcaPea1
 
Practical cheminformatics workflows with mobile apps
Practical cheminformatics workflows with mobile appsPractical cheminformatics workflows with mobile apps
Practical cheminformatics workflows with mobile appsAlex Clark
 
A Collaborative Data Science Development Workflow
A Collaborative Data Science Development WorkflowA Collaborative Data Science Development Workflow
A Collaborative Data Science Development WorkflowDatabricks
 
MUSYOP: Towards a Query Optimization for Heterogeneous Distributed Database S...
MUSYOP: Towards a Query Optimization for Heterogeneous Distributed Database S...MUSYOP: Towards a Query Optimization for Heterogeneous Distributed Database S...
MUSYOP: Towards a Query Optimization for Heterogeneous Distributed Database S...Institute of Information Systems (HES-SO)
 
Exascale Computing Project - Driving a HUGE Change in a Changing World
Exascale Computing Project - Driving a HUGE Change in a Changing WorldExascale Computing Project - Driving a HUGE Change in a Changing World
Exascale Computing Project - Driving a HUGE Change in a Changing Worldinside-BigData.com
 
[Capella Days 2020] MBSE and the High-Tech Equipment Industry, how do they ma...
[Capella Days 2020] MBSE and the High-Tech Equipment Industry, how do they ma...[Capella Days 2020] MBSE and the High-Tech Equipment Industry, how do they ma...
[Capella Days 2020] MBSE and the High-Tech Equipment Industry, how do they ma...Obeo
 
Model-Driven Cloud Data Storage
Model-Driven Cloud Data StorageModel-Driven Cloud Data Storage
Model-Driven Cloud Data Storagejccastrejon
 
Chem4Word Wade
Chem4Word WadeChem4Word Wade
Chem4Word WadeAlex Wade
 
(ATS4-DEV02) Accelrys Query Service: Technology and Tools
(ATS4-DEV02) Accelrys Query Service: Technology and Tools(ATS4-DEV02) Accelrys Query Service: Technology and Tools
(ATS4-DEV02) Accelrys Query Service: Technology and ToolsBIOVIA
 
High Performance Computing and Big Data
High Performance Computing and Big Data High Performance Computing and Big Data
High Performance Computing and Big Data Geoffrey Fox
 
NoSQLDatabases
NoSQLDatabasesNoSQLDatabases
NoSQLDatabasesAdi Challa
 
The Open Chemistry Project
The Open Chemistry ProjectThe Open Chemistry Project
The Open Chemistry ProjectMarcus Hanwell
 

Similar to Reaction Lab Notebooks for Mobile Devices - Alex M. Clark - GDCh 2013 (20)

Alex Clark : NETTAB 2013
Alex Clark : NETTAB 2013Alex Clark : NETTAB 2013
Alex Clark : NETTAB 2013
 
Bojan Veljanovski - Modular Software Architecture and Design (Code Camp 2016)
Bojan Veljanovski - Modular Software Architecture and Design (Code Camp 2016)Bojan Veljanovski - Modular Software Architecture and Design (Code Camp 2016)
Bojan Veljanovski - Modular Software Architecture and Design (Code Camp 2016)
 
Novo Nordisk's journey in developing an open-source application on Neo4j
Novo Nordisk's journey in developing an open-source application on Neo4jNovo Nordisk's journey in developing an open-source application on Neo4j
Novo Nordisk's journey in developing an open-source application on Neo4j
 
DBMS - Introduction.ppt
DBMS - Introduction.pptDBMS - Introduction.ppt
DBMS - Introduction.ppt
 
Multi phase mixture media
Multi phase mixture mediaMulti phase mixture media
Multi phase mixture media
 
Cloud
CloudCloud
Cloud
 
Application of domain engineering to generate customized information dashboards
Application of domain engineering to generate customized information dashboardsApplication of domain engineering to generate customized information dashboards
Application of domain engineering to generate customized information dashboards
 
Practical cheminformatics workflows with mobile apps
Practical cheminformatics workflows with mobile appsPractical cheminformatics workflows with mobile apps
Practical cheminformatics workflows with mobile apps
 
Developing Digital Twins
Developing Digital TwinsDeveloping Digital Twins
Developing Digital Twins
 
A Collaborative Data Science Development Workflow
A Collaborative Data Science Development WorkflowA Collaborative Data Science Development Workflow
A Collaborative Data Science Development Workflow
 
MUSYOP: Towards a Query Optimization for Heterogeneous Distributed Database S...
MUSYOP: Towards a Query Optimization for Heterogeneous Distributed Database S...MUSYOP: Towards a Query Optimization for Heterogeneous Distributed Database S...
MUSYOP: Towards a Query Optimization for Heterogeneous Distributed Database S...
 
Exascale Computing Project - Driving a HUGE Change in a Changing World
Exascale Computing Project - Driving a HUGE Change in a Changing WorldExascale Computing Project - Driving a HUGE Change in a Changing World
Exascale Computing Project - Driving a HUGE Change in a Changing World
 
[Capella Days 2020] MBSE and the High-Tech Equipment Industry, how do they ma...
[Capella Days 2020] MBSE and the High-Tech Equipment Industry, how do they ma...[Capella Days 2020] MBSE and the High-Tech Equipment Industry, how do they ma...
[Capella Days 2020] MBSE and the High-Tech Equipment Industry, how do they ma...
 
Model-Driven Cloud Data Storage
Model-Driven Cloud Data StorageModel-Driven Cloud Data Storage
Model-Driven Cloud Data Storage
 
Chem4Word Wade
Chem4Word WadeChem4Word Wade
Chem4Word Wade
 
(ATS4-DEV02) Accelrys Query Service: Technology and Tools
(ATS4-DEV02) Accelrys Query Service: Technology and Tools(ATS4-DEV02) Accelrys Query Service: Technology and Tools
(ATS4-DEV02) Accelrys Query Service: Technology and Tools
 
High Performance Computing and Big Data
High Performance Computing and Big Data High Performance Computing and Big Data
High Performance Computing and Big Data
 
Design patterns
Design patternsDesign patterns
Design patterns
 
NoSQLDatabases
NoSQLDatabasesNoSQLDatabases
NoSQLDatabases
 
The Open Chemistry Project
The Open Chemistry ProjectThe Open Chemistry Project
The Open Chemistry Project
 

More from Alex Clark

Mixtures QSAR: modelling collections of chemicals
Mixtures QSAR: modelling collections of chemicalsMixtures QSAR: modelling collections of chemicals
Mixtures QSAR: modelling collections of chemicalsAlex Clark
 
Mixtures InChI: a story of how standards drive upstream products
Mixtures InChI: a story of how standards drive upstream productsMixtures InChI: a story of how standards drive upstream products
Mixtures InChI: a story of how standards drive upstream productsAlex Clark
 
Mixtures as first class citizens in the realm of informatics
Mixtures as first class citizens in the realm of informaticsMixtures as first class citizens in the realm of informatics
Mixtures as first class citizens in the realm of informaticsAlex Clark
 
Mixtures: informatics for formulations and consumer products
Mixtures: informatics for formulations and consumer productsMixtures: informatics for formulations and consumer products
Mixtures: informatics for formulations and consumer productsAlex Clark
 
Coordination InChI (2019)
Coordination InChI (2019)Coordination InChI (2019)
Coordination InChI (2019)Alex Clark
 
Chemical mixtures: File format, open source tools, example data, and mixtures...
Chemical mixtures: File format, open source tools, example data, and mixtures...Chemical mixtures: File format, open source tools, example data, and mixtures...
Chemical mixtures: File format, open source tools, example data, and mixtures...Alex Clark
 
Bringing bioassay protocols to the world of informatics, using semantic annot...
Bringing bioassay protocols to the world of informatics, using semantic annot...Bringing bioassay protocols to the world of informatics, using semantic annot...
Bringing bioassay protocols to the world of informatics, using semantic annot...Alex Clark
 
ACS CINF Luncheon talk (Boston 2018)
ACS CINF Luncheon talk (Boston 2018)ACS CINF Luncheon talk (Boston 2018)
ACS CINF Luncheon talk (Boston 2018)Alex Clark
 
Autonomous model building with a preponderance of well annotated assay protocols
Autonomous model building with a preponderance of well annotated assay protocolsAutonomous model building with a preponderance of well annotated assay protocols
Autonomous model building with a preponderance of well annotated assay protocolsAlex Clark
 
Representing molecules with minimalism: A solution to the entropy of informatics
Representing molecules with minimalism: A solution to the entropy of informaticsRepresenting molecules with minimalism: A solution to the entropy of informatics
Representing molecules with minimalism: A solution to the entropy of informaticsAlex Clark
 
CDD BioAssay Express: Expanding the target dimension: How to visualize a lot ...
CDD BioAssay Express: Expanding the target dimension: How to visualize a lot ...CDD BioAssay Express: Expanding the target dimension: How to visualize a lot ...
CDD BioAssay Express: Expanding the target dimension: How to visualize a lot ...Alex Clark
 
BioAssay Express
BioAssay ExpressBioAssay Express
BioAssay ExpressAlex Clark
 
SLAS2016: Why have one model when you could have thousands?
SLAS2016: Why have one model when you could have thousands?SLAS2016: Why have one model when you could have thousands?
SLAS2016: Why have one model when you could have thousands?Alex Clark
 
The anatomy of a chemical reaction: Dissection by machine learning algorithms
The anatomy of a chemical reaction: Dissection by machine learning algorithmsThe anatomy of a chemical reaction: Dissection by machine learning algorithms
The anatomy of a chemical reaction: Dissection by machine learning algorithmsAlex Clark
 
Compact models for compact devices: Visualisation of SAR using mobile apps
Compact models for compact devices: Visualisation of SAR using mobile appsCompact models for compact devices: Visualisation of SAR using mobile apps
Compact models for compact devices: Visualisation of SAR using mobile appsAlex Clark
 
Green chemistry in chemical reactions: informatics by design
Green chemistry in chemical reactions: informatics by designGreen chemistry in chemical reactions: informatics by design
Green chemistry in chemical reactions: informatics by designAlex Clark
 
ICCE 2014: The Green Lab Notebook
ICCE 2014: The Green Lab NotebookICCE 2014: The Green Lab Notebook
ICCE 2014: The Green Lab NotebookAlex Clark
 
Cloud hosted APIs for cheminformatics on mobile devices (ACS Dallas 2014)
Cloud hosted APIs for cheminformatics on mobile devices (ACS Dallas 2014)Cloud hosted APIs for cheminformatics on mobile devices (ACS Dallas 2014)
Cloud hosted APIs for cheminformatics on mobile devices (ACS Dallas 2014)Alex Clark
 
Building a mobile reaction lab notebook (ACS Dallas 2014)
Building a mobile reaction lab notebook (ACS Dallas 2014)Building a mobile reaction lab notebook (ACS Dallas 2014)
Building a mobile reaction lab notebook (ACS Dallas 2014)Alex Clark
 
Open Drug Discovery Teams @ Hacking Health Montreal
Open Drug Discovery Teams @ Hacking Health MontrealOpen Drug Discovery Teams @ Hacking Health Montreal
Open Drug Discovery Teams @ Hacking Health MontrealAlex Clark
 

More from Alex Clark (20)

Mixtures QSAR: modelling collections of chemicals
Mixtures QSAR: modelling collections of chemicalsMixtures QSAR: modelling collections of chemicals
Mixtures QSAR: modelling collections of chemicals
 
Mixtures InChI: a story of how standards drive upstream products
Mixtures InChI: a story of how standards drive upstream productsMixtures InChI: a story of how standards drive upstream products
Mixtures InChI: a story of how standards drive upstream products
 
Mixtures as first class citizens in the realm of informatics
Mixtures as first class citizens in the realm of informaticsMixtures as first class citizens in the realm of informatics
Mixtures as first class citizens in the realm of informatics
 
Mixtures: informatics for formulations and consumer products
Mixtures: informatics for formulations and consumer productsMixtures: informatics for formulations and consumer products
Mixtures: informatics for formulations and consumer products
 
Coordination InChI (2019)
Coordination InChI (2019)Coordination InChI (2019)
Coordination InChI (2019)
 
Chemical mixtures: File format, open source tools, example data, and mixtures...
Chemical mixtures: File format, open source tools, example data, and mixtures...Chemical mixtures: File format, open source tools, example data, and mixtures...
Chemical mixtures: File format, open source tools, example data, and mixtures...
 
Bringing bioassay protocols to the world of informatics, using semantic annot...
Bringing bioassay protocols to the world of informatics, using semantic annot...Bringing bioassay protocols to the world of informatics, using semantic annot...
Bringing bioassay protocols to the world of informatics, using semantic annot...
 
ACS CINF Luncheon talk (Boston 2018)
ACS CINF Luncheon talk (Boston 2018)ACS CINF Luncheon talk (Boston 2018)
ACS CINF Luncheon talk (Boston 2018)
 
Autonomous model building with a preponderance of well annotated assay protocols
Autonomous model building with a preponderance of well annotated assay protocolsAutonomous model building with a preponderance of well annotated assay protocols
Autonomous model building with a preponderance of well annotated assay protocols
 
Representing molecules with minimalism: A solution to the entropy of informatics
Representing molecules with minimalism: A solution to the entropy of informaticsRepresenting molecules with minimalism: A solution to the entropy of informatics
Representing molecules with minimalism: A solution to the entropy of informatics
 
CDD BioAssay Express: Expanding the target dimension: How to visualize a lot ...
CDD BioAssay Express: Expanding the target dimension: How to visualize a lot ...CDD BioAssay Express: Expanding the target dimension: How to visualize a lot ...
CDD BioAssay Express: Expanding the target dimension: How to visualize a lot ...
 
BioAssay Express
BioAssay ExpressBioAssay Express
BioAssay Express
 
SLAS2016: Why have one model when you could have thousands?
SLAS2016: Why have one model when you could have thousands?SLAS2016: Why have one model when you could have thousands?
SLAS2016: Why have one model when you could have thousands?
 
The anatomy of a chemical reaction: Dissection by machine learning algorithms
The anatomy of a chemical reaction: Dissection by machine learning algorithmsThe anatomy of a chemical reaction: Dissection by machine learning algorithms
The anatomy of a chemical reaction: Dissection by machine learning algorithms
 
Compact models for compact devices: Visualisation of SAR using mobile apps
Compact models for compact devices: Visualisation of SAR using mobile appsCompact models for compact devices: Visualisation of SAR using mobile apps
Compact models for compact devices: Visualisation of SAR using mobile apps
 
Green chemistry in chemical reactions: informatics by design
Green chemistry in chemical reactions: informatics by designGreen chemistry in chemical reactions: informatics by design
Green chemistry in chemical reactions: informatics by design
 
ICCE 2014: The Green Lab Notebook
ICCE 2014: The Green Lab NotebookICCE 2014: The Green Lab Notebook
ICCE 2014: The Green Lab Notebook
 
Cloud hosted APIs for cheminformatics on mobile devices (ACS Dallas 2014)
Cloud hosted APIs for cheminformatics on mobile devices (ACS Dallas 2014)Cloud hosted APIs for cheminformatics on mobile devices (ACS Dallas 2014)
Cloud hosted APIs for cheminformatics on mobile devices (ACS Dallas 2014)
 
Building a mobile reaction lab notebook (ACS Dallas 2014)
Building a mobile reaction lab notebook (ACS Dallas 2014)Building a mobile reaction lab notebook (ACS Dallas 2014)
Building a mobile reaction lab notebook (ACS Dallas 2014)
 
Open Drug Discovery Teams @ Hacking Health Montreal
Open Drug Discovery Teams @ Hacking Health MontrealOpen Drug Discovery Teams @ Hacking Health Montreal
Open Drug Discovery Teams @ Hacking Health Montreal
 

Recently uploaded

Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native ApplicationsWSO2
 
ChatGPT and Beyond - Elevating DevOps Productivity
ChatGPT and Beyond - Elevating DevOps ProductivityChatGPT and Beyond - Elevating DevOps Productivity
ChatGPT and Beyond - Elevating DevOps ProductivityVictorSzoltysek
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusZilliz
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdfSandro Moreira
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra
 
Simplifying Mobile A11y Presentation.pptx
Simplifying Mobile A11y Presentation.pptxSimplifying Mobile A11y Presentation.pptx
Simplifying Mobile A11y Presentation.pptxMarkSteadman7
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
WSO2 Micro Integrator for Enterprise Integration in a Decentralized, Microser...
WSO2 Micro Integrator for Enterprise Integration in a Decentralized, Microser...WSO2 Micro Integrator for Enterprise Integration in a Decentralized, Microser...
WSO2 Micro Integrator for Enterprise Integration in a Decentralized, Microser...WSO2
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businesspanagenda
 
Modernizing Legacy Systems Using Ballerina
Modernizing Legacy Systems Using BallerinaModernizing Legacy Systems Using Ballerina
Modernizing Legacy Systems Using BallerinaWSO2
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamUiPathCommunity
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfOrbitshub
 
The Zero-ETL Approach: Enhancing Data Agility and Insight
The Zero-ETL Approach: Enhancing Data Agility and InsightThe Zero-ETL Approach: Enhancing Data Agility and Insight
The Zero-ETL Approach: Enhancing Data Agility and InsightSafe Software
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Zilliz
 
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)Samir Dash
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Jeffrey Haguewood
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
AI in Action: Real World Use Cases by Anitaraj
AI in Action: Real World Use Cases by AnitarajAI in Action: Real World Use Cases by Anitaraj
AI in Action: Real World Use Cases by AnitarajAnitaRaj43
 

Recently uploaded (20)

Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
ChatGPT and Beyond - Elevating DevOps Productivity
ChatGPT and Beyond - Elevating DevOps ProductivityChatGPT and Beyond - Elevating DevOps Productivity
ChatGPT and Beyond - Elevating DevOps Productivity
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Simplifying Mobile A11y Presentation.pptx
Simplifying Mobile A11y Presentation.pptxSimplifying Mobile A11y Presentation.pptx
Simplifying Mobile A11y Presentation.pptx
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
WSO2 Micro Integrator for Enterprise Integration in a Decentralized, Microser...
WSO2 Micro Integrator for Enterprise Integration in a Decentralized, Microser...WSO2 Micro Integrator for Enterprise Integration in a Decentralized, Microser...
WSO2 Micro Integrator for Enterprise Integration in a Decentralized, Microser...
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Modernizing Legacy Systems Using Ballerina
Modernizing Legacy Systems Using BallerinaModernizing Legacy Systems Using Ballerina
Modernizing Legacy Systems Using Ballerina
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
The Zero-ETL Approach: Enhancing Data Agility and Insight
The Zero-ETL Approach: Enhancing Data Agility and InsightThe Zero-ETL Approach: Enhancing Data Agility and Insight
The Zero-ETL Approach: Enhancing Data Agility and Insight
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
AI in Action: Real World Use Cases by Anitaraj
AI in Action: Real World Use Cases by AnitarajAI in Action: Real World Use Cases by Anitaraj
AI in Action: Real World Use Cases by Anitaraj
 

Reaction Lab Notebooks for Mobile Devices - Alex M. Clark - GDCh 2013

  • 1. Putting together the pieces: Building a reaction-centric electronic lab notebook for mobile devices Alex M. Clark, Ph.D. November 2013 © 2013 Molecular Materials Informatics, Inc. http://molmatinf.com
  • 2. MOLECULAR MATERIALS INFORMATICS 2 Introduction mainframes minicomputers personal computers portable laptops mobile tablets smartphones • The mobile platform is revolutionary: a clean break - entirely new user interface / user experience no backward compatibility highly constrained resources applicable to entirely new situations
  • 3. MOLECULAR MATERIALS INFORMATICS 3 Disruption • Entire software industry ➠ mobile, like it or not incredibly simple no learning curve just works app user expectations delightful to use cheap trivial install
  • 4. MOLECULAR MATERIALS INFORMATICS Technical Challenges • Tiny screens (palm-sized) • Fingers instead of mouse • Reduced computing power • Limited storage • Delegate tasks to cloud servers: split between UI & API • Multiple platforms 4
  • 5. MOLECULAR MATERIALS INFORMATICS Chemistry App Ecosystem • Reference data • Education • Structure drawing • Database searching • 3D viewing • Reactions & collections • Property calculation • Model building • Graphical presentation • Data sharing 5
  • 6. MOLECULAR MATERIALS INFORMATICS Reaction Lab Notebook? • Electronic Lab Notebooks: huge business • Some products targeting iPad or mobile web • Reaction-centric lab book app: - should understand chemistry easy access to reference information incorporate green chemistry metrics make it easy to share or centralise data appeal to academia & industry 6
  • 7. MOLECULAR MATERIALS INFORMATICS 7 Reaction Canvas • Reactants and products: atoms & bonds (mostly) o • Component separation: layout dependent • Reagents, solvents, conditions: essentially meaningless Hong et al, JACS, 135, 11704 (2013)
  • 8. MOLECULAR MATERIALS INFORMATICS 8 Reaction Representation • Markup as component structures: reactants reagents products • Properties can be associated with each component, e.g. name, stoichiometry • Draw each structure separately
  • 9. MOLECULAR MATERIALS INFORMATICS Sketcher • Drawing reactants & products on an iPhone is quick • Reagents & solvents added subsequently 9
  • 10. MOLECULAR MATERIALS INFORMATICS Organometallics • Global & local style conventions anathema to machine-readability... bonding? oxidation state? carbonyl? graphic object salt? formula? 10
  • 11. MOLECULAR MATERIALS INFORMATICS Organometallic Anatomy • Commonly used connection table formats are unable to provide aesthetics or accuracy 11
  • 12. MOLECULAR MATERIALS INFORMATICS 12 SketchEl Format SketchEl!(8,7) Ru=0.0143,-1.3143;0,0,i0 H=2.2104,-1.3143;0,0,i0 H=1.5672,0.2386;0,0,i0 P|Ph{3}=0.0143,-3.9763;0,0,i0, ➥aSketchEl!(20002C22)000A* P|Ph{3}=0.0143,1.3477;0,0,i0, ➥aSketchEl!(20002C22)000A* P|Ph{3}=-1.9857,-1.3143;0,0,i0, ➥aSketchEl!(20002C22)000A* C=-1.3975,-2.7260;0,0,e0 O=-2.4581,-3.7867;0,0,i0 1-2=1,0 1-3=1,0 1-4=0,0 1-5=0,0 1-6=0,0 1-7=0,0 C55H47OP3Ru 7-8=2,0 917.9517 g/mol !End J. Chem. Inf. Model., 52, 3149 (2011) • Used by open source SketchEl & mobile apps • Core principles: - extreme minimalism extensibility (forward) zero-order bonds virtual hydrogen control inline abbreviations • Lowest common denominator: SketchEl MDL Molfile
  • 13. MOLECULAR MATERIALS INFORMATICS Extensible DataSheet • Need a well designed format • Surprisingly few widespread options, all inadequate: - MDL SDfile/RDfile - tab/comma-separated text - Excel • Tabular, typed columns • Native molecules • Extensible meta-layers 13
  • 14. MOLECULAR MATERIALS INFORMATICS 14 DataSheet XML <?xml version="1.0" encoding="UTF-8"?> <DataSheet> <Summary> <Title>Reaction Components</Title> <Description><![CDATA[Compounds involved in amide formation reaction]]></Description> </Summary> <Header nrows="6" ncols="4"> <Column name="Molecule" type="molecule" id="1">Molecular structure</Column> <Column name="Role" type="string" id="2">Role in reaction</Column> <Column name="MW" type="real" id="3">Molecular weight (g/mol)</Column> <Column name="MF" type="string" id="4">Molecular formula</Column> </Header> <Content> <Row id="1"> <Cell id="1"><![CDATA[SketchEl!(10,10) C=-0.8571,-0.7714;0,0,i0 C=0.4419,-0.0214;0,0,i2 C=1.7409,-0.7714;0,0,i2 C=-2.1562,-0.0214;0,0,i1 C=-3.4552,-0.7714;0,0,i1 C=-3.4552,-2.2714;0,0,i1 C=-2.1562,-3.0214;0,0,i1 C=-0.8571,-2.2714;0,0,i1 C=3.0400,-0.0214;0,0,i0 N=4.3391,0.7286;0,0,i0 1-2=1,0 2-3=1,0 1-4=1,0 4-5=2,0 5-6=1,0 6-7=2,0 7-8=1,0 8-1=2,0 3-9=1,0 9-10=3,0 !End]]></Cell> <Cell id="2"><![CDATA[reactant 1]]></Cell> <Cell id="3">131.174</Cell> <Cell id="4"><![CDATA[C9H9N]]></Cell> </Row> <Row id="2"> ... • Minimalistic baseline • Data types: - molecule - string - boolean - integer - real - extend • Header, column definitions • Content: row-major • Streamable
  • 15. MOLECULAR MATERIALS INFORMATICS Reaction Aspect <?xml version="1.0" encoding="UTF-8"?> <DataSheet> <Summary> <Title>Amide Formation</Title> <Description><![CDATA[]]></Description> </Summary> <Header nrows="1" ncols="15"> <Column name="ReactantMol1" type="molecule" id="1"/> <Column name="ReactantName1" type="string" id="2"/> <Column name="ReactantStoich1" type="string" id="3"/> <Column name="ReactantMol2" type="molecule" id="4"/> <Column name="ReactantName2" type="string" id="5"/> <Column name="ReactantStoich2" type="string" id="6"/> <Column name="ProductMol1" type="molecule" id="7"/> <Column name="ProductName1" type="string" id="8"/> <Column name="ProductStoich1" type="string" id="9"/> <Column name="ReagentMol1" type="molecule" id="10"/> <Column name="ReagentName1" type="string" id="11"/> <Column name="ReagentMol2" type="molecule" id="12"/> <Column name="ReagentName2" type="string" id="13"/> <Column name="ReagentMol3" type="molecule" id="14"/> <Column name="ReagentName3" type="string" id="15"/> </Header> <Extension> <Ext type="org.mmi.aspect.Reaction" name="Reaction"> <![CDATA[nreactants=2 nproducts=1 nreagents=3 ]]> </Ext> </Extension> <Content> <Row id="1"> <Cell id="1"><![CDATA[SketchEl!(10,10) C=-0.8571,-0.7714;0,0,i0 C=0.4419,-0.0214;0,0,i2 C=1.7409,-0.7714;0,0,i2 C=-2.1562,-0.0214;0,0,i1 C=-3.4552,-0.7714;0,0,i1 C=-3.4552,-2.2714;0,0,i1 C=-2.1562,-3.0214;0,0,i1 ... • An aspect is an extension field • Implies additional behaviour & visualisation features • Parser recognition optional: - fallback to baseline specification - datasheet still highly editable - backward and forward compatible • org.mmi.aspect.Reaction: - comparable to MDL RDfile - includes reagent structures, stoichiometry - readily extendable 15
  • 16. MOLECULAR MATERIALS INFORMATICS Yield101 • Prototype lab notebook already exists on the market • Designed for undergraduates, more general use 16
  • 17. MOLECULAR MATERIALS INFORMATICS Quantities • Adds quantities to the Reaction aspect: auto-interconverted • Green chemistry: - solvent reference - process mass intensity calculation • Automatic lookup of structures, cross reference to Mobile Reagents • Graphics creation • Private & public sharing of data 17
  • 18. MOLECULAR MATERIALS INFORMATICS 18 Graphics Creation hardcopy: print or export PDF
  • 19. MOLECULAR MATERIALS INFORMATICS 19 Public Sharing • Scheme uploaded • Public • Persistent • URL access • Formats • Graphics
  • 20. MOLECULAR MATERIALS INFORMATICS 19 Public Sharing • Scheme uploaded • Public • Persistent • URL access • Formats • Graphics
  • 21. MOLECULAR MATERIALS INFORMATICS Manuscripts • Export MS Word/Excel documents: DrawingML • Encapsulated PostScript (EPS) • Scalable Vector Graphics (SVG) 20
  • 22. MOLECULAR MATERIALS INFORMATICS Reaction Lab Notebook Core Technology • structure drawing • reaction drawing • quantity calculation • green metrics • graphics • public databases • data sharing Enhancements • professional chemists • calculation services • reference data • green chemistry • usability • data centralisation • sustainable business 21
  • 23. MOLECULAR MATERIALS INFORMATICS 22 The Green Lab Notebook (GLN) Transforms Tutorial Experiment Folders Feedstocks Solvents Green Chemistry Principles Scratch Sheet Templates
  • 24. MOLECULAR MATERIALS INFORMATICS Folders of Experiments • Groups of experiments • Stored locally on device: network optional • Synchronised to central location • Expand on the basic reaction definition... 23
  • 25. MOLECULAR MATERIALS INFORMATICS 24 Reaction Experiments • Multistep reaction editor, more supporting fields Multistep: ★ reactant ★ reagent ★ intermediate ★ product ★ waste ★ quantities ★ conditions ★ literature ★ freeform ★ metadata
  • 26. MOLECULAR MATERIALS INFORMATICS Green Metrics Process Mass Intensity (PMI) = E-factor = Atom Economy = mass of all reactants mass of products mass of waste mass of products Σ molecular weight reactants Σ molecular weight products • Databases: e.g. Toxics Release Inventory (TRI), Persistent Bioaccumulative Toxic Chemicals (PBT) • Curate as structure-searchable database, automatic recall 25
  • 27. MOLECULAR MATERIALS INFORMATICS Green Solvents • Environmental data from ACS GCI & GSK 26
  • 28. MOLECULAR MATERIALS INFORMATICS Feedstocks • Reference collection: - info about supply chain - encourage renewable use • Link lab-available quantities to experiment records • Could synchronise with inventory software • Lookup in vendor catalogs... 27
  • 29. MOLECULAR MATERIALS INFORMATICS 28 Vendor Lookup • MetaSearch engine: - PubChem - ChEBI - ChemSpider • Vendor links available • Can integrate more services
  • 30. MOLECULAR MATERIALS INFORMATICS Templates • Editable list • Advanced placement algorithms • Complicated ring systems • Inorganic ligands Journal of Cheminformatics, 2:8 (2010) 29
  • 31. MOLECULAR MATERIALS INFORMATICS 30 Reaction Transforms 4 1 2 3 6 4 5 1 2 6 3 5 • Provide a pre-curated list of "green" reaction transforms • Promote user entered experiments into transforms (numbering, clipping) • Associate with: - reagents, catalysts & solvents stoichiometry & quantities yield & experimental conditions literature & green reference data
  • 32. MOLECULAR MATERIALS INFORMATICS Matching Transforms Reactants Products • Search by Reactants or Products • Defer to webservice • User-defined transforms & server collection • Use sketch coordinates of query & transform molecules to produce consistent orientations • Allow filtering & sorting by yield, green metrics, availability of ingredients, etc. 31
  • 33. MOLECULAR MATERIALS INFORMATICS 32 Tutorials • Learned the hard way: intolerance for steep learning curves • Scientific software is inherently complicated • Two strategies: - workthrough tutorials - feature level-up: unlock ? • Careful to associate features with value
  • 34. MOLECULAR MATERIALS INFORMATICS Consumer vs. Enterprise • Business paradox: - apps expected to be low cost - not all markets are high volume Consumer • App is cheap • Webservices are free • Data access unsecured • No integration Enterprise • App cost negligible • Services licensed • Secure in-house data • Hybrid infrastructure 33
  • 35. Acknowledgments • GDCh organisers • RSC & ChemSpider, Eidogen-Sertanty, InfoChem, CDD, PubChem, ChEBI, CollabChem • Inquiries to info@molmatinf.com http://molmatinf.com http://molsync.com http://cheminf20.org @aclarkxyz