6. 1992
• CAS (and meters of books)
• Access through STN via IBM 3270
terminal emulation and cryptic
commands
• Beilstein Database (and meters of books)
• No open source software libraries for
cheminformatics
13. The CDK after 16 years
•16,521 commits made by 115 contributors
•564,171 lines of code
•mostly written in Java
•well established, mature codebase
•maintained by a large development team
•with stable Y-O-Y commits
•estimated 151 years of effort (COCOMO model)
•first commit in October, 2000
•most recent commit 1 day ago
The Chemistry Development Kit (CDK)
Open Source Cheminformatics in Java
28. There are known knowns; there are things we know
we know.
We also know there are known unknowns; that is to
say, we know there are some things we do not know.
But there are also unknown unknowns – the ones we
don’t know we don’t know.
—United States Secretary of Defense,
Donald Rumsfeld
38. Building upon extensive genomics research, we argue that the time is
now right to focus intensively on model organism metabolomes. We
propose a grand challenge for metabolomics studies of model
organisms: to identify and map all metabolites onto metabolic
pathways, to develop quantitative metabolic models for model
organisms, and to relate organism metabolic pathways within the
context of evolutionary metabolomics, i.e., phylometabolomics. These
efforts should focus on a series of established model organisms in
microbial, animal and plant research.
Metabolites. 2016 Feb 15;6(1)
41. •8.7 mio eukaryotic species on earth (+- 1.3mio)
•1.2 mio species identified and classified
42. •8.7 mio eukaryotic species on earth (+- 1.3mio)
•1.2 mio species identified and classified
•3000 - 4000 complete species genomes sequenced
43. •8.7 mio eukaryotic species on earth (+- 1.3mio)
•1.2 mio species identified and classified
•3000 - 4000 complete species genomes sequenced
44. •8.7 mio eukaryotic species on earth (+- 1.3mio)
•1.2 mio species identified and classified
•3000 - 4000 complete species genomes sequenced
What about completed metabolomes?
45. •8.7 mio eukaryotic species on earth (+- 1.3mio)
•1.2 mio species identified and classified
•3000 - 4000 complete species genomes sequenced
What about completed metabolomes?
47. Experimental Repository
Reference Layer
Chemistry Spectroscopy Biology
AnalysisTools
Primary Literature
Primary data and Meta-Data, Spectra, Protocols, Synopses, ...
MetaboLights Database at the EBI