Semantic pipelines to molecular properties
Upcoming SlideShare
Loading in...5
×
 

Semantic pipelines to molecular properties

on

  • 7,089 views

In our quest to replace answers in molecular sciences by recipes to get answers, the semantic web technologies play the important role of giving meaning to numbers and characters. The Resource ...

In our quest to replace answers in molecular sciences by recipes to get answers, the semantic web technologies play the important role of giving meaning to numbers and characters. The Resource Description Framework (RDF) complements (and not replaces) earlier work with eXtensible Markup Language (XML) applications by providing a more clear separation between syntax and meaning. This creates an environment where multiple serialization formats can be used, that grows and shrinks in complexity where needed, and, for example, that can be easily embedded in document formats like HTML. We here present recent work in the dissemination and prediction of molecular properties, where data is shared in RDF, read into statistical and life science software including Bioclipse and R, and where molecular properties are predicted.

Statistics

Views

Total Views
7,089
Views on SlideShare
1,191
Embed Views
5,898

Actions

Likes
1
Downloads
6
Comments
0

64 Embeds 5,898

http://mnemosyne.de-blog.jp 3500
http://planetrdf.com 807
http://chem-bla-ics.blogspot.com 481
http://pele.farmbio.uu.se 205
http://www.r-bloggers.com 199
http://lanyrd.com 99
http://chem-bla-ics.blogspot.co.uk 97
http://chem-bla-ics.blogspot.nl 66
http://chem-bla-ics.blogspot.de 54
http://feeds.feedburner.com 46
http://chem-bla-ics.blogspot.ca 30
http://chem-bla-ics.blogspot.se 23
http://www.mnemosyne.de-blog.jp 20
http://chem-bla-ics.blogspot.in 19
http://nrnb.org 17
http://chem-bla-ics.blogspot.com.es 17
http://chem-bla-ics.blogspot.fr 15
http://chem-bla-ics.blogspot.dk 14
http://chem-bla-ics.blogspot.it 13
http://chem-bla-ics.blogspot.jp 13
http://chem-bla-ics.blogspot.ch 11
http://www.newsblur.com 11
http://chem-bla-ics.blogspot.com.au 9
http://chem-bla-ics.blogspot.ie 8
http://chem-bla-ics.blogspot.mx 8
http://chem-bla-ics.blogspot.sg 8
http://newsblur.com 8
http://webcache.googleusercontent.com 7
http://chem-bla-ics.blogspot.fi 7
http://www.infominder.com 7
http://chem-bla-ics.blogspot.ru 6
http://chem-bla-ics.blogspot.com.br 6
http://chem-bla-ics.blogspot.no 5
http://chem-bla-ics.blogspot.kr 5
http://chem-bla-ics.blogspot.be 5
http://127.0.0.1 4
http://chem-bla-ics.blogspot.co.nz 4
http://chem-bla-ics.blogspot.co.at 3
http://chem-bla-ics.blogspot.hu 3
http://chem-bla-ics.blogspot.cz 3
http://chem-bla-ics.blogspot.pt 3
http://chem-bla-ics.blogspot.tw 2
http://www.tuicool.com 2
http://translate.googleusercontent.com 2
http://chem-bla-ics.blogspot.gr 2
http://wmail.shinsegae.com 2
http://ranksit.com 2
http://web-18.citromail.hu 2
https://plu.mx 2
http://chem-bla-ics.blogspot.hk 2
More...

Accessibility

Upload Details

Uploaded via as Adobe PDF

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment

Semantic pipelines to molecular properties Semantic pipelines to molecular properties Presentation Transcript

  • semantic pipelines to Molecular Properties Egon Willighagen (@egonwillighagen) Dept. of Bioinformatics – BiGCaT, Maastricht University #ACSPhilly 2012Department of Bioinformatics - BiGCaT 1
  • Properties: Physical, chemical, biological• From experiment – with experimental error: interlab differences, method differences, …• No experiment? Predict – with prediction error, originating from experimental error, modeling error, incorrect interpretation, …• Where does the prediction error come from? Predicted BPs →Department of Bioinformatics - BiGCaT 2
  • Richer Property Prediction1. Select training data (x,y) ● experimental data with errors, detailed description of what the data is2. Find f(x) = y, minimizing error(y) ● use as much as possible information from 1.3. Validation ● not just validate (and quantify) the statistics against a test set, also compare with other data How? → Semantic pipelines E.L. Willighagen, et al. Molecular Chemometrics. Crit. Rev. Anal. Chem. 2006.Department of Bioinformatics - BiGCaT 3
  • Semantic pipelinesBe clear in what you mean ● Use (open) look-up lists, dictionaries, ontologies ● Be flexible and remove format limitation ● Link to other data → from other domains ● Calculation provenanceM. Samwald, et al. Linked open drug data for pharmaceuticalresearch and development. J. Cheminf. 2011. 3:19Department of Bioinformatics - BiGCaT 4
  • Semantic pipelines: what technology? CML: semantic, flexible, embeddable in HTML, RSS, … but only in XML RDF: ~10yrs before adopted! But JSON, XML, Turtle, … Also: linked data.Department of Bioinformatics - BiGCaT 5
  • RDF and friends ... • format independent (JSON, XML, ...) • db technology independent (RDB2RDF) • embeddable in HTML (e.g. RDFa) • Open Standard → widely supportedQuerying • SPARQL: like SQL → access • Federated SPARQL link multiple SPARQL endpoints and other RDF sources Department of Bioinformatics - BiGCaT 6
  • App #1: Bioclipse speaks RDF andOpenToxDepartment of Bioinformatics - BiGCaT 7
  • App #2: NanotoxicityDepartment of Bioinformatics - BiGCaT 8
  • Whats next?Department of Bioinformatics - BiGCaT 9
  • Collaborations / Thanx CDK community Christoph Steinbeck, Rajarshi Guha, many moreUppsala UniversityOla Spjuth et al. OpenTox community Nina Jeliazkova, Barry HardyKarolinska InstitutetRoland Grafström, Bengt CHEMINF communityFadeel, Hanna Karlsson Nico Adams, Michel Dumontier, Janna HastingsW3C Heath Care and LifeScience interest group CML community (you know :)Department of Bioinformatics - BiGCaT 10
  • Take home tweetImprove yourproperty predictiontraining andvalidation byadopting semanticpipelines! Further examples: blog: chem-bla-ics twitter: @egonwillighagen CiteULike: egonw Mendeley: egon-willighagen Blue Obelisk GScholar: u8SjMZ0AAAAJDepartment of Bioinformatics - BiGCaT 11