Navigating the Complex Web of Chemistry Using ChemSpider

Loading...

Flash Player 9 (or above) is needed to view presentations.
We have detected that you do not have it on your computer. To install it, go here.

0 comments

Post a comment

    Post a comment
    Embed Video
    Edit your comment Cancel

    1 Favorite

    Navigating the Complex Web of Chemistry Using ChemSpider - Presentation Transcript

    1. Navigating the Complex Web of Chemistry Using ChemSpider
    2. Antony Williams vs Identifiers Old Passport ID Dad, Tony, others SSN Green Card License 5 email addresses ChemSpiderman (blog, Twitter account, Facebook, Friendfeed) OpenID … .
    3. Aspirin vs Chemical Identifiers
    4. Aspirin names and synonyms
      • Text searches depend on correct association
      • 335 suggested identifiers for Aspirin just on PubChem!
      • Disambiguation dictionaries are necessary
    5. Linked Data Cloud
      • … the premium database producers are using some automatic tools to prepare a ‘first draft’ of a database record, to be refined by eye .
      • Coupled with the public internet as a distribution method of choice, it is becoming possible for the first time to create and distribute new structure based databases at much lower costs, or even free of charge.
    6.  
    7.  
    8. The Final Search Strategy
    9. All Those Names, One Structure
    10. Content is King and Quality Costs
      • Chemistry “content” is big business. Not everyone can afford it.
        • Patent searching
        • Structures and properties
        • Drug databases
        • Literature databases
      • Chemical Abstracts Service (CAS), the “Gold Standard” in Chemistry related information
        • 101 years of content
        • $260 million revenue (2006)
        • >50 million substances
        • Proprietary platform
    11. Searching Chemistry on the Internet
      • How complete a result set will we get if we search for “chemicals” by name?
      • Is there a better way to link chemistry databases? Linking by “names” is dangerous
      • Chemists want structure and SUBstructure searching
    12. The InChI Identifier
    13. Multiple Layers
    14. InChIStrings Hash to InChIKeys
    15. Oleoylethanolamine
    16. InChIKey Searches Work
    17. Search Engine Dependencies
    18. Search Engine Dependencies
    19. InChIs have traction…
    20. RDF Linking of Structures
    21. PubChem
    22. The Simplest Organic Molecule
    23. Question Everything online: www.dhmo.org
    24. The Structure-Based Data Cloud
    25. Vancomycin
    26.  
    27. Vancomycin
      • Who will curate?
      • How would you clean such a large dataset?
    28. Vancomycin on ChemSpider
    29. Vancomycin
    30. Vancomycin Search Molecular SKELETON Search Full Molecule
    31. Full Skeleton Search: 104 Hits
    32. Full Molecule Search: 4 Hits
    33. What is ChemSpider?
      • ChemSpider is:
        • Building a Structure Centric Community for Chemists
        • 22.2 million compounds, >200 data sources
        • A deposition and curation platform
        • A publishing platform for the community
        • Grows daily – more depositions, more links, more data sources
    34. For Chemical Compounds
      • Vendor sites – Aldrich, Alfa Aesar, TCI and 100s of others
      • Government databases – PubChem, DSSTox, FDA databases, ChemIDPlus,…
      • Biological Databases – Protein Database, Stitch, KEGG, ChEBI,…
      • Analytical databases –NMRShiftDB,…
    35. How Was ChemSpider Built?
      • ChemSpider was a “hobby project”
      • Housed in a basement and running off three servers – one bought, two built
      • May 2009
      • 3 servers – 2 homebuilt
      • .NET architecture
      • SQL server
      • Homebuilt structure/substructure
      • Commercial components
      • Open Source Components
        • OpenBabel, Jmol, JSpecView, NCBI Toolkit, InChI Libraries
    36. Search Cholesterol
    37. Search Cholesterol
    38. Search Cholesterol
    39. Search Cholesterol
    40. Linked across the internet
    41. Kyoto Encyclopedia of Genes and Genomes
    42. Links to Patents based on structure
    43.  
    44. Answering Questions for Chemists
      • Questions a chemist might ask…
        • What is the melting point of n-butanol?
        • What is the chemical structure of Xanax?
        • Chemically, what is phenolphthalein?
        • What are the stereocenters of cholesterol?
        • Where can I find publications about xylene?
        • What are the different trade names for Ketoconazole?
        • What is the NMR spectrum of Aspirin?
        • What are the safety handling issues for Thymol Blue?
    45. Complex Data and Information
    46. Remember – QUALITY ISSUES
    47. The FDA’s DailyMed
    48. Incorrect Structures
    49. Does one stereocenter matter?
      • Distaval, Talimol, Nibrol, Sedimide, Quietoplex, Contergan, Neurosedyn, and Softenon
    50. Crowd-sourcing Chemistry Curation
    51. We Need Recognition and Rewards
    52. Master Curators, Curators, Depositors
    53. Collaborating with Wikipedia
      • Long term project to curate chemical compounds
      • Robotically linking ChemSpider to Wikipedia at present
      • Will layer on InChI Strings and InChIKeys shortly and make Wikipedia structure searchable
    54. Blogs need InChIs too!
    55. Blogs need InChIs too!
    56. Use Intelligent Structures : ChemSpider Embed Web Service
    57. ChemSpider Web Services
    58. Semantic Mark-up for Chemistry
      • Semantic mark-up for chemistry is here
        • RSC project prospect
        • Nature publishing group compound linking
        • ChemMantis
    59. Nature Chemistry Compound Pages
    60. Project Prospect
    61. ChemMantis
    62. Deposit Structures
    63. Species – linked to Wikipedia
    64. Semantic Linking of Structures
      • What would you want to link off a structure?
        • Chemical suppliers
        • Other publications
        • Analytical Data
        • Related Reactions
        • Wikipedia
        • Patents
        • “ Everything”
    65. The InChI “Resolver”
    66. InChI Resolver to DOIs Structure Search the Web
    67.  
    68. Conclusions
      • Internet resources provide a collaborative community for chemistry
      • Crowdsourcing to expand, curate and integrate to the benefit of chemists
      • Searching the web for chemistry is arriving
      • InChIs are enabling chemistry on the internet
      • Question Quality!
    69. [email_address] Twitter: ChemSpiderman www.chemspider.com/blog

    + Antony Williams, ChemSpidermanAntony Williams, ChemSpiderman, 1 month ago

    custom

    214 views, 1 favs, 1 embeds more stats

    There is an increasing availability of free and ope more

    More info about this document

    © All Rights Reserved

    Go to text version

    • Total Views 214
      • 211 on SlideShare
      • 3 from embeds
    • Comments 0
    • Favorites 1
    • Downloads 3
    Most viewed embeds
    • 3 views on http://www.chemspider.com

    more

    All embeds
    • 3 views on http://www.chemspider.com

    less

    Flagged as inappropriate Flag as inappropriate
    Flag as inappropriate

    Select your reason for flagging this presentation as inappropriate. If needed, use the feedback form to let us know more details.

    Cancel
    File a copyright complaint
    Having problems? Go to our helpdesk?

    Categories