• Share
  • Email
  • Embed
  • Like
  • Save
  • Private Content
How Internet Resources Are Providing a Collaborative Community for Chemistry
 

How Internet Resources Are Providing a Collaborative Community for Chemistry

on

  • 3,288 views

Online chemistry resources have expanded dramatically in the past few years with resources such as PubChem, ChEBI, Wikipedia, ChemSpider and many others offering rich resources to scientists seeking ...

Online chemistry resources have expanded dramatically in the past few years with resources such as PubChem, ChEBI, Wikipedia, ChemSpider and many others offering rich resources to scientists seeking data and information. ChemSpider has become one of the primary chemistry portals delivering a heterogeneous mix of Open and Closed data. ChemSpider offers a structure-centric community for collaboration enabling the crowd-sourced deposition and validation of online chemistry data. ChemSpider has also been integrated into the ChemMantis system – CHEMistry Markup And Nomenclature Transformation Integrated System. This platform facilitates entity extraction of science related terms using both heuristics and highly curated dictionaries. The resulting documents are marked up to allow viewing of chemical structures linked out to over 200 different data sources via the ChemSpider database.

Statistics

Views

Total Views
3,288
Views on SlideShare
2,915
Embed Views
373

Actions

Likes
1
Downloads
36
Comments
0

5 Embeds 373

http://www.chemspider.com 367
http://translate.googleusercontent.com 3
http://www.slideshare.net 1
http://www.brijj.com 1
http://chemspider.com 1

Accessibility

Categories

Upload Details

Uploaded via as Microsoft PowerPoint

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment

    How Internet Resources Are Providing a Collaborative Community for Chemistry How Internet Resources Are Providing a Collaborative Community for Chemistry Presentation Transcript

    • How Internet Resources Are Providing a Collaborative Community for Chemistry 60 slides in 20 minutes
    • Imagine a time when ….
      • The internet is searchable by chemical structure and substructure (e.g.Wikipedia, Google Scholar)
      • Chemistry articles are indexed and searchable by a free online service
      • The web is linked together through the “language of chemistry”
    • It’s Coming…Linked Data Cloud
    • Thanks to the Organizers…
    • Antony Williams vs Identifiers Passport ID Dad, Tony, others SSN Green Card License 5 email addresses ChemSpiderman (blog, Twitter account, Facebook, Friendfeed) OpenID … .
    • Aspirin vs Chemical Identifiers
    • Aspirin names and synonyms
      • Text searches depend on correct association
      • 335 suggested identifiers for Aspirin just on PubChem!
      • Disambiguation dictionaries are necessary
    •  
    •  
    •  
    • The Final Search Strategy
    • All Those Names, One Structure
    • Searching Chemistry on the Internet
      • How complete a result set will we get if we search for “chemicals” by name?
      • Is there a better way to link chemistry databases? Linking by “names” is dangerous
      • Chemists want structure and SUBstructure searching
    • The InChI Identifier
    • Multiple Layers
    • InChIStrings Hash to InChIKeys
    • Oleoylethanolamine
    • Search Engine Dependencies
    • Search Engine Dependencies
    • InChIs have traction…
    • RDF Linking of Structures
    • PubChem
    • The Simplest Organic Molecule
    • Vancomycin
    •  
    • Vancomycin
      • Who will curate?
      • How would you clean such a large dataset?
    • Vancomycin on ChemSpider
    • Vancomycin
    • Vancomycin Search Molecular SKELETON Search Full Molecule
    • Full Skeleton Search: 104 Hits
    • Full Molecule Search: 4 Hits
    • The InChI “Resolver”
    • Content is King and Quality Costs
      • Curated Chemistry “content” is expensive to create
        • Patent searching
        • Structures and properties
        • Drug databases
        • Literature databases
      • Chemical Abstracts Service (CAS), the “Gold Standard” in Chemistry related information
        • 102 years of content
        • >50 million substances
        • Proprietary platform
    • The EXPERTS must get it right?!
    • Wikipedia, C&E News, PubChem
      • C&E News (from ACS)
    • Feedback from Steve Ritter
      • “ Although CAS and C&EN are both part of the ACS Publications Division, we at C&EN still have to pay for our SciFinder access, strangely enough.”
      • “ It would be nice to have an authoritative web-based source of standard, well-drawn structures for chemists to go to so they can freely cut and paste structures into their papers, PowerPoint presentations, and anything else they might need. Maybe Wikipedia will be that source one day .”
    • Maybe it will be ChemSpider?
      • What is ChemSpider?
        • A database of almost 23 million compounds, >200 data sources
        • A deposition and curation platform
        • A publishing platform for the community
        • Grows daily – more depositions, more links, more data sources
    • Search OEA
    • Search OEA
    • Search OEA
    • Search OEA
    • Linked Patents for OEA
    •  
    • Linked resources
      • Vendor sites – Aldrich, Alfa Aesar, TCI and 100s of others
      • Government databases – PubChem, DSSTox, FDA databases, ChemIDPlus,…
      • Biological Databases – Protein Database, Stitch, KEGG, ChEBI,…
      • Analytical databases –NMRShiftDB,…
    • Linked across the internet
    • Kyoto Encyclopedia of Genes and Genomes
    • Complex Data and Information
    • Remember – QUALITY ISSUES
    • The FDA’s DailyMed
    • Incorrect Structures
    • Crowd-sourcing Chemistry Curation
    • The Currency of Recognition
      • We need to build a platform for recognition ….
    • Chemistry – A Deposition Platform
      • CAS indexes published literature, patents and chemical vendors
      • CAS indexes ChemSpider – >303,000 records
      • “ Lost Chemistry” – syntheses in theses, lab notebooks? Compounds in private collections?
      • ChemSpider accepts public depositions, linking to websites, hosting of details etc. Accepts structures, text, spectra, images.
    • Blogs should be searchable too…
    • Use Intelligent Structures : ChemSpider Embed Web Service
    • ChemSpider Web Services
    • Semantic Linking of Structures
      • What would you want to link off a structure?
        • Chemical suppliers
        • Other publications
        • Analytical Data
        • Related Reactions
        • Wikipedia
        • Patents
        • “ Everything”
        • See Richard Kidd’s Talk
    • Conclusions
      • Internet resources provide a collaborative community for chemistry
      • Crowdsourcing to expand, curate and integrate to the benefit of chemists
      • Searching the web for chemistry is arriving
      • InChIs are enabling chemistry on the internet
      • Question Quality!
    •  
    • Acknowledgments
      • Valery Tkachenko and Sergey Golotvin
      • RSC infrastructure team
      • The ChemSpider advisory group
      • The Wikipedia Chemistry team
    • [email_address] Twitter: ChemSpiderman www.chemspider.com/blog