Text Mining for Chemistry and Building a Public Platform for Document Markup

Loading...

Flash Player 9 (or above) is needed to view presentations.
We have detected that you do not have it on your computer. To install it, go here.

0 comments

Post a comment

    Post a comment
    Embed Video
    Edit your comment Cancel

    1 Favorite

    Text Mining for Chemistry and Building a Public Platform for Document Markup - Presentation Transcript

    1. Text mining for chemistry and building a public platform for document markup Antony Williams
    2. Searching and Reading Articles…
      • Online search tools for chemistry articles are generally text-based
      • Searching articles based on chemical structure and substructure is very expensive.. but is changing
      • Text-mining is a “hot area” of research ….but what is public? What depends on public curation?
    3. Text-Based Search Tools
      • Google
      • Pubmed
      • Google Scholar
      • Publishers websites
      • And 10s of other resources….
    4. Vancomycin Through PubChem
    5. Vancomycin Text Searches
      • Pubmed
      • Google Scholar
    6. Online Structure Searching of Articles
      • Some capabilities from publishers starting to show up
    7. Publishers should adopt/add InChIs RSC and Nature Publishing Group have!
    8.  
    9. ChemMantis - Single Click Mark-up
    10. Name-Structure Pairs
    11. Converting Detected Names…
      • Names are searched against a validated dictionary (this expands as ChemSpider is curated
      • If not found then they are passed through a Name to Structure algorithm
      • If they cannot convert then ChemSpider is searched for non-validated names
    12. RED Underline Non-validated, Cannot Convert through NTS
      • “Names” can be added to Suppress List
    13. BLUE Underline Name to Structure Converted
    14. Deposit Structures
      • Entity Extraction built around modified algorithms from SureChem
      • Optimized for “publications”
      • Dictionaries for chemical entities, groups, reactions, elements, families, species…
      • Dictionaries can be expanded – presently adding PDB
    15. Species..
    16. What do you do with a markup system?
      • Test it, Show it off and make it available…
      • Tested on chemistry articles so why not HOST articles?
      • …and create an online journal…
    17. The ChemSpider Journal
    18. Open Access Community Journal
    19. Deposit Article
      • Import URL or Document
      • Copy-Paste
      • Markup
    20. Copy-Paste Version Martin Walker Monthly Article
    21. Chemical names
    22. Names, Elements, Groups, Families
    23. Outlinks
    24. Mark Up Open Access Article
    25. Online Journals and Live Data
    26. A Community Resource of Spectra
      • Spectra deposited on ChemSpider as “Open Data” are available to anybody to “Embed” in their articles, blogs, wikis etc
    27. Present Dictionaries
      • Chemical names - ChemSpider Validated Names
      • Reactions - Wikipedia Named Reactions and RSC Reaction Ontology reactions
      • Species – Wikipedia “species”
      • To add – New Dictionaries
        • PDB codes
        • IUPAC Gold Book
    28. Conclusions
      • The internet enables chemistry – and at a reduced cost
      • Web 2.0 is here and improving quality – to benefit 3.0
      • Question Quality!
      • Crowdsourcing for expansion, curation and integration
      • Classical models may die quite quickly – business models must change soon or fail
      • Publishers – heed the profileration of InChIs for Chemistry

    + Antony Williams, ChemSpidermanAntony Williams, ChemSpiderman, 7 months ago

    custom

    579 views, 1 favs, 1 embeds more stats

    Text Mining for Chemistry and Building a Public Pla more

    More info about this document

    © All Rights Reserved

    Go to text version

    • Total Views 579
      • 577 on SlideShare
      • 2 from embeds
    • Comments 0
    • Favorites 1
    • Downloads 16
    Most viewed embeds
    • 2 views on http://www.chemspider.com

    more

    All embeds
    • 2 views on http://www.chemspider.com

    less

    Flagged as inappropriate Flag as inappropriate
    Flag as inappropriate

    Select your reason for flagging this presentation as inappropriate. If needed, use the feedback form to let us know more details.

    Cancel
    File a copyright complaint
    Having problems? Go to our helpdesk?

    Categories