SlideShare a Scribd company logo
Using ODIN for a PharmGKB revalidation experiment

       Fabio Rinaldi1 , Simon Clematide1 , Yael Garten2 , Michelle
Whirl-Carrillo2 , Li Gong2 , Joan M. Hebert2 , Katrin Sangkuhl2 , Caroline
              F. Thorn2 , Teri E. Klein2 , Russ B. Altman2 .

                     1 OntoGene   group, University of Zurich
                     2 PharmGKB    group, Stanford University


                            Biocuration 2012
Intro IE Approach Revalidation Results Conclusion Extra   PharmGKB        OntoGene



Introduction
    PharmGKB
    OntoGene
IE Approach
   Entities
   Interactions
Revalidation
Results
Conclusion
   Outlook
   Acknowledgments
Extra
   ME Ranking
   Evaluation

Biocuration 2012                         Rinaldi et al.   ODIN-PharmGKB     2 / 42
Intro IE Approach Revalidation Results Conclusion Extra   PharmGKB        OntoGene


PharmGKB



Mission
PharmGKB is a pharmacogenomics knowledge resource that encompasses
clinical information, potentially clinically actionable gene-drug associations
and genotype-phenotype relationships

Approach
PharmGKB collects, curates and disseminates knowledge about the impact
of human genetic variation on drug responses through the many activities,
including Annotating genetic variants and gene-drug-disease relationships
via literature reviews




Biocuration 2012                         Rinaldi et al.   ODIN-PharmGKB     3 / 42
Intro IE Approach Revalidation Results Conclusion Extra   PharmGKB        OntoGene


PharmGKB



Mission
PharmGKB is a pharmacogenomics knowledge resource that encompasses
clinical information, potentially clinically actionable gene-drug associations
and genotype-phenotype relationships

Approach
PharmGKB collects, curates and disseminates knowledge about the impact
of human genetic variation on drug responses through the many activities,
including Annotating genetic variants and gene-drug-disease relationships
via literature reviews




Biocuration 2012                         Rinaldi et al.   ODIN-PharmGKB     3 / 42
Intro IE Approach Revalidation Results Conclusion Extra   PharmGKB        OntoGene


http://www.pharmgkb.org/




Biocuration 2012                         Rinaldi et al.   ODIN-PharmGKB     4 / 42
Intro IE Approach Revalidation Results Conclusion Extra   PharmGKB        OntoGene


OntoGene group


Aims
Develop innovative text mining technologies for the automatic extraction
of information from the biomedical literature.

                                 http://www.ontogene.org/

Selected results
        PPI,IMT BioCreative 2006
        PPI BioCreative 2009 (best results)
        ACT, IMT, IAT, BioCreative 2010




Biocuration 2012                         Rinaldi et al.   ODIN-PharmGKB     5 / 42
Intro IE Approach Revalidation Results Conclusion Extra   PharmGKB        OntoGene


OntoGene group


Aims
Develop innovative text mining technologies for the automatic extraction
of information from the biomedical literature.

                                 http://www.ontogene.org/

Selected results
        PPI,IMT BioCreative 2006
        PPI BioCreative 2009 (best results)
        ACT, IMT, IAT, BioCreative 2010




Biocuration 2012                         Rinaldi et al.   ODIN-PharmGKB     5 / 42
Intro IE Approach Revalidation Results Conclusion Extra   PharmGKB        OntoGene


SASEBio: Missions




SASEBio: Semi-automated semantic enrichment of biomedical texts
        Mission I “Relation/Text Mining”: Extraction of semantic relations
        between biomedical entities (proteins, genes, drugs) using linguistic
        text mining methods
        Mission II “Literature Curation”: Development of a flexible interactive
        curation interface for efficient human validation and annotation




Biocuration 2012                         Rinaldi et al.   ODIN-PharmGKB     6 / 42
Intro IE Approach Revalidation Results Conclusion Extra   PharmGKB        OntoGene


SASEBio: Missions




SASEBio: Semi-automated semantic enrichment of biomedical texts
        Mission I “Relation/Text Mining”: Extraction of semantic relations
        between biomedical entities (proteins, genes, drugs) using linguistic
        text mining methods
        Mission II “Literature Curation”: Development of a flexible interactive
        curation interface for efficient human validation and annotation




Biocuration 2012                         Rinaldi et al.   ODIN-PharmGKB     6 / 42
Intro IE Approach Revalidation Results Conclusion Extra   PharmGKB        OntoGene


Relation/Text Mining: Automatic Document Analysis




Biocuration 2012                         Rinaldi et al.   ODIN-PharmGKB     7 / 42
Intro IE Approach Revalidation Results Conclusion Extra   PharmGKB        OntoGene


Relation Mining: Syntactic Approach


Using dependency parses and machine learning




Biocuration 2012                         Rinaldi et al.   ODIN-PharmGKB     8 / 42
Intro IE Approach Revalidation Results Conclusion Extra   PharmGKB        OntoGene


Literature Curation: Interactive Curation Environment




Biocuration 2012                         Rinaldi et al.   ODIN-PharmGKB     9 / 42
Intro IE Approach Revalidation Results Conclusion Extra   PharmGKB        OntoGene


ODIN: Interactive Curation Environment
Using client-side Web-based techniques
XML, CSS, DOM manipulation by JavaScript and AJAX




Biocuration 2012                         Rinaldi et al.   ODIN-PharmGKB    10 / 42
Intro IE Approach Revalidation Results Conclusion Extra   PharmGKB        OntoGene


ODIN: Interactive Curation Environment




Extensive logging facilities




Biocuration 2012                         Rinaldi et al.   ODIN-PharmGKB    11 / 42
Intro IE Approach Revalidation Results Conclusion Extra   Entities        Interactions



Introduction
    PharmGKB
    OntoGene
IE Approach
   Entities
   Interactions
Revalidation
Results
Conclusion
   Outlook
   Acknowledgments
Extra
   ME Ranking
   Evaluation

Biocuration 2012                         Rinaldi et al.   ODIN-PharmGKB       12 / 42
Intro IE Approach Revalidation Results Conclusion Extra   Entities        Interactions


Relations between Genes, Drugs, Diseases


PharmGKB: Pharmacogenomics Knowledge Base as a Gold Standard
Subset of information in PharmGKB used:
        26,122 binary relations between diseases, drugs, and genes
        5062 PubMed abstracts referenced




Biocuration 2012                         Rinaldi et al.   ODIN-PharmGKB       13 / 42
Intro IE Approach Revalidation Results Conclusion Extra   Entities        Interactions


Relations between Genes, Drugs, Diseases


PharmGKB: Pharmacogenomics Knowledge Base as a Gold Standard
Subset of information in PharmGKB used:
        26,122 binary relations between diseases, drugs, and genes
        5062 PubMed abstracts referenced

Goal
Compute high-quality relation candidates and rank them according to a
confidence score.

Information used for text mining
PubMed abstracts plus MeSH terms and chemical substances terms.



Biocuration 2012                         Rinaldi et al.   ODIN-PharmGKB       13 / 42
Intro IE Approach Revalidation Results Conclusion Extra   Entities        Interactions


Baseline: Abstract-wide Co-occurence-based Candidate
Relation Generation

Basic idea
Combine all concepts identified in the abstract into relation candidate
pairs.
However, do not combine concepts stemming from the same ambiguous
term.




Biocuration 2012                         Rinaldi et al.   ODIN-PharmGKB       14 / 42
Intro IE Approach Revalidation Results Conclusion Extra      Entities             Interactions


Baseline: Abstract-wide Co-occurence-based Candidate
Relation Generation

Basic idea
Combine all concepts identified in the abstract into relation candidate
pairs.
However, do not combine concepts stemming from the same ambiguous
term.

Basic ranking: Occurrences and zoning
Score of a pair of concepts c1 , c2 in an abstract (C = all concepts):

                                                          freq(c1 ) + freq(c2 )
                              score(c1 , c2 ) =
                                                                freq(C )

Text zone boosting: An occurrence in an article title is counted 10 times.

Biocuration 2012                         Rinaldi et al.      ODIN-PharmGKB            14 / 42
Intro IE Approach Revalidation Results Conclusion Extra   Entities        Interactions


Improving Relation Ranking



Core ideas for improved ranking
        Identify noisy concepts recognized by term recognizer and penalize
        them.
        Weight individual concepts according to their likeliness to appear in a
        gold relation!
        Adapt ranking of relations to gold standard.
        Combine the weights of individual concepts for the score of relation
        candidates.
        Generally penalize relations of the same type (rare phenomenon)




Biocuration 2012                         Rinaldi et al.   ODIN-PharmGKB       15 / 42
Intro IE Approach Revalidation Results Conclusion Extra   Entities        Interactions


Improving Relation Ranking



Core ideas for improved ranking
        Identify noisy concepts recognized by term recognizer and penalize
        them.
        Weight individual concepts according to their likeliness to appear in a
        gold relation!
        Adapt ranking of relations to gold standard.
        Combine the weights of individual concepts for the score of relation
        candidates.
        Generally penalize relations of the same type (rare phenomenon)




Biocuration 2012                         Rinaldi et al.   ODIN-PharmGKB       15 / 42
Intro IE Approach Revalidation Results Conclusion Extra   Entities        Interactions


Improving Relation Ranking



Core ideas for improved ranking
        Identify noisy concepts recognized by term recognizer and penalize
        them.
        Weight individual concepts according to their likeliness to appear in a
        gold relation!
        Adapt ranking of relations to gold standard.
        Combine the weights of individual concepts for the score of relation
        candidates.
        Generally penalize relations of the same type (rare phenomenon)




Biocuration 2012                         Rinaldi et al.   ODIN-PharmGKB       15 / 42
Intro IE Approach Revalidation Results Conclusion Extra



Introduction
    PharmGKB
    OntoGene
IE Approach
   Entities
   Interactions
Revalidation
Results
Conclusion
   Outlook
   Acknowledgments
Extra
   ME Ranking
   Evaluation

Biocuration 2012                         Rinaldi et al.   ODIN-PharmGKB   16 / 42
Intro IE Approach Revalidation Results Conclusion Extra


Revalidation Experiment

Goal
Revalidation of PharmGKB relations with respect to false positives.
Collaboration with Stanford Center for Biomedical Informatics
Research
                                                                          Relations   Articles
    In 3059 out of 5378 articles we find all                                   2          8
    relations.                                                                3          9
                                                                              4          2
    Keep 1407 where number of relations > 1 and                               5          3
    ≤ 20.                                                                    6-7         1
    Almost half of 3059 contain only 1 relation.                             8-9         1
                                                                           10-20         1
    Each of the 5 curators revalidates 25 articles
    Sampling of articles according to number
    relations per article

Biocuration 2012                         Rinaldi et al.   ODIN-PharmGKB                          17 / 42
Intro IE Approach Revalidation Results Conclusion Extra


Revalidation Experiment

Goal
Revalidation of PharmGKB relations with respect to false positives.
Collaboration with Stanford Center for Biomedical Informatics
Research
                                                                          Relations   Articles
    In 3059 out of 5378 articles we find all                                   2          8
    relations.                                                                3          9
                                                                              4          2
    Keep 1407 where number of relations > 1 and                               5          3
    ≤ 20.                                                                    6-7         1
    Almost half of 3059 contain only 1 relation.                             8-9         1
                                                                           10-20         1
    Each of the 5 curators revalidates 25 articles
    Sampling of articles according to number
    relations per article

Biocuration 2012                         Rinaldi et al.   ODIN-PharmGKB                          17 / 42
Intro IE Approach Revalidation Results Conclusion Extra


Revalidation Experiment

Goal
Revalidation of PharmGKB relations with respect to false positives.
Collaboration with Stanford Center for Biomedical Informatics
Research
                                                                          Relations   Articles
    In 3059 out of 5378 articles we find all                                   2          8
    relations.                                                                3          9
                                                                              4          2
    Keep 1407 where number of relations > 1 and                               5          3
    ≤ 20.                                                                    6-7         1
    Almost half of 3059 contain only 1 relation.                             8-9         1
                                                                           10-20         1
    Each of the 5 curators revalidates 25 articles
    Sampling of articles according to number
    relations per article

Biocuration 2012                         Rinaldi et al.   ODIN-PharmGKB                          17 / 42
Intro IE Approach Revalidation Results Conclusion Extra


Revalidation Experiment

Goal
Revalidation of PharmGKB relations with respect to false positives.
Collaboration with Stanford Center for Biomedical Informatics
Research
                                                                          Relations   Articles
    In 3059 out of 5378 articles we find all                                   2          8
    relations.                                                                3          9
                                                                              4          2
    Keep 1407 where number of relations > 1 and                               5          3
    ≤ 20.                                                                    6-7         1
    Almost half of 3059 contain only 1 relation.                             8-9         1
                                                                           10-20         1
    Each of the 5 curators revalidates 25 articles
    Sampling of articles according to number
    relations per article

Biocuration 2012                         Rinaldi et al.   ODIN-PharmGKB                          17 / 42
Intro IE Approach Revalidation Results Conclusion Extra


Revalidation Process and Categories

Revalidation process
        Our initial setup from IAT BioCreative task: Curator deletes
        unwanted relations and exports the wanted.
        But curators didn’t like that: The want checkboxes for revalidation
        categories for each relation




http://kitt.cl.uzh.ch/kitt/bcms/pharmgkbmeB/#pmid=11990384
Biocuration 2012                         Rinaldi et al.   ODIN-PharmGKB   18 / 42
Intro IE Approach Revalidation Results Conclusion Extra


Revalidation Process and Categories

Revalidation process
        Our initial setup from IAT BioCreative task: Curator deletes
        unwanted relations and exports the wanted.
        But curators didn’t like that: The want checkboxes for revalidation
        categories for each relation

Revalidation categories
        Our initial setup: verified = true positive; falsified = false positive
        But curators wanted more:
               Need full text: A relation can only be revalidated by recourse to full
               text
               Negative relation: Article denies a relation between two entities

http://kitt.cl.uzh.ch/kitt/bcms/pharmgkbmeB/#pmid=11990384
Biocuration 2012                         Rinaldi et al.   ODIN-PharmGKB           18 / 42
Intro IE Approach Revalidation Results Conclusion Extra


Customized ODIN interface




Biocuration 2012                         Rinaldi et al.   ODIN-PharmGKB   19 / 42
Intro IE Approach Revalidation Results Conclusion Extra


Lessons Learnt for Usability



    1 Ask experienced users what they want (or what they are used to)
    2 Rapidly implement prototypes and get feedback from users!
      (The use of a JavaScript framework allows this easily!)
    3 Let the users test on real data!
    4 Respect user needs (as far as possible or sensible)!
      Goto item 1!
        Prepare simple and good documentation!
        Be prepared for the unforeseeable!




Biocuration 2012                         Rinaldi et al.   ODIN-PharmGKB   20 / 42
Intro IE Approach Revalidation Results Conclusion Extra


Lessons Learnt for Usability



    1 Ask experienced users what they want (or what they are used to)
    2 Rapidly implement prototypes and get feedback from users!
      (The use of a JavaScript framework allows this easily!)
    3 Let the users test on real data!
    4 Respect user needs (as far as possible or sensible)!
      Goto item 1!
        Prepare simple and good documentation!
        Be prepared for the unforeseeable!




Biocuration 2012                         Rinaldi et al.   ODIN-PharmGKB   20 / 42
Intro IE Approach Revalidation Results Conclusion Extra


Lessons Learnt for Usability



    1 Ask experienced users what they want (or what they are used to)
    2 Rapidly implement prototypes and get feedback from users!
      (The use of a JavaScript framework allows this easily!)
    3 Let the users test on real data!
    4 Respect user needs (as far as possible or sensible)!
      Goto item 1!
        Prepare simple and good documentation!
        Be prepared for the unforeseeable!




Biocuration 2012                         Rinaldi et al.   ODIN-PharmGKB   20 / 42
Intro IE Approach Revalidation Results Conclusion Extra


Lessons Learnt for Usability



    1 Ask experienced users what they want (or what they are used to)
    2 Rapidly implement prototypes and get feedback from users!
      (The use of a JavaScript framework allows this easily!)
    3 Let the users test on real data!
    4 Respect user needs (as far as possible or sensible)!
      Goto item 1!
        Prepare simple and good documentation!
        Be prepared for the unforeseeable!




Biocuration 2012                         Rinaldi et al.   ODIN-PharmGKB   20 / 42
Intro IE Approach Revalidation Results Conclusion Extra


Lessons Learnt for Usability



    1 Ask experienced users what they want (or what they are used to)
    2 Rapidly implement prototypes and get feedback from users!
      (The use of a JavaScript framework allows this easily!)
    3 Let the users test on real data!
    4 Respect user needs (as far as possible or sensible)!
      Goto item 1!
        Prepare simple and good documentation!
        Be prepared for the unforeseeable!




Biocuration 2012                         Rinaldi et al.   ODIN-PharmGKB   20 / 42
Intro IE Approach Revalidation Results Conclusion Extra


Lessons Learnt for Usability



    1 Ask experienced users what they want (or what they are used to)
    2 Rapidly implement prototypes and get feedback from users!
      (The use of a JavaScript framework allows this easily!)
    3 Let the users test on real data!
    4 Respect user needs (as far as possible or sensible)!
      Goto item 1!
        Prepare simple and good documentation!
        Be prepared for the unforeseeable!




Biocuration 2012                         Rinaldi et al.   ODIN-PharmGKB   20 / 42
Intro IE Approach Revalidation Results Conclusion Extra



Introduction
    PharmGKB
    OntoGene
IE Approach
   Entities
   Interactions
Revalidation
Results
Conclusion
   Outlook
   Acknowledgments
Extra
   ME Ranking
   Evaluation

Biocuration 2012                         Rinaldi et al.   ODIN-PharmGKB   21 / 42
Intro IE Approach Revalidation Results Conclusion Extra


Revalidation Results




                                                reject




                             needs full text



                                   negative
                                                                       confirm




Biocuration 2012                           Rinaldi et al.   ODIN-PharmGKB        22 / 42
Intro IE Approach Revalidation Results Conclusion Extra


Revalidation Results by Relation Types


                                                                              reject
                                                                              needs full text
                                                                              negative
                                                                              confirm
                                          150
                    Number of relations

                                          100
                                          50
                                          0




                                                Disease/Drug Disease/Ds.      Drug/Drug         Drug/Gene   Gene/Gene

                                                                           Relation types
Biocuration 2012                                             Rinaldi et al.        ODIN-PharmGKB                        23 / 42
Intro IE Approach Revalidation Results Conclusion Extra


Revalidation Results by Curators


                                                                                          reject
                                          70                                              needs full text
                                                                                          negative
                                                                                          confirm
                                          60
                                          50
                    Number of relations

                                          40
                                          30
                                          20
                                          10
                                          0




                                               A        B             C           D              E

                                                                    Curator
Biocuration 2012                                   Rinaldi et al.         ODIN-PharmGKB                     24 / 42
Intro IE Approach Revalidation Results Conclusion Extra


Revalidation Results by Confidence Score Ranking



                                                                               1.0
                                                                                                                                          confirm
                                                                                                                                          negative
                    Relative distribution of decisions for curated relations


                                                                                                                                          needs full text
                                                                                                                                          reject
                                                                               0.8
                                                                               0.6
                                                                               0.4
                                                                               0.2
                                                                               0.0




                                                                                     1.                  2.                3−5.              6−20.

                                                                                          Rank of a relation according to the confidence score
Biocuration 2012                                                                              Rinaldi et al.        ODIN-PharmGKB                           25 / 42
Intro IE Approach Revalidation Results Conclusion Extra


Concept Identification Quality as Rated by Curators




                                                      bad
                                                                       N/A

                                         ok




                                                                good




Biocuration 2012                         Rinaldi et al.     ODIN-PharmGKB    26 / 42
Intro IE Approach Revalidation Results Conclusion Extra


Concept Identification Quality as Rated by Curators



                               25
                                                                                    N/A
                                                                                    good
                                                                                    ok
                                                                                    bad
                               20
                               15
                    Articles

                               10
                               5
                               0




                                    A         B             C           D       E

                                                          Curator
Biocuration 2012                         Rinaldi et al.         ODIN-PharmGKB              27 / 42
Intro IE Approach Revalidation Results Conclusion Extra


Meantime for Decision Taking for One Relation


                                                                             q

                                                                       350
                                                                             q
                                                                             q
                                                                             q
                    Meantime of curation time per article in seconds

                                                                       300


                                                                                      q
                                                                       250




                                                                                                               q
                                                                       200




                                                                                                               q
                                                                       150




                                                                                      q
                                                                                                                        q
                                                                       100




                                                                                                                        q
                                                                                                                        q
                                                                                                    q
                                                                       50




                                                                                                    q
                                                                                                    q
                                                                       0




                                                                             A        B             C          D        E

                                                                                                  Curator
Biocuration 2012                                                                 Rinaldi et al.         ODIN-PharmGKB       28 / 42
Intro IE Approach Revalidation Results Conclusion Extra


Concept Identification Quality and Meantime for Decision
Taking


                                                                          350                             q


                                                                                                          q
                                                                                 q                                                  q
                       Meantime of curation time per article in seconds

                                                                          300




                                                                                                          q
                                                                          250




                                                                                                          q
                                                                          200




                                                                                                                                    q
                                                                                                                                    q
                                                                          150




                                                                                                                                    q
                                                                          100
                                                                          50
                                                                          0




                                                                                bad                       ok                      good

                                                                                Rating of quality of concept identification per article
Biocuration 2012                                                                     Rinaldi et al.            ODIN-PharmGKB              29 / 42
Intro IE Approach Revalidation Results Conclusion Extra   Outlook         Acknowledgments



Introduction
    PharmGKB
    OntoGene
IE Approach
   Entities
   Interactions
Revalidation
Results
Conclusion
   Outlook
   Acknowledgments
Extra
   ME Ranking
   Evaluation

Biocuration 2012                         Rinaldi et al.   ODIN-PharmGKB           30 / 42
Intro IE Approach Revalidation Results Conclusion Extra   Outlook         Acknowledgments


Conclusion




        The PharmGKB resource is an interesting gold standard for relation
        detection between drugs, genes and diseases (apart from the common
        protein-protein interaction detection task)
        Proper ranking is crucial for real-world applications.
        Supervised machine learning methods improve rankings dramatically.
        Usability of the interface as a crucial acceptability criteria.




Biocuration 2012                         Rinaldi et al.   ODIN-PharmGKB           31 / 42
Intro IE Approach Revalidation Results Conclusion Extra   Outlook         Acknowledgments


Future Work




        For measuring inter-annotator agreement, each article sample should
        be revalidated by at least two curators
        Another experiment for the detection of false negatives: Select
        PubMed articles where our text mining systems suggests a
        non-existing relation with high confidence score.
        Consider other databases: we are interested in research collaborations.




Biocuration 2012                         Rinaldi et al.   ODIN-PharmGKB           32 / 42
Intro IE Approach Revalidation Results Conclusion Extra   Outlook         Acknowledgments


Future Work




        For measuring inter-annotator agreement, each article sample should
        be revalidated by at least two curators
        Another experiment for the detection of false negatives: Select
        PubMed articles where our text mining systems suggests a
        non-existing relation with high confidence score.
        Consider other databases: we are interested in research collaborations.




Biocuration 2012                         Rinaldi et al.   ODIN-PharmGKB           32 / 42
Intro IE Approach Revalidation Results Conclusion Extra   Outlook         Acknowledgments


Future Work




        For measuring inter-annotator agreement, each article sample should
        be revalidated by at least two curators
        Another experiment for the detection of false negatives: Select
        PubMed articles where our text mining systems suggests a
        non-existing relation with high confidence score.
        Consider other databases: we are interested in research collaborations.




Biocuration 2012                         Rinaldi et al.   ODIN-PharmGKB           32 / 42
Intro IE Approach Revalidation Results Conclusion Extra   Outlook         Acknowledgments


SMBM 2012
Semantic Mining in Biomedicine, Zurich, September 3-4, 2012
http://www.smbm.eu/




Biocuration 2012                         Rinaldi et al.   ODIN-PharmGKB           33 / 42
Intro IE Approach Revalidation Results Conclusion Extra   Outlook         Acknowledgments


SMBM 2012
Semantic Mining in Biomedicine, Zurich, September 3-4, 2012
http://www.smbm.eu/




Biocuration 2012                         Rinaldi et al.   ODIN-PharmGKB           33 / 42
Intro IE Approach Revalidation Results Conclusion Extra   Outlook         Acknowledgments


Acknowledgements


        Yael Garten, Michelle Whirl-Carillo, Li Gong, Joan M. Hebert, Katrin
        Sangkuhl, Caroline F. Thorn, Teri E. Klein, Russ B. Altman from
        Stanford University
        Gerold Schneider and Kaarel Kaljurand
        Martin Romacker from NITAS, Novartis



                           Thank you for your attention!

                                             Questions?

Biocuration 2012                         Rinaldi et al.   ODIN-PharmGKB           34 / 42

More Related Content

Similar to Rinaldi - ODIN

Machine learning in computational docking
Machine learning in computational dockingMachine learning in computational docking
Machine learning in computational docking
Mohamed AbdElAziz Khamis
 
3 d virtual screening of pknb inhibitors using data
3 d virtual screening of pknb inhibitors using data3 d virtual screening of pknb inhibitors using data
3 d virtual screening of pknb inhibitors using data
Abhik Seal
 
Significance of computational tools in drug discovery
Significance of computational tools in drug discoverySignificance of computational tools in drug discovery
Significance of computational tools in drug discovery
DrMopuriDeepaReddy
 
Molecular docking
Molecular dockingMolecular docking
Molecular docking
Maakasaikumar
 
Reproducibility in cheminformatics and computational chemistry research: cert...
Reproducibility in cheminformatics and computational chemistry research: cert...Reproducibility in cheminformatics and computational chemistry research: cert...
Reproducibility in cheminformatics and computational chemistry research: cert...
Greg Landrum
 
Delivering Curated Chemistry to the World via Crowdsourced Deposition and Ann...
Delivering Curated Chemistry to the World via Crowdsourced Deposition and Ann...Delivering Curated Chemistry to the World via Crowdsourced Deposition and Ann...
Delivering Curated Chemistry to the World via Crowdsourced Deposition and Ann...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Various Computational Tools used in Drug Design
Various Computational Tools used in Drug DesignVarious Computational Tools used in Drug Design
Various Computational Tools used in Drug Design
FirujAhmed2
 
MOLECULAR DOCKING AND RELATED DRUG DESIGN ACHIEVEMENTS
MOLECULAR DOCKING AND RELATED DRUG DESIGN ACHIEVEMENTS MOLECULAR DOCKING AND RELATED DRUG DESIGN ACHIEVEMENTS
MOLECULAR DOCKING AND RELATED DRUG DESIGN ACHIEVEMENTS
santosh Kumbhar
 
chandrakant
chandrakantchandrakant
chandrakant
Chandrakant Roy
 
Mining public domain data as a basis for drug repurposing
Mining public domain data as a basis for drug repurposingMining public domain data as a basis for drug repurposing
Omics int conference series analbioanal dr sudeb mandal jr scientist vimta ...
Omics int conference series analbioanal dr sudeb mandal   jr scientist vimta ...Omics int conference series analbioanal dr sudeb mandal   jr scientist vimta ...
Omics int conference series analbioanal dr sudeb mandal jr scientist vimta ...
Dr. Sudeb Mandal
 
Seminários avancados .pptx
Seminários avancados .pptxSeminários avancados .pptx
Seminários avancados .pptx
DrDionatanGomes
 
Docking
DockingDocking
Docking
Monika Verma
 
new drug discovery studies
new drug discovery studiesnew drug discovery studies
new drug discovery studies
Drx Rather Ishfaq
 
Docking Score Functions
Docking Score FunctionsDocking Score Functions
Docking Score Functions
SAKEEL AHMED
 
Sunday (2) lipinski
Sunday (2) lipinskiSunday (2) lipinski
Sunday (2) lipinski
plmiami
 
COMPUTER AIDED DRUG DESIGN BYJayant_Nimkar
COMPUTER AIDED DRUG DESIGN BYJayant_NimkarCOMPUTER AIDED DRUG DESIGN BYJayant_Nimkar
COMPUTER AIDED DRUG DESIGN BYJayant_Nimkar
78JAYANTNIMKAR
 
COMPUTER AISES DRUG DESIGN .BY JAYA NT NIMKAR
COMPUTER AISES DRUG DESIGN .BY JAYA NT NIMKARCOMPUTER AISES DRUG DESIGN .BY JAYA NT NIMKAR
COMPUTER AISES DRUG DESIGN .BY JAYA NT NIMKAR
78JAYANTNIMKAR
 
Webinar: New RMC - Your lead_optimization Solution June082017
Webinar: New RMC - Your lead_optimization Solution June082017Webinar: New RMC - Your lead_optimization Solution June082017
Webinar: New RMC - Your lead_optimization Solution June082017
Ann-Marie Roche
 
Pharmacophore mapping in Drug Development
Pharmacophore mapping in Drug DevelopmentPharmacophore mapping in Drug Development
Pharmacophore mapping in Drug Development
Mbachu Chinedu
 

Similar to Rinaldi - ODIN (20)

Machine learning in computational docking
Machine learning in computational dockingMachine learning in computational docking
Machine learning in computational docking
 
3 d virtual screening of pknb inhibitors using data
3 d virtual screening of pknb inhibitors using data3 d virtual screening of pknb inhibitors using data
3 d virtual screening of pknb inhibitors using data
 
Significance of computational tools in drug discovery
Significance of computational tools in drug discoverySignificance of computational tools in drug discovery
Significance of computational tools in drug discovery
 
Molecular docking
Molecular dockingMolecular docking
Molecular docking
 
Reproducibility in cheminformatics and computational chemistry research: cert...
Reproducibility in cheminformatics and computational chemistry research: cert...Reproducibility in cheminformatics and computational chemistry research: cert...
Reproducibility in cheminformatics and computational chemistry research: cert...
 
Delivering Curated Chemistry to the World via Crowdsourced Deposition and Ann...
Delivering Curated Chemistry to the World via Crowdsourced Deposition and Ann...Delivering Curated Chemistry to the World via Crowdsourced Deposition and Ann...
Delivering Curated Chemistry to the World via Crowdsourced Deposition and Ann...
 
Various Computational Tools used in Drug Design
Various Computational Tools used in Drug DesignVarious Computational Tools used in Drug Design
Various Computational Tools used in Drug Design
 
MOLECULAR DOCKING AND RELATED DRUG DESIGN ACHIEVEMENTS
MOLECULAR DOCKING AND RELATED DRUG DESIGN ACHIEVEMENTS MOLECULAR DOCKING AND RELATED DRUG DESIGN ACHIEVEMENTS
MOLECULAR DOCKING AND RELATED DRUG DESIGN ACHIEVEMENTS
 
chandrakant
chandrakantchandrakant
chandrakant
 
Mining public domain data as a basis for drug repurposing
Mining public domain data as a basis for drug repurposingMining public domain data as a basis for drug repurposing
Mining public domain data as a basis for drug repurposing
 
Omics int conference series analbioanal dr sudeb mandal jr scientist vimta ...
Omics int conference series analbioanal dr sudeb mandal   jr scientist vimta ...Omics int conference series analbioanal dr sudeb mandal   jr scientist vimta ...
Omics int conference series analbioanal dr sudeb mandal jr scientist vimta ...
 
Seminários avancados .pptx
Seminários avancados .pptxSeminários avancados .pptx
Seminários avancados .pptx
 
Docking
DockingDocking
Docking
 
new drug discovery studies
new drug discovery studiesnew drug discovery studies
new drug discovery studies
 
Docking Score Functions
Docking Score FunctionsDocking Score Functions
Docking Score Functions
 
Sunday (2) lipinski
Sunday (2) lipinskiSunday (2) lipinski
Sunday (2) lipinski
 
COMPUTER AIDED DRUG DESIGN BYJayant_Nimkar
COMPUTER AIDED DRUG DESIGN BYJayant_NimkarCOMPUTER AIDED DRUG DESIGN BYJayant_Nimkar
COMPUTER AIDED DRUG DESIGN BYJayant_Nimkar
 
COMPUTER AISES DRUG DESIGN .BY JAYA NT NIMKAR
COMPUTER AISES DRUG DESIGN .BY JAYA NT NIMKARCOMPUTER AISES DRUG DESIGN .BY JAYA NT NIMKAR
COMPUTER AISES DRUG DESIGN .BY JAYA NT NIMKAR
 
Webinar: New RMC - Your lead_optimization Solution June082017
Webinar: New RMC - Your lead_optimization Solution June082017Webinar: New RMC - Your lead_optimization Solution June082017
Webinar: New RMC - Your lead_optimization Solution June082017
 
Pharmacophore mapping in Drug Development
Pharmacophore mapping in Drug DevelopmentPharmacophore mapping in Drug Development
Pharmacophore mapping in Drug Development
 

Recently uploaded

CLEAR ALIGNER THERAPY IN ORTHODONTICS .pptx
CLEAR ALIGNER THERAPY IN ORTHODONTICS .pptxCLEAR ALIGNER THERAPY IN ORTHODONTICS .pptx
CLEAR ALIGNER THERAPY IN ORTHODONTICS .pptx
Government Dental College & Hospital Srinagar
 
Medical Quiz ( Online Quiz for API Meet 2024 ).pdf
Medical Quiz ( Online Quiz for API Meet 2024 ).pdfMedical Quiz ( Online Quiz for API Meet 2024 ).pdf
Medical Quiz ( Online Quiz for API Meet 2024 ).pdf
Jim Jacob Roy
 
Skin Diseases That Happen During Summer.
 Skin Diseases That Happen During Summer. Skin Diseases That Happen During Summer.
Skin Diseases That Happen During Summer.
Gokuldas Hospital
 
Hemodialysis: Chapter 5, Dialyzers Overview - Dr.Gawad
Hemodialysis: Chapter 5, Dialyzers Overview - Dr.GawadHemodialysis: Chapter 5, Dialyzers Overview - Dr.Gawad
Hemodialysis: Chapter 5, Dialyzers Overview - Dr.Gawad
NephroTube - Dr.Gawad
 
Lecture 6 -- Memory 2015.pptlearning occurs when a stimulus (unconditioned st...
Lecture 6 -- Memory 2015.pptlearning occurs when a stimulus (unconditioned st...Lecture 6 -- Memory 2015.pptlearning occurs when a stimulus (unconditioned st...
Lecture 6 -- Memory 2015.pptlearning occurs when a stimulus (unconditioned st...
AyushGadhvi1
 
Cell Therapy Expansion and Challenges in Autoimmune Disease
Cell Therapy Expansion and Challenges in Autoimmune DiseaseCell Therapy Expansion and Challenges in Autoimmune Disease
Cell Therapy Expansion and Challenges in Autoimmune Disease
Health Advances
 
Test bank for karp s cell and molecular biology 9th edition by gerald karp.pdf
Test bank for karp s cell and molecular biology 9th edition by gerald karp.pdfTest bank for karp s cell and molecular biology 9th edition by gerald karp.pdf
Test bank for karp s cell and molecular biology 9th edition by gerald karp.pdf
rightmanforbloodline
 
Pharmacology of 5-hydroxytryptamine and Antagonist
Pharmacology of 5-hydroxytryptamine and AntagonistPharmacology of 5-hydroxytryptamine and Antagonist
Pharmacology of 5-hydroxytryptamine and Antagonist
Dr. Nikhilkumar Sakle
 
Cervical Disc Arthroplasty ORSI 2024.pptx
Cervical Disc Arthroplasty ORSI 2024.pptxCervical Disc Arthroplasty ORSI 2024.pptx
Cervical Disc Arthroplasty ORSI 2024.pptx
LEFLOT Jean-Louis
 
DECLARATION OF HELSINKI - History and principles
DECLARATION OF HELSINKI - History and principlesDECLARATION OF HELSINKI - History and principles
DECLARATION OF HELSINKI - History and principles
anaghabharat01
 
Osteoporosis - Definition , Evaluation and Management .pdf
Osteoporosis - Definition , Evaluation and Management .pdfOsteoporosis - Definition , Evaluation and Management .pdf
Osteoporosis - Definition , Evaluation and Management .pdf
Jim Jacob Roy
 
CHEMOTHERAPY_RDP_CHAPTER 4_ANTI VIRAL DRUGS.pdf
CHEMOTHERAPY_RDP_CHAPTER 4_ANTI VIRAL DRUGS.pdfCHEMOTHERAPY_RDP_CHAPTER 4_ANTI VIRAL DRUGS.pdf
CHEMOTHERAPY_RDP_CHAPTER 4_ANTI VIRAL DRUGS.pdf
rishi2789
 
Demystifying Fallopian Tube Blockage- Grading the Differences and Implication...
Demystifying Fallopian Tube Blockage- Grading the Differences and Implication...Demystifying Fallopian Tube Blockage- Grading the Differences and Implication...
Demystifying Fallopian Tube Blockage- Grading the Differences and Implication...
FFragrant
 
June 2024 Oncology Cartoons By Dr Kanhu Charan Patro
June 2024 Oncology Cartoons By Dr Kanhu Charan PatroJune 2024 Oncology Cartoons By Dr Kanhu Charan Patro
June 2024 Oncology Cartoons By Dr Kanhu Charan Patro
Kanhu Charan
 
Top Travel Vaccinations in Manchester
Top Travel Vaccinations in ManchesterTop Travel Vaccinations in Manchester
Top Travel Vaccinations in Manchester
NX Healthcare
 
10 Benefits an EPCR Software should Bring to EMS Organizations
10 Benefits an EPCR Software should Bring to EMS Organizations   10 Benefits an EPCR Software should Bring to EMS Organizations
10 Benefits an EPCR Software should Bring to EMS Organizations
Traumasoft LLC
 
vonoprazan A novel drug for GERD presentation
vonoprazan A novel drug for GERD presentationvonoprazan A novel drug for GERD presentation
vonoprazan A novel drug for GERD presentation
Dr.pavithra Anandan
 
SENSORY NEEDS B.SC. NURSING SEMESTER II.
SENSORY NEEDS B.SC. NURSING SEMESTER II.SENSORY NEEDS B.SC. NURSING SEMESTER II.
SENSORY NEEDS B.SC. NURSING SEMESTER II.
KULDEEP VYAS
 
Post-Menstrual Smell- When to Suspect Vaginitis.pptx
Post-Menstrual Smell- When to Suspect Vaginitis.pptxPost-Menstrual Smell- When to Suspect Vaginitis.pptx
Post-Menstrual Smell- When to Suspect Vaginitis.pptx
FFragrant
 
Physical demands in sports - WCSPT Oslo 2024
Physical demands in sports - WCSPT Oslo 2024Physical demands in sports - WCSPT Oslo 2024
Physical demands in sports - WCSPT Oslo 2024
Torstein Dalen-Lorentsen
 

Recently uploaded (20)

CLEAR ALIGNER THERAPY IN ORTHODONTICS .pptx
CLEAR ALIGNER THERAPY IN ORTHODONTICS .pptxCLEAR ALIGNER THERAPY IN ORTHODONTICS .pptx
CLEAR ALIGNER THERAPY IN ORTHODONTICS .pptx
 
Medical Quiz ( Online Quiz for API Meet 2024 ).pdf
Medical Quiz ( Online Quiz for API Meet 2024 ).pdfMedical Quiz ( Online Quiz for API Meet 2024 ).pdf
Medical Quiz ( Online Quiz for API Meet 2024 ).pdf
 
Skin Diseases That Happen During Summer.
 Skin Diseases That Happen During Summer. Skin Diseases That Happen During Summer.
Skin Diseases That Happen During Summer.
 
Hemodialysis: Chapter 5, Dialyzers Overview - Dr.Gawad
Hemodialysis: Chapter 5, Dialyzers Overview - Dr.GawadHemodialysis: Chapter 5, Dialyzers Overview - Dr.Gawad
Hemodialysis: Chapter 5, Dialyzers Overview - Dr.Gawad
 
Lecture 6 -- Memory 2015.pptlearning occurs when a stimulus (unconditioned st...
Lecture 6 -- Memory 2015.pptlearning occurs when a stimulus (unconditioned st...Lecture 6 -- Memory 2015.pptlearning occurs when a stimulus (unconditioned st...
Lecture 6 -- Memory 2015.pptlearning occurs when a stimulus (unconditioned st...
 
Cell Therapy Expansion and Challenges in Autoimmune Disease
Cell Therapy Expansion and Challenges in Autoimmune DiseaseCell Therapy Expansion and Challenges in Autoimmune Disease
Cell Therapy Expansion and Challenges in Autoimmune Disease
 
Test bank for karp s cell and molecular biology 9th edition by gerald karp.pdf
Test bank for karp s cell and molecular biology 9th edition by gerald karp.pdfTest bank for karp s cell and molecular biology 9th edition by gerald karp.pdf
Test bank for karp s cell and molecular biology 9th edition by gerald karp.pdf
 
Pharmacology of 5-hydroxytryptamine and Antagonist
Pharmacology of 5-hydroxytryptamine and AntagonistPharmacology of 5-hydroxytryptamine and Antagonist
Pharmacology of 5-hydroxytryptamine and Antagonist
 
Cervical Disc Arthroplasty ORSI 2024.pptx
Cervical Disc Arthroplasty ORSI 2024.pptxCervical Disc Arthroplasty ORSI 2024.pptx
Cervical Disc Arthroplasty ORSI 2024.pptx
 
DECLARATION OF HELSINKI - History and principles
DECLARATION OF HELSINKI - History and principlesDECLARATION OF HELSINKI - History and principles
DECLARATION OF HELSINKI - History and principles
 
Osteoporosis - Definition , Evaluation and Management .pdf
Osteoporosis - Definition , Evaluation and Management .pdfOsteoporosis - Definition , Evaluation and Management .pdf
Osteoporosis - Definition , Evaluation and Management .pdf
 
CHEMOTHERAPY_RDP_CHAPTER 4_ANTI VIRAL DRUGS.pdf
CHEMOTHERAPY_RDP_CHAPTER 4_ANTI VIRAL DRUGS.pdfCHEMOTHERAPY_RDP_CHAPTER 4_ANTI VIRAL DRUGS.pdf
CHEMOTHERAPY_RDP_CHAPTER 4_ANTI VIRAL DRUGS.pdf
 
Demystifying Fallopian Tube Blockage- Grading the Differences and Implication...
Demystifying Fallopian Tube Blockage- Grading the Differences and Implication...Demystifying Fallopian Tube Blockage- Grading the Differences and Implication...
Demystifying Fallopian Tube Blockage- Grading the Differences and Implication...
 
June 2024 Oncology Cartoons By Dr Kanhu Charan Patro
June 2024 Oncology Cartoons By Dr Kanhu Charan PatroJune 2024 Oncology Cartoons By Dr Kanhu Charan Patro
June 2024 Oncology Cartoons By Dr Kanhu Charan Patro
 
Top Travel Vaccinations in Manchester
Top Travel Vaccinations in ManchesterTop Travel Vaccinations in Manchester
Top Travel Vaccinations in Manchester
 
10 Benefits an EPCR Software should Bring to EMS Organizations
10 Benefits an EPCR Software should Bring to EMS Organizations   10 Benefits an EPCR Software should Bring to EMS Organizations
10 Benefits an EPCR Software should Bring to EMS Organizations
 
vonoprazan A novel drug for GERD presentation
vonoprazan A novel drug for GERD presentationvonoprazan A novel drug for GERD presentation
vonoprazan A novel drug for GERD presentation
 
SENSORY NEEDS B.SC. NURSING SEMESTER II.
SENSORY NEEDS B.SC. NURSING SEMESTER II.SENSORY NEEDS B.SC. NURSING SEMESTER II.
SENSORY NEEDS B.SC. NURSING SEMESTER II.
 
Post-Menstrual Smell- When to Suspect Vaginitis.pptx
Post-Menstrual Smell- When to Suspect Vaginitis.pptxPost-Menstrual Smell- When to Suspect Vaginitis.pptx
Post-Menstrual Smell- When to Suspect Vaginitis.pptx
 
Physical demands in sports - WCSPT Oslo 2024
Physical demands in sports - WCSPT Oslo 2024Physical demands in sports - WCSPT Oslo 2024
Physical demands in sports - WCSPT Oslo 2024
 

Rinaldi - ODIN

  • 1. Using ODIN for a PharmGKB revalidation experiment Fabio Rinaldi1 , Simon Clematide1 , Yael Garten2 , Michelle Whirl-Carrillo2 , Li Gong2 , Joan M. Hebert2 , Katrin Sangkuhl2 , Caroline F. Thorn2 , Teri E. Klein2 , Russ B. Altman2 . 1 OntoGene group, University of Zurich 2 PharmGKB group, Stanford University Biocuration 2012
  • 2. Intro IE Approach Revalidation Results Conclusion Extra PharmGKB OntoGene Introduction PharmGKB OntoGene IE Approach Entities Interactions Revalidation Results Conclusion Outlook Acknowledgments Extra ME Ranking Evaluation Biocuration 2012 Rinaldi et al. ODIN-PharmGKB 2 / 42
  • 3. Intro IE Approach Revalidation Results Conclusion Extra PharmGKB OntoGene PharmGKB Mission PharmGKB is a pharmacogenomics knowledge resource that encompasses clinical information, potentially clinically actionable gene-drug associations and genotype-phenotype relationships Approach PharmGKB collects, curates and disseminates knowledge about the impact of human genetic variation on drug responses through the many activities, including Annotating genetic variants and gene-drug-disease relationships via literature reviews Biocuration 2012 Rinaldi et al. ODIN-PharmGKB 3 / 42
  • 4. Intro IE Approach Revalidation Results Conclusion Extra PharmGKB OntoGene PharmGKB Mission PharmGKB is a pharmacogenomics knowledge resource that encompasses clinical information, potentially clinically actionable gene-drug associations and genotype-phenotype relationships Approach PharmGKB collects, curates and disseminates knowledge about the impact of human genetic variation on drug responses through the many activities, including Annotating genetic variants and gene-drug-disease relationships via literature reviews Biocuration 2012 Rinaldi et al. ODIN-PharmGKB 3 / 42
  • 5. Intro IE Approach Revalidation Results Conclusion Extra PharmGKB OntoGene http://www.pharmgkb.org/ Biocuration 2012 Rinaldi et al. ODIN-PharmGKB 4 / 42
  • 6. Intro IE Approach Revalidation Results Conclusion Extra PharmGKB OntoGene OntoGene group Aims Develop innovative text mining technologies for the automatic extraction of information from the biomedical literature. http://www.ontogene.org/ Selected results PPI,IMT BioCreative 2006 PPI BioCreative 2009 (best results) ACT, IMT, IAT, BioCreative 2010 Biocuration 2012 Rinaldi et al. ODIN-PharmGKB 5 / 42
  • 7. Intro IE Approach Revalidation Results Conclusion Extra PharmGKB OntoGene OntoGene group Aims Develop innovative text mining technologies for the automatic extraction of information from the biomedical literature. http://www.ontogene.org/ Selected results PPI,IMT BioCreative 2006 PPI BioCreative 2009 (best results) ACT, IMT, IAT, BioCreative 2010 Biocuration 2012 Rinaldi et al. ODIN-PharmGKB 5 / 42
  • 8. Intro IE Approach Revalidation Results Conclusion Extra PharmGKB OntoGene SASEBio: Missions SASEBio: Semi-automated semantic enrichment of biomedical texts Mission I “Relation/Text Mining”: Extraction of semantic relations between biomedical entities (proteins, genes, drugs) using linguistic text mining methods Mission II “Literature Curation”: Development of a flexible interactive curation interface for efficient human validation and annotation Biocuration 2012 Rinaldi et al. ODIN-PharmGKB 6 / 42
  • 9. Intro IE Approach Revalidation Results Conclusion Extra PharmGKB OntoGene SASEBio: Missions SASEBio: Semi-automated semantic enrichment of biomedical texts Mission I “Relation/Text Mining”: Extraction of semantic relations between biomedical entities (proteins, genes, drugs) using linguistic text mining methods Mission II “Literature Curation”: Development of a flexible interactive curation interface for efficient human validation and annotation Biocuration 2012 Rinaldi et al. ODIN-PharmGKB 6 / 42
  • 10. Intro IE Approach Revalidation Results Conclusion Extra PharmGKB OntoGene Relation/Text Mining: Automatic Document Analysis Biocuration 2012 Rinaldi et al. ODIN-PharmGKB 7 / 42
  • 11. Intro IE Approach Revalidation Results Conclusion Extra PharmGKB OntoGene Relation Mining: Syntactic Approach Using dependency parses and machine learning Biocuration 2012 Rinaldi et al. ODIN-PharmGKB 8 / 42
  • 12. Intro IE Approach Revalidation Results Conclusion Extra PharmGKB OntoGene Literature Curation: Interactive Curation Environment Biocuration 2012 Rinaldi et al. ODIN-PharmGKB 9 / 42
  • 13. Intro IE Approach Revalidation Results Conclusion Extra PharmGKB OntoGene ODIN: Interactive Curation Environment Using client-side Web-based techniques XML, CSS, DOM manipulation by JavaScript and AJAX Biocuration 2012 Rinaldi et al. ODIN-PharmGKB 10 / 42
  • 14. Intro IE Approach Revalidation Results Conclusion Extra PharmGKB OntoGene ODIN: Interactive Curation Environment Extensive logging facilities Biocuration 2012 Rinaldi et al. ODIN-PharmGKB 11 / 42
  • 15. Intro IE Approach Revalidation Results Conclusion Extra Entities Interactions Introduction PharmGKB OntoGene IE Approach Entities Interactions Revalidation Results Conclusion Outlook Acknowledgments Extra ME Ranking Evaluation Biocuration 2012 Rinaldi et al. ODIN-PharmGKB 12 / 42
  • 16. Intro IE Approach Revalidation Results Conclusion Extra Entities Interactions Relations between Genes, Drugs, Diseases PharmGKB: Pharmacogenomics Knowledge Base as a Gold Standard Subset of information in PharmGKB used: 26,122 binary relations between diseases, drugs, and genes 5062 PubMed abstracts referenced Biocuration 2012 Rinaldi et al. ODIN-PharmGKB 13 / 42
  • 17. Intro IE Approach Revalidation Results Conclusion Extra Entities Interactions Relations between Genes, Drugs, Diseases PharmGKB: Pharmacogenomics Knowledge Base as a Gold Standard Subset of information in PharmGKB used: 26,122 binary relations between diseases, drugs, and genes 5062 PubMed abstracts referenced Goal Compute high-quality relation candidates and rank them according to a confidence score. Information used for text mining PubMed abstracts plus MeSH terms and chemical substances terms. Biocuration 2012 Rinaldi et al. ODIN-PharmGKB 13 / 42
  • 18. Intro IE Approach Revalidation Results Conclusion Extra Entities Interactions Baseline: Abstract-wide Co-occurence-based Candidate Relation Generation Basic idea Combine all concepts identified in the abstract into relation candidate pairs. However, do not combine concepts stemming from the same ambiguous term. Biocuration 2012 Rinaldi et al. ODIN-PharmGKB 14 / 42
  • 19. Intro IE Approach Revalidation Results Conclusion Extra Entities Interactions Baseline: Abstract-wide Co-occurence-based Candidate Relation Generation Basic idea Combine all concepts identified in the abstract into relation candidate pairs. However, do not combine concepts stemming from the same ambiguous term. Basic ranking: Occurrences and zoning Score of a pair of concepts c1 , c2 in an abstract (C = all concepts): freq(c1 ) + freq(c2 ) score(c1 , c2 ) = freq(C ) Text zone boosting: An occurrence in an article title is counted 10 times. Biocuration 2012 Rinaldi et al. ODIN-PharmGKB 14 / 42
  • 20. Intro IE Approach Revalidation Results Conclusion Extra Entities Interactions Improving Relation Ranking Core ideas for improved ranking Identify noisy concepts recognized by term recognizer and penalize them. Weight individual concepts according to their likeliness to appear in a gold relation! Adapt ranking of relations to gold standard. Combine the weights of individual concepts for the score of relation candidates. Generally penalize relations of the same type (rare phenomenon) Biocuration 2012 Rinaldi et al. ODIN-PharmGKB 15 / 42
  • 21. Intro IE Approach Revalidation Results Conclusion Extra Entities Interactions Improving Relation Ranking Core ideas for improved ranking Identify noisy concepts recognized by term recognizer and penalize them. Weight individual concepts according to their likeliness to appear in a gold relation! Adapt ranking of relations to gold standard. Combine the weights of individual concepts for the score of relation candidates. Generally penalize relations of the same type (rare phenomenon) Biocuration 2012 Rinaldi et al. ODIN-PharmGKB 15 / 42
  • 22. Intro IE Approach Revalidation Results Conclusion Extra Entities Interactions Improving Relation Ranking Core ideas for improved ranking Identify noisy concepts recognized by term recognizer and penalize them. Weight individual concepts according to their likeliness to appear in a gold relation! Adapt ranking of relations to gold standard. Combine the weights of individual concepts for the score of relation candidates. Generally penalize relations of the same type (rare phenomenon) Biocuration 2012 Rinaldi et al. ODIN-PharmGKB 15 / 42
  • 23. Intro IE Approach Revalidation Results Conclusion Extra Introduction PharmGKB OntoGene IE Approach Entities Interactions Revalidation Results Conclusion Outlook Acknowledgments Extra ME Ranking Evaluation Biocuration 2012 Rinaldi et al. ODIN-PharmGKB 16 / 42
  • 24. Intro IE Approach Revalidation Results Conclusion Extra Revalidation Experiment Goal Revalidation of PharmGKB relations with respect to false positives. Collaboration with Stanford Center for Biomedical Informatics Research Relations Articles In 3059 out of 5378 articles we find all 2 8 relations. 3 9 4 2 Keep 1407 where number of relations > 1 and 5 3 ≤ 20. 6-7 1 Almost half of 3059 contain only 1 relation. 8-9 1 10-20 1 Each of the 5 curators revalidates 25 articles Sampling of articles according to number relations per article Biocuration 2012 Rinaldi et al. ODIN-PharmGKB 17 / 42
  • 25. Intro IE Approach Revalidation Results Conclusion Extra Revalidation Experiment Goal Revalidation of PharmGKB relations with respect to false positives. Collaboration with Stanford Center for Biomedical Informatics Research Relations Articles In 3059 out of 5378 articles we find all 2 8 relations. 3 9 4 2 Keep 1407 where number of relations > 1 and 5 3 ≤ 20. 6-7 1 Almost half of 3059 contain only 1 relation. 8-9 1 10-20 1 Each of the 5 curators revalidates 25 articles Sampling of articles according to number relations per article Biocuration 2012 Rinaldi et al. ODIN-PharmGKB 17 / 42
  • 26. Intro IE Approach Revalidation Results Conclusion Extra Revalidation Experiment Goal Revalidation of PharmGKB relations with respect to false positives. Collaboration with Stanford Center for Biomedical Informatics Research Relations Articles In 3059 out of 5378 articles we find all 2 8 relations. 3 9 4 2 Keep 1407 where number of relations > 1 and 5 3 ≤ 20. 6-7 1 Almost half of 3059 contain only 1 relation. 8-9 1 10-20 1 Each of the 5 curators revalidates 25 articles Sampling of articles according to number relations per article Biocuration 2012 Rinaldi et al. ODIN-PharmGKB 17 / 42
  • 27. Intro IE Approach Revalidation Results Conclusion Extra Revalidation Experiment Goal Revalidation of PharmGKB relations with respect to false positives. Collaboration with Stanford Center for Biomedical Informatics Research Relations Articles In 3059 out of 5378 articles we find all 2 8 relations. 3 9 4 2 Keep 1407 where number of relations > 1 and 5 3 ≤ 20. 6-7 1 Almost half of 3059 contain only 1 relation. 8-9 1 10-20 1 Each of the 5 curators revalidates 25 articles Sampling of articles according to number relations per article Biocuration 2012 Rinaldi et al. ODIN-PharmGKB 17 / 42
  • 28. Intro IE Approach Revalidation Results Conclusion Extra Revalidation Process and Categories Revalidation process Our initial setup from IAT BioCreative task: Curator deletes unwanted relations and exports the wanted. But curators didn’t like that: The want checkboxes for revalidation categories for each relation http://kitt.cl.uzh.ch/kitt/bcms/pharmgkbmeB/#pmid=11990384 Biocuration 2012 Rinaldi et al. ODIN-PharmGKB 18 / 42
  • 29. Intro IE Approach Revalidation Results Conclusion Extra Revalidation Process and Categories Revalidation process Our initial setup from IAT BioCreative task: Curator deletes unwanted relations and exports the wanted. But curators didn’t like that: The want checkboxes for revalidation categories for each relation Revalidation categories Our initial setup: verified = true positive; falsified = false positive But curators wanted more: Need full text: A relation can only be revalidated by recourse to full text Negative relation: Article denies a relation between two entities http://kitt.cl.uzh.ch/kitt/bcms/pharmgkbmeB/#pmid=11990384 Biocuration 2012 Rinaldi et al. ODIN-PharmGKB 18 / 42
  • 30. Intro IE Approach Revalidation Results Conclusion Extra Customized ODIN interface Biocuration 2012 Rinaldi et al. ODIN-PharmGKB 19 / 42
  • 31. Intro IE Approach Revalidation Results Conclusion Extra Lessons Learnt for Usability 1 Ask experienced users what they want (or what they are used to) 2 Rapidly implement prototypes and get feedback from users! (The use of a JavaScript framework allows this easily!) 3 Let the users test on real data! 4 Respect user needs (as far as possible or sensible)! Goto item 1! Prepare simple and good documentation! Be prepared for the unforeseeable! Biocuration 2012 Rinaldi et al. ODIN-PharmGKB 20 / 42
  • 32. Intro IE Approach Revalidation Results Conclusion Extra Lessons Learnt for Usability 1 Ask experienced users what they want (or what they are used to) 2 Rapidly implement prototypes and get feedback from users! (The use of a JavaScript framework allows this easily!) 3 Let the users test on real data! 4 Respect user needs (as far as possible or sensible)! Goto item 1! Prepare simple and good documentation! Be prepared for the unforeseeable! Biocuration 2012 Rinaldi et al. ODIN-PharmGKB 20 / 42
  • 33. Intro IE Approach Revalidation Results Conclusion Extra Lessons Learnt for Usability 1 Ask experienced users what they want (or what they are used to) 2 Rapidly implement prototypes and get feedback from users! (The use of a JavaScript framework allows this easily!) 3 Let the users test on real data! 4 Respect user needs (as far as possible or sensible)! Goto item 1! Prepare simple and good documentation! Be prepared for the unforeseeable! Biocuration 2012 Rinaldi et al. ODIN-PharmGKB 20 / 42
  • 34. Intro IE Approach Revalidation Results Conclusion Extra Lessons Learnt for Usability 1 Ask experienced users what they want (or what they are used to) 2 Rapidly implement prototypes and get feedback from users! (The use of a JavaScript framework allows this easily!) 3 Let the users test on real data! 4 Respect user needs (as far as possible or sensible)! Goto item 1! Prepare simple and good documentation! Be prepared for the unforeseeable! Biocuration 2012 Rinaldi et al. ODIN-PharmGKB 20 / 42
  • 35. Intro IE Approach Revalidation Results Conclusion Extra Lessons Learnt for Usability 1 Ask experienced users what they want (or what they are used to) 2 Rapidly implement prototypes and get feedback from users! (The use of a JavaScript framework allows this easily!) 3 Let the users test on real data! 4 Respect user needs (as far as possible or sensible)! Goto item 1! Prepare simple and good documentation! Be prepared for the unforeseeable! Biocuration 2012 Rinaldi et al. ODIN-PharmGKB 20 / 42
  • 36. Intro IE Approach Revalidation Results Conclusion Extra Lessons Learnt for Usability 1 Ask experienced users what they want (or what they are used to) 2 Rapidly implement prototypes and get feedback from users! (The use of a JavaScript framework allows this easily!) 3 Let the users test on real data! 4 Respect user needs (as far as possible or sensible)! Goto item 1! Prepare simple and good documentation! Be prepared for the unforeseeable! Biocuration 2012 Rinaldi et al. ODIN-PharmGKB 20 / 42
  • 37. Intro IE Approach Revalidation Results Conclusion Extra Introduction PharmGKB OntoGene IE Approach Entities Interactions Revalidation Results Conclusion Outlook Acknowledgments Extra ME Ranking Evaluation Biocuration 2012 Rinaldi et al. ODIN-PharmGKB 21 / 42
  • 38. Intro IE Approach Revalidation Results Conclusion Extra Revalidation Results reject needs full text negative confirm Biocuration 2012 Rinaldi et al. ODIN-PharmGKB 22 / 42
  • 39. Intro IE Approach Revalidation Results Conclusion Extra Revalidation Results by Relation Types reject needs full text negative confirm 150 Number of relations 100 50 0 Disease/Drug Disease/Ds. Drug/Drug Drug/Gene Gene/Gene Relation types Biocuration 2012 Rinaldi et al. ODIN-PharmGKB 23 / 42
  • 40. Intro IE Approach Revalidation Results Conclusion Extra Revalidation Results by Curators reject 70 needs full text negative confirm 60 50 Number of relations 40 30 20 10 0 A B C D E Curator Biocuration 2012 Rinaldi et al. ODIN-PharmGKB 24 / 42
  • 41. Intro IE Approach Revalidation Results Conclusion Extra Revalidation Results by Confidence Score Ranking 1.0 confirm negative Relative distribution of decisions for curated relations needs full text reject 0.8 0.6 0.4 0.2 0.0 1. 2. 3−5. 6−20. Rank of a relation according to the confidence score Biocuration 2012 Rinaldi et al. ODIN-PharmGKB 25 / 42
  • 42. Intro IE Approach Revalidation Results Conclusion Extra Concept Identification Quality as Rated by Curators bad N/A ok good Biocuration 2012 Rinaldi et al. ODIN-PharmGKB 26 / 42
  • 43. Intro IE Approach Revalidation Results Conclusion Extra Concept Identification Quality as Rated by Curators 25 N/A good ok bad 20 15 Articles 10 5 0 A B C D E Curator Biocuration 2012 Rinaldi et al. ODIN-PharmGKB 27 / 42
  • 44. Intro IE Approach Revalidation Results Conclusion Extra Meantime for Decision Taking for One Relation q 350 q q q Meantime of curation time per article in seconds 300 q 250 q 200 q 150 q q 100 q q q 50 q q 0 A B C D E Curator Biocuration 2012 Rinaldi et al. ODIN-PharmGKB 28 / 42
  • 45. Intro IE Approach Revalidation Results Conclusion Extra Concept Identification Quality and Meantime for Decision Taking 350 q q q q Meantime of curation time per article in seconds 300 q 250 q 200 q q 150 q 100 50 0 bad ok good Rating of quality of concept identification per article Biocuration 2012 Rinaldi et al. ODIN-PharmGKB 29 / 42
  • 46. Intro IE Approach Revalidation Results Conclusion Extra Outlook Acknowledgments Introduction PharmGKB OntoGene IE Approach Entities Interactions Revalidation Results Conclusion Outlook Acknowledgments Extra ME Ranking Evaluation Biocuration 2012 Rinaldi et al. ODIN-PharmGKB 30 / 42
  • 47. Intro IE Approach Revalidation Results Conclusion Extra Outlook Acknowledgments Conclusion The PharmGKB resource is an interesting gold standard for relation detection between drugs, genes and diseases (apart from the common protein-protein interaction detection task) Proper ranking is crucial for real-world applications. Supervised machine learning methods improve rankings dramatically. Usability of the interface as a crucial acceptability criteria. Biocuration 2012 Rinaldi et al. ODIN-PharmGKB 31 / 42
  • 48. Intro IE Approach Revalidation Results Conclusion Extra Outlook Acknowledgments Future Work For measuring inter-annotator agreement, each article sample should be revalidated by at least two curators Another experiment for the detection of false negatives: Select PubMed articles where our text mining systems suggests a non-existing relation with high confidence score. Consider other databases: we are interested in research collaborations. Biocuration 2012 Rinaldi et al. ODIN-PharmGKB 32 / 42
  • 49. Intro IE Approach Revalidation Results Conclusion Extra Outlook Acknowledgments Future Work For measuring inter-annotator agreement, each article sample should be revalidated by at least two curators Another experiment for the detection of false negatives: Select PubMed articles where our text mining systems suggests a non-existing relation with high confidence score. Consider other databases: we are interested in research collaborations. Biocuration 2012 Rinaldi et al. ODIN-PharmGKB 32 / 42
  • 50. Intro IE Approach Revalidation Results Conclusion Extra Outlook Acknowledgments Future Work For measuring inter-annotator agreement, each article sample should be revalidated by at least two curators Another experiment for the detection of false negatives: Select PubMed articles where our text mining systems suggests a non-existing relation with high confidence score. Consider other databases: we are interested in research collaborations. Biocuration 2012 Rinaldi et al. ODIN-PharmGKB 32 / 42
  • 51. Intro IE Approach Revalidation Results Conclusion Extra Outlook Acknowledgments SMBM 2012 Semantic Mining in Biomedicine, Zurich, September 3-4, 2012 http://www.smbm.eu/ Biocuration 2012 Rinaldi et al. ODIN-PharmGKB 33 / 42
  • 52. Intro IE Approach Revalidation Results Conclusion Extra Outlook Acknowledgments SMBM 2012 Semantic Mining in Biomedicine, Zurich, September 3-4, 2012 http://www.smbm.eu/ Biocuration 2012 Rinaldi et al. ODIN-PharmGKB 33 / 42
  • 53. Intro IE Approach Revalidation Results Conclusion Extra Outlook Acknowledgments Acknowledgements Yael Garten, Michelle Whirl-Carillo, Li Gong, Joan M. Hebert, Katrin Sangkuhl, Caroline F. Thorn, Teri E. Klein, Russ B. Altman from Stanford University Gerold Schneider and Kaarel Kaljurand Martin Romacker from NITAS, Novartis Thank you for your attention! Questions? Biocuration 2012 Rinaldi et al. ODIN-PharmGKB 34 / 42