SlideShare a Scribd company logo
Guideline                Introduction                   Methods                 Experimental Results




            Inferring Cancer Subnetwork Markers
                     using Density-Constrained Biclustering


             Presenters: Phuong Dao1 , Alexander Schonhuth2

                     1
                       School of Computing Science, Simon Fraser University
                2
                    Algorithmic Computational Biology Group, CWI, Netherlands
Guideline                Introduction           Methods   Experimental Results




                                        Guideline

        Introduction
            Personalized Medicine
            Biomarker Discovery

        Methods
          Motivations
          Our approach

        Experimental Results
           Data
           Classifier Performance
           Markers
Guideline                 Introduction                     Methods   Experimental Results




                                         Introduction
                                         Personalized Medicine




            •   Exact determination of disease status based on
                patient genetics/genomics
            •   Goal: Specific, individual choice of treatment
Guideline                 Introduction                     Methods   Experimental Results




                                         Introduction
                                         Personalized Medicine




            •   Exact determination of disease status based on
                patient genetics/genomics
            •   Goal: Specific, individual choice of treatment
            •   Necessary: Reliable disease markers
Guideline                Introduction        Methods           Experimental Results




                            Biomarker Discovery


      •     Single gene markers: Each gene is ranked according to
            their ability to distinguish samples of different classes
      •     Multigenic markers: Each subset S of genes is ranked
            based on the aggregation ability of all genes in S to
            distinguish samples of different classes
Guideline                             Introduction                               Methods                                          Experimental Results




                                            Single Gene Markers




                                                                                                                      Control 1

                                                                                                                                  Control 2
                                                                                                                                              Control 3
                                                                                           Case 1
                                                                                                    Case 2

                                                                                                             Case 3
                                            Control 1

                                                        Control 2
                                                                    Control 3
                 Case 1
                          Case 2

                                   Case 3


                                                                                Gene 1
                                                                                Gene 3
        Gene 1
        Gene 2                                                                        Differentially Expressed
        Gene 3
        Gene 4                                                                  Gene 2
        Gene 5                                                                  Gene 4
        Gene 6                                                                  Gene 5
                                                                                Gene 6
                                                                                   Non−Differentially Expressed
Guideline                 Introduction                   Methods   Experimental Results




                                Multigenic Markers
                                         Subnetwork Markers




[Chuang et al., Mol.Sys.Biol. (2007)]:
    • Predicting progression of breast
      cancer
    • Subnetwork markers are
      connected subnetworks with
      aggregate expression profiles
      correlates the most with the labels
      of the samples
    • Greedy heuristics for searching
      for optimal subnetwork markers
Guideline                  Introduction                   Methods       Experimental Results




                                 Multigenic Markers
                                          Subnetwork Markers




        [Chowdhury et al., PSB 2010]:
            • Predicting colon cancer subtypes
            • Each marker is a small connected subnetwork N such that genes
              in N cover all disease samples (gene g covers sample s if g is
              differentially expressed in s)
            • Greedy heuristics for searching for the smallest subnetwork
              markers
Guideline                    Introduction                Methods                                  Experimental Results




                                            Motivations
                                  Heterogeneity of Cancer Genomes




      • Cancer genomes evolve
            (many cells in one
            patient have different
            genomes)
      • No two cancer cells of
            two different patients
            are the same




                                                       [Hampton et al., Genome Research (2009)]
Guideline                    Introduction               Methods               Experimental Results




                                            Motivations
                        Proximity of Disease Related Genes in PPI Network


        [Goh et al., PNAS (2007)]:
            • The protein products of genes related to the same disease tend to
              interact with one another
            • Genes related to a disease have coherent functions with respect to the
              Gene Ontology hierarchy
Guideline              Introduction            Methods        Experimental Results




                                      Our Approach



        Each of our subnetwork markers:
         • includes genes that have higher interaction among
           them than expected (densely connected
           subnetworks)
         • contains differentially expressed genes in a fraction of
           all the samples from cancer tissues (partially
           differential expression)
Guideline   Introduction             Methods   Experimental Results




                           Methods
Guideline                     Introduction                     Methods   Experimental Results




                    Densely Connected Subnetworks
                                               Properties



        Let G = (V , E) be a network with edge weights we , e ∈ E.
            • The density θ(G) of G is

                                                            e∈E     we
                                             θ(G) :=         |V |
                                                              2

                      |V |
              where    2     is the number of possible edges in G.
            • G is called α-dense if
                                                θ(G) ≥ α.
            • An α-dense, connected network G is called α-densely
              connected.
Guideline                                    Introduction                                           Methods                        Experimental Results




                                 Partially Differential Expression




                                                                                                                         S1

                                                                                                                              S2

                                                                                                                                   S3
                                                                                               G1
                                 0.95
                                             0.6        0.8                              0.95
                                                                                                         0.85       G1   1    1     0
                                                                     0.9                        0.75           G3
                             0.45
                                                 0.85
                                                                                                                    G2   1    1     1
                                                                                    G2
                                    0.75                      0.25           0.9          0.8                       G3   1    1     0
                     0.7                                0.9                                            0.9          G4   1    1     1
                          0.55               0.5 0.95
                                                                                                G4
             0.8                                                     0.85
                             0.95                         0.75
                                                                             0.95
                                    0.35 0.65                                                   G4
                      0.45                            0.8
                                                                 0.9




                                                                                                                         S1

                                                                                                                              S2

                                                                                                                                   S3
                                                 0.75 0.8
                                                         0.9          0.7
                           0.3             0.8
                                    0.9                                                  0.9            0.7         G4   1    1     1
               0.65                                      0.85
                                 0.8             0.9                  0.95
                                                                                     G5                       G6    G5   0    1     1
                   0.75
                                                                                                                    G6   0    1     1
                                                                                         0.85                            0    1     1
                                                                                                        0.95        G7
                                                                                               G7


        Compute all densely connected subnetworks whose genes are differentially
        expressed in a subset of patients of size at least k (here: k = 2).
Guideline                                    Introduction                                                             Methods                                                                Experimental Results




                               Density Constrained Biclustering
                                                                           Search Strategy

        Theorem: Let α ≥ 0.5. Every α-densely connected network of size n
        contains an α-densely connected subnetwork of size n − 1.


                                                          0.4   A                     0.6   A             0.9     A                             C             0.8   D                    C
                                                      B                           C                   D                         B                         B                        D




                                         C
                                                                    0.6    A                        0.6   A                               0.9       A                       0.8    D
                         0.4         0.6
                    B          A                                C           0.4                 C           0.9                       D             0.4                 B
                                   0.9                                     B                              D                               0.8       B                              C
                        0.8
                               D


                    Density: 0.45
            = [(0.8 + 0.9 + 0.6 + 0.4) / 6]                                                                                           C
                                                                          Not Dense                                                                                               wDCB
                                                                                                                      0.4         0.6
                                                                                                                B           A
                                                                                                                                0.9
                                                                                                                    0.8
                                                                          Not Connected                                     D                                                     maximal wDCB



        Figure: Toy example for computation of densely connected subnetworks,
        density threshold θ = 0.5.
Guideline                   Introduction                    Methods                                   Experimental Results




                             Classifier Construction
                                                                                           G4
                                                       G1
                                              0.95                                 0.9
                                                                   0.85                         0.7
                                                       0.75            G3        G5
     1. Rank density constrained            G2                                                         G6
        biclusters according to density          0.8
                                                               0.9
                                                                                   0.85
        significance                                     G4                                      0.95
                                                                                      G7
     2. Keep only high-ranked
                                              Gene 1        1.25
        subnetworks with little overlap       Gene 2         1.5
     3. Feature space dimension =             Gene 3         1.0
                                                                                    Marker 1 1.25
                                              Gene 4        1.25       Average
        number of markers                     Gene 5         0.5
                                                                                    Marker 2 0.5

     4. SVM classification                     Gene 6         0.0
                                              Gene 7        0.25

                                           Gene Expression Profile          Average Gene Expression Profile
Guideline   Introduction              Methods   Experimental Results




                      Experimental Results
Guideline                    Introduction                  Methods                                                         Experimental Results




                                            Network Data


       Confidence-scored PPI network
     [STRING, von Mering et al., NAR 2009]


      • Edges reflect physical
        protein-protein interactions
      • Confidence scores reflect the
        probability that the interaction is                             0.95
                                                                                    0.6        0.8
                                                                                                            0.9
        associated with a cellular                                  0.45

                                                                           0.75
                                                                                        0.85
                                                                                               0.9
                                                                                                     0.25           0.9
                                                            0.7
        phenomenon (and not an                      0.8          0.55
                                                                    0.95
                                                                                    0.5 0.95
                                                                                                 0.75
                                                                                                            0.85
                                                                                                                    0.95
        experimental artifact)                               0.45
                                                                           0.35 0.65
                                                                                             0.8
                                                                                        0.75 0.8
                                                                                                        0.9
                                                                                                0.9          0.7
                                                                  0.3             0.8

      • Scoring system based on KEGG                  0.65
                                                          0.75          0.8
                                                                           0.9

                                                                                        0.9
                                                                                                0.85
                                                                                                             0.95

        pathways
Guideline                    Introduction              Methods                 Experimental Results




                              Gene Expression Data


                              Three experiments on colon cancer

            • GSE8671, 32 patients / tissue pairs
            • GSE10950, 24 patients / tissue pairs
            • GSE6988, 123 samples across several cancer subtypes

                                One experiment on breast cancer

            • GSE3494, 251 patients with different mutation status (wildtype vs.
              mutant)
Guideline                     Introduction                       Methods                  Experimental Results




                             GSE 8671 −→ GSE 10950

                                             GSE8671 >> GSE10950
                    1


                  0.95


                   0.9
            AUC




                  0.85


                   0.8                                           SGM
                                                                 GMI
                                                            NETCOVER
                                                                wDCB
                  0.75
                         0    5     10       15   20   25   30    35       40   45   50
                                              # Subnetworks/Genes
Guideline                    Introduction                       Methods                  Experimental Results




            GSE 8671 −→ GSE 6988 - Colon Cancer

                                            GSE8671 >> GSE6988
                    1

                  0.95

                   0.9

                  0.85
            AUC




                   0.8

                  0.75

                   0.7                                          SGM
                                                                GMI
                  0.65                                     NETCOVER
                                                               wDCB
                   0.6
                         0   5     10       15   20   25   30    35       40   45   50
                                             #Subnetworks/Genes
Guideline                    Introduction                       Methods                  Experimental Results




            GSE 8671 −→ GSE 6988 - Colon Cancer

                                            GSE8671 >> GSE6988
                    1

                  0.95

                   0.9

                  0.85
            AUC




                   0.8

                  0.75

                   0.7                                          SGM
                                                                GMI
                  0.65                                     NETCOVER
                                                               wDCB
                   0.6
                         0   5     10       15   20   25   30    35       40   45   50
                                             #Subnetworks/Genes
Guideline    Introduction   Methods    Experimental Results




            GSE 3494 - Breast Cancer
Guideline                    Introduction               Methods               Experimental Results




                     Subnetwork Marker Statistics


                                     Avg AUC                             Avg AUC
                    #      ER-50 6988 10950             #      ER-50 6988 8671
             GMI   806      0.38   0.86    0.95        755      0.34   0.84   0.99
             NC    923      0.12   0.87    0.99        N/A      N/A    0.86    N/A
            wDCB   282      0.76   0.91    1.00        216      0.74   0.91   1.00
                          8671 Subnetworks                   10950 Subnetworks

                          GMI = Greedy Mutual Information (Chuang et al.)
                                 NC = NetCover (Chowdhury et al.)
                         wDCB = weighted Density Constrained Biclustering
                            # = total number of subnetworks computed
                           ER-50 = enrichment rate of the top-50 markers
Guideline   Introduction        Methods                 Experimental Results




                     Top Marker 8671


                                          • DNA replication
                                            initiation
                                          • DNA metabolic
                                            process
                                          • TP53, BRCA1: tumor
                                            suppressor genes
                                          • Minichromosome
                                            maintenance (MCM)
                                            complex
                                          • Protein kinase CDC7
                                            phosphorylates
                                            MCM2
Guideline   Introduction       Methods                 Experimental Results




                   Top Marker 10950




                                         • Nukleotide excision
                                         • DNA clamp (PCNA)
                                           loader activity
                                         • Polymorphisms in
                                           WRN ↔ colon cancer
                                         • DNMT1: methyl
                                           transferase, silences
                                           cell growth repressors
Guideline                 Introduction             Methods                 Experimental Results




                                         Future Works



     1. Comparison subnetwork signatures of different cancers or subtypes of a
        particular cancer
     2. Extend the interaction network with for example ncRNA-protein interactions
     3. Redesign novel methods to work with real valued continuous phenotype
        variables
Guideline   Introduction             Methods   Experimental Results




                  Thanks for the attention!

More Related Content

Similar to Eccb

Whole Exome Sequencing .pptx
Whole Exome Sequencing .pptxWhole Exome Sequencing .pptx
Whole Exome Sequencing .pptx
Karthikeyans636697
 
GENETIC BASIS OF PSYCHIATRIC DISRODERS AND THE RELEVANCE OF CLINICAL PRACTICE
 GENETIC BASIS OF  PSYCHIATRIC DISRODERS AND THE RELEVANCE OF CLINICAL  PRACTICE GENETIC BASIS OF  PSYCHIATRIC DISRODERS AND THE RELEVANCE OF CLINICAL  PRACTICE
GENETIC BASIS OF PSYCHIATRIC DISRODERS AND THE RELEVANCE OF CLINICAL PRACTICE
PRASHNATH javali
 
metagenomics.pptx
metagenomics.pptxmetagenomics.pptx
metagenomics.pptx
priyadharshinip21bbt
 
Gene hunting strategies
Gene hunting strategiesGene hunting strategies
Gene hunting strategies
Ashfaq Ahmad
 
Genomics In Personal Care Product Development
Genomics In Personal Care Product DevelopmentGenomics In Personal Care Product Development
Genomics In Personal Care Product Development
Genemarkers
 
Exome sequencing for disease gene identification and patient diagnostics, Gen...
Exome sequencing for disease gene identification and patient diagnostics, Gen...Exome sequencing for disease gene identification and patient diagnostics, Gen...
Exome sequencing for disease gene identification and patient diagnostics, Gen...
Copenhagenomics
 
New drug discovery
New drug discoveryNew drug discovery
New drug discovery
RashadKafeel
 
Genomics
GenomicsGenomics
Genomics
Komal Rajgire
 
Concept of genomics, proteomics and metabolomics
Concept of genomics, proteomics and metabolomicsConcept of genomics, proteomics and metabolomics
Concept of genomics, proteomics and metabolomics
Muragendraswami Astagimath
 
Prashanth_Seminar.pptx
Prashanth_Seminar.pptxPrashanth_Seminar.pptx
Prashanth_Seminar.pptx
prashanthbabu31
 
Functional genomics,Pharmaco genomics, and Meta genomics.
Functional genomics,Pharmaco genomics, and Meta genomics.Functional genomics,Pharmaco genomics, and Meta genomics.
Functional genomics,Pharmaco genomics, and Meta genomics.
sangeeta jadav
 
Basic Aspects of Microarray Technology and Data Analysis (UEB-UAT Bioinformat...
Basic Aspects of Microarray Technology and Data Analysis (UEB-UAT Bioinformat...Basic Aspects of Microarray Technology and Data Analysis (UEB-UAT Bioinformat...
Basic Aspects of Microarray Technology and Data Analysis (UEB-UAT Bioinformat...
VHIR Vall d’Hebron Institut de Recerca
 
Advances and Applications Enabled by Single Cell Technology
Advances and Applications Enabled by Single Cell TechnologyAdvances and Applications Enabled by Single Cell Technology
Advances and Applications Enabled by Single Cell Technology
QIAGEN
 
microarrary
microarrarymicroarrary
microarrary
Dr Nisha Singh
 
Novel network pharmacology methods for drug mechanism of action identificatio...
Novel network pharmacology methods for drug mechanism of action identificatio...Novel network pharmacology methods for drug mechanism of action identificatio...
Novel network pharmacology methods for drug mechanism of action identificatio...
laserxiong
 
Seftah DNA fingerprint 2007NEW.ppt
Seftah DNA fingerprint 2007NEW.pptSeftah DNA fingerprint 2007NEW.ppt
Seftah DNA fingerprint 2007NEW.ppt
SamerPaser
 
Fornsic medicine dna barcoading
Fornsic medicine dna barcoadingFornsic medicine dna barcoading
Fornsic medicine dna barcoading
University Of Wuerzburg,Germany
 
Session 4 part 1
Session 4 part 1Session 4 part 1
Session 4 part 1
plmiami
 
NGS in Clinical Research: Meet the NGS Experts Series Part 1
NGS in Clinical Research: Meet the NGS Experts Series Part 1NGS in Clinical Research: Meet the NGS Experts Series Part 1
NGS in Clinical Research: Meet the NGS Experts Series Part 1
QIAGEN
 
Types of genomics ppt
Types of genomics pptTypes of genomics ppt
Types of genomics ppt
Hina Zamir Noori
 

Similar to Eccb (20)

Whole Exome Sequencing .pptx
Whole Exome Sequencing .pptxWhole Exome Sequencing .pptx
Whole Exome Sequencing .pptx
 
GENETIC BASIS OF PSYCHIATRIC DISRODERS AND THE RELEVANCE OF CLINICAL PRACTICE
 GENETIC BASIS OF  PSYCHIATRIC DISRODERS AND THE RELEVANCE OF CLINICAL  PRACTICE GENETIC BASIS OF  PSYCHIATRIC DISRODERS AND THE RELEVANCE OF CLINICAL  PRACTICE
GENETIC BASIS OF PSYCHIATRIC DISRODERS AND THE RELEVANCE OF CLINICAL PRACTICE
 
metagenomics.pptx
metagenomics.pptxmetagenomics.pptx
metagenomics.pptx
 
Gene hunting strategies
Gene hunting strategiesGene hunting strategies
Gene hunting strategies
 
Genomics In Personal Care Product Development
Genomics In Personal Care Product DevelopmentGenomics In Personal Care Product Development
Genomics In Personal Care Product Development
 
Exome sequencing for disease gene identification and patient diagnostics, Gen...
Exome sequencing for disease gene identification and patient diagnostics, Gen...Exome sequencing for disease gene identification and patient diagnostics, Gen...
Exome sequencing for disease gene identification and patient diagnostics, Gen...
 
New drug discovery
New drug discoveryNew drug discovery
New drug discovery
 
Genomics
GenomicsGenomics
Genomics
 
Concept of genomics, proteomics and metabolomics
Concept of genomics, proteomics and metabolomicsConcept of genomics, proteomics and metabolomics
Concept of genomics, proteomics and metabolomics
 
Prashanth_Seminar.pptx
Prashanth_Seminar.pptxPrashanth_Seminar.pptx
Prashanth_Seminar.pptx
 
Functional genomics,Pharmaco genomics, and Meta genomics.
Functional genomics,Pharmaco genomics, and Meta genomics.Functional genomics,Pharmaco genomics, and Meta genomics.
Functional genomics,Pharmaco genomics, and Meta genomics.
 
Basic Aspects of Microarray Technology and Data Analysis (UEB-UAT Bioinformat...
Basic Aspects of Microarray Technology and Data Analysis (UEB-UAT Bioinformat...Basic Aspects of Microarray Technology and Data Analysis (UEB-UAT Bioinformat...
Basic Aspects of Microarray Technology and Data Analysis (UEB-UAT Bioinformat...
 
Advances and Applications Enabled by Single Cell Technology
Advances and Applications Enabled by Single Cell TechnologyAdvances and Applications Enabled by Single Cell Technology
Advances and Applications Enabled by Single Cell Technology
 
microarrary
microarrarymicroarrary
microarrary
 
Novel network pharmacology methods for drug mechanism of action identificatio...
Novel network pharmacology methods for drug mechanism of action identificatio...Novel network pharmacology methods for drug mechanism of action identificatio...
Novel network pharmacology methods for drug mechanism of action identificatio...
 
Seftah DNA fingerprint 2007NEW.ppt
Seftah DNA fingerprint 2007NEW.pptSeftah DNA fingerprint 2007NEW.ppt
Seftah DNA fingerprint 2007NEW.ppt
 
Fornsic medicine dna barcoading
Fornsic medicine dna barcoadingFornsic medicine dna barcoading
Fornsic medicine dna barcoading
 
Session 4 part 1
Session 4 part 1Session 4 part 1
Session 4 part 1
 
NGS in Clinical Research: Meet the NGS Experts Series Part 1
NGS in Clinical Research: Meet the NGS Experts Series Part 1NGS in Clinical Research: Meet the NGS Experts Series Part 1
NGS in Clinical Research: Meet the NGS Experts Series Part 1
 
Types of genomics ppt
Types of genomics pptTypes of genomics ppt
Types of genomics ppt
 

Recently uploaded

Your Skill Boost Masterclass: Strategies for Effective Upskilling
Your Skill Boost Masterclass: Strategies for Effective UpskillingYour Skill Boost Masterclass: Strategies for Effective Upskilling
Your Skill Boost Masterclass: Strategies for Effective Upskilling
Excellence Foundation for South Sudan
 
How to Add Chatter in the odoo 17 ERP Module
How to Add Chatter in the odoo 17 ERP ModuleHow to Add Chatter in the odoo 17 ERP Module
How to Add Chatter in the odoo 17 ERP Module
Celine George
 
What is Digital Literacy? A guest blog from Andy McLaughlin, University of Ab...
What is Digital Literacy? A guest blog from Andy McLaughlin, University of Ab...What is Digital Literacy? A guest blog from Andy McLaughlin, University of Ab...
What is Digital Literacy? A guest blog from Andy McLaughlin, University of Ab...
GeorgeMilliken2
 
How to Manage Your Lost Opportunities in Odoo 17 CRM
How to Manage Your Lost Opportunities in Odoo 17 CRMHow to Manage Your Lost Opportunities in Odoo 17 CRM
How to Manage Your Lost Opportunities in Odoo 17 CRM
Celine George
 
A Independência da América Espanhola LAPBOOK.pdf
A Independência da América Espanhola LAPBOOK.pdfA Independência da América Espanhola LAPBOOK.pdf
A Independência da América Espanhola LAPBOOK.pdf
Jean Carlos Nunes Paixão
 
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...
Nguyen Thanh Tu Collection
 
Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...
Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...
Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...
National Information Standards Organization (NISO)
 
The simplified electron and muon model, Oscillating Spacetime: The Foundation...
The simplified electron and muon model, Oscillating Spacetime: The Foundation...The simplified electron and muon model, Oscillating Spacetime: The Foundation...
The simplified electron and muon model, Oscillating Spacetime: The Foundation...
RitikBhardwaj56
 
Pengantar Penggunaan Flutter - Dart programming language1.pptx
Pengantar Penggunaan Flutter - Dart programming language1.pptxPengantar Penggunaan Flutter - Dart programming language1.pptx
Pengantar Penggunaan Flutter - Dart programming language1.pptx
Fajar Baskoro
 
Hindi varnamala | hindi alphabet PPT.pdf
Hindi varnamala | hindi alphabet PPT.pdfHindi varnamala | hindi alphabet PPT.pdf
Hindi varnamala | hindi alphabet PPT.pdf
Dr. Mulla Adam Ali
 
BBR 2024 Summer Sessions Interview Training
BBR  2024 Summer Sessions Interview TrainingBBR  2024 Summer Sessions Interview Training
BBR 2024 Summer Sessions Interview Training
Katrina Pritchard
 
How to Fix the Import Error in the Odoo 17
How to Fix the Import Error in the Odoo 17How to Fix the Import Error in the Odoo 17
How to Fix the Import Error in the Odoo 17
Celine George
 
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
Dr. Vinod Kumar Kanvaria
 
Digital Artifact 1 - 10VCD Environments Unit
Digital Artifact 1 - 10VCD Environments UnitDigital Artifact 1 - 10VCD Environments Unit
Digital Artifact 1 - 10VCD Environments Unit
chanes7
 
S1-Introduction-Biopesticides in ICM.pptx
S1-Introduction-Biopesticides in ICM.pptxS1-Introduction-Biopesticides in ICM.pptx
S1-Introduction-Biopesticides in ICM.pptx
tarandeep35
 
MARY JANE WILSON, A “BOA MÃE” .
MARY JANE WILSON, A “BOA MÃE”           .MARY JANE WILSON, A “BOA MÃE”           .
MARY JANE WILSON, A “BOA MÃE” .
Colégio Santa Teresinha
 
Chapter 4 - Islamic Financial Institutions in Malaysia.pptx
Chapter 4 - Islamic Financial Institutions in Malaysia.pptxChapter 4 - Islamic Financial Institutions in Malaysia.pptx
Chapter 4 - Islamic Financial Institutions in Malaysia.pptx
Mohd Adib Abd Muin, Senior Lecturer at Universiti Utara Malaysia
 
Smart-Money for SMC traders good time and ICT
Smart-Money for SMC traders good time and ICTSmart-Money for SMC traders good time and ICT
Smart-Money for SMC traders good time and ICT
simonomuemu
 
The basics of sentences session 6pptx.pptx
The basics of sentences session 6pptx.pptxThe basics of sentences session 6pptx.pptx
The basics of sentences session 6pptx.pptx
heathfieldcps1
 
How to Make a Field Mandatory in Odoo 17
How to Make a Field Mandatory in Odoo 17How to Make a Field Mandatory in Odoo 17
How to Make a Field Mandatory in Odoo 17
Celine George
 

Recently uploaded (20)

Your Skill Boost Masterclass: Strategies for Effective Upskilling
Your Skill Boost Masterclass: Strategies for Effective UpskillingYour Skill Boost Masterclass: Strategies for Effective Upskilling
Your Skill Boost Masterclass: Strategies for Effective Upskilling
 
How to Add Chatter in the odoo 17 ERP Module
How to Add Chatter in the odoo 17 ERP ModuleHow to Add Chatter in the odoo 17 ERP Module
How to Add Chatter in the odoo 17 ERP Module
 
What is Digital Literacy? A guest blog from Andy McLaughlin, University of Ab...
What is Digital Literacy? A guest blog from Andy McLaughlin, University of Ab...What is Digital Literacy? A guest blog from Andy McLaughlin, University of Ab...
What is Digital Literacy? A guest blog from Andy McLaughlin, University of Ab...
 
How to Manage Your Lost Opportunities in Odoo 17 CRM
How to Manage Your Lost Opportunities in Odoo 17 CRMHow to Manage Your Lost Opportunities in Odoo 17 CRM
How to Manage Your Lost Opportunities in Odoo 17 CRM
 
A Independência da América Espanhola LAPBOOK.pdf
A Independência da América Espanhola LAPBOOK.pdfA Independência da América Espanhola LAPBOOK.pdf
A Independência da América Espanhola LAPBOOK.pdf
 
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...
 
Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...
Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...
Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...
 
The simplified electron and muon model, Oscillating Spacetime: The Foundation...
The simplified electron and muon model, Oscillating Spacetime: The Foundation...The simplified electron and muon model, Oscillating Spacetime: The Foundation...
The simplified electron and muon model, Oscillating Spacetime: The Foundation...
 
Pengantar Penggunaan Flutter - Dart programming language1.pptx
Pengantar Penggunaan Flutter - Dart programming language1.pptxPengantar Penggunaan Flutter - Dart programming language1.pptx
Pengantar Penggunaan Flutter - Dart programming language1.pptx
 
Hindi varnamala | hindi alphabet PPT.pdf
Hindi varnamala | hindi alphabet PPT.pdfHindi varnamala | hindi alphabet PPT.pdf
Hindi varnamala | hindi alphabet PPT.pdf
 
BBR 2024 Summer Sessions Interview Training
BBR  2024 Summer Sessions Interview TrainingBBR  2024 Summer Sessions Interview Training
BBR 2024 Summer Sessions Interview Training
 
How to Fix the Import Error in the Odoo 17
How to Fix the Import Error in the Odoo 17How to Fix the Import Error in the Odoo 17
How to Fix the Import Error in the Odoo 17
 
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...
 
Digital Artifact 1 - 10VCD Environments Unit
Digital Artifact 1 - 10VCD Environments UnitDigital Artifact 1 - 10VCD Environments Unit
Digital Artifact 1 - 10VCD Environments Unit
 
S1-Introduction-Biopesticides in ICM.pptx
S1-Introduction-Biopesticides in ICM.pptxS1-Introduction-Biopesticides in ICM.pptx
S1-Introduction-Biopesticides in ICM.pptx
 
MARY JANE WILSON, A “BOA MÃE” .
MARY JANE WILSON, A “BOA MÃE”           .MARY JANE WILSON, A “BOA MÃE”           .
MARY JANE WILSON, A “BOA MÃE” .
 
Chapter 4 - Islamic Financial Institutions in Malaysia.pptx
Chapter 4 - Islamic Financial Institutions in Malaysia.pptxChapter 4 - Islamic Financial Institutions in Malaysia.pptx
Chapter 4 - Islamic Financial Institutions in Malaysia.pptx
 
Smart-Money for SMC traders good time and ICT
Smart-Money for SMC traders good time and ICTSmart-Money for SMC traders good time and ICT
Smart-Money for SMC traders good time and ICT
 
The basics of sentences session 6pptx.pptx
The basics of sentences session 6pptx.pptxThe basics of sentences session 6pptx.pptx
The basics of sentences session 6pptx.pptx
 
How to Make a Field Mandatory in Odoo 17
How to Make a Field Mandatory in Odoo 17How to Make a Field Mandatory in Odoo 17
How to Make a Field Mandatory in Odoo 17
 

Eccb

  • 1. Guideline Introduction Methods Experimental Results Inferring Cancer Subnetwork Markers using Density-Constrained Biclustering Presenters: Phuong Dao1 , Alexander Schonhuth2 1 School of Computing Science, Simon Fraser University 2 Algorithmic Computational Biology Group, CWI, Netherlands
  • 2. Guideline Introduction Methods Experimental Results Guideline Introduction Personalized Medicine Biomarker Discovery Methods Motivations Our approach Experimental Results Data Classifier Performance Markers
  • 3. Guideline Introduction Methods Experimental Results Introduction Personalized Medicine • Exact determination of disease status based on patient genetics/genomics • Goal: Specific, individual choice of treatment
  • 4. Guideline Introduction Methods Experimental Results Introduction Personalized Medicine • Exact determination of disease status based on patient genetics/genomics • Goal: Specific, individual choice of treatment • Necessary: Reliable disease markers
  • 5. Guideline Introduction Methods Experimental Results Biomarker Discovery • Single gene markers: Each gene is ranked according to their ability to distinguish samples of different classes • Multigenic markers: Each subset S of genes is ranked based on the aggregation ability of all genes in S to distinguish samples of different classes
  • 6. Guideline Introduction Methods Experimental Results Single Gene Markers Control 1 Control 2 Control 3 Case 1 Case 2 Case 3 Control 1 Control 2 Control 3 Case 1 Case 2 Case 3 Gene 1 Gene 3 Gene 1 Gene 2 Differentially Expressed Gene 3 Gene 4 Gene 2 Gene 5 Gene 4 Gene 6 Gene 5 Gene 6 Non−Differentially Expressed
  • 7. Guideline Introduction Methods Experimental Results Multigenic Markers Subnetwork Markers [Chuang et al., Mol.Sys.Biol. (2007)]: • Predicting progression of breast cancer • Subnetwork markers are connected subnetworks with aggregate expression profiles correlates the most with the labels of the samples • Greedy heuristics for searching for optimal subnetwork markers
  • 8. Guideline Introduction Methods Experimental Results Multigenic Markers Subnetwork Markers [Chowdhury et al., PSB 2010]: • Predicting colon cancer subtypes • Each marker is a small connected subnetwork N such that genes in N cover all disease samples (gene g covers sample s if g is differentially expressed in s) • Greedy heuristics for searching for the smallest subnetwork markers
  • 9. Guideline Introduction Methods Experimental Results Motivations Heterogeneity of Cancer Genomes • Cancer genomes evolve (many cells in one patient have different genomes) • No two cancer cells of two different patients are the same [Hampton et al., Genome Research (2009)]
  • 10. Guideline Introduction Methods Experimental Results Motivations Proximity of Disease Related Genes in PPI Network [Goh et al., PNAS (2007)]: • The protein products of genes related to the same disease tend to interact with one another • Genes related to a disease have coherent functions with respect to the Gene Ontology hierarchy
  • 11. Guideline Introduction Methods Experimental Results Our Approach Each of our subnetwork markers: • includes genes that have higher interaction among them than expected (densely connected subnetworks) • contains differentially expressed genes in a fraction of all the samples from cancer tissues (partially differential expression)
  • 12. Guideline Introduction Methods Experimental Results Methods
  • 13. Guideline Introduction Methods Experimental Results Densely Connected Subnetworks Properties Let G = (V , E) be a network with edge weights we , e ∈ E. • The density θ(G) of G is e∈E we θ(G) := |V | 2 |V | where 2 is the number of possible edges in G. • G is called α-dense if θ(G) ≥ α. • An α-dense, connected network G is called α-densely connected.
  • 14. Guideline Introduction Methods Experimental Results Partially Differential Expression S1 S2 S3 G1 0.95 0.6 0.8 0.95 0.85 G1 1 1 0 0.9 0.75 G3 0.45 0.85 G2 1 1 1 G2 0.75 0.25 0.9 0.8 G3 1 1 0 0.7 0.9 0.9 G4 1 1 1 0.55 0.5 0.95 G4 0.8 0.85 0.95 0.75 0.95 0.35 0.65 G4 0.45 0.8 0.9 S1 S2 S3 0.75 0.8 0.9 0.7 0.3 0.8 0.9 0.9 0.7 G4 1 1 1 0.65 0.85 0.8 0.9 0.95 G5 G6 G5 0 1 1 0.75 G6 0 1 1 0.85 0 1 1 0.95 G7 G7 Compute all densely connected subnetworks whose genes are differentially expressed in a subset of patients of size at least k (here: k = 2).
  • 15. Guideline Introduction Methods Experimental Results Density Constrained Biclustering Search Strategy Theorem: Let α ≥ 0.5. Every α-densely connected network of size n contains an α-densely connected subnetwork of size n − 1. 0.4 A 0.6 A 0.9 A C 0.8 D C B C D B B D C 0.6 A 0.6 A 0.9 A 0.8 D 0.4 0.6 B A C 0.4 C 0.9 D 0.4 B 0.9 B D 0.8 B C 0.8 D Density: 0.45 = [(0.8 + 0.9 + 0.6 + 0.4) / 6] C Not Dense wDCB 0.4 0.6 B A 0.9 0.8 Not Connected D maximal wDCB Figure: Toy example for computation of densely connected subnetworks, density threshold θ = 0.5.
  • 16. Guideline Introduction Methods Experimental Results Classifier Construction G4 G1 0.95 0.9 0.85 0.7 0.75 G3 G5 1. Rank density constrained G2 G6 biclusters according to density 0.8 0.9 0.85 significance G4 0.95 G7 2. Keep only high-ranked Gene 1 1.25 subnetworks with little overlap Gene 2 1.5 3. Feature space dimension = Gene 3 1.0 Marker 1 1.25 Gene 4 1.25 Average number of markers Gene 5 0.5 Marker 2 0.5 4. SVM classification Gene 6 0.0 Gene 7 0.25 Gene Expression Profile Average Gene Expression Profile
  • 17. Guideline Introduction Methods Experimental Results Experimental Results
  • 18. Guideline Introduction Methods Experimental Results Network Data Confidence-scored PPI network [STRING, von Mering et al., NAR 2009] • Edges reflect physical protein-protein interactions • Confidence scores reflect the probability that the interaction is 0.95 0.6 0.8 0.9 associated with a cellular 0.45 0.75 0.85 0.9 0.25 0.9 0.7 phenomenon (and not an 0.8 0.55 0.95 0.5 0.95 0.75 0.85 0.95 experimental artifact) 0.45 0.35 0.65 0.8 0.75 0.8 0.9 0.9 0.7 0.3 0.8 • Scoring system based on KEGG 0.65 0.75 0.8 0.9 0.9 0.85 0.95 pathways
  • 19. Guideline Introduction Methods Experimental Results Gene Expression Data Three experiments on colon cancer • GSE8671, 32 patients / tissue pairs • GSE10950, 24 patients / tissue pairs • GSE6988, 123 samples across several cancer subtypes One experiment on breast cancer • GSE3494, 251 patients with different mutation status (wildtype vs. mutant)
  • 20. Guideline Introduction Methods Experimental Results GSE 8671 −→ GSE 10950 GSE8671 >> GSE10950 1 0.95 0.9 AUC 0.85 0.8 SGM GMI NETCOVER wDCB 0.75 0 5 10 15 20 25 30 35 40 45 50 # Subnetworks/Genes
  • 21. Guideline Introduction Methods Experimental Results GSE 8671 −→ GSE 6988 - Colon Cancer GSE8671 >> GSE6988 1 0.95 0.9 0.85 AUC 0.8 0.75 0.7 SGM GMI 0.65 NETCOVER wDCB 0.6 0 5 10 15 20 25 30 35 40 45 50 #Subnetworks/Genes
  • 22. Guideline Introduction Methods Experimental Results GSE 8671 −→ GSE 6988 - Colon Cancer GSE8671 >> GSE6988 1 0.95 0.9 0.85 AUC 0.8 0.75 0.7 SGM GMI 0.65 NETCOVER wDCB 0.6 0 5 10 15 20 25 30 35 40 45 50 #Subnetworks/Genes
  • 23. Guideline Introduction Methods Experimental Results GSE 3494 - Breast Cancer
  • 24. Guideline Introduction Methods Experimental Results Subnetwork Marker Statistics Avg AUC Avg AUC # ER-50 6988 10950 # ER-50 6988 8671 GMI 806 0.38 0.86 0.95 755 0.34 0.84 0.99 NC 923 0.12 0.87 0.99 N/A N/A 0.86 N/A wDCB 282 0.76 0.91 1.00 216 0.74 0.91 1.00 8671 Subnetworks 10950 Subnetworks GMI = Greedy Mutual Information (Chuang et al.) NC = NetCover (Chowdhury et al.) wDCB = weighted Density Constrained Biclustering # = total number of subnetworks computed ER-50 = enrichment rate of the top-50 markers
  • 25. Guideline Introduction Methods Experimental Results Top Marker 8671 • DNA replication initiation • DNA metabolic process • TP53, BRCA1: tumor suppressor genes • Minichromosome maintenance (MCM) complex • Protein kinase CDC7 phosphorylates MCM2
  • 26. Guideline Introduction Methods Experimental Results Top Marker 10950 • Nukleotide excision • DNA clamp (PCNA) loader activity • Polymorphisms in WRN ↔ colon cancer • DNMT1: methyl transferase, silences cell growth repressors
  • 27. Guideline Introduction Methods Experimental Results Future Works 1. Comparison subnetwork signatures of different cancers or subtypes of a particular cancer 2. Extend the interaction network with for example ncRNA-protein interactions 3. Redesign novel methods to work with real valued continuous phenotype variables
  • 28. Guideline Introduction Methods Experimental Results Thanks for the attention!