SlideShare a Scribd company logo
1 of 37
Low Duplicability and Network Fragility of Cancer Genes Davide Rambaldi
Background and Aim of the Project high heterogeneity and high number (~600) of genes mutated in cancer Identification of Systems-level properties Better understanding of the genetic determinants of cancer progression  Identification of candidate cancer genes
Choice of Systems-level properties Genomic Duplicability Tendency to retain conserved and/or recent duplicates Network topology Position of the protein in a protein-protein interaction network Duplicability (Zhang, 2006) (Sun, 2006) Network connectivity (Wu, 2005) ( Prachumwat, 2006)   fragility  (Veitia, 2002)
~ 600 genes mutated in cancer ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Detection of Genomic Duplicates reference set  N=349 benchmark set  N=254 reference set 83.68% 16.32% benchmark set 10.3% 89.7% reference set benchmark set
Example of duplicable gene:  rara RARA -  RETINOIC ACID RECEPTOR ALPHA First duplication:  Coverage  68% Second duplication:  Coverage  65% Best Hit:  Coverage  99% Spurious Hit:  Coverage  9%
Do Cancer and CAN-genes duplicate more or less than the rest of human genes? Reference Set Benchmark Set Comparison to other human genes 83.7% Singletons 16.3% Duplicable genes % 89.7% Singletons 10.3% Duplicable genes %
Comparison to other human genes Human genes = 24.202
Genes mutated in cancer tend to duplicate less than other human genes Reference Set Benchmark Set
Is this really a systems-level property? Human genes = 24.202
Genes mutated in cancer duplicate less than other human genes with the same functional distribution
From Genomes to Network ,[object Object],[object Object],[object Object],Does this apply also for cancer genes? Duplicability Network connectivity fragility
Human Interaction Network ,[object Object],[object Object],[object Object],[object Object],[object Object],154/254 Benchmark set 24% Duplicable proteins 304/349 Reference set 76% Singletons 34564 edges (interactions) 9264 nodes (proteins)
Resulting Network
Network Analysis Global Topology DEGREE (d) Measure of connectivity of each node CLUSTERING COEFFICIENT (cc) Measure of interconnectivity of each node d=4 cc=0 d=4 cc=0.3
global topology Scale free network : few nodes with many connections, many nodes with few connections (Barabási and Albert,1999)
How do cancer genes behave in the network? Duplicability Network connectivity fragility
Global topology of singleton and duplicable proteins In the entire network, singletons proteins are  less  connected than duplicable proteins but have an  higher  clustering coefficient P < 0.0001 (Wilcoxon Test) P = 0.0163 (Wilcoxon Test) singleton duplicable
Global topology of cancer proteins P < 0.0001 (Wilcoxon Test) P < 0.0001 (Wilcoxon Test) Unlike most singletons, proteins mutated in cancer are  more  connected than other proteins and have an  higher  clustering coefficient singleton cancer
Local Topology Measure the enrichment of subgraphs in the network  ,[object Object],[object Object],[object Object],We analyzed 3-nodes and 4-nodes subgraphs 3-nodes 4-nodes
Local Topology of the entire network The human network  is enriched  in the most interconnected subgraphs.
Local topology of duplicable and singleton proteins No significant difference  between singleton and duplicable proteins in the network motifs.
Local topology of cancer and CAN-proteins
Summary Singletons are  less  connected but  more  interconnected than duplicable proteins Cancer genes, mainly singletons, code for  protein HUBS of highly interconnected modules   of the human network Singletons and duplicable proteins  are equally  represented in the network motifs BUT In the entire network:
Data interpretation ,[object Object],[object Object],[object Object],[object Object],[object Object],~94% of the entire network ~6% of the entire network
Data interpretation Duplicability Network connectivity fragility candidates ,[object Object],[object Object],[object Object],[object Object],[object Object]
Possible candidates 101 singletons genes with >20 connections and cc>0.1 Significantly enriched in Gene Ontology terms related to cancer
Network of candidate cancer genes
Network of Cancer genes (developed by Federico Giorgi) http://bio.ifom-ieo-campus.it/ncg/
Many thanks to … Ciccarelli Group Francesca Ciccarelli Anna DeGrassi Federico Giorgi Matteo Dantonio Ciliberto Group Andrea Ciliberto Fabrizio Capuani Romilde Manzoni Federico Vaggi And all the bioinfo crew … Statistics Giovanni d’Ario Lara Lusa IT support Davide Cittaro
[object Object],[object Object],[object Object],[object Object],Duplicates definition: 60% coverage
RARA and NR2C2 RARA NR2C2
A singleton gene: FEV FEV -  ETS oncogene family (coverage= 100%  identity= 100% ) (coverage= 35%  identity= 86% )
Changing threshold Changing the threshold of 10% doesn’t change the results: our observation are independent from the chosen coverage threshold value
Is this signal real? EXIST A CORRELATION BETWEEN  CONNECTIVITY  IN HPRD AND  ABSTRACTS  IN PUBMED? HOW IS THE  CONNECTIVITY  OF CANCER PROTEINS USING ONLY INTERACTIONS COMING FROM  HIGH-THROUGHPUT  EXPERIMENTS? HPRD is a database based on literature:  is it biased towards well-studied genes?  (… and cancer genes are among them)
Network Randomization Real Network Edges Randomization
Network of Cancer genes: public access to our data (developed by F.M. Giorgi)

More Related Content

What's hot

Level of Tumor Protein Indicates Chances Cancer Will Spread AND Malfunctionin...
Level of Tumor Protein Indicates Chances Cancer Will Spread AND Malfunctionin...Level of Tumor Protein Indicates Chances Cancer Will Spread AND Malfunctionin...
Level of Tumor Protein Indicates Chances Cancer Will Spread AND Malfunctionin...espontanea
 
Genomics seminar
Genomics seminarGenomics seminar
Genomics seminarS Rasouli
 
NetBioSIG2013-Talk Thomas Kelder
NetBioSIG2013-Talk Thomas KelderNetBioSIG2013-Talk Thomas Kelder
NetBioSIG2013-Talk Thomas KelderAlexander Pico
 
NetBioSIG2014-Talk by David Amar
NetBioSIG2014-Talk by David AmarNetBioSIG2014-Talk by David Amar
NetBioSIG2014-Talk by David AmarAlexander Pico
 
DNA Technology
DNA TechnologyDNA Technology
DNA Technologymgsonline
 
Human genome project
Human genome projectHuman genome project
Human genome projectmah neem mah
 
NGS in cancer treatment
NGS in cancer treatmentNGS in cancer treatment
NGS in cancer treatmentNur Suhaida
 
Microbial Metagenomics Drives a New Cyberinfrastructure
Microbial Metagenomics Drives a New CyberinfrastructureMicrobial Metagenomics Drives a New Cyberinfrastructure
Microbial Metagenomics Drives a New CyberinfrastructureLarry Smarr
 
High-Throughput Sequencing
High-Throughput SequencingHigh-Throughput Sequencing
High-Throughput SequencingMark Pallen
 
Kurrey_et_al-2009-STEM_CELLS
Kurrey_et_al-2009-STEM_CELLSKurrey_et_al-2009-STEM_CELLS
Kurrey_et_al-2009-STEM_CELLSSwati Jalgaonkar
 
Haendel clingenetics.3.14.14
Haendel clingenetics.3.14.14Haendel clingenetics.3.14.14
Haendel clingenetics.3.14.14mhaendel
 
Cancer genome databases & Ecological databases
Cancer genome databases & Ecological databases Cancer genome databases & Ecological databases
Cancer genome databases & Ecological databases Waliullah Wali
 
Reconstruction and analysis of cancerspecific Gene regulatory networks from G...
Reconstruction and analysis of cancerspecific Gene regulatory networks from G...Reconstruction and analysis of cancerspecific Gene regulatory networks from G...
Reconstruction and analysis of cancerspecific Gene regulatory networks from G...ijbbjournal
 
Bioinformatics as a tool for understanding clinically significant variations ...
Bioinformatics as a tool for understanding clinically significant variations ...Bioinformatics as a tool for understanding clinically significant variations ...
Bioinformatics as a tool for understanding clinically significant variations ...Despoina Kalfakakou
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomicshemantbreeder
 

What's hot (20)

Level of Tumor Protein Indicates Chances Cancer Will Spread AND Malfunctionin...
Level of Tumor Protein Indicates Chances Cancer Will Spread AND Malfunctionin...Level of Tumor Protein Indicates Chances Cancer Will Spread AND Malfunctionin...
Level of Tumor Protein Indicates Chances Cancer Will Spread AND Malfunctionin...
 
Genomics seminar
Genomics seminarGenomics seminar
Genomics seminar
 
NetBioSIG2013-Talk Thomas Kelder
NetBioSIG2013-Talk Thomas KelderNetBioSIG2013-Talk Thomas Kelder
NetBioSIG2013-Talk Thomas Kelder
 
Genomics
GenomicsGenomics
Genomics
 
NetBioSIG2014-Talk by David Amar
NetBioSIG2014-Talk by David AmarNetBioSIG2014-Talk by David Amar
NetBioSIG2014-Talk by David Amar
 
Cancer genome
Cancer genomeCancer genome
Cancer genome
 
DNA Technology
DNA TechnologyDNA Technology
DNA Technology
 
Human genome project
Human genome projectHuman genome project
Human genome project
 
Cancer and CNV
Cancer and CNVCancer and CNV
Cancer and CNV
 
NGS in cancer treatment
NGS in cancer treatmentNGS in cancer treatment
NGS in cancer treatment
 
Microbial Metagenomics Drives a New Cyberinfrastructure
Microbial Metagenomics Drives a New CyberinfrastructureMicrobial Metagenomics Drives a New Cyberinfrastructure
Microbial Metagenomics Drives a New Cyberinfrastructure
 
High-Throughput Sequencing
High-Throughput SequencingHigh-Throughput Sequencing
High-Throughput Sequencing
 
Kurrey_et_al-2009-STEM_CELLS
Kurrey_et_al-2009-STEM_CELLSKurrey_et_al-2009-STEM_CELLS
Kurrey_et_al-2009-STEM_CELLS
 
Haendel clingenetics.3.14.14
Haendel clingenetics.3.14.14Haendel clingenetics.3.14.14
Haendel clingenetics.3.14.14
 
Cancer genome databases & Ecological databases
Cancer genome databases & Ecological databases Cancer genome databases & Ecological databases
Cancer genome databases & Ecological databases
 
Reconstruction and analysis of cancerspecific Gene regulatory networks from G...
Reconstruction and analysis of cancerspecific Gene regulatory networks from G...Reconstruction and analysis of cancerspecific Gene regulatory networks from G...
Reconstruction and analysis of cancerspecific Gene regulatory networks from G...
 
Bioinformatics as a tool for understanding clinically significant variations ...
Bioinformatics as a tool for understanding clinically significant variations ...Bioinformatics as a tool for understanding clinically significant variations ...
Bioinformatics as a tool for understanding clinically significant variations ...
 
20140711 5 s_pond_ercc2.0_workshop
20140711 5 s_pond_ercc2.0_workshop20140711 5 s_pond_ercc2.0_workshop
20140711 5 s_pond_ercc2.0_workshop
 
Genetic mapping
Genetic mappingGenetic mapping
Genetic mapping
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomics
 

Viewers also liked

Laboratorio Probabilidad 1/3
Laboratorio Probabilidad 1/3Laboratorio Probabilidad 1/3
Laboratorio Probabilidad 1/3cbpresentaciones
 
Linguaggio R, principi e concetti
Linguaggio R, principi e concettiLinguaggio R, principi e concetti
Linguaggio R, principi e concettiVincenzo De Maio
 
Elisa Teodoro, Clase 5, Funciones
Elisa Teodoro, Clase 5, FuncionesElisa Teodoro, Clase 5, Funciones
Elisa Teodoro, Clase 5, FuncionesInfoUdo.com.ve
 
Ruby es un lenguaje de programación interpretado
Ruby es un lenguaje de programación interpretadoRuby es un lenguaje de programación interpretado
Ruby es un lenguaje de programación interpretadoYulgrecia2011
 
Scheduling power-aware abstract
Scheduling power-aware abstractScheduling power-aware abstract
Scheduling power-aware abstractVincenzo De Maio
 
Introduzione a R
Introduzione a RIntroduzione a R
Introduzione a RMCalderisi
 
Elisa Teodoro, Aplicacion de Derivadas, Clase 2
Elisa Teodoro, Aplicacion de Derivadas, Clase 2Elisa Teodoro, Aplicacion de Derivadas, Clase 2
Elisa Teodoro, Aplicacion de Derivadas, Clase 2InfoUdo.com.ve
 
Introduction to R by David Lucy Cap 12-16
Introduction to R by David Lucy Cap 12-16Introduction to R by David Lucy Cap 12-16
Introduction to R by David Lucy Cap 12-16Luis Pons
 
ECUACIONES DIFERENCIALES CON DERIVE
ECUACIONES DIFERENCIALES CON DERIVEECUACIONES DIFERENCIALES CON DERIVE
ECUACIONES DIFERENCIALES CON DERIVEJorge Paz
 
Ejercicios resueltos en r
Ejercicios resueltos en rEjercicios resueltos en r
Ejercicios resueltos en rzasque11
 
An introduction to structural equation models in R using the Lavaan package
An introduction to structural equation models in R using the Lavaan packageAn introduction to structural equation models in R using the Lavaan package
An introduction to structural equation models in R using the Lavaan packageNoam Ross
 
LENGUAJE DE PROGRAMACION R
LENGUAJE DE PROGRAMACION RLENGUAJE DE PROGRAMACION R
LENGUAJE DE PROGRAMACION RShirley Armas
 
Narrative codes
Narrative codesNarrative codes
Narrative codesATith
 

Viewers also liked (20)

R Vectors
R VectorsR Vectors
R Vectors
 
R Graphics
R GraphicsR Graphics
R Graphics
 
Laboratorio Probabilidad 1/3
Laboratorio Probabilidad 1/3Laboratorio Probabilidad 1/3
Laboratorio Probabilidad 1/3
 
Linguaggio R, principi e concetti
Linguaggio R, principi e concettiLinguaggio R, principi e concetti
Linguaggio R, principi e concetti
 
R_note_ODE_ver1.0
R_note_ODE_ver1.0R_note_ODE_ver1.0
R_note_ODE_ver1.0
 
Elisa Teodoro, Clase 5, Funciones
Elisa Teodoro, Clase 5, FuncionesElisa Teodoro, Clase 5, Funciones
Elisa Teodoro, Clase 5, Funciones
 
Ruby es un lenguaje de programación interpretado
Ruby es un lenguaje de programación interpretadoRuby es un lenguaje de programación interpretado
Ruby es un lenguaje de programación interpretado
 
Scheduling power-aware abstract
Scheduling power-aware abstractScheduling power-aware abstract
Scheduling power-aware abstract
 
Introduzione a R
Introduzione a RIntroduzione a R
Introduzione a R
 
Elisa Teodoro, Aplicacion de Derivadas, Clase 2
Elisa Teodoro, Aplicacion de Derivadas, Clase 2Elisa Teodoro, Aplicacion de Derivadas, Clase 2
Elisa Teodoro, Aplicacion de Derivadas, Clase 2
 
Programacion en R
Programacion en RProgramacion en R
Programacion en R
 
Abstract tesi
Abstract tesiAbstract tesi
Abstract tesi
 
Introduction to R by David Lucy Cap 12-16
Introduction to R by David Lucy Cap 12-16Introduction to R by David Lucy Cap 12-16
Introduction to R by David Lucy Cap 12-16
 
ECUACIONES DIFERENCIALES CON DERIVE
ECUACIONES DIFERENCIALES CON DERIVEECUACIONES DIFERENCIALES CON DERIVE
ECUACIONES DIFERENCIALES CON DERIVE
 
9 introduzione r
9   introduzione r9   introduzione r
9 introduzione r
 
Ejercicios resueltos en r
Ejercicios resueltos en rEjercicios resueltos en r
Ejercicios resueltos en r
 
An introduction to structural equation models in R using the Lavaan package
An introduction to structural equation models in R using the Lavaan packageAn introduction to structural equation models in R using the Lavaan package
An introduction to structural equation models in R using the Lavaan package
 
Apuntes de prácticas de DERIVE
Apuntes de prácticas de DERIVEApuntes de prácticas de DERIVE
Apuntes de prácticas de DERIVE
 
LENGUAJE DE PROGRAMACION R
LENGUAJE DE PROGRAMACION RLENGUAJE DE PROGRAMACION R
LENGUAJE DE PROGRAMACION R
 
Narrative codes
Narrative codesNarrative codes
Narrative codes
 

Similar to PhD midterm report

OKC Grand Rounds 2009
OKC Grand Rounds 2009OKC Grand Rounds 2009
OKC Grand Rounds 2009Sean Davis
 
Genomics Technologies
Genomics TechnologiesGenomics Technologies
Genomics TechnologiesSean Davis
 
Bioinformatics as a tool for understanding carcinogenesis
Bioinformatics as a tool for understanding carcinogenesisBioinformatics as a tool for understanding carcinogenesis
Bioinformatics as a tool for understanding carcinogenesisDespoina Kalfakakou
 
Developing a framework for for detection of low frequency somatic genetic alt...
Developing a framework for for detection of low frequency somatic genetic alt...Developing a framework for for detection of low frequency somatic genetic alt...
Developing a framework for for detection of low frequency somatic genetic alt...Ronak Shah
 
Bioinformatic Analysis of Synthetic Lethality in Breast Cancer
Bioinformatic Analysis of Synthetic Lethality in Breast CancerBioinformatic Analysis of Synthetic Lethality in Breast Cancer
Bioinformatic Analysis of Synthetic Lethality in Breast CancerTom Kelly
 
Efficiency of Using Sequence Discovery for Polymorphism in DNA Sequence
Efficiency of Using Sequence Discovery for Polymorphism in DNA SequenceEfficiency of Using Sequence Discovery for Polymorphism in DNA Sequence
Efficiency of Using Sequence Discovery for Polymorphism in DNA SequenceIJSTA
 
Visual Exploration of Clinical and Genomic Data for Patient Stratification
Visual Exploration of Clinical and Genomic Data for Patient StratificationVisual Exploration of Clinical and Genomic Data for Patient Stratification
Visual Exploration of Clinical and Genomic Data for Patient StratificationNils Gehlenborg
 
Prediction of protein function
Prediction of protein functionPrediction of protein function
Prediction of protein functionLars Juhl Jensen
 
STRING - Prediction of a functional association network for the yeast mitocho...
STRING - Prediction of a functional association network for the yeast mitocho...STRING - Prediction of a functional association network for the yeast mitocho...
STRING - Prediction of a functional association network for the yeast mitocho...Lars Juhl Jensen
 
How to transform genomic big data into valuable clinical information
How to transform genomic big data into valuable clinical informationHow to transform genomic big data into valuable clinical information
How to transform genomic big data into valuable clinical informationJoaquin Dopazo
 
A New Generation Of Mechanism-Based Biomarkers For The Clinic
A New Generation Of Mechanism-Based Biomarkers For The ClinicA New Generation Of Mechanism-Based Biomarkers For The Clinic
A New Generation Of Mechanism-Based Biomarkers For The ClinicJoaquin Dopazo
 
From reads to pathways for efficient disease gene finding
From reads to pathways for efficient disease gene findingFrom reads to pathways for efficient disease gene finding
From reads to pathways for efficient disease gene findingJoaquin Dopazo
 
Quantitative Medicine Feb 2009
Quantitative Medicine Feb 2009Quantitative Medicine Feb 2009
Quantitative Medicine Feb 2009Ian Foster
 
GENOME DATA ANALYSIS
GENOME DATA ANALYSISGENOME DATA ANALYSIS
GENOME DATA ANALYSISAmeldaAkoijam
 
Metagenomics and it’s applications
Metagenomics and it’s applicationsMetagenomics and it’s applications
Metagenomics and it’s applicationsSham Sadiq
 
CAMERA Presentation at KNAW ICoMM Colloquium May 2008
CAMERA Presentation at KNAW ICoMM Colloquium May 2008CAMERA Presentation at KNAW ICoMM Colloquium May 2008
CAMERA Presentation at KNAW ICoMM Colloquium May 2008Saul Kravitz
 
Pre-clinical drug prioritization via prognosis-guided genetic interaction net...
Pre-clinical drug prioritization via prognosis-guided genetic interaction net...Pre-clinical drug prioritization via prognosis-guided genetic interaction net...
Pre-clinical drug prioritization via prognosis-guided genetic interaction net...laserxiong
 
Next Generation Sequencing
Next Generation SequencingNext Generation Sequencing
Next Generation SequencingShelomi Karoon
 
metagenomicsanditsapplications-161222180924.pdf
metagenomicsanditsapplications-161222180924.pdfmetagenomicsanditsapplications-161222180924.pdf
metagenomicsanditsapplications-161222180924.pdfVisheshMishra20
 

Similar to PhD midterm report (20)

OKC Grand Rounds 2009
OKC Grand Rounds 2009OKC Grand Rounds 2009
OKC Grand Rounds 2009
 
Genomics Technologies
Genomics TechnologiesGenomics Technologies
Genomics Technologies
 
10.1.1.80.2149
10.1.1.80.214910.1.1.80.2149
10.1.1.80.2149
 
Bioinformatics as a tool for understanding carcinogenesis
Bioinformatics as a tool for understanding carcinogenesisBioinformatics as a tool for understanding carcinogenesis
Bioinformatics as a tool for understanding carcinogenesis
 
Developing a framework for for detection of low frequency somatic genetic alt...
Developing a framework for for detection of low frequency somatic genetic alt...Developing a framework for for detection of low frequency somatic genetic alt...
Developing a framework for for detection of low frequency somatic genetic alt...
 
Bioinformatic Analysis of Synthetic Lethality in Breast Cancer
Bioinformatic Analysis of Synthetic Lethality in Breast CancerBioinformatic Analysis of Synthetic Lethality in Breast Cancer
Bioinformatic Analysis of Synthetic Lethality in Breast Cancer
 
Efficiency of Using Sequence Discovery for Polymorphism in DNA Sequence
Efficiency of Using Sequence Discovery for Polymorphism in DNA SequenceEfficiency of Using Sequence Discovery for Polymorphism in DNA Sequence
Efficiency of Using Sequence Discovery for Polymorphism in DNA Sequence
 
Visual Exploration of Clinical and Genomic Data for Patient Stratification
Visual Exploration of Clinical and Genomic Data for Patient StratificationVisual Exploration of Clinical and Genomic Data for Patient Stratification
Visual Exploration of Clinical and Genomic Data for Patient Stratification
 
Prediction of protein function
Prediction of protein functionPrediction of protein function
Prediction of protein function
 
STRING - Prediction of a functional association network for the yeast mitocho...
STRING - Prediction of a functional association network for the yeast mitocho...STRING - Prediction of a functional association network for the yeast mitocho...
STRING - Prediction of a functional association network for the yeast mitocho...
 
How to transform genomic big data into valuable clinical information
How to transform genomic big data into valuable clinical informationHow to transform genomic big data into valuable clinical information
How to transform genomic big data into valuable clinical information
 
A New Generation Of Mechanism-Based Biomarkers For The Clinic
A New Generation Of Mechanism-Based Biomarkers For The ClinicA New Generation Of Mechanism-Based Biomarkers For The Clinic
A New Generation Of Mechanism-Based Biomarkers For The Clinic
 
From reads to pathways for efficient disease gene finding
From reads to pathways for efficient disease gene findingFrom reads to pathways for efficient disease gene finding
From reads to pathways for efficient disease gene finding
 
Quantitative Medicine Feb 2009
Quantitative Medicine Feb 2009Quantitative Medicine Feb 2009
Quantitative Medicine Feb 2009
 
GENOME DATA ANALYSIS
GENOME DATA ANALYSISGENOME DATA ANALYSIS
GENOME DATA ANALYSIS
 
Metagenomics and it’s applications
Metagenomics and it’s applicationsMetagenomics and it’s applications
Metagenomics and it’s applications
 
CAMERA Presentation at KNAW ICoMM Colloquium May 2008
CAMERA Presentation at KNAW ICoMM Colloquium May 2008CAMERA Presentation at KNAW ICoMM Colloquium May 2008
CAMERA Presentation at KNAW ICoMM Colloquium May 2008
 
Pre-clinical drug prioritization via prognosis-guided genetic interaction net...
Pre-clinical drug prioritization via prognosis-guided genetic interaction net...Pre-clinical drug prioritization via prognosis-guided genetic interaction net...
Pre-clinical drug prioritization via prognosis-guided genetic interaction net...
 
Next Generation Sequencing
Next Generation SequencingNext Generation Sequencing
Next Generation Sequencing
 
metagenomicsanditsapplications-161222180924.pdf
metagenomicsanditsapplications-161222180924.pdfmetagenomicsanditsapplications-161222180924.pdf
metagenomicsanditsapplications-161222180924.pdf
 

Recently uploaded

SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
costume and set research powerpoint presentation
costume and set research powerpoint presentationcostume and set research powerpoint presentation
costume and set research powerpoint presentationphoebematthew05
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 

Recently uploaded (20)

SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
costume and set research powerpoint presentation
costume and set research powerpoint presentationcostume and set research powerpoint presentation
costume and set research powerpoint presentation
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 

PhD midterm report

  • 1. Low Duplicability and Network Fragility of Cancer Genes Davide Rambaldi
  • 2. Background and Aim of the Project high heterogeneity and high number (~600) of genes mutated in cancer Identification of Systems-level properties Better understanding of the genetic determinants of cancer progression Identification of candidate cancer genes
  • 3. Choice of Systems-level properties Genomic Duplicability Tendency to retain conserved and/or recent duplicates Network topology Position of the protein in a protein-protein interaction network Duplicability (Zhang, 2006) (Sun, 2006) Network connectivity (Wu, 2005) ( Prachumwat, 2006) fragility (Veitia, 2002)
  • 4.
  • 5. Detection of Genomic Duplicates reference set N=349 benchmark set N=254 reference set 83.68% 16.32% benchmark set 10.3% 89.7% reference set benchmark set
  • 6. Example of duplicable gene: rara RARA - RETINOIC ACID RECEPTOR ALPHA First duplication: Coverage 68% Second duplication: Coverage 65% Best Hit: Coverage 99% Spurious Hit: Coverage 9%
  • 7. Do Cancer and CAN-genes duplicate more or less than the rest of human genes? Reference Set Benchmark Set Comparison to other human genes 83.7% Singletons 16.3% Duplicable genes % 89.7% Singletons 10.3% Duplicable genes %
  • 8. Comparison to other human genes Human genes = 24.202
  • 9. Genes mutated in cancer tend to duplicate less than other human genes Reference Set Benchmark Set
  • 10. Is this really a systems-level property? Human genes = 24.202
  • 11. Genes mutated in cancer duplicate less than other human genes with the same functional distribution
  • 12.
  • 13.
  • 15. Network Analysis Global Topology DEGREE (d) Measure of connectivity of each node CLUSTERING COEFFICIENT (cc) Measure of interconnectivity of each node d=4 cc=0 d=4 cc=0.3
  • 16. global topology Scale free network : few nodes with many connections, many nodes with few connections (Barabási and Albert,1999)
  • 17. How do cancer genes behave in the network? Duplicability Network connectivity fragility
  • 18. Global topology of singleton and duplicable proteins In the entire network, singletons proteins are less connected than duplicable proteins but have an higher clustering coefficient P < 0.0001 (Wilcoxon Test) P = 0.0163 (Wilcoxon Test) singleton duplicable
  • 19. Global topology of cancer proteins P < 0.0001 (Wilcoxon Test) P < 0.0001 (Wilcoxon Test) Unlike most singletons, proteins mutated in cancer are more connected than other proteins and have an higher clustering coefficient singleton cancer
  • 20.
  • 21. Local Topology of the entire network The human network is enriched in the most interconnected subgraphs.
  • 22. Local topology of duplicable and singleton proteins No significant difference between singleton and duplicable proteins in the network motifs.
  • 23. Local topology of cancer and CAN-proteins
  • 24. Summary Singletons are less connected but more interconnected than duplicable proteins Cancer genes, mainly singletons, code for protein HUBS of highly interconnected modules of the human network Singletons and duplicable proteins are equally represented in the network motifs BUT In the entire network:
  • 25.
  • 26.
  • 27. Possible candidates 101 singletons genes with >20 connections and cc>0.1 Significantly enriched in Gene Ontology terms related to cancer
  • 28. Network of candidate cancer genes
  • 29. Network of Cancer genes (developed by Federico Giorgi) http://bio.ifom-ieo-campus.it/ncg/
  • 30. Many thanks to … Ciccarelli Group Francesca Ciccarelli Anna DeGrassi Federico Giorgi Matteo Dantonio Ciliberto Group Andrea Ciliberto Fabrizio Capuani Romilde Manzoni Federico Vaggi And all the bioinfo crew … Statistics Giovanni d’Ario Lara Lusa IT support Davide Cittaro
  • 31.
  • 32. RARA and NR2C2 RARA NR2C2
  • 33. A singleton gene: FEV FEV - ETS oncogene family (coverage= 100% identity= 100% ) (coverage= 35% identity= 86% )
  • 34. Changing threshold Changing the threshold of 10% doesn’t change the results: our observation are independent from the chosen coverage threshold value
  • 35. Is this signal real? EXIST A CORRELATION BETWEEN CONNECTIVITY IN HPRD AND ABSTRACTS IN PUBMED? HOW IS THE CONNECTIVITY OF CANCER PROTEINS USING ONLY INTERACTIONS COMING FROM HIGH-THROUGHPUT EXPERIMENTS? HPRD is a database based on literature: is it biased towards well-studied genes? (… and cancer genes are among them)
  • 36. Network Randomization Real Network Edges Randomization
  • 37. Network of Cancer genes: public access to our data (developed by F.M. Giorgi)

Editor's Notes

  1. Hello, my names is Davide Rambaldi and I work in the Bioinformatics and Evolutionary genomics of cancer group. I will present the results of my first 2 year of PhD. In this 2 years I focused on the analysis of human genes mutated in cancer and today I will talk of their properties at the genomic level and in the context of a protein-protein interaction network.