SlideShare a Scribd company logo
Cancer hallmarks, “omic”
data, and data resources
Anthony Gitter
Cancer Bioinformatics (BMI 826/CS 838)
January 22, 2015
What computational analysis
contributes to cancer research
1. Predicting driver alterations
2. Defining properties of cancer (sub)types
3. Predicting prognosis and therapy
4. Integrating complementary data
5. Detecting affected pathways and processes
6. Explaining tumor heterogeneity
7. Detecting mutations and variants
8. Organizing, visualizing, and distributing data
Convergence of driver events
• Amid the complexity and heterogeneity, there is
some order
• Finite number of major pathways that are affected
by drivers
Hanahan2011
Vogelstein2013
Similar pathway effects
Vogelstein2013
• Tumor 1: EGFR receptor
mutation makes it
hypersensitive
• Tumor 2: KRAS
hyperactive
• Tumor 3: NF1 inactivated
and no longer modulates
KRAS
• Tumor 4: BRAF over
responsive to KRAS
signals
Detecting affected pathways
Ding2014
Pathway enrichment
DAVID
Pathway discovery
BioCarta EGF Signaling Pathway
Stimulate receptor
31% of pathway
is activated
98% of activity
is not covered
Phosphorylation data from Alejandro Wolf-Yadlin
Hallmarks of cancer
Hanahan2011
Sustaining proliferative signaling
• Cells receive signals from the local environment
telling them to grow (proliferate)
• Specialized receptors detect these signals
• Feedback in pathways carefully controls the
response to these signals
Evading growth suppressors
• Override tumor suppressor genes
• Some proteins control the cell’s decision to grow or
switch to an alternate track
• Apoptosis: programmed cell death
• Senescence: halt the cell cycle
• External or internal signals can affect these
decisions
Cell cycle
Biology of Cancer
Resisting cell death
• One self-defense mechanism against cancer
• Apoptosis triggers include:
• DNA damage sensors
• Limited survival cues
• Overactive signaling proteins
• Necrosis causes cells to explode
• Destroys a (pre)cancerous cell
• Releases chemicals that can promote growth in other
cells
O’Day
Enabling replicative immortality
• Cells typically have a limited number of divisions
• Immortalization: unlimited replicative potential
• Telomeres protect the ends of DNA
• Shorten over time
• Encode the number of cell divisions remaining
• Can be artificially upregulated in cancer
Patton2013
Telomere shortening
Wall Street Journal
Inducing angiogenesis
• Tumors must receive nutrients like other cells
• Certain proteins promote growth of blood vessels
LKT Laboratories
Activating invasion and metastasis
• Cancer progresses through the aforementioned
stages
• Epithelial-mesenchymal transition (EMT)
Emerging hallmarks
Hanahan2011
Genome instability and mutation
• Cancer cells mutate more frequently
• Increased sensitivity to mutagens
• Loss of telomeres increases copy number
alterations
Model systems in oncology
• Cell lines: Cells that reproduce in a lab indefinitely
(e.g. Hela cells)
• Genetically engineered mice: Manipulate mice to
make them predisposed to cancer
• Xenograft: Implant human tumor cells into mice
“Omic” data types
• DNA (genome)
• Mutations
• Copy number variation
• Other structural variation
• RNA expression (transcriptome)
• Gene expression (mRNA)
• Micro RNA expression (miRNA)
• Protein (proteome)
• Protein abundance
• Protein state (e.g. phosphorylation)
• Protein DNA binding
• DNA state and accessibility (epigenome)
• DNA methylation (methylome)
• Histone modification / chromatin marks
• DNase I hypersensitivity
“Next-generation” sequencing
(NGS)
• Revolutionized high-throughput data collection
• *-seq strategy
• Decide what you want to measure in cells
• Figure out how to select or synthesize the right DNA
• Dump it into a DNA sequencer
• ~100 different *-seq applications
NODAI
*-seq examples
Rizzo2012
Generating DNA templates
Rizzo2012
Generating reads
Rizzo2012
Assembly and alignment
Rizzo2012
Microarrays
• High-throughput measurement of gene expression,
protein DNA binding, etc.
• Mostly replaced by *-seq
• Fixed probes as opposed to DNA reads
Microarray quantification
University of Utah Wikipedia Wikimedia
DNA mutations
• Whole-exome most prevalent in cancer
• Only covers exons that form genes, less expensive
• Whole-genome becoming more widespread as
sequencing costs continue to decrease
DNA Link
Copy number variation
• Often represented as relative to normal 2 copies
• Ranges from a few bases to whole chromosomes
• Quantitative, not discrete, representation
MindSpec
Gene expression
• Transcript (messenger RNA) abundance
Graz
Appling lab
Genome-wide gene expression
• Quantitative state of the cell
1
35
…
…
5
Gene 1
Gene 2
Gene
20000
Brain
15
32
…
…
0
Heart
87
2
…
…
65
Blood (normal)
85
2
…
…
3
Blood (infected)
miRNA expression
• microRNA (miRNA)
• ~22 nucleotides
• Does not code for a protein
• Regulates gene expression levels by binding mRNA
NIH
Protein abundance
• Protein abundance is analogous to gene expression
• Not perfectly correlated with gene expression
• Harder to measure
• Mass spectrometry is almost proteome-wide
• Vaporize molecules
• Determine what was vaporized
based on mass/charge
David Darling
Protein state
• Chemical groups added to mature protein
• Phosphorylation is the most-studied
• Analogous to Boolean state
Pierce
Protein arrays
• Currently more common in cancer datasets
• Measure a limited number of specific proteins
using antibodies
• Protein abundance or state
R&D MD Anderson
Transcriptional regulation
• ChIP-seq directly measures transcription factor (TF)
binding but requires a matching antibody
• Various indirect strategies
Wang2012
Predicting regulator binding sites
• Motifs are signatures of
the DNA sequence
recognized by a TF
• TFs block DNA cleavage
• Combining accessible
DNA and DNA motifs
produces binding
predictions for hundreds
of TFs
Neph2012
DNA methylation
• Methylation is a DNA modification (state change)
• Hyper-methylation suppresses transcription
• Methylation almost always at C
Learn NC
Wikimedia
Clinical data
• Age, sex, cancer stage, survival
• Kaplan–Meier plot
Wikipedia
Large cancer datasets
• Tumors
• The Cancer Genome Atlas (TCGA)
• Broad Firehose and FireBrowse access to TCGA data
• International Cancer Genome Consortium (ICGC)
• Cell lines
• Cancer Cell Line Encyclopedia (CCLE)
• Catalogue of Somatic Mutations in Cancer (COSMIC)
• Cancer gene lists
• COSMIC Gene Census
• Vogelstein2013 drivers
Interactive tools for cancer data
• cBioPortal
• TumorPortal
• Cancer Regulome
• Cancer Genomics Browser
• StratomeX
Gene and protein information
• TP53 example
• GeneCards
• UniProt
• Entrez Gene
Pathway and function enrichment
• Database for Annotation, Visualization and
Integrated Discovery (DAVID)
• Molecular Signatures Database (MSigDB)
Gene expression data
• Gene Expression Omnibus (GEO)
• ArrayExpress
Protein interaction networks
• iRefIndex and iRefWeb
• Search Tool for the Retrieval of Interacting
Genes/Proteins (STRING)
• High-quality INTeractomes (HINT)
Transcriptional regulation
• Encyclopedia of DNA Elements (ENCODE)
• DNA binding motifs
• TRANSFAC
• JASPAR
• UniPROBE
miRNA binding
• miRBase
• TargetScan

More Related Content

Similar to CancerBioinformatics_DataTypesResources_012215.pptx

14 march seminar
14 march seminar14 march seminar
14 march seminar
Pooja Goswami
 
Big Data OncoAnalytics Architecture
Big Data OncoAnalytics ArchitectureBig Data OncoAnalytics Architecture
Big Data OncoAnalytics Architecture
Philippe Julio
 
Cancer Biology.pptx
Cancer  Biology.pptxCancer  Biology.pptx
Cancer Biology.pptx
Egizeru Enedalew
 
Cancer Biology For a Surgeon to understand
Cancer Biology For a Surgeon to understandCancer Biology For a Surgeon to understand
Cancer Biology For a Surgeon to understand
tgbk100
 
Flow cytometry
Flow cytometryFlow cytometry
Flow cytometry
Sreenivasa Reddy Thalla
 
NORMAL-CELLS-versus-CANCER-CELLS.pptxssss
NORMAL-CELLS-versus-CANCER-CELLS.pptxssssNORMAL-CELLS-versus-CANCER-CELLS.pptxssss
NORMAL-CELLS-versus-CANCER-CELLS.pptxssss
RashadHamada
 
Liquid Biopsy Overview, Challenges and New Solutions: Liquid Biopsy Series Pa...
Liquid Biopsy Overview, Challenges and New Solutions: Liquid Biopsy Series Pa...Liquid Biopsy Overview, Challenges and New Solutions: Liquid Biopsy Series Pa...
Liquid Biopsy Overview, Challenges and New Solutions: Liquid Biopsy Series Pa...
QIAGEN
 
Integrative analysis and visualization of clinical and molecular data for can...
Integrative analysis and visualization of clinical and molecular data for can...Integrative analysis and visualization of clinical and molecular data for can...
Integrative analysis and visualization of clinical and molecular data for can...
Lake Como School of Advanced Studies
 
VALIDATION OF NGS SEQUENCING BY SANGER SEQUENCING
VALIDATION OF NGS SEQUENCING BY SANGER SEQUENCINGVALIDATION OF NGS SEQUENCING BY SANGER SEQUENCING
VALIDATION OF NGS SEQUENCING BY SANGER SEQUENCING
NARRANAGAPAVANKUMAR
 
Cancer genome
Cancer genomeCancer genome
Cancer genome
Kundan Singh
 
Introduction to epigenetics and study design
Introduction to epigenetics and study designIntroduction to epigenetics and study design
Introduction to epigenetics and study design
amlbinder
 
Liquid biopsy
Liquid biopsyLiquid biopsy
Liquid biopsy
Dr. Shashank Agrawal
 
Molecular techniques for pathology research - MDX .pdf
Molecular techniques for pathology research - MDX .pdfMolecular techniques for pathology research - MDX .pdf
Molecular techniques for pathology research - MDX .pdf
sabyabby
 
Genomics seminar
Genomics seminarGenomics seminar
Genomics seminar
S Rasouli
 
Kloning Gen.ppt
Kloning Gen.pptKloning Gen.ppt
Kloning Gen.ppt
TatangShabur1
 
Bioinformatics tools for development, analysis, and preclinical testing of in...
Bioinformatics tools for development, analysis, and preclinical testing of in...Bioinformatics tools for development, analysis, and preclinical testing of in...
Bioinformatics tools for development, analysis, and preclinical testing of in...
Malachi Griffith
 
Bea lehming memorial lectures cacs - washington dc 11-15-2014
Bea lehming memorial lectures   cacs - washington dc 11-15-2014Bea lehming memorial lectures   cacs - washington dc 11-15-2014
Bea lehming memorial lectures cacs - washington dc 11-15-2014
CACSNETS
 
Advances and Applications Enabled by Single Cell Technology
Advances and Applications Enabled by Single Cell TechnologyAdvances and Applications Enabled by Single Cell Technology
Advances and Applications Enabled by Single Cell Technology
QIAGEN
 
LIQUID BIOPSY.pptx
LIQUID  BIOPSY.pptxLIQUID  BIOPSY.pptx
LIQUID BIOPSY.pptx
SeenaSusanItty1
 
Ian Day Edited Presentation
Ian Day Edited PresentationIan Day Edited Presentation
Ian Day Edited Presentation
Bath & Bristol Enterprise Network
 

Similar to CancerBioinformatics_DataTypesResources_012215.pptx (20)

14 march seminar
14 march seminar14 march seminar
14 march seminar
 
Big Data OncoAnalytics Architecture
Big Data OncoAnalytics ArchitectureBig Data OncoAnalytics Architecture
Big Data OncoAnalytics Architecture
 
Cancer Biology.pptx
Cancer  Biology.pptxCancer  Biology.pptx
Cancer Biology.pptx
 
Cancer Biology For a Surgeon to understand
Cancer Biology For a Surgeon to understandCancer Biology For a Surgeon to understand
Cancer Biology For a Surgeon to understand
 
Flow cytometry
Flow cytometryFlow cytometry
Flow cytometry
 
NORMAL-CELLS-versus-CANCER-CELLS.pptxssss
NORMAL-CELLS-versus-CANCER-CELLS.pptxssssNORMAL-CELLS-versus-CANCER-CELLS.pptxssss
NORMAL-CELLS-versus-CANCER-CELLS.pptxssss
 
Liquid Biopsy Overview, Challenges and New Solutions: Liquid Biopsy Series Pa...
Liquid Biopsy Overview, Challenges and New Solutions: Liquid Biopsy Series Pa...Liquid Biopsy Overview, Challenges and New Solutions: Liquid Biopsy Series Pa...
Liquid Biopsy Overview, Challenges and New Solutions: Liquid Biopsy Series Pa...
 
Integrative analysis and visualization of clinical and molecular data for can...
Integrative analysis and visualization of clinical and molecular data for can...Integrative analysis and visualization of clinical and molecular data for can...
Integrative analysis and visualization of clinical and molecular data for can...
 
VALIDATION OF NGS SEQUENCING BY SANGER SEQUENCING
VALIDATION OF NGS SEQUENCING BY SANGER SEQUENCINGVALIDATION OF NGS SEQUENCING BY SANGER SEQUENCING
VALIDATION OF NGS SEQUENCING BY SANGER SEQUENCING
 
Cancer genome
Cancer genomeCancer genome
Cancer genome
 
Introduction to epigenetics and study design
Introduction to epigenetics and study designIntroduction to epigenetics and study design
Introduction to epigenetics and study design
 
Liquid biopsy
Liquid biopsyLiquid biopsy
Liquid biopsy
 
Molecular techniques for pathology research - MDX .pdf
Molecular techniques for pathology research - MDX .pdfMolecular techniques for pathology research - MDX .pdf
Molecular techniques for pathology research - MDX .pdf
 
Genomics seminar
Genomics seminarGenomics seminar
Genomics seminar
 
Kloning Gen.ppt
Kloning Gen.pptKloning Gen.ppt
Kloning Gen.ppt
 
Bioinformatics tools for development, analysis, and preclinical testing of in...
Bioinformatics tools for development, analysis, and preclinical testing of in...Bioinformatics tools for development, analysis, and preclinical testing of in...
Bioinformatics tools for development, analysis, and preclinical testing of in...
 
Bea lehming memorial lectures cacs - washington dc 11-15-2014
Bea lehming memorial lectures   cacs - washington dc 11-15-2014Bea lehming memorial lectures   cacs - washington dc 11-15-2014
Bea lehming memorial lectures cacs - washington dc 11-15-2014
 
Advances and Applications Enabled by Single Cell Technology
Advances and Applications Enabled by Single Cell TechnologyAdvances and Applications Enabled by Single Cell Technology
Advances and Applications Enabled by Single Cell Technology
 
LIQUID BIOPSY.pptx
LIQUID  BIOPSY.pptxLIQUID  BIOPSY.pptx
LIQUID BIOPSY.pptx
 
Ian Day Edited Presentation
Ian Day Edited PresentationIan Day Edited Presentation
Ian Day Edited Presentation
 

Recently uploaded

Phenomics assisted breeding in crop improvement
Phenomics assisted breeding in crop improvementPhenomics assisted breeding in crop improvement
Phenomics assisted breeding in crop improvement
IshaGoswami9
 
Micronuclei test.M.sc.zoology.fisheries.
Micronuclei test.M.sc.zoology.fisheries.Micronuclei test.M.sc.zoology.fisheries.
Micronuclei test.M.sc.zoology.fisheries.
Aditi Bajpai
 
aziz sancar nobel prize winner: from mardin to nobel
aziz sancar nobel prize winner: from mardin to nobelaziz sancar nobel prize winner: from mardin to nobel
aziz sancar nobel prize winner: from mardin to nobel
İsa Badur
 
Cytokines and their role in immune regulation.pptx
Cytokines and their role in immune regulation.pptxCytokines and their role in immune regulation.pptx
Cytokines and their role in immune regulation.pptx
Hitesh Sikarwar
 
Randomised Optimisation Algorithms in DAPHNE
Randomised Optimisation Algorithms in DAPHNERandomised Optimisation Algorithms in DAPHNE
Randomised Optimisation Algorithms in DAPHNE
University of Maribor
 
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdfTopic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
TinyAnderson
 
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
yqqaatn0
 
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills MN
 
SAR of Medicinal Chemistry 1st by dk.pdf
SAR of Medicinal Chemistry 1st by dk.pdfSAR of Medicinal Chemistry 1st by dk.pdf
SAR of Medicinal Chemistry 1st by dk.pdf
KrushnaDarade1
 
Unlocking the mysteries of reproduction: Exploring fecundity and gonadosomati...
Unlocking the mysteries of reproduction: Exploring fecundity and gonadosomati...Unlocking the mysteries of reproduction: Exploring fecundity and gonadosomati...
Unlocking the mysteries of reproduction: Exploring fecundity and gonadosomati...
AbdullaAlAsif1
 
在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样
在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样
在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样
vluwdy49
 
NuGOweek 2024 Ghent programme overview flyer
NuGOweek 2024 Ghent programme overview flyerNuGOweek 2024 Ghent programme overview flyer
NuGOweek 2024 Ghent programme overview flyer
pablovgd
 
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốtmô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
HongcNguyn6
 
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
David Osipyan
 
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
Sérgio Sacani
 
Shallowest Oil Discovery of Turkiye.pptx
Shallowest Oil Discovery of Turkiye.pptxShallowest Oil Discovery of Turkiye.pptx
Shallowest Oil Discovery of Turkiye.pptx
Gokturk Mehmet Dilci
 
Sharlene Leurig - Enabling Onsite Water Use with Net Zero Water
Sharlene Leurig - Enabling Onsite Water Use with Net Zero WaterSharlene Leurig - Enabling Onsite Water Use with Net Zero Water
Sharlene Leurig - Enabling Onsite Water Use with Net Zero Water
Texas Alliance of Groundwater Districts
 
Bob Reedy - Nitrate in Texas Groundwater.pdf
Bob Reedy - Nitrate in Texas Groundwater.pdfBob Reedy - Nitrate in Texas Groundwater.pdf
Bob Reedy - Nitrate in Texas Groundwater.pdf
Texas Alliance of Groundwater Districts
 
bordetella pertussis.................................ppt
bordetella pertussis.................................pptbordetella pertussis.................................ppt
bordetella pertussis.................................ppt
kejapriya1
 
ESR spectroscopy in liquid food and beverages.pptx
ESR spectroscopy in liquid food and beverages.pptxESR spectroscopy in liquid food and beverages.pptx
ESR spectroscopy in liquid food and beverages.pptx
PRIYANKA PATEL
 

Recently uploaded (20)

Phenomics assisted breeding in crop improvement
Phenomics assisted breeding in crop improvementPhenomics assisted breeding in crop improvement
Phenomics assisted breeding in crop improvement
 
Micronuclei test.M.sc.zoology.fisheries.
Micronuclei test.M.sc.zoology.fisheries.Micronuclei test.M.sc.zoology.fisheries.
Micronuclei test.M.sc.zoology.fisheries.
 
aziz sancar nobel prize winner: from mardin to nobel
aziz sancar nobel prize winner: from mardin to nobelaziz sancar nobel prize winner: from mardin to nobel
aziz sancar nobel prize winner: from mardin to nobel
 
Cytokines and their role in immune regulation.pptx
Cytokines and their role in immune regulation.pptxCytokines and their role in immune regulation.pptx
Cytokines and their role in immune regulation.pptx
 
Randomised Optimisation Algorithms in DAPHNE
Randomised Optimisation Algorithms in DAPHNERandomised Optimisation Algorithms in DAPHNE
Randomised Optimisation Algorithms in DAPHNE
 
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdfTopic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
 
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
 
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
 
SAR of Medicinal Chemistry 1st by dk.pdf
SAR of Medicinal Chemistry 1st by dk.pdfSAR of Medicinal Chemistry 1st by dk.pdf
SAR of Medicinal Chemistry 1st by dk.pdf
 
Unlocking the mysteries of reproduction: Exploring fecundity and gonadosomati...
Unlocking the mysteries of reproduction: Exploring fecundity and gonadosomati...Unlocking the mysteries of reproduction: Exploring fecundity and gonadosomati...
Unlocking the mysteries of reproduction: Exploring fecundity and gonadosomati...
 
在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样
在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样
在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样
 
NuGOweek 2024 Ghent programme overview flyer
NuGOweek 2024 Ghent programme overview flyerNuGOweek 2024 Ghent programme overview flyer
NuGOweek 2024 Ghent programme overview flyer
 
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốtmô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
 
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
 
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
 
Shallowest Oil Discovery of Turkiye.pptx
Shallowest Oil Discovery of Turkiye.pptxShallowest Oil Discovery of Turkiye.pptx
Shallowest Oil Discovery of Turkiye.pptx
 
Sharlene Leurig - Enabling Onsite Water Use with Net Zero Water
Sharlene Leurig - Enabling Onsite Water Use with Net Zero WaterSharlene Leurig - Enabling Onsite Water Use with Net Zero Water
Sharlene Leurig - Enabling Onsite Water Use with Net Zero Water
 
Bob Reedy - Nitrate in Texas Groundwater.pdf
Bob Reedy - Nitrate in Texas Groundwater.pdfBob Reedy - Nitrate in Texas Groundwater.pdf
Bob Reedy - Nitrate in Texas Groundwater.pdf
 
bordetella pertussis.................................ppt
bordetella pertussis.................................pptbordetella pertussis.................................ppt
bordetella pertussis.................................ppt
 
ESR spectroscopy in liquid food and beverages.pptx
ESR spectroscopy in liquid food and beverages.pptxESR spectroscopy in liquid food and beverages.pptx
ESR spectroscopy in liquid food and beverages.pptx
 

CancerBioinformatics_DataTypesResources_012215.pptx

  • 1. Cancer hallmarks, “omic” data, and data resources Anthony Gitter Cancer Bioinformatics (BMI 826/CS 838) January 22, 2015
  • 2. What computational analysis contributes to cancer research 1. Predicting driver alterations 2. Defining properties of cancer (sub)types 3. Predicting prognosis and therapy 4. Integrating complementary data 5. Detecting affected pathways and processes 6. Explaining tumor heterogeneity 7. Detecting mutations and variants 8. Organizing, visualizing, and distributing data
  • 3. Convergence of driver events • Amid the complexity and heterogeneity, there is some order • Finite number of major pathways that are affected by drivers Hanahan2011 Vogelstein2013
  • 4. Similar pathway effects Vogelstein2013 • Tumor 1: EGFR receptor mutation makes it hypersensitive • Tumor 2: KRAS hyperactive • Tumor 3: NF1 inactivated and no longer modulates KRAS • Tumor 4: BRAF over responsive to KRAS signals
  • 7. Pathway discovery BioCarta EGF Signaling Pathway Stimulate receptor 31% of pathway is activated 98% of activity is not covered Phosphorylation data from Alejandro Wolf-Yadlin
  • 9. Sustaining proliferative signaling • Cells receive signals from the local environment telling them to grow (proliferate) • Specialized receptors detect these signals • Feedback in pathways carefully controls the response to these signals
  • 10. Evading growth suppressors • Override tumor suppressor genes • Some proteins control the cell’s decision to grow or switch to an alternate track • Apoptosis: programmed cell death • Senescence: halt the cell cycle • External or internal signals can affect these decisions
  • 12. Resisting cell death • One self-defense mechanism against cancer • Apoptosis triggers include: • DNA damage sensors • Limited survival cues • Overactive signaling proteins • Necrosis causes cells to explode • Destroys a (pre)cancerous cell • Releases chemicals that can promote growth in other cells O’Day
  • 13. Enabling replicative immortality • Cells typically have a limited number of divisions • Immortalization: unlimited replicative potential • Telomeres protect the ends of DNA • Shorten over time • Encode the number of cell divisions remaining • Can be artificially upregulated in cancer Patton2013
  • 15. Inducing angiogenesis • Tumors must receive nutrients like other cells • Certain proteins promote growth of blood vessels LKT Laboratories
  • 16. Activating invasion and metastasis • Cancer progresses through the aforementioned stages • Epithelial-mesenchymal transition (EMT)
  • 18. Genome instability and mutation • Cancer cells mutate more frequently • Increased sensitivity to mutagens • Loss of telomeres increases copy number alterations
  • 19. Model systems in oncology • Cell lines: Cells that reproduce in a lab indefinitely (e.g. Hela cells) • Genetically engineered mice: Manipulate mice to make them predisposed to cancer • Xenograft: Implant human tumor cells into mice
  • 20. “Omic” data types • DNA (genome) • Mutations • Copy number variation • Other structural variation • RNA expression (transcriptome) • Gene expression (mRNA) • Micro RNA expression (miRNA) • Protein (proteome) • Protein abundance • Protein state (e.g. phosphorylation) • Protein DNA binding • DNA state and accessibility (epigenome) • DNA methylation (methylome) • Histone modification / chromatin marks • DNase I hypersensitivity
  • 21. “Next-generation” sequencing (NGS) • Revolutionized high-throughput data collection • *-seq strategy • Decide what you want to measure in cells • Figure out how to select or synthesize the right DNA • Dump it into a DNA sequencer • ~100 different *-seq applications NODAI
  • 26. Microarrays • High-throughput measurement of gene expression, protein DNA binding, etc. • Mostly replaced by *-seq • Fixed probes as opposed to DNA reads
  • 27. Microarray quantification University of Utah Wikipedia Wikimedia
  • 28. DNA mutations • Whole-exome most prevalent in cancer • Only covers exons that form genes, less expensive • Whole-genome becoming more widespread as sequencing costs continue to decrease DNA Link
  • 29. Copy number variation • Often represented as relative to normal 2 copies • Ranges from a few bases to whole chromosomes • Quantitative, not discrete, representation MindSpec
  • 30. Gene expression • Transcript (messenger RNA) abundance Graz Appling lab
  • 31. Genome-wide gene expression • Quantitative state of the cell 1 35 … … 5 Gene 1 Gene 2 Gene 20000 Brain 15 32 … … 0 Heart 87 2 … … 65 Blood (normal) 85 2 … … 3 Blood (infected)
  • 32. miRNA expression • microRNA (miRNA) • ~22 nucleotides • Does not code for a protein • Regulates gene expression levels by binding mRNA NIH
  • 33. Protein abundance • Protein abundance is analogous to gene expression • Not perfectly correlated with gene expression • Harder to measure • Mass spectrometry is almost proteome-wide • Vaporize molecules • Determine what was vaporized based on mass/charge David Darling
  • 34. Protein state • Chemical groups added to mature protein • Phosphorylation is the most-studied • Analogous to Boolean state Pierce
  • 35. Protein arrays • Currently more common in cancer datasets • Measure a limited number of specific proteins using antibodies • Protein abundance or state R&D MD Anderson
  • 36. Transcriptional regulation • ChIP-seq directly measures transcription factor (TF) binding but requires a matching antibody • Various indirect strategies Wang2012
  • 37. Predicting regulator binding sites • Motifs are signatures of the DNA sequence recognized by a TF • TFs block DNA cleavage • Combining accessible DNA and DNA motifs produces binding predictions for hundreds of TFs Neph2012
  • 38. DNA methylation • Methylation is a DNA modification (state change) • Hyper-methylation suppresses transcription • Methylation almost always at C Learn NC Wikimedia
  • 39. Clinical data • Age, sex, cancer stage, survival • Kaplan–Meier plot Wikipedia
  • 40. Large cancer datasets • Tumors • The Cancer Genome Atlas (TCGA) • Broad Firehose and FireBrowse access to TCGA data • International Cancer Genome Consortium (ICGC) • Cell lines • Cancer Cell Line Encyclopedia (CCLE) • Catalogue of Somatic Mutations in Cancer (COSMIC) • Cancer gene lists • COSMIC Gene Census • Vogelstein2013 drivers
  • 41. Interactive tools for cancer data • cBioPortal • TumorPortal • Cancer Regulome • Cancer Genomics Browser • StratomeX
  • 42. Gene and protein information • TP53 example • GeneCards • UniProt • Entrez Gene
  • 43. Pathway and function enrichment • Database for Annotation, Visualization and Integrated Discovery (DAVID) • Molecular Signatures Database (MSigDB)
  • 44. Gene expression data • Gene Expression Omnibus (GEO) • ArrayExpress
  • 45. Protein interaction networks • iRefIndex and iRefWeb • Search Tool for the Retrieval of Interacting Genes/Proteins (STRING) • High-quality INTeractomes (HINT)
  • 46. Transcriptional regulation • Encyclopedia of DNA Elements (ENCODE) • DNA binding motifs • TRANSFAC • JASPAR • UniPROBE