SlideShare a Scribd company logo
One gene – one transcript… mostly
Mar Gonzales Porta, Adam Frankish, Johan Rung,
Jennifer Harrow, Alvis Brazma
European Bioinformatics Institute
European Molecular Biology Laboratory
Wellcome Trust Sanger Institute
Analysis of RNA-seq data across different tissues
and cell lines reveals that the majority of genes have
a single dominant transcript
Mar Gonzàlez-Porta
Adam Frankish (Sanger)
Johan Rung (EBI)
Jennifer Harrow (Sanger)
Alvis Brazma (EBI)
To appear in Genome Biology on July 1, 2013
http://genomebiology.com/
aaaaCAP
ATG
ter
polypeptide
transcription
splicing
translation
Figure 1
Central dogma of molecular biolgoy
Gene
Transcript
Spliced
transcript
The number of genes in human genome
•  Estimates before the Human Genome Project
•  ~100,000 genes
•  After
•  ~21,000 genes
•  By comparison
•  Yeast ~6,000 genes
•  C. Elegance ~17,000
Central gogma revised – one gene many
transcritps and proteins via alternative
splicing
In human, there are 21,405 protein coding genes and
141,031 different isoforms (92,581 of which are protein
coding) annotated, 17,413 genes have >1 isoform
From RNA-seq and other recent experiments
•  Most human genes have more than one splice-form
expressed [Pan et al, Nat Genetics 2008], [Wang et al, Nat
Genetics 2008] [Mortazavi, Nat Methods 2008]	

•  Several isoforms per gene are often expressed to significant
levels either in the same cell type or across different [Wang
et al, Nat Genetics 2008], [Tang et al, Nat Methods 2009],
[Trapnell et al, Nat Biotechy 2010]	

•  Isoform expression is regulated [Waks et al, Mol Syst Biol
2011] but splicing can be noisy [Melamud, NAR 2009]
ENCODE – Nature, September 2012
•  ‘Isoform expression by a gene does not follow a
minimalistic expression strategy, resulting in a tendency
for genes to express many isoforms simultaneously’
•  ‘[…] alternative isoforms within a gene are not expressed
at similar levels, and one isoform dominates in a given
condition)’
•  Which is it then?
A fundamental question still remains – are
different isoforms of the same gene
expressed at similar levels?
Expression
level
Isoforms 1 2 3 4 5 6
?
Isoforms 1 2 3 4 5 6
We think this is a fundamental question related to the complexity
of transcritpome
Data and methods
Illumina Body Map	

16 tissues	

PE (80M)	

no replicates	

HiSeq 2000	

ENCODE	

5 cell lines	

PE (40M)	

technical replicates	

GAII	

•  2 different datasets: 46 samples	

•  Three different state-of-art tools for transcript quantification
(MISO, Cufflinks and mmseq)	

•  Direct evidence from splice junctions	

•  Simulated data – can the methods distinuish between the two
scenarios? 	

•  All approaches produced very consistent results
Most genes express one predominant
transcript
Expression
level
79% of cases 	

 56% of cases 	

2-fold dominance 5-fold dominance
Consistent across all studied tissues
What are the RPKM distributions of first,
second, … most abundant isoforms for all
genes?
Most genes express one predominant
transcript
Most genes express one predominant
transcript
We detect a total of 31,902 transcripts expressed above 1 FPKM in at least one
tissue and 26,641 of these are major transcripts (ratio 1.12)
There is just over one highly expressed
transcript per gene!
85% of transcriptome comes from the
dominant transcripts
Is it the same dominant transcript in all
tissues or different tissues tend to have
different dominant transcripts?
Major transcripts tend to be recurrent across
tissues
Switch events exist ~ 100 strong switches
that change the protein sequence
Some of the major transcripts are non-
canonical
AES
Do the dominant transcripts of protein coding
genes always code proteins?
•  Only in about 80% cases
•  However much more often in cytosol than in nucleolus
•  In nucleolus the retained intron is predominantly located
towards the 3’ end of the transcript
•  Is this because we extract mRNA before the splicing is
completed or can a retained intron be a form of expression
regulation?
Conclusions
•  Most genes express one predominant transcript over the
rest
•  ~85% of the mRNA pool comes from major transcripts
•  Major transcripts tend to be recurrent across samples,
switch events exist but only a small number of these are
likely to express different proteins
•  Despite the transcriptome complexity the central dogma
of molecular biology may be closer to the truth than
recently believed!
Mar Gonzales Porta
Funding
•  EMBL member countries
•  European Commission FP7 grants

More Related Content

What's hot

Cracking the code of life
Cracking the code of lifeCracking the code of life
Cracking the code of lifegmtrainor3
 
Genomics
GenomicsGenomics
Application of genetic mapping
Application of genetic mappingApplication of genetic mapping
Application of genetic mapping
banker smit
 
Genomics experimental-methods
Genomics experimental-methodsGenomics experimental-methods
Genomics experimental-methods
Prof. Wim Van Criekinge
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomicshemantbreeder
 
Comparative and functional genomics
Comparative and functional genomicsComparative and functional genomics
Comparative and functional genomics
Jalormi Parekh
 
GENOMICS AND BIOINFORMATICS
GENOMICS AND BIOINFORMATICSGENOMICS AND BIOINFORMATICS
GENOMICS AND BIOINFORMATICS
sandeshGM
 
Genomics seminar
Genomics seminarGenomics seminar
Genomics seminar
S Rasouli
 
Genes, Genomics and Proteomics
Genes, Genomics and Proteomics Genes, Genomics and Proteomics
Genes, Genomics and Proteomics
Garry D. Lasaga
 
Human Genome Project
Human Genome ProjectHuman Genome Project
Human Genome Project
Peyman Ghoraishizadeh
 
Genomics 101 jun 15 2012
Genomics 101 jun 15 2012Genomics 101 jun 15 2012
Genomics 101 jun 15 2012Genome Alberta
 
Genomics
GenomicsGenomics
Genomics
Amna Jalil
 
Non – coding dna presentation
Non – coding dna presentationNon – coding dna presentation
Non – coding dna presentation
Chaitrali Jadhav
 
Comparative genomics in eukaryotes, organelles
Comparative genomics in eukaryotes, organellesComparative genomics in eukaryotes, organelles
Comparative genomics in eukaryotes, organelles
KAUSHAL SAHU
 
Dna sequencing pp
Dna sequencing ppDna sequencing pp
Dna sequencing pplibs6359
 
Oncogene
OncogeneOncogene
Oncogene
Bhabesh Kalita
 
Genomics and bioinformatics
Genomics and bioinformatics Genomics and bioinformatics
Genomics and bioinformatics Senthil Natesan
 
Transcriptomics and metabolomics
Transcriptomics and metabolomicsTranscriptomics and metabolomics
Transcriptomics and metabolomics
Sukhjinder Singh
 

What's hot (20)

Cracking the code of life
Cracking the code of lifeCracking the code of life
Cracking the code of life
 
Genomics
GenomicsGenomics
Genomics
 
Application of genetic mapping
Application of genetic mappingApplication of genetic mapping
Application of genetic mapping
 
Genomics experimental-methods
Genomics experimental-methodsGenomics experimental-methods
Genomics experimental-methods
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomics
 
Comparative and functional genomics
Comparative and functional genomicsComparative and functional genomics
Comparative and functional genomics
 
GENOMICS AND BIOINFORMATICS
GENOMICS AND BIOINFORMATICSGENOMICS AND BIOINFORMATICS
GENOMICS AND BIOINFORMATICS
 
Genomics seminar
Genomics seminarGenomics seminar
Genomics seminar
 
Genes, Genomics and Proteomics
Genes, Genomics and Proteomics Genes, Genomics and Proteomics
Genes, Genomics and Proteomics
 
Human Genome Project
Human Genome ProjectHuman Genome Project
Human Genome Project
 
Genomics 101 jun 15 2012
Genomics 101 jun 15 2012Genomics 101 jun 15 2012
Genomics 101 jun 15 2012
 
Genomics
GenomicsGenomics
Genomics
 
Non – coding dna presentation
Non – coding dna presentationNon – coding dna presentation
Non – coding dna presentation
 
Comparative genomics in eukaryotes, organelles
Comparative genomics in eukaryotes, organellesComparative genomics in eukaryotes, organelles
Comparative genomics in eukaryotes, organelles
 
Dna sequencing pp
Dna sequencing ppDna sequencing pp
Dna sequencing pp
 
Genomics
GenomicsGenomics
Genomics
 
Oncogene
OncogeneOncogene
Oncogene
 
Hgp info
Hgp infoHgp info
Hgp info
 
Genomics and bioinformatics
Genomics and bioinformatics Genomics and bioinformatics
Genomics and bioinformatics
 
Transcriptomics and metabolomics
Transcriptomics and metabolomicsTranscriptomics and metabolomics
Transcriptomics and metabolomics
 

Similar to Mar Gonzales Porta, One gene One transcript, fged_seattle_2013

Nuclear Genomes(Short Answers and questions)
Nuclear Genomes(Short Answers and questions)Nuclear Genomes(Short Answers and questions)
Nuclear Genomes(Short Answers and questions)
Zohaib HUSSAIN
 
2014 intro-genetics
2014 intro-genetics2014 intro-genetics
2014 intro-genetics
Andrew Hutabarat
 
IB Biology 3.1 Slides: Genes
IB Biology 3.1 Slides: GenesIB Biology 3.1 Slides: Genes
IB Biology 3.1 Slides: Genes
Jacob Cedarbaum
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomics
Jajati Keshari Nayak
 
Comparative transcriptomics
Comparative transcriptomicsComparative transcriptomics
Comparative transcriptomics
Sayak Ghosh
 
2014 whitney-research
2014 whitney-research2014 whitney-research
2014 whitney-researchc.titus.brown
 
Genome organization ,gene expression sand regulation
Genome organization ,gene expression sand regulation Genome organization ,gene expression sand regulation
Genome organization ,gene expression sand regulation
sukanyakk
 
Lecture bioinformatics Part2.next generation
Lecture bioinformatics Part2.next generationLecture bioinformatics Part2.next generation
Lecture bioinformatics Part2.next generation
MohamedHasan816582
 
Microbial Genetics
Microbial GeneticsMicrobial Genetics
Microbial Genetics
Roshni Mehta
 
Microbial Genetics
Microbial GeneticsMicrobial Genetics
Microbial Genetics
Roshni Mehta
 
Microbial Genetics
Microbial GeneticsMicrobial Genetics
Microbial Genetics
Roshni Mehta
 
Chapter 7 genome structure, chromatin, and the nucleosome (1)
Chapter 7   genome structure, chromatin, and the nucleosome (1)Chapter 7   genome structure, chromatin, and the nucleosome (1)
Chapter 7 genome structure, chromatin, and the nucleosome (1)Roger Mendez
 
GENETICS IN ORTHODONTICS WORD.docx
GENETICS IN ORTHODONTICS WORD.docxGENETICS IN ORTHODONTICS WORD.docx
GENETICS IN ORTHODONTICS WORD.docx
sidharth779721
 
Terminology related to molecular biology and genetics
Terminology related to molecular biology and geneticsTerminology related to molecular biology and genetics
Terminology related to molecular biology and genetics
enamifat
 
Pertemuan 1. introduction
Pertemuan 1. introductionPertemuan 1. introduction
Pertemuan 1. introduction
Suryati Purba
 
Human genome project (2) converted
Human genome project (2) convertedHuman genome project (2) converted
Human genome project (2) converted
GAnchal
 
Content of the genome
Content of the genomeContent of the genome
Content of the genome
Kiran Modi
 
genetic variation
genetic variationgenetic variation
genetic variation
Nawfal Aldujaily
 
Sci cafe humangenome&health
Sci cafe humangenome&healthSci cafe humangenome&health
Sci cafe humangenome&health
Toby Rossman
 

Similar to Mar Gonzales Porta, One gene One transcript, fged_seattle_2013 (20)

Nuclear Genomes(Short Answers and questions)
Nuclear Genomes(Short Answers and questions)Nuclear Genomes(Short Answers and questions)
Nuclear Genomes(Short Answers and questions)
 
2014 intro-genetics
2014 intro-genetics2014 intro-genetics
2014 intro-genetics
 
IB Biology 3.1 Slides: Genes
IB Biology 3.1 Slides: GenesIB Biology 3.1 Slides: Genes
IB Biology 3.1 Slides: Genes
 
Comparitive genomics
Comparitive genomicsComparitive genomics
Comparitive genomics
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomics
 
Comparative transcriptomics
Comparative transcriptomicsComparative transcriptomics
Comparative transcriptomics
 
2014 whitney-research
2014 whitney-research2014 whitney-research
2014 whitney-research
 
Genome organization ,gene expression sand regulation
Genome organization ,gene expression sand regulation Genome organization ,gene expression sand regulation
Genome organization ,gene expression sand regulation
 
Lecture bioinformatics Part2.next generation
Lecture bioinformatics Part2.next generationLecture bioinformatics Part2.next generation
Lecture bioinformatics Part2.next generation
 
Microbial Genetics
Microbial GeneticsMicrobial Genetics
Microbial Genetics
 
Microbial Genetics
Microbial GeneticsMicrobial Genetics
Microbial Genetics
 
Microbial Genetics
Microbial GeneticsMicrobial Genetics
Microbial Genetics
 
Chapter 7 genome structure, chromatin, and the nucleosome (1)
Chapter 7   genome structure, chromatin, and the nucleosome (1)Chapter 7   genome structure, chromatin, and the nucleosome (1)
Chapter 7 genome structure, chromatin, and the nucleosome (1)
 
GENETICS IN ORTHODONTICS WORD.docx
GENETICS IN ORTHODONTICS WORD.docxGENETICS IN ORTHODONTICS WORD.docx
GENETICS IN ORTHODONTICS WORD.docx
 
Terminology related to molecular biology and genetics
Terminology related to molecular biology and geneticsTerminology related to molecular biology and genetics
Terminology related to molecular biology and genetics
 
Pertemuan 1. introduction
Pertemuan 1. introductionPertemuan 1. introduction
Pertemuan 1. introduction
 
Human genome project (2) converted
Human genome project (2) convertedHuman genome project (2) converted
Human genome project (2) converted
 
Content of the genome
Content of the genomeContent of the genome
Content of the genome
 
genetic variation
genetic variationgenetic variation
genetic variation
 
Sci cafe humangenome&health
Sci cafe humangenome&healthSci cafe humangenome&health
Sci cafe humangenome&health
 

More from Functional Genomics Data Society

Cshl minseqe 2013_ouellette
Cshl minseqe 2013_ouelletteCshl minseqe 2013_ouellette
Cshl minseqe 2013_ouellette
Functional Genomics Data Society
 
Michael Reich, GenomeSpace Workshop, fged_seattle_2013
Michael Reich, GenomeSpace Workshop, fged_seattle_2013Michael Reich, GenomeSpace Workshop, fged_seattle_2013
Michael Reich, GenomeSpace Workshop, fged_seattle_2013
Functional Genomics Data Society
 
Stefano Volinia, miRNA Signature - Breast Cancer, fged_seattle_2013
Stefano Volinia, miRNA Signature - Breast Cancer, fged_seattle_2013Stefano Volinia, miRNA Signature - Breast Cancer, fged_seattle_2013
Stefano Volinia, miRNA Signature - Breast Cancer, fged_seattle_2013
Functional Genomics Data Society
 
Kimberly Glass, Network model - Ovarian Cancer, fged_seattle_2013
Kimberly Glass, Network model - Ovarian Cancer, fged_seattle_2013Kimberly Glass, Network model - Ovarian Cancer, fged_seattle_2013
Kimberly Glass, Network model - Ovarian Cancer, fged_seattle_2013
Functional Genomics Data Society
 
Jenny Giannopoulou, Prostate cancer methylome, fged_seattle_2013
Jenny Giannopoulou, Prostate cancer methylome, fged_seattle_2013Jenny Giannopoulou, Prostate cancer methylome, fged_seattle_2013
Jenny Giannopoulou, Prostate cancer methylome, fged_seattle_2013
Functional Genomics Data Society
 
Ishwar Chandramouliswaran, Cancer Research, fged_seattle_2013
Ishwar Chandramouliswaran, Cancer Research, fged_seattle_2013Ishwar Chandramouliswaran, Cancer Research, fged_seattle_2013
Ishwar Chandramouliswaran, Cancer Research, fged_seattle_2013
Functional Genomics Data Society
 
Martin Ringwald, Mouse Gene Expression DB, fged_seattle_2013
Martin Ringwald, Mouse Gene Expression DB, fged_seattle_2013Martin Ringwald, Mouse Gene Expression DB, fged_seattle_2013
Martin Ringwald, Mouse Gene Expression DB, fged_seattle_2013
Functional Genomics Data Society
 
Gary bader fged_toronto_2012
Gary bader fged_toronto_2012Gary bader fged_toronto_2012
Gary bader fged_toronto_2012
Functional Genomics Data Society
 

More from Functional Genomics Data Society (8)

Cshl minseqe 2013_ouellette
Cshl minseqe 2013_ouelletteCshl minseqe 2013_ouellette
Cshl minseqe 2013_ouellette
 
Michael Reich, GenomeSpace Workshop, fged_seattle_2013
Michael Reich, GenomeSpace Workshop, fged_seattle_2013Michael Reich, GenomeSpace Workshop, fged_seattle_2013
Michael Reich, GenomeSpace Workshop, fged_seattle_2013
 
Stefano Volinia, miRNA Signature - Breast Cancer, fged_seattle_2013
Stefano Volinia, miRNA Signature - Breast Cancer, fged_seattle_2013Stefano Volinia, miRNA Signature - Breast Cancer, fged_seattle_2013
Stefano Volinia, miRNA Signature - Breast Cancer, fged_seattle_2013
 
Kimberly Glass, Network model - Ovarian Cancer, fged_seattle_2013
Kimberly Glass, Network model - Ovarian Cancer, fged_seattle_2013Kimberly Glass, Network model - Ovarian Cancer, fged_seattle_2013
Kimberly Glass, Network model - Ovarian Cancer, fged_seattle_2013
 
Jenny Giannopoulou, Prostate cancer methylome, fged_seattle_2013
Jenny Giannopoulou, Prostate cancer methylome, fged_seattle_2013Jenny Giannopoulou, Prostate cancer methylome, fged_seattle_2013
Jenny Giannopoulou, Prostate cancer methylome, fged_seattle_2013
 
Ishwar Chandramouliswaran, Cancer Research, fged_seattle_2013
Ishwar Chandramouliswaran, Cancer Research, fged_seattle_2013Ishwar Chandramouliswaran, Cancer Research, fged_seattle_2013
Ishwar Chandramouliswaran, Cancer Research, fged_seattle_2013
 
Martin Ringwald, Mouse Gene Expression DB, fged_seattle_2013
Martin Ringwald, Mouse Gene Expression DB, fged_seattle_2013Martin Ringwald, Mouse Gene Expression DB, fged_seattle_2013
Martin Ringwald, Mouse Gene Expression DB, fged_seattle_2013
 
Gary bader fged_toronto_2012
Gary bader fged_toronto_2012Gary bader fged_toronto_2012
Gary bader fged_toronto_2012
 

Recently uploaded

FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance
 
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionGenerative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Aggregage
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
Matthew Sinclair
 
Introduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - CybersecurityIntroduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - Cybersecurity
mikeeftimakis1
 
Pushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 daysPushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 days
Adtran
 
The Future of Platform Engineering
The Future of Platform EngineeringThe Future of Platform Engineering
The Future of Platform Engineering
Jemma Hussein Allen
 
UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5
DianaGray10
 
GridMate - End to end testing is a critical piece to ensure quality and avoid...
GridMate - End to end testing is a critical piece to ensure quality and avoid...GridMate - End to end testing is a critical piece to ensure quality and avoid...
GridMate - End to end testing is a critical piece to ensure quality and avoid...
ThomasParaiso2
 
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
Neo4j
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
KatiaHIMEUR1
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance
 
National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
Quotidiano Piemontese
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
Guy Korland
 
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfObservability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Paige Cruz
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
Safe Software
 
Removing Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software FuzzingRemoving Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software Fuzzing
Aftab Hussain
 
Elevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object CalisthenicsElevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object Calisthenics
Dorra BARTAGUIZ
 
Free Complete Python - A step towards Data Science
Free Complete Python - A step towards Data ScienceFree Complete Python - A step towards Data Science
Free Complete Python - A step towards Data Science
RinaMondal9
 
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
Neo4j
 
By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024
Pierluigi Pugliese
 

Recently uploaded (20)

FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdf
 
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionGenerative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to Production
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
 
Introduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - CybersecurityIntroduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - Cybersecurity
 
Pushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 daysPushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 days
 
The Future of Platform Engineering
The Future of Platform EngineeringThe Future of Platform Engineering
The Future of Platform Engineering
 
UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5
 
GridMate - End to end testing is a critical piece to ensure quality and avoid...
GridMate - End to end testing is a critical piece to ensure quality and avoid...GridMate - End to end testing is a critical piece to ensure quality and avoid...
GridMate - End to end testing is a critical piece to ensure quality and avoid...
 
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
 
National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
 
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfObservability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
 
Removing Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software FuzzingRemoving Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software Fuzzing
 
Elevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object CalisthenicsElevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object Calisthenics
 
Free Complete Python - A step towards Data Science
Free Complete Python - A step towards Data ScienceFree Complete Python - A step towards Data Science
Free Complete Python - A step towards Data Science
 
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
 
By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024
 

Mar Gonzales Porta, One gene One transcript, fged_seattle_2013

  • 1. One gene – one transcript… mostly Mar Gonzales Porta, Adam Frankish, Johan Rung, Jennifer Harrow, Alvis Brazma European Bioinformatics Institute European Molecular Biology Laboratory Wellcome Trust Sanger Institute
  • 2. Analysis of RNA-seq data across different tissues and cell lines reveals that the majority of genes have a single dominant transcript Mar Gonzàlez-Porta Adam Frankish (Sanger) Johan Rung (EBI) Jennifer Harrow (Sanger) Alvis Brazma (EBI) To appear in Genome Biology on July 1, 2013 http://genomebiology.com/
  • 3. aaaaCAP ATG ter polypeptide transcription splicing translation Figure 1 Central dogma of molecular biolgoy Gene Transcript Spliced transcript
  • 4. The number of genes in human genome •  Estimates before the Human Genome Project •  ~100,000 genes •  After •  ~21,000 genes •  By comparison •  Yeast ~6,000 genes •  C. Elegance ~17,000
  • 5. Central gogma revised – one gene many transcritps and proteins via alternative splicing In human, there are 21,405 protein coding genes and 141,031 different isoforms (92,581 of which are protein coding) annotated, 17,413 genes have >1 isoform
  • 6. From RNA-seq and other recent experiments •  Most human genes have more than one splice-form expressed [Pan et al, Nat Genetics 2008], [Wang et al, Nat Genetics 2008] [Mortazavi, Nat Methods 2008] •  Several isoforms per gene are often expressed to significant levels either in the same cell type or across different [Wang et al, Nat Genetics 2008], [Tang et al, Nat Methods 2009], [Trapnell et al, Nat Biotechy 2010] •  Isoform expression is regulated [Waks et al, Mol Syst Biol 2011] but splicing can be noisy [Melamud, NAR 2009]
  • 7. ENCODE – Nature, September 2012 •  ‘Isoform expression by a gene does not follow a minimalistic expression strategy, resulting in a tendency for genes to express many isoforms simultaneously’ •  ‘[…] alternative isoforms within a gene are not expressed at similar levels, and one isoform dominates in a given condition)’ •  Which is it then?
  • 8. A fundamental question still remains – are different isoforms of the same gene expressed at similar levels? Expression level Isoforms 1 2 3 4 5 6 ? Isoforms 1 2 3 4 5 6 We think this is a fundamental question related to the complexity of transcritpome
  • 9. Data and methods Illumina Body Map 16 tissues PE (80M) no replicates HiSeq 2000 ENCODE 5 cell lines PE (40M) technical replicates GAII •  2 different datasets: 46 samples •  Three different state-of-art tools for transcript quantification (MISO, Cufflinks and mmseq) •  Direct evidence from splice junctions •  Simulated data – can the methods distinuish between the two scenarios? •  All approaches produced very consistent results
  • 10. Most genes express one predominant transcript Expression level 79% of cases 56% of cases 2-fold dominance 5-fold dominance
  • 11. Consistent across all studied tissues
  • 12. What are the RPKM distributions of first, second, … most abundant isoforms for all genes?
  • 13. Most genes express one predominant transcript
  • 14. Most genes express one predominant transcript We detect a total of 31,902 transcripts expressed above 1 FPKM in at least one tissue and 26,641 of these are major transcripts (ratio 1.12)
  • 15.
  • 16. There is just over one highly expressed transcript per gene! 85% of transcriptome comes from the dominant transcripts Is it the same dominant transcript in all tissues or different tissues tend to have different dominant transcripts?
  • 17. Major transcripts tend to be recurrent across tissues
  • 18. Switch events exist ~ 100 strong switches that change the protein sequence
  • 19.
  • 20. Some of the major transcripts are non- canonical AES
  • 21. Do the dominant transcripts of protein coding genes always code proteins? •  Only in about 80% cases •  However much more often in cytosol than in nucleolus •  In nucleolus the retained intron is predominantly located towards the 3’ end of the transcript •  Is this because we extract mRNA before the splicing is completed or can a retained intron be a form of expression regulation?
  • 22. Conclusions •  Most genes express one predominant transcript over the rest •  ~85% of the mRNA pool comes from major transcripts •  Major transcripts tend to be recurrent across samples, switch events exist but only a small number of these are likely to express different proteins •  Despite the transcriptome complexity the central dogma of molecular biology may be closer to the truth than recently believed!
  • 24. Funding •  EMBL member countries •  European Commission FP7 grants