SlideShare a Scribd company logo
Human genome diversity: Frequently asked questions Guido Barbujani Dipartimento di Biologia ed Evoluzione, Università di Ferrara [email_address]
Total size   3 272 480 987(haploid) N of protein-coding genes   22 320 N of RNA-coding genes    9 922 N of gene exons   530 906 N of transcripts  142 707 N of segregating sites   15 040 632 Nucleotide differences with chimp   1.23% Chimp orthologue genes   13 454 Human genes missing in chimp 36 totally, 17 largely Classes of genes with max. differences  immune response,  reproduction, olfaction A few human genome statistics From www.ensembl.org version 57.37b (Jan. 2010)
Human-mouse alignment
Human-chimp alignment Chimp chromosomes 2 and 2a The human genome is very similar to the chimpanzee genome
Phylogenetic tree of human (n=70), chimpanzee (n=30), bonobo (n=5), gorilla (n=11) and orang-utan (n=14), based on 10,000 bp sequences of a noncoding Xq13.3 region. Kaessmann et al. (2001). Individual genetic diversity among humans is the lowest  of all primates
Genomic estimates of  F ST  for the global human population are    0.12  Human populations display    12% of the maximum possible diversity, given their allele frequencies   N of markers Samples F ST Reference 599,356 SNPs 209 individuals from 4 populations: Caucasian, Chinese, Japanese, Yoruba 0.13 Weir et al. 2005 1,034,741 SNPs 71 individuals from 4 populations: Caucasian, Chinese, Japanese, Yoruba 0.10 Weir et al. 2005 1,007,329 SNPs 269 individuals from 4 populations: Caucasian, Chinese, Japanese, Yoruba 0.12 International HapMap Consortium 2005 443,434 SNPs 3845 worldwide distributed individuals 0.052 Auton et al. 2009 2,841,354 SNPs 210 individuals from 4 populations: Caucasian, Chinese, Japanese, Yoruba 0.11 Barreiro et al. 2008 243,855 SNPs 554 individuals from 27 worldwide populations 0.123 Xing et al. 2009 100 Alu insertions 710 individuals from 23 worldwide populations 0.095 Watkins et al. 2008 67 CNVs  270 individuals from 4 populations with ancestry in Europe, Africa or Asia 0.11 Redon et al. 2006
0.38 0.32 0.12 Genetic diversity among human populations is the lowest  of all primates F ST Geographically-variable selection Small population sizes Little gene flow Isolation Stabilizing selection Large population sizes Extensive gene flow Admixture
Li et al. (2009) Clinal variation in the geographical space is the rule for human populations Cavalli-Sfdorza et al. (1994)
Methods 1 : Estimating variances from sequence comparisons - TA C GAACATC A GGC - - TA T GAACATC A GGC - - TA T GAACATC G GGC - Polymorphic DNA sites
Genetic variances within and between populations Population 1  Population 2  variance between pops. 100% 19% 0%
Independent studies of genetic variances yield very similar results: 85, 5, 10 Lewontin (1972)    17 loci 85% 8% 6% Latter (1973)  18 86% 5% 9% Barbujani et al. (1997) 109 85% 5%  10% Jorde et al. (2000) 100 85% 2%  13% Romualdi et al. (2002)    32 83% 8% 9% Rosenberg et al. (2002)  377 93%   3%  4% Excoffier & Hamilton (2003) 377  88% 3% 9% Ramachandran et al. (2005)   17 90%   5%   5% Bastos-Rodriguez et al. (2006)  40 86%   2%   12% Li et al. (2008)  650 000   89%   2%   9% MEDIANA within populations between populations between races or continents 85%   5%   10%
What does it mean, in practice? 100% 100% 100% Members of our community are only slightly less different from us than members of distant populations 85% 85% 85%
Mind the numbers Humans and chimps share >98% of their genomes Among the 1.8% differences, 1.7% are fixed differences within species The remaining fraction, 0.1%, contains all human genomic variation The differences among the main continental groups represent 10% of 0.1% of the total, that is, 0.01% But 0.1% of >3 billion DNA sites means >3 million polymorphic DNA sites (3,213,401 according to Levy et al. 2007)
Methods 2 : Clustering genotypes or haplotypes K=3 K=4 Rosenberg et al., 2002
SNPs Haplotypes CNV Jakobsson et al. (2008) Structure inferred from SNPs and haplotypes differs from that inferred from Copy Number Variation
Genes, as well as morphology, suggest inconsistent clusterings of genotypes Y chromosome: Romualdi et al. 2002 X chromosome: Wilson et al. 2001 377 STR loci: Barbujani and Belle 2006 377 STR loci: Rosenberg et al. 2002 Europe, Ethiopia S. Africa   N. Guinea Asia Africa Asia, Europe, Australia, Americas Americas Melanesia Eurasia N Africa N America Maya S. Africa E Africa C Africa Piapoco Suruì Karitiana Kalash W. Eurasia E. Asia Africa Americas Oceania
Sampling assumptions have a large effect on the apparent structuring
Sampling assumptions have a large effect on the apparent structuring Serre and P ääbo (2004)
ASIP A8818G MATP C374G Genetic variation is discordant across loci
Similar skin colors are due to different combinations of alleles MATP C374G ASIP A8818G
16 completely sequenced genomes  (as of May 1st, 2010)   And 5 more published on May 6th:
Two persons from the same continent may share fewer SNPs than persons of different continents
81% of SNPs cosmopolitan. Alleles present in one continent only: 0.91% in Africa, 0.75% in Asia, practically 0 elsewhere. Jakobsson et al. 2008 (525910 SNPs, 396 CNVs)
In  the 117 megabases (Mb) of sequenced exome-containing intervals, the average rate of nucleotide difference between a pair of the Bushmen was 1.2 per kb, compared to an average of 1.0 per kb between a European and Asian individual. Schuster et al. (2010) Greater differences between Africans than between European and Asians
Genetic diversity out of Africa is often a subset ot the African genetic diversity   Tishkoff et al. (1998)
LD decreasing with physical distance between loci and with geographic distance from East Africa Jakobsson et al. 2008
Gene diversity declines as a function of distance from Africa Best fit of the model for an African exit 56,000 years ago Liu et al. (2006)
Patterns of  morphological  and  genetic  variation are compatible with the effects of dispersal from Africa Manica et al. (2007)
Models with an African population replacing previous human continental groups explain the data better than any alternative models Fagundes et al. (2007)
[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],To make a long story short: So, what happened?
100,000 years ago
70,000 years ago
60,000 years ago
30,000 years ago
10,000 years ago
 
[object Object],[object Object],[object Object],[object Object],Preliminary analyses
Divergence from modern humans Neandertals  fall inside the variation of present-day humans. Overall divergence greater for the three Neandertal genomes (modes ~11%), whereas the San mode is ~9% and for the other present-day humans ~8%. For the Neandertals, 13% of windows have a divergence above 20%, whereas this is the case for 2.5% to 3.7% of windows in the current humans
Segments of Neandertal genome in non-African genomes ,[object Object],[object Object],[object Object],[object Object],[object Object],?
1. Comparison with the HapMap sequences and 5 newly sequenced individuals 2. No comparison within Eurasia  (Papuan-French-Han) or within Africa (Yoruba- San) shows significant skews in D  3. All comparisons of non-Africans and Africans show that the Neandertal is closer to the  non-Africans 4. All or almost all the gene flow detected was from Neandertals into modern humans 5. Some old haplotypes most likely owe their presence in non Africans to gene flow from Neandertals
Four processes potentially accounting for the data  Between 1 and 4% of the genomes of people in Eurasia are derived from Neandertals 1. From erectus to Neandertal 2. From late Neandertals into the first Europeans 3. From early Neandertals into the first Eurasians 4. Ancient population structure, preserved from before the Neandertal – modern sapiens separation
Enza Colonna And if you want to read more about all this  Trends in Genetics , July 2010

More Related Content

What's hot

Genomic databases
Genomic databasesGenomic databases
Genomic databases
DrSatyabrataSahoo
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
Somdutt Sharma
 
Data retrieval
Data retrievalData retrieval
RNA-Seq
RNA-SeqRNA-Seq
Introduction to ncbi, embl, ddbj
Introduction to ncbi, embl, ddbjIntroduction to ncbi, embl, ddbj
Introduction to ncbi, embl, ddbj
KAUSHAL SAHU
 
Epigenomics
EpigenomicsEpigenomics
Epigenomics
Adonis Sfera, MD
 
Population genetics
Population geneticsPopulation genetics
Population genetics
LubnaSSubair
 
Primary and secondary databases ppt by puneet kulyana
Primary and secondary databases ppt by puneet kulyanaPrimary and secondary databases ppt by puneet kulyana
Primary and secondary databases ppt by puneet kulyana
Puneet Kulyana
 
Genotyping, linkage mapping and binary data
Genotyping, linkage mapping and binary dataGenotyping, linkage mapping and binary data
Genotyping, linkage mapping and binary data
FAO
 
Lecture 6 candidate gene association full
Lecture 6 candidate gene association fullLecture 6 candidate gene association full
Lecture 6 candidate gene association full
Lekki Frazier-Wood
 
RNA-Seq with R-Bioconductor
RNA-Seq with R-BioconductorRNA-Seq with R-Bioconductor
Bioinformatics workshop presentation
Bioinformatics   workshop presentationBioinformatics   workshop presentation
Bioinformatics workshop presentation
SKUAST-Kashmir
 
Uni prot presentation
Uni prot presentationUni prot presentation
Uni prot presentation
Rida Khalid
 
Phylogenetic analysis
Phylogenetic analysisPhylogenetic analysis
Phylogenetic analysis
National Institute of Biologics
 
Supporting Genomics in the Practice of Medicine by Heidi Rehm
Supporting Genomics in the Practice of Medicine by Heidi RehmSupporting Genomics in the Practice of Medicine by Heidi Rehm
Supporting Genomics in the Practice of Medicine by Heidi Rehm
Knome_Inc
 
TYPES OF MOLECULAR MARKERS,ITS ADVANTAGES AND DISADVANTAGES
TYPES OF MOLECULAR MARKERS,ITS ADVANTAGES AND DISADVANTAGESTYPES OF MOLECULAR MARKERS,ITS ADVANTAGES AND DISADVANTAGES
TYPES OF MOLECULAR MARKERS,ITS ADVANTAGES AND DISADVANTAGES
ANFAS KT
 
BioInformatics MCQ
BioInformatics MCQBioInformatics MCQ
BioInformatics MCQ
Afra Fathima
 
Similarity
SimilaritySimilarity
Similarity
hiratufail
 
Snp
SnpSnp
sequence of file formats in bioinformatics
sequence of file formats in bioinformaticssequence of file formats in bioinformatics
sequence of file formats in bioinformatics
nadeem akhter
 

What's hot (20)

Genomic databases
Genomic databasesGenomic databases
Genomic databases
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Data retrieval
Data retrievalData retrieval
Data retrieval
 
RNA-Seq
RNA-SeqRNA-Seq
RNA-Seq
 
Introduction to ncbi, embl, ddbj
Introduction to ncbi, embl, ddbjIntroduction to ncbi, embl, ddbj
Introduction to ncbi, embl, ddbj
 
Epigenomics
EpigenomicsEpigenomics
Epigenomics
 
Population genetics
Population geneticsPopulation genetics
Population genetics
 
Primary and secondary databases ppt by puneet kulyana
Primary and secondary databases ppt by puneet kulyanaPrimary and secondary databases ppt by puneet kulyana
Primary and secondary databases ppt by puneet kulyana
 
Genotyping, linkage mapping and binary data
Genotyping, linkage mapping and binary dataGenotyping, linkage mapping and binary data
Genotyping, linkage mapping and binary data
 
Lecture 6 candidate gene association full
Lecture 6 candidate gene association fullLecture 6 candidate gene association full
Lecture 6 candidate gene association full
 
RNA-Seq with R-Bioconductor
RNA-Seq with R-BioconductorRNA-Seq with R-Bioconductor
RNA-Seq with R-Bioconductor
 
Bioinformatics workshop presentation
Bioinformatics   workshop presentationBioinformatics   workshop presentation
Bioinformatics workshop presentation
 
Uni prot presentation
Uni prot presentationUni prot presentation
Uni prot presentation
 
Phylogenetic analysis
Phylogenetic analysisPhylogenetic analysis
Phylogenetic analysis
 
Supporting Genomics in the Practice of Medicine by Heidi Rehm
Supporting Genomics in the Practice of Medicine by Heidi RehmSupporting Genomics in the Practice of Medicine by Heidi Rehm
Supporting Genomics in the Practice of Medicine by Heidi Rehm
 
TYPES OF MOLECULAR MARKERS,ITS ADVANTAGES AND DISADVANTAGES
TYPES OF MOLECULAR MARKERS,ITS ADVANTAGES AND DISADVANTAGESTYPES OF MOLECULAR MARKERS,ITS ADVANTAGES AND DISADVANTAGES
TYPES OF MOLECULAR MARKERS,ITS ADVANTAGES AND DISADVANTAGES
 
BioInformatics MCQ
BioInformatics MCQBioInformatics MCQ
BioInformatics MCQ
 
Similarity
SimilaritySimilarity
Similarity
 
Snp
SnpSnp
Snp
 
sequence of file formats in bioinformatics
sequence of file formats in bioinformaticssequence of file formats in bioinformatics
sequence of file formats in bioinformatics
 

Viewers also liked

Barbujani leicester
Barbujani leicesterBarbujani leicester
Comparing genes across linguistic families
Comparing genes across linguistic familiesComparing genes across linguistic families
Comparing genes across linguistic families
Genetica, Ferrara University, Italy
 
Genpop10coal e abc
Genpop10coal e abcGenpop10coal e abc
Milano darwinday
Milano darwindayMilano darwinday
Genpop11a dna
Genpop11a dnaGenpop11a dna
Barbujani abt lecture
Barbujani abt lectureBarbujani abt lecture

Viewers also liked (16)

Gen pop7geneflow
Gen pop7geneflowGen pop7geneflow
Gen pop7geneflow
 
Barbujani leicester
Barbujani leicesterBarbujani leicester
Barbujani leicester
 
Comparing genes across linguistic families
Comparing genes across linguistic familiesComparing genes across linguistic families
Comparing genes across linguistic families
 
Gen pop1var
Gen pop1varGen pop1var
Gen pop1var
 
Genpop10coal e abc
Genpop10coal e abcGenpop10coal e abc
Genpop10coal e abc
 
Gen pop4ld
Gen pop4ldGen pop4ld
Gen pop4ld
 
Gen pop5mut
Gen pop5mutGen pop5mut
Gen pop5mut
 
Gen pop9mantpol
Gen pop9mantpolGen pop9mantpol
Gen pop9mantpol
 
Genetica di popolazioni 3
Genetica di popolazioni 3Genetica di popolazioni 3
Genetica di popolazioni 3
 
Milano darwinday
Milano darwindayMilano darwinday
Milano darwinday
 
Gen pop8selezione
Gen pop8selezioneGen pop8selezione
Gen pop8selezione
 
Genpop11a dna
Genpop11a dnaGenpop11a dna
Genpop11a dna
 
Genetica di Popolazioni 2
Genetica di Popolazioni 2Genetica di Popolazioni 2
Genetica di Popolazioni 2
 
Bari1 Darwin
Bari1 DarwinBari1 Darwin
Bari1 Darwin
 
Barbujani abt lecture
Barbujani abt lectureBarbujani abt lecture
Barbujani abt lecture
 
Gen pop6drift
Gen pop6driftGen pop6drift
Gen pop6drift
 

Similar to Lisbon genome diversity

Colloquium Presentation 2009 Fall Bongsoo
Colloquium Presentation 2009 Fall BongsooColloquium Presentation 2009 Fall Bongsoo
Colloquium Presentation 2009 Fall Bongsoo
Bongsoo Park
 
Nucleotide Variation and Selective Pressure in the Mitochondrial Genome of Af...
Nucleotide Variation and Selective Pressure in the Mitochondrial Genome of Af...Nucleotide Variation and Selective Pressure in the Mitochondrial Genome of Af...
Nucleotide Variation and Selective Pressure in the Mitochondrial Genome of Af...
International Journal of Science and Research (IJSR)
 
Insights into the genetic diversity and structure of indigenous ovi-caprine p...
Insights into the genetic diversity and structure of indigenous ovi-caprine p...Insights into the genetic diversity and structure of indigenous ovi-caprine p...
Insights into the genetic diversity and structure of indigenous ovi-caprine p...
ILRI
 
Honors ~ Evolution 1011
Honors ~ Evolution 1011Honors ~ Evolution 1011
Honors ~ Evolution 1011
Michael Edgar
 
Biodiverse - Rosauer talk @ iEvoBio conference June 2010
Biodiverse - Rosauer talk @ iEvoBio conference June 2010Biodiverse - Rosauer talk @ iEvoBio conference June 2010
Biodiverse - Rosauer talk @ iEvoBio conference June 2010
Dan Rosauer
 
Rym kefi (1)
Rym kefi (1)Rym kefi (1)
Rym kefi (1)
Gonçalo Figueira
 
Human Evolution Talk
Human Evolution TalkHuman Evolution Talk
Human Evolution Talk
Steven Hamblin
 
Rice
RiceRice
An integrated map of genetic variation from 1,092
An integrated map of genetic variation from 1,092An integrated map of genetic variation from 1,092
An integrated map of genetic variation from 1,092
Grigory Sapunov
 
Unveiling Hidden Treasures of Indigenous Cattle In Zambia Using Macrosatilltes
Unveiling Hidden Treasures of Indigenous Cattle In Zambia Using MacrosatilltesUnveiling Hidden Treasures of Indigenous Cattle In Zambia Using Macrosatilltes
Unveiling Hidden Treasures of Indigenous Cattle In Zambia Using Macrosatilltes
MSIMUKO ELLISON
 
J. Mollus. Stud.-2015-Carvalho-mollus-eyv014
J. Mollus. Stud.-2015-Carvalho-mollus-eyv014J. Mollus. Stud.-2015-Carvalho-mollus-eyv014
J. Mollus. Stud.-2015-Carvalho-mollus-eyv014
Carolina Ruivo Pereira
 
Bioinformatics & biostatistics tools for monogenic and multifactorial disease...
Bioinformatics & biostatistics tools for monogenic and multifactorial disease...Bioinformatics & biostatistics tools for monogenic and multifactorial disease...
Bioinformatics & biostatistics tools for monogenic and multifactorial disease...
Pasteur_Tunis
 
Lise Sandenbergh poster
Lise Sandenbergh poster Lise Sandenbergh poster
Lise Sandenbergh poster
Lise Sandenbergh
 
Talk at Institut Jean Nicod on 6 October 2010
Talk at Institut Jean Nicod on 6 October 2010Talk at Institut Jean Nicod on 6 October 2010
Talk at Institut Jean Nicod on 6 October 2010
Robin Ryder
 
Human genetic diversity and origin of major human groups
Human genetic diversity and origin of major human groupsHuman genetic diversity and origin of major human groups
Human genetic diversity and origin of major human groups
Mayank Sagar
 
Comparing the Amount and Quality of Information from Different Sequencing Str...
Comparing the Amount and Quality of Information from Different Sequencing Str...Comparing the Amount and Quality of Information from Different Sequencing Str...
Comparing the Amount and Quality of Information from Different Sequencing Str...
jembrown
 
Genetic variation and its role in health pharmacology
Genetic variation and its role in health pharmacologyGenetic variation and its role in health pharmacology
Genetic variation and its role in health pharmacology
Deepak Kumar
 
Human genetic variation and its contribution to complex traits
Human genetic variation and its contribution to complex traitsHuman genetic variation and its contribution to complex traits
Human genetic variation and its contribution to complex traits
groovescience
 
Early Human Origins
Early Human OriginsEarly Human Origins
Early Human Origins
bdrydyk
 
Evolution of transposons, genomes, and organisms (Hertweck Fall 2014)
Evolution of transposons, genomes, and organisms (Hertweck Fall 2014)Evolution of transposons, genomes, and organisms (Hertweck Fall 2014)
Evolution of transposons, genomes, and organisms (Hertweck Fall 2014)
Kate Hertweck
 

Similar to Lisbon genome diversity (20)

Colloquium Presentation 2009 Fall Bongsoo
Colloquium Presentation 2009 Fall BongsooColloquium Presentation 2009 Fall Bongsoo
Colloquium Presentation 2009 Fall Bongsoo
 
Nucleotide Variation and Selective Pressure in the Mitochondrial Genome of Af...
Nucleotide Variation and Selective Pressure in the Mitochondrial Genome of Af...Nucleotide Variation and Selective Pressure in the Mitochondrial Genome of Af...
Nucleotide Variation and Selective Pressure in the Mitochondrial Genome of Af...
 
Insights into the genetic diversity and structure of indigenous ovi-caprine p...
Insights into the genetic diversity and structure of indigenous ovi-caprine p...Insights into the genetic diversity and structure of indigenous ovi-caprine p...
Insights into the genetic diversity and structure of indigenous ovi-caprine p...
 
Honors ~ Evolution 1011
Honors ~ Evolution 1011Honors ~ Evolution 1011
Honors ~ Evolution 1011
 
Biodiverse - Rosauer talk @ iEvoBio conference June 2010
Biodiverse - Rosauer talk @ iEvoBio conference June 2010Biodiverse - Rosauer talk @ iEvoBio conference June 2010
Biodiverse - Rosauer talk @ iEvoBio conference June 2010
 
Rym kefi (1)
Rym kefi (1)Rym kefi (1)
Rym kefi (1)
 
Human Evolution Talk
Human Evolution TalkHuman Evolution Talk
Human Evolution Talk
 
Rice
RiceRice
Rice
 
An integrated map of genetic variation from 1,092
An integrated map of genetic variation from 1,092An integrated map of genetic variation from 1,092
An integrated map of genetic variation from 1,092
 
Unveiling Hidden Treasures of Indigenous Cattle In Zambia Using Macrosatilltes
Unveiling Hidden Treasures of Indigenous Cattle In Zambia Using MacrosatilltesUnveiling Hidden Treasures of Indigenous Cattle In Zambia Using Macrosatilltes
Unveiling Hidden Treasures of Indigenous Cattle In Zambia Using Macrosatilltes
 
J. Mollus. Stud.-2015-Carvalho-mollus-eyv014
J. Mollus. Stud.-2015-Carvalho-mollus-eyv014J. Mollus. Stud.-2015-Carvalho-mollus-eyv014
J. Mollus. Stud.-2015-Carvalho-mollus-eyv014
 
Bioinformatics & biostatistics tools for monogenic and multifactorial disease...
Bioinformatics & biostatistics tools for monogenic and multifactorial disease...Bioinformatics & biostatistics tools for monogenic and multifactorial disease...
Bioinformatics & biostatistics tools for monogenic and multifactorial disease...
 
Lise Sandenbergh poster
Lise Sandenbergh poster Lise Sandenbergh poster
Lise Sandenbergh poster
 
Talk at Institut Jean Nicod on 6 October 2010
Talk at Institut Jean Nicod on 6 October 2010Talk at Institut Jean Nicod on 6 October 2010
Talk at Institut Jean Nicod on 6 October 2010
 
Human genetic diversity and origin of major human groups
Human genetic diversity and origin of major human groupsHuman genetic diversity and origin of major human groups
Human genetic diversity and origin of major human groups
 
Comparing the Amount and Quality of Information from Different Sequencing Str...
Comparing the Amount and Quality of Information from Different Sequencing Str...Comparing the Amount and Quality of Information from Different Sequencing Str...
Comparing the Amount and Quality of Information from Different Sequencing Str...
 
Genetic variation and its role in health pharmacology
Genetic variation and its role in health pharmacologyGenetic variation and its role in health pharmacology
Genetic variation and its role in health pharmacology
 
Human genetic variation and its contribution to complex traits
Human genetic variation and its contribution to complex traitsHuman genetic variation and its contribution to complex traits
Human genetic variation and its contribution to complex traits
 
Early Human Origins
Early Human OriginsEarly Human Origins
Early Human Origins
 
Evolution of transposons, genomes, and organisms (Hertweck Fall 2014)
Evolution of transposons, genomes, and organisms (Hertweck Fall 2014)Evolution of transposons, genomes, and organisms (Hertweck Fall 2014)
Evolution of transposons, genomes, and organisms (Hertweck Fall 2014)
 

More from Genetica, Ferrara University, Italy

13 estensioni mendel
13 estensioni mendel13 estensioni mendel
08 genomica
08 genomica08 genomica
06 traduzione
06 traduzione06 traduzione
Perché non possiamo non dirci africani. Otto cose da ricordare sulla biodiver...
Perché non possiamo non dirci africani. Otto cose da ricordare sulla biodiver...Perché non possiamo non dirci africani. Otto cose da ricordare sulla biodiver...
Perché non possiamo non dirci africani. Otto cose da ricordare sulla biodiver...
Genetica, Ferrara University, Italy
 

More from Genetica, Ferrara University, Italy (17)

13 estensioni mendel
13 estensioni mendel13 estensioni mendel
13 estensioni mendel
 
08 genomica
08 genomica08 genomica
08 genomica
 
06 traduzione
06 traduzione06 traduzione
06 traduzione
 
04 funzione del gene
04 funzione del gene04 funzione del gene
04 funzione del gene
 
Perché non possiamo non dirci africani. Otto cose da ricordare sulla biodiver...
Perché non possiamo non dirci africani. Otto cose da ricordare sulla biodiver...Perché non possiamo non dirci africani. Otto cose da ricordare sulla biodiver...
Perché non possiamo non dirci africani. Otto cose da ricordare sulla biodiver...
 
Rovereto
RoveretoRovereto
Rovereto
 
Genpop9coal e abc
Genpop9coal e abcGenpop9coal e abc
Genpop9coal e abc
 
21 genetica di popolazioni
21 genetica di popolazioni21 genetica di popolazioni
21 genetica di popolazioni
 
20 genetica del cancro
20 genetica del cancro20 genetica del cancro
20 genetica del cancro
 
18 regolazione eucarioti
18 regolazione eucarioti18 regolazione eucarioti
18 regolazione eucarioti
 
17 regolazione procarioti
17 regolazione procarioti17 regolazione procarioti
17 regolazione procarioti
 
16 variazione cromosomi
16 variazione cromosomi16 variazione cromosomi
16 variazione cromosomi
 
15 mappe genetiche procarioti
15 mappe genetiche procarioti15 mappe genetiche procarioti
15 mappe genetiche procarioti
 
14 mappe genetiche eucarioti
14 mappe genetiche eucarioti14 mappe genetiche eucarioti
14 mappe genetiche eucarioti
 
12 basi cromosomiche
12 basi cromosomiche12 basi cromosomiche
12 basi cromosomiche
 
11 genetica mendeliana
11 genetica mendeliana11 genetica mendeliana
11 genetica mendeliana
 
05 trascrizione
05 trascrizione05 trascrizione
05 trascrizione
 

Recently uploaded

"Frontline Battles with DDoS: Best practices and Lessons Learned", Igor Ivaniuk
"Frontline Battles with DDoS: Best practices and Lessons Learned",  Igor Ivaniuk"Frontline Battles with DDoS: Best practices and Lessons Learned",  Igor Ivaniuk
"Frontline Battles with DDoS: Best practices and Lessons Learned", Igor Ivaniuk
Fwdays
 
Connector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectors
Connector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectorsConnector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectors
Connector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectors
DianaGray10
 
"$10 thousand per minute of downtime: architecture, queues, streaming and fin...
"$10 thousand per minute of downtime: architecture, queues, streaming and fin..."$10 thousand per minute of downtime: architecture, queues, streaming and fin...
"$10 thousand per minute of downtime: architecture, queues, streaming and fin...
Fwdays
 
Mutation Testing for Task-Oriented Chatbots
Mutation Testing for Task-Oriented ChatbotsMutation Testing for Task-Oriented Chatbots
Mutation Testing for Task-Oriented Chatbots
Pablo Gómez Abajo
 
From Natural Language to Structured Solr Queries using LLMs
From Natural Language to Structured Solr Queries using LLMsFrom Natural Language to Structured Solr Queries using LLMs
From Natural Language to Structured Solr Queries using LLMs
Sease
 
Christine's Supplier Sourcing Presentaion.pptx
Christine's Supplier Sourcing Presentaion.pptxChristine's Supplier Sourcing Presentaion.pptx
Christine's Supplier Sourcing Presentaion.pptx
christinelarrosa
 
Getting the Most Out of ScyllaDB Monitoring: ShareChat's Tips
Getting the Most Out of ScyllaDB Monitoring: ShareChat's TipsGetting the Most Out of ScyllaDB Monitoring: ShareChat's Tips
Getting the Most Out of ScyllaDB Monitoring: ShareChat's Tips
ScyllaDB
 
ScyllaDB Tablets: Rethinking Replication
ScyllaDB Tablets: Rethinking ReplicationScyllaDB Tablets: Rethinking Replication
ScyllaDB Tablets: Rethinking Replication
ScyllaDB
 
Lee Barnes - Path to Becoming an Effective Test Automation Engineer.pdf
Lee Barnes - Path to Becoming an Effective Test Automation Engineer.pdfLee Barnes - Path to Becoming an Effective Test Automation Engineer.pdf
Lee Barnes - Path to Becoming an Effective Test Automation Engineer.pdf
leebarnesutopia
 
Session 1 - Intro to Robotic Process Automation.pdf
Session 1 - Intro to Robotic Process Automation.pdfSession 1 - Intro to Robotic Process Automation.pdf
Session 1 - Intro to Robotic Process Automation.pdf
UiPathCommunity
 
"What does it really mean for your system to be available, or how to define w...
"What does it really mean for your system to be available, or how to define w..."What does it really mean for your system to be available, or how to define w...
"What does it really mean for your system to be available, or how to define w...
Fwdays
 
Day 2 - Intro to UiPath Studio Fundamentals
Day 2 - Intro to UiPath Studio FundamentalsDay 2 - Intro to UiPath Studio Fundamentals
Day 2 - Intro to UiPath Studio Fundamentals
UiPathCommunity
 
Y-Combinator seed pitch deck template PP
Y-Combinator seed pitch deck template PPY-Combinator seed pitch deck template PP
Y-Combinator seed pitch deck template PP
c5vrf27qcz
 
Discover the Unseen: Tailored Recommendation of Unwatched Content
Discover the Unseen: Tailored Recommendation of Unwatched ContentDiscover the Unseen: Tailored Recommendation of Unwatched Content
Discover the Unseen: Tailored Recommendation of Unwatched Content
ScyllaDB
 
Principle of conventional tomography-Bibash Shahi ppt..pptx
Principle of conventional tomography-Bibash Shahi ppt..pptxPrinciple of conventional tomography-Bibash Shahi ppt..pptx
Principle of conventional tomography-Bibash Shahi ppt..pptx
BibashShahi
 
"NATO Hackathon Winner: AI-Powered Drug Search", Taras Kloba
"NATO Hackathon Winner: AI-Powered Drug Search",  Taras Kloba"NATO Hackathon Winner: AI-Powered Drug Search",  Taras Kloba
"NATO Hackathon Winner: AI-Powered Drug Search", Taras Kloba
Fwdays
 
Dandelion Hashtable: beyond billion requests per second on a commodity server
Dandelion Hashtable: beyond billion requests per second on a commodity serverDandelion Hashtable: beyond billion requests per second on a commodity server
Dandelion Hashtable: beyond billion requests per second on a commodity server
Antonios Katsarakis
 
The Microsoft 365 Migration Tutorial For Beginner.pptx
The Microsoft 365 Migration Tutorial For Beginner.pptxThe Microsoft 365 Migration Tutorial For Beginner.pptx
The Microsoft 365 Migration Tutorial For Beginner.pptx
operationspcvita
 
Apps Break Data
Apps Break DataApps Break Data
Apps Break Data
Ivo Velitchkov
 
Christine's Product Research Presentation.pptx
Christine's Product Research Presentation.pptxChristine's Product Research Presentation.pptx
Christine's Product Research Presentation.pptx
christinelarrosa
 

Recently uploaded (20)

"Frontline Battles with DDoS: Best practices and Lessons Learned", Igor Ivaniuk
"Frontline Battles with DDoS: Best practices and Lessons Learned",  Igor Ivaniuk"Frontline Battles with DDoS: Best practices and Lessons Learned",  Igor Ivaniuk
"Frontline Battles with DDoS: Best practices and Lessons Learned", Igor Ivaniuk
 
Connector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectors
Connector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectorsConnector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectors
Connector Corner: Seamlessly power UiPath Apps, GenAI with prebuilt connectors
 
"$10 thousand per minute of downtime: architecture, queues, streaming and fin...
"$10 thousand per minute of downtime: architecture, queues, streaming and fin..."$10 thousand per minute of downtime: architecture, queues, streaming and fin...
"$10 thousand per minute of downtime: architecture, queues, streaming and fin...
 
Mutation Testing for Task-Oriented Chatbots
Mutation Testing for Task-Oriented ChatbotsMutation Testing for Task-Oriented Chatbots
Mutation Testing for Task-Oriented Chatbots
 
From Natural Language to Structured Solr Queries using LLMs
From Natural Language to Structured Solr Queries using LLMsFrom Natural Language to Structured Solr Queries using LLMs
From Natural Language to Structured Solr Queries using LLMs
 
Christine's Supplier Sourcing Presentaion.pptx
Christine's Supplier Sourcing Presentaion.pptxChristine's Supplier Sourcing Presentaion.pptx
Christine's Supplier Sourcing Presentaion.pptx
 
Getting the Most Out of ScyllaDB Monitoring: ShareChat's Tips
Getting the Most Out of ScyllaDB Monitoring: ShareChat's TipsGetting the Most Out of ScyllaDB Monitoring: ShareChat's Tips
Getting the Most Out of ScyllaDB Monitoring: ShareChat's Tips
 
ScyllaDB Tablets: Rethinking Replication
ScyllaDB Tablets: Rethinking ReplicationScyllaDB Tablets: Rethinking Replication
ScyllaDB Tablets: Rethinking Replication
 
Lee Barnes - Path to Becoming an Effective Test Automation Engineer.pdf
Lee Barnes - Path to Becoming an Effective Test Automation Engineer.pdfLee Barnes - Path to Becoming an Effective Test Automation Engineer.pdf
Lee Barnes - Path to Becoming an Effective Test Automation Engineer.pdf
 
Session 1 - Intro to Robotic Process Automation.pdf
Session 1 - Intro to Robotic Process Automation.pdfSession 1 - Intro to Robotic Process Automation.pdf
Session 1 - Intro to Robotic Process Automation.pdf
 
"What does it really mean for your system to be available, or how to define w...
"What does it really mean for your system to be available, or how to define w..."What does it really mean for your system to be available, or how to define w...
"What does it really mean for your system to be available, or how to define w...
 
Day 2 - Intro to UiPath Studio Fundamentals
Day 2 - Intro to UiPath Studio FundamentalsDay 2 - Intro to UiPath Studio Fundamentals
Day 2 - Intro to UiPath Studio Fundamentals
 
Y-Combinator seed pitch deck template PP
Y-Combinator seed pitch deck template PPY-Combinator seed pitch deck template PP
Y-Combinator seed pitch deck template PP
 
Discover the Unseen: Tailored Recommendation of Unwatched Content
Discover the Unseen: Tailored Recommendation of Unwatched ContentDiscover the Unseen: Tailored Recommendation of Unwatched Content
Discover the Unseen: Tailored Recommendation of Unwatched Content
 
Principle of conventional tomography-Bibash Shahi ppt..pptx
Principle of conventional tomography-Bibash Shahi ppt..pptxPrinciple of conventional tomography-Bibash Shahi ppt..pptx
Principle of conventional tomography-Bibash Shahi ppt..pptx
 
"NATO Hackathon Winner: AI-Powered Drug Search", Taras Kloba
"NATO Hackathon Winner: AI-Powered Drug Search",  Taras Kloba"NATO Hackathon Winner: AI-Powered Drug Search",  Taras Kloba
"NATO Hackathon Winner: AI-Powered Drug Search", Taras Kloba
 
Dandelion Hashtable: beyond billion requests per second on a commodity server
Dandelion Hashtable: beyond billion requests per second on a commodity serverDandelion Hashtable: beyond billion requests per second on a commodity server
Dandelion Hashtable: beyond billion requests per second on a commodity server
 
The Microsoft 365 Migration Tutorial For Beginner.pptx
The Microsoft 365 Migration Tutorial For Beginner.pptxThe Microsoft 365 Migration Tutorial For Beginner.pptx
The Microsoft 365 Migration Tutorial For Beginner.pptx
 
Apps Break Data
Apps Break DataApps Break Data
Apps Break Data
 
Christine's Product Research Presentation.pptx
Christine's Product Research Presentation.pptxChristine's Product Research Presentation.pptx
Christine's Product Research Presentation.pptx
 

Lisbon genome diversity

  • 1. Human genome diversity: Frequently asked questions Guido Barbujani Dipartimento di Biologia ed Evoluzione, Università di Ferrara [email_address]
  • 2. Total size 3 272 480 987(haploid) N of protein-coding genes 22 320 N of RNA-coding genes 9 922 N of gene exons 530 906 N of transcripts 142 707 N of segregating sites 15 040 632 Nucleotide differences with chimp 1.23% Chimp orthologue genes 13 454 Human genes missing in chimp 36 totally, 17 largely Classes of genes with max. differences immune response, reproduction, olfaction A few human genome statistics From www.ensembl.org version 57.37b (Jan. 2010)
  • 4. Human-chimp alignment Chimp chromosomes 2 and 2a The human genome is very similar to the chimpanzee genome
  • 5. Phylogenetic tree of human (n=70), chimpanzee (n=30), bonobo (n=5), gorilla (n=11) and orang-utan (n=14), based on 10,000 bp sequences of a noncoding Xq13.3 region. Kaessmann et al. (2001). Individual genetic diversity among humans is the lowest of all primates
  • 6. Genomic estimates of F ST for the global human population are  0.12 Human populations display  12% of the maximum possible diversity, given their allele frequencies N of markers Samples F ST Reference 599,356 SNPs 209 individuals from 4 populations: Caucasian, Chinese, Japanese, Yoruba 0.13 Weir et al. 2005 1,034,741 SNPs 71 individuals from 4 populations: Caucasian, Chinese, Japanese, Yoruba 0.10 Weir et al. 2005 1,007,329 SNPs 269 individuals from 4 populations: Caucasian, Chinese, Japanese, Yoruba 0.12 International HapMap Consortium 2005 443,434 SNPs 3845 worldwide distributed individuals 0.052 Auton et al. 2009 2,841,354 SNPs 210 individuals from 4 populations: Caucasian, Chinese, Japanese, Yoruba 0.11 Barreiro et al. 2008 243,855 SNPs 554 individuals from 27 worldwide populations 0.123 Xing et al. 2009 100 Alu insertions 710 individuals from 23 worldwide populations 0.095 Watkins et al. 2008 67 CNVs 270 individuals from 4 populations with ancestry in Europe, Africa or Asia 0.11 Redon et al. 2006
  • 7. 0.38 0.32 0.12 Genetic diversity among human populations is the lowest of all primates F ST Geographically-variable selection Small population sizes Little gene flow Isolation Stabilizing selection Large population sizes Extensive gene flow Admixture
  • 8. Li et al. (2009) Clinal variation in the geographical space is the rule for human populations Cavalli-Sfdorza et al. (1994)
  • 9. Methods 1 : Estimating variances from sequence comparisons - TA C GAACATC A GGC - - TA T GAACATC A GGC - - TA T GAACATC G GGC - Polymorphic DNA sites
  • 10. Genetic variances within and between populations Population 1 Population 2 variance between pops. 100% 19% 0%
  • 11. Independent studies of genetic variances yield very similar results: 85, 5, 10 Lewontin (1972) 17 loci 85% 8% 6% Latter (1973) 18 86% 5% 9% Barbujani et al. (1997) 109 85% 5% 10% Jorde et al. (2000) 100 85% 2% 13% Romualdi et al. (2002) 32 83% 8% 9% Rosenberg et al. (2002) 377 93% 3% 4% Excoffier & Hamilton (2003) 377 88% 3% 9% Ramachandran et al. (2005) 17 90% 5% 5% Bastos-Rodriguez et al. (2006) 40 86% 2% 12% Li et al. (2008) 650 000 89% 2% 9% MEDIANA within populations between populations between races or continents 85% 5% 10%
  • 12. What does it mean, in practice? 100% 100% 100% Members of our community are only slightly less different from us than members of distant populations 85% 85% 85%
  • 13. Mind the numbers Humans and chimps share >98% of their genomes Among the 1.8% differences, 1.7% are fixed differences within species The remaining fraction, 0.1%, contains all human genomic variation The differences among the main continental groups represent 10% of 0.1% of the total, that is, 0.01% But 0.1% of >3 billion DNA sites means >3 million polymorphic DNA sites (3,213,401 according to Levy et al. 2007)
  • 14. Methods 2 : Clustering genotypes or haplotypes K=3 K=4 Rosenberg et al., 2002
  • 15. SNPs Haplotypes CNV Jakobsson et al. (2008) Structure inferred from SNPs and haplotypes differs from that inferred from Copy Number Variation
  • 16. Genes, as well as morphology, suggest inconsistent clusterings of genotypes Y chromosome: Romualdi et al. 2002 X chromosome: Wilson et al. 2001 377 STR loci: Barbujani and Belle 2006 377 STR loci: Rosenberg et al. 2002 Europe, Ethiopia S. Africa N. Guinea Asia Africa Asia, Europe, Australia, Americas Americas Melanesia Eurasia N Africa N America Maya S. Africa E Africa C Africa Piapoco Suruì Karitiana Kalash W. Eurasia E. Asia Africa Americas Oceania
  • 17. Sampling assumptions have a large effect on the apparent structuring
  • 18. Sampling assumptions have a large effect on the apparent structuring Serre and P ääbo (2004)
  • 19. ASIP A8818G MATP C374G Genetic variation is discordant across loci
  • 20. Similar skin colors are due to different combinations of alleles MATP C374G ASIP A8818G
  • 21. 16 completely sequenced genomes (as of May 1st, 2010) And 5 more published on May 6th:
  • 22. Two persons from the same continent may share fewer SNPs than persons of different continents
  • 23. 81% of SNPs cosmopolitan. Alleles present in one continent only: 0.91% in Africa, 0.75% in Asia, practically 0 elsewhere. Jakobsson et al. 2008 (525910 SNPs, 396 CNVs)
  • 24. In the 117 megabases (Mb) of sequenced exome-containing intervals, the average rate of nucleotide difference between a pair of the Bushmen was 1.2 per kb, compared to an average of 1.0 per kb between a European and Asian individual. Schuster et al. (2010) Greater differences between Africans than between European and Asians
  • 25. Genetic diversity out of Africa is often a subset ot the African genetic diversity Tishkoff et al. (1998)
  • 26. LD decreasing with physical distance between loci and with geographic distance from East Africa Jakobsson et al. 2008
  • 27. Gene diversity declines as a function of distance from Africa Best fit of the model for an African exit 56,000 years ago Liu et al. (2006)
  • 28. Patterns of morphological and genetic variation are compatible with the effects of dispersal from Africa Manica et al. (2007)
  • 29. Models with an African population replacing previous human continental groups explain the data better than any alternative models Fagundes et al. (2007)
  • 30.
  • 36.  
  • 37.
  • 38. Divergence from modern humans Neandertals fall inside the variation of present-day humans. Overall divergence greater for the three Neandertal genomes (modes ~11%), whereas the San mode is ~9% and for the other present-day humans ~8%. For the Neandertals, 13% of windows have a divergence above 20%, whereas this is the case for 2.5% to 3.7% of windows in the current humans
  • 39.
  • 40. 1. Comparison with the HapMap sequences and 5 newly sequenced individuals 2. No comparison within Eurasia (Papuan-French-Han) or within Africa (Yoruba- San) shows significant skews in D 3. All comparisons of non-Africans and Africans show that the Neandertal is closer to the non-Africans 4. All or almost all the gene flow detected was from Neandertals into modern humans 5. Some old haplotypes most likely owe their presence in non Africans to gene flow from Neandertals
  • 41. Four processes potentially accounting for the data Between 1 and 4% of the genomes of people in Eurasia are derived from Neandertals 1. From erectus to Neandertal 2. From late Neandertals into the first Europeans 3. From early Neandertals into the first Eurasians 4. Ancient population structure, preserved from before the Neandertal – modern sapiens separation
  • 42. Enza Colonna And if you want to read more about all this Trends in Genetics , July 2010