SlideShare a Scribd company logo
PROPOSALS FOR THE FORMAT
FOR POPULATION DATA BASES
AND THEIR ANALYSIS
A. G. Smolyanitsky1, N. N. Khromov-Borisov1, G. B.A. G. Smolyanitsky1, N. N. Khromov-Borisov1, G. B.
Lazzarotto2 and T. B. L. Kist2
1Forensic Medicine Bureau of Leningrad District, Saint
Petersburg, Russia
2Institute of Biosciences, Federal University of Rio Grande do
Sul, Porto Alegre, Brazil
Andrew.Smolyanitsky@yandex.ru
Nikita.KhromovBorisov@gmail.com
Gustavo.Lazzarotto@terra.com.br
Kist@molgen.mpg.de
DNA-PCR Data Banks
DNA-PCR Databank: http://www.uni-
duesseldorf.de/WWW/MedFak/Serology/database.
html
DB on Nuclear DNADB on Nuclear DNA
http://www.ertzaintza.net/cgi-bin/
db2www.exe/adn.d2w/INPUT?IDIOMA=INGLES
World population data
J. Forensic Sci. 45 (1) 118-146 (2000)
CODIS STR loci data
J. Forensic Sci. 46 (3) 453-489 (2001)
Precision and accuracy
Sometime inaccurate calculation or
presentation of relative allele
frequencies are observedfrequencies are observed
Precision up to three significant
digits appear to be not sufficient
Round-off
Sometimes the sum of the frequencies is not
equal to unit due to low precision or round-off
errors, such as, e.g., 0.879 or 1.123
Sometime it is difficult to round-off correctly
the recalculated absolute frequencies, such as,
e.g., 18.51 or 75.48
As a result their sum may be odd or not equal to
the published value
Uncertainties
Some data sets appear to be completely
identical
Such duplications may result from the
fact that they are reproduced infact that they are reproduced in
different publications
SANCT software permits to identify
them in very large DB automatically
Independence
Some data sets seems to be non-
independent: preliminary data
published earlier are then combined
with the new data in subsequentwith the new data in subsequent
publications
SANCT software facilitates their
detection
Collapsability
Sometime rare alleles are combined with
the nearest ones, e.g., 14+15+16
SANCT puts this manipulation on the solidSANCT puts this manipulation on the solid
statistical ground:
Categories (both, alleles and/or samples)
are combined (collapsed) not arbitrarily,
but those which are statistically
homogeneous, e.g., 14+21
Precision
Compute relative frequencies with at least
four or even more significant digits (GDA)
Check the equality of their sum to unit:
Sum (pi)=1.0000
Check the “re-computability” of the initial
absolute counts:
Sum (pi ×N)=N
Show individual genotypes
when feasible
ID Locus A Locus B Locus Z
Xx-xxx 3.2/7 --/-- 6/6Xx-xxx
1
3.2/7
3207
--/--
0000
6/6
0606
Yy-yyy
2
6/14
0614
17/18
1718
9/9.3
0093
FSTAT is able to detect 0093 as an error
Convertibility
Program Import Export
GDA BIOSYS BIOSYS
GeneStrut GeneStat-PCGeneStrut GeneStat-PC
Weir GeneStrut
Nexus
SAS
Weir
Convertibility
Program Import Export
GENETIX GENEPOP Arlequin
FSTAT BIOSYS
Text GENEPOP
FSTAT
Show absolute counts
Present genotype counts in form of
triangle matrix.
Such presentation visualizes theSuch presentation visualizes the
“saturation” of the data and permits to
present important information on the
partial fixation indices in compact form
on the same matrix.
Template for genotype and allele counts,
partial fixation indices and relative allele
frequencies
Locus: GC n = 196
Allele A B C fii Ni pi
A 25 0.06 0.08 0.08 131 0.3308A 25 0.06 0.08 0.08 131 0.3308
B 14 2 0.06 -0.03 45 0.1136
C 67 27 63 0.04 220 0.5556
Total 0.044 396 1.0000
GDA software provides computing fii
Availability
“Open and show all your data”,
visualization and “statistification”
or GSP (Good Statistics Practice)or GSP (Good Statistics Practice)
must be the main principles in data
basing.
Make all your data available to the
users preferably online or under
request from the authors.
Sincere thanks
Drs.
Carsten HohoffCarsten Hohoff
Edwin Ehrlich
Kurt Trübner
for the invitation, help and support

More Related Content

Similar to Format for the population data in forensic genetics ppt

AI Math Agents
AI Math AgentsAI Math Agents
AI Math Agents
Melanie Swan
 
Robust Prediction of Cancer Disease Using Pattern Classification of Microarra...
Robust Prediction of Cancer Disease Using Pattern Classification of Microarra...Robust Prediction of Cancer Disease Using Pattern Classification of Microarra...
Robust Prediction of Cancer Disease Using Pattern Classification of Microarra...
Md Rahman
 
Session ii g1 lab genomics and gene expression mmc-corr
Session ii g1 lab genomics and gene expression mmc-corrSession ii g1 lab genomics and gene expression mmc-corr
Session ii g1 lab genomics and gene expression mmc-corrUSD Bioinformatics
 
Hetman immem xi final March 2016
Hetman immem xi final March 2016Hetman immem xi final March 2016
Hetman immem xi final March 2016
IRIDA_community
 
Methods to enhance the validity of precision guidelines emerging from big data
Methods to enhance the validity of precision guidelines emerging from big dataMethods to enhance the validity of precision guidelines emerging from big data
Methods to enhance the validity of precision guidelines emerging from big data
Chirag Patel
 
Hierarchical clustering .pdf
Hierarchical clustering .pdfHierarchical clustering .pdf
Hierarchical clustering .pdf
VidyasriDharmalingam1
 
Bioinformatics life sciences_v2015
Bioinformatics life sciences_v2015Bioinformatics life sciences_v2015
Bioinformatics life sciences_v2015
Prof. Wim Van Criekinge
 
2009 CSBB LAB 新生訓練
2009 CSBB LAB 新生訓練2009 CSBB LAB 新生訓練
2009 CSBB LAB 新生訓練
Abner Huang
 
EVE 161 Winter 2018 Class 13
EVE 161 Winter 2018 Class 13EVE 161 Winter 2018 Class 13
EVE 161 Winter 2018 Class 13
Jonathan Eisen
 
Multi-trait modeling in polygenic scores
Multi-trait modeling in polygenic scoresMulti-trait modeling in polygenic scores
Multi-trait modeling in polygenic scores
Yosuke Tanigawa
 
Extracting a cellular hierarchy from high-dimensional cytometry data with SPADE
Extracting a cellular hierarchy from high-dimensional cytometry data with SPADEExtracting a cellular hierarchy from high-dimensional cytometry data with SPADE
Extracting a cellular hierarchy from high-dimensional cytometry data with SPADENikolas Pontikos
 
Exploiting NLP for Digital Disease Informatics
Exploiting NLP for Digital Disease InformaticsExploiting NLP for Digital Disease Informatics
Exploiting NLP for Digital Disease Informatics
Nigel Collier
 
Genomic Epidemiology: How High Throughput Sequencing changed our view on bac...
Genomic Epidemiology:  How High Throughput Sequencing changed our view on bac...Genomic Epidemiology:  How High Throughput Sequencing changed our view on bac...
Genomic Epidemiology: How High Throughput Sequencing changed our view on bac...
João André Carriço
 
Project Presentation
Project PresentationProject Presentation
Project Presentationbutest
 
rareAPA_website.pptx
rareAPA_website.pptxrareAPA_website.pptx
rareAPA_website.pptx
xuelianma
 
Integrating phylogenetic inference and metadata visualization for NGS data
Integrating phylogenetic inference and metadata visualization for NGS dataIntegrating phylogenetic inference and metadata visualization for NGS data
Integrating phylogenetic inference and metadata visualization for NGS data
João André Carriço
 
20100509 bioinformatics kapushesky_lecture05_0
20100509 bioinformatics kapushesky_lecture05_020100509 bioinformatics kapushesky_lecture05_0
20100509 bioinformatics kapushesky_lecture05_0Computer Science Club
 

Similar to Format for the population data in forensic genetics ppt (20)

ImmGenPosterCLVizbiSpring2014
ImmGenPosterCLVizbiSpring2014ImmGenPosterCLVizbiSpring2014
ImmGenPosterCLVizbiSpring2014
 
AI Math Agents
AI Math AgentsAI Math Agents
AI Math Agents
 
Robust Prediction of Cancer Disease Using Pattern Classification of Microarra...
Robust Prediction of Cancer Disease Using Pattern Classification of Microarra...Robust Prediction of Cancer Disease Using Pattern Classification of Microarra...
Robust Prediction of Cancer Disease Using Pattern Classification of Microarra...
 
Session ii g1 lab genomics and gene expression mmc-corr
Session ii g1 lab genomics and gene expression mmc-corrSession ii g1 lab genomics and gene expression mmc-corr
Session ii g1 lab genomics and gene expression mmc-corr
 
Hetman immem xi final March 2016
Hetman immem xi final March 2016Hetman immem xi final March 2016
Hetman immem xi final March 2016
 
Methods to enhance the validity of precision guidelines emerging from big data
Methods to enhance the validity of precision guidelines emerging from big dataMethods to enhance the validity of precision guidelines emerging from big data
Methods to enhance the validity of precision guidelines emerging from big data
 
Hierarchical clustering .pdf
Hierarchical clustering .pdfHierarchical clustering .pdf
Hierarchical clustering .pdf
 
Bioinformatics life sciences_v2015
Bioinformatics life sciences_v2015Bioinformatics life sciences_v2015
Bioinformatics life sciences_v2015
 
Blum
BlumBlum
Blum
 
2009 CSBB LAB 新生訓練
2009 CSBB LAB 新生訓練2009 CSBB LAB 新生訓練
2009 CSBB LAB 新生訓練
 
EVE 161 Winter 2018 Class 13
EVE 161 Winter 2018 Class 13EVE 161 Winter 2018 Class 13
EVE 161 Winter 2018 Class 13
 
Multi-trait modeling in polygenic scores
Multi-trait modeling in polygenic scoresMulti-trait modeling in polygenic scores
Multi-trait modeling in polygenic scores
 
Extracting a cellular hierarchy from high-dimensional cytometry data with SPADE
Extracting a cellular hierarchy from high-dimensional cytometry data with SPADEExtracting a cellular hierarchy from high-dimensional cytometry data with SPADE
Extracting a cellular hierarchy from high-dimensional cytometry data with SPADE
 
Exploiting NLP for Digital Disease Informatics
Exploiting NLP for Digital Disease InformaticsExploiting NLP for Digital Disease Informatics
Exploiting NLP for Digital Disease Informatics
 
Genetic mapping
Genetic mappingGenetic mapping
Genetic mapping
 
Genomic Epidemiology: How High Throughput Sequencing changed our view on bac...
Genomic Epidemiology:  How High Throughput Sequencing changed our view on bac...Genomic Epidemiology:  How High Throughput Sequencing changed our view on bac...
Genomic Epidemiology: How High Throughput Sequencing changed our view on bac...
 
Project Presentation
Project PresentationProject Presentation
Project Presentation
 
rareAPA_website.pptx
rareAPA_website.pptxrareAPA_website.pptx
rareAPA_website.pptx
 
Integrating phylogenetic inference and metadata visualization for NGS data
Integrating phylogenetic inference and metadata visualization for NGS dataIntegrating phylogenetic inference and metadata visualization for NGS data
Integrating phylogenetic inference and metadata visualization for NGS data
 
20100509 bioinformatics kapushesky_lecture05_0
20100509 bioinformatics kapushesky_lecture05_020100509 bioinformatics kapushesky_lecture05_0
20100509 bioinformatics kapushesky_lecture05_0
 

More from Nikita Khromov-Borisov

кольцов и матричный принцип 2015
кольцов и матричный принцип 2015кольцов и матричный принцип 2015
кольцов и матричный принцип 2015
Nikita Khromov-Borisov
 
парадоксы спортгеномики 2015
парадоксы спортгеномики 2015парадоксы спортгеномики 2015
парадоксы спортгеномики 2015
Nikita Khromov-Borisov
 
химия днк для генетиков 2015
химия днк для генетиков 2015химия днк для генетиков 2015
химия днк для генетиков 2015
Nikita Khromov-Borisov
 
парадоксы геномной медицины 2015
парадоксы геномной медицины 2015парадоксы геномной медицины 2015
парадоксы геномной медицины 2015
Nikita Khromov-Borisov
 
Harmonizing statistical evidences and predictions
Harmonizing statistical evidences and predictionsHarmonizing statistical evidences and predictions
Harmonizing statistical evidences and predictions
Nikita Khromov-Borisov
 
Evolutionary arguments in medical genomics
Evolutionary arguments in medical genomicsEvolutionary arguments in medical genomics
Evolutionary arguments in medical genomics
Nikita Khromov-Borisov
 
кризис воспроизводимости в биомедицине Rus 2014
кризис воспроизводимости в биомедицине Rus 2014кризис воспроизводимости в биомедицине Rus 2014
кризис воспроизводимости в биомедицине Rus 2014
Nikita Khromov-Borisov
 
Prematurity of genetic testing of predispositions rus 2014
Prematurity of genetic testing of predispositions rus 2014Prematurity of genetic testing of predispositions rus 2014
Prematurity of genetic testing of predispositions rus 2014
Nikita Khromov-Borisov
 
Syndrome of statistical leniency ppt
Syndrome of statistical leniency pptSyndrome of statistical leniency ppt
Syndrome of statistical leniency ppt
Nikita Khromov-Borisov
 
Reproducibility and predictivity in the genetics of predispositions ppt 2013
Reproducibility and predictivity in the genetics of predispositions ppt 2013Reproducibility and predictivity in the genetics of predispositions ppt 2013
Reproducibility and predictivity in the genetics of predispositions ppt 2013
Nikita Khromov-Borisov
 
Population thinking in studies of genetic predispositions ppt
Population thinking in studies of genetic predispositions pptPopulation thinking in studies of genetic predispositions ppt
Population thinking in studies of genetic predispositions ppt
Nikita Khromov-Borisov
 
Modern free biostatistical software ppt
Modern free biostatistical software pptModern free biostatistical software ppt
Modern free biostatistical software ppt
Nikita Khromov-Borisov
 
Half a century with the central dogma of molecular biology ppt
Half a century with the central dogma of molecular biology pptHalf a century with the central dogma of molecular biology ppt
Half a century with the central dogma of molecular biology ppt
Nikita Khromov-Borisov
 
Genetics of predispositions ppt
Genetics of predispositions pptGenetics of predispositions ppt
Genetics of predispositions ppt
Nikita Khromov-Borisov
 
Evolutionary medical genomics ppt 2013
Evolutionary medical genomics ppt 2013Evolutionary medical genomics ppt 2013
Evolutionary medical genomics ppt 2013
Nikita Khromov-Borisov
 
Catalog of formulae for forensic genetics ppt
Catalog of formulae for forensic genetics pptCatalog of formulae for forensic genetics ppt
Catalog of formulae for forensic genetics ppt
Nikita Khromov-Borisov
 
Biometrical problems in population studies ppt 2004
Biometrical problems in population studies ppt 2004Biometrical problems in population studies ppt 2004
Biometrical problems in population studies ppt 2004Nikita Khromov-Borisov
 
Joshua lederberg ppt
Joshua lederberg pptJoshua lederberg ppt
Joshua lederberg ppt
Nikita Khromov-Borisov
 
Reproducibility of results in the genetics of predisposition eng 2014
Reproducibility of results in the genetics of predisposition eng 2014Reproducibility of results in the genetics of predisposition eng 2014
Reproducibility of results in the genetics of predisposition eng 2014Nikita Khromov-Borisov
 

More from Nikita Khromov-Borisov (19)

кольцов и матричный принцип 2015
кольцов и матричный принцип 2015кольцов и матричный принцип 2015
кольцов и матричный принцип 2015
 
парадоксы спортгеномики 2015
парадоксы спортгеномики 2015парадоксы спортгеномики 2015
парадоксы спортгеномики 2015
 
химия днк для генетиков 2015
химия днк для генетиков 2015химия днк для генетиков 2015
химия днк для генетиков 2015
 
парадоксы геномной медицины 2015
парадоксы геномной медицины 2015парадоксы геномной медицины 2015
парадоксы геномной медицины 2015
 
Harmonizing statistical evidences and predictions
Harmonizing statistical evidences and predictionsHarmonizing statistical evidences and predictions
Harmonizing statistical evidences and predictions
 
Evolutionary arguments in medical genomics
Evolutionary arguments in medical genomicsEvolutionary arguments in medical genomics
Evolutionary arguments in medical genomics
 
кризис воспроизводимости в биомедицине Rus 2014
кризис воспроизводимости в биомедицине Rus 2014кризис воспроизводимости в биомедицине Rus 2014
кризис воспроизводимости в биомедицине Rus 2014
 
Prematurity of genetic testing of predispositions rus 2014
Prematurity of genetic testing of predispositions rus 2014Prematurity of genetic testing of predispositions rus 2014
Prematurity of genetic testing of predispositions rus 2014
 
Syndrome of statistical leniency ppt
Syndrome of statistical leniency pptSyndrome of statistical leniency ppt
Syndrome of statistical leniency ppt
 
Reproducibility and predictivity in the genetics of predispositions ppt 2013
Reproducibility and predictivity in the genetics of predispositions ppt 2013Reproducibility and predictivity in the genetics of predispositions ppt 2013
Reproducibility and predictivity in the genetics of predispositions ppt 2013
 
Population thinking in studies of genetic predispositions ppt
Population thinking in studies of genetic predispositions pptPopulation thinking in studies of genetic predispositions ppt
Population thinking in studies of genetic predispositions ppt
 
Modern free biostatistical software ppt
Modern free biostatistical software pptModern free biostatistical software ppt
Modern free biostatistical software ppt
 
Half a century with the central dogma of molecular biology ppt
Half a century with the central dogma of molecular biology pptHalf a century with the central dogma of molecular biology ppt
Half a century with the central dogma of molecular biology ppt
 
Genetics of predispositions ppt
Genetics of predispositions pptGenetics of predispositions ppt
Genetics of predispositions ppt
 
Evolutionary medical genomics ppt 2013
Evolutionary medical genomics ppt 2013Evolutionary medical genomics ppt 2013
Evolutionary medical genomics ppt 2013
 
Catalog of formulae for forensic genetics ppt
Catalog of formulae for forensic genetics pptCatalog of formulae for forensic genetics ppt
Catalog of formulae for forensic genetics ppt
 
Biometrical problems in population studies ppt 2004
Biometrical problems in population studies ppt 2004Biometrical problems in population studies ppt 2004
Biometrical problems in population studies ppt 2004
 
Joshua lederberg ppt
Joshua lederberg pptJoshua lederberg ppt
Joshua lederberg ppt
 
Reproducibility of results in the genetics of predisposition eng 2014
Reproducibility of results in the genetics of predisposition eng 2014Reproducibility of results in the genetics of predisposition eng 2014
Reproducibility of results in the genetics of predisposition eng 2014
 

Recently uploaded

20240520 Planning a Circuit Simulator in JavaScript.pptx
20240520 Planning a Circuit Simulator in JavaScript.pptx20240520 Planning a Circuit Simulator in JavaScript.pptx
20240520 Planning a Circuit Simulator in JavaScript.pptx
Sharon Liu
 
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
University of Maribor
 
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
University of Maribor
 
Lateral Ventricles.pdf very easy good diagrams comprehensive
Lateral Ventricles.pdf very easy good diagrams comprehensiveLateral Ventricles.pdf very easy good diagrams comprehensive
Lateral Ventricles.pdf very easy good diagrams comprehensive
silvermistyshot
 
Anemia_ types_clinical significance.pptx
Anemia_ types_clinical significance.pptxAnemia_ types_clinical significance.pptx
Anemia_ types_clinical significance.pptx
muralinath2
 
Unveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdfUnveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdf
Erdal Coalmaker
 
THEMATIC APPERCEPTION TEST(TAT) cognitive abilities, creativity, and critic...
THEMATIC  APPERCEPTION  TEST(TAT) cognitive abilities, creativity, and critic...THEMATIC  APPERCEPTION  TEST(TAT) cognitive abilities, creativity, and critic...
THEMATIC APPERCEPTION TEST(TAT) cognitive abilities, creativity, and critic...
Abdul Wali Khan University Mardan,kP,Pakistan
 
Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.
Nistarini College, Purulia (W.B) India
 
SAR of Medicinal Chemistry 1st by dk.pdf
SAR of Medicinal Chemistry 1st by dk.pdfSAR of Medicinal Chemistry 1st by dk.pdf
SAR of Medicinal Chemistry 1st by dk.pdf
KrushnaDarade1
 
Chapter 12 - climate change and the energy crisis
Chapter 12 - climate change and the energy crisisChapter 12 - climate change and the energy crisis
Chapter 12 - climate change and the energy crisis
tonzsalvador2222
 
Orion Air Quality Monitoring Systems - CWS
Orion Air Quality Monitoring Systems - CWSOrion Air Quality Monitoring Systems - CWS
Orion Air Quality Monitoring Systems - CWS
Columbia Weather Systems
 
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
David Osipyan
 
Introduction to Mean Field Theory(MFT).pptx
Introduction to Mean Field Theory(MFT).pptxIntroduction to Mean Field Theory(MFT).pptx
Introduction to Mean Field Theory(MFT).pptx
zeex60
 
Mudde & Rovira Kaltwasser. - Populism in Europe and the Americas - Threat Or...
Mudde &  Rovira Kaltwasser. - Populism in Europe and the Americas - Threat Or...Mudde &  Rovira Kaltwasser. - Populism in Europe and the Americas - Threat Or...
Mudde & Rovira Kaltwasser. - Populism in Europe and the Americas - Threat Or...
frank0071
 
Mudde & Rovira Kaltwasser. - Populism - a very short introduction [2017].pdf
Mudde & Rovira Kaltwasser. - Populism - a very short introduction [2017].pdfMudde & Rovira Kaltwasser. - Populism - a very short introduction [2017].pdf
Mudde & Rovira Kaltwasser. - Populism - a very short introduction [2017].pdf
frank0071
 
Red blood cells- genesis-maturation.pptx
Red blood cells- genesis-maturation.pptxRed blood cells- genesis-maturation.pptx
Red blood cells- genesis-maturation.pptx
muralinath2
 
Toxic effects of heavy metals : Lead and Arsenic
Toxic effects of heavy metals : Lead and ArsenicToxic effects of heavy metals : Lead and Arsenic
Toxic effects of heavy metals : Lead and Arsenic
sanjana502982
 
Deep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless ReproducibilityDeep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless Reproducibility
University of Rennes, INSA Rennes, Inria/IRISA, CNRS
 
Seminar of U.V. Spectroscopy by SAMIR PANDA
 Seminar of U.V. Spectroscopy by SAMIR PANDA Seminar of U.V. Spectroscopy by SAMIR PANDA
Seminar of U.V. Spectroscopy by SAMIR PANDA
SAMIR PANDA
 
bordetella pertussis.................................ppt
bordetella pertussis.................................pptbordetella pertussis.................................ppt
bordetella pertussis.................................ppt
kejapriya1
 

Recently uploaded (20)

20240520 Planning a Circuit Simulator in JavaScript.pptx
20240520 Planning a Circuit Simulator in JavaScript.pptx20240520 Planning a Circuit Simulator in JavaScript.pptx
20240520 Planning a Circuit Simulator in JavaScript.pptx
 
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
 
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
 
Lateral Ventricles.pdf very easy good diagrams comprehensive
Lateral Ventricles.pdf very easy good diagrams comprehensiveLateral Ventricles.pdf very easy good diagrams comprehensive
Lateral Ventricles.pdf very easy good diagrams comprehensive
 
Anemia_ types_clinical significance.pptx
Anemia_ types_clinical significance.pptxAnemia_ types_clinical significance.pptx
Anemia_ types_clinical significance.pptx
 
Unveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdfUnveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdf
 
THEMATIC APPERCEPTION TEST(TAT) cognitive abilities, creativity, and critic...
THEMATIC  APPERCEPTION  TEST(TAT) cognitive abilities, creativity, and critic...THEMATIC  APPERCEPTION  TEST(TAT) cognitive abilities, creativity, and critic...
THEMATIC APPERCEPTION TEST(TAT) cognitive abilities, creativity, and critic...
 
Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.
 
SAR of Medicinal Chemistry 1st by dk.pdf
SAR of Medicinal Chemistry 1st by dk.pdfSAR of Medicinal Chemistry 1st by dk.pdf
SAR of Medicinal Chemistry 1st by dk.pdf
 
Chapter 12 - climate change and the energy crisis
Chapter 12 - climate change and the energy crisisChapter 12 - climate change and the energy crisis
Chapter 12 - climate change and the energy crisis
 
Orion Air Quality Monitoring Systems - CWS
Orion Air Quality Monitoring Systems - CWSOrion Air Quality Monitoring Systems - CWS
Orion Air Quality Monitoring Systems - CWS
 
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
3D Hybrid PIC simulation of the plasma expansion (ISSS-14)
 
Introduction to Mean Field Theory(MFT).pptx
Introduction to Mean Field Theory(MFT).pptxIntroduction to Mean Field Theory(MFT).pptx
Introduction to Mean Field Theory(MFT).pptx
 
Mudde & Rovira Kaltwasser. - Populism in Europe and the Americas - Threat Or...
Mudde &  Rovira Kaltwasser. - Populism in Europe and the Americas - Threat Or...Mudde &  Rovira Kaltwasser. - Populism in Europe and the Americas - Threat Or...
Mudde & Rovira Kaltwasser. - Populism in Europe and the Americas - Threat Or...
 
Mudde & Rovira Kaltwasser. - Populism - a very short introduction [2017].pdf
Mudde & Rovira Kaltwasser. - Populism - a very short introduction [2017].pdfMudde & Rovira Kaltwasser. - Populism - a very short introduction [2017].pdf
Mudde & Rovira Kaltwasser. - Populism - a very short introduction [2017].pdf
 
Red blood cells- genesis-maturation.pptx
Red blood cells- genesis-maturation.pptxRed blood cells- genesis-maturation.pptx
Red blood cells- genesis-maturation.pptx
 
Toxic effects of heavy metals : Lead and Arsenic
Toxic effects of heavy metals : Lead and ArsenicToxic effects of heavy metals : Lead and Arsenic
Toxic effects of heavy metals : Lead and Arsenic
 
Deep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless ReproducibilityDeep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless Reproducibility
 
Seminar of U.V. Spectroscopy by SAMIR PANDA
 Seminar of U.V. Spectroscopy by SAMIR PANDA Seminar of U.V. Spectroscopy by SAMIR PANDA
Seminar of U.V. Spectroscopy by SAMIR PANDA
 
bordetella pertussis.................................ppt
bordetella pertussis.................................pptbordetella pertussis.................................ppt
bordetella pertussis.................................ppt
 

Format for the population data in forensic genetics ppt

  • 1. PROPOSALS FOR THE FORMAT FOR POPULATION DATA BASES AND THEIR ANALYSIS A. G. Smolyanitsky1, N. N. Khromov-Borisov1, G. B.A. G. Smolyanitsky1, N. N. Khromov-Borisov1, G. B. Lazzarotto2 and T. B. L. Kist2 1Forensic Medicine Bureau of Leningrad District, Saint Petersburg, Russia 2Institute of Biosciences, Federal University of Rio Grande do Sul, Porto Alegre, Brazil Andrew.Smolyanitsky@yandex.ru Nikita.KhromovBorisov@gmail.com Gustavo.Lazzarotto@terra.com.br Kist@molgen.mpg.de
  • 2. DNA-PCR Data Banks DNA-PCR Databank: http://www.uni- duesseldorf.de/WWW/MedFak/Serology/database. html DB on Nuclear DNADB on Nuclear DNA http://www.ertzaintza.net/cgi-bin/ db2www.exe/adn.d2w/INPUT?IDIOMA=INGLES World population data J. Forensic Sci. 45 (1) 118-146 (2000) CODIS STR loci data J. Forensic Sci. 46 (3) 453-489 (2001)
  • 3. Precision and accuracy Sometime inaccurate calculation or presentation of relative allele frequencies are observedfrequencies are observed Precision up to three significant digits appear to be not sufficient
  • 4. Round-off Sometimes the sum of the frequencies is not equal to unit due to low precision or round-off errors, such as, e.g., 0.879 or 1.123 Sometime it is difficult to round-off correctly the recalculated absolute frequencies, such as, e.g., 18.51 or 75.48 As a result their sum may be odd or not equal to the published value
  • 5. Uncertainties Some data sets appear to be completely identical Such duplications may result from the fact that they are reproduced infact that they are reproduced in different publications SANCT software permits to identify them in very large DB automatically
  • 6. Independence Some data sets seems to be non- independent: preliminary data published earlier are then combined with the new data in subsequentwith the new data in subsequent publications SANCT software facilitates their detection
  • 7. Collapsability Sometime rare alleles are combined with the nearest ones, e.g., 14+15+16 SANCT puts this manipulation on the solidSANCT puts this manipulation on the solid statistical ground: Categories (both, alleles and/or samples) are combined (collapsed) not arbitrarily, but those which are statistically homogeneous, e.g., 14+21
  • 8. Precision Compute relative frequencies with at least four or even more significant digits (GDA) Check the equality of their sum to unit: Sum (pi)=1.0000 Check the “re-computability” of the initial absolute counts: Sum (pi ×N)=N
  • 9. Show individual genotypes when feasible ID Locus A Locus B Locus Z Xx-xxx 3.2/7 --/-- 6/6Xx-xxx 1 3.2/7 3207 --/-- 0000 6/6 0606 Yy-yyy 2 6/14 0614 17/18 1718 9/9.3 0093 FSTAT is able to detect 0093 as an error
  • 10. Convertibility Program Import Export GDA BIOSYS BIOSYS GeneStrut GeneStat-PCGeneStrut GeneStat-PC Weir GeneStrut Nexus SAS Weir
  • 11. Convertibility Program Import Export GENETIX GENEPOP Arlequin FSTAT BIOSYS Text GENEPOP FSTAT
  • 12. Show absolute counts Present genotype counts in form of triangle matrix. Such presentation visualizes theSuch presentation visualizes the “saturation” of the data and permits to present important information on the partial fixation indices in compact form on the same matrix.
  • 13. Template for genotype and allele counts, partial fixation indices and relative allele frequencies Locus: GC n = 196 Allele A B C fii Ni pi A 25 0.06 0.08 0.08 131 0.3308A 25 0.06 0.08 0.08 131 0.3308 B 14 2 0.06 -0.03 45 0.1136 C 67 27 63 0.04 220 0.5556 Total 0.044 396 1.0000 GDA software provides computing fii
  • 14. Availability “Open and show all your data”, visualization and “statistification” or GSP (Good Statistics Practice)or GSP (Good Statistics Practice) must be the main principles in data basing. Make all your data available to the users preferably online or under request from the authors.
  • 15. Sincere thanks Drs. Carsten HohoffCarsten Hohoff Edwin Ehrlich Kurt Trübner for the invitation, help and support