SlideShare a Scribd company logo
1 of 19
Minia University
Faculty of Engineering
Biomedical Engineering Department
Haplotype Block Partitioning for
NARAC Dataset Using Interval Graph
Modeling of Clusters Algorithm
Authors:
Fatma S. Ibrahim
Mohamed N. Saad
Ashraf M. Said
Hesham F. A. Hamed
Background and introduction
Literature Review
Dataset and system overview
Implementation and findings
Conclusion 2
3
4
5
(SNP) is a mutation at a single nucleotide position, where a possible nucleotide type
is called an allele.
FGT MATILDE
2002
2008
CIT SSLD MIG++
2002
2005
2014
S-MIG++
2016
BigLD
2018
6
7
7
Extracting
Chromosome
21 from entire
NARAC genomic
dataset
NARAC 22 chromosomes input files
NARAC genomic data
(2,062 individuals)
Perl
Choosing chromosome
21 under study
Genotyped
ch21 dataset
Map file for
ch21
NARAC map file
(545,080 SNPs)
R
Chromosomes
separated data
Chromosomes
separated map file
8
System
description
start
NARAC genotype
dataset ch21
NARAC map file
Position ch21
Reformatting Data for
codeGeno function
Imputation of missing data
Biomarker check
Recoding to 0,1,2
Genotype format
Processed
Genotype data
Applying Big-LD algorithm
Haplotype blocks
Plotting the heatmap
Calculating the parameters of the blocks
and make the comparison among methods
9
System
description
start
NARAC genotype
dataset ch21
NARAC map file
Position ch21
Reformatting Data for
codeGeno function
Imputation of missing data
Biomarker check
Recoding to 0,1,2
Genotype format
Processed
Genotype data
Applying Big-LD algorithm
Haplotype blocks
Plotting the heatmap
Calculating the parameters of the blocks
and make the comparison among methods
BigLD
SSLD
CIT
FGT
10
Implementation
Haplotype block partitioning based on BigLD
method
11
Blocks features and
parameters
Haplotype block portioning
based on BigLD method
Pre-processing phase 2
(recoding)
Pre-processing phase 1
(reformatting)
Reading and cropping
The steps of haplotype block partitioning based on BigLD
method
12
Heatmap for the haplotype
blocks detected by interval
graph modeling of clusters for a
portion of chromosome 21 from
9,993,822 bp to 14,137,685 bp.
13
1182 blocks
8051 SNPs
14
7787 SNPs
after
biomarker
check
FGT
CI
SSLD
BigLD
Base-pair position
Plot of a sample of chromosome 21 haplotype blocks produced
by FGT, CIT, SSLD, and Big-LD.
15
16
17
• Big-LD method provided robust blocks partitioning in terms of the
block size and genomic coverage.
• Moreover, it produced the least total number of blocks compared to
CIT, FGT, and SSLD.
• Big-LD produces larger LD blocks compared to other existing
methods
17
18
• The results showed the similar intersections and inconsistencies among
resulted blocks.
• This analysis shows that Big-LD matched more with FGT and produced much
more stable large haplotype blocks for NARAC dataset
18
CIBEC Presentation Fatma Sayed.pptx

More Related Content

Similar to CIBEC Presentation Fatma Sayed.pptx

New methods draft v4alpha small variant benchmark
New methods   draft v4alpha small variant benchmarkNew methods   draft v4alpha small variant benchmark
New methods draft v4alpha small variant benchmarkGenomeInABottle
 
Genome in a Bottle- reference materials to benchmark challenging variants and...
Genome in a Bottle- reference materials to benchmark challenging variants and...Genome in a Bottle- reference materials to benchmark challenging variants and...
Genome in a Bottle- reference materials to benchmark challenging variants and...GenomeInABottle
 
Protocol Type Based Intrusion Detection Using RBF Neural Network
Protocol Type Based Intrusion Detection Using RBF Neural NetworkProtocol Type Based Intrusion Detection Using RBF Neural Network
Protocol Type Based Intrusion Detection Using RBF Neural NetworkWaqas Tariq
 
Comparing Cahn-Ingold-Prelog Rule Implementations
Comparing Cahn-Ingold-Prelog Rule ImplementationsComparing Cahn-Ingold-Prelog Rule Implementations
Comparing Cahn-Ingold-Prelog Rule ImplementationsNextMove Software
 
CRISPR Screening: the What, Why and How
CRISPR Screening: the What, Why and HowCRISPR Screening: the What, Why and How
CRISPR Screening: the What, Why and HowHorizonDiscovery
 
Giab agbt small_var_2019
Giab agbt small_var_2019Giab agbt small_var_2019
Giab agbt small_var_2019GenomeInABottle
 
Making effective use of graphics processing units (GPUs) in computations
Making effective use of graphics processing units (GPUs) in computationsMaking effective use of graphics processing units (GPUs) in computations
Making effective use of graphics processing units (GPUs) in computationsOregon State University
 
GIAB ASHG 2019 Small Variant poster
GIAB ASHG 2019 Small Variant posterGIAB ASHG 2019 Small Variant poster
GIAB ASHG 2019 Small Variant posterGenomeInABottle
 
GPCR PROTEIN FEATURE REPRESENTATION USING DISCRETE WAVELET TRANSFORM AND PART...
GPCR PROTEIN FEATURE REPRESENTATION USING DISCRETE WAVELET TRANSFORM AND PART...GPCR PROTEIN FEATURE REPRESENTATION USING DISCRETE WAVELET TRANSFORM AND PART...
GPCR PROTEIN FEATURE REPRESENTATION USING DISCRETE WAVELET TRANSFORM AND PART...ijma
 
Integration of single molecule, genome mapping data in a web-based genome bro...
Integration of single molecule, genome mapping data in a web-based genome bro...Integration of single molecule, genome mapping data in a web-based genome bro...
Integration of single molecule, genome mapping data in a web-based genome bro...William Chow
 
GPU-accelerated Virtual Screening
GPU-accelerated Virtual ScreeningGPU-accelerated Virtual Screening
GPU-accelerated Virtual ScreeningOlexandr Isayev
 

Similar to CIBEC Presentation Fatma Sayed.pptx (20)

Homology directed repair (HDR) Knock-in
Homology directed repair (HDR) Knock-inHomology directed repair (HDR) Knock-in
Homology directed repair (HDR) Knock-in
 
New methods draft v4alpha small variant benchmark
New methods   draft v4alpha small variant benchmarkNew methods   draft v4alpha small variant benchmark
New methods draft v4alpha small variant benchmark
 
May 15 workshop
May 15  workshopMay 15  workshop
May 15 workshop
 
May workshop
May workshopMay workshop
May workshop
 
Genome in a Bottle- reference materials to benchmark challenging variants and...
Genome in a Bottle- reference materials to benchmark challenging variants and...Genome in a Bottle- reference materials to benchmark challenging variants and...
Genome in a Bottle- reference materials to benchmark challenging variants and...
 
Protocol Type Based Intrusion Detection Using RBF Neural Network
Protocol Type Based Intrusion Detection Using RBF Neural NetworkProtocol Type Based Intrusion Detection Using RBF Neural Network
Protocol Type Based Intrusion Detection Using RBF Neural Network
 
Agbt2015 workshop schneider
Agbt2015 workshop schneiderAgbt2015 workshop schneider
Agbt2015 workshop schneider
 
Comparing Cahn-Ingold-Prelog Rule Implementations
Comparing Cahn-Ingold-Prelog Rule ImplementationsComparing Cahn-Ingold-Prelog Rule Implementations
Comparing Cahn-Ingold-Prelog Rule Implementations
 
P0126557 slides
P0126557 slidesP0126557 slides
P0126557 slides
 
CRISPR Screening: the What, Why and How
CRISPR Screening: the What, Why and HowCRISPR Screening: the What, Why and How
CRISPR Screening: the What, Why and How
 
HUG @ NGCLE@e-Novia 15.11.2017
HUG @ NGCLE@e-Novia 15.11.2017HUG @ NGCLE@e-Novia 15.11.2017
HUG @ NGCLE@e-Novia 15.11.2017
 
Giab agbt small_var_2019
Giab agbt small_var_2019Giab agbt small_var_2019
Giab agbt small_var_2019
 
Making effective use of graphics processing units (GPUs) in computations
Making effective use of graphics processing units (GPUs) in computationsMaking effective use of graphics processing units (GPUs) in computations
Making effective use of graphics processing units (GPUs) in computations
 
GIAB ASHG 2019 Small Variant poster
GIAB ASHG 2019 Small Variant posterGIAB ASHG 2019 Small Variant poster
GIAB ASHG 2019 Small Variant poster
 
Ashg2014 grc workshop_schneider
Ashg2014 grc workshop_schneiderAshg2014 grc workshop_schneider
Ashg2014 grc workshop_schneider
 
2015-03-31_MotifGP
2015-03-31_MotifGP2015-03-31_MotifGP
2015-03-31_MotifGP
 
GPCR PROTEIN FEATURE REPRESENTATION USING DISCRETE WAVELET TRANSFORM AND PART...
GPCR PROTEIN FEATURE REPRESENTATION USING DISCRETE WAVELET TRANSFORM AND PART...GPCR PROTEIN FEATURE REPRESENTATION USING DISCRETE WAVELET TRANSFORM AND PART...
GPCR PROTEIN FEATURE REPRESENTATION USING DISCRETE WAVELET TRANSFORM AND PART...
 
Integration of single molecule, genome mapping data in a web-based genome bro...
Integration of single molecule, genome mapping data in a web-based genome bro...Integration of single molecule, genome mapping data in a web-based genome bro...
Integration of single molecule, genome mapping data in a web-based genome bro...
 
GPU-accelerated Virtual Screening
GPU-accelerated Virtual ScreeningGPU-accelerated Virtual Screening
GPU-accelerated Virtual Screening
 
Jan2016 nabsys giab
Jan2016 nabsys giabJan2016 nabsys giab
Jan2016 nabsys giab
 

More from Fatma Sayed Ibrahim

Introduction to computer architecture .pptx
Introduction to computer architecture .pptxIntroduction to computer architecture .pptx
Introduction to computer architecture .pptxFatma Sayed Ibrahim
 
Introduction to haplotype blocks .pptx
Introduction to haplotype blocks .pptxIntroduction to haplotype blocks .pptx
Introduction to haplotype blocks .pptxFatma Sayed Ibrahim
 
installationoftensorflowandkeras-190310121258.pptx
installationoftensorflowandkeras-190310121258.pptxinstallationoftensorflowandkeras-190310121258.pptx
installationoftensorflowandkeras-190310121258.pptxFatma Sayed Ibrahim
 
Algorithm Implementation of Genetic Association ‎Analysis for Rheumatoid Arth...
Algorithm Implementation of Genetic Association ‎Analysis for Rheumatoid Arth...Algorithm Implementation of Genetic Association ‎Analysis for Rheumatoid Arth...
Algorithm Implementation of Genetic Association ‎Analysis for Rheumatoid Arth...Fatma Sayed Ibrahim
 
Automatic System for Detection and Classification of Brain Tumors
Automatic System for Detection and Classification of Brain TumorsAutomatic System for Detection and Classification of Brain Tumors
Automatic System for Detection and Classification of Brain TumorsFatma Sayed Ibrahim
 
Hospital architecture design planning
Hospital architecture design  planningHospital architecture design  planning
Hospital architecture design planningFatma Sayed Ibrahim
 

More from Fatma Sayed Ibrahim (7)

Introduction to computer architecture .pptx
Introduction to computer architecture .pptxIntroduction to computer architecture .pptx
Introduction to computer architecture .pptx
 
Introduction to haplotype blocks .pptx
Introduction to haplotype blocks .pptxIntroduction to haplotype blocks .pptx
Introduction to haplotype blocks .pptx
 
The steps of R code Master.pptx
The steps of R code Master.pptxThe steps of R code Master.pptx
The steps of R code Master.pptx
 
installationoftensorflowandkeras-190310121258.pptx
installationoftensorflowandkeras-190310121258.pptxinstallationoftensorflowandkeras-190310121258.pptx
installationoftensorflowandkeras-190310121258.pptx
 
Algorithm Implementation of Genetic Association ‎Analysis for Rheumatoid Arth...
Algorithm Implementation of Genetic Association ‎Analysis for Rheumatoid Arth...Algorithm Implementation of Genetic Association ‎Analysis for Rheumatoid Arth...
Algorithm Implementation of Genetic Association ‎Analysis for Rheumatoid Arth...
 
Automatic System for Detection and Classification of Brain Tumors
Automatic System for Detection and Classification of Brain TumorsAutomatic System for Detection and Classification of Brain Tumors
Automatic System for Detection and Classification of Brain Tumors
 
Hospital architecture design planning
Hospital architecture design  planningHospital architecture design  planning
Hospital architecture design planning
 

Recently uploaded

UNIT-II FMM-Flow Through Circular Conduits
UNIT-II FMM-Flow Through Circular ConduitsUNIT-II FMM-Flow Through Circular Conduits
UNIT-II FMM-Flow Through Circular Conduitsrknatarajan
 
Processing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxProcessing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxpranjaldaimarysona
 
UNIT-III FMM. DIMENSIONAL ANALYSIS
UNIT-III FMM.        DIMENSIONAL ANALYSISUNIT-III FMM.        DIMENSIONAL ANALYSIS
UNIT-III FMM. DIMENSIONAL ANALYSISrknatarajan
 
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)Suman Mia
 
Introduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptxIntroduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptxupamatechverse
 
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSAPPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSKurinjimalarL3
 
Introduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxIntroduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxupamatechverse
 
AKTU Computer Networks notes --- Unit 3.pdf
AKTU Computer Networks notes ---  Unit 3.pdfAKTU Computer Networks notes ---  Unit 3.pdf
AKTU Computer Networks notes --- Unit 3.pdfankushspencer015
 
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordCCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordAsst.prof M.Gokilavani
 
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130Suhani Kapoor
 
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Dr.Costas Sachpazis
 
Coefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxCoefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxAsutosh Ranjan
 
Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)simmis5
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130Suhani Kapoor
 
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 
result management system report for college project
result management system report for college projectresult management system report for college project
result management system report for college projectTonystark477637
 
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Dr.Costas Sachpazis
 
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINEMANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINESIVASHANKAR N
 

Recently uploaded (20)

UNIT-II FMM-Flow Through Circular Conduits
UNIT-II FMM-Flow Through Circular ConduitsUNIT-II FMM-Flow Through Circular Conduits
UNIT-II FMM-Flow Through Circular Conduits
 
Processing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxProcessing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptx
 
UNIT-III FMM. DIMENSIONAL ANALYSIS
UNIT-III FMM.        DIMENSIONAL ANALYSISUNIT-III FMM.        DIMENSIONAL ANALYSIS
UNIT-III FMM. DIMENSIONAL ANALYSIS
 
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)
 
Introduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptxIntroduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptx
 
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSAPPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
 
Introduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxIntroduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptx
 
AKTU Computer Networks notes --- Unit 3.pdf
AKTU Computer Networks notes ---  Unit 3.pdfAKTU Computer Networks notes ---  Unit 3.pdf
AKTU Computer Networks notes --- Unit 3.pdf
 
Water Industry Process Automation & Control Monthly - April 2024
Water Industry Process Automation & Control Monthly - April 2024Water Industry Process Automation & Control Monthly - April 2024
Water Industry Process Automation & Control Monthly - April 2024
 
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINEDJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
 
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordCCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
 
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
 
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
 
Coefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxCoefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptx
 
Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
 
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
 
result management system report for college project
result management system report for college projectresult management system report for college project
result management system report for college project
 
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
 
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINEMANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
 

CIBEC Presentation Fatma Sayed.pptx

  • 1. Minia University Faculty of Engineering Biomedical Engineering Department Haplotype Block Partitioning for NARAC Dataset Using Interval Graph Modeling of Clusters Algorithm Authors: Fatma S. Ibrahim Mohamed N. Saad Ashraf M. Said Hesham F. A. Hamed
  • 2. Background and introduction Literature Review Dataset and system overview Implementation and findings Conclusion 2
  • 3. 3
  • 4. 4
  • 5. 5 (SNP) is a mutation at a single nucleotide position, where a possible nucleotide type is called an allele.
  • 6. FGT MATILDE 2002 2008 CIT SSLD MIG++ 2002 2005 2014 S-MIG++ 2016 BigLD 2018 6
  • 7. 7 7
  • 8. Extracting Chromosome 21 from entire NARAC genomic dataset NARAC 22 chromosomes input files NARAC genomic data (2,062 individuals) Perl Choosing chromosome 21 under study Genotyped ch21 dataset Map file for ch21 NARAC map file (545,080 SNPs) R Chromosomes separated data Chromosomes separated map file 8
  • 9. System description start NARAC genotype dataset ch21 NARAC map file Position ch21 Reformatting Data for codeGeno function Imputation of missing data Biomarker check Recoding to 0,1,2 Genotype format Processed Genotype data Applying Big-LD algorithm Haplotype blocks Plotting the heatmap Calculating the parameters of the blocks and make the comparison among methods 9
  • 10. System description start NARAC genotype dataset ch21 NARAC map file Position ch21 Reformatting Data for codeGeno function Imputation of missing data Biomarker check Recoding to 0,1,2 Genotype format Processed Genotype data Applying Big-LD algorithm Haplotype blocks Plotting the heatmap Calculating the parameters of the blocks and make the comparison among methods BigLD SSLD CIT FGT 10
  • 12. Blocks features and parameters Haplotype block portioning based on BigLD method Pre-processing phase 2 (recoding) Pre-processing phase 1 (reformatting) Reading and cropping The steps of haplotype block partitioning based on BigLD method 12
  • 13. Heatmap for the haplotype blocks detected by interval graph modeling of clusters for a portion of chromosome 21 from 9,993,822 bp to 14,137,685 bp. 13
  • 14. 1182 blocks 8051 SNPs 14 7787 SNPs after biomarker check
  • 15. FGT CI SSLD BigLD Base-pair position Plot of a sample of chromosome 21 haplotype blocks produced by FGT, CIT, SSLD, and Big-LD. 15
  • 16. 16
  • 17. 17 • Big-LD method provided robust blocks partitioning in terms of the block size and genomic coverage. • Moreover, it produced the least total number of blocks compared to CIT, FGT, and SSLD. • Big-LD produces larger LD blocks compared to other existing methods 17
  • 18. 18 • The results showed the similar intersections and inconsistencies among resulted blocks. • This analysis shows that Big-LD matched more with FGT and produced much more stable large haplotype blocks for NARAC dataset 18

Editor's Notes

  1. I've organized my talk into eight topics shown here.
  2. SNP: single base mutation in DNA
  3. Therefore, Haplotype block partitioning is important for population genetics, association analysis, and genetic epidemiology
  4. This research provided an empirical comparison of haplotype blocks partitioned by four different methods for chromosome 21 of the NARAC dataset The proposed analysis demonstrated the differences and similarities among the most common four methods of haplotype partitioning. This analysis shows that Big-LD matched more with FGT and produced much more stable large haplotype blocks for NARAC dataset