The presentation presents basic concepts and describes various advanced uses of Genome Wide SNP markers for Admixture Analysis and Selection Signatures. The usage of SNP markers has made tremendous progress in recent times and it shall help in application of genomic selection in developing countries including India
Development of a high-throughput high-density SNP genotyping array for bovineAffymetrix
Ali Pirani, Bioinformatics, Affymetrix.Our bioinformatics team first developed a workflow for validating de novo SNPs in animals. They have since applied this successfully in multiple plant species.
Genomic selection changing Breeding programe around the world, talk consist of concept of Breeding, breeding value, Genomic breeding value, Genotype imputation, male calf procurement on basis of GEBV under SAG PT Project and 1000 bull genome project.
Population Structure & Genetic Improvement in LivestockGolden Helix Inc
The genetic improvement of livestock has been a hot topic for almost a century, bringing together researchers, industry, and producers to work towards a common goal. Many countries currently employ extensive genetic selection programs in their cattle with pigs, sheep, and chicken close behind.
In this webcast, Heather J. Huson, Ph.D. from Cornell University will focus on population dynamics and trait association in cattle and goats using high density SNP datasets. Population structure plays a critical role in understanding the relatedness among livestock, ancestral origins of traits, and identification of unique sub-populations or breeds for production improvement and conservation. This also lays the foundation for understanding and improving species such as the goat which is a vital food source in developing countries but has little recorded production or health data.
Understanding population structure is essential for designing complex trait association studies such as those related to production and health characteristics. Here, Huson shows examples of her lab's investigation into population structure in both goats and cattle to identify distinct groups and study traits such as thermo-tolerance.
Research Program Genetic Gains (RPGG) Review Meeting 2021: From Discovery to ...ICRISAT
A number of advances in genetics and genomics research of pigeonpea. These advances have enhanced our understanding of structural and functional aspects of genome and also provided us opportunities to deal with constraints impeding production of pigeonpea in precise and faster manner. Availability of the draft genome sequence and large-scale molecular markers has made it possible to map traits of interest in speedy manner. Although germplasm re-sequencing has already been started in pigeonpea, large-scale germplasm including elite breeding line, landraces and wild species is expected to be fully sequenced very soon.
Axiom® Genome-Wide LAT 1 Array World Array 4Affymetrix
Coverage-Optimized World Arrays: next-generation designs combining GWAS, replication, and fine-mapping into one array. Dense marker coverage of disease-associated SNPs, genes, and regions, plus whole-genome coverage of common and rare variants. Optimized for diverse ancestries including European, African and Native American.
Development of a high-throughput high-density SNP genotyping array for bovineAffymetrix
Ali Pirani, Bioinformatics, Affymetrix.Our bioinformatics team first developed a workflow for validating de novo SNPs in animals. They have since applied this successfully in multiple plant species.
Genomic selection changing Breeding programe around the world, talk consist of concept of Breeding, breeding value, Genomic breeding value, Genotype imputation, male calf procurement on basis of GEBV under SAG PT Project and 1000 bull genome project.
Population Structure & Genetic Improvement in LivestockGolden Helix Inc
The genetic improvement of livestock has been a hot topic for almost a century, bringing together researchers, industry, and producers to work towards a common goal. Many countries currently employ extensive genetic selection programs in their cattle with pigs, sheep, and chicken close behind.
In this webcast, Heather J. Huson, Ph.D. from Cornell University will focus on population dynamics and trait association in cattle and goats using high density SNP datasets. Population structure plays a critical role in understanding the relatedness among livestock, ancestral origins of traits, and identification of unique sub-populations or breeds for production improvement and conservation. This also lays the foundation for understanding and improving species such as the goat which is a vital food source in developing countries but has little recorded production or health data.
Understanding population structure is essential for designing complex trait association studies such as those related to production and health characteristics. Here, Huson shows examples of her lab's investigation into population structure in both goats and cattle to identify distinct groups and study traits such as thermo-tolerance.
Research Program Genetic Gains (RPGG) Review Meeting 2021: From Discovery to ...ICRISAT
A number of advances in genetics and genomics research of pigeonpea. These advances have enhanced our understanding of structural and functional aspects of genome and also provided us opportunities to deal with constraints impeding production of pigeonpea in precise and faster manner. Availability of the draft genome sequence and large-scale molecular markers has made it possible to map traits of interest in speedy manner. Although germplasm re-sequencing has already been started in pigeonpea, large-scale germplasm including elite breeding line, landraces and wild species is expected to be fully sequenced very soon.
Axiom® Genome-Wide LAT 1 Array World Array 4Affymetrix
Coverage-Optimized World Arrays: next-generation designs combining GWAS, replication, and fine-mapping into one array. Dense marker coverage of disease-associated SNPs, genes, and regions, plus whole-genome coverage of common and rare variants. Optimized for diverse ancestries including European, African and Native American.
SNP genotyping using Illumina BeadXpress for germplasm diversity studies in c...ICRISAT
Single nucleotide polymorphisms (SNPs) have become an ideal marker system due to their greater abundance in the plant genomes. In cases like marker-assisted selection (MAS) where only few markers are required for genotyping a set of potential lines, a cost-effective SNP genotyping platform is required. 26 Sep 2012
Towards Precision Medicine: Tute Genomics, a cloud-based application for anal...Reid Robison
Tute Genomics is cloud-based software that can rapidly analyze entire human genomes. The cost of whole genome sequencing is dropping rapidly and we are in the middle of a genomic revolution. Tute is opening a new door for personalized medicine by helping researchers & healthcare organizations analyze human genomes.
Next-generation sequencing has enabled clinicians and researchers alike to identify novel genetic variants associated with rare Mendelian Diseases across the human genome. To help enable researchers and clinicians understand the role of CNVs in human health and disease, Golden Helix has integrated a specialized NGS-based CNV caller capable of detecting deletion and duplication events as small as single-exons and as large as whole chromosome aneuploidy events. In this webcast, we will present our workflows that integrates the NGS-based CNV caller into SVS.
Breed composition evaluation based on genetic makersILRI
Presented by Yi Zhang (College of Animal Science and Technology, China Agricultural University, Beijing) at the Inception workshop of the AgriTT project: Evaluation of breed composition, productivity and fitness for smallholder dairy cattle in Tanzania, Dar es Salaam, 10-11 June 2014
Potential for genomic selection in indigenous cattle breeds and results of GWAS in Gir dairy cattle of Gujrat by Dr.Pravin Kandhani and Dr. Vijay Trivedi KAMDHENU UNIVERSITY GANDHINAGAR
The present study was conducted with the aim of reducing the cost of implementing Genomic Selection(GS) by using Genotype imputation methodology in Gir cattle. Application of GS mainly depends upon the cost of genotyping and reduce its cost, imputation approaches have been used. Imputation strategies and GS have been comprehensively studied in several taurine dairy cattle populations but very limited information is available on indigenous populations. Factors that affect the efficiency of imputation and GS are population structure, linkage disequilibrium between markers and differing marker density between indigenous and taurine breeds. The objective of the study was to evaluate the performance of INDUSCHIP-1, a customized Illumina bovine microarray chip for indigenous cattle breeds, designed by National Dairy Development Board, Anand and design one (7-15K) LD panel, and evaluate the performance of two panels of INDUSCHIP-1, and a 13K subset of the same for its imputation accuracy to HD (777K or INDUSCHIP-1 level). Thus, the study was planned with the aim to design LD panel for genotype imputation to INDUSCHIP-1 level with the strategy to maximize the accuracy of imputation in Gir cattle.
Using Public Access Clinical Databases to Interpret NGS VariantsGolden Helix Inc
In this webcast on February 19th, Gabe Rudy, Vice President of Product Development, will showcase publicly available databases and resources available for interpreting rare and novel mutations in the context of his own personal exome obtained through a limited 23andMe pilot in 2012.
The last couple years have seen many changes in well-established resources such as OMIM and dbSNP, while motivating new efforts such as ClinVar and PhenoDB to bring NGS interpretation to clinical grade through a global data sharing effort.
In this webcast, Gabe will cover:
The changing landscape of public annotations: Then, Now, and Soon.
Will the new human reference (GRCh38) released in December be a game changer?
Specific examples of improvements in annotation and algorithms that result in more accurate analysis of his own exome.
The utility and progress of NGS to different clinical applications in terms of public resources: carrier screening, hereditary cancer risk, pharmacogenomics, oncology care, and genetic disorder diagnosis.
Sharing of new clinical data: How both variation and phenotype level data is currently being shared and what will be the way forward to match rare and undiagnosed cases at a global scale.
The increasing availability of SNP (single nucleotide polymorphisms) genotype data in livestock is stimulating the development of new data analysis strategies, which can be applied in animal breeding. One possible application is the prediction of carriers of specific haplotypes, especially if they impact animal health. It is therefore convenient to have a practical and easy-to-implement statistical method for the accurate classification of individuals into carriers and non-carriers. In this paper, we present a procedure for the identification of carriers of the haplotype HH1 on BTA5 (Bos Taurus autosome 5), which is known to be associated with reduced cow fertility in Holstein-Friesian cattle. A population of 1104 Holstein bulls genotyped with the 54K SNP-chip was available for the analysis. There were 45 carriers (5.3%) and 1045 non-carriers (94.7%). Two complementary multivariate statistical techniques were used for the identification of haplotype carriers: Backward Stepwise Selection (BSS) to select the SNP that best fit the model, and Linear Discriminant Analysis (LDA) to classify observations, based on the selected SNP, into carriers and non-carriers. In order to explore the minimum-sized set of SNP that correctly identifies haplotype carriers, different proportions of SNP were tested: 2.5; 10; 15; 30; 50 and 100%. For each proportion of SNP, BSS and LDA were applied, and the classification error rate was estimated in a 10-fold cross-validation scheme. Data were split in 10 subsets. The first subset was treated as validation set, while the model was fit on the remaining nine subsets (the training set). The overall error rate for the prediction of haplotype carriers was on average very low (∼1%) both in the training and in the validation datasets. The error rate was found to depend on the number of SNPs in the model and their density around the region of the haplotype on BTA5. The minimum set of SNPs to achieve accurate predictions was 18, with a total test error rate of 1.27. This paper describes a procedure to accurately identify haplotype carriers from SNP genotypes in cattle populations. Very few misclassifications were observed, which indicates that this is a very reliable approach for potential applications in cattle breeding.
Animal genetic research for Africa—Strategies and opportunities for improving...ILRI
Presented by Richard Osei-Amponsah, University of Ghana, at the Workshop on Animal Genetic Research for Africa (Biosciences for Farming in Africa), Nairobi, 10-11 September 2015
Genome-wide association studies (GWAS) have been providing valuable insight to the genetics of common and complex diseases for many years. In this webcast we will walk through one possible workflow for completing GWAS in Golden Helix SNP & Variation Suite (SVS) with special attention paid to adjusting analysis for population stratification.
The webcast will include:
Visualizations including Manhattan Plots, linkage disequilibrium plots, and genomic annotation sources.
Quality assurance including cryptic relatedness, population stratification, as well as sample and marker statistics.
Genotype association tests and statistics including Corr/Trend tests, logistic and linear regression, Mixed Linear Models, and more.
SNP genotyping using Illumina BeadXpress for germplasm diversity studies in c...ICRISAT
Single nucleotide polymorphisms (SNPs) have become an ideal marker system due to their greater abundance in the plant genomes. In cases like marker-assisted selection (MAS) where only few markers are required for genotyping a set of potential lines, a cost-effective SNP genotyping platform is required. 26 Sep 2012
Towards Precision Medicine: Tute Genomics, a cloud-based application for anal...Reid Robison
Tute Genomics is cloud-based software that can rapidly analyze entire human genomes. The cost of whole genome sequencing is dropping rapidly and we are in the middle of a genomic revolution. Tute is opening a new door for personalized medicine by helping researchers & healthcare organizations analyze human genomes.
Next-generation sequencing has enabled clinicians and researchers alike to identify novel genetic variants associated with rare Mendelian Diseases across the human genome. To help enable researchers and clinicians understand the role of CNVs in human health and disease, Golden Helix has integrated a specialized NGS-based CNV caller capable of detecting deletion and duplication events as small as single-exons and as large as whole chromosome aneuploidy events. In this webcast, we will present our workflows that integrates the NGS-based CNV caller into SVS.
Breed composition evaluation based on genetic makersILRI
Presented by Yi Zhang (College of Animal Science and Technology, China Agricultural University, Beijing) at the Inception workshop of the AgriTT project: Evaluation of breed composition, productivity and fitness for smallholder dairy cattle in Tanzania, Dar es Salaam, 10-11 June 2014
Potential for genomic selection in indigenous cattle breeds and results of GWAS in Gir dairy cattle of Gujrat by Dr.Pravin Kandhani and Dr. Vijay Trivedi KAMDHENU UNIVERSITY GANDHINAGAR
The present study was conducted with the aim of reducing the cost of implementing Genomic Selection(GS) by using Genotype imputation methodology in Gir cattle. Application of GS mainly depends upon the cost of genotyping and reduce its cost, imputation approaches have been used. Imputation strategies and GS have been comprehensively studied in several taurine dairy cattle populations but very limited information is available on indigenous populations. Factors that affect the efficiency of imputation and GS are population structure, linkage disequilibrium between markers and differing marker density between indigenous and taurine breeds. The objective of the study was to evaluate the performance of INDUSCHIP-1, a customized Illumina bovine microarray chip for indigenous cattle breeds, designed by National Dairy Development Board, Anand and design one (7-15K) LD panel, and evaluate the performance of two panels of INDUSCHIP-1, and a 13K subset of the same for its imputation accuracy to HD (777K or INDUSCHIP-1 level). Thus, the study was planned with the aim to design LD panel for genotype imputation to INDUSCHIP-1 level with the strategy to maximize the accuracy of imputation in Gir cattle.
Using Public Access Clinical Databases to Interpret NGS VariantsGolden Helix Inc
In this webcast on February 19th, Gabe Rudy, Vice President of Product Development, will showcase publicly available databases and resources available for interpreting rare and novel mutations in the context of his own personal exome obtained through a limited 23andMe pilot in 2012.
The last couple years have seen many changes in well-established resources such as OMIM and dbSNP, while motivating new efforts such as ClinVar and PhenoDB to bring NGS interpretation to clinical grade through a global data sharing effort.
In this webcast, Gabe will cover:
The changing landscape of public annotations: Then, Now, and Soon.
Will the new human reference (GRCh38) released in December be a game changer?
Specific examples of improvements in annotation and algorithms that result in more accurate analysis of his own exome.
The utility and progress of NGS to different clinical applications in terms of public resources: carrier screening, hereditary cancer risk, pharmacogenomics, oncology care, and genetic disorder diagnosis.
Sharing of new clinical data: How both variation and phenotype level data is currently being shared and what will be the way forward to match rare and undiagnosed cases at a global scale.
The increasing availability of SNP (single nucleotide polymorphisms) genotype data in livestock is stimulating the development of new data analysis strategies, which can be applied in animal breeding. One possible application is the prediction of carriers of specific haplotypes, especially if they impact animal health. It is therefore convenient to have a practical and easy-to-implement statistical method for the accurate classification of individuals into carriers and non-carriers. In this paper, we present a procedure for the identification of carriers of the haplotype HH1 on BTA5 (Bos Taurus autosome 5), which is known to be associated with reduced cow fertility in Holstein-Friesian cattle. A population of 1104 Holstein bulls genotyped with the 54K SNP-chip was available for the analysis. There were 45 carriers (5.3%) and 1045 non-carriers (94.7%). Two complementary multivariate statistical techniques were used for the identification of haplotype carriers: Backward Stepwise Selection (BSS) to select the SNP that best fit the model, and Linear Discriminant Analysis (LDA) to classify observations, based on the selected SNP, into carriers and non-carriers. In order to explore the minimum-sized set of SNP that correctly identifies haplotype carriers, different proportions of SNP were tested: 2.5; 10; 15; 30; 50 and 100%. For each proportion of SNP, BSS and LDA were applied, and the classification error rate was estimated in a 10-fold cross-validation scheme. Data were split in 10 subsets. The first subset was treated as validation set, while the model was fit on the remaining nine subsets (the training set). The overall error rate for the prediction of haplotype carriers was on average very low (∼1%) both in the training and in the validation datasets. The error rate was found to depend on the number of SNPs in the model and their density around the region of the haplotype on BTA5. The minimum set of SNPs to achieve accurate predictions was 18, with a total test error rate of 1.27. This paper describes a procedure to accurately identify haplotype carriers from SNP genotypes in cattle populations. Very few misclassifications were observed, which indicates that this is a very reliable approach for potential applications in cattle breeding.
Animal genetic research for Africa—Strategies and opportunities for improving...ILRI
Presented by Richard Osei-Amponsah, University of Ghana, at the Workshop on Animal Genetic Research for Africa (Biosciences for Farming in Africa), Nairobi, 10-11 September 2015
Genome-wide association studies (GWAS) have been providing valuable insight to the genetics of common and complex diseases for many years. In this webcast we will walk through one possible workflow for completing GWAS in Golden Helix SNP & Variation Suite (SVS) with special attention paid to adjusting analysis for population stratification.
The webcast will include:
Visualizations including Manhattan Plots, linkage disequilibrium plots, and genomic annotation sources.
Quality assurance including cryptic relatedness, population stratification, as well as sample and marker statistics.
Genotype association tests and statistics including Corr/Trend tests, logistic and linear regression, Mixed Linear Models, and more.
Similar to Genome Wide SNPs for Admixture Analysis and Selection Signatures (20)
(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...Scintica Instrumentation
Intravital microscopy (IVM) is a powerful tool utilized to study cellular behavior over time and space in vivo. Much of our understanding of cell biology has been accomplished using various in vitro and ex vivo methods; however, these studies do not necessarily reflect the natural dynamics of biological processes. Unlike traditional cell culture or fixed tissue imaging, IVM allows for the ultra-fast high-resolution imaging of cellular processes over time and space and were studied in its natural environment. Real-time visualization of biological processes in the context of an intact organism helps maintain physiological relevance and provide insights into the progression of disease, response to treatments or developmental processes.
In this webinar we give an overview of advanced applications of the IVM system in preclinical research. IVIM technology is a provider of all-in-one intravital microscopy systems and solutions optimized for in vivo imaging of live animal models at sub-micron resolution. The system’s unique features and user-friendly software enables researchers to probe fast dynamic biological processes such as immune cell tracking, cell-cell interaction as well as vascularization and tumor metastasis with exceptional detail. This webinar will also give an overview of IVM being utilized in drug development, offering a view into the intricate interaction between drugs/nanoparticles and tissues in vivo and allows for the evaluation of therapeutic intervention in a variety of tissues and organs. This interdisciplinary collaboration continues to drive the advancements of novel therapeutic strategies.
Richard's entangled aventures in wonderlandRichard Gill
Since the loophole-free Bell experiments of 2020 and the Nobel prizes in physics of 2022, critics of Bell's work have retreated to the fortress of super-determinism. Now, super-determinism is a derogatory word - it just means "determinism". Palmer, Hance and Hossenfelder argue that quantum mechanics and determinism are not incompatible, using a sophisticated mathematical construction based on a subtle thinning of allowed states and measurements in quantum mechanics, such that what is left appears to make Bell's argument fail, without altering the empirical predictions of quantum mechanics. I think however that it is a smoke screen, and the slogan "lost in math" comes to my mind. I will discuss some other recent disproofs of Bell's theorem using the language of causality based on causal graphs. Causal thinking is also central to law and justice. I will mention surprising connections to my work on serial killer nurse cases, in particular the Dutch case of Lucia de Berk and the current UK case of Lucy Letby.
Introduction:
RNA interference (RNAi) or Post-Transcriptional Gene Silencing (PTGS) is an important biological process for modulating eukaryotic gene expression.
It is highly conserved process of posttranscriptional gene silencing by which double stranded RNA (dsRNA) causes sequence-specific degradation of mRNA sequences.
dsRNA-induced gene silencing (RNAi) is reported in a wide range of eukaryotes ranging from worms, insects, mammals and plants.
This process mediates resistance to both endogenous parasitic and exogenous pathogenic nucleic acids, and regulates the expression of protein-coding genes.
What are small ncRNAs?
micro RNA (miRNA)
short interfering RNA (siRNA)
Properties of small non-coding RNA:
Involved in silencing mRNA transcripts.
Called “small” because they are usually only about 21-24 nucleotides long.
Synthesized by first cutting up longer precursor sequences (like the 61nt one that Lee discovered).
Silence an mRNA by base pairing with some sequence on the mRNA.
Discovery of siRNA?
The first small RNA:
In 1993 Rosalind Lee (Victor Ambros lab) was studying a non- coding gene in C. elegans, lin-4, that was involved in silencing of another gene, lin-14, at the appropriate time in the
development of the worm C. elegans.
Two small transcripts of lin-4 (22nt and 61nt) were found to be complementary to a sequence in the 3' UTR of lin-14.
Because lin-4 encoded no protein, she deduced that it must be these transcripts that are causing the silencing by RNA-RNA interactions.
Types of RNAi ( non coding RNA)
MiRNA
Length (23-25 nt)
Trans acting
Binds with target MRNA in mismatch
Translation inhibition
Si RNA
Length 21 nt.
Cis acting
Bind with target Mrna in perfect complementary sequence
Piwi-RNA
Length ; 25 to 36 nt.
Expressed in Germ Cells
Regulates trnasposomes activity
MECHANISM OF RNAI:
First the double-stranded RNA teams up with a protein complex named Dicer, which cuts the long RNA into short pieces.
Then another protein complex called RISC (RNA-induced silencing complex) discards one of the two RNA strands.
The RISC-docked, single-stranded RNA then pairs with the homologous mRNA and destroys it.
THE RISC COMPLEX:
RISC is large(>500kD) RNA multi- protein Binding complex which triggers MRNA degradation in response to MRNA
Unwinding of double stranded Si RNA by ATP independent Helicase
Active component of RISC is Ago proteins( ENDONUCLEASE) which cleave target MRNA.
DICER: endonuclease (RNase Family III)
Argonaute: Central Component of the RNA-Induced Silencing Complex (RISC)
One strand of the dsRNA produced by Dicer is retained in the RISC complex in association with Argonaute
ARGONAUTE PROTEIN :
1.PAZ(PIWI/Argonaute/ Zwille)- Recognition of target MRNA
2.PIWI (p-element induced wimpy Testis)- breaks Phosphodiester bond of mRNA.)RNAse H activity.
MiRNA:
The Double-stranded RNAs are naturally produced in eukaryotic cells during development, and they have a key role in regulating gene expression .
A brief information about the SCOP protein database used in bioinformatics.
The Structural Classification of Proteins (SCOP) database is a comprehensive and authoritative resource for the structural and evolutionary relationships of proteins. It provides a detailed and curated classification of protein structures, grouping them into families, superfamilies, and folds based on their structural and sequence similarities.
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.Sérgio Sacani
The return of a sample of near-surface atmosphere from Mars would facilitate answers to several first-order science questions surrounding the formation and evolution of the planet. One of the important aspects of terrestrial planet formation in general is the role that primary atmospheres played in influencing the chemistry and structure of the planets and their antecedents. Studies of the martian atmosphere can be used to investigate the role of a primary atmosphere in its history. Atmosphere samples would also inform our understanding of the near-surface chemistry of the planet, and ultimately the prospects for life. High-precision isotopic analyses of constituent gases are needed to address these questions, requiring that the analyses are made on returned samples rather than in situ.
Richard's aventures in two entangled wonderlandsRichard Gill
Since the loophole-free Bell experiments of 2020 and the Nobel prizes in physics of 2022, critics of Bell's work have retreated to the fortress of super-determinism. Now, super-determinism is a derogatory word - it just means "determinism". Palmer, Hance and Hossenfelder argue that quantum mechanics and determinism are not incompatible, using a sophisticated mathematical construction based on a subtle thinning of allowed states and measurements in quantum mechanics, such that what is left appears to make Bell's argument fail, without altering the empirical predictions of quantum mechanics. I think however that it is a smoke screen, and the slogan "lost in math" comes to my mind. I will discuss some other recent disproofs of Bell's theorem using the language of causality based on causal graphs. Causal thinking is also central to law and justice. I will mention surprising connections to my work on serial killer nurse cases, in particular the Dutch case of Lucia de Berk and the current UK case of Lucy Letby.
This pdf is about the Schizophrenia.
For more details visit on YouTube; @SELF-EXPLANATORY;
https://www.youtube.com/channel/UCAiarMZDNhe1A3Rnpr_WkzA/videos
Thanks...!
Genome Wide SNPs for Admixture Analysis and Selection Signatures
1. Genome Wide SNPs for Admixture Analysis and Selection
Signatures
Dr. Sheikh Firdous Ahmad
Scientist
Division of Animal Genetics
ICAR-Indian Veterinary Research Institute, Izatnagar, 243 122
7. Investigation of genetic diversity/population structure
Genomic signatures of selection, Admixture studies
Identification of genetic variants and QTLs related to economically
important traits
Genome-wide association studies (GWAS), copy number variation
Genomic selection, CNV detection and association analysis
GENOMICS AND LIVESTOCK
1
2
3
4
5
8. Elite indigenous/ exotic
breed (A) Bull
Non-descript breed (B) cow
F1 progeny
(50% A + 50% B)
Backcross progeny
75% A + 25% B
Bull of breed A (100%)
--- after 6-8
Generations --
Vrindavani
Frieswal
Karan Fries
Karan Swiss
Phule Triveni
CROSSBREEDING IN INDIA
9. x
1/2 1/2
x
3/4 1/4
A crossbred/
composite
population
Inheritance generally ranges between 12.5 and 87.5
percent of two breeds
ADMIXTURE GENERATION
Admixture proportion related to production, reproduction and fitness performance
Studying any trait in crossbred population --- take care of admixture correction
12. Genome-wide data generation is based on hybridization and
imaging principles
Already fixed primers (specific for SNPs) at specific locations
GENERATION OF GENOME-WIDE DATA
Genotyping or sequencing
Whole-genome
sequencing
Reduced representation
approaches
BeadChip
Genotyping
13. GENOMIC ERA IN LIVESTOCK
SNP Beadchip Array principle
Immobilized ssDNA probes on the chip Hybridization Scanning
14. Bovine 50K SNP
chip
2008 2008 2009 2009 2010 2012
Porcine 60K SNP
chip
Ovine 50K SNP
chip
Bovine HD chip
(777K)
Chicken 60K SNP
chip
Caprine 50K SNP
chip
Micro-array bead chips
available for cattle
Buffalo 90K SNP
chip
2017
Chip SNPs
3K 2,900
LD (7K) 6909
LD2(7K) 9912
50K V1 54,001
50K V2 54,609
50K V3 53,714
HD(777K) 777,962
Chip SNPs
G 7K 7083
GGP 9K 8762
GP220K 19,809
GP3 27K 26,151
GP4 30K 30,112
GHD 75K 77,068
GGP 100K 100,000
GH2 140K 1,39,480
Chip SNPs
Affy10K 9713
Affy15K 15,036
Affy25K 25,068
Affy700K 6,48,875
BEADCHIP AVAILABILITY FOR LIVESTOCK
15. Breeds used for development of these chips
Angus
Beefmaster
Bos indicus Gyr
Bos indicus Nelore
Brahman
Charolais
Guernsey
Hereford
Holstein
Jersey
Limousin
N’Dama
Santa Gertrudis
Sheko
Red Angus
Romagnola
Novel SNPs derived from sequencing
HapMap data
Btau assembly SNPs
Whole-genome shortgun reads
Holstein AC sequence data
Other sources
Content sources
GENOME-WIDE DATA – DEVELOPMENT OF SNP ARRAY
Bovine 50K BeadChip
16. Data formats
PLINK format EIGENSTRAT format HapMap format
.ped and .map format
Binary file format
VCF format
GENOME-WIDE DATA FORMATS
17. PED file is white-space (space or tab) delimited file
With six mandatory columns
Column 7 onwards
Genotypes at fixed chromosomal coordinates
MAP file pertains to chromosomal coordinates of
SNPs in PED files
Mandatory 4 columns
Chromosome
rs# or SNP identifier
Genetic distance (morgans)
Base pair position (bp units)
Family ID
Individual ID
Paternal ID
Maternal ID
Sex info
Phenotype
X-chr -- 23
Y chr -- 24
PAR (X) -- 25
Mit. SNP -- 26
GENOME-WIDE DATA FORMATS
Plink format files
19. Binary file format
.bed file
Binary PED file
.fam file
Pedigree/phenotype file
.bim file
Extended MAP file
Binary file
Genotype information
First six columns of
PED file
Contains two extra
columns
Allele names
GENOME-WIDE DATA FORMATS
20. Assess the admixture and ancestry levels of a particular population
Global ancestry: Ancestry proportions averaged across the genome
of an individual
Local ancestry: Inferences made at a certain loci –
Whether the admixed individual inherited both, single or no alleles
from ancestral/ founder population
ACTUAL ANALYSIS USING GENOME-WIDE SNP DATA
21. Remove sex-chromosome and Mitochondrial SNP markers
Filter based on
Genotype coverage for markers: Markers whose genotype is known for less than X%
of individuals from at least one population will be ignored
Genotype coverage of individuals: Individuals whose genotype is known for less than
X% of markers will be ignored
Minor allele frequency (MAF): Markers for which MAF<X within the selected
dataset will be ignored
Hardy-Weinberg equilibrium: Markers for which pValue<X for at least one
population will be ignored
GENOME-WIDE SNP DATA PRUNING
23. Input genome-wide SNP data Admixture analysis
Likelihood based approach
Based on estimating log likelihood
ADMIXTURE software
Model-based Bayesian approach
Based on priori distribution
Structure software for admixture
analysis
Approaches available
STATISTICAL APPROACHES FOR ADMIXTURE ANALYSIS
24. Probability that a person named X will attend
an online lecture
Basic answer -- 1/2
Timing of training – office
timing – more participation
1/n1
Age of participant-- 1/n2
Importance and
relevance of topic-- 1/n3
Background
information
SNP1
SNP2
SNP3
SNP4
SNP5
SNP6
SNP7
SNP8
SNP9
SNP -
n
ADMIXTURE ANALYSIS – BAYESIAN STATISTICS
28. India is home to 50 cattle breeds Prioritize 5 topmost breeds
Based on population size and milk production status
Tharparkar
Gir
Sahiwal Kankrej Rathi
Design selective breeding programmes with open nucleus herds for their propagation
Progeny testing programmes
MANAGING PUREBRED INDIGENOUS POPULATION
30. Gene
frequency = 0
Gene frequency
>0=x<1
Gene
frequency ~1
SELECTION SIGNATURES
Interested in studying evidences of evolutionary pressures?
Normal gene frequency
Evolutionary forces Evolutionary forces
31. METHODS OF DETECTING SELECTION SIGNATURES
Nature of signatures being investigated Recent versus ancient signatures
Fst statistic -- --50000-75000 years back ~2000-3000 genx.
Reduction in genetic diversity
Tajima’s D statistic – 250,000 years back – 10000 genx.
Tajima’s D statistic – Accounts for ascertainment bias
1
2
Fay and Wu’s H
Most recent positive signatures
Haplotype analysis
Within population iHS methodology
Across population XP-EHH methodology
Extended haplotype homozygosity
3
Random drift Population bottleneck Population expansion
Beware
33. METHODS OF DETECTING SELECTION SIGNATURES
Adopted from https://doi.org/10.1016/j.livsci.2020.104257
34. METHODS OF DETECTING SELECTION SIGNATURES
Application Use in workflow
R programming
software >3.0
Rehh and detectRUNS package and customization
Rehh v1.11 Calculation of iHS and EHH measures
vcfools Calculate FST and Tajima’s D statistic
SHAPEIT Produce phased haplotype files
Beagle Phasing and imputation
SelScan Various measurements related to selection sweeps
Variscan Calculation of Fay and Wu’s H statistic
35. National Mission on Bovine Productivity
• Establishment of National Bovine Genomic Centre for Indigenous
Breeds (NBGC-IB)
• Autonomous body for undertaking all activities related to the
introduction of genomic selection
INITIATIVES IN INDIA W.R.T. GENOMIC SELECTION
36. 36
• BAIF (Bharatiya Agro Industries Foundation)
(http://www.baif.org.in)
• INAPH (Information Network for Animal Productivity and
Health) recording system of NDDB – integration with state
AHDs
INITIATIVES IN INDIA W.R.T. GENOMIC SELECTION