The document discusses the development and applications of plant pangenomics. It begins by defining what a pangenome is and explaining the difference between core and dispensable genes. It then provides a timeline of key developments in pangenomic research. Some of the major driving forces shaping structural variation in plant pangenomes are discussed. The processes of generating a pangenome through de novo and reference-based assembly methods are outlined. Two case studies on constructing chickpea and rice pangenomes are summarized. Applications of pangenomics in plant genetic studies and breeding like domestication, heterosis, and identifying rare alleles are highlighted.
Role of Pangenomics for crop ImprovementPatelSupriya
It describes about the role of pangenomics in the crop improvement.It includes pangenome,superpangenome,databases,tools used in pangenomics,utilisation in crop improvement
Genome to pangenome : A doorway into crops genome explorationKiranKm11
This seminar underpins the significance and need of formulating pan-genome oriented crop improvement strategies over single reference genome based studies. Pangenome graphs uncovers large repository of genetic variation which could we useful for planning and executing strategic crop improvement programmed
Association mapping approaches for tagging quality traits in maizeSenthil Natesan
Association mapping has been widely used to study the genetic basis of complex traits in human and animal systems and is a very efficient and effective method for confirming candidate genes or for identifying new genes (Altshuler et al., 2008). Association mapping is now being increasingly used in a wide range of plants (Rafalski, 2010), where it appears to be more powerful than in humans or animals (Zhu et al., 2008). Unlike linkage mapping, association mapping can explore all the recombination events and mutations in a given population and with a higher resolution (Yu and Buckler, 2006). However, association mapping has a lower power to detect rare alleles in a population, even those with large effects, than linkage mapping (Hill et al., 2008). Yan et al., (2010) demonstrated that the gene encoding β-carotene hydroxylase 1 (crtRB1) underlies a principal quantitative trait locus associated with β-carotene concentration and conversion in maize kernels has been identified through candidate gene strategy of association mapping.
AgBioData: Complexity and Diversity of the Pan-Genome AgBioData
A pan-genome represents the full complement of diversity within a clade, or the union of all genes or SNPs across a representative selection of genomes. One of the first pan-genomes was that of Streptococcus agalactiae, introduced in 2005. Since then, with the acceleration of whole-genome sequencing technology, pan-genomes have been generated across a wide range of multicellular eukaryotes. This presentation will outline the history of pan-genomes, the categories of pan-genomes, advances in pan-genome assessment, and the challenges of representing the diversity of a taxonomic clade in complex eukaryotes.
Role of Pangenomics for crop ImprovementPatelSupriya
It describes about the role of pangenomics in the crop improvement.It includes pangenome,superpangenome,databases,tools used in pangenomics,utilisation in crop improvement
Genome to pangenome : A doorway into crops genome explorationKiranKm11
This seminar underpins the significance and need of formulating pan-genome oriented crop improvement strategies over single reference genome based studies. Pangenome graphs uncovers large repository of genetic variation which could we useful for planning and executing strategic crop improvement programmed
Association mapping approaches for tagging quality traits in maizeSenthil Natesan
Association mapping has been widely used to study the genetic basis of complex traits in human and animal systems and is a very efficient and effective method for confirming candidate genes or for identifying new genes (Altshuler et al., 2008). Association mapping is now being increasingly used in a wide range of plants (Rafalski, 2010), where it appears to be more powerful than in humans or animals (Zhu et al., 2008). Unlike linkage mapping, association mapping can explore all the recombination events and mutations in a given population and with a higher resolution (Yu and Buckler, 2006). However, association mapping has a lower power to detect rare alleles in a population, even those with large effects, than linkage mapping (Hill et al., 2008). Yan et al., (2010) demonstrated that the gene encoding β-carotene hydroxylase 1 (crtRB1) underlies a principal quantitative trait locus associated with β-carotene concentration and conversion in maize kernels has been identified through candidate gene strategy of association mapping.
AgBioData: Complexity and Diversity of the Pan-Genome AgBioData
A pan-genome represents the full complement of diversity within a clade, or the union of all genes or SNPs across a representative selection of genomes. One of the first pan-genomes was that of Streptococcus agalactiae, introduced in 2005. Since then, with the acceleration of whole-genome sequencing technology, pan-genomes have been generated across a wide range of multicellular eukaryotes. This presentation will outline the history of pan-genomes, the categories of pan-genomes, advances in pan-genome assessment, and the challenges of representing the diversity of a taxonomic clade in complex eukaryotes.
Association mapping, also known as "linkage disequilibrium mapping", is a method of mapping quantitative trait loci (QTLs) that takes advantage of linkage disequilibrium to link phenotypes to genotypes.Varioius strategey involved in association mapping is discussed in this presentation
A concise and well fabricated presentation the current techniques used for plant genome editing including CRISPER/cas9 system, TALENS, TELES, ZINC FINGER NUCLEASES(ZFN), HEJ (homologous endjoing) and many other high throughout techniques along references.
Quantitative trait loci (QTL) analysis and its applications in plant breedingPGS
Abstract
Many agriculturally important traits such as grain yield, protein content and relative disease resistance are controlled by many genes and are known as quantitative traits (also polygenic or complex traits). A quantitative trait depends on the cumulative actions of many genes and the environment. The genomic regions that contain genes associated with a quantitative trait are known as quantitative trait loci (QTLs). Thus, a QTL could be defined as a genomic region responsible for a part of the observed phenotypic variation for a quantitative trait. A QTL can be a single gene or a cluster of linked genes that affect the trait. The effects of individual QTLs may differ from each other and change from environment to environment. The genetics of a quantitative trait can often be deduced from the statistical analysis of several segregating populations. Recently, by using molecular markers, it is feasible to analyze quantitative traits and identify individual QTLs or genes controlling the traits of interest in breeding programs.
Multiple inbred founder lines are inter-mated for several generations prior to creating inbred lines, resulting in a diverse population whose genomes are fine scale mosaics of contributions from all founders.
Genomics, proteomics and metabolomics are the three core omics technologies, which respectively deal with the analysis of genome, proteome and metabolome of cells and tissues of an organism.
Genomic aided selection for crop improvementtanvic2
In last Several years novel genetic and genomics approaches are expended. Genetics and genomics have greatly enhanced our understanding of the structural and functional aspects of plant genomes.
Presentation delivered by Dr. Jesse Poland (Kansas State University, USA) at Borlaug Summit on Wheat for Food Security. March 25 - 28, 2014, Ciudad Obregon, Mexico.
http://www.borlaug100.org
Association mapping, also known as "linkage disequilibrium mapping", is a method of mapping quantitative trait loci (QTLs) that takes advantage of linkage disequilibrium to link phenotypes to genotypes.Varioius strategey involved in association mapping is discussed in this presentation
A concise and well fabricated presentation the current techniques used for plant genome editing including CRISPER/cas9 system, TALENS, TELES, ZINC FINGER NUCLEASES(ZFN), HEJ (homologous endjoing) and many other high throughout techniques along references.
Quantitative trait loci (QTL) analysis and its applications in plant breedingPGS
Abstract
Many agriculturally important traits such as grain yield, protein content and relative disease resistance are controlled by many genes and are known as quantitative traits (also polygenic or complex traits). A quantitative trait depends on the cumulative actions of many genes and the environment. The genomic regions that contain genes associated with a quantitative trait are known as quantitative trait loci (QTLs). Thus, a QTL could be defined as a genomic region responsible for a part of the observed phenotypic variation for a quantitative trait. A QTL can be a single gene or a cluster of linked genes that affect the trait. The effects of individual QTLs may differ from each other and change from environment to environment. The genetics of a quantitative trait can often be deduced from the statistical analysis of several segregating populations. Recently, by using molecular markers, it is feasible to analyze quantitative traits and identify individual QTLs or genes controlling the traits of interest in breeding programs.
Multiple inbred founder lines are inter-mated for several generations prior to creating inbred lines, resulting in a diverse population whose genomes are fine scale mosaics of contributions from all founders.
Genomics, proteomics and metabolomics are the three core omics technologies, which respectively deal with the analysis of genome, proteome and metabolome of cells and tissues of an organism.
Genomic aided selection for crop improvementtanvic2
In last Several years novel genetic and genomics approaches are expended. Genetics and genomics have greatly enhanced our understanding of the structural and functional aspects of plant genomes.
Presentation delivered by Dr. Jesse Poland (Kansas State University, USA) at Borlaug Summit on Wheat for Food Security. March 25 - 28, 2014, Ciudad Obregon, Mexico.
http://www.borlaug100.org
Genomics platform for agriculture-CAT lectureSenthil Natesan
The popular lecture for the undergraduate students of agriculture to know about the application of biotechnology in agriculture science graduates. Some of the major break through inventions how it impact on agriculture research and development
CD Genomics is dedicated to providing a comprehensive list of genomics and microarray solutions for agriculture, including genome, exome, transcriptome, and metagenome sequencing, genome-wide association studies (GWAS), and targeted sequencing and genotyping that focus on a subset of regions or genes such as single nucleotide polymorphisms (SNPs). https://www.cd-genomics.com/Transcriptomics.html
Genotyping by Sequencing is a robust,fast and cheap approach for high throughput marker discovery.It has applications in crop improvement programs by enhancing identification of superior genotypes.
Functional genomics is a general approach toward understanding how the genes of an organism work together by assigning new functions to unknown genes. Information about the hypothesized function of an unknown gene may be deduced from its sequence structure using already known functions of similar genes as the basis for comparison. Gene function analysis therefore necessitates the analysis of temporal and spatial gene expression patterns (Yunbi Xu et al , Plant Molecular Biology (2005) ).
A number of developments have been made in the molecular biology of oat (Avena spp.) in recent years. Many of these were recently described at the Fourth International Oat Conference, held on 18 to 23 October, in Adelaide, South Australia. These advances include a report of oat transformation and regeneration, the characterisation of J3-glucanase genes in oat, the further development of a molecular genetic map in oats, and the characterisation of genes encoding novel oat grain proteins. A technique for assessing pedigrees in the oat and other cereal crops has been reported using a modified electrophoretic technique.
This pdf is about the Schizophrenia.
For more details visit on YouTube; @SELF-EXPLANATORY;
https://www.youtube.com/channel/UCAiarMZDNhe1A3Rnpr_WkzA/videos
Thanks...!
Introduction:
RNA interference (RNAi) or Post-Transcriptional Gene Silencing (PTGS) is an important biological process for modulating eukaryotic gene expression.
It is highly conserved process of posttranscriptional gene silencing by which double stranded RNA (dsRNA) causes sequence-specific degradation of mRNA sequences.
dsRNA-induced gene silencing (RNAi) is reported in a wide range of eukaryotes ranging from worms, insects, mammals and plants.
This process mediates resistance to both endogenous parasitic and exogenous pathogenic nucleic acids, and regulates the expression of protein-coding genes.
What are small ncRNAs?
micro RNA (miRNA)
short interfering RNA (siRNA)
Properties of small non-coding RNA:
Involved in silencing mRNA transcripts.
Called “small” because they are usually only about 21-24 nucleotides long.
Synthesized by first cutting up longer precursor sequences (like the 61nt one that Lee discovered).
Silence an mRNA by base pairing with some sequence on the mRNA.
Discovery of siRNA?
The first small RNA:
In 1993 Rosalind Lee (Victor Ambros lab) was studying a non- coding gene in C. elegans, lin-4, that was involved in silencing of another gene, lin-14, at the appropriate time in the
development of the worm C. elegans.
Two small transcripts of lin-4 (22nt and 61nt) were found to be complementary to a sequence in the 3' UTR of lin-14.
Because lin-4 encoded no protein, she deduced that it must be these transcripts that are causing the silencing by RNA-RNA interactions.
Types of RNAi ( non coding RNA)
MiRNA
Length (23-25 nt)
Trans acting
Binds with target MRNA in mismatch
Translation inhibition
Si RNA
Length 21 nt.
Cis acting
Bind with target Mrna in perfect complementary sequence
Piwi-RNA
Length ; 25 to 36 nt.
Expressed in Germ Cells
Regulates trnasposomes activity
MECHANISM OF RNAI:
First the double-stranded RNA teams up with a protein complex named Dicer, which cuts the long RNA into short pieces.
Then another protein complex called RISC (RNA-induced silencing complex) discards one of the two RNA strands.
The RISC-docked, single-stranded RNA then pairs with the homologous mRNA and destroys it.
THE RISC COMPLEX:
RISC is large(>500kD) RNA multi- protein Binding complex which triggers MRNA degradation in response to MRNA
Unwinding of double stranded Si RNA by ATP independent Helicase
Active component of RISC is Ago proteins( ENDONUCLEASE) which cleave target MRNA.
DICER: endonuclease (RNase Family III)
Argonaute: Central Component of the RNA-Induced Silencing Complex (RISC)
One strand of the dsRNA produced by Dicer is retained in the RISC complex in association with Argonaute
ARGONAUTE PROTEIN :
1.PAZ(PIWI/Argonaute/ Zwille)- Recognition of target MRNA
2.PIWI (p-element induced wimpy Testis)- breaks Phosphodiester bond of mRNA.)RNAse H activity.
MiRNA:
The Double-stranded RNAs are naturally produced in eukaryotic cells during development, and they have a key role in regulating gene expression .
Nutraceutical market, scope and growth: Herbal drug technologyLokesh Patil
As consumer awareness of health and wellness rises, the nutraceutical market—which includes goods like functional meals, drinks, and dietary supplements that provide health advantages beyond basic nutrition—is growing significantly. As healthcare expenses rise, the population ages, and people want natural and preventative health solutions more and more, this industry is increasing quickly. Further driving market expansion are product formulation innovations and the use of cutting-edge technology for customized nutrition. With its worldwide reach, the nutraceutical industry is expected to keep growing and provide significant chances for research and investment in a number of categories, including vitamins, minerals, probiotics, and herbal supplements.
The increased availability of biomedical data, particularly in the public domain, offers the opportunity to better understand human health and to develop effective therapeutics for a wide range of unmet medical needs. However, data scientists remain stymied by the fact that data remain hard to find and to productively reuse because data and their metadata i) are wholly inaccessible, ii) are in non-standard or incompatible representations, iii) do not conform to community standards, and iv) have unclear or highly restricted terms and conditions that preclude legitimate reuse. These limitations require a rethink on data can be made machine and AI-ready - the key motivation behind the FAIR Guiding Principles. Concurrently, while recent efforts have explored the use of deep learning to fuse disparate data into predictive models for a wide range of biomedical applications, these models often fail even when the correct answer is already known, and fail to explain individual predictions in terms that data scientists can appreciate. These limitations suggest that new methods to produce practical artificial intelligence are still needed.
In this talk, I will discuss our work in (1) building an integrative knowledge infrastructure to prepare FAIR and "AI-ready" data and services along with (2) neurosymbolic AI methods to improve the quality of predictions and to generate plausible explanations. Attention is given to standards, platforms, and methods to wrangle knowledge into simple, but effective semantic and latent representations, and to make these available into standards-compliant and discoverable interfaces that can be used in model building, validation, and explanation. Our work, and those of others in the field, creates a baseline for building trustworthy and easy to deploy AI models in biomedicine.
Bio
Dr. Michel Dumontier is the Distinguished Professor of Data Science at Maastricht University, founder and executive director of the Institute of Data Science, and co-founder of the FAIR (Findable, Accessible, Interoperable and Reusable) data principles. His research explores socio-technological approaches for responsible discovery science, which includes collaborative multi-modal knowledge graphs, privacy-preserving distributed data mining, and AI methods for drug discovery and personalized medicine. His work is supported through the Dutch National Research Agenda, the Netherlands Organisation for Scientific Research, Horizon Europe, the European Open Science Cloud, the US National Institutes of Health, and a Marie-Curie Innovative Training Network. He is the editor-in-chief for the journal Data Science and is internationally recognized for his contributions in bioinformatics, biomedical informatics, and semantic technologies including ontologies and linked data.
Multi-source connectivity as the driver of solar wind variability in the heli...Sérgio Sacani
The ambient solar wind that flls the heliosphere originates from multiple
sources in the solar corona and is highly structured. It is often described
as high-speed, relatively homogeneous, plasma streams from coronal
holes and slow-speed, highly variable, streams whose source regions are
under debate. A key goal of ESA/NASA’s Solar Orbiter mission is to identify
solar wind sources and understand what drives the complexity seen in the
heliosphere. By combining magnetic feld modelling and spectroscopic
techniques with high-resolution observations and measurements, we show
that the solar wind variability detected in situ by Solar Orbiter in March
2022 is driven by spatio-temporal changes in the magnetic connectivity to
multiple sources in the solar atmosphere. The magnetic feld footpoints
connected to the spacecraft moved from the boundaries of a coronal hole
to one active region (12961) and then across to another region (12957). This
is refected in the in situ measurements, which show the transition from fast
to highly Alfvénic then to slow solar wind that is disrupted by the arrival of
a coronal mass ejection. Our results describe solar wind variability at 0.5 au
but are applicable to near-Earth observatories.
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...Sérgio Sacani
Since volcanic activity was first discovered on Io from Voyager images in 1979, changes
on Io’s surface have been monitored from both spacecraft and ground-based telescopes.
Here, we present the highest spatial resolution images of Io ever obtained from a groundbased telescope. These images, acquired by the SHARK-VIS instrument on the Large
Binocular Telescope, show evidence of a major resurfacing event on Io’s trailing hemisphere. When compared to the most recent spacecraft images, the SHARK-VIS images
show that a plume deposit from a powerful eruption at Pillan Patera has covered part
of the long-lived Pele plume deposit. Although this type of resurfacing event may be common on Io, few have been detected due to the rarity of spacecraft visits and the previously low spatial resolution available from Earth-based telescopes. The SHARK-VIS instrument ushers in a new era of high resolution imaging of Io’s surface using adaptive
optics at visible wavelengths.
(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...Scintica Instrumentation
Intravital microscopy (IVM) is a powerful tool utilized to study cellular behavior over time and space in vivo. Much of our understanding of cell biology has been accomplished using various in vitro and ex vivo methods; however, these studies do not necessarily reflect the natural dynamics of biological processes. Unlike traditional cell culture or fixed tissue imaging, IVM allows for the ultra-fast high-resolution imaging of cellular processes over time and space and were studied in its natural environment. Real-time visualization of biological processes in the context of an intact organism helps maintain physiological relevance and provide insights into the progression of disease, response to treatments or developmental processes.
In this webinar we give an overview of advanced applications of the IVM system in preclinical research. IVIM technology is a provider of all-in-one intravital microscopy systems and solutions optimized for in vivo imaging of live animal models at sub-micron resolution. The system’s unique features and user-friendly software enables researchers to probe fast dynamic biological processes such as immune cell tracking, cell-cell interaction as well as vascularization and tumor metastasis with exceptional detail. This webinar will also give an overview of IVM being utilized in drug development, offering a view into the intricate interaction between drugs/nanoparticles and tissues in vivo and allows for the evaluation of therapeutic intervention in a variety of tissues and organs. This interdisciplinary collaboration continues to drive the advancements of novel therapeutic strategies.
Lateral Ventricles.pdf very easy good diagrams comprehensive
Pangenomics.pptx
1.
2. UNIVERSITY OF AGRICULTURAL SCIENCES, BANGALORE
DEPARTMENT OF GENETICS AND PLANT BREEDING
PRESENTED BY-
MARUTHI PRASAD B P
ID. No. - PAMB 1066
3. POPULATION INCREASE!!!! CLIMATE CHANGE!!!!
PEST AND DISEASE
OUTBREAK!!!!
Capturing maximum genetic
variation
Time & Accuracy
Understanding genome of crop
3
4. Reference genome - Tool which serve as a base for crop improvement
Numerous sequencing efforts have been undertaken in plants and, as a result, reference
genome sequences have become available for several crops, which serve as a base for
crop improvement efforts.
Tao et al. (2019)
4
5. Single reference genome is adequate?
SINGLE-REFERENCE GENOME
Single reference genome oriented Comparative
genome analysis
What if our reference genome is incomplete to
capture whole information's ?
5
6. Incomplete representation of genetic diversity Biases in the reference genome
Dynamic nature of the genome
Limitations of using single reference genome
6
7. BEYOND A SINGLE REFERENCE GENOME !!!
A Single Reference
Is Insufficient to
Fully Capture the
Diversity within a
Crop Species
Multiple Reference
Genomes Facilitate
Exploration of
Genetic Diversity.
Bayer et al. (2020)
The move from a single reference genome to multiple reference genomes will better
illuminate the mining of genetic diversity for crop improvement by providing a more
precise and comprehensive guiding principle
7
9. Outline of Presentation
9
Introduction
Pangenome
Structural variants
How is pangenome generated?
Case studies
Utilizing pangenomics for crop improvement
Current developments in pangenomics
Super pangenome
Challenges
Future prospects
Conclusions
10. Genomic data
derived from multiple
accessions and
cultivars
Full extent of
sequence variations
within a species
PAN-GENOMIC
approach to figure out new genes and alleles
directly related to phenotype
“A pangenome refers to the full complement of genes of a biological
clade, such as a species, which can be partitioned into a set of core genes that are
shared by all individuals and a set of dispensable genes that are partially shared or
individual specific.”
What is Pangenome?
10
13. 13
Difference between Core Genes and Dispensable Genes
Core Genes Dispensable genes
Highly conserved More variable
Ratio of non-synonymous to
synonymous mutations- Low
Mutation rate is high
Highly conserved functionally Less functionally conserved
House keeping genes Adaptive / Defense response genes
14. Herve Tettelin Duccio Medini
✔ Pangenomes were first
introduced by Tettelin et al., in
2005 to describe gene diversity in
Streptococcus agalactiae
Michele Morgante
✔ Pangenomics in plants was first proposed
by Morgante et al. (2007)
14
When and Who ?
15. Timeline of Developments in Pangenomic Research
2005 2006 2008
2007 2009 2013 2014 2015 2016 2017 2018 2019
The pangenome
introduced by Tettelin
Plant pangenome
concept proposed by
Morgante et al.
1. Human pangenome
2. Bacterial upper
kingdom pangenome
Pangenome of
phytoplankton
Emiliania huxleyi
1. Review of analytical tool and model
developed over 10 years of
pangenome research (Vernikos et al.,
2015)
2. E. Coli pangenome built using 1085
genomes
3. Rice accessory genome characterized
Pangenome of bread
wheat and stiff brome
1. Human
pangenome
2. Pig pangenome
Streptococcus
pneumoniae
pangenome
Escherichia coli
pangenome
1. Soybean and
wild relatives
pangenome
2. Maize
transcriptome
1. Brassica
oleracea
pangenome
2. Poplar
genome
Rice pangenome built using 3010
accessions
Saccharomyces cerevisiae
pangenome built using 10 isolates
15
Golicz et al. (2019)
16. Timeline of Developments in Pangenomic Research
‘‘map-to-pan’’ strategy 16
First graph-based plant pan-
genome was constructed in
soybean
17. MAJOR DRIVING FORCES FOR
SVs UNDERLYING THE
VARIABLE SEQUENCES OF
PLANT PAN-GENOMES
17
29. De Novo genome assembly
1. Short/Long reads
2. Contig assembly
3. Scaffold/Chromosome
assembly
4. Multiple alignment of
genomic regions
5. Pan-genome construction
29
30. • Less expensive
• It requires much less data
• Permits the assessment of large
numbers of individuals with
relatively low sequencing
coverage.
30
Changes in the gene
order
31. Mapping of the reads to the reference
sequence
Assembly of the unmapped reads
Building pangenome
31
Reference genome
32. 32
Graph structure to represent
the diversity of genomic
sequences
Presents variation across
multiple genomes as different
paths along a graph of
sequence or variant nodes
33. Steps involved in graph-based pangenome
assembly
Read pre-processing
K-mer construction
Graph traversal
Pangenome construction
33
36. Crop species
PAN GENOME
Reference
No pan genes Core% Dispensable%
Triticum
aestivum
140 500 genes 57.70 42.30 Montenegro et al.,
2017
Glycine soja 59 080 genes
families
48.60 51.40 Li et al., 2014
Zea mays 41 903
transcripts
39.12 60.88 Hirsch et al., 2014
Brassica
oleracea
61 379 genes 81.29 18.71 Golicz et al., 2016
Brassica rapa 36 882 genes 84.56 15.44 Lin et al., 2014
Brachypodiu
m distachyon
37 886 genes 55.00 45.00 Gordon et al., 2017
Helianthus
annuus
61,205 genes 73.00 27.00 Hubner et al., 2018
A higher ploidy and outcrossing rate provides extra level of
diversity and therefore a larger pangenome with higher
percentage of dispensable genes.
36
Ratio of Core vs Dispensable genes
37. Case study 1
Objective:
To construct a chickpea pan-genome which provide insights into
species divergence, the migration of the cultigen (C. arietinum) and
identification of rare allele burden and fitness loss in chickpea.
Varshney, R. K. et al. (2021)
37
38. Results
Chickpea pan-genome (592.58 Mb) developed using an iterative
mapping and assembly approach.
38
A total of 29,870 genes were identified, of which 1,582 were to our
knowledge novel compared to previously reported genes.
Gene ontology (GO) annotations identified genes that encode response to
oxidative stress, response to stimulus, heat shock protein, cellular response
to acidic pH and response to cold, suggesting a possible role in adaptation.
The modeling curve analysis showed that chickpea pan-genome is closed
39. • Cultivated (2,258) and C. reticulatum (22)
accessions were analysed to discover structural
variations, as compared to the CDC Frontier
genome.
• More structural variations in the C. reticulatum
accessions because of their high divergence
from cultivated chickpea.
• They further identified 793 gene-gain copy
number variants (CNVs) and 209 gene-loss
CNVs in cultivated accessions, and 643 gene-
gain and 247 gene-loss CNVs in C. reticulatum
accessions.
39
40. Reconstructed the past history of effective size of
chickpea population using 150 randomly chosen
cultivated genotypes of chickpea using markovian
coalescent as implemented in SMC++ (Terhorst
et al., 2017).
1. Chickpea experienced a strong
bottleneck beginning around
10,000 years ago
2. The population size reaching its
minimum around 1,000 years ago
3. Followed by a very strong
expansion of the population
within the last 400 years, suggest
a strong recent expansion of
chickpea agriculture.
40
41. Neighbour-joining tree constructed
indicates a clear out-grouping of wild
species accessions from cultivated
accessions
The cultivated accessions formed
three distinct clusters
One landrace from East Africa (ICC
16369) grouped together with wild
species accessions indicating that it is
mislabeled as belonging to the
cultivated chickpea 41
42. Conclusions from this study
They constructed a chickpea pan-genome and identified the novel genes which are
not reported earlier
Divergence tree constructed allowed them to estimate the divergence of cicer over
the last 21 million years
Identified selective sweeps of genes under domestication & bottleneck leading to
reduced genetic diversity
42
43. Case study 2
To develop a high-quality rice pan-genome of genetically diverse rice
accessions through de novo genome assemblies
Demonstration of the impact of structural variation on environmental
adaptations and agronomic traits
2021
Objective:
2021
43
44. Materials and methods
PacBio SMRT sequencing De novo assembled Assemblies were evaluated for completeness using
BUSCO
(Benchmarking Universal Single-Copy Orthologs)
44
45. Results
• They had built a pan-genome of cultivated rice
comprising 66,636 genes.
• Distribution analysis showed that 20,374 genes were
categorized as ‘‘core genes’’ and 46,262 genes were
categorized as ‘‘dispensable genes’’ which included
14,609 accession-private genes.
• They identified an average 24,469 SVs per accession
relative to Nipponbare.
45
46. Contribution of SVs in rice environmental adaptation
OsWAK112d gene, a known
negative regulator of blast
resistance
Two Independent deletions in
OsWAK112d gene contributed to
environmental adaptation by
enhancing blast resistance in
rice.
Fig 2. The distributions of the deletion of OsWAK112d in subpopulations of O. sativa and wild rice population
Fig 1.Schematic illustrating the deletions of OsWAK112d in the LJ and
N22 accessions
46
47. Association of Gene CNVs with variations in agronomic traits
In addition to SVs, gCNVs were inferred for 25,549(38.34%) of the protein
coding genes in the rice pan genome.
47
Short day Early flowering
Long day Delayed flowering & Increased grain number
CNV of OsVIL1 is likely associated with flowering time and grain number
48. Conclusions from this study
• De novo assembly of 31 high-quality genomes for genetically diverse
accessions
• Pan-genome-scale resources and a graph-based genome reveal hidden SVs
and gCNVs
• The derived state of O. sativa SVs was inferred using the O. glaberrima
assembly
• SVs and gCNVs have shaped gene expression profiles and agronomic trait
variations
48
51. De novo domestication is essential for utilization of diversity present in CWRs
Zsogon et al. edited six loci [SELF PRUNING, OVATE, FASCIATED, FRUIT WEIGHT3.2,
MULTIFLORA, LYCOPENE BETA CYCLASE] in S. pimpinellifolium significantly increased its yield,
productivity, and nutritional value resulting in de novo domestication of tomato
Variation for fruit weight QTL fw3.2 caused by tandem duplication of the cytochrome P450 gene
elucidated with the help of tomato pangenome.
Alonge et al. (2020) 51
52. A model of heterosis proposed by Swanson-
Wagner
❖ Pan-genomics plays important role in
identifying gene members and families
contributing to heterosis
❖ A new gene and variant finding is
essential to explaining and utilizing
heterosis for crop improvement.
Dominance Hypothesis-
Complementarity of Genes between
Parental Lines drives Heterosis
52
53. The tomato pangenome un-covers new genes and a
rare allele regulating fruit flavor
4. Pangenome un-covers rare alleles
Objectives:
Construction of Pangenome of Tomato
Identification of PAVs and substantial gene loss during domestication
Identification of rare alleles
53
54. Material and Methods
Species Group No. of Accessions
Solanum pimpinellifolium (SP)
Wild groups
78
Solanum cheesmaniae ssp galapagense (SCG) 8
Solanum lycopersicum L. lycopersicum (SLL)
Domesticated
group
372
Solanum lycopersicum L. var. cerasiforme (SLC) 267
Total 725
The genome for each accession was de novo assembled producing a total of
306Gb of contigs longer than 500bp with N50 value of 3180bp
54
55. Violin graph
SP
Wild groups
SCG
SLC Domesticated
group
SLL
Selection of gene PAVs during tomato domestication & breeding
Gene loss during tomato domestication and subsequent improvement
55
56. “Who will last in the Run?”
Identification gene PAVs under selection
Scatter plots Gene selection preference during tomato domestication and improvement
Domestication (SLC Vs SP) Improvement (SLLheirlooms vs SLC)
Favorable Unfavorable
Domestication
Phase
120 1213
Improvement
Phase
12 665
Results suggest that more genes were selected against
than selected for during both domestication and
improvement of tomato
56
57. • Many of the favorable alleles lost in recent years as a result of breeding
emphasizing production over quality trait
Identification of rare alleles
Present in Heinz 1706
(Reference)
Non reference allele found
in pangenome
TomLoxC (Solyc01g006540) essential for C5 and C6 green-leaf
volatile production in tomato fruit
57
4,151-bp (~4-kb substitution) nonreference allele of the TomLoxC promoter
captured in Pan-genome
Rare allele in cultivated tomatoes
that reflects strong negative selection during domestication.
58. S. pimpinellifolium
SP
(47.4 %)
Modern SLL cultivars
(7.2%)
All heterozygotes
S. cheesmaniae SLC
(8.4 %)
SLL heirlooms
(1.1 %),
The frequency of the non-reference allele
Wild group Cultivated group
Most likely because of recent introgressions of rare allle
from wild into cultivated tomatoes to induce stress
tolerance
Cultivated land races Modern cultivars
58
59. Conclusions from this study
• 4,873 novel genes identified which are not in reference genome
• PAVs analyses revealed substantial gene loss and intense negative
selection of genes during domestication and improvement.
• Identification of rare allele in the TomLoxC promoter selected against
during domestication.
• Lost or negatively selected genes are enriched for important traits in
present breeding scenario
59
60. A major concern in QTL mapping and GWAS based on SNPs from a single
reference genome is reference bias.
Maize gene resistance to sugarcane mosaic virus identified by GWAS using
markers based on the B73 but not the PH207 because gene was absent in
PH207 assembly. (Coletta et al., 2021).
Using a pan-genome as the reference can reduce the misalignment
60
62. Genomic selection is an alternative approach for complex traits controlled
by QTLs with small effects
Uses SNPs as predictors and it is biased with the use of single reference
genome.
Pangenomic data can be
used to identify new markers
to improve prediction
accuracy
Practical haplotype graph
approach (PHG).
62
67. • Dispensable genome is enriched with environmental response genes
• Pangenomes can be used in detection of sequences associated with
agronomically relevant traits
• This lead to the transition from so-called genomic-assisted to pan
genomic-assisted breeding strategies for….
1.Resistance to
biotic
stress/Disease
resistance
2.Vernalization
and flowering
time
3.Fruit, grain,
yield and seed
quality
4.Abiotic stress
tolerance and
Resistance
5.Plant
architecture
67
68. CNVs at Rhg1 locus –resistance
to cyst nematode (soybean)
(Cook et al., 2012)
Absence of sulfotransferase gene
in PAVs with various sizes-
resistance to striga (Sorghum)
(Gobena et al., 2019)
Deletions in the Pi21 gene
results in quantitative and
durable resistance against blast
disease (Rice) (Fukuoka et al.,
2009)
Duplications
68
69. Gene encoding a Fe2+/Zn2+
regulated transporter
associated with iron deficiency
chlorosis in soybean
The PAVs in Sub1A gene -
submergence tolerance &
2 ERF genes SNORKEL
1&SNORKEL 2 - deep water
response in rice.
(Liu et al., 2020)
(Xu et al., 2006)
69
70. Brachypodium distachyon reference (Bd21) having
shortest flowering.
This dispensable gene Brdisv1ABR21022861m is present
in delayed and extremely delayed flowering species
(Gordon et al., 2017)
In Brassica napus pangenome analysis based GWAS
identified variations causing several agronomically
relevant traits including silique length, seed weight and
flowering time (Song et al., 2020)
70
71. The pepper pangenome GWAS
revealed the deletions in genes
involved in carotenoid and
capsaicinoid biosynthetic
pathways (Ou et al., 2018)
In rice, 1212bp deletion of the
GW5 gene causes variation of
grain width and grain weight.(Liu
et al., 2017)
71
72. In wheat, a extra copy of
Rht-D1b resulting in
reduction of plant height
(Li et al., 2012)
In rice, a deletion in qPE9-
1, resulted in in erect
panicles(Zhou et al., 2009)
72
73. SV’s causing variation of agronomically important traits in major crops
Species Gene(locus) Trait Type
Rice GW5/qSW5/GSE5 Grain size PAV
Rice GL7 Grain size CNV
Rice SGDP7 Grain size, grain number, yield CNV
Rice Pikm1-TS Blast resistance PAV
Rice Pikm2-TS Blast resistance PAV
Rice Sub1A Submergence tolerance PAV
Rice Pup1 Phosphorus-starvation tolerance PAV
Rice SNORKEL1 Deep water response PAV
Rice SNORKEL2 Deep water response PAV
Rice qPE9-1 Plant architecture PAV
Rice Pi21 Blast disease PAV
Rice Sc Hybrid male sterility CNV
Rice DPL1/DPL2 Hybrid male sterility PAV
Rice S27/S28 Hybrid male sterility PAV
Rice OsSh1 Shattering PAV
Maize KRN4 Kernel row number PAV
CNV –Copy Number Variation PAV – Presence Absence Variations
73
75. Current developments in pangenomics
CURRENT STATUS AND FUTURE
ASPECTS OF PANGENOMIC STUDIES
75
76. This database provides:
1. Basic information of 3,010 (3k) rice accessions
2. Sequences and gene annotations for the rice pan-genome
3. Gene presence-absence variations (PAVs) of rice accessions
4. 260Mbp novel sequences
5. At least 12,000 novel genes absent in the reference genome were
included
6. Expression profiles for rice pan-genome
76
78. Uses:
It is capable of scanning presence/absence variants (PAVs) and constructing a
fully annotated pan-genome
Overview of the ppsPCP pipeline.
(github DOI: https://doi.org/10. 5281/zenodo.2567390 and webpage
http://cbi.hzau.edu.cn/ppsPCP/)
78
80. 80
Software / Tool Description / Role URI link
PanSeq Extract the regions unique in the genome, Identify the SNPs and
construct the file for phylogeny programme
https://lfz.corefacility.ca/pans
eq/
PanFunPro x Homology detection and pairwise genome analysis in pan/core genome. https://zenodo.org/record/758
3#.YTR36p0zY2w
PGAP Detection of homologous genes, orthologous genes, SNP, phylogenetic
studies, pangenome plotting and functional annotation.
http://pgap.sf.net
PanACEA Identification of genomic regions those are phylogenetically dissimilar. https://github.com/JCVenterIn
stitute/PanACEA
PGAP-X Genome diversity and visualize genome structure and gene content to
understand the evolution.
http://pgapx.ybzhao.com/
PAN2HGENE To identify new products, resulting in altering the α value behavior in
the pangenome without altering the original genomic sequence.
https://sourceforgenet/projects
/pan2hgene-software
BGDMdocker For pangenome analysis, visualization, clustering and genome
annotation
https://www.docker.com/what
isdocker
Tools-Pangenome analysis
81. Cross-species pangenomes and evolutionary studies
Single species
Multiple related species
using superpangenome
81
82. Super Pangenome
• Useful to transfer genes from the
species belonging to distantly
related gene pools.
• Breeding of crops better adapted to
diverse environments and more
resilient to climate change
82
83. Challenges in Pangenome studies
Several species have large, complex genomes, making numerous
assemblies per taxon cost-prohibitive
Assembly errors can lead to the detection of false SVs
Consolidation of pangenome variation into a single reference or
coordinate system
Polyploidy and heterozygosity are the challenging in genome
assembly
83
84. Li et al. (2022) 84
FUTURE PROSPECTS
New tools are required to support
variation graph assembly, pangenome
construction and visualization
An integrated pangenome browser
should be developed, capable of
representing SNPs and SVs for genome
analysis
Expanding the pangenome beyond
species will increase the use of wild gene
sequence diversity in crop improvement.