This document summarizes research on predicting risk from Phytophthora genome evolution by analyzing newly sequenced genomes of three less damaging Phytophthora species: P. europeae, P. foliorum, and P. obscura. PacBio sequencing generated long reads that assembled into high quality draft genomes with over 98% completeness. Comparative analysis of gene families like xylanases involved in pathogenesis will provide insight into variations influencing host range and damage among Phytophthora species. Further improvement and comparative studies of these and other Phytophthora genomes aim to further understanding of pathogen adaptation and evolution.
The genome of Fraxinus excelsior (European ash)LizzySollars
Slides presented at the Plant and Animal Genome XXIII conference in San Diego, 11th Jan 2015.
The talk includes assembly and annotation of the ash tree genome, as well as analysis of gene sharing with other plants and an preliminary look at the diversity of European ash trees.
Recombinant DNA (rDNA) molecules are DNA molecules formed by laboratory methods of genetic recombination (such as molecular cloning) to bring together genetic material from multiple sources, creating sequences that would not otherwise be found in the genome.
The genome of Fraxinus excelsior (European ash)LizzySollars
Slides presented at the Plant and Animal Genome XXIII conference in San Diego, 11th Jan 2015.
The talk includes assembly and annotation of the ash tree genome, as well as analysis of gene sharing with other plants and an preliminary look at the diversity of European ash trees.
Recombinant DNA (rDNA) molecules are DNA molecules formed by laboratory methods of genetic recombination (such as molecular cloning) to bring together genetic material from multiple sources, creating sequences that would not otherwise be found in the genome.
Genome: The entire chromosomal genetic material of an organism.
Sequencing a genome: Determining the identity and order of nucleotides in the genetic material – usually DNA, sometimes RNA, of an organism.
This presentation entitled 'Molecular phylogenetics and its application' deals with all the developmental ideas and basics in the field of bioinformatics.
Molecular marker analysis of A few Capsicum annum varietiesAnkitha Hirematha
The hybrid variety and parental varieties among the 3 chilly varieties were identified by finding out the genetic polymorphism between them. It helps to identify different plant varieties, disputed plant varieties, genetic polymorphism between intraspecific crosses of plants and also to protect Plant Breeder’s Rights (PBR). Based on banding pattern on gel, identification of KA, KS and HK chilly varieties using SSR & ISSR markers was successfully carried out.
Genome evolution - tales of scales DNA to crops,months to billions of years, ...Pat (JS) Heslop-Harrison
Pat Heslop-Harrison: Lecture to University of Malaya, Kuala Lumpur, Malaysia December 2013
Some DNA sequences are recognizable in all organisms and originated with the start of life. Others are unique to a single species. Some sequences are present in single copies in genomes, while others are present as millions of copies. The total amount of DNA in cells of an advanced eukaryotic species can vary over three orders of magnitude, and chromosome number can vary similarly. How can such huge variations be accommodated within the constraints of organism growth, development and reproduction? What are the evolutionary implications of these huge variations? How can we use the information to understand plant evolution, cytogenetics, genetics and epigenetics? What are the implications for future evolution, biodiversity and responses of plants during plant breeding or climate change?
Genome: The entire chromosomal genetic material of an organism.
Sequencing a genome: Determining the identity and order of nucleotides in the genetic material – usually DNA, sometimes RNA, of an organism.
This presentation entitled 'Molecular phylogenetics and its application' deals with all the developmental ideas and basics in the field of bioinformatics.
Molecular marker analysis of A few Capsicum annum varietiesAnkitha Hirematha
The hybrid variety and parental varieties among the 3 chilly varieties were identified by finding out the genetic polymorphism between them. It helps to identify different plant varieties, disputed plant varieties, genetic polymorphism between intraspecific crosses of plants and also to protect Plant Breeder’s Rights (PBR). Based on banding pattern on gel, identification of KA, KS and HK chilly varieties using SSR & ISSR markers was successfully carried out.
Genome evolution - tales of scales DNA to crops,months to billions of years, ...Pat (JS) Heslop-Harrison
Pat Heslop-Harrison: Lecture to University of Malaya, Kuala Lumpur, Malaysia December 2013
Some DNA sequences are recognizable in all organisms and originated with the start of life. Others are unique to a single species. Some sequences are present in single copies in genomes, while others are present as millions of copies. The total amount of DNA in cells of an advanced eukaryotic species can vary over three orders of magnitude, and chromosome number can vary similarly. How can such huge variations be accommodated within the constraints of organism growth, development and reproduction? What are the evolutionary implications of these huge variations? How can we use the information to understand plant evolution, cytogenetics, genetics and epigenetics? What are the implications for future evolution, biodiversity and responses of plants during plant breeding or climate change?
Professional air quality monitoring systems provide immediate, on-site data for analysis, compliance, and decision-making.
Monitor common gases, weather parameters, particulates.
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...Sérgio Sacani
We characterize the earliest galaxy population in the JADES Origins Field (JOF), the deepest
imaging field observed with JWST. We make use of the ancillary Hubble optical images (5 filters
spanning 0.4−0.9µm) and novel JWST images with 14 filters spanning 0.8−5µm, including 7 mediumband filters, and reaching total exposure times of up to 46 hours per filter. We combine all our data
at > 2.3µm to construct an ultradeep image, reaching as deep as ≈ 31.4 AB mag in the stack and
30.3-31.0 AB mag (5σ, r = 0.1” circular aperture) in individual filters. We measure photometric
redshifts and use robust selection criteria to identify a sample of eight galaxy candidates at redshifts
z = 11.5 − 15. These objects show compact half-light radii of R1/2 ∼ 50 − 200pc, stellar masses of
M⋆ ∼ 107−108M⊙, and star-formation rates of SFR ∼ 0.1−1 M⊙ yr−1
. Our search finds no candidates
at 15 < z < 20, placing upper limits at these redshifts. We develop a forward modeling approach to
infer the properties of the evolving luminosity function without binning in redshift or luminosity that
marginalizes over the photometric redshift uncertainty of our candidate galaxies and incorporates the
impact of non-detections. We find a z = 12 luminosity function in good agreement with prior results,
and that the luminosity function normalization and UV luminosity density decline by a factor of ∼ 2.5
from z = 12 to z = 14. We discuss the possible implications of our results in the context of theoretical
models for evolution of the dark matter halo mass function.
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.Sérgio Sacani
The return of a sample of near-surface atmosphere from Mars would facilitate answers to several first-order science questions surrounding the formation and evolution of the planet. One of the important aspects of terrestrial planet formation in general is the role that primary atmospheres played in influencing the chemistry and structure of the planets and their antecedents. Studies of the martian atmosphere can be used to investigate the role of a primary atmosphere in its history. Atmosphere samples would also inform our understanding of the near-surface chemistry of the planet, and ultimately the prospects for life. High-precision isotopic analyses of constituent gases are needed to address these questions, requiring that the analyses are made on returned samples rather than in situ.
Cancer cell metabolism: special Reference to Lactate PathwayAADYARAJPANDEY1
Normal Cell Metabolism:
Cellular respiration describes the series of steps that cells use to break down sugar and other chemicals to get the energy we need to function.
Energy is stored in the bonds of glucose and when glucose is broken down, much of that energy is released.
Cell utilize energy in the form of ATP.
The first step of respiration is called glycolysis. In a series of steps, glycolysis breaks glucose into two smaller molecules - a chemical called pyruvate. A small amount of ATP is formed during this process.
Most healthy cells continue the breakdown in a second process, called the Kreb's cycle. The Kreb's cycle allows cells to “burn” the pyruvates made in glycolysis to get more ATP.
The last step in the breakdown of glucose is called oxidative phosphorylation (Ox-Phos).
It takes place in specialized cell structures called mitochondria. This process produces a large amount of ATP. Importantly, cells need oxygen to complete oxidative phosphorylation.
If a cell completes only glycolysis, only 2 molecules of ATP are made per glucose. However, if the cell completes the entire respiration process (glycolysis - Kreb's - oxidative phosphorylation), about 36 molecules of ATP are created, giving it much more energy to use.
IN CANCER CELL:
Unlike healthy cells that "burn" the entire molecule of sugar to capture a large amount of energy as ATP, cancer cells are wasteful.
Cancer cells only partially break down sugar molecules. They overuse the first step of respiration, glycolysis. They frequently do not complete the second step, oxidative phosphorylation.
This results in only 2 molecules of ATP per each glucose molecule instead of the 36 or so ATPs healthy cells gain. As a result, cancer cells need to use a lot more sugar molecules to get enough energy to survive.
Unlike healthy cells that "burn" the entire molecule of sugar to capture a large amount of energy as ATP, cancer cells are wasteful.
Cancer cells only partially break down sugar molecules. They overuse the first step of respiration, glycolysis. They frequently do not complete the second step, oxidative phosphorylation.
This results in only 2 molecules of ATP per each glucose molecule instead of the 36 or so ATPs healthy cells gain. As a result, cancer cells need to use a lot more sugar molecules to get enough energy to survive.
introduction to WARBERG PHENOMENA:
WARBURG EFFECT Usually, cancer cells are highly glycolytic (glucose addiction) and take up more glucose than do normal cells from outside.
Otto Heinrich Warburg (; 8 October 1883 – 1 August 1970) In 1931 was awarded the Nobel Prize in Physiology for his "discovery of the nature and mode of action of the respiratory enzyme.
WARNBURG EFFECT : cancer cells under aerobic (well-oxygenated) conditions to metabolize glucose to lactate (aerobic glycolysis) is known as the Warburg effect. Warburg made the observation that tumor slices consume glucose and secrete lactate at a higher rate than normal tissues.
Richard's aventures in two entangled wonderlandsRichard Gill
Since the loophole-free Bell experiments of 2020 and the Nobel prizes in physics of 2022, critics of Bell's work have retreated to the fortress of super-determinism. Now, super-determinism is a derogatory word - it just means "determinism". Palmer, Hance and Hossenfelder argue that quantum mechanics and determinism are not incompatible, using a sophisticated mathematical construction based on a subtle thinning of allowed states and measurements in quantum mechanics, such that what is left appears to make Bell's argument fail, without altering the empirical predictions of quantum mechanics. I think however that it is a smoke screen, and the slogan "lost in math" comes to my mind. I will discuss some other recent disproofs of Bell's theorem using the language of causality based on causal graphs. Causal thinking is also central to law and justice. I will mention surprising connections to my work on serial killer nurse cases, in particular the Dutch case of Lucia de Berk and the current UK case of Lucy Letby.
A brief information about the SCOP protein database used in bioinformatics.
The Structural Classification of Proteins (SCOP) database is a comprehensive and authoritative resource for the structural and evolutionary relationships of proteins. It provides a detailed and curated classification of protein structures, grouping them into families, superfamilies, and folds based on their structural and sequence similarities.
(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...Scintica Instrumentation
Intravital microscopy (IVM) is a powerful tool utilized to study cellular behavior over time and space in vivo. Much of our understanding of cell biology has been accomplished using various in vitro and ex vivo methods; however, these studies do not necessarily reflect the natural dynamics of biological processes. Unlike traditional cell culture or fixed tissue imaging, IVM allows for the ultra-fast high-resolution imaging of cellular processes over time and space and were studied in its natural environment. Real-time visualization of biological processes in the context of an intact organism helps maintain physiological relevance and provide insights into the progression of disease, response to treatments or developmental processes.
In this webinar we give an overview of advanced applications of the IVM system in preclinical research. IVIM technology is a provider of all-in-one intravital microscopy systems and solutions optimized for in vivo imaging of live animal models at sub-micron resolution. The system’s unique features and user-friendly software enables researchers to probe fast dynamic biological processes such as immune cell tracking, cell-cell interaction as well as vascularization and tumor metastasis with exceptional detail. This webinar will also give an overview of IVM being utilized in drug development, offering a view into the intricate interaction between drugs/nanoparticles and tissues in vivo and allows for the evaluation of therapeutic intervention in a variety of tissues and organs. This interdisciplinary collaboration continues to drive the advancements of novel therapeutic strategies.
Predicting property prices with machine learning algorithms.pdf
Paul Sharp and Ewan Mollison wp4 Nov 2018
1. Phytothreats WP4
20th November 2018
Predicting risk via analysis of Phytophthora genome evolution
Ewan Mollison, Paul Sharp – University of Edinburgh
Sarah Green – Forest Research
Leighton Pritchard, David Cooke – James Hutton Institute
2. Introduction
• What can drive evolution of a pathogen?
• “Intrinsic” factors: duplication, rearrangement, insertion, deletion of DNA
regions
• “Extrinsic” factors: hybridisation between species, transfer of genes between
species
• Allow pathogens to
• Adapt to evolving host defences
• Expand host range
• Increase virulence
3. NOWELL, LAUE, SHARP & GREEN (2016)
“Comparative genomics reveals genes significantly
associated with woody hosts in the plant pathogen
Pseudomonas syringae”
Molec. Plant Path. 17:1409-1424
Genomes of 64 strains
of Pseudomonas syringae
38 from woody hosts
26 others
5. Aims
• Compare genes from available sequenced Phytophthora genomes
• Identify a core set of Phytophthora genes, common to all species
• Identify species-specific genes or variation
• Sequence genomes from three less damaging species, which are
closely related to highly damaging species
• Topic of this talk
• Study target genes / gene families known to be important for
virulence
• How do variations in these influence the pathogen, e.g. host range, damage
caused, etc.?
6. Phytophthora species
Genome assemblies
available for 27 species
from 10 clades
(11 are from clades 7 & 8)
CladeSpecies
1a P. cactorum
1c P. infestans
1 P. parasitica
2a P. colocasiae
2b P. capsici
2c P. multivora
2c P. plurivora
3 P. pluvialis
3? P. taxon totara
4 P. litchii
4 P. megakarya
4 P. palmivora
5 P. agathidicida
6b P. pinifolia
Clade Species
7a P. alni
7a P. cambivora
7a P. fragariae
7a P. rubi
7b P. pisi
7b P. sojae
7c P. cinnamomi
8a P. cryptogea
8c P. lateralis
8c P. ramorum
8d P. austrocedri
9b P. fallax
10 P. kernoviae
8. P. europeae
• Mainly infects European Oak (roots),
also identified in North America
• Clade 7 (Subclade 7a): most closely related to
P. alni, P. cambivora (both woody host), P.
fragariae, P. rubi (both soft fruit host)
9. P. foliorum
• Causes leaf blight in azaleas
• Clade 8 (Subclade 8c): most
closely related to P. lateralis, P.
ramorum (both woody host)
10. P. obscura
• Found associated with horse
chestnut and pieris
• Clade 8 (subclade 8d): most
closely related to P. austrocedri
(juniper and other cypress
species)
12. Three less damaging, although still pathogenic, Phytophthora species:
• P. europeae
• P. foliorum
• P. obscura
Sequencing now complete!
PacBio sequencing of 2 SMRT cells for each species (Exeter)
DNA prepared by Carolyn Riddell (Forest Research)
13. Why PacBio rather than Illumina?
• MUCH longer read lengths can be achieved
• Tens of Kbp rather than 150 – 300bp
• Repeats more easily resolved
• Greater overall contiguity
• Random source of error rather than systematic bias
• Over-coverage can be used to help error-correct rather than amplify bias
• P. austrocedri – hybrid Illumina/PacBio
• Other assembled Phytophthoras – Illumina only
14. P. austrocedri reassembly (Peter Thorpe)
• Hybrid sequencing – both PacBio and
Illumina
• Hampered by not quite enough
coverage of either for optimal assembly
• Reassembled P. austrocedri using only
the PacBio reads
• Error-corrected using trimmed, de-
duplicated Illumina reads
• Purged “haplotigs” to produce
consensus haploid assembly
Hybrid
assembly
Corrected
PacBio
No. scaffolds 43,700 862
Scaffold N50 41,889 213,073
Max scaffold length 422,335 861,531
Mean scaffold length 3,089 121,524
Total length (Mbp) 135.01 104.75
% GC 51.4 51.5
% Repeat masked 49.0 39.3
No. gene models 38,492 26,960
15. Raw sequence generated
P. foliorum SMRT 1 SMRT 2 Combined
No. reads 634,588 836,118 1,470,706
Max read length 77,730 82,231 82,231
Mean read length 9,669 8,144 8,802
Total length (Gbp) 6.1 6.8 12.9
P. obscura SMRT 1 SMRT 2 Combined
No. reads 475,375 564,534 1,039,909
Max read length 79,519 80,795 80,795
Mean read length 12,308 10,876 11,531
Total length (Gbp) 5.9 6.1 12.0
P. europeae SMRT 1 SMRT 2 Combined
No. reads 739,454 723,675 1,463,129
Max read length 85,983 81,473 85,983
Mean read length 9,956 8,983 9,475
Total length (Gbp) 7.4 6.5 13.9 • Variable read length but
high read N50 indicates
good overall read length
achieved
• Max read length >80Kbp for
all three species
• Generally good consistency
across both SMRT cells
16. Overall strategy
PacBio
sequencing
Canu
assembly
SSPACE long read
scaffolding
BUSCO
completeness
estimate
Additional error-
correction &
assembly polishing
Repeat masking
Gene
model
prediction
Final assembly
• Conflicting opinions on whether best to
error-correct and polish before or after
scaffolding
• Correction can take a few weeks, so have
run early repeat mask and gene prediction
on initial scaffolds to get preliminary values
17. Sequencing and assembly summary
• Canu assembly of first cell from each to get “quick” picture of what’s in there
• Run with initial assumption of approx. 100Mbp genome size
• Early estimate of genome size from k-mer analysis of corrected, trimmed reads (k=31)
before assembling full data sets
• P. europeae: 95Mbp
• P. foliorum: 70Mbp
• P. obscura: 63Mbp
• Run full assembly with estimated genome size of 100Mbp for all three
18. Canu contig level assembly
P. europeae P. foliorum P. obscura
No. contigs 112 103 127
Contig N50 (Mbp) 2.83 2.42 2.99
Max contig length (Mbp) 9.61 5.60 6.83
Mean contig length (Mbp) 0.68 0.60 0.48
Total length (Mbp) 76.5 61.8 60.4
Process duration
(correct, trim, assemble) 12d 7h 3d 20h 3d 2h
• High N50, low number of contigs shows very high degree of contiguity in all
three assemblies
• N50: 50% of the sequence is contained within fragments of length N, or
greater
19. Scaffolding
• Scaffold contigs using full set of PacBio reads with SSPACE long-read
• Scaffolding links contigs together with gaps of known length padded
out with “N” characters
• Reduced number of scaffolds, N50 now >2.5Mbp for each assembly
P. europeae P. foliorum P. obscura
No. scaffolds 69 67 77
Scaffold N50 (Mbp) 4.28 2.88 5.42
Max scaffold length (Mbp) 9.61 8.01 7.11
Mean scaffold length (Mbp) 1.11 0.92 0.79
Total length (Mbp) 76.7 61.9 61.1
No. N's 124,828 99,240 440,106
20. • Comparison of
scaffold count and
N50 across all 30
genomes
• Assembly is
comparable to
that of P. sojae
21.
22. “BUSCO” completeness (n = 234)
P. europeae P. foliorum P. obscura
Complete BUSCOs 230 (98.3%) 230 (98.3%) 230 (98.3%)
Complete/single 227 (97.0%) 229 (97.9%) 229 (97.9%)
Complete/duplicated 3 (1.3%) 1 (0.4%) 1 (0.4%)
Fragmented 2 (0.9%) 0 (0.0%) 1 (0.4%)
Missing 2 (0.9%) 4 (1.7%) 3 (1.3%)
• High estimate of completeness for all three assemblies (98%)
• Good coverage of the “gene-space” achieved
• Very low level of duplication seen in all three (~1%)
• Suggests good resolution of haplotypes within assembly
• Also suggests polyploidy unlikely
24. Repeat content and gene model estimation
• RepeatMasker run vs. scaffolded assemblies with models derived
from multiple Phytophthora species (generated by RepeatModeler)
• Augustus run vs. masked assemblies using training set from closest
available relative
• P. europeae – P. rubi based set
• P. foliorum, P. obscura – P. austrocedri based set
• No. predicted gene models comparable to P. infestans, P. ramorum, etc. –
realistic looking figure
P. europeae P. foliorum P. obscura
% GC 53.6 51.9 53.3
% Repeat masked 45 38 35
No. gene models 15,863 15,907 17,178
25.
26.
27. Sample gene family: Xylanases
• Class of cell wall degrading enzymes which
break down hemicellulose by degrading b-1-4-
xylan into xylose
• Hemicellulose is a major constituent of the
plant cell wall
• Xylanase enzymes play a major role in the ability of
micro-organisms to degrade plant material
• Help the pathogen enter host tissues by breaking
down the cell wall
• Expand previous xylanase analysis to include
new genome assemblies
28. full Xylanase tree
• Two major clades
• One containing xyn1 and
xyn2
• The other containing
xyn3 and xyn4
xyn1
xyn2
xyn3
xyn4
29. 8: 3
6: 4
9: 2
10: 2
7: 4
3: 4
2: 4
4: 4
1: 4
5: 4
Number of xylanase genes
varies among clades
30. Next stages
• Finalise assembly improvement
• Remove contaminant reads, polishing, gap-filling, etc.
• Re-scaffold assemblies
• Re-run repeat masking, gene model prediction
• Bring these assemblies together with the others for downstream
comparative analysis
• Identify orthologous groups, targeted gene family studies, etc.