Invezz.com - Grow your wealth with trading signals
Diversity of Crinkler Effector Genes in Phytophthora infestans isolates.
1. Diversity of Crinkler Effector Genes
in Phytophthora infestans isolates.
Graham Etherington
8 Feb 2013
2. Outline
• Phytophthora infestans
– Biology and evolution
– Effector genes
• Crinklers
– Gene architecture
– VLVVVP recombination motif
• Aims
• Methods
– Looking for the VLVVVP motif
– Making use of all the data
• Applying methods to data
• Confirming results
• Comments/To-do
3. Phytophthora infestans
• Oomycete
• Cause of Late Blight in Solanum species (e.g. Potato)
• Responsible for the Irish Potato Famine (1845)
• Global potato yield losses of 16%
• Economic losses of £4.3 billion annually
4. • Capable of sexual and asexual reproduction
• High evolutionary potential
- striking capacity for genetic change
- rapidly adapts to overcome resistant plants
• Phytophthora - from Greek (phytón) “plant” and
(phthorá) “destruction” - the plant-destroyer!
Phytophthora infestans
5. P. infestans effector genes
• Secreted from pathogen – alters host to enable infection
• High polymorphism and positive selection
• Forms a ‘haustorium’ which delivers effector molecules to
plant cells.
• Two main cytoplasmic effectors:
– RXLRs
– Crinklers
Beynon 2006
8. Aims
• Do P.infestans isolates contain unique
Crinkler genes?
• What is the diversity of Crinklers within
P.infestans?
• How many Crinklers are shared between
different isolates?
9. The data
• 76nt PE Illumina genomic sequences of P.
infestans isolates
– EC3527 (51x)
– EC3626 (64x)
– UK 3928 (50x)
– US 22 (50x)
– NL 07434 (50x)
Test data (99% seq similarity)
10. Looking for VLVVVP in genome
• Take isolate reads, make 6-frame translations and
look for ‘VLVVVP’ recombination site?
• Control experiment – available resources:
– P. infestans reference sequence (T30-4)
– All annotated transcripts
• including Crinklers
• Make 6-frame translation of genome and look for
VLVVVP motif
– Where are the motif hits?
11. Looking for VLVVVP in genome
• 231 VLVVVP hits:
– 81 in annotated Crinklers
– 18 in other transcripts
– 132 outside any annotated feature
• Conclusion: Looking for the VLVVVP motif in 6-
frame translations results in a very high
number of false positives.
12. DNA sequence
• Extract Crinkler gene sequences using P.
infestans annotation.
• Align sequences
• Locate VLVVVP motif coding sequence
• Create ‘variable consensus’ DNA sequence
from alignment.
14. Testing the consensus sequence
• No. of consensus motifs found in P. infestans genome:
194
– in annotated Crinklers: 185
– in any other annotated genes: 0
– not found in any annotated region: 9
• Blast 300nt upstream region
– 8 Crinkler gene hits, 1 false positive
• 193/194 hits were Crinklers
• Conclusion: Looking for the ‘variable consensus’
sequence in reads should result in very low number of
false positives.
15. Methods
• Search reads for variable consensus.
• Truncate reads at consensus.
– ATCGGTTCATGTGCTGGTGGTGTTTCCGCCGTG becomes
GTGCTGGTGGTGTTTCCGCCGTG
• Count the occurrence of each sequence:
GTGCTGGTGGTGGTTCCGCCGTG 100
GTGCTGGTGGCGTTGCCATGTG 70
GTGCTGGTGGCGTTGCCA 5
GTGCTGGTGGCGTTGCCCGCGTG 1
GTGCTGGTGGTGGTTCCGCC 30
• Remove singletons
16. Making use of all the data
• In the datasets there are read-counts for short ‘sub-
sequences’ which are part of one or more longer
unique ‘super-sequences’.
GTGCTGGTGGTGGTTCCGCCGTG 2
GTGCTGGTGGTGGTTCC 8
GTGCTGGTGGT 55
GTGCTGGTGGTTAACGGT 40
GTGCTGGTGGTTAACGGTACGTAA 25
• Interested in the longest unique super-sequences,
but don’t want to throw away information in the
shorter sub-sequences.
17. Proportional Representation
• Distribute read counts of smaller sub-
sequences to longer super-sequences
proportionately:
– assign more read counts to common super-
sequences
– assign less read counts to rarer super-sequences
18. Proportional Representation
A ABCDE 100
Bi ABCDEFGH 144
Bii ABCDEFGA 6
A ABCDE 100
Bi ABCDEFGH 48
Bii ABCDEFGA 2
100
50
48 96
100
50
2 4
sub-sequence A ABCDE 100
super-sequence Bi ABCDEFGH 48
super-sequence Bii ABCDEFGA 2
Sub-sequence A is shared between super-sequences Bi and Bii as follows:
50
Assign to Bi
Assign to Bii
+48
+2
19. Recap
• Identify the best way to find Crinkler
recombination motif (VLVVVP) in P. infestans =
variable consensus sequence.
• Search for the variable consensus in the isolate
reads.
• Create ‘proportionally representative’ longest
super-sequences.
• Produce Venn Diagram of shared and unique
sequences (both as DNA seqs and protein seqs).
21. Proportional representation of all DNA
super-sequences.
Shared and unique motifs
Number in brackets refers to percent of dataset.
…as amino acid sequences
23. • UK 3928, US 22, NL 07434
Proportional representation of all
DNA super-sequences ..as amino acid sequences
Shared and unique motifs
24. Are Crinklers being correctly
identified?
Left paired-read = Left primer
Right paired-read = Right primer
• Create primers from reads for sequencing
VLVVVP
25. Shared by NL/US
Shared by UK/US
NL
US
CRN PITG_12090
CRN PITG_12094
UK
US
CRN PITG_19373
CRN11
Crinklers correctly identified?
26. Results
• EC data
– reflects the similarity of the two isolates (99%
identical at amino acid level).
– from the 64 longest unique sequences, 59 are found in
both isolates.
– at least one isolate has unique Crinklers.
• UK/US/NL data
– Most Crinklers are shared
– US genotype shares Crinklers with UK and NL, but very
few shared between UK and NL
– each isolate has 2-3 unique Crinklers.
27. Results
• EC v UK/US/NL
– 64 Crinklers in EC, 59 in UK/US/NL
• 52 shared by both
• 12 exclusive to EC isolates,
• 7 exclusive to UK/US/NL
• ‘VLVVVP’ not the only recombination motif.
– ‘VLVALP’ is also found in 8/64 in EC super-
sequences and 7/59 in UK/US/NL data.
28. Conclusions
• Variable consensus sequence
• ‘Proportional representation’
– identifies Crinkler diversity
– estimates abundance.
• P.infestans isolates do contain ‘unique’ Crinklers.
– Present in a few or totally unique?
• Application of methods on UK/US/NL data
reflects the greater sequence diversity between
the data.
• Crinklers prediction verified through sequencing.
29. Comments
• Method could bias the isolate with the most
reads – normalisation.
• Method shows diversity at the start of the
recombination hotspot
– can’t say much about the rest of the gene.
VLVVVPEQDGTISNDMSAVTTPLTV 1000
VLVVVPEQDGTISNDMSAVTTPLTVABCDEFG 250
VLVVVPEQDGTISNDMSAVTTPLTVHIJKLMN 250
VLVVVPEQDGTISNDMSAVTTPLTVOPQRST 200
VLVVVPEQDGTISNDMSAVTTPLTVUVWXYZ 300
30. • To-do:
– Larger-scale PCR/sequencing to confirm
shared/unique Crinklers in isolates.
– What would we find with longer (targeted?)
sequences?
PHYTOPHTHORA INFESTANSTwo mating types: A1 and A2Until recently, the A1 mating type was predominant in growing areas outside Mexico, but the A2 has lately become as important. The presence of both mating types allows for sexual recombination and production of new and possibly more aggressive isolates.Belongs to Clade 1 within the genus PhytophthoraGenerally no visible symptoms for at least 2 days, after which time small areas of necrosis are visible. Within 1-2 more days sporangiophores are produced, with many sporangia (from which zoospores can germinate).Leaves, stems and tubers are susceptibleTransmitted by spores and infected tubers