The document summarizes the process of predicting miRNAs related to late blight disease of potato using bioinformatics approaches. It discusses how miRNAs play an important role in host-pathogen interactions and regulates genes. The author proposes to identify potential pathogenic miRNAs and their targets in Solanum tuberosum in response to Phytophthora infestans infection using available EST sequences. A multi-step computational approach including screening ESTs, identifying pre-miRNAs, predicting secondary structures, and identifying targets is outlined. Relevant literature on plant miRNA prediction and P. infestans is also reviewed.
3. Many recent studies show that miRNA (~22 nt) play a very important role in
host-pathogen interaction by silencing genes either by destructing or blocking of
translation of mRNA.
They are often well conserved within plant and animal kingdoms (Millar and
Waterhouse, 2005), and are produced from either their own genes or from introns.
Primary miRNA transcripts (pri-miRNA) are transcribed by RNA polymerase II
which contains self-complementary regions that fold to form imperfect double-
stranded stem-loop structures
3
4. 4
Bhatt et al. (2011). J Am Soc
Nephrology. 23(3): 400–404.
Passenger strand associates with an AGO protein to form the miRNA induced
silencing complex (miRISC) and guides the complex to the target transcript (Bartel,
2004; Ghildiyal and Zamore, 2009).
5. 5
Late blight of potato is the most important fungal disease in potato cultivation.
P. infestans genome (~240 megabases), is the largest and most complex genomes in
Phytophthora family, which results from repetitive DNA accounting for ~74% of the
genome (Brian et al., 2009)
Fig. Repeat-driven genome expansion in Phytophthora infestans.
BJ Haas et al. (2009) Nature 000, 1-6 ,doi:10.1038/nature08358
6. 6
The fast-evolving genes are localized to highly dynamic and expanded regions of the P.
infestans genome.
Sexual reproduction in P. infestans develops genetic variation via recombination.
This helps in rapid adaptability of the pathogen to host plants.
7. 7
In P. infestans populations it is seen that there is constant genetic variation which is
destructive threat to a world that relies heavily on potato production (Garelik, 2002)
(Fry and Goodwin, 1997).
Due to dynamic nature of P. infestans genome, study of miRNA in this is not very
imperative.
So it is highly imperative to study the miRNA in counterpart of P. infestans in Solanum
tuberosum.
8. 8
Different strategy are used to detect miRNAs such as through forward genetics, direct
cloning or using Bioinformatics approaches.
Sometimes it becomes difficult to validate miRNAs using experimental approaches due
to the presence of degraded products of mRNAs, endogenous non-coding RNAs etc. in
the sample.
As we know miRNAs have a unique secondary structure ranging from ~21 to 24
nucleotides in length and are conserved between different species.
Also a large amount of data is available such as genomes and EST sequences in
database
9. 9
On such basis researcher developed an expressed sequence tag (EST) approach to
identify miRNAs (Zhang et al., 2005)
EST (expressed sequence tag) is a short sub-sequence of a cDNA sequence
cDNA is complementary to mRNA, so the ESTs represent portions of expressed genes.
10. 10
EST analysis has some substantial advantages over the other approaches such as:
(1) provides direct evidence for miRNA expression that cannot be inferred from genomic
sequence surveys, and
(2) miRNA identification can be conducted without highly specialized software
11. 11
To predict potential pathogenic miRNAs related to late blight disease of potato
To predict the targets of predicted miRNAs and their network.
12. Prediction of potential pathogenic miRNAs related to late blight disease of potato
12
Author Year Work done
Grad et al. 2003 Identified computationally and experimentally
microRNAs of C. elegans.
Bonnet et al. 2004 Presented a genome-wide computational approach to
detect miRNA genes in the Arabidopsis thaliana
genome
Wang et al. 2004 Predicted and identified 83 new microRNAs of
Arabidopsis thaliana and their mRNA targets.
Zhang et al. 2007 Identified 30 potential microRNAs and their targets in
cotton using bioinformatics approaches.
13. 13
Author Year Work done
Xie et al. 2007 Used previously known miRNAs from Arabidopsis, rice
and other plant species against both expressed sequence
tags (EST) and genomic survey sequence (GSS) databases
to search for potential miRNAs in B. napus.
Vetukuri et al. 2012 Gave evidence for small RNAs homologous to effector-
encoding genes and transposable elements in the oomycete,
Phytophthora infestans.
YongJun et al. 2012 Identified 43 new miRNAs using a homology search based
on expressed sequence tag (EST) analysis and miRNA
precursor secondary structure in Panicum miliaceum
Fahlgren et al. 2013 Showed Phytophthora have distinct endogenous small
RNA populations that include short interfering and
microRNAs.
14. 14
Author Year Work done
Pandey et al. 2007 Identified new stress-induced microRNA and their targets
in wheat using computational approach (which is
amalgamation of bioinformatics software and perl script).
In addition, 14 potential target genes were subsequently
predicted
Panda et al. 2014 Identified and characterized conserved miRNAs in garlic
expressed sequence tags (ESTs) through computational
means.
Cui et al. 2014 Predicted and validated potential pathogenic microRNAs
involved in Phytophthora infestans infection through
Bioinformatics approaches.
15. Prediction of the targets of the predicted miRNAs and the target network
15
Author Year Work done
Rhoades et al. 2002 Predicted regulatory targets for 14 Arabidopsis microRNAs
(miRNAs) by identifying mRNAs with near complementarity.
This suggests that many plant miRNAs act similarly to small
interfering RNAs and direct mRNA cleavage.
The targeting of developmental transcription factors suggests
that many plant miRNAs function during cellular differentiation
to clear key regulatory transcripts from daughter cell lineages.
Thomson et al. 2014 Summarizes and critiques the existing experimental techniques for
miRNA target identification. They laid more emphasis on
combining multiple strategies to obtain a comprehensive high-
confidence description of miRNA targeting networks.
16. 16
To search potential miRNAs, one has to go through the following steps
Contin…
Remove protein encoding sequence
Remove redundant miRNAs
Remove candidate not meeting the
criteria
Blastx
Candidate pathogenic Pre-miRNA
Novel potential miRNA related to late blight disease of potato
Protein sequence database
Prediction of secondary structure
Select sequences with 0-4 mismatch
without any gap and minimum E-value
IdentificationofmiRNAs
Screening of EST related to
Phytoplasma infection
sequences
EST sequences
EST sequences of Solanum tuberosum All known mature miRNAs
Blastn
Non- redundant miRNAs
17. Steps continue …
17
Figure – Flowchart of computational prediction of potential pathogenic miRNAs related to late
blight disease of potato
GO annotation
Refseq mRNA and assembled
EST sequences of potato
Characterization of Target Genes
miRNA Target Gene identification
PredictedmiRNAsTargetGene
Identification
Formation of miRNA- miRNA network
Identification of the key genes related to
Phytoplasma infection
Novel potential miRNA
related to late blight
disease of potato
18. Library facility available at IARI/IASRI and computational facility available
will be utilised.
18
19. Pandey, B. et al. (2013). Identification of new stress-induced microRNA and their targets in
wheat using computational approach. Plant Signaling & Behavior, 8, 5
Panda, D. et al. (2014). Computational identification and characterization of conserved miRNAs
and their target genes in garlic (Allium sativum L.) expressed sequence tags. Gene, 537, 333–
342
Thomson, D.W. et al. (2011). Experimental strategies for microRNA target identification. Nucleic
Acids Research, 39, 16
Bonnet, E. et al. (2004). Detection of 91 potential conserved plant microRNAs in Arabidopsis
thaliana and Oryza sativa identifies important target genes. PNAS, 101(31), 11511–11516
19
20. Xie, F.L. et al. (2007). Computational identification of novel microRNAs and targets in
Brassica napus. FEBS Letters 581, 1464–1474
Ghosh, Z., Mallick, B. and Chakrabarti, J. (2009). Cellular versus viral microRNAs in host-
virus interaction. Nucleic Acids Research, 37(4), 1035–1048
Grad, Y., Aach, J., Hayes, G.D., Reinhart, B.J., Church, G.M., Ruvkun, G., and Kim, J., (2003).
Computational and experimental identification of C. elegans microRNAs. Molecular Cell
11, 1253–1263
Telles, G.P. and da Silva, F.R. (2001). Trimming and clustering sugarcane ESTs. Genetics and
Molecular Biology, 24(1-4), 17-23
Haas, B.J. et al. (2009). Genome sequence and analysis of the Irish potato famine pathogen
Phytophthora infestans. Nature, 461(7262), 393–398
20
21. Han, Y.S., Zhu, B.Z., Luan, F.L., Zhu, H.L., Shao, Y., Chen, A.J., Lu, C.W. and Luo, Y.B.
(2010). Conserved miRNAs and their targets identified in lettuce (lactuca) by EST
analysis. Gene, 463(1–2), 1–7
Chou, H. and Holmes, M.H., (2001). DNA sequence quality trimming and vector removal.
Bioinformatics, 17(12), 1093-1104
Janga, S.C. and Vallabhaneni, S., (2011). MicroRNAs as post-transcriptional machines and
their interplay with cellular networks. Adv Exp Med Biology, 722(2), 59–74
Cui, J., Luan, Y., Wang, W. and Zhai, J., (2014). Prediction and validation of potential
pathogenic microRNAs involved in Phytophthora infestans infection. Molecular Bology
Report 41, 1879-1889
Kale, S.D., (2012). Oomycete and fungal effector entry, a microbial trojan horse. New
Phytology 193(4), 874–881
21
22. Luo, Y. and Zhang, S., (2009). Computational prediction of amphioxus microRNA genes and
their targets. Gene, 428(1–2), 41–46
Planell-Saguera, M. and Rodiciob, M.C. (2011). Analytical aspects of microRNA in
diagnostics: A review. Analytica Chimica Acta, 699, 134– 152
Rhoades, M.W. et al. (2002). Prediction of Plant MicroRNA Targets. Cell, 110, 513–520
Rhoades, M.W. and Bartel, D.P., (2004). Computational Identification of Plant MicroRNAs
and Their Targets, Including a Stress-Induced miRNA. Molecular Cell, 14, 787–799
Fahlgren, N., et al. (2013). Phytophthora Have Distinct Endogenous Small RNA Populations
That Include Short Interfering and microRNAs. PLOS ONE, 8, 10
22
23. Nowicki, M., Fooled, M.R., Nowakowska, M. and Kozik, E.U., (2012). Potato and tomato late
blight caused by Phytophthora infestans: an overview of pathology and resistance breeding.
Plant Dis, 96(1), 4–17
Vetukuri, R.R., et al. (2012). Evidence for Small RNAs Homologous to Effector- Encoding
Genes and Transposable Elements in the Oomycete Phytophthora infestans. PLOS ONE, 7,12
Scaria, V., Hariharan, M., Maiti, S., Pillai, B. and Brahmachari, S.K., (2006). Host-virus
interaction: a new role for microRNAs. Retrovirology, 3, 68
Sharma, K., Butz, A.F. and Finckh, M.R., (2010). Effects of host and pathogen genotypes on
inducibility of resistance in tomato (solanum lycopersicum) to Phytophthora infestans. Plant
Pathology, 59(6), 1062–1071
Wang, X.J., Reyes, J.L., Chua, N.H. and Gaasterland, T., (2004). Prediction and identification of
Arabidopsis thaliana microRNAs and their mRNA targets. Genome Biology, 5(9), R65
23
24. YongJun, W.U. et al. (2012). Computational prediction and experimental verification of
miRNAs in Panicum miliaceum L.. Science China Life Science, 55, 807–817
Xu, Z.Q., Qin, Q., Ge, J.C., Pan, J.L., Xu, X.F. (2012). Bioinformatic identification and
validation of conservative microRNAs in Ictalurus punctatus. Molecular Biology
Report, 39(12), 10395–10405
Zhang, B., Pan, X., Cobb, G.P., Anderson, T.A. (2006). Plant microRNA: a small regulatory
molecule with big impact. Devlelopmental Biology, 289(1), 3–16
Zhang, B.H., et al. (2005). Identification and characterization of new plant microRNAs using
EST analysis. Cell Research, 15, 336–360
Zhang, B.H., Wang, Q.L., Wang, K.B., et al. (2007). Identification of cotton microRNAs and
their targets. Gene, 397, 26-37
24
Conserved gene order across three homologous Phytophthora scaffolds. Genome expansion is evident in regions of conserved gene order, a consequence of repeat expansion in intergenic regions. Genes are shown as turquoise boxes, repeats as black boxes. Collinear orthologous gene pairs are connected by pink (direct) or blue (inverted) bands. The expansion of regions between conserved blocks results from increased density of repetitive elements. A scaffold is composed of contigs and gaps.
Conserved gene order across three homologous Phytophthora scaffolds. Genome expansion is evident in regions of conserved gene order, a consequence of repeat expansion in intergenic regions. Genes are shown as turquoise boxes, repeats as black boxes. Collinear orthologous gene pairs are connected by pink (direct) or blue (inverted) bands. The expansion of regions between conserved blocks results from increased density of repetitive elements. A scaffold is composed of contigs and gaps.
asexual -hyphal growth, sporulation, sporangia germination (zoospore/direct germination). Sexual- A1 and A2 meet & oospores which develop genetic variation via recombination. Center of ori for both Andean region. Solanum demissum- R gene overcome by new strains of pathogen. Europe only the A1 strain. United States, in Philadelphia and New York City in early 1843. Atlantic Ocean with a shipment of seed potatoes for Belgian farmers in 1845. copper sulfate & Lack of genetic variability & dependency on a single variety of potato, the Irish Lumper, Biotroph & necrotroph. many recent studies show that miRNA (~22 nt) play a very important role in host-pathogen interaction by silencing genes either by destructing or blocking of translation of mRNA.
RISC utilize sRNAs to program effector protein complex to recognize specific target nucleic acids in a sequence-dependent manner. Target recognition results in the suppression of activity of the target (degradation/recruitment of additional silencing factors). Effector are secreted by both fungal and oomycetes pathogens and are able to disrupt or change the host plant’s immune response enabling a successful infection (Kamoun, 2003). Most studied oomycete effectors are the RxLR and Crinkler (CRN) groups (have conserved peptide motifs required for translocation into the host).
caenorhabditis
Why remove protein encoding sequences? Prediction of secondary structure??
miRNA secondary structures must satisfy the following criteria
The minimum length of the pre-miRNA to be 60 ntd.
The pre-miRNA should be folded into appropriate stem loop hairpin secondary structure.
The mature miRNA sequence and its opposite miRNA strand should not have more than 6 ntd mismatch.
No loops or breaks should be allowed between the miRNA duplex.
The A+U content should be within 30-70 %.
Predicted secondary structures must have higher minimal folding free energy index (MFEI) and negative minimal folding free energy.
Refseq and assembled seq??
RISC utilize sRNAs to program effector protein complex to recognize specific target nucleic acids in a sequence-dependent manner. Target recognition results in the suppression of activity of the target (degradation/recruitment of additional silencing factors). Effector are secreted by both fungal and oomycetes pathogens and are able to disrupt or change the host plant’s immune response enabling a successful infection (Kamoun, 2003). Most studied oomycete effectors are the RxLR and Crinkler (CRN) groups (have conserved peptide motifs required for translocation into the host).