Identifying conserved promoter motifs and transcription factor binding sites in orthologous plant promoter collections

Loading...

Flash Player 9 (or above) is needed to view presentations.
We have detected that you do not have it on your computer. To install it, go here.

0 comments

Post a comment

    Post a comment
    Embed Video
    Edit your comment Cancel

    Favorites, Groups & Events

    Identifying conserved promoter motifs and transcription factor binding sites in orthologous plant promoter collections - Presentation Transcript

    1. Identifying conserved promoter motifs and transcription factor binding sites in orthologous plant promoter collections Endre Sebestyén, ARI-HAS, Martonvásár 8th International Symposium on “New development in green gene technology” 1-4 September 2009, Szeged, Hungary
    2. Transcription factors and binding sites
      • TFs bind short, often degenerate DNA sequences, the TFBSs
      • Variable length upstream sequence as promoter, with TFBSs
      • The TFBSs are usually conserved, in a nonconserved surrounding sequence
      • Some well known TFBSs
        • TATA-box
        • GC-box
        • CpG islands
      • Lots of other, less general TFBSs
      • Similarly expressed genes, or homologs should contain similar TFBSs in their promoters
    3. Transcription factors and binding sites
    4. Promoter analysis/TFBS search
      • Wet-lab methods
        • DNAse footprinting
        • Electrophoretic mobility shift assay
        • ChIP-Chip, ChIP-Seq
      • Bioinformatic methods
        • Search for experimentally verified sites
          • Consensus sequences
          • Matrices
        • De novo motif discovery
          • Oligo frequency
          • Phylogenetic footprinting
          • Other methods
      A 30 0 0 0 20 C 0 25 1 0 2 G 0 1 31 0 3 T 2 6 0 32 7
    5. Experimentally verified sites
      • TRANSFAC, JASPAR, PLACE, PlantCARE
    6. De-novo motif discovery
      • Orthologous gene groups
        • Evolutionary conserved functional sites
      • Co-regulated genes
        • In same tissue, body part
        • Developmental stage
        • Etc
    7. “Real” promoter structure
      • No general motifs
        • No TATA box, GC-box, etc
      • Lots of false positive TFBSs
        • With wet lab and most of the in silico methods
      • Sometimes no apparent common TFBSs between co-regulated genes
    8. Database of Orthologous Promoters
      • Orthologous promoter sequence collections
        • Based on a BLAST search with first exons of reference species
        • Plants (Viridiplantae)
          • Reference species: Arabidopsis thaliana
        • Chordates
          • Reference species: human
        • 500/1000/3000 bp 5’ upstream regions
          • Conserved sequence regions
          • Annotations
          • Xrefs to other databases
          • Annotated transcription start sites
    9. Database of Orthologous Promoters http://doop.abc.hu
    10. DoOP - Clusters
    11. DoOP - Subsets
      • Cluster > Subset
        • Subset: collections of evolutionary monophyletic sequences in a cluster
        • Plant subsets
          • Brassicaceae
            • Arabidopsis thaliana
            • Brassica species
          • Eudicotyledons
            • Grape, Solanum species, papaya, tobacco
          • Magnoliophyta
            • Maize, rice
          • Viridiplantae
    12. DoOP - Subsets
    13.  
    14.  
    15. Gene types (Gene Ontology)
      • Standardized annotation for genes
        • Biological process
          • What does it do?
            • transcription, translation, stress response, etc
        • Cellular component
          • Where is it located?
            • membrane, ribosome, cytosol, etc
        • Molecular function
          • How does it work?
            • dehydrogenase, ATP binding, etc
    16. Gene types (Gene Ontology)
      • 500 bp promoters
      • Search for significantly enriched terms in annotation
        • Brassicaceae
        • Eudicotyledons
        • Magnoliophyta
        • Viridiplantae
          • BP: transcription, translation, protein folding, stress response
          • CC: plasma membrane, ribosome parts
          • MF: ATP/GTP binding, DNA binding, ribosome parts
    17. Motif generation
      • Phylogenetic footprinting
        • Functional TFBSs should be conserved
        • Local sequence alignment
        • Define conserved regions
    18. Motif generation Brassicaceae eudicotyledons Magnoliophyta
    19. Motif statistics Motif number 500 1000 3000 Brassicaceae 323411 410720 893788 eudicotyledons 13863 20192 34353 Magnoliophyta 2009 2211 1938 Viridiplantae 589 565 372
    20. Motif statistics % conserved 500 1000 3000 Brassicaceae 22 19 16 eudicotyledons 5 3 2 Magnoliophyta 6 5 2 Viridiplantae 4 2 1 Avg length 500 1000 3000 Brassicaceae 9 9 9 eudicotyledons 7 7 7 Magnoliophyta 8 9 8 Viridiplantae 9 9 9
    21. TFBS databases
      • Lots of redundant data
      • Low quality, not updated
      • More than a 100 different version for TATA box
      Database TFBSs TRANSFAC 977 JASPAR 18 PLACE 416 PlantCARE 646 ABS 650 AGRIS 72
    22. Synthetic biology
      • Synthetic biology
        • iGEM competition
        • BioBricks
        • MIT Registry of Standard Biological Parts
          • UV responsive promoter
          • Promoter expressed in roots
          • Etc
      • Synthetic promoters
        • Define basic promoter elements
        • Build and use custom-made promoters
        • Gene expression more or less where and when you want it
    23. SNP conservation
      • Gene expression level changes because
        • regulatory elements change
        • usually NOT protein coding regions
      • Conserved promoter regions might be functional regulatory elements
        • Search for SNPs in this regions
        • These SNPs might be interesting for breeders as they are likely to be functional ones
    24. Thank you for your attention! Acknowledgements: Endre Barta Tibor Nagy
    SlideShare Zeitgeist 2009

    + Endre SebestyénEndre Sebestyén Nominate

    custom

    330 views, 0 favs, 2 embeds more stats

    More info about this document

    © All Rights Reserved

    Go to text version

    • Total Views 330
      • 310 on SlideShare
      • 20 from embeds
    • Comments 0
    • Favorites 0
    • Downloads 3
    Most viewed embeds
    • 19 views on http://biopunk.hu
    • 1 views on http://74.125.115.132

    more

    All embeds
    • 19 views on http://biopunk.hu
    • 1 views on http://74.125.115.132

    less

    Flagged as inappropriate Flag as inappropriate
    Flag as inappropriate

    Select your reason for flagging this presentation as inappropriate. If needed, use the feedback form to let us know more details.

    Cancel
    File a copyright complaint
    Having problems? Go to our helpdesk?

    Categories