SlideShare a Scribd company logo
1 of 15
Motif and Pattern Databases And
some practical approaches
Sucheta Tripathy
10/2/2016
Motifs
• Defined as a nucleotide or amino acid
sequence pattern that is widespread and is
associated with a biological function.
– A sequence motif = A structural Motif.
– A sequence motif residing in the coding region
may encode a structural motif.
– Non-coding nucleotide motifs may have regulatory
role. May have recognition sites for DNA binding
proteins.
Motifs, profiles and patterns
• Conserved region of a DNA or protein – Motif
• Qualitative expression of a motif – Pattern
– Regular Expression
– C[TA]TTG{X}
• Quantitative expression of a motif – Profile
– Position Specific Scoring Matrices (PSSMs)
– Weight matrices
Motifs/Patterns
N{P}[ST]{P}
[FILV]Qxxx[RK]Gxxx[RK]xx[FILVWY]
[] -> or (Probability information is lost)
{} -> Not
() -> repeated
^ -> Beginning
Profiles
• Quantitative representation.
• More useful for training dataset.
TCTAGAAGATGGCAGTGGCGAAGA
TCTAGAAAATGACAGTGGCGAAGA
TCTAGAAAATGGCAGTAGCGAAGA
TCTACTAAATGA TAGTAGCGAAGA
A 0,0,0,100 ,0, 75,100, 75 ATG
T 100,0,100,0,0, 25, 0, 0 ATG
G 0, 0, 0, 0, 75 ,0, 0, 25 ATG
C 0,100,0,0, 25 ,0, 0, 0 ATG
De novo prediction of Motifs
• MEME; EXTREME; AlignAce, Amadeus,
CisModule, FIRE, Gibbs Motif Sampler,
PhyloGibbs, SeSiMCMC, ChIPMunk and
Weeder. SCOPE, MotifVoter, and Mprofiler
MEME (Multiple Expectation Maximization for
Motif Elicitation)
Figure 3. Resources
MacIsaac KD, Fraenkel E (2006) Practical Strategies for Discovering Regulatory DNA Sequence Motifs. PLoS Comput Biol 2(4): e36.
doi:10.1371/journal.pcbi.0020036
http://journals.plos.org/ploscompbiol/article?id=info:doi/10.1371/journal.pcbi.0020036
MRLSFVPLLQLSRLVVSTQHSTKMSTVYRTCKMNEIALSLLAPTQPLDADQGVMSPMASSDQ
TTSIGDFRFLRTHHDKEERGLLVTSLTKGLAETSFPYR YTSMCATICSITHSRADAAPAKQAH
Prosite
ATGCGTCTCTCCTTCGTTCCACTACTGCAGCTCTCTCGTCTGGTCGTTAGCACACAACATAGTACGAAAATGA
GCACAGTATACCGTACCTGCAAAATGAATGAAATAGCTCTCTCGTTGCTGGCGCCAACGCAGCCATTGGACG
CTGACCAGGGTGTTATGTCACCGATGGCCTCATCAGACCAGACAACCTCAATTGG TGACTTTCGGTTCCTGA
GAACCCACCACGATAAAGAAGAGCGGGGCTTGCTGGTTACCAGCCTCACAAAAGGTTTGGCTGAAACATCAT
TTCCGTATCGATACACTTCGATGTGCGCAACTATTTGTTCAATTACGCATTCTCGGGCAGATGCTGCGCCTGC
GAAGCAGGCGCACTA
Scan this sequence and get me the motif
OR Build a PSSM
ATGCGTCTCTC
ATGCCTCTGTC
ATGCGTCTCTC
ATGCGTCTCTC
ATGCGTCTATC

More Related Content

What's hot

History and scope in bioinformatics
History and scope in bioinformaticsHistory and scope in bioinformatics
History and scope in bioinformaticsKAUSHAL SAHU
 
Global and local alignment (bioinformatics)
Global and local alignment (bioinformatics)Global and local alignment (bioinformatics)
Global and local alignment (bioinformatics)Pritom Chaki
 
Protein Structure, Databases and Structural Alignment
Protein Structure, Databases and Structural AlignmentProtein Structure, Databases and Structural Alignment
Protein Structure, Databases and Structural AlignmentSaramita De Chakravarti
 
sequence alignment
sequence alignmentsequence alignment
sequence alignmentammar kareem
 
Protein information resource (PIR)
Protein information resource (PIR)Protein information resource (PIR)
Protein information resource (PIR)ShivaniShewale2
 
Protein 3 d structure prediction
Protein 3 d structure predictionProtein 3 d structure prediction
Protein 3 d structure predictionSamvartika Majumdar
 
MULTIPLE SEQUENCE ALIGNMENT
MULTIPLE  SEQUENCE  ALIGNMENTMULTIPLE  SEQUENCE  ALIGNMENT
MULTIPLE SEQUENCE ALIGNMENTMariya Raju
 
methods for protein structure prediction
methods for protein structure predictionmethods for protein structure prediction
methods for protein structure predictionkaramveer prajapat
 
Open Reading Frames
Open Reading FramesOpen Reading Frames
Open Reading FramesOsama Zahid
 
Tools of bioinforformatics by kk
Tools of bioinforformatics by kkTools of bioinforformatics by kk
Tools of bioinforformatics by kkKAUSHAL SAHU
 
Protein array, types and application
Protein array, types and applicationProtein array, types and application
Protein array, types and applicationKAUSHAL SAHU
 
Multiple sequence alignment
Multiple sequence alignmentMultiple sequence alignment
Multiple sequence alignmentRamya S
 

What's hot (20)

Biological database
Biological databaseBiological database
Biological database
 
History and scope in bioinformatics
History and scope in bioinformaticsHistory and scope in bioinformatics
History and scope in bioinformatics
 
Global and local alignment (bioinformatics)
Global and local alignment (bioinformatics)Global and local alignment (bioinformatics)
Global and local alignment (bioinformatics)
 
Protein Structure, Databases and Structural Alignment
Protein Structure, Databases and Structural AlignmentProtein Structure, Databases and Structural Alignment
Protein Structure, Databases and Structural Alignment
 
Finding ORF
Finding ORFFinding ORF
Finding ORF
 
sequence alignment
sequence alignmentsequence alignment
sequence alignment
 
Protein information resource (PIR)
Protein information resource (PIR)Protein information resource (PIR)
Protein information resource (PIR)
 
Protein Databases
Protein DatabasesProtein Databases
Protein Databases
 
Protein 3 d structure prediction
Protein 3 d structure predictionProtein 3 d structure prediction
Protein 3 d structure prediction
 
Clustal W - Multiple Sequence alignment
Clustal W - Multiple Sequence alignment   Clustal W - Multiple Sequence alignment
Clustal W - Multiple Sequence alignment
 
Structure alignment methods
Structure alignment methodsStructure alignment methods
Structure alignment methods
 
MULTIPLE SEQUENCE ALIGNMENT
MULTIPLE  SEQUENCE  ALIGNMENTMULTIPLE  SEQUENCE  ALIGNMENT
MULTIPLE SEQUENCE ALIGNMENT
 
methods for protein structure prediction
methods for protein structure predictionmethods for protein structure prediction
methods for protein structure prediction
 
Protein data bank
Protein data bankProtein data bank
Protein data bank
 
protein data bank
protein data bankprotein data bank
protein data bank
 
Open Reading Frames
Open Reading FramesOpen Reading Frames
Open Reading Frames
 
Tools of bioinforformatics by kk
Tools of bioinforformatics by kkTools of bioinforformatics by kk
Tools of bioinforformatics by kk
 
Protein array, types and application
Protein array, types and applicationProtein array, types and application
Protein array, types and application
 
Scop database
Scop databaseScop database
Scop database
 
Multiple sequence alignment
Multiple sequence alignmentMultiple sequence alignment
Multiple sequence alignment
 

Viewers also liked

Motif presentation
Motif presentationMotif presentation
Motif presentationAmir Razmjou
 
XPRIME: A Novel Motif Searching Method
XPRIME: A Novel Motif Searching MethodXPRIME: A Novel Motif Searching Method
XPRIME: A Novel Motif Searching Methodrlpoulsen
 
Protein threading using context specific alignment potential ismb-2013
Protein threading using context specific alignment potential ismb-2013Protein threading using context specific alignment potential ismb-2013
Protein threading using context specific alignment potential ismb-2013Sheng Wang
 
Drablos Composite Motifs Bosc2009
Drablos Composite Motifs Bosc2009Drablos Composite Motifs Bosc2009
Drablos Composite Motifs Bosc2009bosc
 
Benchmarking 16S rRNA gene sequencing and bioinformatics tools for identifica...
Benchmarking 16S rRNA gene sequencing and bioinformatics tools for identifica...Benchmarking 16S rRNA gene sequencing and bioinformatics tools for identifica...
Benchmarking 16S rRNA gene sequencing and bioinformatics tools for identifica...Luca Cozzuto
 
MEMEs in the Classroom
MEMEs in the ClassroomMEMEs in the Classroom
MEMEs in the ClassroomMichael A.
 
Dow fllormate dow pavimentos
Dow fllormate dow pavimentosDow fllormate dow pavimentos
Dow fllormate dow pavimentosCarla Alves
 
Night motif
Night motifNight motif
Night motifhmfowler
 
Optimum insulation thickness for building envelope a review
Optimum insulation thickness for building envelope  a reviewOptimum insulation thickness for building envelope  a review
Optimum insulation thickness for building envelope a revieweSAT Journals
 
Analysis of ChIP-Seq Data
Analysis of ChIP-Seq DataAnalysis of ChIP-Seq Data
Analysis of ChIP-Seq DataPhil Ewels
 
Protein Evolution: Structure, Function, and Human Health
Protein Evolution: Structure, Function, and Human HealthProtein Evolution: Structure, Function, and Human Health
Protein Evolution: Structure, Function, and Human HealthDan Gaston
 
Content – knowing when and how to use it
Content – knowing when and how to use itContent – knowing when and how to use it
Content – knowing when and how to use itBANNER
 
Theme,Symbols and Motifs
Theme,Symbols and MotifsTheme,Symbols and Motifs
Theme,Symbols and MotifsGuerillateacher
 
B A N N E R S
B A N N E R SB A N N E R S
B A N N E R Svaveloz
 
Patter lattice as a model of human's language processing
Patter lattice as a model of human's language processingPatter lattice as a model of human's language processing
Patter lattice as a model of human's language processingKow Kuroda
 
Methods for characterization of animal genomes(snp,str,qtl,rflp,rapd)
Methods for characterization of animal genomes(snp,str,qtl,rflp,rapd)Methods for characterization of animal genomes(snp,str,qtl,rflp,rapd)
Methods for characterization of animal genomes(snp,str,qtl,rflp,rapd)Dr Vijayata choudhary
 

Viewers also liked (20)

Motif presentation
Motif presentationMotif presentation
Motif presentation
 
XPRIME: A Novel Motif Searching Method
XPRIME: A Novel Motif Searching MethodXPRIME: A Novel Motif Searching Method
XPRIME: A Novel Motif Searching Method
 
Protein threading using context specific alignment potential ismb-2013
Protein threading using context specific alignment potential ismb-2013Protein threading using context specific alignment potential ismb-2013
Protein threading using context specific alignment potential ismb-2013
 
Drablos Composite Motifs Bosc2009
Drablos Composite Motifs Bosc2009Drablos Composite Motifs Bosc2009
Drablos Composite Motifs Bosc2009
 
Benchmarking 16S rRNA gene sequencing and bioinformatics tools for identifica...
Benchmarking 16S rRNA gene sequencing and bioinformatics tools for identifica...Benchmarking 16S rRNA gene sequencing and bioinformatics tools for identifica...
Benchmarking 16S rRNA gene sequencing and bioinformatics tools for identifica...
 
6 motif and pattern
6   motif and pattern6   motif and pattern
6 motif and pattern
 
MEMEs in the Classroom
MEMEs in the ClassroomMEMEs in the Classroom
MEMEs in the Classroom
 
Macs course
Macs courseMacs course
Macs course
 
Apresentacao NG6
Apresentacao NG6Apresentacao NG6
Apresentacao NG6
 
Dow fllormate dow pavimentos
Dow fllormate dow pavimentosDow fllormate dow pavimentos
Dow fllormate dow pavimentos
 
Night motif
Night motifNight motif
Night motif
 
Optimum insulation thickness for building envelope a review
Optimum insulation thickness for building envelope  a reviewOptimum insulation thickness for building envelope  a review
Optimum insulation thickness for building envelope a review
 
DNA Motif Finding 2010
DNA Motif Finding 2010DNA Motif Finding 2010
DNA Motif Finding 2010
 
Analysis of ChIP-Seq Data
Analysis of ChIP-Seq DataAnalysis of ChIP-Seq Data
Analysis of ChIP-Seq Data
 
Protein Evolution: Structure, Function, and Human Health
Protein Evolution: Structure, Function, and Human HealthProtein Evolution: Structure, Function, and Human Health
Protein Evolution: Structure, Function, and Human Health
 
Content – knowing when and how to use it
Content – knowing when and how to use itContent – knowing when and how to use it
Content – knowing when and how to use it
 
Theme,Symbols and Motifs
Theme,Symbols and MotifsTheme,Symbols and Motifs
Theme,Symbols and Motifs
 
B A N N E R S
B A N N E R SB A N N E R S
B A N N E R S
 
Patter lattice as a model of human's language processing
Patter lattice as a model of human's language processingPatter lattice as a model of human's language processing
Patter lattice as a model of human's language processing
 
Methods for characterization of animal genomes(snp,str,qtl,rflp,rapd)
Methods for characterization of animal genomes(snp,str,qtl,rflp,rapd)Methods for characterization of animal genomes(snp,str,qtl,rflp,rapd)
Methods for characterization of animal genomes(snp,str,qtl,rflp,rapd)
 

More from Sucheta Tripathy (20)

Gal
GalGal
Gal
 
Ramorum2016 final
Ramorum2016 finalRamorum2016 final
Ramorum2016 final
 
Primer designgeneprediction
Primer designgenepredictionPrimer designgeneprediction
Primer designgeneprediction
 
Databases ii
Databases iiDatabases ii
Databases ii
 
Snps and microarray
Snps and microarraySnps and microarray
Snps and microarray
 
Stat2013
Stat2013Stat2013
Stat2013
 
26 nov2013seminar
26 nov2013seminar26 nov2013seminar
26 nov2013seminar
 
Stat2013
Stat2013Stat2013
Stat2013
 
Presentation2013
Presentation2013Presentation2013
Presentation2013
 
Lecture7,8
Lecture7,8Lecture7,8
Lecture7,8
 
Lecture5,6
Lecture5,6Lecture5,6
Lecture5,6
 
Primer designgeneprediction
Primer designgenepredictionPrimer designgeneprediction
Primer designgeneprediction
 
Lecture 3,4
Lecture 3,4Lecture 3,4
Lecture 3,4
 
Lecture 1,2
Lecture 1,2Lecture 1,2
Lecture 1,2
 
Sequence Alignment,Blast, Fasta, MSA
Sequence Alignment,Blast, Fasta, MSASequence Alignment,Blast, Fasta, MSA
Sequence Alignment,Blast, Fasta, MSA
 
Databases Part II
Databases Part IIDatabases Part II
Databases Part II
 
Biological databases
Biological databasesBiological databases
Biological databases
 
Genome sequencingprojects
Genome sequencingprojectsGenome sequencingprojects
Genome sequencingprojects
 
Human encodeproject
Human encodeprojectHuman encodeproject
Human encodeproject
 
Tyler presentation
Tyler presentationTyler presentation
Tyler presentation
 

Recently uploaded

Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3JemimahLaneBuaron
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Sapana Sha
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingTechSoup
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeThiyagu K
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 
Separation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesSeparation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesFatimaKhan178732
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionSafetyChain Software
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAssociation for Project Management
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Krashi Coaching
 
Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Disha Kariya
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docxPoojaSen20
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformChameera Dedduwage
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactPECB
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Celine George
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfchloefrazer622
 

Recently uploaded (20)

Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
Separation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesSeparation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and Actinides
 
Advance Mobile Application Development class 07
Advance Mobile Application Development class 07Advance Mobile Application Development class 07
Advance Mobile Application Development class 07
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory Inspection
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across Sectors
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
 
Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docx
 
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdf
 

Motif andpatterndatabase