SlideShare a Scribd company logo
An introduction to
Protein Families
and databases
Jamia Millia Islamia
Date 2
Pitching in
Protein Families and the need
for classification
Domains & Motifs with GPCRs
as example
Vrinda Sharma
Groundwork
Sequence Features
Protein Signatures
Patterns & Profiles
HMMs
Wanchha Maurya
Showstopper
DUFs- a story worth reciting
Databases of Protein Families
Demistifying the Hypotheticals
Rohit Satyam
Need for classification
Date 3
Proteins can be classified into
groups based on sequence or
structural similarity.
These groups often contain
well characterised proteins
whose function is known.
Thus, when a novel protein is
identified, its functional
properties can be proposed
based on the group to which
it is predicted to belong.
Source: EMBL-EBI Training Course:
https://www.ebi.ac.uk/training-
beta/online/courses/protein-classification-intro-ebi-
resources/protein-classification/what-are-protein-
families/
Protein Families in Brief
Date Your Footer Here
Group of Proteins which
• Shares a common evolutionary origin
• Performs related functions
• Similar in sequence or structure.
Superfamily
Family A
Subfamily
A1
Subfamily
A2
Family B Family C
Subfamily
C1
Subfamily
C2
Subfamily
C3
Domain and Motifs
aren’t synonyms
Date
Domains are distinct functional and/or structural
units in a protein.
They are responsible for a particular function or
interaction, contributing to the overall role of a
protein.
Motifs are secondary structure that are formed
due to interaction between alpha-helices and
beta-sheets.
Structure of the SH3 domain
Domain composition of Nck. Nck contains three
SH3 domains plus another domain known as SH2
G-Protein Coupled receptors
An example to understand Protein
Families
G-Protein Signaling
Date Your Footer Here
• Regulator of GPS domains are protein
structural units that activate GTPase.
• sequences belonging to RGS protein
family(multifunctional GTPase accelerating
protein).
• All RGS protein family member contains RGS
domain ,some (RGS1) consist little more than
domain .
• RGS3 and RGS6 contain additional domains for
other functions .
They have seven transmembrane
domains, and interact with
specialized proteins (called G
proteins) to influence intracellular
pathways after binding
extracellular signals
G-protein-coupled receptors
and cancer
Dorsam et al 2007
Date Your Footer Here 9
Level2
Level 1
Sub-family
Superfamily GPCRs
Rhodopsin
-like GPCRs
Opsins
Red-
sensitive
opsins
Green-
sensitive
opsins
Blue-
sensitive
opsins
APJ
receptors
Relaxin
Receptors
cAMP
Receptors
Secretin like-
GPCRs
Etc…
The GPCR superfamily hierarchy. Families and subfamilies to which the short-wave-sensitive opsin 1
protein belongs are highlighted in violet.
GPCRs
Regulates: Biological processes, including photoreception, regulation of the immune system, and nervous system
transmission.
Similarity
increases
Date 10
What Are Sequence Features?
1.Active Site
2.Binding Site
3. Post Translational Modifications (PTMs)
4. Repeats
Group of amino acid that confer certain characteristics upon a protein ,and maybe important for
overall function
Date 11
Protein Signatures
• To classify protein’s family and to
predict the domains or sequence
features we use computational tools
and that tools are the predictive
models known as protein signatures.
• Model refines distantly related
sequences in database are identified.
• Once the model is mature, signature
is ready for protein sequence
analysis.
The Purpose and the Process
Date 12
How do Protein Signature compare to other
ways of classifying proteins?
• Multiple sequence alignment gives
us information about classification
which we use to identify amino acid
residues that are conserved in
distantly proteins.
• Protein signature built from
multiple sequence alignment are
usually better at detecting
divergent homologues than
pairwise comparison method.
Identifying the conserved residues
Date 13
Signature types
Patterns
Profiles
Fingerprints
Hidden Markov Models (HMMs)
Approaches to generate signatures
Patterns & Profiles
Date 14
Signature Types
Patterns can recognize sequence
features such as binding sites or
active sites of enzymes consist of a
only few amino acids.
Ex: PROSITE database.
1 2
Profiles are built by converting
multiple sequence alignment into
position specific scoring system
(PMMs).
Ex: CDD, HAMAP, PROSITE and
PRODOM.
Fingerprints and HMMs
Date 15
Signature Types
3 4
Fingerprints are composed of multiple
short conserved motifs which are drawn
from sequence alignment. They can
distinguish individual subfamilies within
protein families.
Ex : PRINTS database.
Hidden Markov models (HMMs) are
used to convert multiple sequence
alignment into position specific
scoring system.
Ex: Pfam, SMART, TIGRFAM,
PANTHER, SFLD, Superfamily
and Gene 3D.
Date 16
Families in search of function
Domains of unknown function (DUFs)
Popovic et al., 2017.,Scientific reports,
The function of the Domain is yet to be discovered
The DUF naming scheme was introduced by Chris
Ponting through the addition of DUF1 and DUF2 to
the SMART database
Goodacre et al 2014.
Databases at Glance
Date 18
Databases of
Protein Families
5. PRINTS
Combine Multidomain/motif
information for family categorization.
MSA and Fuzzy Logic (Regex)
6. MobiDB
Homology, Predicted, Curated
Intrinsically Disordered regions
database
7. TIGRFAM
MSA, HMM mainly for prokaryotic
proteins
8. SUPERFAMILY2
Using HMM and protein Sequences
Domain organisation, sequence alignments
and protein sequence details can be
obtained for query sequence
4. PRIDE
Mass-Spec based identification
Provide PTM information and Literature
Evidences
3. Prosite
MSA of homologous Proteins;Based on
Prorules
2. PIRSF
MSA and Clustering with hight similarity
thresholds
1. Pfam
Protein Family, Domains, Motifs and Repeats
(Generated from MSA and HMMs)
1
3 5
7
2
4
8
6
Date 19
Interpro-A Protein Family Compendium
Date 20
GOFeat Tutorial
Lorem ipsum dolor sit amet, consectetur adipiscing elit.
Protein Under Investigation: LOC645967
Date 21
InterPro Tutorial
Protein Under Investigation: LOC645967
Date 22
References
• Dorsam, R.T. and Gutkind, J.S., 2007. G-protein-coupled receptors and cancer. Nature reviews
cancer, 7(2), pp.79-94.
• Bateman, Alex, Penny Coggill, and Robert D. Finn. "DUFs: families in search of function." Acta
Crystallographica Section F: Structural Biology and Crystallization Communications 66, no. 10
(2010): 1148-1152.
• Goodacre, Norman F., Dietlind L. Gerloff, and Peter Uetz. "Protein domains of unknown function are
essential in bacteria." MBio 5, no. 1 (2014).
• EMBL-EBI Training Course: https://www.ebi.ac.uk/training-beta/online/courses/protein-
classification-intro-ebi-resources/protein-classification/what-are-protein-families/
Date 23
Thanks
Drop in
@RohitSatyam1
+91 9870953351
Jamia Millia Islamia University

More Related Content

What's hot

Rasmol
RasmolRasmol
Biological database
Biological databaseBiological database
Biological database
Iqbal college Peringammala TVM
 
Introduction OF BIOLOGICAL DATABASE
Introduction OF BIOLOGICAL DATABASEIntroduction OF BIOLOGICAL DATABASE
Introduction OF BIOLOGICAL DATABASE
PrashantSharma807
 
Whole genome shotgun sequencing
Whole genome shotgun sequencingWhole genome shotgun sequencing
Whole genome shotgun sequencing
Goutham Sarovar
 
Shotgun and clone contig method
Shotgun and clone contig methodShotgun and clone contig method
Shotgun and clone contig method
Dr. Naveen Gaurav srivastava
 
BLAST
BLASTBLAST
Swiss prot database
Swiss prot databaseSwiss prot database
Swiss prot database
sagrika chugh
 
Electrophoretic mobility shift assay
Electrophoretic mobility shift assay Electrophoretic mobility shift assay
Electrophoretic mobility shift assay
iqraakbar8
 
Protein sequence databases
Protein sequence databasesProtein sequence databases
Protein sequence databases
Vidya Kalaivani Rajkumar
 
Protein information resource (PIR)
Protein information resource (PIR)Protein information resource (PIR)
Protein information resource (PIR)
ShivaniShewale2
 
Blast and fasta
Blast and fastaBlast and fasta
Blast and fasta
ALLIENU
 
History and scope in bioinformatics
History and scope in bioinformaticsHistory and scope in bioinformatics
History and scope in bioinformatics
KAUSHAL SAHU
 
BLAST (Basic local alignment search Tool)
BLAST (Basic local alignment search Tool)BLAST (Basic local alignment search Tool)
BLAST (Basic local alignment search Tool)
Ariful Islam Sagar
 
Introduction to Bioinformatics
Introduction to BioinformaticsIntroduction to Bioinformatics
Introduction to Bioinformatics
Asad Afridi
 
Major resources of bioinformatics 2
Major resources of bioinformatics 2Major resources of bioinformatics 2
Major resources of bioinformatics 2
Mohd Affan
 
NCBI
NCBINCBI
Protein Databases
Protein DatabasesProtein Databases
Protein Databases
SATHIYA NARAYANAN
 
ENTREZ.ppt
ENTREZ.pptENTREZ.ppt
ENTREZ.ppt
kishoreGupta17
 
Fasta
FastaFasta
EMBL-EBI
EMBL-EBIEMBL-EBI
EMBL-EBI
Sayma Zerin
 

What's hot (20)

Rasmol
RasmolRasmol
Rasmol
 
Biological database
Biological databaseBiological database
Biological database
 
Introduction OF BIOLOGICAL DATABASE
Introduction OF BIOLOGICAL DATABASEIntroduction OF BIOLOGICAL DATABASE
Introduction OF BIOLOGICAL DATABASE
 
Whole genome shotgun sequencing
Whole genome shotgun sequencingWhole genome shotgun sequencing
Whole genome shotgun sequencing
 
Shotgun and clone contig method
Shotgun and clone contig methodShotgun and clone contig method
Shotgun and clone contig method
 
BLAST
BLASTBLAST
BLAST
 
Swiss prot database
Swiss prot databaseSwiss prot database
Swiss prot database
 
Electrophoretic mobility shift assay
Electrophoretic mobility shift assay Electrophoretic mobility shift assay
Electrophoretic mobility shift assay
 
Protein sequence databases
Protein sequence databasesProtein sequence databases
Protein sequence databases
 
Protein information resource (PIR)
Protein information resource (PIR)Protein information resource (PIR)
Protein information resource (PIR)
 
Blast and fasta
Blast and fastaBlast and fasta
Blast and fasta
 
History and scope in bioinformatics
History and scope in bioinformaticsHistory and scope in bioinformatics
History and scope in bioinformatics
 
BLAST (Basic local alignment search Tool)
BLAST (Basic local alignment search Tool)BLAST (Basic local alignment search Tool)
BLAST (Basic local alignment search Tool)
 
Introduction to Bioinformatics
Introduction to BioinformaticsIntroduction to Bioinformatics
Introduction to Bioinformatics
 
Major resources of bioinformatics 2
Major resources of bioinformatics 2Major resources of bioinformatics 2
Major resources of bioinformatics 2
 
NCBI
NCBINCBI
NCBI
 
Protein Databases
Protein DatabasesProtein Databases
Protein Databases
 
ENTREZ.ppt
ENTREZ.pptENTREZ.ppt
ENTREZ.ppt
 
Fasta
FastaFasta
Fasta
 
EMBL-EBI
EMBL-EBIEMBL-EBI
EMBL-EBI
 

Similar to Introduction to Protein Families and Databases

Protein database
Protein databaseProtein database
Protein database
Khalid Hakeem
 
Protein Chemistry-Proteomics-Lec1_Intro.ppt
Protein Chemistry-Proteomics-Lec1_Intro.pptProtein Chemistry-Proteomics-Lec1_Intro.ppt
Protein Chemistry-Proteomics-Lec1_Intro.ppt
Sachin Teotia
 
Lecture__on__Proteomics_Introduction.ppt
Lecture__on__Proteomics_Introduction.pptLecture__on__Proteomics_Introduction.ppt
Lecture__on__Proteomics_Introduction.ppt
Sachin Teotia
 
An Overview to Protein bioinformatics
An Overview to Protein bioinformaticsAn Overview to Protein bioinformatics
An Overview to Protein bioinformatics
Joel Ricci-López
 
Characterizing Protein Families of Unknown Function
Characterizing Protein Families of Unknown FunctionCharacterizing Protein Families of Unknown Function
Characterizing Protein Families of Unknown Function
Morgan Langille
 
1Pfam.pptx
1Pfam.pptx1Pfam.pptx
1Pfam.pptx
Vetico
 
Bioinformatics, application by kk sahu sir
Bioinformatics, application by kk sahu sirBioinformatics, application by kk sahu sir
Bioinformatics, application by kk sahu sir
KAUSHAL SAHU
 
NIH-mar2604.rm.ppt
NIH-mar2604.rm.pptNIH-mar2604.rm.ppt
NIH-mar2604.rm.ppt
Chandrakanth R
 
Data retrieval
Data retrievalData retrieval
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
Arockiyajainmary
 
Theoretical evaluation of shotgun proteomic analysis strategies; Peptide obse...
Theoretical evaluation of shotgun proteomic analysis strategies; Peptide obse...Theoretical evaluation of shotgun proteomic analysis strategies; Peptide obse...
Theoretical evaluation of shotgun proteomic analysis strategies; Peptide obse...
Keiji Takamoto
 
Research presentation-wd
Research presentation-wdResearch presentation-wd
Research presentation-wd
Wagied Davids
 
BTC 506 Gene Identification using Bioinformatic Tools-230302130331.pptx
BTC 506 Gene Identification using Bioinformatic Tools-230302130331.pptxBTC 506 Gene Identification using Bioinformatic Tools-230302130331.pptx
BTC 506 Gene Identification using Bioinformatic Tools-230302130331.pptx
ChijiokeNsofor
 
Proteomics: lecture (1) introduction to proteomics
Proteomics: lecture (1) introduction to proteomicsProteomics: lecture (1) introduction to proteomics
Proteomics: lecture (1) introduction to proteomics
Claudine83
 
Gene identification using bioinformatic tools.pptx
Gene identification using bioinformatic tools.pptxGene identification using bioinformatic tools.pptx
Gene identification using bioinformatic tools.pptx
University of Petroleum and Energy studies
 
Presage database
Presage databasePresage database
Presage database
Akshay More
 
www.ijerd.com
www.ijerd.comwww.ijerd.com
www.ijerd.com
IJERD Editor
 
Introduction to bioinformatics
Introduction to bioinformaticsIntroduction to bioinformatics
Introduction to bioinformatics
maulikchaudhary8
 
6. protein secondry structure ppt
6. protein secondry structure ppt6. protein secondry structure ppt
6. protein secondry structure ppt
VinaKhan1
 
Genome and Proteome data integration in RDF
Genome and Proteome data integration in RDFGenome and Proteome data integration in RDF
Genome and Proteome data integration in RDF
Nadia Anwar
 

Similar to Introduction to Protein Families and Databases (20)

Protein database
Protein databaseProtein database
Protein database
 
Protein Chemistry-Proteomics-Lec1_Intro.ppt
Protein Chemistry-Proteomics-Lec1_Intro.pptProtein Chemistry-Proteomics-Lec1_Intro.ppt
Protein Chemistry-Proteomics-Lec1_Intro.ppt
 
Lecture__on__Proteomics_Introduction.ppt
Lecture__on__Proteomics_Introduction.pptLecture__on__Proteomics_Introduction.ppt
Lecture__on__Proteomics_Introduction.ppt
 
An Overview to Protein bioinformatics
An Overview to Protein bioinformaticsAn Overview to Protein bioinformatics
An Overview to Protein bioinformatics
 
Characterizing Protein Families of Unknown Function
Characterizing Protein Families of Unknown FunctionCharacterizing Protein Families of Unknown Function
Characterizing Protein Families of Unknown Function
 
1Pfam.pptx
1Pfam.pptx1Pfam.pptx
1Pfam.pptx
 
Bioinformatics, application by kk sahu sir
Bioinformatics, application by kk sahu sirBioinformatics, application by kk sahu sir
Bioinformatics, application by kk sahu sir
 
NIH-mar2604.rm.ppt
NIH-mar2604.rm.pptNIH-mar2604.rm.ppt
NIH-mar2604.rm.ppt
 
Data retrieval
Data retrievalData retrieval
Data retrieval
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Theoretical evaluation of shotgun proteomic analysis strategies; Peptide obse...
Theoretical evaluation of shotgun proteomic analysis strategies; Peptide obse...Theoretical evaluation of shotgun proteomic analysis strategies; Peptide obse...
Theoretical evaluation of shotgun proteomic analysis strategies; Peptide obse...
 
Research presentation-wd
Research presentation-wdResearch presentation-wd
Research presentation-wd
 
BTC 506 Gene Identification using Bioinformatic Tools-230302130331.pptx
BTC 506 Gene Identification using Bioinformatic Tools-230302130331.pptxBTC 506 Gene Identification using Bioinformatic Tools-230302130331.pptx
BTC 506 Gene Identification using Bioinformatic Tools-230302130331.pptx
 
Proteomics: lecture (1) introduction to proteomics
Proteomics: lecture (1) introduction to proteomicsProteomics: lecture (1) introduction to proteomics
Proteomics: lecture (1) introduction to proteomics
 
Gene identification using bioinformatic tools.pptx
Gene identification using bioinformatic tools.pptxGene identification using bioinformatic tools.pptx
Gene identification using bioinformatic tools.pptx
 
Presage database
Presage databasePresage database
Presage database
 
www.ijerd.com
www.ijerd.comwww.ijerd.com
www.ijerd.com
 
Introduction to bioinformatics
Introduction to bioinformaticsIntroduction to bioinformatics
Introduction to bioinformatics
 
6. protein secondry structure ppt
6. protein secondry structure ppt6. protein secondry structure ppt
6. protein secondry structure ppt
 
Genome and Proteome data integration in RDF
Genome and Proteome data integration in RDFGenome and Proteome data integration in RDF
Genome and Proteome data integration in RDF
 

More from Rohit Satyam

Best Practices in Structural Biology
Best Practices in Structural BiologyBest Practices in Structural Biology
Best Practices in Structural Biology
Rohit Satyam
 
Tridax procumbens and its Antidiarrhoeal property
Tridax procumbens and its Antidiarrhoeal propertyTridax procumbens and its Antidiarrhoeal property
Tridax procumbens and its Antidiarrhoeal property
Rohit Satyam
 
Bermuda Triangle and Its associated Secrets
Bermuda Triangle and Its associated SecretsBermuda Triangle and Its associated Secrets
Bermuda Triangle and Its associated Secrets
Rohit Satyam
 
Job interviews and How to get through
Job interviews and How to get throughJob interviews and How to get through
Job interviews and How to get through
Rohit Satyam
 
Immunisation against bacteria
Immunisation against bacteriaImmunisation against bacteria
Immunisation against bacteria
Rohit Satyam
 
Golgi bodies
Golgi bodiesGolgi bodies
Golgi bodies
Rohit Satyam
 
Cell division
Cell divisionCell division
Cell division
Rohit Satyam
 
Renewa ble energy
Renewa ble energyRenewa ble energy
Renewa ble energy
Rohit Satyam
 
Induced Pluripotent Stem Cells, iPSCs
Induced Pluripotent Stem Cells, iPSCsInduced Pluripotent Stem Cells, iPSCs
Induced Pluripotent Stem Cells, iPSCs
Rohit Satyam
 

More from Rohit Satyam (9)

Best Practices in Structural Biology
Best Practices in Structural BiologyBest Practices in Structural Biology
Best Practices in Structural Biology
 
Tridax procumbens and its Antidiarrhoeal property
Tridax procumbens and its Antidiarrhoeal propertyTridax procumbens and its Antidiarrhoeal property
Tridax procumbens and its Antidiarrhoeal property
 
Bermuda Triangle and Its associated Secrets
Bermuda Triangle and Its associated SecretsBermuda Triangle and Its associated Secrets
Bermuda Triangle and Its associated Secrets
 
Job interviews and How to get through
Job interviews and How to get throughJob interviews and How to get through
Job interviews and How to get through
 
Immunisation against bacteria
Immunisation against bacteriaImmunisation against bacteria
Immunisation against bacteria
 
Golgi bodies
Golgi bodiesGolgi bodies
Golgi bodies
 
Cell division
Cell divisionCell division
Cell division
 
Renewa ble energy
Renewa ble energyRenewa ble energy
Renewa ble energy
 
Induced Pluripotent Stem Cells, iPSCs
Induced Pluripotent Stem Cells, iPSCsInduced Pluripotent Stem Cells, iPSCs
Induced Pluripotent Stem Cells, iPSCs
 

Recently uploaded

C1 Rubenstein AP HuG xxxxxxxxxxxxxx.pptx
C1 Rubenstein AP HuG xxxxxxxxxxxxxx.pptxC1 Rubenstein AP HuG xxxxxxxxxxxxxx.pptx
C1 Rubenstein AP HuG xxxxxxxxxxxxxx.pptx
mulvey2
 
Traditional Musical Instruments of Arunachal Pradesh and Uttar Pradesh - RAYH...
Traditional Musical Instruments of Arunachal Pradesh and Uttar Pradesh - RAYH...Traditional Musical Instruments of Arunachal Pradesh and Uttar Pradesh - RAYH...
Traditional Musical Instruments of Arunachal Pradesh and Uttar Pradesh - RAYH...
imrankhan141184
 
Mule event processing models | MuleSoft Mysore Meetup #47
Mule event processing models | MuleSoft Mysore Meetup #47Mule event processing models | MuleSoft Mysore Meetup #47
Mule event processing models | MuleSoft Mysore Meetup #47
MysoreMuleSoftMeetup
 
Gender and Mental Health - Counselling and Family Therapy Applications and In...
Gender and Mental Health - Counselling and Family Therapy Applications and In...Gender and Mental Health - Counselling and Family Therapy Applications and In...
Gender and Mental Health - Counselling and Family Therapy Applications and In...
PsychoTech Services
 
Walmart Business+ and Spark Good for Nonprofits.pdf
Walmart Business+ and Spark Good for Nonprofits.pdfWalmart Business+ and Spark Good for Nonprofits.pdf
Walmart Business+ and Spark Good for Nonprofits.pdf
TechSoup
 
RHEOLOGY Physical pharmaceutics-II notes for B.pharm 4th sem students
RHEOLOGY Physical pharmaceutics-II notes for B.pharm 4th sem studentsRHEOLOGY Physical pharmaceutics-II notes for B.pharm 4th sem students
RHEOLOGY Physical pharmaceutics-II notes for B.pharm 4th sem students
Himanshu Rai
 
Pharmaceutics Pharmaceuticals best of brub
Pharmaceutics Pharmaceuticals best of brubPharmaceutics Pharmaceuticals best of brub
Pharmaceutics Pharmaceuticals best of brub
danielkiash986
 
Standardized tool for Intelligence test.
Standardized tool for Intelligence test.Standardized tool for Intelligence test.
Standardized tool for Intelligence test.
deepaannamalai16
 
RESULTS OF THE EVALUATION QUESTIONNAIRE.pptx
RESULTS OF THE EVALUATION QUESTIONNAIRE.pptxRESULTS OF THE EVALUATION QUESTIONNAIRE.pptx
RESULTS OF THE EVALUATION QUESTIONNAIRE.pptx
zuzanka
 
Nutrition Inc FY 2024, 4 - Hour Training
Nutrition Inc FY 2024, 4 - Hour TrainingNutrition Inc FY 2024, 4 - Hour Training
Nutrition Inc FY 2024, 4 - Hour Training
melliereed
 
math operations ued in python and all used
math operations ued in python and all usedmath operations ued in python and all used
math operations ued in python and all used
ssuser13ffe4
 
Wound healing PPT
Wound healing PPTWound healing PPT
Wound healing PPT
Jyoti Chand
 
Educational Technology in the Health Sciences
Educational Technology in the Health SciencesEducational Technology in the Health Sciences
Educational Technology in the Health Sciences
Iris Thiele Isip-Tan
 
Benner "Expanding Pathways to Publishing Careers"
Benner "Expanding Pathways to Publishing Careers"Benner "Expanding Pathways to Publishing Careers"
Benner "Expanding Pathways to Publishing Careers"
National Information Standards Organization (NISO)
 
مصحف القراءات العشر أعد أحرف الخلاف سمير بسيوني.pdf
مصحف القراءات العشر   أعد أحرف الخلاف سمير بسيوني.pdfمصحف القراءات العشر   أعد أحرف الخلاف سمير بسيوني.pdf
مصحف القراءات العشر أعد أحرف الخلاف سمير بسيوني.pdf
سمير بسيوني
 
Leveraging Generative AI to Drive Nonprofit Innovation
Leveraging Generative AI to Drive Nonprofit InnovationLeveraging Generative AI to Drive Nonprofit Innovation
Leveraging Generative AI to Drive Nonprofit Innovation
TechSoup
 
Juneteenth Freedom Day 2024 David Douglas School District
Juneteenth Freedom Day 2024 David Douglas School DistrictJuneteenth Freedom Day 2024 David Douglas School District
Juneteenth Freedom Day 2024 David Douglas School District
David Douglas School District
 
NEWSPAPERS - QUESTION 1 - REVISION POWERPOINT.pptx
NEWSPAPERS - QUESTION 1 - REVISION POWERPOINT.pptxNEWSPAPERS - QUESTION 1 - REVISION POWERPOINT.pptx
NEWSPAPERS - QUESTION 1 - REVISION POWERPOINT.pptx
iammrhaywood
 
How Barcodes Can Be Leveraged Within Odoo 17
How Barcodes Can Be Leveraged Within Odoo 17How Barcodes Can Be Leveraged Within Odoo 17
How Barcodes Can Be Leveraged Within Odoo 17
Celine George
 
Bonku-Babus-Friend by Sathyajith Ray (9)
Bonku-Babus-Friend by Sathyajith Ray  (9)Bonku-Babus-Friend by Sathyajith Ray  (9)
Bonku-Babus-Friend by Sathyajith Ray (9)
nitinpv4ai
 

Recently uploaded (20)

C1 Rubenstein AP HuG xxxxxxxxxxxxxx.pptx
C1 Rubenstein AP HuG xxxxxxxxxxxxxx.pptxC1 Rubenstein AP HuG xxxxxxxxxxxxxx.pptx
C1 Rubenstein AP HuG xxxxxxxxxxxxxx.pptx
 
Traditional Musical Instruments of Arunachal Pradesh and Uttar Pradesh - RAYH...
Traditional Musical Instruments of Arunachal Pradesh and Uttar Pradesh - RAYH...Traditional Musical Instruments of Arunachal Pradesh and Uttar Pradesh - RAYH...
Traditional Musical Instruments of Arunachal Pradesh and Uttar Pradesh - RAYH...
 
Mule event processing models | MuleSoft Mysore Meetup #47
Mule event processing models | MuleSoft Mysore Meetup #47Mule event processing models | MuleSoft Mysore Meetup #47
Mule event processing models | MuleSoft Mysore Meetup #47
 
Gender and Mental Health - Counselling and Family Therapy Applications and In...
Gender and Mental Health - Counselling and Family Therapy Applications and In...Gender and Mental Health - Counselling and Family Therapy Applications and In...
Gender and Mental Health - Counselling and Family Therapy Applications and In...
 
Walmart Business+ and Spark Good for Nonprofits.pdf
Walmart Business+ and Spark Good for Nonprofits.pdfWalmart Business+ and Spark Good for Nonprofits.pdf
Walmart Business+ and Spark Good for Nonprofits.pdf
 
RHEOLOGY Physical pharmaceutics-II notes for B.pharm 4th sem students
RHEOLOGY Physical pharmaceutics-II notes for B.pharm 4th sem studentsRHEOLOGY Physical pharmaceutics-II notes for B.pharm 4th sem students
RHEOLOGY Physical pharmaceutics-II notes for B.pharm 4th sem students
 
Pharmaceutics Pharmaceuticals best of brub
Pharmaceutics Pharmaceuticals best of brubPharmaceutics Pharmaceuticals best of brub
Pharmaceutics Pharmaceuticals best of brub
 
Standardized tool for Intelligence test.
Standardized tool for Intelligence test.Standardized tool for Intelligence test.
Standardized tool for Intelligence test.
 
RESULTS OF THE EVALUATION QUESTIONNAIRE.pptx
RESULTS OF THE EVALUATION QUESTIONNAIRE.pptxRESULTS OF THE EVALUATION QUESTIONNAIRE.pptx
RESULTS OF THE EVALUATION QUESTIONNAIRE.pptx
 
Nutrition Inc FY 2024, 4 - Hour Training
Nutrition Inc FY 2024, 4 - Hour TrainingNutrition Inc FY 2024, 4 - Hour Training
Nutrition Inc FY 2024, 4 - Hour Training
 
math operations ued in python and all used
math operations ued in python and all usedmath operations ued in python and all used
math operations ued in python and all used
 
Wound healing PPT
Wound healing PPTWound healing PPT
Wound healing PPT
 
Educational Technology in the Health Sciences
Educational Technology in the Health SciencesEducational Technology in the Health Sciences
Educational Technology in the Health Sciences
 
Benner "Expanding Pathways to Publishing Careers"
Benner "Expanding Pathways to Publishing Careers"Benner "Expanding Pathways to Publishing Careers"
Benner "Expanding Pathways to Publishing Careers"
 
مصحف القراءات العشر أعد أحرف الخلاف سمير بسيوني.pdf
مصحف القراءات العشر   أعد أحرف الخلاف سمير بسيوني.pdfمصحف القراءات العشر   أعد أحرف الخلاف سمير بسيوني.pdf
مصحف القراءات العشر أعد أحرف الخلاف سمير بسيوني.pdf
 
Leveraging Generative AI to Drive Nonprofit Innovation
Leveraging Generative AI to Drive Nonprofit InnovationLeveraging Generative AI to Drive Nonprofit Innovation
Leveraging Generative AI to Drive Nonprofit Innovation
 
Juneteenth Freedom Day 2024 David Douglas School District
Juneteenth Freedom Day 2024 David Douglas School DistrictJuneteenth Freedom Day 2024 David Douglas School District
Juneteenth Freedom Day 2024 David Douglas School District
 
NEWSPAPERS - QUESTION 1 - REVISION POWERPOINT.pptx
NEWSPAPERS - QUESTION 1 - REVISION POWERPOINT.pptxNEWSPAPERS - QUESTION 1 - REVISION POWERPOINT.pptx
NEWSPAPERS - QUESTION 1 - REVISION POWERPOINT.pptx
 
How Barcodes Can Be Leveraged Within Odoo 17
How Barcodes Can Be Leveraged Within Odoo 17How Barcodes Can Be Leveraged Within Odoo 17
How Barcodes Can Be Leveraged Within Odoo 17
 
Bonku-Babus-Friend by Sathyajith Ray (9)
Bonku-Babus-Friend by Sathyajith Ray  (9)Bonku-Babus-Friend by Sathyajith Ray  (9)
Bonku-Babus-Friend by Sathyajith Ray (9)
 

Introduction to Protein Families and Databases

  • 1. An introduction to Protein Families and databases Jamia Millia Islamia
  • 2. Date 2 Pitching in Protein Families and the need for classification Domains & Motifs with GPCRs as example Vrinda Sharma Groundwork Sequence Features Protein Signatures Patterns & Profiles HMMs Wanchha Maurya Showstopper DUFs- a story worth reciting Databases of Protein Families Demistifying the Hypotheticals Rohit Satyam
  • 3. Need for classification Date 3 Proteins can be classified into groups based on sequence or structural similarity. These groups often contain well characterised proteins whose function is known. Thus, when a novel protein is identified, its functional properties can be proposed based on the group to which it is predicted to belong. Source: EMBL-EBI Training Course: https://www.ebi.ac.uk/training- beta/online/courses/protein-classification-intro-ebi- resources/protein-classification/what-are-protein- families/
  • 4. Protein Families in Brief Date Your Footer Here Group of Proteins which • Shares a common evolutionary origin • Performs related functions • Similar in sequence or structure. Superfamily Family A Subfamily A1 Subfamily A2 Family B Family C Subfamily C1 Subfamily C2 Subfamily C3
  • 5. Domain and Motifs aren’t synonyms Date Domains are distinct functional and/or structural units in a protein. They are responsible for a particular function or interaction, contributing to the overall role of a protein. Motifs are secondary structure that are formed due to interaction between alpha-helices and beta-sheets. Structure of the SH3 domain Domain composition of Nck. Nck contains three SH3 domains plus another domain known as SH2
  • 6. G-Protein Coupled receptors An example to understand Protein Families
  • 7. G-Protein Signaling Date Your Footer Here • Regulator of GPS domains are protein structural units that activate GTPase. • sequences belonging to RGS protein family(multifunctional GTPase accelerating protein). • All RGS protein family member contains RGS domain ,some (RGS1) consist little more than domain . • RGS3 and RGS6 contain additional domains for other functions .
  • 8. They have seven transmembrane domains, and interact with specialized proteins (called G proteins) to influence intracellular pathways after binding extracellular signals G-protein-coupled receptors and cancer Dorsam et al 2007
  • 9. Date Your Footer Here 9 Level2 Level 1 Sub-family Superfamily GPCRs Rhodopsin -like GPCRs Opsins Red- sensitive opsins Green- sensitive opsins Blue- sensitive opsins APJ receptors Relaxin Receptors cAMP Receptors Secretin like- GPCRs Etc… The GPCR superfamily hierarchy. Families and subfamilies to which the short-wave-sensitive opsin 1 protein belongs are highlighted in violet. GPCRs Regulates: Biological processes, including photoreception, regulation of the immune system, and nervous system transmission. Similarity increases
  • 10. Date 10 What Are Sequence Features? 1.Active Site 2.Binding Site 3. Post Translational Modifications (PTMs) 4. Repeats Group of amino acid that confer certain characteristics upon a protein ,and maybe important for overall function
  • 11. Date 11 Protein Signatures • To classify protein’s family and to predict the domains or sequence features we use computational tools and that tools are the predictive models known as protein signatures. • Model refines distantly related sequences in database are identified. • Once the model is mature, signature is ready for protein sequence analysis. The Purpose and the Process
  • 12. Date 12 How do Protein Signature compare to other ways of classifying proteins? • Multiple sequence alignment gives us information about classification which we use to identify amino acid residues that are conserved in distantly proteins. • Protein signature built from multiple sequence alignment are usually better at detecting divergent homologues than pairwise comparison method. Identifying the conserved residues
  • 13. Date 13 Signature types Patterns Profiles Fingerprints Hidden Markov Models (HMMs) Approaches to generate signatures
  • 14. Patterns & Profiles Date 14 Signature Types Patterns can recognize sequence features such as binding sites or active sites of enzymes consist of a only few amino acids. Ex: PROSITE database. 1 2 Profiles are built by converting multiple sequence alignment into position specific scoring system (PMMs). Ex: CDD, HAMAP, PROSITE and PRODOM.
  • 15. Fingerprints and HMMs Date 15 Signature Types 3 4 Fingerprints are composed of multiple short conserved motifs which are drawn from sequence alignment. They can distinguish individual subfamilies within protein families. Ex : PRINTS database. Hidden Markov models (HMMs) are used to convert multiple sequence alignment into position specific scoring system. Ex: Pfam, SMART, TIGRFAM, PANTHER, SFLD, Superfamily and Gene 3D.
  • 16. Date 16 Families in search of function Domains of unknown function (DUFs) Popovic et al., 2017.,Scientific reports, The function of the Domain is yet to be discovered The DUF naming scheme was introduced by Chris Ponting through the addition of DUF1 and DUF2 to the SMART database Goodacre et al 2014.
  • 17.
  • 18. Databases at Glance Date 18 Databases of Protein Families 5. PRINTS Combine Multidomain/motif information for family categorization. MSA and Fuzzy Logic (Regex) 6. MobiDB Homology, Predicted, Curated Intrinsically Disordered regions database 7. TIGRFAM MSA, HMM mainly for prokaryotic proteins 8. SUPERFAMILY2 Using HMM and protein Sequences Domain organisation, sequence alignments and protein sequence details can be obtained for query sequence 4. PRIDE Mass-Spec based identification Provide PTM information and Literature Evidences 3. Prosite MSA of homologous Proteins;Based on Prorules 2. PIRSF MSA and Clustering with hight similarity thresholds 1. Pfam Protein Family, Domains, Motifs and Repeats (Generated from MSA and HMMs) 1 3 5 7 2 4 8 6
  • 19. Date 19 Interpro-A Protein Family Compendium
  • 20. Date 20 GOFeat Tutorial Lorem ipsum dolor sit amet, consectetur adipiscing elit. Protein Under Investigation: LOC645967
  • 21. Date 21 InterPro Tutorial Protein Under Investigation: LOC645967
  • 22. Date 22 References • Dorsam, R.T. and Gutkind, J.S., 2007. G-protein-coupled receptors and cancer. Nature reviews cancer, 7(2), pp.79-94. • Bateman, Alex, Penny Coggill, and Robert D. Finn. "DUFs: families in search of function." Acta Crystallographica Section F: Structural Biology and Crystallization Communications 66, no. 10 (2010): 1148-1152. • Goodacre, Norman F., Dietlind L. Gerloff, and Peter Uetz. "Protein domains of unknown function are essential in bacteria." MBio 5, no. 1 (2014). • EMBL-EBI Training Course: https://www.ebi.ac.uk/training-beta/online/courses/protein- classification-intro-ebi-resources/protein-classification/what-are-protein-families/
  • 23. Date 23 Thanks Drop in @RohitSatyam1 +91 9870953351 Jamia Millia Islamia University