SlideShare a Scribd company logo
1 of 12
Interview with
Carol Scott
PhD, Bioengineering
Bioinformatics Scientist and Curator
Conserved Domain Database
A project of the U.S. National Library of Medicine at the
National Institutes of Health,
National Center for Biotechnology Information
Katie Rapp
LBSC 690
March 1, 2011
Protein is Everything!
 Every living thing is made up of unique, identifiable proteins
 Examples: human hemoglobin, insulin, proteins in
fungus, bacteria, plants
 Proteins are made of different combinations of amino acids
 20 naturally-occurring amino acids; they are like beads in a necklace
and their order determines the type of protein
 Proteins do the work inside cells
 Examples: Hemoglobin carries oxygen in the blood, insulin regulates
glucose metabolism
Problems with Proteins
 Proteins do the work inside cells, so when there are
problems, such as diseases, they are often caused by a
defective protein
 Example: Sickle Cell Anemia (one change in one amino acid in
hemoglobin and you go from healthy to ill)
 Medical researchers study proteins at the molecular level in
order to find cures to diseases
Conserved Domains –
Motivation behind the
database
 The amino acid chains that make up proteins are coiled and
folded. Repeated blocks of coiled and folded amino acids are
referred to as “conserved domains.”
 Conserved domains have specific functions and 3-
dimensional shapes
 It is useful for researchers to be able to compare related
conserved domains in different proteins, but there was no
real way to do this in the past
Conserved Domain Database -
Development
 This database was developed to meet the needs of
researchers
 Project begun in 2001; Carol Scott has worked on it since
2002
 Worked with software developers to produce highly-
interactive database
Conserved Domain Database
Curators
 Carol Scott and other curators create the data in the
database from lists of amino acid sequences found in other
databases
 They take amino acid sequences from millions of proteins
and link them based on structural and functional similarities
 They work with programmers to create the interface and
visual output of the database
 Curators also find and provide links to information about each
protein, journal articles and other resources, related proteins
Conserved Domain Database -
Challenges
 Not all amino acid sequence information is reliable – curators
must pick and choose where they get the basic data to put
into their database
 The process of creating the comparisons in the CDD is very
complex and time-consuming
 Software exists to help find these comparisons, but much
work must be done manually based on knowledge of the
chemical attributes of the amino acids
 The project is currently facing budgetary cutbacks which
affect staffing and perhaps the future of the database
Conserved Domain Database
Results
 Enables scientists to search on specific amino acid chains of
interest to them
 Genetic studies, mutation studies, studying size, shape and
function of proteins
 They can find and compare similar chemical alignments in
different proteins
 These alignments can provide insight into the functions of
different parts of protein molecules
Conserved Domain Database
Output – 3-Dimensional Structures
Conserved Domain Database
Output - Superfamilies
Conserved Domain Database
Users – Who Are They?
 The database is freely accessible to anyone over the internet
 It is used frequently by researchers around the world
 Users include anyone studying proteins – everyone from high
school and college students up to very high level researchers
at NIH, pharmaceutical companies, genetic researchers,
bioengineering firms, etc.
 Can be used to spur further research into areas where
defects in proteins could be repaired using genetic
engineering
Conserved Domain Database
 Questions?

More Related Content

Similar to Interview with NCBI Staff Scientist Carol Scott

Biochemistry-Student-Copy.pptx
Biochemistry-Student-Copy.pptxBiochemistry-Student-Copy.pptx
Biochemistry-Student-Copy.pptxEllahdulpina
 
Bioinformatics introduction
Bioinformatics introductionBioinformatics introduction
Bioinformatics introductionDrGopaSarma
 
Protein motif analysis and optimization using neural algorithms
Protein motif analysis and optimization using neural algorithmsProtein motif analysis and optimization using neural algorithms
Protein motif analysis and optimization using neural algorithmsSamvo Chowdhury
 
The Importance of an Amino Acid Library Iroa
The Importance of an Amino Acid Library IroaThe Importance of an Amino Acid Library Iroa
The Importance of an Amino Acid Library IroaIROA Technologies
 
Bioinformatics biological databases
Bioinformatics biological databasesBioinformatics biological databases
Bioinformatics biological databasesSangeeta Das
 
Types of biological databases-protein database
Types of biological databases-protein databaseTypes of biological databases-protein database
Types of biological databases-protein databasechinmayeec
 
Presentation.pptx
Presentation.pptxPresentation.pptx
Presentation.pptxAshuAsh15
 
SooryaKiran Bioinformatics
SooryaKiran BioinformaticsSooryaKiran Bioinformatics
SooryaKiran Bioinformaticscontactsoorya
 
Intro to in silico drug discovery 2014
Intro to in silico drug discovery 2014Intro to in silico drug discovery 2014
Intro to in silico drug discovery 2014Lee Larcombe
 
57.insilico studies of cellulase from Aspergillus terreus
57.insilico studies of cellulase from Aspergillus terreus57.insilico studies of cellulase from Aspergillus terreus
57.insilico studies of cellulase from Aspergillus terreusAnnadurai B
 
biological databases.pptx
biological databases.pptxbiological databases.pptx
biological databases.pptxscience lover
 

Similar to Interview with NCBI Staff Scientist Carol Scott (20)

Intro bioinfo
Intro bioinfoIntro bioinfo
Intro bioinfo
 
Intro bioinfo
Intro bioinfoIntro bioinfo
Intro bioinfo
 
Delivering Curated Chemistry to the World via Crowdsourced Deposition and Ann...
Delivering Curated Chemistry to the World via Crowdsourced Deposition and Ann...Delivering Curated Chemistry to the World via Crowdsourced Deposition and Ann...
Delivering Curated Chemistry to the World via Crowdsourced Deposition and Ann...
 
Biological databases
Biological databasesBiological databases
Biological databases
 
Biochemistry-Student-Copy.pptx
Biochemistry-Student-Copy.pptxBiochemistry-Student-Copy.pptx
Biochemistry-Student-Copy.pptx
 
Databases
DatabasesDatabases
Databases
 
Bioinformatics introduction
Bioinformatics introductionBioinformatics introduction
Bioinformatics introduction
 
Introduction to databases.pptx
Introduction to databases.pptxIntroduction to databases.pptx
Introduction to databases.pptx
 
Improving online chemistry one structure at a time
Improving online chemistry one structure at a timeImproving online chemistry one structure at a time
Improving online chemistry one structure at a time
 
Protein motif analysis and optimization using neural algorithms
Protein motif analysis and optimization using neural algorithmsProtein motif analysis and optimization using neural algorithms
Protein motif analysis and optimization using neural algorithms
 
The Importance of an Amino Acid Library Iroa
The Importance of an Amino Acid Library IroaThe Importance of an Amino Acid Library Iroa
The Importance of an Amino Acid Library Iroa
 
Chibucos annot go_final
Chibucos annot go_finalChibucos annot go_final
Chibucos annot go_final
 
Bioinformatics biological databases
Bioinformatics biological databasesBioinformatics biological databases
Bioinformatics biological databases
 
Types of biological databases-protein database
Types of biological databases-protein databaseTypes of biological databases-protein database
Types of biological databases-protein database
 
Presentation.pptx
Presentation.pptxPresentation.pptx
Presentation.pptx
 
ChemSpider – A Platform to Gather, Host and Integrate Structure Based Data Ac...
ChemSpider – A Platform to Gather, Host and Integrate Structure Based Data Ac...ChemSpider – A Platform to Gather, Host and Integrate Structure Based Data Ac...
ChemSpider – A Platform to Gather, Host and Integrate Structure Based Data Ac...
 
SooryaKiran Bioinformatics
SooryaKiran BioinformaticsSooryaKiran Bioinformatics
SooryaKiran Bioinformatics
 
Intro to in silico drug discovery 2014
Intro to in silico drug discovery 2014Intro to in silico drug discovery 2014
Intro to in silico drug discovery 2014
 
57.insilico studies of cellulase from Aspergillus terreus
57.insilico studies of cellulase from Aspergillus terreus57.insilico studies of cellulase from Aspergillus terreus
57.insilico studies of cellulase from Aspergillus terreus
 
biological databases.pptx
biological databases.pptxbiological databases.pptx
biological databases.pptx
 

Recently uploaded

Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Neo4j
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraDeakin University
 

Recently uploaded (20)

Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
The transition to renewables in India.pdf
The transition to renewables in India.pdfThe transition to renewables in India.pdf
The transition to renewables in India.pdf
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning era
 

Interview with NCBI Staff Scientist Carol Scott

  • 1. Interview with Carol Scott PhD, Bioengineering Bioinformatics Scientist and Curator Conserved Domain Database A project of the U.S. National Library of Medicine at the National Institutes of Health, National Center for Biotechnology Information Katie Rapp LBSC 690 March 1, 2011
  • 2. Protein is Everything!  Every living thing is made up of unique, identifiable proteins  Examples: human hemoglobin, insulin, proteins in fungus, bacteria, plants  Proteins are made of different combinations of amino acids  20 naturally-occurring amino acids; they are like beads in a necklace and their order determines the type of protein  Proteins do the work inside cells  Examples: Hemoglobin carries oxygen in the blood, insulin regulates glucose metabolism
  • 3. Problems with Proteins  Proteins do the work inside cells, so when there are problems, such as diseases, they are often caused by a defective protein  Example: Sickle Cell Anemia (one change in one amino acid in hemoglobin and you go from healthy to ill)  Medical researchers study proteins at the molecular level in order to find cures to diseases
  • 4. Conserved Domains – Motivation behind the database  The amino acid chains that make up proteins are coiled and folded. Repeated blocks of coiled and folded amino acids are referred to as “conserved domains.”  Conserved domains have specific functions and 3- dimensional shapes  It is useful for researchers to be able to compare related conserved domains in different proteins, but there was no real way to do this in the past
  • 5. Conserved Domain Database - Development  This database was developed to meet the needs of researchers  Project begun in 2001; Carol Scott has worked on it since 2002  Worked with software developers to produce highly- interactive database
  • 6. Conserved Domain Database Curators  Carol Scott and other curators create the data in the database from lists of amino acid sequences found in other databases  They take amino acid sequences from millions of proteins and link them based on structural and functional similarities  They work with programmers to create the interface and visual output of the database  Curators also find and provide links to information about each protein, journal articles and other resources, related proteins
  • 7. Conserved Domain Database - Challenges  Not all amino acid sequence information is reliable – curators must pick and choose where they get the basic data to put into their database  The process of creating the comparisons in the CDD is very complex and time-consuming  Software exists to help find these comparisons, but much work must be done manually based on knowledge of the chemical attributes of the amino acids  The project is currently facing budgetary cutbacks which affect staffing and perhaps the future of the database
  • 8. Conserved Domain Database Results  Enables scientists to search on specific amino acid chains of interest to them  Genetic studies, mutation studies, studying size, shape and function of proteins  They can find and compare similar chemical alignments in different proteins  These alignments can provide insight into the functions of different parts of protein molecules
  • 9. Conserved Domain Database Output – 3-Dimensional Structures
  • 11. Conserved Domain Database Users – Who Are They?  The database is freely accessible to anyone over the internet  It is used frequently by researchers around the world  Users include anyone studying proteins – everyone from high school and college students up to very high level researchers at NIH, pharmaceutical companies, genetic researchers, bioengineering firms, etc.  Can be used to spur further research into areas where defects in proteins could be repaired using genetic engineering