SlideShare a Scribd company logo
1 of 21
CHEMOINFORMATICS
Presented by
Miss. Rupali D. Salunkhe
PG Department of Bioinformatics,
WCBT, Solapur
CONTENTS
 Introduction
 Representation of molecules
 File formats
 Databases
 Applications
 References
INTRODUCTION
 Chemoinformatics is the combination of chemistry and
information technology, is required for the processing
and analysis of chemical data.
 Design, creation, organization, management, analysis,
visualization and use of chemical information.
 It is the application of informatics methods to solve
chemical problems.
 It is relevant to biologists because chemistry data are
important in many areas of molecular biology, e.g, in the
study of protein interactions.
 The term chemoinformatics was defined by F.K.
Brown in 1998.
 It combines the scientific working fields
of chemistry, computer science and information
science.
 For example in the areas of topology, chemical
graph theory, information retrieval and data
mining in the chemical space.
 It can also be applied to data analysis for various
industries like paper , pulp and dyes industries.
REPRESENTATION OF MOLECULES
 1D– computed/experimental global properties
 2D – the chemical structure diagram
 3D – atomic coordinate data
FILE FORMATS
 Mol format
 SDF format
 SMILES format
MOL FORMAT
 An MDL Molfile is a file format for holding information
about the atoms, bonds, connectivity and coordinates of
a molecule.
 The format was created by MDL Information Systems
(MDL) [Molecular Design Limited, Inc.].
 It is also supported by some computational software
such as Mathematica.
 The molfile consists of - Header information ,the
Connection Table (CT) containing atom information,
then bond connections and types.
SDF FORMAT
 SDF is one of a family of chemical-data file formats
developed by MDL.
 SDF stands for structure-data file.
 SDF files actually wrap the molfile (MDL Molfile) format.
 Multiple compounds are delimited by lines consisting of
four dollar signs ($$$$).
 A feature of the SDF format is its ability to include
associated data.
SMILES FORMAT
 The Simplified Molecular Input Line Entry Specification
(SMILES) is a line notation for molecules.
 SMILES strings include connectivity but do not include 2D
or 3D coordinates.
 Hydrogen atoms are not represented.
 Other atoms are represented by their element symbols C,
N,O, S, Cl, Br, and I.
 The symbol "=" represents double bonds and "#"
represents triple bonds, branching is indicated by () and rings
are indicated by pairs of digits.
DATABASES
 PubChem
 ChemBank
 ChEMBL
 DrugBank
PUBCHEM DATABASE
 https://pubchem.ncbi.nlm.nih.gov/
 PubChem is a database of chemical molecules and their
activities against biological assays.
 Is maintained by the NCBI.
 It contains substance descriptions and small molecules
with more than 1000 atoms and 1000 bonds.
 Searching - chemical structure, name
fragments, chemical formula, molecular weight.
HOME PAGE OF PUBCHEM DATABASE
CHEMBANK DATABASE
 http://chembank.broadinstitute.org/
 It is a public, web-based informatics environment.
 Created by the Broad Institute’s Chemical Biology
Program.
 It stores information of small molecules and biomedically
relevant assays.
 It is intended to guide chemists synthesizing novel
compounds or libraries, to assist biologists searching for
small molecules.
HOME PAGE OF CHEMBANK DATABASE
CHEMBL DATABASE
 https://www.ebi.ac.uk/chembl/
 ChEMBLdb is a manually curated chemical
database of bioactive molecules.
 It is maintained by the European Bioinformatics
Institute (EBI), of the EMBL.
 ChEMBL database contains compound bioactivity
data against drug targets.
 The latest version 2 (ChEMBL_02).
HOME PAGE OF CHEMBL DATABASE
DRUGBANK DATABASE
 https://www.drugbank.ca/
 The DrugBank database is a comprehensive, freely
accessible, online database.
 Containing information on drugs and drug targets.
 It is widely used by the drug industry, medicinal
chemists, pharmacists, physicians, students.
 The latest release of the database (version 5.0.11).
 It contains 11,002 drug entries.
HOME PAGE OF DRUGBANK DATABASE
APPLICATIONS
 Storage and retrieval- The primary application is the
storage, indexing and search of information relating to
compounds.
 Virtual libraries- Chemical data can related to real or
virtual molecules,Virtual libraries of classes of
compounds: drugs, natural products, diversity-oriented
synthetic products.
 Virtual screening-It involves computationally screening in
silico libraries of compounds.
 Quantitative structure-activity relationship (QSAR)-Used
to predict the activity of compounds from their structures.
REFERENCES
 An introduction to cheminformatics, A. R. Leach, V. J.
Gillet.
 Cheminformatics, Johann Gasteiger and Thomas
Engel (Eds).
 Molecular modelling – Principles and Applications, A.
R. Leach.
 Gasteiger, Editor, Handbook of Chemoinformatics -
From Data to Knowledge, Wiley-VCH, Weinheim.
THANK YOU

More Related Content

What's hot

Conformational analysis
Conformational analysisConformational analysis
Conformational analysisPinky Vincent
 
Cheminformatics, concept by kk sahu sir
Cheminformatics, concept by kk sahu sirCheminformatics, concept by kk sahu sir
Cheminformatics, concept by kk sahu sirKAUSHAL SAHU
 
Lecture 9 molecular descriptors
Lecture 9  molecular descriptorsLecture 9  molecular descriptors
Lecture 9 molecular descriptorsRAJAN ROLTA
 
De novo drug design
De novo drug designDe novo drug design
De novo drug designmojdeh y
 
Molecular docking and_virtual_screening
Molecular docking and_virtual_screeningMolecular docking and_virtual_screening
Molecular docking and_virtual_screeningFlorent Barbault
 
analogue based drug design and discovery.pptx
analogue based drug design and discovery.pptxanalogue based drug design and discovery.pptx
analogue based drug design and discovery.pptxramadevi824914
 
energy minimization
energy minimizationenergy minimization
energy minimizationpradeep kore
 
7.local and global minima
7.local and global minima7.local and global minima
7.local and global minimaAbhijeet Kadam
 
2D QSAR DESCRIPTORS
2D QSAR DESCRIPTORS2D QSAR DESCRIPTORS
2D QSAR DESCRIPTORSSmita Jain
 
Molecular Mechanics in Molecular Modeling
Molecular Mechanics in Molecular ModelingMolecular Mechanics in Molecular Modeling
Molecular Mechanics in Molecular ModelingAkshay Kank
 
Basics Of Molecular Docking
Basics Of Molecular DockingBasics Of Molecular Docking
Basics Of Molecular DockingSatarupa Deb
 
Structure based drug design
Structure based drug designStructure based drug design
Structure based drug designADAM S
 
Computer aided drug designing
Computer aided drug designingComputer aided drug designing
Computer aided drug designingMuhammed sadiq
 

What's hot (20)

Conformational analysis
Conformational analysisConformational analysis
Conformational analysis
 
Chemoinformatics.ppt
Chemoinformatics.pptChemoinformatics.ppt
Chemoinformatics.ppt
 
Cheminformatics, concept by kk sahu sir
Cheminformatics, concept by kk sahu sirCheminformatics, concept by kk sahu sir
Cheminformatics, concept by kk sahu sir
 
Lecture 9 molecular descriptors
Lecture 9  molecular descriptorsLecture 9  molecular descriptors
Lecture 9 molecular descriptors
 
De novo drug design
De novo drug designDe novo drug design
De novo drug design
 
Molecular docking and_virtual_screening
Molecular docking and_virtual_screeningMolecular docking and_virtual_screening
Molecular docking and_virtual_screening
 
analogue based drug design and discovery.pptx
analogue based drug design and discovery.pptxanalogue based drug design and discovery.pptx
analogue based drug design and discovery.pptx
 
docking
docking docking
docking
 
energy minimization
energy minimizationenergy minimization
energy minimization
 
7.local and global minima
7.local and global minima7.local and global minima
7.local and global minima
 
MOLECULAR DOCKING
MOLECULAR DOCKINGMOLECULAR DOCKING
MOLECULAR DOCKING
 
Molecular modelling
Molecular modellingMolecular modelling
Molecular modelling
 
2D QSAR DESCRIPTORS
2D QSAR DESCRIPTORS2D QSAR DESCRIPTORS
2D QSAR DESCRIPTORS
 
Homology modelling
Homology modellingHomology modelling
Homology modelling
 
Molecular Mechanics in Molecular Modeling
Molecular Mechanics in Molecular ModelingMolecular Mechanics in Molecular Modeling
Molecular Mechanics in Molecular Modeling
 
Cheminformatics-1.ppt
Cheminformatics-1.pptCheminformatics-1.ppt
Cheminformatics-1.ppt
 
Basics Of Molecular Docking
Basics Of Molecular DockingBasics Of Molecular Docking
Basics Of Molecular Docking
 
Structure based drug design
Structure based drug designStructure based drug design
Structure based drug design
 
Molecular docking
Molecular dockingMolecular docking
Molecular docking
 
Computer aided drug designing
Computer aided drug designingComputer aided drug designing
Computer aided drug designing
 

Similar to Chemoinformatics

Lecture 12 – chemoinformatic
Lecture 12 – chemoinformatic Lecture 12 – chemoinformatic
Lecture 12 – chemoinformatic RAJAN ROLTA
 
An Introduction to Chemoinformatics for the postgraduate students of Agriculture
An Introduction to Chemoinformatics for the postgraduate students of AgricultureAn Introduction to Chemoinformatics for the postgraduate students of Agriculture
An Introduction to Chemoinformatics for the postgraduate students of AgricultureDevakumar Jain
 
Chemoinformatic File Format.pptx
Chemoinformatic File Format.pptxChemoinformatic File Format.pptx
Chemoinformatic File Format.pptxwadhava gurumeet
 
Implications of structural and chemical data bases
Implications of structural and chemical data basesImplications of structural and chemical data bases
Implications of structural and chemical data basesBhavitha Pulaparthi
 
Protein Data Bank ( PDB ) - Bioinformatics
Protein Data Bank ( PDB ) - BioinformaticsProtein Data Bank ( PDB ) - Bioinformatics
Protein Data Bank ( PDB ) - Bioinformaticskarmandeepkaur7
 
Peptide line notations for biologics registration and patent filings
Peptide line notations for biologics registration and patent filingsPeptide line notations for biologics registration and patent filings
Peptide line notations for biologics registration and patent filingsNextMove Software
 
II-SDV 2017: The "International Chemical Ontology Network"
II-SDV 2017: The "International Chemical Ontology Network" II-SDV 2017: The "International Chemical Ontology Network"
II-SDV 2017: The "International Chemical Ontology Network" Dr. Haxel Consult
 
La chemoinformatica: uno strumento computazionale per la chimica farmaceutica
La chemoinformatica: uno strumento computazionale per la chimica farmaceuticaLa chemoinformatica: uno strumento computazionale per la chimica farmaceutica
La chemoinformatica: uno strumento computazionale per la chimica farmaceuticaCRS4 Research Center in Sardinia
 
20090511 Manchester Biochemistry
20090511 Manchester Biochemistry20090511 Manchester Biochemistry
20090511 Manchester BiochemistryMichel Dumontier
 
INFORMATICS 2.pptx
INFORMATICS 2.pptxINFORMATICS 2.pptx
INFORMATICS 2.pptxOramadevi1
 

Similar to Chemoinformatics (20)

Lecture 12 – chemoinformatic
Lecture 12 – chemoinformatic Lecture 12 – chemoinformatic
Lecture 12 – chemoinformatic
 
An Introduction to Chemoinformatics for the postgraduate students of Agriculture
An Introduction to Chemoinformatics for the postgraduate students of AgricultureAn Introduction to Chemoinformatics for the postgraduate students of Agriculture
An Introduction to Chemoinformatics for the postgraduate students of Agriculture
 
Chemoinformatic File Format.pptx
Chemoinformatic File Format.pptxChemoinformatic File Format.pptx
Chemoinformatic File Format.pptx
 
The EPA Comptox Chemistry Dashboard: A Web-Based Data Integration Hub for Tox...
The EPA Comptox Chemistry Dashboard: A Web-Based Data Integration Hub for Tox...The EPA Comptox Chemistry Dashboard: A Web-Based Data Integration Hub for Tox...
The EPA Comptox Chemistry Dashboard: A Web-Based Data Integration Hub for Tox...
 
Precompetitive preclinical ADME/tox data and set it free on the web to facili...
Precompetitive preclinical ADME/tox data and set it free on the web to facili...Precompetitive preclinical ADME/tox data and set it free on the web to facili...
Precompetitive preclinical ADME/tox data and set it free on the web to facili...
 
Assignment 105B.pptx
Assignment 105B.pptxAssignment 105B.pptx
Assignment 105B.pptx
 
Protein Data Bank
Protein Data BankProtein Data Bank
Protein Data Bank
 
Implications of structural and chemical data bases
Implications of structural and chemical data basesImplications of structural and chemical data bases
Implications of structural and chemical data bases
 
Automated identification and conversion of chemical names to structure search...
Automated identification and conversion of chemical names to structure search...Automated identification and conversion of chemical names to structure search...
Automated identification and conversion of chemical names to structure search...
 
Automated Identification and Conversion of Chemical Names to Structure Search...
Automated Identification and Conversion of Chemical Names to Structure Search...Automated Identification and Conversion of Chemical Names to Structure Search...
Automated Identification and Conversion of Chemical Names to Structure Search...
 
Pallavi gupta
Pallavi guptaPallavi gupta
Pallavi gupta
 
Protein Data Bank ( PDB ) - Bioinformatics
Protein Data Bank ( PDB ) - BioinformaticsProtein Data Bank ( PDB ) - Bioinformatics
Protein Data Bank ( PDB ) - Bioinformatics
 
Ppi
PpiPpi
Ppi
 
Peptide line notations for biologics registration and patent filings
Peptide line notations for biologics registration and patent filingsPeptide line notations for biologics registration and patent filings
Peptide line notations for biologics registration and patent filings
 
US-EPA CompTox Chemicals Dashboard as a web-based data resource to help ident...
US-EPA CompTox Chemicals Dashboard as a web-based data resource to help ident...US-EPA CompTox Chemicals Dashboard as a web-based data resource to help ident...
US-EPA CompTox Chemicals Dashboard as a web-based data resource to help ident...
 
II-SDV 2017: The "International Chemical Ontology Network"
II-SDV 2017: The "International Chemical Ontology Network" II-SDV 2017: The "International Chemical Ontology Network"
II-SDV 2017: The "International Chemical Ontology Network"
 
La chemoinformatica: uno strumento computazionale per la chimica farmaceutica
La chemoinformatica: uno strumento computazionale per la chimica farmaceuticaLa chemoinformatica: uno strumento computazionale per la chimica farmaceutica
La chemoinformatica: uno strumento computazionale per la chimica farmaceutica
 
20090511 Manchester Biochemistry
20090511 Manchester Biochemistry20090511 Manchester Biochemistry
20090511 Manchester Biochemistry
 
INFORMATICS 2.pptx
INFORMATICS 2.pptxINFORMATICS 2.pptx
INFORMATICS 2.pptx
 
INFORMATICS 2.pptx
INFORMATICS 2.pptxINFORMATICS 2.pptx
INFORMATICS 2.pptx
 

Recently uploaded

History Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptxHistory Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptxsocialsciencegdgrohi
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationnomboosow
 
Biting mechanism of poisonous snakes.pdf
Biting mechanism of poisonous snakes.pdfBiting mechanism of poisonous snakes.pdf
Biting mechanism of poisonous snakes.pdfadityarao40181
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxSayali Powar
 
internship ppt on smartinternz platform as salesforce developer
internship ppt on smartinternz platform as salesforce developerinternship ppt on smartinternz platform as salesforce developer
internship ppt on smartinternz platform as salesforce developerunnathinaik
 
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Celine George
 
Pharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdfPharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdfMahmoud M. Sallam
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon AUnboundStockton
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentInMediaRes1
 
Types of Journalistic Writing Grade 8.pptx
Types of Journalistic Writing Grade 8.pptxTypes of Journalistic Writing Grade 8.pptx
Types of Journalistic Writing Grade 8.pptxEyham Joco
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxNirmalaLoungPoorunde1
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxGaneshChakor2
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTiammrhaywood
 
Roles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceRoles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceSamikshaHamane
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptxVS Mahajan Coaching Centre
 

Recently uploaded (20)

ESSENTIAL of (CS/IT/IS) class 06 (database)
ESSENTIAL of (CS/IT/IS) class 06 (database)ESSENTIAL of (CS/IT/IS) class 06 (database)
ESSENTIAL of (CS/IT/IS) class 06 (database)
 
History Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptxHistory Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptx
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communication
 
Biting mechanism of poisonous snakes.pdf
Biting mechanism of poisonous snakes.pdfBiting mechanism of poisonous snakes.pdf
Biting mechanism of poisonous snakes.pdf
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
 
internship ppt on smartinternz platform as salesforce developer
internship ppt on smartinternz platform as salesforce developerinternship ppt on smartinternz platform as salesforce developer
internship ppt on smartinternz platform as salesforce developer
 
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
 
Pharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdfPharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdf
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon A
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media Component
 
Types of Journalistic Writing Grade 8.pptx
Types of Journalistic Writing Grade 8.pptxTypes of Journalistic Writing Grade 8.pptx
Types of Journalistic Writing Grade 8.pptx
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptx
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptx
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
 
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
 
Roles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceRoles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in Pharmacovigilance
 
OS-operating systems- ch04 (Threads) ...
OS-operating systems- ch04 (Threads) ...OS-operating systems- ch04 (Threads) ...
OS-operating systems- ch04 (Threads) ...
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
 

Chemoinformatics

  • 1. CHEMOINFORMATICS Presented by Miss. Rupali D. Salunkhe PG Department of Bioinformatics, WCBT, Solapur
  • 2. CONTENTS  Introduction  Representation of molecules  File formats  Databases  Applications  References
  • 3. INTRODUCTION  Chemoinformatics is the combination of chemistry and information technology, is required for the processing and analysis of chemical data.  Design, creation, organization, management, analysis, visualization and use of chemical information.  It is the application of informatics methods to solve chemical problems.  It is relevant to biologists because chemistry data are important in many areas of molecular biology, e.g, in the study of protein interactions.
  • 4.  The term chemoinformatics was defined by F.K. Brown in 1998.  It combines the scientific working fields of chemistry, computer science and information science.  For example in the areas of topology, chemical graph theory, information retrieval and data mining in the chemical space.  It can also be applied to data analysis for various industries like paper , pulp and dyes industries.
  • 5. REPRESENTATION OF MOLECULES  1D– computed/experimental global properties  2D – the chemical structure diagram  3D – atomic coordinate data
  • 6. FILE FORMATS  Mol format  SDF format  SMILES format
  • 7. MOL FORMAT  An MDL Molfile is a file format for holding information about the atoms, bonds, connectivity and coordinates of a molecule.  The format was created by MDL Information Systems (MDL) [Molecular Design Limited, Inc.].  It is also supported by some computational software such as Mathematica.  The molfile consists of - Header information ,the Connection Table (CT) containing atom information, then bond connections and types.
  • 8. SDF FORMAT  SDF is one of a family of chemical-data file formats developed by MDL.  SDF stands for structure-data file.  SDF files actually wrap the molfile (MDL Molfile) format.  Multiple compounds are delimited by lines consisting of four dollar signs ($$$$).  A feature of the SDF format is its ability to include associated data.
  • 9. SMILES FORMAT  The Simplified Molecular Input Line Entry Specification (SMILES) is a line notation for molecules.  SMILES strings include connectivity but do not include 2D or 3D coordinates.  Hydrogen atoms are not represented.  Other atoms are represented by their element symbols C, N,O, S, Cl, Br, and I.  The symbol "=" represents double bonds and "#" represents triple bonds, branching is indicated by () and rings are indicated by pairs of digits.
  • 11. PUBCHEM DATABASE  https://pubchem.ncbi.nlm.nih.gov/  PubChem is a database of chemical molecules and their activities against biological assays.  Is maintained by the NCBI.  It contains substance descriptions and small molecules with more than 1000 atoms and 1000 bonds.  Searching - chemical structure, name fragments, chemical formula, molecular weight.
  • 12. HOME PAGE OF PUBCHEM DATABASE
  • 13. CHEMBANK DATABASE  http://chembank.broadinstitute.org/  It is a public, web-based informatics environment.  Created by the Broad Institute’s Chemical Biology Program.  It stores information of small molecules and biomedically relevant assays.  It is intended to guide chemists synthesizing novel compounds or libraries, to assist biologists searching for small molecules.
  • 14. HOME PAGE OF CHEMBANK DATABASE
  • 15. CHEMBL DATABASE  https://www.ebi.ac.uk/chembl/  ChEMBLdb is a manually curated chemical database of bioactive molecules.  It is maintained by the European Bioinformatics Institute (EBI), of the EMBL.  ChEMBL database contains compound bioactivity data against drug targets.  The latest version 2 (ChEMBL_02).
  • 16. HOME PAGE OF CHEMBL DATABASE
  • 17. DRUGBANK DATABASE  https://www.drugbank.ca/  The DrugBank database is a comprehensive, freely accessible, online database.  Containing information on drugs and drug targets.  It is widely used by the drug industry, medicinal chemists, pharmacists, physicians, students.  The latest release of the database (version 5.0.11).  It contains 11,002 drug entries.
  • 18. HOME PAGE OF DRUGBANK DATABASE
  • 19. APPLICATIONS  Storage and retrieval- The primary application is the storage, indexing and search of information relating to compounds.  Virtual libraries- Chemical data can related to real or virtual molecules,Virtual libraries of classes of compounds: drugs, natural products, diversity-oriented synthetic products.  Virtual screening-It involves computationally screening in silico libraries of compounds.  Quantitative structure-activity relationship (QSAR)-Used to predict the activity of compounds from their structures.
  • 20. REFERENCES  An introduction to cheminformatics, A. R. Leach, V. J. Gillet.  Cheminformatics, Johann Gasteiger and Thomas Engel (Eds).  Molecular modelling – Principles and Applications, A. R. Leach.  Gasteiger, Editor, Handbook of Chemoinformatics - From Data to Knowledge, Wiley-VCH, Weinheim.