SlideShare a Scribd company logo
CHEMOINFORMATICS
Presented by
Miss. Rupali D. Salunkhe
PG Department of Bioinformatics,
WCBT, Solapur
CONTENTS
 Introduction
 Representation of molecules
 File formats
 Databases
 Applications
 References
INTRODUCTION
 Chemoinformatics is the combination of chemistry and
information technology, is required for the processing
and analysis of chemical data.
 Design, creation, organization, management, analysis,
visualization and use of chemical information.
 It is the application of informatics methods to solve
chemical problems.
 It is relevant to biologists because chemistry data are
important in many areas of molecular biology, e.g, in the
study of protein interactions.
 The term chemoinformatics was defined by F.K.
Brown in 1998.
 It combines the scientific working fields
of chemistry, computer science and information
science.
 For example in the areas of topology, chemical
graph theory, information retrieval and data
mining in the chemical space.
 It can also be applied to data analysis for various
industries like paper , pulp and dyes industries.
REPRESENTATION OF MOLECULES
 1D– computed/experimental global properties
 2D – the chemical structure diagram
 3D – atomic coordinate data
FILE FORMATS
 Mol format
 SDF format
 SMILES format
MOL FORMAT
 An MDL Molfile is a file format for holding information
about the atoms, bonds, connectivity and coordinates of
a molecule.
 The format was created by MDL Information Systems
(MDL) [Molecular Design Limited, Inc.].
 It is also supported by some computational software
such as Mathematica.
 The molfile consists of - Header information ,the
Connection Table (CT) containing atom information,
then bond connections and types.
SDF FORMAT
 SDF is one of a family of chemical-data file formats
developed by MDL.
 SDF stands for structure-data file.
 SDF files actually wrap the molfile (MDL Molfile) format.
 Multiple compounds are delimited by lines consisting of
four dollar signs ($$$$).
 A feature of the SDF format is its ability to include
associated data.
SMILES FORMAT
 The Simplified Molecular Input Line Entry Specification
(SMILES) is a line notation for molecules.
 SMILES strings include connectivity but do not include 2D
or 3D coordinates.
 Hydrogen atoms are not represented.
 Other atoms are represented by their element symbols C,
N,O, S, Cl, Br, and I.
 The symbol "=" represents double bonds and "#"
represents triple bonds, branching is indicated by () and rings
are indicated by pairs of digits.
DATABASES
 PubChem
 ChemBank
 ChEMBL
 DrugBank
PUBCHEM DATABASE
 https://pubchem.ncbi.nlm.nih.gov/
 PubChem is a database of chemical molecules and their
activities against biological assays.
 Is maintained by the NCBI.
 It contains substance descriptions and small molecules
with more than 1000 atoms and 1000 bonds.
 Searching - chemical structure, name
fragments, chemical formula, molecular weight.
HOME PAGE OF PUBCHEM DATABASE
CHEMBANK DATABASE
 http://chembank.broadinstitute.org/
 It is a public, web-based informatics environment.
 Created by the Broad Institute’s Chemical Biology
Program.
 It stores information of small molecules and biomedically
relevant assays.
 It is intended to guide chemists synthesizing novel
compounds or libraries, to assist biologists searching for
small molecules.
HOME PAGE OF CHEMBANK DATABASE
CHEMBL DATABASE
 https://www.ebi.ac.uk/chembl/
 ChEMBLdb is a manually curated chemical
database of bioactive molecules.
 It is maintained by the European Bioinformatics
Institute (EBI), of the EMBL.
 ChEMBL database contains compound bioactivity
data against drug targets.
 The latest version 2 (ChEMBL_02).
HOME PAGE OF CHEMBL DATABASE
DRUGBANK DATABASE
 https://www.drugbank.ca/
 The DrugBank database is a comprehensive, freely
accessible, online database.
 Containing information on drugs and drug targets.
 It is widely used by the drug industry, medicinal
chemists, pharmacists, physicians, students.
 The latest release of the database (version 5.0.11).
 It contains 11,002 drug entries.
HOME PAGE OF DRUGBANK DATABASE
APPLICATIONS
 Storage and retrieval- The primary application is the
storage, indexing and search of information relating to
compounds.
 Virtual libraries- Chemical data can related to real or
virtual molecules,Virtual libraries of classes of
compounds: drugs, natural products, diversity-oriented
synthetic products.
 Virtual screening-It involves computationally screening in
silico libraries of compounds.
 Quantitative structure-activity relationship (QSAR)-Used
to predict the activity of compounds from their structures.
REFERENCES
 An introduction to cheminformatics, A. R. Leach, V. J.
Gillet.
 Cheminformatics, Johann Gasteiger and Thomas
Engel (Eds).
 Molecular modelling – Principles and Applications, A.
R. Leach.
 Gasteiger, Editor, Handbook of Chemoinformatics -
From Data to Knowledge, Wiley-VCH, Weinheim.
THANK YOU

More Related Content

What's hot

MOLECULAR DOCKING.pptx
MOLECULAR DOCKING.pptxMOLECULAR DOCKING.pptx
MOLECULAR DOCKING.pptx
E Poovarasan
 
A Brief Overview of Cheminformatics
A Brief Overview of CheminformaticsA Brief Overview of Cheminformatics
A Brief Overview of Cheminformatics
Sunghwan Kim
 
Homology modeling
Homology modelingHomology modeling
Molecular docking
Molecular dockingMolecular docking
Molecular docking
Rahul B S
 
Structure based drug designing
Structure based drug designingStructure based drug designing
Structure based drug designing
Seenam Iftikhar
 
Molecular Mechanics in Molecular Modeling
Molecular Mechanics in Molecular ModelingMolecular Mechanics in Molecular Modeling
Molecular Mechanics in Molecular Modeling
Akshay Kank
 
Molecular modelling
Molecular modelling Molecular modelling
Molecular modelling
Pharmaceutical
 
Protein 3 d structure prediction
Protein 3 d structure predictionProtein 3 d structure prediction
Protein 3 d structure prediction
Samvartika Majumdar
 
Role of bioinformatics in drug designing
Role of bioinformatics in drug designingRole of bioinformatics in drug designing
Role of bioinformatics in drug designing
W Roseybala Devi
 
CADD
CADDCADD
Molecular modelling and docking studies
Molecular modelling and docking studiesMolecular modelling and docking studies
Molecular modelling and docking studies
routhusree
 
molecular docking its types and de novo drug design and application and softw...
molecular docking its types and de novo drug design and application and softw...molecular docking its types and de novo drug design and application and softw...
molecular docking its types and de novo drug design and application and softw...
GAUTAM KHUNE
 
energy minimization
energy minimizationenergy minimization
energy minimizationpradeep kore
 
Pharmacophore
PharmacophorePharmacophore
Pharmacophore
sachin panwar
 
Computer aided drug designing
Computer aided drug designingComputer aided drug designing
Computer aided drug designing
Muhammed sadiq
 
MOLECULAR DOCKING
MOLECULAR DOCKINGMOLECULAR DOCKING
MOLECULAR DOCKING
Saramita De Chakravarti
 
Protein-Ligand Docking
Protein-Ligand DockingProtein-Ligand Docking
Protein-Ligand Dockingbaoilleach
 
Docking
DockingDocking
Docking
Monika Verma
 
Virtual sreening
Virtual sreeningVirtual sreening
Virtual sreening
Mahendra G S
 

What's hot (20)

MOLECULAR DOCKING.pptx
MOLECULAR DOCKING.pptxMOLECULAR DOCKING.pptx
MOLECULAR DOCKING.pptx
 
A Brief Overview of Cheminformatics
A Brief Overview of CheminformaticsA Brief Overview of Cheminformatics
A Brief Overview of Cheminformatics
 
Homology modeling
Homology modelingHomology modeling
Homology modeling
 
Molecular docking
Molecular dockingMolecular docking
Molecular docking
 
Structure based drug designing
Structure based drug designingStructure based drug designing
Structure based drug designing
 
Molecular Mechanics in Molecular Modeling
Molecular Mechanics in Molecular ModelingMolecular Mechanics in Molecular Modeling
Molecular Mechanics in Molecular Modeling
 
Computer Aided Drug Design ppt
Computer Aided Drug Design pptComputer Aided Drug Design ppt
Computer Aided Drug Design ppt
 
Molecular modelling
Molecular modelling Molecular modelling
Molecular modelling
 
Protein 3 d structure prediction
Protein 3 d structure predictionProtein 3 d structure prediction
Protein 3 d structure prediction
 
Role of bioinformatics in drug designing
Role of bioinformatics in drug designingRole of bioinformatics in drug designing
Role of bioinformatics in drug designing
 
CADD
CADDCADD
CADD
 
Molecular modelling and docking studies
Molecular modelling and docking studiesMolecular modelling and docking studies
Molecular modelling and docking studies
 
molecular docking its types and de novo drug design and application and softw...
molecular docking its types and de novo drug design and application and softw...molecular docking its types and de novo drug design and application and softw...
molecular docking its types and de novo drug design and application and softw...
 
energy minimization
energy minimizationenergy minimization
energy minimization
 
Pharmacophore
PharmacophorePharmacophore
Pharmacophore
 
Computer aided drug designing
Computer aided drug designingComputer aided drug designing
Computer aided drug designing
 
MOLECULAR DOCKING
MOLECULAR DOCKINGMOLECULAR DOCKING
MOLECULAR DOCKING
 
Protein-Ligand Docking
Protein-Ligand DockingProtein-Ligand Docking
Protein-Ligand Docking
 
Docking
DockingDocking
Docking
 
Virtual sreening
Virtual sreeningVirtual sreening
Virtual sreening
 

Similar to Chemoinformatics

Lecture 12 – chemoinformatic
Lecture 12 – chemoinformatic Lecture 12 – chemoinformatic
Lecture 12 – chemoinformatic
RAJAN ROLTA
 
An Introduction to Chemoinformatics for the postgraduate students of Agriculture
An Introduction to Chemoinformatics for the postgraduate students of AgricultureAn Introduction to Chemoinformatics for the postgraduate students of Agriculture
An Introduction to Chemoinformatics for the postgraduate students of Agriculture
Devakumar Jain
 
Chemoinformatic File Format.pptx
Chemoinformatic File Format.pptxChemoinformatic File Format.pptx
Chemoinformatic File Format.pptx
wadhava gurumeet
 
The EPA Comptox Chemistry Dashboard: A Web-Based Data Integration Hub for Tox...
The EPA Comptox Chemistry Dashboard: A Web-Based Data Integration Hub for Tox...The EPA Comptox Chemistry Dashboard: A Web-Based Data Integration Hub for Tox...
The EPA Comptox Chemistry Dashboard: A Web-Based Data Integration Hub for Tox...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Precompetitive preclinical ADME/tox data and set it free on the web to facili...
Precompetitive preclinical ADME/tox data and set it free on the web to facili...Precompetitive preclinical ADME/tox data and set it free on the web to facili...
Precompetitive preclinical ADME/tox data and set it free on the web to facili...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Assignment 105B.pptx
Assignment 105B.pptxAssignment 105B.pptx
Assignment 105B.pptx
RajendraRawat27
 
Implications of structural and chemical data bases
Implications of structural and chemical data basesImplications of structural and chemical data bases
Implications of structural and chemical data bases
Bhavitha Pulaparthi
 
Automated identification and conversion of chemical names to structure search...
Automated identification and conversion of chemical names to structure search...Automated identification and conversion of chemical names to structure search...
Automated identification and conversion of chemical names to structure search...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Automated Identification and Conversion of Chemical Names to Structure Search...
Automated Identification and Conversion of Chemical Names to Structure Search...Automated Identification and Conversion of Chemical Names to Structure Search...
Automated Identification and Conversion of Chemical Names to Structure Search...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Pallavi gupta
Pallavi guptaPallavi gupta
Pallavi gupta
PallaviGupta220
 
Protein Data Bank ( PDB ) - Bioinformatics
Protein Data Bank ( PDB ) - BioinformaticsProtein Data Bank ( PDB ) - Bioinformatics
Protein Data Bank ( PDB ) - Bioinformatics
karmandeepkaur7
 
Peptide line notations for biologics registration and patent filings
Peptide line notations for biologics registration and patent filingsPeptide line notations for biologics registration and patent filings
Peptide line notations for biologics registration and patent filings
NextMove Software
 
US-EPA CompTox Chemicals Dashboard as a web-based data resource to help ident...
US-EPA CompTox Chemicals Dashboard as a web-based data resource to help ident...US-EPA CompTox Chemicals Dashboard as a web-based data resource to help ident...
US-EPA CompTox Chemicals Dashboard as a web-based data resource to help ident...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
II-SDV 2017: The "International Chemical Ontology Network"
II-SDV 2017: The "International Chemical Ontology Network" II-SDV 2017: The "International Chemical Ontology Network"
II-SDV 2017: The "International Chemical Ontology Network"
Dr. Haxel Consult
 
La chemoinformatica: uno strumento computazionale per la chimica farmaceutica
La chemoinformatica: uno strumento computazionale per la chimica farmaceuticaLa chemoinformatica: uno strumento computazionale per la chimica farmaceutica
La chemoinformatica: uno strumento computazionale per la chimica farmaceutica
CRS4 Research Center in Sardinia
 
20090511 Manchester Biochemistry
20090511 Manchester Biochemistry20090511 Manchester Biochemistry
20090511 Manchester Biochemistry
Michel Dumontier
 
INFORMATICS 2.pptx
INFORMATICS 2.pptxINFORMATICS 2.pptx
INFORMATICS 2.pptx
ramadevi824914
 
INFORMATICS 2.pptx
INFORMATICS 2.pptxINFORMATICS 2.pptx
INFORMATICS 2.pptx
Oramadevi1
 

Similar to Chemoinformatics (20)

Lecture 12 – chemoinformatic
Lecture 12 – chemoinformatic Lecture 12 – chemoinformatic
Lecture 12 – chemoinformatic
 
An Introduction to Chemoinformatics for the postgraduate students of Agriculture
An Introduction to Chemoinformatics for the postgraduate students of AgricultureAn Introduction to Chemoinformatics for the postgraduate students of Agriculture
An Introduction to Chemoinformatics for the postgraduate students of Agriculture
 
Chemoinformatic File Format.pptx
Chemoinformatic File Format.pptxChemoinformatic File Format.pptx
Chemoinformatic File Format.pptx
 
The EPA Comptox Chemistry Dashboard: A Web-Based Data Integration Hub for Tox...
The EPA Comptox Chemistry Dashboard: A Web-Based Data Integration Hub for Tox...The EPA Comptox Chemistry Dashboard: A Web-Based Data Integration Hub for Tox...
The EPA Comptox Chemistry Dashboard: A Web-Based Data Integration Hub for Tox...
 
Precompetitive preclinical ADME/tox data and set it free on the web to facili...
Precompetitive preclinical ADME/tox data and set it free on the web to facili...Precompetitive preclinical ADME/tox data and set it free on the web to facili...
Precompetitive preclinical ADME/tox data and set it free on the web to facili...
 
Assignment 105B.pptx
Assignment 105B.pptxAssignment 105B.pptx
Assignment 105B.pptx
 
Protein Data Bank
Protein Data BankProtein Data Bank
Protein Data Bank
 
Implications of structural and chemical data bases
Implications of structural and chemical data basesImplications of structural and chemical data bases
Implications of structural and chemical data bases
 
Automated identification and conversion of chemical names to structure search...
Automated identification and conversion of chemical names to structure search...Automated identification and conversion of chemical names to structure search...
Automated identification and conversion of chemical names to structure search...
 
Automated Identification and Conversion of Chemical Names to Structure Search...
Automated Identification and Conversion of Chemical Names to Structure Search...Automated Identification and Conversion of Chemical Names to Structure Search...
Automated Identification and Conversion of Chemical Names to Structure Search...
 
Pallavi gupta
Pallavi guptaPallavi gupta
Pallavi gupta
 
Protein Data Bank ( PDB ) - Bioinformatics
Protein Data Bank ( PDB ) - BioinformaticsProtein Data Bank ( PDB ) - Bioinformatics
Protein Data Bank ( PDB ) - Bioinformatics
 
Ppi
PpiPpi
Ppi
 
Peptide line notations for biologics registration and patent filings
Peptide line notations for biologics registration and patent filingsPeptide line notations for biologics registration and patent filings
Peptide line notations for biologics registration and patent filings
 
US-EPA CompTox Chemicals Dashboard as a web-based data resource to help ident...
US-EPA CompTox Chemicals Dashboard as a web-based data resource to help ident...US-EPA CompTox Chemicals Dashboard as a web-based data resource to help ident...
US-EPA CompTox Chemicals Dashboard as a web-based data resource to help ident...
 
II-SDV 2017: The "International Chemical Ontology Network"
II-SDV 2017: The "International Chemical Ontology Network" II-SDV 2017: The "International Chemical Ontology Network"
II-SDV 2017: The "International Chemical Ontology Network"
 
La chemoinformatica: uno strumento computazionale per la chimica farmaceutica
La chemoinformatica: uno strumento computazionale per la chimica farmaceuticaLa chemoinformatica: uno strumento computazionale per la chimica farmaceutica
La chemoinformatica: uno strumento computazionale per la chimica farmaceutica
 
20090511 Manchester Biochemistry
20090511 Manchester Biochemistry20090511 Manchester Biochemistry
20090511 Manchester Biochemistry
 
INFORMATICS 2.pptx
INFORMATICS 2.pptxINFORMATICS 2.pptx
INFORMATICS 2.pptx
 
INFORMATICS 2.pptx
INFORMATICS 2.pptxINFORMATICS 2.pptx
INFORMATICS 2.pptx
 

Recently uploaded

Lapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdfLapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdf
Jean Carlos Nunes Paixão
 
Introduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp NetworkIntroduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp Network
TechSoup
 
Multithreading_in_C++ - std::thread, race condition
Multithreading_in_C++ - std::thread, race conditionMultithreading_in_C++ - std::thread, race condition
Multithreading_in_C++ - std::thread, race condition
Mohammed Sikander
 
How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...
Jisc
 
How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17
Celine George
 
Operation Blue Star - Saka Neela Tara
Operation Blue Star   -  Saka Neela TaraOperation Blue Star   -  Saka Neela Tara
Operation Blue Star - Saka Neela Tara
Balvir Singh
 
Home assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdfHome assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdf
Tamralipta Mahavidyalaya
 
Supporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptxSupporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptx
Jisc
 
Acetabularia Information For Class 9 .docx
Acetabularia Information For Class 9  .docxAcetabularia Information For Class 9  .docx
Acetabularia Information For Class 9 .docx
vaibhavrinwa19
 
Unit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdfUnit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdf
Thiyagu K
 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
siemaillard
 
Marketing internship report file for MBA
Marketing internship report file for MBAMarketing internship report file for MBA
Marketing internship report file for MBA
gb193092
 
The basics of sentences session 5pptx.pptx
The basics of sentences session 5pptx.pptxThe basics of sentences session 5pptx.pptx
The basics of sentences session 5pptx.pptx
heathfieldcps1
 
Chapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptxChapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptx
Mohd Adib Abd Muin, Senior Lecturer at Universiti Utara Malaysia
 
Francesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptxFrancesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptx
EduSkills OECD
 
Language Across the Curriculm LAC B.Ed.
Language Across the  Curriculm LAC B.Ed.Language Across the  Curriculm LAC B.Ed.
Language Across the Curriculm LAC B.Ed.
Atul Kumar Singh
 
Thesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.pptThesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.ppt
EverAndrsGuerraGuerr
 
Digital Tools and AI for Teaching Learning and Research
Digital Tools and AI for Teaching Learning and ResearchDigital Tools and AI for Teaching Learning and Research
Digital Tools and AI for Teaching Learning and Research
Vikramjit Singh
 
1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx
JosvitaDsouza2
 
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup   New Member Orientation and Q&A (May 2024).pdfWelcome to TechSoup   New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
TechSoup
 

Recently uploaded (20)

Lapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdfLapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdf
 
Introduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp NetworkIntroduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp Network
 
Multithreading_in_C++ - std::thread, race condition
Multithreading_in_C++ - std::thread, race conditionMultithreading_in_C++ - std::thread, race condition
Multithreading_in_C++ - std::thread, race condition
 
How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...
 
How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17
 
Operation Blue Star - Saka Neela Tara
Operation Blue Star   -  Saka Neela TaraOperation Blue Star   -  Saka Neela Tara
Operation Blue Star - Saka Neela Tara
 
Home assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdfHome assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdf
 
Supporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptxSupporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptx
 
Acetabularia Information For Class 9 .docx
Acetabularia Information For Class 9  .docxAcetabularia Information For Class 9  .docx
Acetabularia Information For Class 9 .docx
 
Unit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdfUnit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdf
 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
 
Marketing internship report file for MBA
Marketing internship report file for MBAMarketing internship report file for MBA
Marketing internship report file for MBA
 
The basics of sentences session 5pptx.pptx
The basics of sentences session 5pptx.pptxThe basics of sentences session 5pptx.pptx
The basics of sentences session 5pptx.pptx
 
Chapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptxChapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptx
 
Francesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptxFrancesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptx
 
Language Across the Curriculm LAC B.Ed.
Language Across the  Curriculm LAC B.Ed.Language Across the  Curriculm LAC B.Ed.
Language Across the Curriculm LAC B.Ed.
 
Thesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.pptThesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.ppt
 
Digital Tools and AI for Teaching Learning and Research
Digital Tools and AI for Teaching Learning and ResearchDigital Tools and AI for Teaching Learning and Research
Digital Tools and AI for Teaching Learning and Research
 
1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx
 
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup   New Member Orientation and Q&A (May 2024).pdfWelcome to TechSoup   New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
 

Chemoinformatics

  • 1. CHEMOINFORMATICS Presented by Miss. Rupali D. Salunkhe PG Department of Bioinformatics, WCBT, Solapur
  • 2. CONTENTS  Introduction  Representation of molecules  File formats  Databases  Applications  References
  • 3. INTRODUCTION  Chemoinformatics is the combination of chemistry and information technology, is required for the processing and analysis of chemical data.  Design, creation, organization, management, analysis, visualization and use of chemical information.  It is the application of informatics methods to solve chemical problems.  It is relevant to biologists because chemistry data are important in many areas of molecular biology, e.g, in the study of protein interactions.
  • 4.  The term chemoinformatics was defined by F.K. Brown in 1998.  It combines the scientific working fields of chemistry, computer science and information science.  For example in the areas of topology, chemical graph theory, information retrieval and data mining in the chemical space.  It can also be applied to data analysis for various industries like paper , pulp and dyes industries.
  • 5. REPRESENTATION OF MOLECULES  1D– computed/experimental global properties  2D – the chemical structure diagram  3D – atomic coordinate data
  • 6. FILE FORMATS  Mol format  SDF format  SMILES format
  • 7. MOL FORMAT  An MDL Molfile is a file format for holding information about the atoms, bonds, connectivity and coordinates of a molecule.  The format was created by MDL Information Systems (MDL) [Molecular Design Limited, Inc.].  It is also supported by some computational software such as Mathematica.  The molfile consists of - Header information ,the Connection Table (CT) containing atom information, then bond connections and types.
  • 8. SDF FORMAT  SDF is one of a family of chemical-data file formats developed by MDL.  SDF stands for structure-data file.  SDF files actually wrap the molfile (MDL Molfile) format.  Multiple compounds are delimited by lines consisting of four dollar signs ($$$$).  A feature of the SDF format is its ability to include associated data.
  • 9. SMILES FORMAT  The Simplified Molecular Input Line Entry Specification (SMILES) is a line notation for molecules.  SMILES strings include connectivity but do not include 2D or 3D coordinates.  Hydrogen atoms are not represented.  Other atoms are represented by their element symbols C, N,O, S, Cl, Br, and I.  The symbol "=" represents double bonds and "#" represents triple bonds, branching is indicated by () and rings are indicated by pairs of digits.
  • 11. PUBCHEM DATABASE  https://pubchem.ncbi.nlm.nih.gov/  PubChem is a database of chemical molecules and their activities against biological assays.  Is maintained by the NCBI.  It contains substance descriptions and small molecules with more than 1000 atoms and 1000 bonds.  Searching - chemical structure, name fragments, chemical formula, molecular weight.
  • 12. HOME PAGE OF PUBCHEM DATABASE
  • 13. CHEMBANK DATABASE  http://chembank.broadinstitute.org/  It is a public, web-based informatics environment.  Created by the Broad Institute’s Chemical Biology Program.  It stores information of small molecules and biomedically relevant assays.  It is intended to guide chemists synthesizing novel compounds or libraries, to assist biologists searching for small molecules.
  • 14. HOME PAGE OF CHEMBANK DATABASE
  • 15. CHEMBL DATABASE  https://www.ebi.ac.uk/chembl/  ChEMBLdb is a manually curated chemical database of bioactive molecules.  It is maintained by the European Bioinformatics Institute (EBI), of the EMBL.  ChEMBL database contains compound bioactivity data against drug targets.  The latest version 2 (ChEMBL_02).
  • 16. HOME PAGE OF CHEMBL DATABASE
  • 17. DRUGBANK DATABASE  https://www.drugbank.ca/  The DrugBank database is a comprehensive, freely accessible, online database.  Containing information on drugs and drug targets.  It is widely used by the drug industry, medicinal chemists, pharmacists, physicians, students.  The latest release of the database (version 5.0.11).  It contains 11,002 drug entries.
  • 18. HOME PAGE OF DRUGBANK DATABASE
  • 19. APPLICATIONS  Storage and retrieval- The primary application is the storage, indexing and search of information relating to compounds.  Virtual libraries- Chemical data can related to real or virtual molecules,Virtual libraries of classes of compounds: drugs, natural products, diversity-oriented synthetic products.  Virtual screening-It involves computationally screening in silico libraries of compounds.  Quantitative structure-activity relationship (QSAR)-Used to predict the activity of compounds from their structures.
  • 20. REFERENCES  An introduction to cheminformatics, A. R. Leach, V. J. Gillet.  Cheminformatics, Johann Gasteiger and Thomas Engel (Eds).  Molecular modelling – Principles and Applications, A. R. Leach.  Gasteiger, Editor, Handbook of Chemoinformatics - From Data to Knowledge, Wiley-VCH, Weinheim.