SlideShare a Scribd company logo
1 of 13
Download to read offline
Chemoinformatics in action:
     some question for audience

 Yuriy Sushko, Sergii Novotarskyi
Practical example
Story:
A company that produces or intends
   to produce some particular
   compound (drug, make up, paint,
   glue, toilet refresher, whatever..) is
   obliged to test, if this compound is
   toxic for human and how toxic it is.
   What are the options to check
   this?



                                            Teuthrin, Cyclopropanecarboxylic acid
Practical example

       Bioassay                      Computer modeling

                                    In silico: using QSAR (QSPR) based
                                    on machine learning to predict
In vivo and in vitro assays with
                                    properties of interest without direct
mice, dogs, rats or other species
                                    experiment.
Option 1: Bioassay
Classical and currently widely used method
  for measuring toxicity is bioassay with
  mice, rats, dogs or other species.

What are advantages and disadvantages
Option 1: Bioassay
For bioassay we would typically need:
• Dozens of mice for checking several concentrations of
  tested compound
• In some assays we need to wait for next generation
• We may need to test against several organisms (rat,
  mouse) and dierent administration routes (oral, skin, IV
  injection)
• Test can take upto several months
• Test would cost upto dozens of thousands dollars
     What if we need to measure toxicity for 100 000 compounds?
Option 2: Modeling
What are the steps required to build
 predictive model for physicochemical or
 biological property?

• Prepare dataset of experimental data
• Choose and calculate molecular
  descriptors
• Apply machine learning method
Molecular descriptors
What is descriptor? Most simple examples?

Descriptor is some numerical property of chemical
  compound.

•   Simplest constitutional descriptors: MW, NA, nDB, ..
•   Molecular properties: LogP, hydrophilic factor, ..
•   Randic molecular profiles
•   Various topological and 3D indices and profiles
Molecular descriptors
         2.54
         4.25
         -5.71
         3.26
         0.57
         -0.07



         1.45
         6.34
         8.28
         2.78
         -5.67
         -2.33



         1.45
         7.34
         8.35
         1.64
         -5.56
         -4.45
Machine learning
What kind of machine learning methods do
  you know?
• Linear regression
• K nearest neighbors (KNN)
• Partial Least Regression
• Neural networks
• Support Vector Machines
Some additional facts
Popular formats for representing molecules
  in databases
• SDF
• SMILES
• INCHI
SDF — a plain text file
benzene
ACD/Labs0812062058                                                                     header
 6   6 0 0 0 0 0       0     0 0     1 V2000
    1.9050  -0.7932        0.0000   C   0 0    0   0   0   0   0   0   0   0   0   0
    1.9050  -2.1232        0.0000   C   0 0    0   0   0   0   0   0   0   0   0   0
    0.7531  -0.1282        0.0000   C   0 0    0   0   0   0   0   0   0   0   0   0
    0.7531  -2.7882        0.0000   C   0 0    0   0   0   0   0   0   0   0   0   0   atom information
   -0.3987  -0.7932        0.0000   C   0 0    0   0   0   0   0   0   0   0   0   0
   -0.3987  -2.1232        0.0000   C   0 0    0   0   0   0   0   0   0   0   0   0
  2 1 1 0 0 0 0
  3 1 2 0 0 0 0
  4 2 2 0 0 0 0
  5 3 1 0 0 0 0                                                                        bond information
  6 4 1 0 0 0 0
  6 5 2 0 0 0 0
 M END
 $$$$
> <Unique_ID>
XCA3464366

> <ClogP>
5.825
                                                                                       tags
> <Vendor>
Sigma

> <Molecular Weight>
499.611
SMILES — a string representation

                     C1=CC=C(C=C1)Br



                     CC(F)F




                     COC(C(Cl)Cl)(F)
                     F
InChI — one more approach
 InChI (international chemical identifier) — a standart, developed by IUPAC
    for a textual identifier of chemical substances


              InChI: InChI=1S/C6H5Br/c7-6-4-2-1-3-5-6/h1-5H
              InChIKey: QARVLSVVCXYDNA-UHFFFAOYSA


             InChI: InChI=1S/C2H4F2/c1-2(3)4/h2H,1H3
             InChIKey: NPNPZTNLOVBDOC-UHFFFAOYSA


              InChI: InChI=1S/C3H4Cl2F2O/c1-8-3(6,7)2(4)5/h2H,1H3
              InChIKey: RFKMCNOHBTXSMU-UHFFFAOYSA

More Related Content

Viewers also liked

2012 mda navarra marcelo ranzini_ casa vilassar
2012 mda navarra marcelo ranzini_ casa vilassar2012 mda navarra marcelo ranzini_ casa vilassar
2012 mda navarra marcelo ranzini_ casa vilassar
mdanavarra
 

Viewers also liked (14)

Evaluation question 1 Nikitha
Evaluation question 1 Nikitha Evaluation question 1 Nikitha
Evaluation question 1 Nikitha
 
Maria Machlowska i Elżbieta Sądel - "Appium: automatyzacja testów w Mobile"
Maria Machlowska i Elżbieta Sądel - "Appium: automatyzacja testów w Mobile"Maria Machlowska i Elżbieta Sądel - "Appium: automatyzacja testów w Mobile"
Maria Machlowska i Elżbieta Sądel - "Appium: automatyzacja testów w Mobile"
 
INOBACION TECNICA Y DESARROLLO SOSTENIBLE
INOBACION TECNICA Y DESARROLLO SOSTENIBLE INOBACION TECNICA Y DESARROLLO SOSTENIBLE
INOBACION TECNICA Y DESARROLLO SOSTENIBLE
 
Exj 5
Exj 5Exj 5
Exj 5
 
Smith middle school Guthrie
Smith middle school GuthrieSmith middle school Guthrie
Smith middle school Guthrie
 
Selenium
SeleniumSelenium
Selenium
 
Social media strategies in the workplace
Social media strategies in the workplaceSocial media strategies in the workplace
Social media strategies in the workplace
 
2012 mda navarra marcelo ranzini_ casa vilassar
2012 mda navarra marcelo ranzini_ casa vilassar2012 mda navarra marcelo ranzini_ casa vilassar
2012 mda navarra marcelo ranzini_ casa vilassar
 
The Internet Is Dying
The Internet Is DyingThe Internet Is Dying
The Internet Is Dying
 
SORACOM UG Miyagi #1 | IoT通信プラットフォーム SORACOM のご紹介と最新情報
SORACOM UG Miyagi #1 | IoT通信プラットフォーム SORACOM のご紹介と最新情報SORACOM UG Miyagi #1 | IoT通信プラットフォーム SORACOM のご紹介と最新情報
SORACOM UG Miyagi #1 | IoT通信プラットフォーム SORACOM のご紹介と最新情報
 
Samsung galaxy s® iii
Samsung galaxy s® iiiSamsung galaxy s® iii
Samsung galaxy s® iii
 
Quiz prelims
Quiz prelimsQuiz prelims
Quiz prelims
 
Brochure Salvo website
Brochure Salvo websiteBrochure Salvo website
Brochure Salvo website
 
here is the Shift
here is the Shifthere is the Shift
here is the Shift
 

Similar to Chemoinformatics in Action

Robots, Small Molecules & R
Robots, Small Molecules & RRobots, Small Molecules & R
Robots, Small Molecules & R
Rajarshi Guha
 
Chemistry data: Distortion and dissemination in the Internet Era
Chemistry data: Distortion and dissemination in the Internet EraChemistry data: Distortion and dissemination in the Internet Era
Chemistry data: Distortion and dissemination in the Internet Era
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Delivering The Benefits of Chemical-Biological Integration in Computational T...
Delivering The Benefits of Chemical-Biological Integration in Computational T...Delivering The Benefits of Chemical-Biological Integration in Computational T...
Delivering The Benefits of Chemical-Biological Integration in Computational T...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Prediction of pKa from chemical structure using free and open source tools
Prediction of pKa from chemical structure using free and open source toolsPrediction of pKa from chemical structure using free and open source tools
Prediction of pKa from chemical structure using free and open source tools
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 

Similar to Chemoinformatics in Action (20)

ACS Meeting New Orleans 2013 (CINF)
ACS Meeting New Orleans 2013 (CINF)ACS Meeting New Orleans 2013 (CINF)
ACS Meeting New Orleans 2013 (CINF)
 
The importance of data curation on QSAR Modeling: PHYSPROP open data as a cas...
The importance of data curation on QSAR Modeling: PHYSPROP open data as a cas...The importance of data curation on QSAR Modeling: PHYSPROP open data as a cas...
The importance of data curation on QSAR Modeling: PHYSPROP open data as a cas...
 
A Global Commons for Scientific Data: Molecules and Wikidata
A Global Commons for Scientific Data: Molecules and WikidataA Global Commons for Scientific Data: Molecules and Wikidata
A Global Commons for Scientific Data: Molecules and Wikidata
 
Data integration and building a profile for yourself as an online scientist
Data integration and building a profile for yourself as an online scientistData integration and building a profile for yourself as an online scientist
Data integration and building a profile for yourself as an online scientist
 
The anatomy of a chemical reaction: Dissection by machine learning algorithms
The anatomy of a chemical reaction: Dissection by machine learning algorithmsThe anatomy of a chemical reaction: Dissection by machine learning algorithms
The anatomy of a chemical reaction: Dissection by machine learning algorithms
 
Reading and Writing Molecular File Formats for Data Exchange of Small Molecul...
Reading and Writing Molecular File Formats for Data Exchange of Small Molecul...Reading and Writing Molecular File Formats for Data Exchange of Small Molecul...
Reading and Writing Molecular File Formats for Data Exchange of Small Molecul...
 
|QAB> : Quantum Computing, AI and Blockchain
|QAB> : Quantum Computing, AI and Blockchain|QAB> : Quantum Computing, AI and Blockchain
|QAB> : Quantum Computing, AI and Blockchain
 
EUGM 2013 - Anh Kiet Tran Minh (CNRS): French Academic Compound Library: the ...
EUGM 2013 - Anh Kiet Tran Minh (CNRS): French Academic Compound Library: the ...EUGM 2013 - Anh Kiet Tran Minh (CNRS): French Academic Compound Library: the ...
EUGM 2013 - Anh Kiet Tran Minh (CNRS): French Academic Compound Library: the ...
 
OPERA: A free and open source QSAR tool for predicting physicochemical proper...
OPERA: A free and open source QSAR tool for predicting physicochemical proper...OPERA: A free and open source QSAR tool for predicting physicochemical proper...
OPERA: A free and open source QSAR tool for predicting physicochemical proper...
 
Nirs
NirsNirs
Nirs
 
151 performance of a localized fiber optic
151 performance of a localized fiber optic151 performance of a localized fiber optic
151 performance of a localized fiber optic
 
151 performance of a localized fiber optic
151 performance of a localized fiber optic151 performance of a localized fiber optic
151 performance of a localized fiber optic
 
Substructure Search Face-off
Substructure Search Face-offSubstructure Search Face-off
Substructure Search Face-off
 
Robots, Small Molecules & R
Robots, Small Molecules & RRobots, Small Molecules & R
Robots, Small Molecules & R
 
An examination of data quality on QSAR Modeling in regards to the environment...
An examination of data quality on QSAR Modeling in regards to the environment...An examination of data quality on QSAR Modeling in regards to the environment...
An examination of data quality on QSAR Modeling in regards to the environment...
 
An examination of data quality on QSAR Modeling in regards to the environment...
An examination of data quality on QSAR Modeling in regards to the environment...An examination of data quality on QSAR Modeling in regards to the environment...
An examination of data quality on QSAR Modeling in regards to the environment...
 
Chemistry data: Distortion and dissemination in the Internet Era
Chemistry data: Distortion and dissemination in the Internet EraChemistry data: Distortion and dissemination in the Internet Era
Chemistry data: Distortion and dissemination in the Internet Era
 
Delivering The Benefits of Chemical-Biological Integration in Computational T...
Delivering The Benefits of Chemical-Biological Integration in Computational T...Delivering The Benefits of Chemical-Biological Integration in Computational T...
Delivering The Benefits of Chemical-Biological Integration in Computational T...
 
Prediction of pKa from chemical structure using free and open source tools
Prediction of pKa from chemical structure using free and open source toolsPrediction of pKa from chemical structure using free and open source tools
Prediction of pKa from chemical structure using free and open source tools
 
20160219 - M. Agostini - Nuove tecnologie per lo studio del DNA tumorale libe...
20160219 - M. Agostini - Nuove tecnologie per lo studio del DNA tumorale libe...20160219 - M. Agostini - Nuove tecnologie per lo studio del DNA tumorale libe...
20160219 - M. Agostini - Nuove tecnologie per lo studio del DNA tumorale libe...
 

More from SSA KPI

Germany presentation
Germany presentationGermany presentation
Germany presentation
SSA KPI
 
Grand challenges in energy
Grand challenges in energyGrand challenges in energy
Grand challenges in energy
SSA KPI
 
Engineering role in sustainability
Engineering role in sustainabilityEngineering role in sustainability
Engineering role in sustainability
SSA KPI
 
Consensus and interaction on a long term strategy for sustainable development
Consensus and interaction on a long term strategy for sustainable developmentConsensus and interaction on a long term strategy for sustainable development
Consensus and interaction on a long term strategy for sustainable development
SSA KPI
 
Competences in sustainability in engineering education
Competences in sustainability in engineering educationCompetences in sustainability in engineering education
Competences in sustainability in engineering education
SSA KPI
 
Introducatio SD for enginers
Introducatio SD for enginersIntroducatio SD for enginers
Introducatio SD for enginers
SSA KPI
 

More from SSA KPI (20)

Germany presentation
Germany presentationGermany presentation
Germany presentation
 
Grand challenges in energy
Grand challenges in energyGrand challenges in energy
Grand challenges in energy
 
Engineering role in sustainability
Engineering role in sustainabilityEngineering role in sustainability
Engineering role in sustainability
 
Consensus and interaction on a long term strategy for sustainable development
Consensus and interaction on a long term strategy for sustainable developmentConsensus and interaction on a long term strategy for sustainable development
Consensus and interaction on a long term strategy for sustainable development
 
Competences in sustainability in engineering education
Competences in sustainability in engineering educationCompetences in sustainability in engineering education
Competences in sustainability in engineering education
 
Introducatio SD for enginers
Introducatio SD for enginersIntroducatio SD for enginers
Introducatio SD for enginers
 
DAAD-10.11.2011
DAAD-10.11.2011DAAD-10.11.2011
DAAD-10.11.2011
 
Talking with money
Talking with moneyTalking with money
Talking with money
 
'Green' startup investment
'Green' startup investment'Green' startup investment
'Green' startup investment
 
From Huygens odd sympathy to the energy Huygens' extraction from the sea waves
From Huygens odd sympathy to the energy Huygens' extraction from the sea wavesFrom Huygens odd sympathy to the energy Huygens' extraction from the sea waves
From Huygens odd sympathy to the energy Huygens' extraction from the sea waves
 
Dynamics of dice games
Dynamics of dice gamesDynamics of dice games
Dynamics of dice games
 
Energy Security Costs
Energy Security CostsEnergy Security Costs
Energy Security Costs
 
Naturally Occurring Radioactivity (NOR) in natural and anthropic environments
Naturally Occurring Radioactivity (NOR) in natural and anthropic environmentsNaturally Occurring Radioactivity (NOR) in natural and anthropic environments
Naturally Occurring Radioactivity (NOR) in natural and anthropic environments
 
Advanced energy technology for sustainable development. Part 5
Advanced energy technology for sustainable development. Part 5Advanced energy technology for sustainable development. Part 5
Advanced energy technology for sustainable development. Part 5
 
Advanced energy technology for sustainable development. Part 4
Advanced energy technology for sustainable development. Part 4Advanced energy technology for sustainable development. Part 4
Advanced energy technology for sustainable development. Part 4
 
Advanced energy technology for sustainable development. Part 3
Advanced energy technology for sustainable development. Part 3Advanced energy technology for sustainable development. Part 3
Advanced energy technology for sustainable development. Part 3
 
Advanced energy technology for sustainable development. Part 2
Advanced energy technology for sustainable development. Part 2Advanced energy technology for sustainable development. Part 2
Advanced energy technology for sustainable development. Part 2
 
Advanced energy technology for sustainable development. Part 1
Advanced energy technology for sustainable development. Part 1Advanced energy technology for sustainable development. Part 1
Advanced energy technology for sustainable development. Part 1
 
Fluorescent proteins in current biology
Fluorescent proteins in current biologyFluorescent proteins in current biology
Fluorescent proteins in current biology
 
Neurotransmitter systems of the brain and their functions
Neurotransmitter systems of the brain and their functionsNeurotransmitter systems of the brain and their functions
Neurotransmitter systems of the brain and their functions
 

Recently uploaded

Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Krashi Coaching
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
ciinovamais
 

Recently uploaded (20)

Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
 
Advance Mobile Application Development class 07
Advance Mobile Application Development class 07Advance Mobile Application Development class 07
Advance Mobile Application Development class 07
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
social pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajansocial pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajan
 
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdf
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdf
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdf
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 

Chemoinformatics in Action

  • 1. Chemoinformatics in action: some question for audience Yuriy Sushko, Sergii Novotarskyi
  • 2. Practical example Story: A company that produces or intends to produce some particular compound (drug, make up, paint, glue, toilet refresher, whatever..) is obliged to test, if this compound is toxic for human and how toxic it is. What are the options to check this? Teuthrin, Cyclopropanecarboxylic acid
  • 3. Practical example Bioassay Computer modeling In silico: using QSAR (QSPR) based on machine learning to predict In vivo and in vitro assays with properties of interest without direct mice, dogs, rats or other species experiment.
  • 4. Option 1: Bioassay Classical and currently widely used method for measuring toxicity is bioassay with mice, rats, dogs or other species. What are advantages and disadvantages
  • 5. Option 1: Bioassay For bioassay we would typically need: • Dozens of mice for checking several concentrations of tested compound • In some assays we need to wait for next generation • We may need to test against several organisms (rat, mouse) and dierent administration routes (oral, skin, IV injection) • Test can take upto several months • Test would cost upto dozens of thousands dollars What if we need to measure toxicity for 100 000 compounds?
  • 6. Option 2: Modeling What are the steps required to build predictive model for physicochemical or biological property? • Prepare dataset of experimental data • Choose and calculate molecular descriptors • Apply machine learning method
  • 7. Molecular descriptors What is descriptor? Most simple examples? Descriptor is some numerical property of chemical compound. • Simplest constitutional descriptors: MW, NA, nDB, .. • Molecular properties: LogP, hydrophilic factor, .. • Randic molecular profiles • Various topological and 3D indices and profiles
  • 8. Molecular descriptors 2.54 4.25 -5.71 3.26 0.57 -0.07 1.45 6.34 8.28 2.78 -5.67 -2.33 1.45 7.34 8.35 1.64 -5.56 -4.45
  • 9. Machine learning What kind of machine learning methods do you know? • Linear regression • K nearest neighbors (KNN) • Partial Least Regression • Neural networks • Support Vector Machines
  • 10. Some additional facts Popular formats for representing molecules in databases • SDF • SMILES • INCHI
  • 11. SDF — a plain text file benzene ACD/Labs0812062058 header 6 6 0 0 0 0 0 0 0 0 1 V2000 1.9050 -0.7932 0.0000 C 0 0 0 0 0 0 0 0 0 0 0 0 1.9050 -2.1232 0.0000 C 0 0 0 0 0 0 0 0 0 0 0 0 0.7531 -0.1282 0.0000 C 0 0 0 0 0 0 0 0 0 0 0 0 0.7531 -2.7882 0.0000 C 0 0 0 0 0 0 0 0 0 0 0 0 atom information -0.3987 -0.7932 0.0000 C 0 0 0 0 0 0 0 0 0 0 0 0 -0.3987 -2.1232 0.0000 C 0 0 0 0 0 0 0 0 0 0 0 0 2 1 1 0 0 0 0 3 1 2 0 0 0 0 4 2 2 0 0 0 0 5 3 1 0 0 0 0 bond information 6 4 1 0 0 0 0 6 5 2 0 0 0 0 M END $$$$ > <Unique_ID> XCA3464366 > <ClogP> 5.825 tags > <Vendor> Sigma > <Molecular Weight> 499.611
  • 12. SMILES — a string representation C1=CC=C(C=C1)Br CC(F)F COC(C(Cl)Cl)(F) F
  • 13. InChI — one more approach InChI (international chemical identifier) — a standart, developed by IUPAC for a textual identifier of chemical substances InChI: InChI=1S/C6H5Br/c7-6-4-2-1-3-5-6/h1-5H InChIKey: QARVLSVVCXYDNA-UHFFFAOYSA InChI: InChI=1S/C2H4F2/c1-2(3)4/h2H,1H3 InChIKey: NPNPZTNLOVBDOC-UHFFFAOYSA InChI: InChI=1S/C3H4Cl2F2O/c1-8-3(6,7)2(4)5/h2H,1H3 InChIKey: RFKMCNOHBTXSMU-UHFFFAOYSA