12. Parsing Log Files to CML Coordinates Molecular Formula Calculation Type Point Group Dipole Total Energy Computational Chemistry Log Files
13. CompChem Output Coordinates Energy Levels Vibrations Coordinates Energy Level Vibration CML File CMLCore CMLCore CMLComp CMLSpect Input/jobControl General Parsers
14. Dissemination of results LOG FILE CML FILE HUMAN DISPLAY WWMM* Server and DSpace Outside world JUMBOMarker NLP-based log file parser * World Wide Molecular Matrix
15.
16. Proteus molecules * Calculation JUNK Cured by MOPAC * Proteus was a shape changing ocean deity
25. Analysis of different computational methods Mean - Overall difference Normality - Distribution of values Outliers - Unusual molecules? Variance - Spread of the data, depends on both distributions. (standard deviation)
27. Mean of distribution (Approx - 0.03 Å ) Range over which sample distribution is approximately normal Outliers Probability Plot (Normal QQ plot) S.D. 0.020 Å
28. All bonds* r (MOPAC – GAMESS) / Å * Excludes bonds to Hydrogenc
29. All bonds* r (MOPAC – GAMESS) / Å Good agreement Nearly normal Outliers S.D. 0.005 Å * Excludes bonds to Hydrogenc
31. Mean r (M - G) / Å Standard Error of the Mean / Å All values given to 3 significant figures C N O F S Cl C -0.006 0.020 -0.010 -0.014 -0.040 -0.037 0.000 0.000 0.000 0.001 0.001 0.001 N 0.006 -0.037 -0.055 0.001 0.001 0.009 O -0.087 -0.070 0.004 0.014
40. Most common fragments found in Left set but not Right set C(sp 3 ) C(sp 3 ) (sp 3 ) S(sp 2 ) N(ar) N (ar) C(sp 2 ) S(sp 2 ) N(ar) N (ar) C(sp 2 ) Or
41. GAMESS Log Files Comparison of theory and experiment CIF* CIF* CIF* CIF* CIF* CIF 2 CML * CIF: Crystallographic Information File
43. All bonds* r (Cryst. – GAMESS) / Å Single molecules, no disorder * Excludes bonds to Hydrogenc
44. All bonds* r (Cryst. – GAMESS) / Å Single molecules, no disorder Mean r - 0.011 Å Nearly normal Outliers S.D. 0.014 Å * Excludes bonds to Hydrogenc