Open source


Published on

In many research disciplines, including chemical informatics, open software is successfully challenging the age old paradigm of proprietary vendor software. Not only is this good for research vitality, but there are also avenues through which open source development can be lucrative and rewarding for developers themselves! This presentation discusses the open source philosophy and provides suggestions of useful open source tools covering nearly every practical aspect of chemical informatics.

Two important notes: the original presentation included practical applications that have been deleted to honor client confidentiality. Secondly, more details regarding key open source tools are being published periodically on my Tumblr and WordPress blogs:

  • Be the first to comment

  • Be the first to like this

No Downloads
Total views
On SlideShare
From Embeds
Number of Embeds
Embeds 0
No embeds

No notes for slide

Open source

  1. 1. Informatics for AllThe Open Source and FreewareRevolution in Chemical Biology Applications to Selected SRI Projects
  2. 2. Proprietary software: black box model● Exclusive code control● Exclusive development / customization controlUser-initiated customization mustawait proprietor implementation Orreinvent all requisite wheels
  3. 3. Open source empowers users to pursue novelenhancements according to their needs and theirtimelines!
  4. 4. Open source = philosophically goodBut in practice: can you replace well established proprietary tools with open source and still sustain ● Effective ● Accurate ● Efficient science?
  5. 5. Sometimes it helps toYes! have a guinea pig(mostly) = me July – Oct. 16 distinct projects for 8 clients 98+% open source
  6. 6. Synthesis & Intellectual AssayProcurement Property Development WWW Meta Data Omics Meta Data Data Chemical Structures Screening Data Target Discovery Scope Structure-Based Design SAR, ADME, Tox, PK
  7. 7. Meta Data Omics Meta Data Data Chemical Structures Screening DataAcquire& managedata
  8. 8. Chemical specification, drawing & editing:Marvin ( functionality approaching that of ChemDraw; good drawing options; can embed into office documents
  9. 9. Enumerate combinatorial librariesSmiLib ( Efficient and flexibleMarvin ( Intuitive but slower
  10. 10. Molecular Structure ConversionMolconverter ( FastOpenBabel ( Excellent functionality but slow
  11. 11. Store / analyze libraries and screening dataScreening Assistant SA2 ( Powerful, enterprise-likesoftware: capable of handling internal data management for serious operations
  12. 12. WWWNeed Caveat:External logged queryKnowledgebases = disclosure!
  13. 13. Chemical Data / Meta DataChemSpider ( structure, literature, suppliersPubChem ( structure, screening dataSureChem ( patent searches
  14. 14. ADME/Tox profiling; target identificationPASS ( only offered online, free, and surprisingly accurate predictions on 300+ endpoints
  15. 15. ADME profilingiLab2 ( good range of ADME endpoints, online only, one compound at a time
  16. 16. Target DiscoveryNeed Structure-Based DesignModeling,Informatics SAR, ADME, Tox, PK
  17. 17. Molecular Structure Prediction / CharacterizationAvogadro ( Great builder; good graphics; built in molecular mechanics; hooks to free quantum codes
  18. 18. Molecular Structure Prediction / CharacterizationVMD ( Good graphics; excellent analytical tools; hooks to NAMD (molecular dynamics)
  19. 19. Molecular Structure Prediction / CharacterizationPyMol ( Great graphics; Decent builder, good analytical tools
  20. 20. Protein Structure PredictionSwissModel ( good control, must have close homologModeller ( use this for optimal control and efficient relaxation
  21. 21. Structure Based DesignPyRx / AutoDock ( easy to use; good predictionsSurflex ( fast; accurate; no free interface
  22. 22. QSAR: DescriptorsCDK ( Good descriptor selection, easy to useSA2 ( Better descriptor selection, but harder to navigate
  23. 23. QSAR: modelingBuildQSAR ( Fast, flexible, easy to use
  24. 24. Toxicology profilingToxTree ( fast, easy to use, clear logic, good array of toxicological endpoints
  25. 25. Synthesis & AssayProcurement Development Target Discovery Structure-Based DesignInformationFlow SAR, ADME, Tox, PK
  26. 26. Workflows (i.e., seamless process integration)
  27. 27. Thats enough for now ..... Thank you!Any questions?