This document discusses applying bio-inspired computing techniques to problems in cheminformatics. It begins with introductions to cheminformatics and bio-inspired computing. Popular bio-inspired algorithms like ant colony optimization are explained. The document outlines applications of bio-inspired approaches to tasks in cheminformatics like classification, clustering, and feature selection. It concludes by noting potential applications in drug discovery and design.
This document discusses de novo drug design, which aims to design novel drug molecules from scratch computationally. It describes the basic steps of de novo design programs, including analyzing the target protein's active site, building molecules, and evaluating candidates. The key goals are to design molecules that fit the active site and form favorable interactions. Constraints like hydrogen bonding and hydrophobic regions are extracted from the target structure to guide molecule generation and scoring. The end goal is to produce new molecular scaffolds that can inspire medicinal chemistry efforts.
Domainex has contributed to three clinical candidates through its drug discovery programs for clients. It uses a variety of technologies like combinatorial domain hunting, LeadBuilder for virtual screening, and integrated medicinal and computational chemistry. Domainex has a highly experienced team of drug discovery scientists and has successfully delivered ion-channel blockers, kinase inhibitors, and anti-thrombotics into clinical trials for clients. It provides concise drug discovery services from hit identification to candidate selection through its expertise in computational chemistry, library synthesis, and medicinal chemistry.
1. Structure-based drug design relies on knowledge of the three-dimensional structure of the biological target obtained through methods such as x-ray crystallography. Candidate drugs that are predicted to bind with high affinity and selectivity to the target can be designed.
2. Structure-based drug design approaches include receptor-based drug design, which involves "building" ligands within the constraints of the binding pocket, and ligand-based drug design.
3. De novo drug design is a receptor-based approach that uses the target's 3D structure to design new molecules without existing leads. It involves building ligands that complement the active site properties through manual or automated methods.
The document describes the development and refinement of a quantitative structure-activity relationship (QSAR) model to predict the biological activity of pyranenamine compounds. It discusses 5 stages of synthesizing analogs and developing the QSAR equation based on substituents. Anomalies identified were used to refine the model terms. The final optimized QSAR equation considered parameters like hydrophilicity, hydrogen bonding, resonance effects, and steric hindrance to identify a hypothetical compound over 1000 times more active than the lead compound.
The document discusses the process of preparing a chemical database for virtual screening or compound acquisition. It begins with assembling collections from in-house and external databases. The collection is then cleaned by removing invalid structures and standardizing structure representations. Property filtering is used to focus on lead-like compounds. Known active molecules are searched for structural similarity. Alternative structures like stereoisomers are explored. Representatives are selected from clustered structures using descriptors and similarity metrics. 3D structures are generated and a final list of compounds is assembled for screening, with some random additions, completing the preparation.
The Basic of Molecular Dynamics SimulationSyed Lokman
Molecular dynamics simulation is a computational method that analyzes the physical movements of atoms and molecules over time. It works by calculating the acceleration, position, and velocity of atoms in a system using Newton's laws of motion. The forces between atoms are determined from interatomic potential functions, and initial atom velocities are assigned randomly based on temperature using the Boltzmann distribution. The simulation is iterated in small time steps to track how atom positions, velocities, and accelerations change over time. This provides insights into molecular structure, function, and interactions at the atomic scale.
Presentation delivered at Lehigh University (Bethlehem, PA) on Friday, April 26, 2019.
This presentation begins with discussing the history of the cheminformatics field. In addition, it also discusses a question "what makes cheminformatics different from bioinformatics?" (by comparing the ways in which molecules are described and compared in the two fields).
This document discusses de novo drug design, which aims to design novel drug molecules from scratch computationally. It describes the basic steps of de novo design programs, including analyzing the target protein's active site, building molecules, and evaluating candidates. The key goals are to design molecules that fit the active site and form favorable interactions. Constraints like hydrogen bonding and hydrophobic regions are extracted from the target structure to guide molecule generation and scoring. The end goal is to produce new molecular scaffolds that can inspire medicinal chemistry efforts.
Domainex has contributed to three clinical candidates through its drug discovery programs for clients. It uses a variety of technologies like combinatorial domain hunting, LeadBuilder for virtual screening, and integrated medicinal and computational chemistry. Domainex has a highly experienced team of drug discovery scientists and has successfully delivered ion-channel blockers, kinase inhibitors, and anti-thrombotics into clinical trials for clients. It provides concise drug discovery services from hit identification to candidate selection through its expertise in computational chemistry, library synthesis, and medicinal chemistry.
1. Structure-based drug design relies on knowledge of the three-dimensional structure of the biological target obtained through methods such as x-ray crystallography. Candidate drugs that are predicted to bind with high affinity and selectivity to the target can be designed.
2. Structure-based drug design approaches include receptor-based drug design, which involves "building" ligands within the constraints of the binding pocket, and ligand-based drug design.
3. De novo drug design is a receptor-based approach that uses the target's 3D structure to design new molecules without existing leads. It involves building ligands that complement the active site properties through manual or automated methods.
The document describes the development and refinement of a quantitative structure-activity relationship (QSAR) model to predict the biological activity of pyranenamine compounds. It discusses 5 stages of synthesizing analogs and developing the QSAR equation based on substituents. Anomalies identified were used to refine the model terms. The final optimized QSAR equation considered parameters like hydrophilicity, hydrogen bonding, resonance effects, and steric hindrance to identify a hypothetical compound over 1000 times more active than the lead compound.
The document discusses the process of preparing a chemical database for virtual screening or compound acquisition. It begins with assembling collections from in-house and external databases. The collection is then cleaned by removing invalid structures and standardizing structure representations. Property filtering is used to focus on lead-like compounds. Known active molecules are searched for structural similarity. Alternative structures like stereoisomers are explored. Representatives are selected from clustered structures using descriptors and similarity metrics. 3D structures are generated and a final list of compounds is assembled for screening, with some random additions, completing the preparation.
The Basic of Molecular Dynamics SimulationSyed Lokman
Molecular dynamics simulation is a computational method that analyzes the physical movements of atoms and molecules over time. It works by calculating the acceleration, position, and velocity of atoms in a system using Newton's laws of motion. The forces between atoms are determined from interatomic potential functions, and initial atom velocities are assigned randomly based on temperature using the Boltzmann distribution. The simulation is iterated in small time steps to track how atom positions, velocities, and accelerations change over time. This provides insights into molecular structure, function, and interactions at the atomic scale.
Presentation delivered at Lehigh University (Bethlehem, PA) on Friday, April 26, 2019.
This presentation begins with discussing the history of the cheminformatics field. In addition, it also discusses a question "what makes cheminformatics different from bioinformatics?" (by comparing the ways in which molecules are described and compared in the two fields).
WE THE STUDENT OF PHARMACEUTICAL CHEMISTRY FROM GURUNANAK COLLEGE OF PHARMACY HAS PRESENTED QSRR, TO MAKE READERS EASILY AVAILABLE, A COMPLETE TOPIC OF MPHARM 1ST YEAR WHICH WILL MAKE THEIR STUDY AND TO COLLECT DATA MORE EASILY AT A PLACE.
This document provides an overview of the lead development and drug discovery process. It discusses lead compounds, lead identification, criteria for leads, sources of leads from microorganisms, animals, and plants. The stages of drug discovery are described including drug development, phases of drug discovery from Phase 0 to Phase IV, and advantages of drug development such as tax incentives and enhanced patent protection.
This document provides an overview of computer aided drug design (CADD). It discusses how CADD uses computational methods like molecular docking, virtual screening, and quantitative structure-activity relationships (QSAR) to aid in the drug design process. Molecular docking uses software to predict how drug molecules bind to biological targets. QSAR analyzes relationships between chemical structure and biological activity. CADD aims to identify and optimize lead drug compounds in silico to reduce costs and save on experimental trials in drug development.
Computational Drug Discovery: Machine Learning for Making Sense of Big Data i...Chanin Nantasenamat
In this lecture, I provide an overview on how computers can be instrumental in drug discovery efforts. Topics covered includes: big data as a result of omics effort; bioinformatics; cheminformatics; biological space; chemical space; how computers particularly machine learning (and data science) can be applied in the context of drug discovery.
A video of this lecture is also provided on the "Data Professor" YouTube channel available at http://bit.ly/dataprofessor
If you are fascinated about data science, it would mean the world to me if you would consider subscribing to this channel (by clicking the link below):
http://bit.ly/dataprofessor
Molecular docking is a computational method that predicts the preferred orientation of one molecule to another when bound and forming a stable complex. It involves finding the best match between two molecules and can be used for drug design and development by predicting the binding affinity between potential drug candidates and their protein targets. Common molecular docking approaches include shape complementarity, which describes interacting molecules as complementary surfaces, and simulation methods, which simulate the actual docking process and calculate interaction energies between molecules. Popular molecular docking software includes AutoDock, FlexX, and GOLD.
The document provides an overview of protein-ligand docking, which is a computational method used in structure-based drug design to predict how small molecules bind to proteins. It discusses key components of docking software including search algorithms that generate poses of ligands in the binding site and scoring functions that calculate binding affinity scores. The document also touches on uses of docking like virtual screening and pose prediction, as well as considerations like flexible docking and handling protein conformations.
This document summarizes various virtual screening techniques used in drug discovery. It discusses ligand-based methods like similarity searching using 2D and 3D fingerprints, pharmacophore mapping. It also discusses structure-based methods like protein-ligand docking to predict binding poses and scores. Hybrid methods combining different techniques are also used. The document provides an overview of key virtual screening methods and their applications to enrich hit rates and select compounds for further testing from large libraries in an efficient manner during the drug discovery process.
This document provides an overview of the history and methods of drug discovery, including traditional and computer-aided approaches. It discusses the traditional drug discovery life cycle from hit identification through random screening and the use of natural products and synthetic chemicals. It then introduces computer-aided drug design (CADD) and describes how it can be used throughout the drug discovery process, including structure-based design, ligand-based design, and de novo design to speed up screening and enable more rational drug design. It also lists some advantages of CADD over traditional methods and examples of drugs successfully developed using these approaches.
MOLECULAR DOCKING AND RELATED DRUG DESIGN ACHIEVEMENTS santosh Kumbhar
Molecular docking is a computational method used in structure-based drug design to predict how biological macromolecules interact with other molecules. It attempts to predict the preferred orientation of one molecule to another when bound to each other to form a stable complex. Docking is useful for predicting the binding orientation of small molecule drug candidates to their protein targets in order to predict their interaction and to design effective inhibitors. There are various types of docking software available that implement different algorithms to predict the binding orientation and affinity between molecules rapidly and accurately to help identify potential lead compounds for drug development. Molecular docking has contributed to the discovery of several new drug classes and is playing an increasingly important role in modern computer-aided drug design and virtual
Molecular docking is the process of placing molecules in configurations to interact with a receptor. There are two main types - rigid docking treats the receptor and ligand as rigid, while flexible docking allows rotation of one molecule. The key stages of molecular docking are receptor and ligand preparation, docking, and evaluation to identify ligands that bind well and predict binding affinity.
In silico drug design/Molecular dockingKannan Iyanar
This document discusses rational drug design using computational methods. It begins by explaining how drugs work by binding to biological targets like proteins. It then discusses the need for new drugs to treat new diseases or improve current treatments. The document outlines several methods for screening and designing new drugs, including studying natural products, making modifications, and rational drug design based on understanding the molecular disease process. It describes using the 3D structure of protein targets and molecular docking to design ligands that selectively bind targets. The goals of drug design are to find molecules that effectively bind targets while also having suitable absorption, distribution, metabolism, excretion and toxicity properties. Computational methods can help streamline the drug discovery process.
Molecular docking by harendra ...power point presentationHarendra Bisht
Molecular docking is a computational method used to predict how a small molecule, like a drug, binds to a larger target molecule, like a protein. It works by fitting the structures of the two molecules together to find the highest affinity binding mode. The docking process involves defining the active site on the target protein, generating possible positions for the small molecule to bind, scoring the interactions between them, and identifying the best binding pose. Docking can help researchers design new drugs that effectively interact with protein targets.
Molecular descriptors are numerical values that characterize molecular properties and structures. They can represent physicochemical properties or values derived from algorithmic techniques applied to molecular structures. Descriptors vary in complexity and computational requirements. Some are based on experimental data while others are algorithmic constructs. Two-dimensional (2D) descriptors are calculated from 2D structures and include counts, physicochemical properties, and topological indices. Three-dimensional (3D) descriptors encode spatial relationships and include fragment screens and pharmacophore keys.
This document discusses structure-based and ligand-based drug design approaches. Structure-based design uses the 3D structure of biological targets to dock potential drug molecules. Ligand-based design analyzes similar molecules that bind to the target to derive pharmacophore models or quantitative structure-activity relationships (QSAR) to predict new candidates. Specific structure-based methods covered include docking tools like AutoDock and CDOCKER, and accounting for protein and complex flexibility. Ligand-based methods discussed are QSAR techniques like Comparative Molecular Field Analysis (CoMSIA) and Field Analysis (CoMFA). In conclusion, computational approaches like these are valuable for drug discovery by facilitating the identification and testing of new ligand
This document provides an overview of molecular dynamics (MD) simulation, which calculates the time-dependent behavior of biological molecules. MD simulation can provide detailed information on protein fluctuations and conformational changes. It is used to study protein stability, folding, molecular recognition and other biological processes. The document discusses how MD simulations are set up and run, including using force fields to calculate molecular interactions and numerical integration algorithms to solve equations of motion. It also covers statistical mechanics approaches for relating atomic-level simulation data to macroscopic properties.
drug like property concepts in pharmaceutical designDeepak Rohilla
The document discusses key drug-like property concepts in pharmaceutical design, including solubility, permeability, metabolism, and transporters. It outlines common issues seen in drug development candidates related to these properties. Methods to improve properties like structural modification, prodrug approaches, and formulation development are presented. The importance of considering drug-like properties early in drug discovery is emphasized to help identify potential issues and guide optimization of candidates.
Molecular docking is a method for predicting how two molecules, such as a ligand and its protein target, will interact and fit together in three dimensions. Docking has become an important tool in drug discovery for identifying potential binding conformations between drug candidates and protein targets. The key steps in a typical docking workflow involve selecting the receptor and ligand molecules, then using software to computationally predict the orientation of binding and evaluate the fit through scoring functions. Popular molecular docking software packages include AutoDock, GOLD, and Glide. Applications of docking include virtual screening in drug discovery and lead optimization.
The screening of chemical libraries with traditional methods, such as high-throughput screening (HTS), is expensive and time consuming. Quantitative structure–activity relation (QSAR) modeling is an alternative method that can assist in the selection of lead molecules by using the information from
reference active and inactive compounds. This approach requires good molecular descriptors that are representative of the molecular features responsible for the relevant molecular activity.
computer aided drug designing and molecular modellingnehla313
This document discusses computer-aided drug design (CADD) and molecular modeling. It provides a brief history of drug discovery and describes the traditional drug design lifecycle versus the modern CADD approach. The key principles of CADD include molecular mechanics, which uses classical mechanics to model molecule geometry, and quantum mechanics, which is based on solving Schrodinger's equation. The document also lists several common software tools used for tasks like molecular visualization, docking, descriptor calculation, and general molecular modeling libraries. Advantages of CADD are highlighted as reduced time, cost and manpower requirements compared to traditional drug discovery methods.
Nature-Inspired Optimization Algorithms Xin-She Yang
This document discusses nature-inspired optimization algorithms. It begins with an overview of the essence of optimization algorithms and their goal of moving to better solutions. It then discusses some issues with traditional algorithms and how nature-inspired algorithms aim to address these. Several nature-inspired algorithms are described in detail, including particle swarm optimization, firefly algorithm, cuckoo search, and bat algorithm. These are inspired by behaviors in swarms, fireflies, cuckoos, and bats respectively. Examples of applications to engineering design problems are also provided.
Metaheuristic Algorithms: A Critical AnalysisXin-She Yang
The document discusses metaheuristic algorithms and their application to optimization problems. It provides an overview of several nature-inspired algorithms including particle swarm optimization, firefly algorithm, harmony search, and cuckoo search. It describes how these algorithms were inspired by natural phenomena like swarming behavior, flashing fireflies, and bird breeding. The document also discusses applications of these algorithms to engineering design problems like pressure vessel design and gear box design optimization.
WE THE STUDENT OF PHARMACEUTICAL CHEMISTRY FROM GURUNANAK COLLEGE OF PHARMACY HAS PRESENTED QSRR, TO MAKE READERS EASILY AVAILABLE, A COMPLETE TOPIC OF MPHARM 1ST YEAR WHICH WILL MAKE THEIR STUDY AND TO COLLECT DATA MORE EASILY AT A PLACE.
This document provides an overview of the lead development and drug discovery process. It discusses lead compounds, lead identification, criteria for leads, sources of leads from microorganisms, animals, and plants. The stages of drug discovery are described including drug development, phases of drug discovery from Phase 0 to Phase IV, and advantages of drug development such as tax incentives and enhanced patent protection.
This document provides an overview of computer aided drug design (CADD). It discusses how CADD uses computational methods like molecular docking, virtual screening, and quantitative structure-activity relationships (QSAR) to aid in the drug design process. Molecular docking uses software to predict how drug molecules bind to biological targets. QSAR analyzes relationships between chemical structure and biological activity. CADD aims to identify and optimize lead drug compounds in silico to reduce costs and save on experimental trials in drug development.
Computational Drug Discovery: Machine Learning for Making Sense of Big Data i...Chanin Nantasenamat
In this lecture, I provide an overview on how computers can be instrumental in drug discovery efforts. Topics covered includes: big data as a result of omics effort; bioinformatics; cheminformatics; biological space; chemical space; how computers particularly machine learning (and data science) can be applied in the context of drug discovery.
A video of this lecture is also provided on the "Data Professor" YouTube channel available at http://bit.ly/dataprofessor
If you are fascinated about data science, it would mean the world to me if you would consider subscribing to this channel (by clicking the link below):
http://bit.ly/dataprofessor
Molecular docking is a computational method that predicts the preferred orientation of one molecule to another when bound and forming a stable complex. It involves finding the best match between two molecules and can be used for drug design and development by predicting the binding affinity between potential drug candidates and their protein targets. Common molecular docking approaches include shape complementarity, which describes interacting molecules as complementary surfaces, and simulation methods, which simulate the actual docking process and calculate interaction energies between molecules. Popular molecular docking software includes AutoDock, FlexX, and GOLD.
The document provides an overview of protein-ligand docking, which is a computational method used in structure-based drug design to predict how small molecules bind to proteins. It discusses key components of docking software including search algorithms that generate poses of ligands in the binding site and scoring functions that calculate binding affinity scores. The document also touches on uses of docking like virtual screening and pose prediction, as well as considerations like flexible docking and handling protein conformations.
This document summarizes various virtual screening techniques used in drug discovery. It discusses ligand-based methods like similarity searching using 2D and 3D fingerprints, pharmacophore mapping. It also discusses structure-based methods like protein-ligand docking to predict binding poses and scores. Hybrid methods combining different techniques are also used. The document provides an overview of key virtual screening methods and their applications to enrich hit rates and select compounds for further testing from large libraries in an efficient manner during the drug discovery process.
This document provides an overview of the history and methods of drug discovery, including traditional and computer-aided approaches. It discusses the traditional drug discovery life cycle from hit identification through random screening and the use of natural products and synthetic chemicals. It then introduces computer-aided drug design (CADD) and describes how it can be used throughout the drug discovery process, including structure-based design, ligand-based design, and de novo design to speed up screening and enable more rational drug design. It also lists some advantages of CADD over traditional methods and examples of drugs successfully developed using these approaches.
MOLECULAR DOCKING AND RELATED DRUG DESIGN ACHIEVEMENTS santosh Kumbhar
Molecular docking is a computational method used in structure-based drug design to predict how biological macromolecules interact with other molecules. It attempts to predict the preferred orientation of one molecule to another when bound to each other to form a stable complex. Docking is useful for predicting the binding orientation of small molecule drug candidates to their protein targets in order to predict their interaction and to design effective inhibitors. There are various types of docking software available that implement different algorithms to predict the binding orientation and affinity between molecules rapidly and accurately to help identify potential lead compounds for drug development. Molecular docking has contributed to the discovery of several new drug classes and is playing an increasingly important role in modern computer-aided drug design and virtual
Molecular docking is the process of placing molecules in configurations to interact with a receptor. There are two main types - rigid docking treats the receptor and ligand as rigid, while flexible docking allows rotation of one molecule. The key stages of molecular docking are receptor and ligand preparation, docking, and evaluation to identify ligands that bind well and predict binding affinity.
In silico drug design/Molecular dockingKannan Iyanar
This document discusses rational drug design using computational methods. It begins by explaining how drugs work by binding to biological targets like proteins. It then discusses the need for new drugs to treat new diseases or improve current treatments. The document outlines several methods for screening and designing new drugs, including studying natural products, making modifications, and rational drug design based on understanding the molecular disease process. It describes using the 3D structure of protein targets and molecular docking to design ligands that selectively bind targets. The goals of drug design are to find molecules that effectively bind targets while also having suitable absorption, distribution, metabolism, excretion and toxicity properties. Computational methods can help streamline the drug discovery process.
Molecular docking by harendra ...power point presentationHarendra Bisht
Molecular docking is a computational method used to predict how a small molecule, like a drug, binds to a larger target molecule, like a protein. It works by fitting the structures of the two molecules together to find the highest affinity binding mode. The docking process involves defining the active site on the target protein, generating possible positions for the small molecule to bind, scoring the interactions between them, and identifying the best binding pose. Docking can help researchers design new drugs that effectively interact with protein targets.
Molecular descriptors are numerical values that characterize molecular properties and structures. They can represent physicochemical properties or values derived from algorithmic techniques applied to molecular structures. Descriptors vary in complexity and computational requirements. Some are based on experimental data while others are algorithmic constructs. Two-dimensional (2D) descriptors are calculated from 2D structures and include counts, physicochemical properties, and topological indices. Three-dimensional (3D) descriptors encode spatial relationships and include fragment screens and pharmacophore keys.
This document discusses structure-based and ligand-based drug design approaches. Structure-based design uses the 3D structure of biological targets to dock potential drug molecules. Ligand-based design analyzes similar molecules that bind to the target to derive pharmacophore models or quantitative structure-activity relationships (QSAR) to predict new candidates. Specific structure-based methods covered include docking tools like AutoDock and CDOCKER, and accounting for protein and complex flexibility. Ligand-based methods discussed are QSAR techniques like Comparative Molecular Field Analysis (CoMSIA) and Field Analysis (CoMFA). In conclusion, computational approaches like these are valuable for drug discovery by facilitating the identification and testing of new ligand
This document provides an overview of molecular dynamics (MD) simulation, which calculates the time-dependent behavior of biological molecules. MD simulation can provide detailed information on protein fluctuations and conformational changes. It is used to study protein stability, folding, molecular recognition and other biological processes. The document discusses how MD simulations are set up and run, including using force fields to calculate molecular interactions and numerical integration algorithms to solve equations of motion. It also covers statistical mechanics approaches for relating atomic-level simulation data to macroscopic properties.
drug like property concepts in pharmaceutical designDeepak Rohilla
The document discusses key drug-like property concepts in pharmaceutical design, including solubility, permeability, metabolism, and transporters. It outlines common issues seen in drug development candidates related to these properties. Methods to improve properties like structural modification, prodrug approaches, and formulation development are presented. The importance of considering drug-like properties early in drug discovery is emphasized to help identify potential issues and guide optimization of candidates.
Molecular docking is a method for predicting how two molecules, such as a ligand and its protein target, will interact and fit together in three dimensions. Docking has become an important tool in drug discovery for identifying potential binding conformations between drug candidates and protein targets. The key steps in a typical docking workflow involve selecting the receptor and ligand molecules, then using software to computationally predict the orientation of binding and evaluate the fit through scoring functions. Popular molecular docking software packages include AutoDock, GOLD, and Glide. Applications of docking include virtual screening in drug discovery and lead optimization.
The screening of chemical libraries with traditional methods, such as high-throughput screening (HTS), is expensive and time consuming. Quantitative structure–activity relation (QSAR) modeling is an alternative method that can assist in the selection of lead molecules by using the information from
reference active and inactive compounds. This approach requires good molecular descriptors that are representative of the molecular features responsible for the relevant molecular activity.
computer aided drug designing and molecular modellingnehla313
This document discusses computer-aided drug design (CADD) and molecular modeling. It provides a brief history of drug discovery and describes the traditional drug design lifecycle versus the modern CADD approach. The key principles of CADD include molecular mechanics, which uses classical mechanics to model molecule geometry, and quantum mechanics, which is based on solving Schrodinger's equation. The document also lists several common software tools used for tasks like molecular visualization, docking, descriptor calculation, and general molecular modeling libraries. Advantages of CADD are highlighted as reduced time, cost and manpower requirements compared to traditional drug discovery methods.
Nature-Inspired Optimization Algorithms Xin-She Yang
This document discusses nature-inspired optimization algorithms. It begins with an overview of the essence of optimization algorithms and their goal of moving to better solutions. It then discusses some issues with traditional algorithms and how nature-inspired algorithms aim to address these. Several nature-inspired algorithms are described in detail, including particle swarm optimization, firefly algorithm, cuckoo search, and bat algorithm. These are inspired by behaviors in swarms, fireflies, cuckoos, and bats respectively. Examples of applications to engineering design problems are also provided.
Metaheuristic Algorithms: A Critical AnalysisXin-She Yang
The document discusses metaheuristic algorithms and their application to optimization problems. It provides an overview of several nature-inspired algorithms including particle swarm optimization, firefly algorithm, harmony search, and cuckoo search. It describes how these algorithms were inspired by natural phenomena like swarming behavior, flashing fireflies, and bird breeding. The document also discusses applications of these algorithms to engineering design problems like pressure vessel design and gear box design optimization.
Chemoinformatics—an introduction for computer scientistsunyil96
Chemoinformatics is an interdisciplinary field that combines expertise from chemistry, biology, physics, and computer science. It aims to discover novel chemical entities that can be developed into new medical treatments. The field uses computational methods and tools to analyze large collections of molecules in order to facilitate drug discovery. This involves tasks like selecting compounds for screening libraries, analyzing results from high-throughput screening to identify hit compounds, and optimizing leads into drug candidates. While the field has existed for decades, it was only recently termed "chemoinformatics" and has grown significantly with the ability to now synthesize and test huge numbers of compounds computationally.
Canonicalized systematic nomenclature in cheminformaticsJeremy Yang
This document discusses canonicalization in chemoinformatics and new canonicalization tools from OpenEye. It reviews existing canonicalization methods like the Morgan algorithm and describes how OpenEye has implemented and expanded on these methods to canonicalize molecular structures, tautomers, and pKa states. OpenEye tools like OEChem and QuacPac can generate canonical SMILES, connection tables, and representations of different chemical forms and standard file formats.
This document presents a comparative study of nature-inspired algorithms, including the Firefly Algorithm and Particle Swarm Optimization, for numerical optimization. It describes implementing these algorithms to solve benchmark problems like the Michalewicz function and the Travelling Salesman Problem. The document finds that Particle Swarm Optimization often outperforms genetic algorithms, while the Firefly Algorithm is superior to PSO in efficiency and success rate. It concludes that the Firefly Algorithm is a potentially powerful tool for solving NP-hard problems but could be improved through reducing randomness over time.
A Discrete Firefly Algorithm for the Multi-Objective Hybrid Flowshop Scheduli...Xin-She Yang
The document describes a discrete firefly algorithm proposed to solve hybrid flowshop scheduling problems with two objectives: minimizing makespan and mean flow time. Hybrid flowshop scheduling problems involve scheduling jobs through multiple stages with parallel machines in some stages, and are known to be NP-hard. The proposed discrete firefly algorithm adapts the continuous firefly algorithm to the discrete problem by using a smallest position value rule to map continuous firefly positions to discrete job permutations. Computational experiments show the proposed algorithm outperforms other metaheuristics for hybrid flowshop scheduling problems.
A.E. Eiben's presentation slides on Methodological Issues in Bio-inspired Computing or How to Get a PhD in …? From Bionetics 2011. Sponsored by the Awareness Initiative.
Applications and Analysis of Bio-Inspired Eagle Strategy for Engineering Opti...Xin-She Yang
This document discusses applying an eagle strategy inspired by nature to engineering optimization problems. The eagle strategy uses a two-stage approach combining global exploration with local exploitation. Global exploration uses Lèvy flights for random walks to diversify solutions. Promising solutions are then locally optimized using an efficient local search algorithm like particle swarm optimization. The document analyzes random walk models like Lèvy flights and how they can maintain diversity in swarm intelligence algorithms. It applies the eagle strategy to four engineering design problems, finding Lèvy flights can effectively reduce computational efforts.
Bio-Inspired Computation: Success and Challenges of IJBICXin-She Yang
This document summarizes the success of the International Journal of Bio-Inspired Computation (IJBIC) over its first 5 years of publication from 2009 to 2014. It discusses how IJBIC has become a leading journal in the field by publishing high-quality research on new algorithms, improvements to existing algorithms, and applications. Some of the key areas of research published in IJBIC include particle swarm optimization, ant colony optimization, firefly algorithm, bat algorithm, and cuckoo search. While applications are diverse, the document highlights examples in engineering, data mining, and network optimization. It concludes by discussing ongoing challenges like developing more theoretical analysis, solving larger-scale problems, and achieving true intelligence in algorithms.
This document discusses chemoinformatics and its role in distance education. It defines chemoinformatics as the application of informatics methods to solve chemical problems, involving the design, creation, organization, management, analysis and use of chemical information. The document then outlines proposed courses for a distance learning program in chemoinformatics, including introductory courses covering fundamental concepts as well as more advanced topics involving programming, databases, and molecular modeling. It concludes by discussing how chemoinformatics can help improve distance education programs in chemistry fields.
Cuckoo search is an optimization algorithm inspired by cuckoos that lay eggs in other birds' nests. It was developed in 2009 by Xin-she Yang and Suash Deb. In cuckoo search, each cuckoo lays one egg at a time in a randomly chosen nest, and the best nests with high quality eggs are carried over to the next generation. A fraction of worse nests are abandoned and replaced with new nests. The algorithm finds the best solutions through iterations until a stop criterion is reached.
Chemoinformatics and information managementDuncan Hull
Chemoinformatics involves the management and analysis of chemical structure data to help accelerate the drug discovery process. It uses computer representations of molecules and applies techniques like database searching, fingerprinting, and molecular modeling to efficiently screen large numbers of chemical structures. This helps identify potential drug leads and reject non-drug candidates more quickly compared to traditional sequential drug screening. Key applications of chemoinformatics include structure and substructure searching of databases, molecular similarity analysis and virtual screening to predict molecular properties and activity.
Cuckoo search is an optimization algorithm inspired by cuckoos that lay eggs in other birds' nests. It works by representing each potential solution as an "egg" in a nest, with the aim of replacing poor solutions with new, potentially better ones. There are three main rules: each cuckoo lays one egg at a time in a randomly chosen nest; the best nests carrying high-quality eggs carry over to the next generation; and some host birds can detect alien eggs and abandon the nest, requiring the cuckoo to lay again in a new nest. The algorithm uses random walks to explore the search space and find optimal solutions. It is simple to implement compared to other metaheuristic algorithms and has been successfully applied
The cuckoo search algorithm is a recently developed meta-heuristic optimization algorithm, which is suitable for solving optimization problems. Cuckoo search is a nature-inspired metaheuristic algorithm, based on the brood parasitism of some cuckoo species, along with Levy flights random walks
This document provides an overview of a course on bio-inspired robotics. The course will cover topics like bio-inspired sensors, actuators, locomotion, and controllers. It will focus on applying insights from biology to engineering problems and robotics. Students will complete a project building a 3D model of an animal in a simulator and developing locomotion algorithms inspired by the animal. The grade will be based 50% on a final exam and 50% on the project. Required textbooks and a list of lectures on various bio-inspired topics are also included.
The document summarizes two nature-inspired metaheuristic algorithms: the Cuckoo Search algorithm and the Firefly algorithm.
The Cuckoo Search algorithm is based on the brood parasitism of some cuckoo species. It lays its eggs in the nests of other host birds. The algorithm uses Lévy flights for generating new solutions and considers the best solutions for the next generation.
The Firefly algorithm is based on the flashing patterns of fireflies to attract mates. It considers attractiveness that decreases with distance and movement of fireflies towards more attractive ones. The pseudo codes of both algorithms are provided along with some example applications.
This document discusses various bio-inspired algorithms including evolutionary algorithms, swarm algorithms, immune algorithms, cultural algorithms, neural algorithms, and provides examples of their applications. It summarizes genetic algorithms and differential evolution algorithms. It also lists some popular libraries for implementing these algorithms in Python and R and provides examples.
Cheminformatics is the application of computer science to solve chemical problems. It involves acquiring chemical data through experiments or simulations, managing the information in databases, and analyzing the data. Key aspects of cheminformatics include computer-assisted synthesis design, representing chemical structures digitally, and using mathematical models to analyze chemical data. Cheminformatics plays an important role in drug discovery by aiding processes like target identification, lead discovery, and molecular modeling.
Introduction
What is cheminformatics?
Why do we have to use informatics methods in chemistry?
Is it cheminformatics or chemoinformatics?
Emergence of cheminformations
Three major aspects of cheminformatics
Basics of cheminformatics
Topological representations
Tools used for cheminformatics
Application of cheminformatics
Role of cheminformatics in morden drug discovery
Conclusion
Bibliography
Computer Aided Drug Design uses computational methods to help streamline the drug discovery process. Key steps include identifying drug targets, generating molecular structures, evaluating interactions between potential drug compounds and targets through docking simulations, and developing quantitative structure-activity relationship (QSAR) models to predict compound activity and guide synthetic efforts. The overall goal is to reduce the cost and time needed to develop new pharmaceutical agents in a rational, mechanism-based manner.
Quantum Mechanics in Molecular modelingAkshay Kank
This slides gives you the information related to computer aided drug design and its application in drug discovery. Also you learn the Quantum mechanics related to the molecular mechanics. Theory related to molecular modeling and how the molecular modeling helps in drug discovery.
This document discusses nature-inspired optimization techniques and their applications. It provides an overview of problems in real-world optimization that involve multiple conflicting objectives. Nature provides inspiration for algorithms that can solve complex problems with simple rules, as seen in animals. Examples of nature-inspired algorithms discussed include firefly algorithm, particle swarm optimization, ant colony optimization, cuckoo search, and others. These algorithms have applications in fields like engineering, cheminformatics, bioinformatics, and more.
This document provides an overview of structural biology. It discusses the importance of determining the 3D structures of biological molecules like proteins to understand their functions. Key techniques in structural biology include X-ray crystallography, NMR spectroscopy, and cryo-electron microscopy. Recent advancements like improved cryo-EM resolution and hybrid methods integrating multiple techniques have provided more detailed structural insights. The integration of structural biology with other omics fields is also advancing our understanding of biological systems.
PREDICTION OF ANTIMICROBIAL PEPTIDES USING MACHINE LEARNING METHODSBilal Nizami
Increasing resistance toward the conventional antibiotics has become a global concern. Antimicrobial peptides (AMPs) are potential alternatives for conventional antibiotics. Due to cost related reasons in designing and synthesis of AMPs. Machine learning based prediction tools are indispensable.
Algorithmic approach to computational biology using graphsS P Sajjan
Computational biology uses algorithms and graph theory to model and analyze bio-molecular networks. It aims to extract knowledge from large datasets to identify drug targets and gene/protein functions. Challenges include modeling interactions between the millions of genes and proteins as well as overcoming computational limitations to simulate whole cellular systems. Graph theory techniques are applied to model networks, measure node centrality, and mine pathways from molecular data.
Numerical taxonomy is a system of grouping species using numerical methods based on their character states. It involves assigning numerical values to characteristics, estimating resemblance between organisms, and performing cluster analysis to group similar organisms. The basic components of numerical taxonomy are operational taxonomic units, unit characters, and estimating resemblance between units. Hierarchical clustering is used to group units into a tree-like structure of clusters. While numerical taxonomy provides a standardized, quantitative approach, character selection can impact results if not done adequately.
The document describes genetic algorithms and their implementation. It begins by defining genetic algorithms as search techniques inspired by biological evolution that maintain a population of candidate solutions. It then provides an overview of genetic algorithms and their typical application to discrete optimization problems. The document proceeds to describe the main components of implementing a genetic algorithm - encoding, selection, crossover, and mutation. It explains each component in detail and provides examples. Finally, it outlines the specific implementation steps and components like population initialization, fitness function, selection, crossover, and mutation used in the scheduling problem the genetic algorithm is being applied to.
1 -val_gillet_-_ligand-based_and_structure-based_virtual_screeningDeependra Ban
The document discusses ligand-based and structure-based virtual screening techniques. It begins with an introduction to the speaker's background and expertise in chemoinformatics. It then provides an overview of the drug discovery challenge and how virtual screening can help address it. The remainder of the document focuses on explaining different virtual screening methods, including ligand-based approaches like similarity searching, pharmacophore mapping, and machine learning methods, as well as structure-based protein-ligand docking. Limitations of methods are discussed along with ways to improve performance, such as using multiple active structures, data fusion techniques, and accounting for conformational flexibility.
This document discusses descriptive versus mechanistic modeling approaches in drug discovery. It provides examples of descriptive modeling, which aims to describe data patterns without understanding the underlying mechanisms, and mechanistic modeling, which works with domain experts to translate scientific knowledge into mathematical representations of the data-generating processes. The document presents tumor growth curve analysis as an example where mechanistic models like Richards and Gompertz curves can incorporate understandings of competing catabolic and anabolic processes to better capture the fundamental characteristics of growth.
P Systems Model Optimisation by Means of Evolutionary Based Search ...Natalio Krasnogor
This document discusses using evolutionary algorithms to optimize parameters in P systems, which are computational models of biological cells. Four test cases of increasing difficulty are used to compare different algorithms. The results show that genetic algorithms, differential evolution, and opposition-based differential evolution perform better for problems with fewer parameters, while variable neighbourhood search algorithms perform better for the largest problem with 38 parameters. This is because the evolutionary algorithms are less efficient at optimizing large populations within the limited evaluation budget, whereas variable neighbourhood search focuses on a single solution.
Cheminformatics combines chemistry, computer science, and information science to study large amounts of chemical information, mostly with computer assistance. It encompasses the design, creation, organization, storage, retrieval, analysis, and use of chemical data. Cheminformatics has various applications including drug discovery. It uses tools like databases, machine learning, molecular properties predictions, and information analysis to help identify new drug leads. Future trends include increased data integration, computer-assisted synthesis design, and expanded use of cheminformatics methods in theoretical chemistry and protein studies. Cheminformatics plays an important role in modern drug development.
This document discusses cheminformatics and its applications. Cheminformatics combines chemistry and computer science to store and analyze chemical data for applications like drug discovery. It encompasses designing, organizing, analyzing and visualizing chemical information. Key topics covered include molecular representations, chemical databases, similarity searching, machine learning methods, and tools for molecular docking and drug discovery.
Chemoinformatics has developed over the past 50+ years to help analyze and utilize the large amount of chemical and biological data generated in drug discovery. Early work in the 1960s involved representing molecules as graphs to enable substructure searching of chemical databases. Key developments included the establishment of chemical literature databases and journals, as well as early computer systems for searching chemical structures and reactions. Methods from fields like artificial intelligence were also applied to problems like structure elucidation. Overall, chemoinformatics aims to transform chemical data into knowledge to help make better, faster decisions in drug lead identification and optimization.
Introduction to Systemics with focus on Systems BiologyMrinal Vashisth
The core content discusses the terminology used in Systems Sciences, the systems thinking/approach or Systemics. Focus is kept on Systems Biology for the most part of the presentations where it is compared with other disciplines and examples of Systems Biology approach and challenges of systems science are also discussed.
The sad thing about uploading this to Slide Share is that animations don't work.
Cheminformatics, concept by kk sahu sirKAUSHAL SAHU
INTRODUCTION
THE NEED FOR CHEMOINFORMATICS
CHEMOINFORMATICS AND DRUG DISCOVERY
HISTORICAL EVOLUTION
BASIC CONCEPTS
Chemistry Space
Molecular Descriptors
High-Throughput Screening
The Similar-Structure, Similar-Property Principle
Graph theory and Chemoinformatics
CHEMOINFORMATICS TASKS
MOLECULAR REPRESENTATIONS
Topological Representations
Geometrical Representations
TYPES OF MOLECULAR DESCRIPTORS
IN SILICO DE NOVO MOLECULAR DESIGN
FREE CHEMISTRY DATABASE
FUTURE
CONCLUSION
REFERENCE
The document discusses statistical modeling in the pharmaceutical industry. It notes that statistical modeling is one approach used to address challenges related to cost, time, and quality in drug discovery and development. The goals of statistical models include understanding the mechanisms by which data is generated and extracting information from data. Statistical modeling aims to translate known properties and hypotheses into mathematical representations. This allows for simplified descriptions of experiments and mechanisms. The identification of appropriate models requires thorough investigation. Descriptive models aim to describe data structures while mechanistic models seek to understand underlying phenomena. Statistical modeling has advantages of reducing costs and time while improving success rates.
Similar to Bio inspiring computing and its application in cheminformatics (20)
Essentials of Automations: The Art of Triggers and Actions in FMESafe Software
In this second installment of our Essentials of Automations webinar series, we’ll explore the landscape of triggers and actions, guiding you through the nuances of authoring and adapting workspaces for seamless automations. Gain an understanding of the full spectrum of triggers and actions available in FME, empowering you to enhance your workspaces for efficient automation.
We’ll kick things off by showcasing the most commonly used event-based triggers, introducing you to various automation workflows like manual triggers, schedules, directory watchers, and more. Plus, see how these elements play out in real scenarios.
Whether you’re tweaking your current setup or building from the ground up, this session will arm you with the tools and insights needed to transform your FME usage into a powerhouse of productivity. Join us to discover effective strategies that simplify complex processes, enhancing your productivity and transforming your data management practices with FME. Let’s turn complexity into clarity and make your workspaces work wonders!
Removing Uninteresting Bytes in Software FuzzingAftab Hussain
Imagine a world where software fuzzing, the process of mutating bytes in test seeds to uncover hidden and erroneous program behaviors, becomes faster and more effective. A lot depends on the initial seeds, which can significantly dictate the trajectory of a fuzzing campaign, particularly in terms of how long it takes to uncover interesting behaviour in your code. We introduce DIAR, a technique designed to speedup fuzzing campaigns by pinpointing and eliminating those uninteresting bytes in the seeds. Picture this: instead of wasting valuable resources on meaningless mutations in large, bloated seeds, DIAR removes the unnecessary bytes, streamlining the entire process.
In this work, we equipped AFL, a popular fuzzer, with DIAR and examined two critical Linux libraries -- Libxml's xmllint, a tool for parsing xml documents, and Binutil's readelf, an essential debugging and security analysis command-line tool used to display detailed information about ELF (Executable and Linkable Format). Our preliminary results show that AFL+DIAR does not only discover new paths more quickly but also achieves higher coverage overall. This work thus showcases how starting with lean and optimized seeds can lead to faster, more comprehensive fuzzing campaigns -- and DIAR helps you find such seeds.
- These are slides of the talk given at IEEE International Conference on Software Testing Verification and Validation Workshop, ICSTW 2022.
Driving Business Innovation: Latest Generative AI Advancements & Success StorySafe Software
Are you ready to revolutionize how you handle data? Join us for a webinar where we’ll bring you up to speed with the latest advancements in Generative AI technology and discover how leveraging FME with tools from giants like Google Gemini, Amazon, and Microsoft OpenAI can supercharge your workflow efficiency.
During the hour, we’ll take you through:
Guest Speaker Segment with Hannah Barrington: Dive into the world of dynamic real estate marketing with Hannah, the Marketing Manager at Workspace Group. Hear firsthand how their team generates engaging descriptions for thousands of office units by integrating diverse data sources—from PDF floorplans to web pages—using FME transformers, like OpenAIVisionConnector and AnthropicVisionConnector. This use case will show you how GenAI can streamline content creation for marketing across the board.
Ollama Use Case: Learn how Scenario Specialist Dmitri Bagh has utilized Ollama within FME to input data, create custom models, and enhance security protocols. This segment will include demos to illustrate the full capabilities of FME in AI-driven processes.
Custom AI Models: Discover how to leverage FME to build personalized AI models using your data. Whether it’s populating a model with local data for added security or integrating public AI tools, find out how FME facilitates a versatile and secure approach to AI.
We’ll wrap up with a live Q&A session where you can engage with our experts on your specific use cases, and learn more about optimizing your data workflows with AI.
This webinar is ideal for professionals seeking to harness the power of AI within their data management systems while ensuring high levels of customization and security. Whether you're a novice or an expert, gain actionable insights and strategies to elevate your data processes. Join us to see how FME and AI can revolutionize how you work with data!
Fueling AI with Great Data with Airbyte WebinarZilliz
This talk will focus on how to collect data from a variety of sources, leveraging this data for RAG and other GenAI use cases, and finally charting your course to productionalization.
Threats to mobile devices are more prevalent and increasing in scope and complexity. Users of mobile devices desire to take full advantage of the features
available on those devices, but many of the features provide convenience and capability but sacrifice security. This best practices guide outlines steps the users can take to better protect personal devices and information.
TrustArc Webinar - 2024 Global Privacy SurveyTrustArc
How does your privacy program stack up against your peers? What challenges are privacy teams tackling and prioritizing in 2024?
In the fifth annual Global Privacy Benchmarks Survey, we asked over 1,800 global privacy professionals and business executives to share their perspectives on the current state of privacy inside and outside of their organizations. This year’s report focused on emerging areas of importance for privacy and compliance professionals, including considerations and implications of Artificial Intelligence (AI) technologies, building brand trust, and different approaches for achieving higher privacy competence scores.
See how organizational priorities and strategic approaches to data security and privacy are evolving around the globe.
This webinar will review:
- The top 10 privacy insights from the fifth annual Global Privacy Benchmarks Survey
- The top challenges for privacy leaders, practitioners, and organizations in 2024
- Key themes to consider in developing and maintaining your privacy program
GraphRAG for Life Science to increase LLM accuracyTomaz Bratanic
GraphRAG for life science domain, where you retriever information from biomedical knowledge graphs using LLMs to increase the accuracy and performance of generated answers
Climate Impact of Software Testing at Nordic Testing DaysKari Kakkonen
My slides at Nordic Testing Days 6.6.2024
Climate impact / sustainability of software testing discussed on the talk. ICT and testing must carry their part of global responsibility to help with the climat warming. We can minimize the carbon footprint but we can also have a carbon handprint, a positive impact on the climate. Quality characteristics can be added with sustainability, and then measured continuously. Test environments can be used less, and in smaller scale and on demand. Test techniques can be used in optimizing or minimizing number of tests. Test automation can be used to speed up testing.
Your One-Stop Shop for Python Success: Top 10 US Python Development Providersakankshawande
Simplify your search for a reliable Python development partner! This list presents the top 10 trusted US providers offering comprehensive Python development services, ensuring your project's success from conception to completion.
For the full video of this presentation, please visit: https://www.edge-ai-vision.com/2024/06/building-and-scaling-ai-applications-with-the-nx-ai-manager-a-presentation-from-network-optix/
Robin van Emden, Senior Director of Data Science at Network Optix, presents the “Building and Scaling AI Applications with the Nx AI Manager,” tutorial at the May 2024 Embedded Vision Summit.
In this presentation, van Emden covers the basics of scaling edge AI solutions using the Nx tool kit. He emphasizes the process of developing AI models and deploying them globally. He also showcases the conversion of AI models and the creation of effective edge AI pipelines, with a focus on pre-processing, model conversion, selecting the appropriate inference engine for the target hardware and post-processing.
van Emden shows how Nx can simplify the developer’s life and facilitate a rapid transition from concept to production-ready applications.He provides valuable insights into developing scalable and efficient edge AI solutions, with a strong focus on practical implementation.
How to Get CNIC Information System with Paksim Ga.pptxdanishmna97
Pakdata Cf is a groundbreaking system designed to streamline and facilitate access to CNIC information. This innovative platform leverages advanced technology to provide users with efficient and secure access to their CNIC details.
“An Outlook of the Ongoing and Future Relationship between Blockchain Technologies and Process-aware Information Systems.” Invited talk at the joint workshop on Blockchain for Information Systems (BC4IS) and Blockchain for Trusted Data Sharing (B4TDS), co-located with with the 36th International Conference on Advanced Information Systems Engineering (CAiSE), 3 June 2024, Limassol, Cyprus.
Infrastructure Challenges in Scaling RAG with Custom AI modelsZilliz
Building Retrieval-Augmented Generation (RAG) systems with open-source and custom AI models is a complex task. This talk explores the challenges in productionizing RAG systems, including retrieval performance, response synthesis, and evaluation. We’ll discuss how to leverage open-source models like text embeddings, language models, and custom fine-tuned models to enhance RAG performance. Additionally, we’ll cover how BentoML can help orchestrate and scale these AI components efficiently, ensuring seamless deployment and management of RAG systems in the cloud.
Full-RAG: A modern architecture for hyper-personalizationZilliz
Mike Del Balso, CEO & Co-Founder at Tecton, presents "Full RAG," a novel approach to AI recommendation systems, aiming to push beyond the limitations of traditional models through a deep integration of contextual insights and real-time data, leveraging the Retrieval-Augmented Generation architecture. This talk will outline Full RAG's potential to significantly enhance personalization, address engineering challenges such as data management and model training, and introduce data enrichment with reranking as a key solution. Attendees will gain crucial insights into the importance of hyperpersonalization in AI, the capabilities of Full RAG for advanced personalization, and strategies for managing complex data integrations for deploying cutting-edge AI solutions.
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slackshyamraj55
Discover the seamless integration of RPA (Robotic Process Automation), COMPOSER, and APM with AWS IDP enhanced with Slack notifications. Explore how these technologies converge to streamline workflows, optimize performance, and ensure secure access, all while leveraging the power of AWS IDP and real-time communication via Slack notifications.
Things to Consider When Choosing a Website Developer for your Website | FODUUFODUU
Choosing the right website developer is crucial for your business. This article covers essential factors to consider, including experience, portfolio, technical skills, communication, pricing, reputation & reviews, cost and budget considerations and post-launch support. Make an informed decision to ensure your website meets your business goals.
Things to Consider When Choosing a Website Developer for your Website | FODUU
Bio inspiring computing and its application in cheminformatics
1. Bio-inspired Computing and its
Application in Cheminformatics
B Y
A B D E L A Z I M G A L A L H U S S I E N
D E M O N S T R A T O R A T F A C U L T Y O F S C I E N C E , F A Y O U M U N I V E R S I T Y
Professor Mohamed Amin
Faculty of Science
Menofiya University
Professor Aboul Ella Hassanien
Faculty of Computers and Information
Cairo University
Supervisoion by
2. Agenda
Cheminformatics
• Introduction.
• Representation.
• Molecular descriptors.
Bio-Inspiring
• Problems
• Algorithms
• Ant Colony Optimization
Bioinspiring and
Cheminformatics
• Classification
• Clustering
• Feature Selection
Application
• Drug Discovery
• Drug Design
3. Cheminformatics
Chemoinformatics is concerned with the
application of computational methods to
tackle chemical problems, with particular
emphasis on the manipulation of chemical
structural information.
The term was introduced in the late 1990s.
there is not even any universal agreement on
the correct spelling:
Cheminformatics.
chemical informatics.
Chemiinformatics.
Chemoinformatics.
4. Cheminformatics
• Cheminformatics is the use of computer and
informational techniques applied to a range of
problems in the field of Chemistry.
• Cheminformatics strategies are useful in drug
discovery and other efforts where large numbers
of compounds are being evaluated for specific
properties.
• Cheminformatics is also known as
multidisciplinary science as it combines
Chemistry, Biology, Mathematics, Biochemistry,
Statistics and informatics.
5. Problems in Cheminformatics
• Storing data generated through experiments or from molecular
simulation Retrieval of chemical
• Structures from chemical database (Software libraries).
• Prediction of physical, chemical and biological properties of chemical
compounds.
• Elucidation of the structure of a compound based on spectroscopic
data.
• Structure, Substructure, Similarity and diversity searching from
chemical database.
• Docking - Interaction between two macromolecules.
• Drug Discovery
• Molecular Science, Materials Science, Food Science (nutraceuticals),
Atmospheric chemistry, Polymer chemistry, Textile Industry,
Combinatorial organic synthesis (COS).
7. Representation of Chemical Structures
•Chemical structures are usually stored
in a computer as molecular graphs.
Graph theory is a well-established area
of mathematics that has found
application not just in chemistry but in
many other areas, such as computer
science.
nodes = atoms
edges = bonds
The nodes and edges may have
properties associated with them.
SMILES
Connection Table
8. Connection Table
The simplest type of connection table consists of two sections:
A) List of the atomic numbers of the atoms in the molecule
B) List of the bonds, specified as pairs of bonded atoms.
hydrogen atoms may be implied in which case the connection
table is hydrogen suppressed.
9. SMILES
• SMILES stands for Simplified Molecular
Input Line Entry Specification.
• In SMILES, atoms are represented by
their atomic symbol.
• Upper case symbols are used for
aliphatic atoms and lower case for
aromatic atoms.
• Double bonds are written using “=”
and triple bonds using “#”
10. Morgan algorithm
• There may be many different ways to construct the connection table or
the SMILES string for a given molecule.
• each atom is assigned a connectivity value equal to the number of
connected atoms. In the second and subsequent iterations a new
connectivity value is calculated.
11. Screening Methods
• Molecule screens are often
implemented using binary string
representations of the molecules
and the query substructure called
bitstrings. Bitstrings consist of a
sequence of “0”s and “1s”. They
are the “natural currency” of
computers and so can be
compared and manipulated very
rapidly, especially if held in the
computer’s memory. A “1” in a
bitstring usually indicates the
presence of a particular structural
feature and a “0” its absence.
12. Structure Searching
• Graph theoretic methods can be used to perform substructure searching,
which is equivalent to determining whether one graph is entirely
contained within another, a problem known as subgraph isomorphism.
13. Molecular Descriptors
• The manipulation and analysis of chemical structural information is made possible through
the use of molecular descriptors.
• These are numerical values that characterise properties of molecules.
• The molecular descriptor is the final result of a logic and mathematical procedure which
transforms chemical information encoded within a symbolic representation of a molecule
into an useful number or the result of some standardized experiment.
• Examples:
• The descriptors fall into Four classes .
Topological.
Geometrical.
Electronic .
Hybrid or 3D Descriptors.
15. Computational Models
• Most molecular discoveries today are the result of an iterative, three-
phase cycle of design, synthesis and test. Analysis of the results from one
iteration provides information and knowledge that enables the next cycle
to be initiated and further improvements to be achieved.
• A common feature of this analysis stage is the construction of some form
of model which enables the observed activity or properties to be related
to the molecular structure.
• Examples:
Quantitative Structure-Activity Relationships (QSARs)
Quantitative Structure–Property Relationships (QSPRs)
16. Quantitative Structure-Activity Relationships
(QSARs)
QSAR is a mathematical relationship between a biological activity of a
molecular system and its geometric and chemical characteristics.
A general formula for a quantitative structure-activity relationship
(QSAR) can be given by the following:
activity = f (molecular or fragmental properties)
QSAR attempts to find consistent relationship between biological activity
and molecular properties, so that these “rules” can be used to evaluate
the activity of new compounds.
19. Agenda
Cheminformatics
• Introduction.
• Representation.
• Molecular descriptors.
Bio-Inspiring
• Problems
• Algorithms
• Ant Colony Optimization
Bioinspiring and
Cheminformatics
• Classification
• Clustering
• Feature Selection
Application
• Drug Discovery
• Drug Design
Thesis statement
• what’s I aim to achieve
20. Bio-Inspired Computing
Finding the best solution
increasingly becomes very difficult
to identify, if not impossible, due to
the very large and dynamic scope of
solutions and complexity of
computations. Often, the optimal
solution for such a NP hard problem
is a point in the n-dimensional
hyperspace and identifying the
solution is computationally very
expensive or even not feasible in
limited time.
21. Bio-Inspired Computing
21
• The computing inspired from biology is a field of study
based on the social behavior of animals, insects and other
living organisms, including also connectionism and
emergence.
• Bio-inspired computing uses computers to model nature and
the study of nature to improve the usage of computers.
Biological
computation
Artificial
Intelligence
Bio-inspired
computing
23. Motivation
Dealing too complex problems
Incapable to solve by human proposed solution
Absence of complete mathematical model
Existing of similar problem in nature
Adaptation
Self-organization
Communication
Optimization
24. Bio-inspired computing Methods:
24
Some areas of bio-inspired computing are:
• neural networks
• genetic algorithm
• particle swarm
• ant colony optimization
• artificial bee colony
• bacterial foraging
• cuckoo search
• Firefly
• leaping frog
• bat algorithm
• flower pollination
• artificial plant optimization
25. Swarm Intelligence
• The SI-based algorithms belong to a wider class of the algorithms, called
the bio-inspired algorithms.
• we can observe that SI-based ⊂ bio-inspired ⊂ nature-inspired.
26. Swarm Intelligence
• Population of simple agents
• Decentralized
• Self-Organized
• No or local communication
• Example
Ant/Bee colonies
Bird flocking
Fish schooling
27. Ant Colony Optimization
• mimic the foraging behavior of
social ants.
• Ants primarily use pheromone as
a chemical messenger.
• pheromone concentration can be
considered as the indicator of
quality solutions to a problem of
interest.
• The movement of an ant is
controlled by pheromone, which
will evaporate over time.
• the probability of ants at a
particular node i to choose the
route from node i to node j is
given by
28. Agenda
Cheminformatics
• Introduction.
• Representation.
• Molecular descriptors.
BioInspiring
• Cheminformatics
• Molecular Descriptors
• Similarity
Bioinspiring and
Cheminformatics
• Classification
• Clustering
• Feature Selection
Application
• Drug Discovery
• Drug Design
Thesis statement
• what’s I aim to achieve
29. Bio-Inspiring in Cheminformatics
Bio-Inspiring has many application in the field of Cheminformatics:
Classification: is a general process related to categorization, the process
in which molecules are differentiated and understood.
Clustering: is the task of grouping a set of objects in such a way that
objects in such a way that objects in the same group (called a cluster)
are more similar to each other than those in other groups (clusters).
Feature Selection: is a process that chooses an optimal subset of
features according to a certain criterion.
30. Classification
• In machine learning and statistics, classification is the problem of
identifying to which of a set of categories (sub-populations) a
new observation belongs, on the basis of a training set of data containing
observations (or instances) whose category membership is known.
31. Clustering
• Clustering is the process of partitioning
a usually large dataset into groups (or
clusters), according to a similarity (or
dissimilarity) measure.
• If we assume that we have a dataset X,
defined as X = x1, x2, x3, . . ., which
consists of all the data that we want to
place into clusters, then we define a
clustering of X in m clusters C1, ..., Cm,
in such a way that the following
conditions apply:
32. Feature Selection
• Why we need FS?
To improve performance (in terms of speed, predictive power,
simplicity of the model).
to visualize the data for model selection.
To reduce dimensionality and remove noise.
• Prespectives:
– searching for the best subset of features.
– criteria for evaluating different subsets.
– principle for selection, adding, removing or changing new features
during the search.
33. Agenda
Cheminformatics
• Introduction.
• Representation.
• Molecular descriptors.
BioInspiring
• Cheminformatics
• Molecular Descriptors
• Similarity
Bioinspiring and
Cheminformatics
• Classification
• Clustering
• Feature Selection
Application
• Drug Discovery
• Drug Design
Thesis statement
• what’s I aim to achieve
34. Application (I) Drug Design
• Drug design, often referred
to as rational drug design or
simply rational design, is the
inventive process of finding
new medications based on
the knowledge of a biological
target.
• The drug is most commonly
an organic small molecule
that activates or inhibits the
function of a biomolecule
such as a protein.
37. Literature Review
• Joerg Kurt Wegner, Aaron Sterling, Rajarshi Guha, Andreas Bender in their
survey “ Cheminformatics ” introduce a comprehensive introduction to the
field of cheminformatics and Roberto Todeschini and Viviana Consonni in
their book molecular descriptors combine a huge number of descriptors. All
new descriptors, QSAR approaches and chemometric strategies proposed
since 2000 have been included in this handbook.
• Aboul Ella Hassnien and Eid Elamry introduce “Swarm Intelligence Methods
and Concepts ”.
38. Literature Review
Gerald M. Maggiora and Veerabahu Shanmugasundaram in the
“Molecular Similarity Measures ” introduce a survey on getting
similarity between 2 graph and they try to solve Maximum subgraph
matching.
Arpan Kumar Kar introduce a bio-inspired review .
39. Thesis Statement
Title:
Bio-Inspiring Computing and its Application in Cheminformatics
Aim:
Try to cluster Molecular using spectral clustering.
Try to find similarity between molecules.
40. References
1. Andrew R. Leach and Valerie J. Gillet, “An Introduction to Chemoinformatics” Springer 2007.
2. Roberto Todeschini and Viviana Consonni ,“Molecular Descriptors for Cheminformatics” ,WILEY-VCH
May,2009.
3. Christina Chrysouli, Anastasios Tefa, “Spectral clustering and semi-supervised learning using evolvingsimilarity
graphs”, Applied Soft Computing,
4. U. Luxburg, A tutorial on spectral clustering, Stat. Comput. 17 (4) (2007)395–416
5. R. Dutt , A. K. Madan , “Predicting biological activity: Computational approach using novel distance based
molecular descriptors”, Computers in Biology and Medicine,2012.
6. Yang, X.S., Cui, Z.,Xias, R., Gandomi, A.H. and Karamanoglu, M. eds., 2013. Swarm intelligence and bio-inspired
computation: theory and applications. Newnes
7. Kar, Arpan Kumar. "Bio inspired computing–A review of algorithms and scope of applications." Expert Systems
with Applications 59 (2016): 20-32.
8. Emmert-Streib, Frank, Matthias Dehmer and Yongtang Shi. “Fifty years of graph matching, network alignment and
network comparison.” Inf. Sci. 346-347 (2016): 180-197.
9. Oduguwa, Abiola, Ashutosh Tiwari, Rajkumar Roy, and Conrad Bessant. "An overview of soft computing
techniques used in the drug discovery process." In Applied Soft Computing Technologies: The Challenge of
Complexity, pp. 465-480. Springer Berlin Heidelberg, 2006.
10. Maggiora, G.M. and Shanmugasundaram, V., 2004. Molecular similarity measures. Chemoinformatics: Concepts,
Methods, and Tools for Drug Discovery, pp.1-50.
A dedicated website or other application which enables users to communicate with each other by posting information, comments, messages, images, etc.
A social network is a social structure made of nodes (individuals or organizations) and edges that connect nodes in various relationships like friendship, kinship, etc.
Note that hydrogen atoms are often omitted.
The nodes and edges may have properties associated with them. For example, the atomic number or atom type may be associated with each node and the bond order with each edge.
A graph represents the topology of a molecule only, that is, the way the nodes (or atoms) are connected.
It can be represented by a graph or adjacency matrix
It can be represented by a graph or adjacency matrix
Perhaps the simplest descriptors are based on simple counts of features such as hydrogen bond donors, hydrogen bond acceptors, ring systems (including aromatic rings), rotatable bonds and molecular weight. Many of these features can be defined as substructures or molecular fragments and so their frequency of occurrence can be readily calculated from a 2D connection table using the techniques developed for substructure search.
Many different molecular descriptors have been described and used for a wide variety of purposes. They vary in the complexity of the information they encode and in the time required to calculate them. In general, the computational requirements increase with the level of discrimination that is achieved. For example, the molecular weight does not convey much about a molecule’s properties but it is very rapid to compute. By contrast, descriptors that are based on quantum mechanics may provide accurate representations of properties, but they are much more time consuming to compute. Some descriptors have an experimental counterpart (e.g. the octanol–water partition coefficient)
CD in Mono-Dimensional SN
CD in Multi-Dimensional SN
Multi-Dimensional Networks
network has multiple types of interactions between actors of the same type. Each dimension represents one type of activity between users. p-dimensional network is represented as
Multidimensional network can represent multiple types of interactions (activities) between one type of entities. Each type of interaction can be represented by one dimension
E.g. at YouTube, two users can be connected through friendship connection, email communications, subscription/Fans, chatter in comments, etc.
Communities are also called groups cohesive subgroups modules cluster in different context
CD in Mono-Dimensional SN
CD in Multi-Dimensional SN
CD in Mono-Dimensional SN
CD in Multi-Dimensional SN
How we might extend methods presented so far to handle this heterogeneity.
How to use Map Reduce capabilities in order to improve efficiency and increase scalability of community detection algorithms.