The document describes an upcoming workshop on chemical artificial intelligence (ChemAI) taking place on November 17, 2023. It includes an agenda with topics such as learning the biochemical language with AI, natural language processing for biomolecules, and applications of chemical language modeling including de novo drug design. The presenter is F. Grisoni from Eindhoven University of Technology who will be discussing using AI to decipher the language of life at the molecular level and learning this language to enable applications in drug discovery.
This document summarizes a presentation about using machine learning for computational chemistry. It discusses how machine learning and computational chemistry are deeply connected, with machine learning serving as a new tool for computational chemistry. The presentation outlines how machine learning can help accelerate drug discovery and materials design for applications in health and sustainability by generating new molecules and predicting chemical reactions.
Στο παρόν φυλλάδιο μελετάμε ένα από τα πιο κοινά προβλήματα που εμφανίζονται στα μαθηματικά,
τα γραμμικά συστήματα. Παρουσιάζουμε το γενικό πρόβλημα της επίλυσης ενός
συστήματος με m εξισώσεις και n αγνώστους και την πιο γνωστή μέθοδο επίλυσης, την μέθοδο απαλοιφής του Gauss, χωρίς να δίνουμε
άλλους τρόπους επίλυσης που χρησιμοποιούν επαυξημένους πίνακες και ορίζουσες.
Gentle Introduction to Dirichlet ProcessesYap Wooi Hen
This document provides an introduction to Dirichlet processes. It begins by motivating the need for nonparametric clustering when the number of clusters in the data is unknown. It then provides an overview of Dirichlet processes and discusses them from multiple perspectives, including samples from a Dirichlet process, the Chinese restaurant process representation, stick breaking construction, and formal definition. It also covers Dirichlet process mixtures and common inference techniques like Markov chain Monte Carlo and variational inference.
Introduction to Model-Based Machine LearningDaniel Emaasit
The field of machine learning has seen the development of thousands of learning algorithms. Typically, scientists choose from these algorithms to solve specific problems. Their choices often being limited by their familiarity with these algorithms. In this classical/traditional framework of machine learning, scientists are constrained to making some assumptions so as to use an existing algorithm. This is in contrast to the model-based machine learning approach which seeks to create a bespoke solution tailored to each new problem.
WiDS Alexandria, Egypt workshop in topological data analysis (Python and R code available on request), covering persistent homology, the Mapper algorithm, and discrete Ricci curvature. Examples include text data and social network data.
This document summarizes a presentation about using machine learning for computational chemistry. It discusses how machine learning and computational chemistry are deeply connected, with machine learning serving as a new tool for computational chemistry. The presentation outlines how machine learning can help accelerate drug discovery and materials design for applications in health and sustainability by generating new molecules and predicting chemical reactions.
Στο παρόν φυλλάδιο μελετάμε ένα από τα πιο κοινά προβλήματα που εμφανίζονται στα μαθηματικά,
τα γραμμικά συστήματα. Παρουσιάζουμε το γενικό πρόβλημα της επίλυσης ενός
συστήματος με m εξισώσεις και n αγνώστους και την πιο γνωστή μέθοδο επίλυσης, την μέθοδο απαλοιφής του Gauss, χωρίς να δίνουμε
άλλους τρόπους επίλυσης που χρησιμοποιούν επαυξημένους πίνακες και ορίζουσες.
Gentle Introduction to Dirichlet ProcessesYap Wooi Hen
This document provides an introduction to Dirichlet processes. It begins by motivating the need for nonparametric clustering when the number of clusters in the data is unknown. It then provides an overview of Dirichlet processes and discusses them from multiple perspectives, including samples from a Dirichlet process, the Chinese restaurant process representation, stick breaking construction, and formal definition. It also covers Dirichlet process mixtures and common inference techniques like Markov chain Monte Carlo and variational inference.
Introduction to Model-Based Machine LearningDaniel Emaasit
The field of machine learning has seen the development of thousands of learning algorithms. Typically, scientists choose from these algorithms to solve specific problems. Their choices often being limited by their familiarity with these algorithms. In this classical/traditional framework of machine learning, scientists are constrained to making some assumptions so as to use an existing algorithm. This is in contrast to the model-based machine learning approach which seeks to create a bespoke solution tailored to each new problem.
WiDS Alexandria, Egypt workshop in topological data analysis (Python and R code available on request), covering persistent homology, the Mapper algorithm, and discrete Ricci curvature. Examples include text data and social network data.
1) Εισαγωγικοί Ορισμοί
1.1) Κατευθυνόνομενο Γράφημα
1.2) Μονοπάτια
1.3) Κύκλοι
1.4) Έσω και Έξω Βαθμός Κορυφής
1.5) Απομονωμένη Κορυφή
1.6) Πλήρες Γράφημα
2) Η Γλώσσα των Κατευθυνόμενων Γραφημάτων
2.1) Η Γλώσσα των Κατευθυνόμενων Γραφημάτων
2.2) Ερμηνείες στην Γλώσσα των Κατευθυνόμενων Γραφημάτων
3) Ασκήσεις στην Γλώσσα των Κατευθυνόμενων Γραφημάτων
3.1) Μετάφραση στα Ελληνικά
3.2) Μετάφραση στα Κατηγορηματικά
3.3) Εύρεση Αλήθειας Προτάσεων
3.4) Εύρεση Ερμηνείας που ικανοποιεί δεδομένη πρόταση
3.5) Συντομογραφίες στην Γλώσσα των Κατευθυνόμενων Γραφημάτων
Ασκήσεις
This document provides information about plagiarism and how to avoid it. It begins with some ground rules for the session and then defines plagiarism. It identifies different types of plagiarism such as copying work word for word or paraphrasing without citation. The document explains why plagiarism should be avoided, such as it violating policies. It offers strategies for avoiding plagiarism, such as using own ideas, learning how to cite sources, and understanding voice in assignments. Software for checking plagiarism is also introduced.
Curso sobre biofabricação de tecidos do Núcleo de Tecnologias Tridimensionais (NT3D) do Centro de Tecnologia da Informação Renato Archer. Os assuntos abordados incluem os seguintes tópicos:
•Conceitos da bioimpressão e biofabricação de tecidos;
•Engenharia tecidual;
•Tecnologias envolvidas;
•O papel da tecnologia da informação na bioimpressão de tecidos;
•Projetos desenvolvidos no Brasil e no mundo sobre bioimpressão de tecidos.
Synthetic biology is an emerging scientific field that combines engineering and biology to design and construct novel biological systems or redesign existing natural biological systems. The document provides a brief history of synthetic biology from 1960 to 2013, highlighting key developments such as the first synthetic genetic circuits in 2000-2003 and the engineering of metabolic pathways. It also discusses topics such as standard biological parts, modeling and design techniques, applications in health, energy and environment, as well as potential risks that need consideration with the further development of the field.
ReComp: optimising the re-execution of analytics pipelines in response to cha...Paolo Missier
Paolo Missier presented on optimizing the re-execution of analytics pipelines in response to changes in input data. The talk discussed using provenance to selectively re-run parts of workflows impacted by changes. ProvONE combines process structure and runtime provenance to enable granular re-execution. The ReComp framework detects and quantifies data changes, estimates impact, and selectively re-executes relevant sub-processes to optimize re-running workflows in response to evolving data.
computer aided detection of pulmonary nodules in ct scansWookjin Choi
The document discusses computer aided detection of pulmonary nodules in CT scans. It introduces lung cancer as a major health problem and describes how detecting nodules early can improve survival rates. It then provides an overview of pulmonary nodule detection CAD systems, describing their general structure and evaluating various approaches in the literature. Key contributions are genetic programming and shape-based classifiers and a hierarchical block analysis method that achieved high performance on a publicly available lung image database.
Multi-k-mer de novo transcriptome assembly and assembly of assemblies using 4...Jennifer Shelton
Jennifer Shelton KSU
Multi-k-mer de novo transcriptome assembly and assembly of assemblies using 454 and illumina data.
http://bioinformaticsk-state.blogspot.com/
http://bioinformatics.k-state.edu/index.html
The document discusses the field of bioinformatics, which involves applying computational techniques and building tools to solve biological problems, such as analyzing genetic sequences and modeling molecular structures. It outlines several applications of bioinformatics, including in medicine for disease research and drug design, as well as in agriculture and animal health. The emergence of bioinformatics is attributed to the convergence of rapid growth in fields like biotechnology and information technology.
We know that we are in an AI take-off, what is new is that we are in a math take-off. A math take-off is using math as a formal language, beyond the human-facing math-as-math use case, for AI to interface with the computational infrastructure. The message of generative AI and LLMs (large language models like GPT) is not that they speak natural language to humans, but that they speak formal languages (programmatic code, mathematics, physics) to the computational infrastructure, implying the ability to create a much larger problem-solving apparatus for humanity-benefitting applications in biology, energy, and space science, however not without risk.
Friday, October 15th, 2021, Sapporo, Hokkaido, Japan.
Hokkaido University ICReDD - Faculty of Medicine Joint Symposium
https://www.icredd.hokudai.ac.jp/event/5993
ICReDD (Institute for Chemical Reaction Design and Discovery)
https://www.icredd.hokudai.ac.jp
Genetic Algorithm for optimization on IRIS Dataset REPORT pdfSunil Rajput
Apply the Genetic Algorithm for optimization on a dataset obtained from UCI ML repository.
For Example: IRIS Dataset
Genetic Algorithm Optimization, Iris Dataset, Machine Learning, Python.
This document describes a classroom exercise where students developed deep neural networks to model and predict adsorption equilibrium data. The exercise introduced students to artificial intelligence and deep learning concepts. Students used MATLAB to create neural networks that modeled adsorption of acids by activated carbon at different temperatures, comparing results to theoretical models. The goals were to teach AI methodology, increase coding skills, and show neural networks can accurately model complex chemical engineering processes. Feedback confirmed students gained knowledge of machine learning terms and abilities to develop simple or sophisticated neural networks for modeling unit operations.
The adaptive mechanisms include the following AI paradigms that exhibit an ability to learn or adapt to new environments:
Swarm Intelligence (SI),
Artificial Neural Networks (ANN),
Evolutionary Computation (EC),
Artificial Immune Systems (AIS), and
Fuzzy Systems (FS).
1) Εισαγωγικοί Ορισμοί
1.1) Κατευθυνόνομενο Γράφημα
1.2) Μονοπάτια
1.3) Κύκλοι
1.4) Έσω και Έξω Βαθμός Κορυφής
1.5) Απομονωμένη Κορυφή
1.6) Πλήρες Γράφημα
2) Η Γλώσσα των Κατευθυνόμενων Γραφημάτων
2.1) Η Γλώσσα των Κατευθυνόμενων Γραφημάτων
2.2) Ερμηνείες στην Γλώσσα των Κατευθυνόμενων Γραφημάτων
3) Ασκήσεις στην Γλώσσα των Κατευθυνόμενων Γραφημάτων
3.1) Μετάφραση στα Ελληνικά
3.2) Μετάφραση στα Κατηγορηματικά
3.3) Εύρεση Αλήθειας Προτάσεων
3.4) Εύρεση Ερμηνείας που ικανοποιεί δεδομένη πρόταση
3.5) Συντομογραφίες στην Γλώσσα των Κατευθυνόμενων Γραφημάτων
Ασκήσεις
This document provides information about plagiarism and how to avoid it. It begins with some ground rules for the session and then defines plagiarism. It identifies different types of plagiarism such as copying work word for word or paraphrasing without citation. The document explains why plagiarism should be avoided, such as it violating policies. It offers strategies for avoiding plagiarism, such as using own ideas, learning how to cite sources, and understanding voice in assignments. Software for checking plagiarism is also introduced.
Curso sobre biofabricação de tecidos do Núcleo de Tecnologias Tridimensionais (NT3D) do Centro de Tecnologia da Informação Renato Archer. Os assuntos abordados incluem os seguintes tópicos:
•Conceitos da bioimpressão e biofabricação de tecidos;
•Engenharia tecidual;
•Tecnologias envolvidas;
•O papel da tecnologia da informação na bioimpressão de tecidos;
•Projetos desenvolvidos no Brasil e no mundo sobre bioimpressão de tecidos.
Synthetic biology is an emerging scientific field that combines engineering and biology to design and construct novel biological systems or redesign existing natural biological systems. The document provides a brief history of synthetic biology from 1960 to 2013, highlighting key developments such as the first synthetic genetic circuits in 2000-2003 and the engineering of metabolic pathways. It also discusses topics such as standard biological parts, modeling and design techniques, applications in health, energy and environment, as well as potential risks that need consideration with the further development of the field.
ReComp: optimising the re-execution of analytics pipelines in response to cha...Paolo Missier
Paolo Missier presented on optimizing the re-execution of analytics pipelines in response to changes in input data. The talk discussed using provenance to selectively re-run parts of workflows impacted by changes. ProvONE combines process structure and runtime provenance to enable granular re-execution. The ReComp framework detects and quantifies data changes, estimates impact, and selectively re-executes relevant sub-processes to optimize re-running workflows in response to evolving data.
computer aided detection of pulmonary nodules in ct scansWookjin Choi
The document discusses computer aided detection of pulmonary nodules in CT scans. It introduces lung cancer as a major health problem and describes how detecting nodules early can improve survival rates. It then provides an overview of pulmonary nodule detection CAD systems, describing their general structure and evaluating various approaches in the literature. Key contributions are genetic programming and shape-based classifiers and a hierarchical block analysis method that achieved high performance on a publicly available lung image database.
Multi-k-mer de novo transcriptome assembly and assembly of assemblies using 4...Jennifer Shelton
Jennifer Shelton KSU
Multi-k-mer de novo transcriptome assembly and assembly of assemblies using 454 and illumina data.
http://bioinformaticsk-state.blogspot.com/
http://bioinformatics.k-state.edu/index.html
The document discusses the field of bioinformatics, which involves applying computational techniques and building tools to solve biological problems, such as analyzing genetic sequences and modeling molecular structures. It outlines several applications of bioinformatics, including in medicine for disease research and drug design, as well as in agriculture and animal health. The emergence of bioinformatics is attributed to the convergence of rapid growth in fields like biotechnology and information technology.
We know that we are in an AI take-off, what is new is that we are in a math take-off. A math take-off is using math as a formal language, beyond the human-facing math-as-math use case, for AI to interface with the computational infrastructure. The message of generative AI and LLMs (large language models like GPT) is not that they speak natural language to humans, but that they speak formal languages (programmatic code, mathematics, physics) to the computational infrastructure, implying the ability to create a much larger problem-solving apparatus for humanity-benefitting applications in biology, energy, and space science, however not without risk.
Friday, October 15th, 2021, Sapporo, Hokkaido, Japan.
Hokkaido University ICReDD - Faculty of Medicine Joint Symposium
https://www.icredd.hokudai.ac.jp/event/5993
ICReDD (Institute for Chemical Reaction Design and Discovery)
https://www.icredd.hokudai.ac.jp
Genetic Algorithm for optimization on IRIS Dataset REPORT pdfSunil Rajput
Apply the Genetic Algorithm for optimization on a dataset obtained from UCI ML repository.
For Example: IRIS Dataset
Genetic Algorithm Optimization, Iris Dataset, Machine Learning, Python.
This document describes a classroom exercise where students developed deep neural networks to model and predict adsorption equilibrium data. The exercise introduced students to artificial intelligence and deep learning concepts. Students used MATLAB to create neural networks that modeled adsorption of acids by activated carbon at different temperatures, comparing results to theoretical models. The goals were to teach AI methodology, increase coding skills, and show neural networks can accurately model complex chemical engineering processes. Feedback confirmed students gained knowledge of machine learning terms and abilities to develop simple or sophisticated neural networks for modeling unit operations.
The adaptive mechanisms include the following AI paradigms that exhibit an ability to learn or adapt to new environments:
Swarm Intelligence (SI),
Artificial Neural Networks (ANN),
Evolutionary Computation (EC),
Artificial Immune Systems (AIS), and
Fuzzy Systems (FS).
The document discusses the growth of data-intensive science and the need for new computing infrastructures to manage the large amounts of data being produced. It covers three perspectives on infrastructure: grid computing which enables sharing of distributed resources over the internet, data centers which provide integrated storage and computing services, and e-science which combines grids, collaboration tools, and data analysis services. Examples are given of different scientific domains using these infrastructures.
Applications of Computer Science in Environmental ModelsIJLT EMAS
Computation is now regarded as an equal and
indispensable partner, along with theory and experiment, in the
advance of scientific knowledge and engineering practice.
Numerical simulation enables the study of complex systems and
natural phenomena that would be too expensive or dangerous, or
even impossible, to study by direct experimentation. The quest
for ever higher levels of detail and realism in such simulations
requires enormous computational capacity, and has provided the
impetus for dramatic breakthroughs in computer algorithms and
architectures. Due to these advances, computational scientists
and engineers can now solve large-scale problems that were once
thought intractable. Computational science and engineering
(CSE) is a rapidly growing multidisciplinary area with
connections to the sciences, engineering, and mathematics and
computer science. CSE focuses on the development of problemsolving
methodologies and robust tools for the solution of
scientific and engineering problems. We believe that CSE will
play an important if not dominating role for the future of the
scientific discovery process and engineering design. The
computation science is now being used widely for environmental
engineering calculations. The behavior of environmental
engineering systems and processes can be studied with the help
of computation science and understanding as well as better
solutions to environmental engineering problems can be
obtained.
Mapping Genotype to Phenotype using Attribute Grammar, Laura Adammadalladam
Defense -- thesis: “Mapping Genotype to Phenotype using Attribute Grammar.”
PhD degree in Genetics, Bioinformatics and Computational Biology (GBCB) in the tracks of Computer Science, Mathematics and Life Sciences.
ReComp and P4@NU: Reproducible Data Science for HealthPaolo Missier
brief overview of the ReComp project (http://recomp.org.uk) on Selective recurring re-computation of complex analytics, and a brief outlook for the P4@NU project on seeking digital biomarkers for age-0related metabolic diseases
The document discusses advance techniques of computational intelligence for biomedical image analysis. It provides an overview of computational intelligence, which involves adaptive mechanisms like artificial neural networks, evolutionary computation, fuzzy systems, and swarm intelligence. These techniques exhibit an ability to learn or adapt to new environments. The document also discusses deep learning techniques like convolutional neural networks and recurrent neural networks that are widely used for tasks like image classification.
Future Directions in Engineering BiologyIlya Klabukov
This document summarizes the key discussions and major points from a two-day workshop on future directions in engineering biology. The workshop brought together thought leaders from universities and industry to discuss major challenges and opportunities in the field. Key points included: (1) a need for standardized biological data and models to reconstruct networks from genomic data; (2) emerging strategies to produce diverse products using synthetic biology and systems biology; and (3) major technical challenges such as understanding biological networks, creating knowledge from data, and developing new tools for design and characterization. Advances in areas like biomanufacturing, biomedicine, energy and the environment were also discussed.
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...University of Maribor
Slides from:
11th International Conference on Electrical, Electronics and Computer Engineering (IcETRAN), Niš, 3-6 June 2024
Track: Artificial Intelligence
https://www.etran.rs/2024/en/home-english/
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptxMAGOTI ERNEST
Although Artemia has been known to man for centuries, its use as a food for the culture of larval organisms apparently began only in the 1930s, when several investigators found that it made an excellent food for newly hatched fish larvae (Litvinenko et al., 2023). As aquaculture developed in the 1960s and ‘70s, the use of Artemia also became more widespread, due both to its convenience and to its nutritional value for larval organisms (Arenas-Pardo et al., 2024). The fact that Artemia dormant cysts can be stored for long periods in cans, and then used as an off-the-shelf food requiring only 24 h of incubation makes them the most convenient, least labor-intensive, live food available for aquaculture (Sorgeloos & Roubach, 2021). The nutritional value of Artemia, especially for marine organisms, is not constant, but varies both geographically and temporally. During the last decade, however, both the causes of Artemia nutritional variability and methods to improve poorquality Artemia have been identified (Loufi et al., 2024).
Brine shrimp (Artemia spp.) are used in marine aquaculture worldwide. Annually, more than 2,000 metric tons of dry cysts are used for cultivation of fish, crustacean, and shellfish larva. Brine shrimp are important to aquaculture because newly hatched brine shrimp nauplii (larvae) provide a food source for many fish fry (Mozanzadeh et al., 2021). Culture and harvesting of brine shrimp eggs represents another aspect of the aquaculture industry. Nauplii and metanauplii of Artemia, commonly known as brine shrimp, play a crucial role in aquaculture due to their nutritional value and suitability as live feed for many aquatic species, particularly in larval stages (Sorgeloos & Roubach, 2021).
The binding of cosmological structures by massless topological defectsSérgio Sacani
Assuming spherical symmetry and weak field, it is shown that if one solves the Poisson equation or the Einstein field
equations sourced by a topological defect, i.e. a singularity of a very specific form, the result is a localized gravitational
field capable of driving flat rotation (i.e. Keplerian circular orbits at a constant speed for all radii) of test masses on a thin
spherical shell without any underlying mass. Moreover, a large-scale structure which exploits this solution by assembling
concentrically a number of such topological defects can establish a flat stellar or galactic rotation curve, and can also deflect
light in the same manner as an equipotential (isothermal) sphere. Thus, the need for dark matter or modified gravity theory is
mitigated, at least in part.
Unlocking the mysteries of reproduction: Exploring fecundity and gonadosomati...AbdullaAlAsif1
The pygmy halfbeak Dermogenys colletei, is known for its viviparous nature, this presents an intriguing case of relatively low fecundity, raising questions about potential compensatory reproductive strategies employed by this species. Our study delves into the examination of fecundity and the Gonadosomatic Index (GSI) in the Pygmy Halfbeak, D. colletei (Meisner, 2001), an intriguing viviparous fish indigenous to Sarawak, Borneo. We hypothesize that the Pygmy halfbeak, D. colletei, may exhibit unique reproductive adaptations to offset its low fecundity, thus enhancing its survival and fitness. To address this, we conducted a comprehensive study utilizing 28 mature female specimens of D. colletei, carefully measuring fecundity and GSI to shed light on the reproductive adaptations of this species. Our findings reveal that D. colletei indeed exhibits low fecundity, with a mean of 16.76 ± 2.01, and a mean GSI of 12.83 ± 1.27, providing crucial insights into the reproductive mechanisms at play in this species. These results underscore the existence of unique reproductive strategies in D. colletei, enabling its adaptation and persistence in Borneo's diverse aquatic ecosystems, and call for further ecological research to elucidate these mechanisms. This study lends to a better understanding of viviparous fish in Borneo and contributes to the broader field of aquatic ecology, enhancing our knowledge of species adaptations to unique ecological challenges.
Phenomics assisted breeding in crop improvementIshaGoswami9
As the population is increasing and will reach about 9 billion upto 2050. Also due to climate change, it is difficult to meet the food requirement of such a large population. Facing the challenges presented by resource shortages, climate
change, and increasing global population, crop yield and quality need to be improved in a sustainable way over the coming decades. Genetic improvement by breeding is the best way to increase crop productivity. With the rapid progression of functional
genomics, an increasing number of crop genomes have been sequenced and dozens of genes influencing key agronomic traits have been identified. However, current genome sequence information has not been adequately exploited for understanding
the complex characteristics of multiple gene, owing to a lack of crop phenotypic data. Efficient, automatic, and accurate technologies and platforms that can capture phenotypic data that can
be linked to genomics information for crop improvement at all growth stages have become as important as genotyping. Thus,
high-throughput phenotyping has become the major bottleneck restricting crop breeding. Plant phenomics has been defined as the high-throughput, accurate acquisition and analysis of multi-dimensional phenotypes
during crop growing stages at the organism level, including the cell, tissue, organ, individual plant, plot, and field levels. With the rapid development of novel sensors, imaging technology,
and analysis methods, numerous infrastructure platforms have been developed for phenotyping.
ESR spectroscopy in liquid food and beverages.pptxPRIYANKA PATEL
With increasing population, people need to rely on packaged food stuffs. Packaging of food materials requires the preservation of food. There are various methods for the treatment of food to preserve them and irradiation treatment of food is one of them. It is the most common and the most harmless method for the food preservation as it does not alter the necessary micronutrients of food materials. Although irradiated food doesn’t cause any harm to the human health but still the quality assessment of food is required to provide consumers with necessary information about the food. ESR spectroscopy is the most sophisticated way to investigate the quality of the food and the free radicals induced during the processing of the food. ESR spin trapping technique is useful for the detection of highly unstable radicals in the food. The antioxidant capability of liquid food and beverages in mainly performed by spin trapping technique.
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...Travis Hills MN
Travis Hills of Minnesota developed a method to convert waste into high-value dry fertilizer, significantly enriching soil quality. By providing farmers with a valuable resource derived from waste, Travis Hills helps enhance farm profitability while promoting environmental stewardship. Travis Hills' sustainable practices lead to cost savings and increased revenue for farmers by improving resource efficiency and reducing waste.
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...Ana Luísa Pinho
Functional Magnetic Resonance Imaging (fMRI) provides means to characterize brain activations in response to behavior. However, cognitive neuroscience has been limited to group-level effects referring to the performance of specific tasks. To obtain the functional profile of elementary cognitive mechanisms, the combination of brain responses to many tasks is required. Yet, to date, both structural atlases and parcellation-based activations do not fully account for cognitive function and still present several limitations. Further, they do not adapt overall to individual characteristics. In this talk, I will give an account of deep-behavioral phenotyping strategies, namely data-driven methods in large task-fMRI datasets, to optimize functional brain-data collection and improve inference of effects-of-interest related to mental processes. Key to this approach is the employment of fast multi-functional paradigms rich on features that can be well parametrized and, consequently, facilitate the creation of psycho-physiological constructs to be modelled with imaging data. Particular emphasis will be given to music stimuli when studying high-order cognitive mechanisms, due to their ecological nature and quality to enable complex behavior compounded by discrete entities. I will also discuss how deep-behavioral phenotyping and individualized models applied to neuroimaging data can better account for the subject-specific organization of domain-general cognitive systems in the human brain. Finally, the accumulation of functional brain signatures brings the possibility to clarify relationships among tasks and create a univocal link between brain systems and mental functions through: (1) the development of ontologies proposing an organization of cognitive processes; and (2) brain-network taxonomies describing functional specialization. To this end, tools to improve commensurability in cognitive science are necessary, such as public repositories, ontology-based platforms and automated meta-analysis tools. I will thus discuss some brain-atlasing resources currently under development, and their applicability in cognitive as well as clinical neuroscience.
ANAMOLOUS SECONDARY GROWTH IN DICOT ROOTS.pptxRASHMI M G
Abnormal or anomalous secondary growth in plants. It defines secondary growth as an increase in plant girth due to vascular cambium or cork cambium. Anomalous secondary growth does not follow the normal pattern of a single vascular cambium producing xylem internally and phloem externally.
When I was asked to give a companion lecture in support of ‘The Philosophy of Science’ (https://shorturl.at/4pUXz) I decided not to walk through the detail of the many methodologies in order of use. Instead, I chose to employ a long standing, and ongoing, scientific development as an exemplar. And so, I chose the ever evolving story of Thermodynamics as a scientific investigation at its best.
Conducted over a period of >200 years, Thermodynamics R&D, and application, benefitted from the highest levels of professionalism, collaboration, and technical thoroughness. New layers of application, methodology, and practice were made possible by the progressive advance of technology. In turn, this has seen measurement and modelling accuracy continually improved at a micro and macro level.
Perhaps most importantly, Thermodynamics rapidly became a primary tool in the advance of applied science/engineering/technology, spanning micro-tech, to aerospace and cosmology. I can think of no better a story to illustrate the breadth of scientific methodologies and applications at their best.
BREEDING METHODS FOR DISEASE RESISTANCE.pptxRASHMI M G
Plant breeding for disease resistance is a strategy to reduce crop losses caused by disease. Plants have an innate immune system that allows them to recognize pathogens and provide resistance. However, breeding for long-lasting resistance often involves combining multiple resistance genes
1. From Words to Wonders:
Language Models for Life
Sciences
Room L1.02
Robots Unleashed: The Rise of AI-
Driven Chemical Discovery
Room L1.01
16 November 2023
2. f.grisoni@tue.nl
Learning the biochemical language with AI
A drug discovery tale
ChemAI workshop | Nov 17, 2023
f.grisoni@tue.nl
F. Grisoni | Assistant Professor
Institute for Complex Molecular Systems (ICMS)
Department of Biomedical Engineering, Eindhoven University of Technology (TU/e)
3. 3 | F. Grisoni |
The language of life
ChemAI workshop | Nov 17, 2023
DNA Proteins Chemical signals
“The general goal of linguistics […] addresses the same problems facing molecular biologists.”1
1Bralley P (1996). An introduction to molecular linguistics. BioScience, 46, 146.
4. 4 | F. Grisoni |
Deciphering the language of life
ChemAI workshop | Nov 17, 2023
Image from ancestry.com
I running am I am running
I am running
for president
I am running
a marathon
• Syntax: set of rules that dictate how
sentences or expressions should be
structured.
• Semantics: meaning conveyed by the
elements and structures of a language.
5. 5 | F. Grisoni |
Deciphering the language of life
ChemAI workshop | Nov 17, 2023
• Syntax: set of rules that dictate how
sentences or expressions should be
structured.
• Semantics: meaning conveyed by the
elements and structures of a language.
Image from ancestry.com
DNA
RNA
Protein
Codons
Codons
6. 6 | F. Grisoni |
Deciphering the language of life
ChemAI workshop | Nov 17, 2023
How can we learn the biomolecular language with AI?
What can we do with it?
7. 7 | F. Grisoni |
Natural language processing
ChemAI workshop | Nov 17, 2023
8. 8 | F. Grisoni |
The vastness of the chemical universe
ChemAI workshop | Nov 17, 2023
Chemical Universe1,2
1060
104
Known
small molecule drugs
Cells in a human body
1013 – 1014
108 – 109
Stars in the Milky Way
1Ertl (2002) Journal of Chemical Information and Computer Sciences 43, 374.
2Walters et al. (1998). Drug Discovery Today 3, 160.
9. 9 | F. Grisoni |
C
C
C
C1
C
C
C
C
C
C
C
C
C OH
O
=
=
CC(C)CC1=CC=C(C=C1)C(C)C(=O)O
G E
Chemical language models (CLMs)
ChemAI workshop | Nov 17, 2023
• “Syntax”
• “Semantics”
1Hochreiter S, Schmidhuber J (1997). Neural computation 9, 1735.
Segler MH, Kogej T, Tyrchan C, Waller MP (2018). ACS Central Science 4,120.
G
C
C c C
…
c 1 E
Recurrent neural network with long short-term memory1
10. 10 | F. Grisoni |
G
O
= 1
…
= C E
O
Recurrent neural network with long short-term memory1
Chemical language models (CLMs)
ChemAI workshop | Nov 17, 2023
1Hochreiter S, Schmidhuber J (1997). Neural computation 9, 1735.
Segler MH, Kogej T, Tyrchan C, Waller MP (2018). ACS Central Science 4,120.
CC(C)CC1=CC=C(C=C1)C(C)C(=O)O
G E
• “Syntax”
• “Semantics”
C
C
C
C1
C
C
C
C
C
C
C
C
C OH
O
=
=
11. 11 | F. Grisoni |
Fine-tuning
Transfer learning
ChemAI workshop | Nov 17, 2023
Pretraining
Generic model
Focused model
Valid ≥ 90%
Novel ≥ 90%
300k bioactive molecules
Segler MH, Kogej T, Tyrchan C, Waller MP (2018). ACS Central Science 4,120.
Merk D, Friedrich L, Grisoni F, Schneider G (2018) Mol. Inf. 37, 1700153.
25 RXR and
PPAR modulators
13. 13 | F. Grisoni |
Applications of chemical language modelling
ChemAI workshop | Nov 17, 2023
Bidirectional molecule generation1
E....O)CCCGC=C(C....E
C
C
C
C1
C
C
C
C
C
C
C
C
C OH
O
=
=
1Grisoni F, Moret M, Lingwood R, Schneider G (2020). J. Chem. Inf. Mod. 60, 1175.
2Grisoni F, Huisman BH, Button AL, et al. (2021). Science Advances 7, eabg3338.
3Moret M, Helmstädter M, Grisoni F et al.(2021). Angewandte Chemie 60, 19477.
Automated design-make-test2
Natural product-inspired design3
14. 14 | F. Grisoni |
‘One-shot’ de novo design of Nurr1 agonists
ChemAI workshop | Nov 17, 2023
Ballarotto M et al. (2023). J. Med. Chem. 66, 12.
300k bioactive molecules
Generic model Potent agonist
Weak agonists
EC50 = 0.07±0.02 µM EC50 = 2.1±0.6 uM
2 novel Nurr1 agonists
D. Merk
(@LMU)
15. 15 | F. Grisoni |
Moret M, Pachon I, Cotos L et al. (2023). Nat. Comms 14, 114.
From language processing to chemistry and back
ChemAI workshop | Nov 17, 2023
ELECTRA pretraining
CN1CC=CC1=O
CC1CC=CC1=F
Corruption
M. Moret
(@ETH)
18
N
N N
N
Br
O
NH2
O
22
N
N N
N
OH
Cl
NH2
Cl
Repression of PI3K-AKT signalling in tumour cells
16. 16 | F. Grisoni |
S4 for de novo drug design
IPM Colloquium 2023
Özçelik R, de Ruiter S, Grisoni F (2023). ChemRxiv.
1Gu A, Goel K and Re C (2022). ICLR.
R. Özçelik
S. de Ruiter
Structured State-Space Sequence (S4) models1
17. 17 | F. Grisoni |
Other biomolecular languages
ChemAI workshop | Nov 17, 2023
Small molecules Peptides and proteins
Syntax
Semantics
“I like pears and apples, I do not
like oranges”
“I pears, I oranges and like do
apples like not”
Alphabetic syntax Symbolic syntax
CC(C)CC1=CC=C(C=C1)C(C)C(=O)O LTKAKLKILNCLHDG
18. 18 | F. Grisoni |
Other biomolecular languages
ChemAI workshop | Nov 17, 2023
ChemMedChem 13, 2018 - Front Cover
V G S A
1Grisoni F, Neuhaus, CS, Gabernet G et al. (2018) ChemMedChem 13, 1300.
300,000 bioactive
molecules
25 in house ACPs
Pre-training Focused model
100k virtual
peptides
Generic model
1000 sequences (12)
Generating anticancer peptides (ACPs)1
19. 19 | F. Grisoni |
ID Sequence EC50 [μm] HC50 [μm]
1 KLWKKIEKLIKKLLTSIR 47±3 236±13
2 YIWARAERVWLWWGKFLSL 56±3 -
3 ELAKKLTKLKRQLHRIW - -
4 DLFKQLQRLFLGILYCLYKIW 47±4 132±16
5 KLIDQWKKVLYHVE - -
6 AIKKFGPLAKIVAKV 95±4 -
7 RWNGRIIKGFYNLVKIWKDLKG 42±4 89±6
8 KVWKIKKNIRRLLHGIKRGWKG 34±4 -
9 GFWARIGKVFAAVKNL 101±4 -
10 AFLYRLTRQIRPWWRWLYKW 45.5±0.8 34±5
11 RIWGKHSRYIKIVKRLIQ 50±10 -
12 QIWHKIRKLWQIIKDGF 16.1±0.3 23±5
In vitro activity on cancer cells (MCF7) and human erythrocytes
Other biomolecular languages
ChemAI workshop | Nov 17, 2023
ChemMedChem 13, 2018 - Front Cover
Generating anticancer peptides (ACPs)1
1Grisoni F, Neuhaus, CS, Gabernet G et al. (2018) ChemMedChem 13, 1300.
20. 20 | F. Grisoni |
Other biomolecular languages
ChemAI workshop | Nov 17, 2023
ChemMedChem 13, 2018 - Front Cover
Generating anticancer peptides (ACPs)1
Y. Nana Teukam
Enzyme design
21. 21 | F. Grisoni |
Acknowledgements
ChemAI workshop | Nov 17, 2023
f.grisoni@tue.nl
Rıza Özçelik
Sarah de Ruiter
Yves Nana Teukam (w/ IBM)
Derek van Tilborg
Emanuele Criscuolo
Helena Brinkmann
Luke Rossen
Cristina Izquierdo (w/ Albertazzi)
Laura van Weesep
Inge Groffen Gisbert Schneider
Michael Moret
Lukas Friedrich
Berend H. Huisman
Daniel Merk
Moritz Helmstädter
Marco Ballarotto
Matteo Manica
Teodoro Laino
@fra_grisoni
@molecularML
22. NVIDIA BioNeMo
Foundry to Build Generative AI for Drug Discovery
Dr. David Ruau, Head of strategic Alliances Drug Discovery, EMEA
ChemAI, Nov 16
23. NVIDIA AI Foundations
Cloud Services to Create and Run Custom Generative AI Models
NeMo BioNeMo Picasso
NVIDIA AI Foundations
NVIDIA DGX Cloud
NVIDIA AI Enterprise
24. Each Enterprise Needs Its Own AI
As-a-Service Public Cloud Private Cloud Edge
Operationalize and Inference at Scale
Train New Model
with Your Data
Optimize a Model You’ve
Already Trained
Customize a Foundation
Model with Your Data
25. NVIDIA Clara
$1.5T Industry |$500B R&D Spend |10+ Years to Bring a Drug to Market
FLARE
Federated Learning
MONAI
Imaging AI
PARABRICKS
Genomics
BIONEMO
Biology Gen AI & LLMs
NEMO
Generative AI & LLMs
TARGET PRE-CLINICAL
LEAD CLINICAL
OPTIMIZE COMMERCIAL
NVIDIA DGX Cloud
Chips, Systems, Networking, Data Center Scale
Pre-Trained Models
Accelerated Training
Optimized Inference
Cloud Services & APIs
NVIDIA AI
Frameworks, Infrastructure, SDKs, Toolkits, Libraries
26. CONTROLLED
GENERATION
GENERATE FUNCTIONAL
PROTEINS
GENERATE
MOLECULES
PREDICT GENE
EXPRESSION
PREDICT COMPLEX
STRUCTURES
PREDICT VIRUS
EVOLUTION
Generative AI is Turning Biology From Science to Engineering
Explosion of Biomolecular Gen AI Research |Joint NVIDIA Biomolecular Gen AI Research
Source: arXiv.org Q-bio: AI, ML, DL, NN
200
400
600
800
1000
0
2012 2014 2016 2018 2020 2022
ESM
AlphaFold
CASP13
AlphaFold2
CASP14
ESM2
EquiFold
DiffDock
OpenFold
ProteinMPNN
ProtGPT2
…
AI
Biology
arXiv
Papers
DNABERT
27. Generative AI Accelerates Early Drug Discovery
3 Years Faster |100s of Millions Cheaper
Source: arXiv.org Q-bio: AI, ML, DL, NN
200
400
600
800
1000
0
2012 2014 2016 2018 2020 2022
ESM
AlphaFold
CASP13
AlphaFold2
CASP14
ESM2
EquiFold
DiffDock
GenSLMs
ProteinMPNN
…
DNABERT
TARGET LEAD OPTIMIZATION
Early
Discovery
~$500M
4.5Yrs
Traditional
Early Discovery
$2M
1.5Yrs
Generative AI
Early Discovery
3x Faster
200x Cheaper
28. BioNeMo is a Cloud Managed Service
Customize and Run Generative AI for Computer Aided Drug Discovery
Your Data
Your Model
Inference
Your App
Optimize
Train
Pre-Trained
Models
BioNeMo
Fine-Tune
AlphaFold2
OpenFold
ESMFold
MoFlow
MegaMolBART
DiffDock
ESM1nv
ESM2
ProGPT2
NVIDIA DGX Cloud
Your Model
29. 9 SOTA Models are Optimized for Drug Discovery Applications
Quick and effortless path to scale, speed, and experimentation
ProtGPT2
Protein Generation
Sequence
Generation
Amino Acid
Sequence
Protein
Structure
Amino Acid
Sequence
ESMFold
OpenFold
AlphaFold2
Protein Structure Prediction
Learned
Embeddings
Amino Acid
Sequence
Docked
Structures
Structures DiffDock
Molecular Docking
Molecule
Generation
Molecule
SMILES
MoFlow
MegaMolBART
Molecular Generation
ESM1nv
ESM2
Protein Learned Sequence & Structure
NVIDIA DGX Cloud
Molecular Learned Representation
Molecule
SMILES
MegaMolBART Learned
Embeddings
30. BioNeMo Inference Service in EA2
A suite of AI computer aided drug discovery models |Optimized for scale, speed and cost
NVIDIA BioNeMo
API Endpoints
NVIDIA DGX
Cloud
NVIDIA BioNeMo
Web Interface
Easy API Integration
Streamline application development and
eliminate management of infrastructure
with and easy-to-use API endpoints.
Interactive Web Experimentation
Instantly bring your own data to
experience the precision and speed of
Gen AI for drug discovery applications
SOTA Models
Suite of state-of-the-art generative models
across the drug discovery process from initial
design to lead optimization
Optimized Model Deployment
Designed for scale and optimized
for the quickest inference time,
reducing deployment costs.
AlphaFold OpenFold ESMFold DiffDock
ProGPT2 MegaMolBART
MoFlow ESM2
ESM1nv
Structure Prediction Pose Prediction
Biomolecular Generation Property Prediction
Target
Discovery
Lead
Discovery
Virtual
Screening
Lead
Optimization
31. Models Accessible on an Easy-to-Use Graphic User Interface
Interactive Inference |Visualization |Experimentation
ESM Fold
3D Protein Structure Prediction
32. Models Accessible on an Easy-to-Use Graphic User Interface
Interactive Inference |Visualization |Experimentation
MegaMolBART
Molecular Generation
33. Models Accessible on an Easy-to-Use Graphic User Interface
Interactive Inference |Visualization |Experimentation
DiffDock
Molecular Docking
34. BioNeMo Training Service in Beta
Fast and easy Gen AI training for drug discovery |Unleash drug discovery data potential
Data Loaders
SMILES, Proteins
Pre-Training Fine-Tuning Advanced Monitoring
Foundation
Model
Customized
Model
Task Specific
Model
AI DRUG DISCOVERY
APPLICATIONS
OpenFold
MMB
ESM1
ProT5
BioNeMo
Pre-Trained Models
NVIDIA DGX
Cloud
NVIDIA DGX
On-Prem
NVIDIA BASE
COMMAND PLATFORM
Flexible Training Workflows
Workflows to support from scratch large
scale pre-training, pre-trained model fine-
tuning and task-tuning on your own data
Enterprise Support
NVIDIA AI Enterprise and experts
by your side to keep projects on
track
Simple Data Loading
Automatic download and preprocess of Uniref
(proteins) and Zinc (molecules), supports
SMILES and protein sequence data loading
Optimized Scaling Recipes
Accelerated training throughput
with model and data parallel
training across 1,000s of nodes
35. Customers Accelerating Drug Discovery
BioNeMo helping to customize and run generative AI for drug discovery
Instadeep Nucleotide Transformer
500M -> 2.5B Parameter Model
SOTA 15 of 18 Benchmarks
175B Nucleotide Multi Species Sequences
Supercomputing Scale - 16 DGX Cloud
Evozyne ProT-VAE
BioNeMo Training Service
Protein Transformer Variational Autoencoder
Functional Protein Design
Experimentally Validated
Amgen BioNeMo DGX Cloud
5 Proprietary Antibody Language Models
3x Speedup – 3 Months to 4 Weeks
Up to 100x Post Training Analysis
Optimized OpenFold Service, 20x per Prediction
36. Generative AI Speeds Biologics Drug Discovery
Challenge
Traditional biologics discovery is a costly
process, and sparse data make predictions
even more challenging.
Amgen wanted to accelerate biologics
discovery by using AI models to propose
and evaluate designs for candidate drugs.
Required powerful multi-node
infrastructure to accelerate training of
large protein models with extensive data.
Solution
Trained large language models (LLMs) on Amgen’s
proprietary data to help predict properties of proteins
and develop biologics with enhanced properties.
Leveraged NVIDIA DGX Cloud and BioNeMo for
training and fine-tuning of protein LLMs and NVIDIA
RAPIDS for faster post-training analysis.
BioNemo on DGX Cloud, a turnkey solution enabled
Amgen to get up and running quickly, moving from
initial login to training large models in just a few days.
NVIDIA DGX Cloud
AI-training-as-a-service solution
Faster protein
structure prediction
20sec/
structure
100x
<1month
Faster post-training
analysis
From onboarding to
first pretrained
protein LLM
“Easeof multi-node training and the ability to use larger
batch sizes within DGX Cloud enabled us to achieve our
three-month objectives in just four weeks..”
- Chris James Langmead, Director of Digital
Biologics Discovery, Amgen
NVIDIA Base Command
Platform
for workflow management
NVIDIA AI Enterprise
RAPIDS for data post-processing
NVIDIA BioNeMo
For training and inferencing
37. Next Steps
• Try MegaMolBART on NVIDIA LaunchPad
• Sign up for Early Access for BioNeMo
• Register for no-cost, 2 week POC on NVIDIA DGX SuperCloud
Contact your account representative