Perturbing The Interactome: Multi-Omics And Personalized Methods For Network ...Marc Santolini
In this talk, I will describe several recently developed methods to study disease perturbations through the lens of network science. First I will present evidence that one can accurately predict perturbation patterns from the topology of biological networks, even when lacking measurements on the kinetic parameters governing the dynamics of these interactions. Using 87 biochemical networks with experimentally measured kinetic parameters, we show that a knowledge of the network topology offers 65% to 80% accuracy in predicting the impact of perturbations. In other words, we can use the increasingly accurate topological models to approximate perturbation patterns, bypassing expensive kinetic constant measurement. These results open new avenues in modeling drug action, and in identifying drug targets relying on the human interactome only.
Then, I will present a novel approach to identify the collective impact of miRNAs in disease. Instead of focusing on the magnitude of miRNA differential expression, here we address the secondary consequences for the interactome. We developed the Impact of Differential Expression Across Layers (IDEAL), a network-based algorithm to prioritize disease-relevant miRNAs based on the central role of their targets in the molecular interactome. This method was used in the context of asthmatic Th2 inflammation and identified five Th2-related miRNAs (mir27b, mir206, mir106b, mir203, and mir23b) whose antagonization led to a sharp reduction of the Th2 phenotype. This result offers novel approaches for therapeutic interventions.
Finally, I will present an investigation of the personalized gene expression responses when inducing hypertrophy and heart failure in 100+ strains of genetically distinct mice from the Hybrid Mouse Diversity Panel (HMDP). I will show that genes whose expression change significantly correlates with the severity of the disease are either up- or down-regulated across strains, and therefore missed by traditional population-wide analyses of differential gene expression. These uncovered personalised genes are enriched in human cardiac disease genes and form a dense co-regulated module strongly interacting with the cardiac hypertrophic signaling network in the human interactome, the set of molecular interactions in the cell. We validate our approach by showing that the knockdown of Hes1, predicted as a strong candidate, induces a dramatic reduction of hypertrophy by 80-90% in neonatal rat ventricular myocytes, demonstrating that individualized approaches are crucial to identify genes underlying complex diseases as well as to develop personalized therapies.
Perturbing The Interactome: Multi-Omics And Personalized Methods For Network ...Marc Santolini
In this talk, I will describe several recently developed methods to study disease perturbations through the lens of network science. First I will present evidence that one can accurately predict perturbation patterns from the topology of biological networks, even when lacking measurements on the kinetic parameters governing the dynamics of these interactions. Using 87 biochemical networks with experimentally measured kinetic parameters, we show that a knowledge of the network topology offers 65% to 80% accuracy in predicting the impact of perturbations. In other words, we can use the increasingly accurate topological models to approximate perturbation patterns, bypassing expensive kinetic constant measurement. These results open new avenues in modeling drug action, and in identifying drug targets relying on the human interactome only.
Then, I will present a novel approach to identify the collective impact of miRNAs in disease. Instead of focusing on the magnitude of miRNA differential expression, here we address the secondary consequences for the interactome. We developed the Impact of Differential Expression Across Layers (IDEAL), a network-based algorithm to prioritize disease-relevant miRNAs based on the central role of their targets in the molecular interactome. This method was used in the context of asthmatic Th2 inflammation and identified five Th2-related miRNAs (mir27b, mir206, mir106b, mir203, and mir23b) whose antagonization led to a sharp reduction of the Th2 phenotype. This result offers novel approaches for therapeutic interventions.
Finally, I will present an investigation of the personalized gene expression responses when inducing hypertrophy and heart failure in 100+ strains of genetically distinct mice from the Hybrid Mouse Diversity Panel (HMDP). I will show that genes whose expression change significantly correlates with the severity of the disease are either up- or down-regulated across strains, and therefore missed by traditional population-wide analyses of differential gene expression. These uncovered personalised genes are enriched in human cardiac disease genes and form a dense co-regulated module strongly interacting with the cardiac hypertrophic signaling network in the human interactome, the set of molecular interactions in the cell. We validate our approach by showing that the knockdown of Hes1, predicted as a strong candidate, induces a dramatic reduction of hypertrophy by 80-90% in neonatal rat ventricular myocytes, demonstrating that individualized approaches are crucial to identify genes underlying complex diseases as well as to develop personalized therapies.
We know that we are in an AI take-off, what is new is that we are in a math take-off. A math take-off is using math as a formal language, beyond the human-facing math-as-math use case, for AI to interface with the computational infrastructure. The message of generative AI and LLMs (large language models like GPT) is not that they speak natural language to humans, but that they speak formal languages (programmatic code, mathematics, physics) to the computational infrastructure, implying the ability to create a much larger problem-solving apparatus for humanity-benefitting applications in biology, energy, and space science, however not without risk.
Multi-trait modeling in polygenic scores
2022.03.02 Bioinformatics seminar at University of Osaka, Japan
複数の表現型を考慮したポリジェニック・スコア解析
2022.03.02 バイオインフォマティクスセミナー @ 大阪大学
Exploiting NLP for Digital Disease InformaticsNigel Collier
Accurate and timely collection of facts from a range of text sources is crucial for supporting the work of experts in detecting and understanding highly complex diseases. In this talk I illustrate several applications using techniques that baseline Natural Language Processing (NLP) pipelines against human-curated biomedical gold standards. (1) In the BioCaster project , high throughput text mining on multilingual news was employed to map infectious disease outbreaks. In order to detect norm violations we show the effectiveness of a range of time series analysis algorithms evaluated against ProMED-mail; (2)In the PhenoMiner project, using an ensemble approach together with SVM learn-to-rank, we show how named entity recognition can achieve improved levels of performance for biomedical concepts. We show however that performance still remains fragile when adapting to new disease domains; (3) Finally, I will discuss how in the SIPHS project we are building concept recognition systems based on deep learning to understand the ‘voice of the patient’ in social media messages.
Четыре величайших предвидения Кольцова:
1. Вопрос о форме (морфе) белковой молекулы лежит в основе всей проблемы физико-химической природы морфологии организмов.
2. Хромосомы – это огромные белковые молекулы, в которых радикалы распределены в определенном для каждого вида порядке.
3. Сложность такой молекулы столь велика, что ее копия не может создаваться в клетке заново.
Она возникает только при наличии в клетке уже готовой молекулы – затравки.
Omnis molecula ex molecula
4. Радикалы хромосомной молекулы занимают в ней совершенно определенное место, и малейшие химические изменения в этих радикалах, должны являться источником новых мутаций.
Научно слабо обоснованные попытки практического использования плохо воспроизводимых результатов плохо спланированных исследований генетических «ассоциаций» для выявления и отбора (евгенической селекции) потенциальных элитных атлетов, как минимум, преждевременны.
Пока что нельзя исключить, что такие действия принесут больше вреда, чем пользы.
Дай бог, если их последствия будут нейтральными.
More Related Content
Similar to Format for the population data in forensic genetics ppt
We know that we are in an AI take-off, what is new is that we are in a math take-off. A math take-off is using math as a formal language, beyond the human-facing math-as-math use case, for AI to interface with the computational infrastructure. The message of generative AI and LLMs (large language models like GPT) is not that they speak natural language to humans, but that they speak formal languages (programmatic code, mathematics, physics) to the computational infrastructure, implying the ability to create a much larger problem-solving apparatus for humanity-benefitting applications in biology, energy, and space science, however not without risk.
Multi-trait modeling in polygenic scores
2022.03.02 Bioinformatics seminar at University of Osaka, Japan
複数の表現型を考慮したポリジェニック・スコア解析
2022.03.02 バイオインフォマティクスセミナー @ 大阪大学
Exploiting NLP for Digital Disease InformaticsNigel Collier
Accurate and timely collection of facts from a range of text sources is crucial for supporting the work of experts in detecting and understanding highly complex diseases. In this talk I illustrate several applications using techniques that baseline Natural Language Processing (NLP) pipelines against human-curated biomedical gold standards. (1) In the BioCaster project , high throughput text mining on multilingual news was employed to map infectious disease outbreaks. In order to detect norm violations we show the effectiveness of a range of time series analysis algorithms evaluated against ProMED-mail; (2)In the PhenoMiner project, using an ensemble approach together with SVM learn-to-rank, we show how named entity recognition can achieve improved levels of performance for biomedical concepts. We show however that performance still remains fragile when adapting to new disease domains; (3) Finally, I will discuss how in the SIPHS project we are building concept recognition systems based on deep learning to understand the ‘voice of the patient’ in social media messages.
Четыре величайших предвидения Кольцова:
1. Вопрос о форме (морфе) белковой молекулы лежит в основе всей проблемы физико-химической природы морфологии организмов.
2. Хромосомы – это огромные белковые молекулы, в которых радикалы распределены в определенном для каждого вида порядке.
3. Сложность такой молекулы столь велика, что ее копия не может создаваться в клетке заново.
Она возникает только при наличии в клетке уже готовой молекулы – затравки.
Omnis molecula ex molecula
4. Радикалы хромосомной молекулы занимают в ней совершенно определенное место, и малейшие химические изменения в этих радикалах, должны являться источником новых мутаций.
Научно слабо обоснованные попытки практического использования плохо воспроизводимых результатов плохо спланированных исследований генетических «ассоциаций» для выявления и отбора (евгенической селекции) потенциальных элитных атлетов, как минимум, преждевременны.
Пока что нельзя исключить, что такие действия принесут больше вреда, чем пользы.
Дай бог, если их последствия будут нейтральными.
1. Масштабные атомно-молекулярные модели – эффективный инструмент обучения и исследований.
2. Устанавливая структуру биомолекулы, желательно задумываться о ее биологическом предназначении, о ее биологической функции.
3. Реакции ДНК с различными агентами являются топохимическими и/или контекст-зависимыми.
4. Без подтверждения структуры с помощью масштабных молекулярных моделей всяческие гипотетические построения почти наверняка обречены на провал.
5. Наука редко отвечает на вопрос «почему?», чаще только на вопрос «как?».
6. Современная структурная химия исключительно плодотворно объясняет и предсказывает геометрию молекул.
7. Таутомерный и ионизационный механизмы возникновения замен пар оснований – идеи, которые трудно доказать.
8. «Конформационный» механизм неправильного спаривания оснований представляется наиболее правдоподобным.
9. Никакими силами и никогда мы не сможем избавиться от радиоактивных изотопов в природе.
10. Фотодимеризация тимина – яркая демонстрация топохимической природы реакций мутагена с ДНК.
11. Озоновый слой в стратосфере действительно надежно защищает нас от вредного мутагенно-канцерогенного действия коротковолнового УФ-излучения.
12. Основной (мажорный) продукт реакции агента с ДНК далеко не всегда является основной причиной его мутагенности. Чем мягче мутаген, тем больше его видовая и локусная специфичности.
13. Делетогены и амплигены заслуживают пристального внимания.
14. Анализ и сравнение мутационных спектров – продуктивный подход к выявлению сходства и различий между мутагенами.
Bad reproducibility of experimental results becomes a systemic problem in biomedicine. One of the main reason of this is inadequate statistical analysis. Statistical analysis should be comprehensive harmonizing statistical evidences and predictions as well as frequentist and Bayesian approaches. It is insufficient to carry out the null hypothesis significance testing (NHST) reporting P-values. Statistical significance doesn’t mean clinical importance.
Effect size with confidence and prediction intervals should be reported. Experiments an/or observations should be repeated many-many times and their agreement should be investigated.
The best way is to repeat the experiments independently in different laboratories (in different countries).
Evolutionary medical genomics, whether we realize it or not, is the foundation of genetics of predispositions whose main goal should not be personalized prediction of disease risk, but to develop strategies for its treatment and prevention on the basis of the knowledge of its genetic, evolutionary history and molecular mechanisms.
Пока мы не выявим большинство полиморфизмов и не разберемся в их предназначении, до тех пор клиническое применение генетического тестирования будет оставаться малоинформативным и потому явно преждевременным.
Надо наложить мораторий на поспешные клинические (и иные практические) применения результатов генетики предрасположенностей.
Надо приостановить деятельность фирм и фирмочек, занимающихся гаданием на генной гуще и/или составлением генетических гороскопов.
Reproducibility and predictivity in the genetics of predispositions ppt 2013Nikita Khromov-Borisov
Poor reproducibility and low predictive values of the results in the genetics of predispositions become a systemic problem. Results of the
statistical quality control of genetic tests in the study should be supported with not only integral indices such as odds ratios (OR), but with the
post-test (posterior) predictive probabilities (PPV and NPV) and likelihood ratios (LR[+] and LR[-]). Usefulness of predictiveness graphs
for visualization of the relationships between the prevalence as a pretest (prior) probability of disease and predictive values PPV and NPV as
posttest (posterior) probabilities is demonstrated. Predictive capabilities of widely used genetic, observational, instrumental and immunological
diagnostic tests are discussed. Several examples of such tests are presented and it is shown that despite of their high statistical significance they
are not able to provide clinically important association between the disease and biomarker. The predictive power of the vast majority of genetic
markers (given very wide confidence intervals due to small sample sizes) differs little from the population prevalence of the disease. Extremely
rare the odds ratios in the studies on the genetics of dispositions exceed practically critical OR = 5. As a result, in most cases, recommendations
of medical geneticists are based on clinically negligible (though statistically significant) recognizablity and predictability of genetic markers.
«Nothing in biomedicine makes sense except in the light of evolution». From this point of view, genetics of predispositions has to answer
two basic questions:
1. Is the natural genetic polymorphism identified with modern genomics proved to be the result of neutral evolution or whether it is an
aggravated genetic (mutation) load determining the susceptibility to common diseases, which inexplicably has not been culled by the natural
selection well-timed?
2. Are effects of different predisposing alleles synergistic or at least additive when combined in a single genotype, or they are mutually
neutralized?
Evolutionary and population arguments help to understand that the «genetics of predispositions» studies natural balanced genetic
polymorphism, i.e. not newly formed alterations of genes (mutations), but alleles passed natural selection and fixed in human populations. Not
anomalies, not pathological or pathogenic variants of the genome are investigated, but infinite number of its natural, «normal» variants.
Thus the answers on the above two questions are:
1. Evolutionary medical genomics testifies that the vast majority of polymorphic variants of genes (alleles) that are observed in the
genomes of modern human populations are selectively neutral.
2. In many cases, the effects of various predisposing alleles are mutually neutralized through the mechanisms of opposite (antagonistic)
pleiotropy and homeostasis.
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...University of Maribor
Slides from:
11th International Conference on Electrical, Electronics and Computer Engineering (IcETRAN), Niš, 3-6 June 2024
Track: Artificial Intelligence
https://www.etran.rs/2024/en/home-english/
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...University of Maribor
Slides from talk:
Aleš Zamuda: Remote Sensing and Computational, Evolutionary, Supercomputing, and Intelligent Systems.
11th International Conference on Electrical, Electronics and Computer Engineering (IcETRAN), Niš, 3-6 June 2024
Inter-Society Networking Panel GRSS/MTT-S/CIS Panel Session: Promoting Connection and Cooperation
https://www.etran.rs/2024/en/home-english/
hematic appreciation test is a psychological assessment tool used to measure an individual's appreciation and understanding of specific themes or topics. This test helps to evaluate an individual's ability to connect different ideas and concepts within a given theme, as well as their overall comprehension and interpretation skills. The results of the test can provide valuable insights into an individual's cognitive abilities, creativity, and critical thinking skills
This presentation explores a brief idea about the structural and functional attributes of nucleotides, the structure and function of genetic materials along with the impact of UV rays and pH upon them.
Professional air quality monitoring systems provide immediate, on-site data for analysis, compliance, and decision-making.
Monitor common gases, weather parameters, particulates.
Toxic effects of heavy metals : Lead and Arsenicsanjana502982
Heavy metals are naturally occuring metallic chemical elements that have relatively high density, and are toxic at even low concentrations. All toxic metals are termed as heavy metals irrespective of their atomic mass and density, eg. arsenic, lead, mercury, cadmium, thallium, chromium, etc.
The ability to recreate computational results with minimal effort and actionable metrics provides a solid foundation for scientific research and software development. When people can replicate an analysis at the touch of a button using open-source software, open data, and methods to assess and compare proposals, it significantly eases verification of results, engagement with a diverse range of contributors, and progress. However, we have yet to fully achieve this; there are still many sociotechnical frictions.
Inspired by David Donoho's vision, this talk aims to revisit the three crucial pillars of frictionless reproducibility (data sharing, code sharing, and competitive challenges) with the perspective of deep software variability.
Our observation is that multiple layers — hardware, operating systems, third-party libraries, software versions, input data, compile-time options, and parameters — are subject to variability that exacerbates frictions but is also essential for achieving robust, generalizable results and fostering innovation. I will first review the literature, providing evidence of how the complex variability interactions across these layers affect qualitative and quantitative software properties, thereby complicating the reproduction and replication of scientific studies in various fields.
I will then present some software engineering and AI techniques that can support the strategic exploration of variability spaces. These include the use of abstractions and models (e.g., feature models), sampling strategies (e.g., uniform, random), cost-effective measurements (e.g., incremental build of software configurations), and dimensionality reduction methods (e.g., transfer learning, feature selection, software debloating).
I will finally argue that deep variability is both the problem and solution of frictionless reproducibility, calling the software science community to develop new methods and tools to manage variability and foster reproducibility in software systems.
Exposé invité Journées Nationales du GDR GPL 2024
Seminar of U.V. Spectroscopy by SAMIR PANDASAMIR PANDA
Spectroscopy is a branch of science dealing the study of interaction of electromagnetic radiation with matter.
Ultraviolet-visible spectroscopy refers to absorption spectroscopy or reflect spectroscopy in the UV-VIS spectral region.
Ultraviolet-visible spectroscopy is an analytical method that can measure the amount of light received by the analyte.
Format for the population data in forensic genetics ppt
1. PROPOSALS FOR THE FORMAT
FOR POPULATION DATA BASES
AND THEIR ANALYSIS
A. G. Smolyanitsky1, N. N. Khromov-Borisov1, G. B.A. G. Smolyanitsky1, N. N. Khromov-Borisov1, G. B.
Lazzarotto2 and T. B. L. Kist2
1Forensic Medicine Bureau of Leningrad District, Saint
Petersburg, Russia
2Institute of Biosciences, Federal University of Rio Grande do
Sul, Porto Alegre, Brazil
Andrew.Smolyanitsky@yandex.ru
Nikita.KhromovBorisov@gmail.com
Gustavo.Lazzarotto@terra.com.br
Kist@molgen.mpg.de
2. DNA-PCR Data Banks
DNA-PCR Databank: http://www.uni-
duesseldorf.de/WWW/MedFak/Serology/database.
html
DB on Nuclear DNADB on Nuclear DNA
http://www.ertzaintza.net/cgi-bin/
db2www.exe/adn.d2w/INPUT?IDIOMA=INGLES
World population data
J. Forensic Sci. 45 (1) 118-146 (2000)
CODIS STR loci data
J. Forensic Sci. 46 (3) 453-489 (2001)
3. Precision and accuracy
Sometime inaccurate calculation or
presentation of relative allele
frequencies are observedfrequencies are observed
Precision up to three significant
digits appear to be not sufficient
4. Round-off
Sometimes the sum of the frequencies is not
equal to unit due to low precision or round-off
errors, such as, e.g., 0.879 or 1.123
Sometime it is difficult to round-off correctly
the recalculated absolute frequencies, such as,
e.g., 18.51 or 75.48
As a result their sum may be odd or not equal to
the published value
5. Uncertainties
Some data sets appear to be completely
identical
Such duplications may result from the
fact that they are reproduced infact that they are reproduced in
different publications
SANCT software permits to identify
them in very large DB automatically
6. Independence
Some data sets seems to be non-
independent: preliminary data
published earlier are then combined
with the new data in subsequentwith the new data in subsequent
publications
SANCT software facilitates their
detection
7. Collapsability
Sometime rare alleles are combined with
the nearest ones, e.g., 14+15+16
SANCT puts this manipulation on the solidSANCT puts this manipulation on the solid
statistical ground:
Categories (both, alleles and/or samples)
are combined (collapsed) not arbitrarily,
but those which are statistically
homogeneous, e.g., 14+21
8. Precision
Compute relative frequencies with at least
four or even more significant digits (GDA)
Check the equality of their sum to unit:
Sum (pi)=1.0000
Check the “re-computability” of the initial
absolute counts:
Sum (pi ×N)=N
9. Show individual genotypes
when feasible
ID Locus A Locus B Locus Z
Xx-xxx 3.2/7 --/-- 6/6Xx-xxx
1
3.2/7
3207
--/--
0000
6/6
0606
Yy-yyy
2
6/14
0614
17/18
1718
9/9.3
0093
FSTAT is able to detect 0093 as an error
12. Show absolute counts
Present genotype counts in form of
triangle matrix.
Such presentation visualizes theSuch presentation visualizes the
“saturation” of the data and permits to
present important information on the
partial fixation indices in compact form
on the same matrix.
13. Template for genotype and allele counts,
partial fixation indices and relative allele
frequencies
Locus: GC n = 196
Allele A B C fii Ni pi
A 25 0.06 0.08 0.08 131 0.3308A 25 0.06 0.08 0.08 131 0.3308
B 14 2 0.06 -0.03 45 0.1136
C 67 27 63 0.04 220 0.5556
Total 0.044 396 1.0000
GDA software provides computing fii
14. Availability
“Open and show all your data”,
visualization and “statistification”
or GSP (Good Statistics Practice)or GSP (Good Statistics Practice)
must be the main principles in data
basing.
Make all your data available to the
users preferably online or under
request from the authors.