NS-CUK Journal club: H.E.Lee, Review on " A biomedical knowledge graph-based method for drug–drug interactions prediction through combining local and global features with deep neural networks", Bioinformatics 2022
1) The document proposes a deep learning framework called DeepLGF to predict drug-drug interactions by combining local and global feature extraction from biomedical knowledge graphs.
2) DeepLGF uses graph neural networks and knowledge graph embedding methods to extract local drug features from chemical structures and biological functions, and global features from the relationships between drugs and other biological entities.
3) Experimental results on prediction tasks using several drug interaction datasets demonstrate that DeepLGF outperforms other state-of-the-art models and has promising applications in drug development and clinical use.
NS-CUK Seminar: J.H.Lee, Review on "Graph Propagation Transformer for Graph R...ssuser4b1f48
More Related Content
Similar to NS-CUK Journal club: H.E.Lee, Review on " A biomedical knowledge graph-based method for drug–drug interactions prediction through combining local and global features with deep neural networks", Bioinformatics 2022
Using open bioactivity data for developing machine-learning prediction models...Sunghwan Kim
Similar to NS-CUK Journal club: H.E.Lee, Review on " A biomedical knowledge graph-based method for drug–drug interactions prediction through combining local and global features with deep neural networks", Bioinformatics 2022 (20)
NS-CUK Journal club: H.E.Lee, Review on " A biomedical knowledge graph-based method for drug–drug interactions prediction through combining local and global features with deep neural networks", Bioinformatics 2022
1. Hyo Eun Lee
Network Science Lab
Dept. of Biotechnology
The Catholic University of Korea
E-mail: gydnsml@gmail.com
2023.08.15
Bioinformatics 2022
2. 1
Introduction
• Previous Research
• Purpose and Goal
Method
• Representation of drug CS
• Method of global feature extracting
• BF feature learning module
• Strategies of information integration
Experimental
• Dataset
• Baseline models and Evaluation criteria
Results
Conclusion
3. 2
1. Introduction
Previous Research
• Combining drugs can lead to synergistic therapeutic effects
, but can also have side effects such as patient death due to potential interactions
• Traditional clinical experimentation methods are time-consuming and cumbersome
• DDI prediction models need to evolve to simplify the detection process and reduce
experimental complexity
4. 3
1. Introduction
Previous Research
• Previous DDI prediction methods can be separated into
heterogeneous network-based method and homogeneous network-based method
• A homogeneous network-based method considers only one type of node and edge
, which limits its inductive capability and prediction performance
• The heterogeneous network-based method overcomes the limitations of homogeneous
networks by considering different types of nodes and edges
• Recently, KG was also proposed to use data integration
and federation techniques to represent complex relationships between objects
• However, focusing on just simple data fusion, triple truths, can limit inductive capability
5. 4
1. Introduction
Purpose and Goal
• Proposing a deep learning framework called DeepLGF
to predict drug-drug interactions (DDIs) by aggregating local-global multi-information
based on biomedical knowledge graph (BKG)
• Propose an embedding method that includes chemical structure (CS) and
biological function (BF) as well as interactions with other biological entities
• Constructed a drug-receptor graph to fully train the BFs
and used BFGNN to generate drug embedding vectors
• Integrate multiple information without losing the ability to induce
, but make full use of the unique characteristics of BKGs as local information
7. 6
2. Method
Representation of drug CS
• Extract features embedded in the drug structure using a method similar to the text embedding
method
• Propose a PB-DBOW method similar to PV-DBOW to predict words within a document
embedding vector.
8. 7
2. Method
Representation of drug CS
• PB-DBOW is based on skip-grams to compute the embedding vector through the weight matrix
of the center word
ℎ𝑐 = 𝑤𝑇
𝑥𝑐
• The loss function maximizes the probability of occurrence across the N context words for each
center word
𝐸 = log 𝑝 𝑣𝑜 𝑣𝑐 = 𝑢𝑜
𝑇ℎ𝑐 − log
𝑖∈𝑁
exp 𝑢𝑖
𝑇
ℎ𝑐
• Similarly, the method proposed in this paper learns to maximize the probability of a center word
occurring in sentences
𝐸 = log 𝑝 𝑣𝑜 𝑠𝑒𝑛𝑡𝑐 = 𝑢𝑜
𝑇
ℎ𝑐 − log
𝑖∈𝑁
exp 𝑢𝑖
𝑇
ℎ𝑐
9. 8
2. Method
Method of global feature extracting
• Using ComplEX, favorable for capturing antisymmetric relationships
, to obtain feature embedding vectors for associations between biological entities using BKGs
• Global representations can be learned through direct connections or potential flows
between nodes
10. 9
2. Method
Method of global feature extracting
• BKG Definitions
𝐺𝐾𝐺 = ℎ, 𝑟, 𝑡 ℎ, 𝑡 𝜖 𝐸, 𝑟 𝜖 𝑅 }
• Calculate the probability that an interaction exists between two objects
using relation adjacent matrices using a scoring function and a sigmoid function
𝑃 𝑌ℎ𝑖,𝑟𝑖,𝑡𝑖
= 1 = 𝜎(𝜙(𝑟𝑖, ℎ𝑖, 𝑡𝑖; 𝛩))
𝜙 𝑟𝑖, ℎ𝑖, 𝑡𝑖; 𝛩 = 𝑅𝑒
𝑘=1
𝐾
𝑤𝑟𝑘𝑒ℎ𝑘𝑒𝑡𝑘
𝐿𝑜𝑠𝑠 = min 𝑖
𝑁
log 1 + exp −𝑌ℎ𝑖,𝑟𝑖,𝑡𝑖
𝜙 𝑟𝑖, ℎ𝑖, 𝑡𝑖; 𝛩 + 𝜆||𝛩||2
2
11. 10
2. Method
BF feature learning module
• Propose BFGNN to leverage BF information to provide local details
and improve feature learning representation
• Different datasets have different amounts of BF information, so you need to build different
networks to extract features
15. 14
3. Experimental
Data set
• DrugBank
: Online database, comprehensive resource containing chemical, pharmacological and
pharmaceutical data
• SMILES - used as drug CS information
• Biological heterogeneous associations
- extract BF information, including protein targets, enzymes, transporters, and carriers
• DRKG
: Comprehensive BKG used for DDI prediction with BKG and for drug discovery and drug
repurposing
• used to extract information about GI, drug-drug interactions, drug-target interactions
, and other biomedical entities and relationships
17. 16
3. Experimental
Data set
• Since different heterogeneous networks need to be constructed depending on the BF
information, the dataset is divided into four main categories.
18. 17
3. Experimental
Data set
• Paper tests performance on three prediction tasks
• 𝑇𝑘𝑘 : randomly select pairwise drugs
• 𝑇𝑘𝑛 : predicting interactions between known drugs
and new drugs
• 𝑇𝑛𝑛 : predicting interactions between new drugs
33. 32
5. Conclusion
Conclusion
• A deep learning framework called DeepLGF is proposed for predicting potential drug-drug
interactions (DDIs) by fully utilizing local-global information.
• The framework combines natural language processing, graph neural networks, and
knowledge graph embedding methods to extract local and global features from biomedical
knowledge graphs.
• The authors also design four aggregation methods to fuse local-global features and
achieve advanced embedding of drugs.
• The performance of DeepLGF is evaluated based on several prediction tasks and datasets,
and the results show that DeepLGF outperforms state-of-the-art models.
• DeepLGF performs remarkably well and can be applied to a variety of prediction tasks,
making it promising for predicting potential DDIs in drug development and clinical
applications, this paper concludes.
Editor's Notes
이 방법은 단일 관점 정보만 사용하고 bkg는 고려하지 않지만, 기존 접근 방법과 달리 상세하고 포괄적인 약물 측면의 정보 흐름을 제공 할 수 있다.
Re함수와 transE에 대한 언급 필요
5-fold CV 사용하여 데이터를 90 10으로 나누어 진행
DS1과 DS3는 제안된 모델의 안정성과 견고성을 평가 → 새 데이터 세트에 대한 모델의 일반화 가능성