Gene-Disease Association Prediction Using Network Centrality Measures

Identifying
Gene-Disease Associations

• Importance of understanding the role of genetics in diseases
• Difﬁculty to determine disease-related genes manually
• Usefulness of predicting good candidate genes
Motivation

The most central genes
in an interaction network for a disease
are likely to be related to the disease
Hypothesis

1. Collect data.
2. Make interaction network.
3. Evaluate centrality of genes.
4. Verify results of the centralities.
Experiment Step

1. Collect known disease-related genes(seed genes).
• OMIM
2. Normalize gene names.
• HGNC database
3. Collect potential interaction sentences.
• PMC Open Access corpus
1. Collect data

1. Make a list of interaction words.
2. Select sentences that contain a seed
gene, other genes and interaction words.
3. Classify sentences based on dependency
parsing and SVM.
4. Link two genes that are related.
2. Make interaction network

• Degree Centrality
• Count of neighbors
• Eigenvector Centrality
• Sum of the centralities of neighbors
• Closeness Centrality
• Sum of the distances from it to other nodes
• Betweenness Centrality
• Sum of the number of shortest paths that pass through it
3. Evaluate centrality of genes

• Collect conﬁrmed data for evaluation.
• Prostate Gene Database (PGDB), PubMed, KEGG
• Set baseline.
• Appearance count with seed genes
4.Verify results of the centralities

4.Verify results of the centralities

Gene-Disease Association Prediction Using Network Centrality Measures

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Viewers also liked

Viewers also liked (16)

Similar to Gene-Disease Association Prediction Using Network Centrality Measures

Similar to Gene-Disease Association Prediction Using Network Centrality Measures (20)

More from Shin Kim

More from Shin Kim (8)

Recently uploaded

Recently uploaded (20)

Gene-Disease Association Prediction Using Network Centrality Measures