29. GeneBank
Cluster SWISS-PROT
Super Computer
Hardware Database Biomedical text analysis
Information
Retrieval
Genomic Variation
Research Sequence alignment
Algorithm
Machine Learning
Classfication
Personalized
Clustering
Pattern recognition
Agent
Medicine
Information filtering
Monitoring agent
Personal Genome
IT
52. ...ATTAGGACCAATAAGTCT...
Person 1 +
...ATTAGGAGCAATAAGTCT...
...ATTAGGAGCAATAACTCT...
Person 2 -
...ATTAGGAGCAATAAGTCT...
0.5M SNP, cohort of 5K individuals, 10k random data set
Text
Single locus Pair of locii
(+,-) (+,-)
Cost = 0.5M x 5k x 10k Cost = 1/2 x 0.5M x 0.5M x 5k x 10k
1 day 120 yr
Computational problem