Slideshow transcript
Slide 7: X X ∩Y Y p(x, y) I(X; Y ) = p(x, y) log p(x)p(y) y∈Y x∈X |X ∩ Y | min(|X|, |Y |)
Slide 8: $ curl \"http://eutils.ncbi.nlm.nih.gov/entrez/eutils/esearch.fcgi? db=pubmed&term=CDK2\" <?xml version=\"1.0\"?> <!DOCTYPE eSearchResult PUBLIC \"-//NLM//DTD eSearchResult, 11 May 2002//EN\" \"http://www.ncbi.nlm.nih.gov/entrez/query/DTD/ eSearch_020511.dtd\"> <eSearchResult> <Count>3778</Count> <RetMax>20</RetMax> <RetStart>0</RetStart> <IdList> <Id>17904841</Id> <Id>17904366</Id> <Id>17893107</Id> () </eSearchResult>
Slide 9: $ curl \"http://eutils.ncbi.nlm.nih.gov/entrez/eutils/esearch.fcgi? db=pubmed&term=CDK6\" () <eSearchResult> <Count>740</Count> <RetMax>20</RetMax> <RetStart>0</RetStart> () </eSearchResult> $ curl \"http://eutils.ncbi.nlm.nih.gov/entrez/eutils/esearch.fcgi? db=pubmed&term=CDK2+CDK6\" () <eSearchResult> <Count>321</Count> <RetMax>20</RetMax> <RetStart>0</RetStart> () </eSearchResult>
Slide 10: 321 |X ∩ Y | = min(|X|, |Y |) min(3778, 740) 321 = = 0.438 740
Slide 13: $ ruby simpson.rb CDK2 CDK6 CDK2 CDK6 3778 742 321 0.432614555256065
Slide 14: #!/usr/bin/env ruby require 'rexml/document' require 'open-uri' def count(gene) fp = open(\"http://eutils.ncbi.nlm.nih.gov/entrez/eutils/esearch.fcgi? db=pubmed&term=\"+gene) source = fp.read fp.close doc = REXML::Document.new source return doc.elements['/eSearchResult/Count'].text.to_i end def simpson(gene1_count, gene2_count, gene12_count) if gene1_count <= 0 || gene2_count <= 0 return nil elsif gene1_count < gene2_count return gene12_count.to_f / gene1_count.to_f end return gene12_count.to_f / gene2_count.to_f end
Slide 15: def main(gene1,gene2) gene1_count = count(gene1) gene2_count = count(gene2) gene12_count = count(gene1 + \"+\" + gene2) s = simpson(gene1_count, gene2_count, gene12_count) puts [gene1, gene2, gene1_count, gene2_count, gene12_count, s].join (\" \") end main(ARGV[0],ARGV[1])



Add a comment on Slide 1
If you have a SlideShare account, login to comment; else you can comment as a guest- Favorites & Groups
Showing 1-50 of 0 (more)