SlideShare a Scribd company logo
1 of 26
DB→RDF →SPARQL
                      (@yayamamo)

2011.7.23
Allie (   )



              Allie
danmachold
✤


                              (PubMed       2000            / )

✤




✤


    PC: personal computer, primary care, prostate cancer, ...
    Alcoholic chronic pancreatitis: ACP, alcoholic CP, ...


                                                                  Year
✤


                              (PubMed       2000            / )

✤




✤


    PC: personal computer, primary care, prostate cancer, ...
    Alcoholic chronic pancreatitis: ACP, alcoholic CP, ...


                                                                  Year
complementary DNA -
       cDNA
2000
 polymerase chain reaction
           PCR
Fourier transform infrared
           FTIR
scanning electron microscopy
            SEM




     MEDLINE
                               1000
Allie

✤


    mammalian target of rapamycin /

✤


    Neoplasms /        ,

✤


    mTOR - (1995) Mechanism of action of rapamycin: new insights into the
    regulation of G1-phase progression in eukaryotic cells.

✤                                     (       )
    mTOR - PI3K, AMPK, S6K1, ...
http://allie.dbcls.jp/
✤   ftp

    ✤




✤   API

    ✤   SOAP / REST
Allie   ?
RDF
 SPARQL

http://data.allie.dbcls.jp/sparql/
SPARQL
—
PairCluster
   ShortForm LongForm
     SPF     specific pathogen-free
                                       appearsIn      PubMedIDList

              contains
                                                      CoocurringShort
                                       cooccursWith
PairList                                                 FormList
   Pair
  ShortForm LongForm
     SPF     specific pathogen-free    inResearch
                                         AreaOf
                                                      ResearchArea
   Pair
  ShortForm LongForm
     spf     specified pathogen free
                                       frequency
Domain             Object property                 Range

   Pair                appearsIn                PubMedList

PairCluster             contains                   PairList

   Pair               cooccursWith           CooccurringShortFormList


   Pair             hasLongFormOf                LongForm

PairCluster   hasLongFormRepresentationOf        LongForm

                     hasMemberOf

   Pair             hasShortFormOf               ShortForm

PairCluster   hasShortFormRepresentationOf       ShortForm

   Pair            inResearchAreaOf            ResearchArea
9000


    11G        (            )
    443M           (gziped)

     134   (            )
—
    1400   (                )
Virtuoso vs. OWLIM-SE

OpenLink Virtuoso version 06.01.3127, Standard Edition
OWLIM-SE 4.0 / OpenRDF Workbench 2.4.0


                             Virtuoso             OWLIM-SE
                             50                    3
SPARQL
SPARQL                            ○                      ?
PREFIX   rdfs:<http://www.w3.org/2000/01/rdf-schema#>
PREFIX   rdf:<http://www.w3.org/1999/02/22-rdf-syntax-ns#>
PREFIX   xsd:<http://www.w3.org/2001/XMLSchema#>
PREFIX   allie:<http://purl.org/allie/ontology/201102#>

select distinct ?F ?lfr ?L ?lf where {
  [] a allie:PairCluster;
    allie:contains [
      allie:hasMemberOf [
        allie:hasLongFormOf [
          rdfs:label ?lf;
          allie:frequency ?lfr ];
    ]];
    allie:frequency ?F;
    allie:hasShortFormRepresentationOf [ rdfs:label "SPF"@EN ];
    allie:hasLongFormRepresentationOf [ rdfs:label ?L ].
  FILTER ( lang(?L) = "en" && lang(?lf) = "en" )
}
ORDER BY DESC(xsd:integer(?F)) DESC(xsd:integer(?lfr))
PREFIX   rdfs:<http://www.w3.org/2000/01/rdf-schema#>
PREFIX   rdf:<http://www.w3.org/1999/02/22-rdf-syntax-ns#>
PREFIX   xsd:<http://www.w3.org/2001/XMLSchema#>
PREFIX   allie: <http://purl.org/allie/ontology/201102#>

 select ?SF ?LF ?freq where {
   [] a allie:PairCluster;
     allie:frequency ?freq;
     allie:contains [
       allie:hasMemberOf [
         allie:hasShortFormOf [
           rdfs:label ?SF ];
         allie:hasLongFormOf [
           rdfs:label ?LF ];
     ]].
    FILTER (xsd:integer(?freq) > 10
            && !regex(?SF,"^s","i")
            && regex(?LF, "^specific", "i"))
 }
ORDER BY DESC (xsd:integer(?freq))
SPARQL?




✤




✤   Virtuoso

✤
✤   DBCLS




    ✤




    ✤
第2回LinkedData勉強会@yayamamo

More Related Content

Similar to 第2回LinkedData勉強会@yayamamo

The introduction of supernova system: a vector system for single-cell labelin...
The introduction of supernova system: a vector system for single-cell labelin...The introduction of supernova system: a vector system for single-cell labelin...
The introduction of supernova system: a vector system for single-cell labelin...Div. of Neurogenet., NIG
 
Querying the Semantic Web with SPARQL
Querying the Semantic Web with SPARQLQuerying the Semantic Web with SPARQL
Querying the Semantic Web with SPARQLEmanuele Della Valle
 
Linked Data for integrating life-science databases
Linked Data for integrating life-science databasesLinked Data for integrating life-science databases
Linked Data for integrating life-science databasesShuichi Kawashima
 
2009 Dils Flyweb
2009 Dils Flyweb2009 Dils Flyweb
2009 Dils FlywebJun Zhao
 
Bioinfomatics Presentation
Bioinfomatics PresentationBioinfomatics Presentation
Bioinfomatics PresentationZhenhong Bao
 
Dgaston dec-06-2012
Dgaston dec-06-2012Dgaston dec-06-2012
Dgaston dec-06-2012Dan Gaston
 
Fruit breedomics workshop wp6 from marker assisted breeding to genomics assis...
Fruit breedomics workshop wp6 from marker assisted breeding to genomics assis...Fruit breedomics workshop wp6 from marker assisted breeding to genomics assis...
Fruit breedomics workshop wp6 from marker assisted breeding to genomics assis...fruitbreedomics
 
SureChEMBL and Open PHACTS
SureChEMBL and Open PHACTSSureChEMBL and Open PHACTS
SureChEMBL and Open PHACTSGeorge Papadatos
 
Apollo Exercises Kansas State University 2015
Apollo Exercises Kansas State University 2015Apollo Exercises Kansas State University 2015
Apollo Exercises Kansas State University 2015Monica Munoz-Torres
 
Annotating nc-RNAs with Rfam
Annotating nc-RNAs with RfamAnnotating nc-RNAs with Rfam
Annotating nc-RNAs with RfamLuca Cozzuto
 
RNA-Seq transcriptome analysis of Gonium pectorale cell cycle.
RNA-Seq transcriptome analysis of Gonium pectorale cell cycle.RNA-Seq transcriptome analysis of Gonium pectorale cell cycle.
RNA-Seq transcriptome analysis of Gonium pectorale cell cycle.Jennifer Shelton
 
RNA-Seq transcriptome analysis of Gonium pectorale cell cycle
RNA-Seq transcriptome analysis of Gonium pectorale cell cycleRNA-Seq transcriptome analysis of Gonium pectorale cell cycle
RNA-Seq transcriptome analysis of Gonium pectorale cell cycleJennifer Shelton
 
2010 03 Lodoxf Openflydata
2010 03 Lodoxf Openflydata2010 03 Lodoxf Openflydata
2010 03 Lodoxf OpenflydataJun Zhao
 
2009 0807 Lod Gmod
2009 0807 Lod Gmod2009 0807 Lod Gmod
2009 0807 Lod GmodJun Zhao
 
An Overview to Protein bioinformatics
An Overview to Protein bioinformaticsAn Overview to Protein bioinformatics
An Overview to Protein bioinformaticsJoel Ricci-López
 

Similar to 第2回LinkedData勉強会@yayamamo (20)

The introduction of supernova system: a vector system for single-cell labelin...
The introduction of supernova system: a vector system for single-cell labelin...The introduction of supernova system: a vector system for single-cell labelin...
The introduction of supernova system: a vector system for single-cell labelin...
 
Querying the Semantic Web with SPARQL
Querying the Semantic Web with SPARQLQuerying the Semantic Web with SPARQL
Querying the Semantic Web with SPARQL
 
Crispr/cas9 101
Crispr/cas9 101Crispr/cas9 101
Crispr/cas9 101
 
Linked Data for integrating life-science databases
Linked Data for integrating life-science databasesLinked Data for integrating life-science databases
Linked Data for integrating life-science databases
 
2009 Dils Flyweb
2009 Dils Flyweb2009 Dils Flyweb
2009 Dils Flyweb
 
Bioinfomatics Presentation
Bioinfomatics PresentationBioinfomatics Presentation
Bioinfomatics Presentation
 
Building Data
Building DataBuilding Data
Building Data
 
Dgaston dec-06-2012
Dgaston dec-06-2012Dgaston dec-06-2012
Dgaston dec-06-2012
 
Fruit breedomics workshop wp6 from marker assisted breeding to genomics assis...
Fruit breedomics workshop wp6 from marker assisted breeding to genomics assis...Fruit breedomics workshop wp6 from marker assisted breeding to genomics assis...
Fruit breedomics workshop wp6 from marker assisted breeding to genomics assis...
 
SureChEMBL and Open PHACTS
SureChEMBL and Open PHACTSSureChEMBL and Open PHACTS
SureChEMBL and Open PHACTS
 
Apollo Exercises Kansas State University 2015
Apollo Exercises Kansas State University 2015Apollo Exercises Kansas State University 2015
Apollo Exercises Kansas State University 2015
 
Ismb2009
Ismb2009Ismb2009
Ismb2009
 
Annotating nc-RNAs with Rfam
Annotating nc-RNAs with RfamAnnotating nc-RNAs with Rfam
Annotating nc-RNAs with Rfam
 
BioSD Tutorial 2014 Editition
BioSD Tutorial 2014 EdititionBioSD Tutorial 2014 Editition
BioSD Tutorial 2014 Editition
 
RNA-Seq transcriptome analysis of Gonium pectorale cell cycle.
RNA-Seq transcriptome analysis of Gonium pectorale cell cycle.RNA-Seq transcriptome analysis of Gonium pectorale cell cycle.
RNA-Seq transcriptome analysis of Gonium pectorale cell cycle.
 
20140710 6 c_mason_ercc2.0_workshop
20140710 6 c_mason_ercc2.0_workshop20140710 6 c_mason_ercc2.0_workshop
20140710 6 c_mason_ercc2.0_workshop
 
RNA-Seq transcriptome analysis of Gonium pectorale cell cycle
RNA-Seq transcriptome analysis of Gonium pectorale cell cycleRNA-Seq transcriptome analysis of Gonium pectorale cell cycle
RNA-Seq transcriptome analysis of Gonium pectorale cell cycle
 
2010 03 Lodoxf Openflydata
2010 03 Lodoxf Openflydata2010 03 Lodoxf Openflydata
2010 03 Lodoxf Openflydata
 
2009 0807 Lod Gmod
2009 0807 Lod Gmod2009 0807 Lod Gmod
2009 0807 Lod Gmod
 
An Overview to Protein bioinformatics
An Overview to Protein bioinformaticsAn Overview to Protein bioinformatics
An Overview to Protein bioinformatics
 

More from yayamamo @ DBCLS Kashiwanoha

JSAI 2015 1G5-1 生命科学分野の日本語言語資源の整備と日本語コンテンツへのリンク
JSAI 2015 1G5-1 生命科学分野の日本語言語資源の整備と日本語コンテンツへのリンクJSAI 2015 1G5-1 生命科学分野の日本語言語資源の整備と日本語コンテンツへのリンク
JSAI 2015 1G5-1 生命科学分野の日本語言語資源の整備と日本語コンテンツへのリンクyayamamo @ DBCLS Kashiwanoha
 
SIG-SWO-A1402-09:SPINを用いたトリプルストアの性能評価システム
SIG-SWO-A1402-09:SPINを用いたトリプルストアの性能評価システムSIG-SWO-A1402-09:SPINを用いたトリプルストアの性能評価システム
SIG-SWO-A1402-09:SPINを用いたトリプルストアの性能評価システムyayamamo @ DBCLS Kashiwanoha
 
Towards Database Integration Through RDF & Linked Data
Towards Database Integration Through RDF & Linked DataTowards Database Integration Through RDF & Linked Data
Towards Database Integration Through RDF & Linked Datayayamamo @ DBCLS Kashiwanoha
 

More from yayamamo @ DBCLS Kashiwanoha (20)

D2RQ Mapper
D2RQ MapperD2RQ Mapper
D2RQ Mapper
 
Qaシステム解説
Qaシステム解説Qaシステム解説
Qaシステム解説
 
AJACS54 PubMed Allie inMeXes Colil
AJACS54 PubMed Allie inMeXes ColilAJACS54 PubMed Allie inMeXes Colil
AJACS54 PubMed Allie inMeXes Colil
 
JSAI 2015 1G5-1 生命科学分野の日本語言語資源の整備と日本語コンテンツへのリンク
JSAI 2015 1G5-1 生命科学分野の日本語言語資源の整備と日本語コンテンツへのリンクJSAI 2015 1G5-1 生命科学分野の日本語言語資源の整備と日本語コンテンツへのリンク
JSAI 2015 1G5-1 生命科学分野の日本語言語資源の整備と日本語コンテンツへのリンク
 
SWAT4LS 2014 SLIDE by Yamamoto
SWAT4LS 2014 SLIDE by YamamotoSWAT4LS 2014 SLIDE by Yamamoto
SWAT4LS 2014 SLIDE by Yamamoto
 
SIG-SWO-A1402-09:SPINを用いたトリプルストアの性能評価システム
SIG-SWO-A1402-09:SPINを用いたトリプルストアの性能評価システムSIG-SWO-A1402-09:SPINを用いたトリプルストアの性能評価システム
SIG-SWO-A1402-09:SPINを用いたトリプルストアの性能評価システム
 
トーゴーの日2014ポスター
トーゴーの日2014ポスタートーゴーの日2014ポスター
トーゴーの日2014ポスター
 
Made in "Jimoto"
Made in "Jimoto"Made in "Jimoto"
Made in "Jimoto"
 
第7回 Linked Data 勉強会 @yayamamo
第7回 Linked Data 勉強会 @yayamamo第7回 Linked Data 勉強会 @yayamamo
第7回 Linked Data 勉強会 @yayamamo
 
R intro
R introR intro
R intro
 
Linked Open Data
Linked Open DataLinked Open Data
Linked Open Data
 
Swc2013 yamamoto
Swc2013 yamamotoSwc2013 yamamoto
Swc2013 yamamoto
 
Ontology howto
Ontology howtoOntology howto
Ontology howto
 
第52回生命科学夏の学校
第52回生命科学夏の学校第52回生命科学夏の学校
第52回生命科学夏の学校
 
Ajacs33 文献の検索とその整理方法
Ajacs33 文献の検索とその整理方法Ajacs33 文献の検索とその整理方法
Ajacs33 文献の検索とその整理方法
 
第5回LinkedData勉強会@yayamamo
第5回LinkedData勉強会@yayamamo第5回LinkedData勉強会@yayamamo
第5回LinkedData勉強会@yayamamo
 
Towards Database Integration Through RDF & Linked Data
Towards Database Integration Through RDF & Linked DataTowards Database Integration Through RDF & Linked Data
Towards Database Integration Through RDF & Linked Data
 
LOD challenge day 2011 LT
LOD challenge day 2011 LTLOD challenge day 2011 LT
LOD challenge day 2011 LT
 
Ajacs27 TogoDoc, inMeXes, Allie
Ajacs27 TogoDoc, inMeXes, AllieAjacs27 TogoDoc, inMeXes, Allie
Ajacs27 TogoDoc, inMeXes, Allie
 
SADI practice
SADI practiceSADI practice
SADI practice
 

第2回LinkedData勉強会@yayamamo

  • 1. DB→RDF →SPARQL (@yayamamo) 2011.7.23
  • 2.
  • 3. Allie ( ) Allie
  • 5. (PubMed 2000 / ) ✤ ✤ PC: personal computer, primary care, prostate cancer, ... Alcoholic chronic pancreatitis: ACP, alcoholic CP, ... Year
  • 6. (PubMed 2000 / ) ✤ ✤ PC: personal computer, primary care, prostate cancer, ... Alcoholic chronic pancreatitis: ACP, alcoholic CP, ... Year
  • 8. 2000 polymerase chain reaction PCR Fourier transform infrared FTIR scanning electron microscopy SEM MEDLINE 1000
  • 9. Allie ✤ mammalian target of rapamycin / ✤ Neoplasms / , ✤ mTOR - (1995) Mechanism of action of rapamycin: new insights into the regulation of G1-phase progression in eukaryotic cells. ✤ ( ) mTOR - PI3K, AMPK, S6K1, ...
  • 11. ftp ✤ ✤ API ✤ SOAP / REST
  • 12.
  • 13. Allie ?
  • 16. — PairCluster ShortForm LongForm SPF specific pathogen-free appearsIn PubMedIDList contains CoocurringShort cooccursWith PairList FormList Pair ShortForm LongForm SPF specific pathogen-free inResearch AreaOf ResearchArea Pair ShortForm LongForm spf specified pathogen free frequency
  • 17.
  • 18.
  • 19. Domain Object property Range Pair appearsIn PubMedList PairCluster contains PairList Pair cooccursWith CooccurringShortFormList Pair hasLongFormOf LongForm PairCluster hasLongFormRepresentationOf LongForm hasMemberOf Pair hasShortFormOf ShortForm PairCluster hasShortFormRepresentationOf ShortForm Pair inResearchAreaOf ResearchArea
  • 20. 9000 11G ( ) 443M (gziped) 134 ( ) — 1400 ( )
  • 21. Virtuoso vs. OWLIM-SE OpenLink Virtuoso version 06.01.3127, Standard Edition OWLIM-SE 4.0 / OpenRDF Workbench 2.4.0 Virtuoso OWLIM-SE 50 3 SPARQL SPARQL ○ ?
  • 22. PREFIX rdfs:<http://www.w3.org/2000/01/rdf-schema#> PREFIX rdf:<http://www.w3.org/1999/02/22-rdf-syntax-ns#> PREFIX xsd:<http://www.w3.org/2001/XMLSchema#> PREFIX allie:<http://purl.org/allie/ontology/201102#> select distinct ?F ?lfr ?L ?lf where { [] a allie:PairCluster; allie:contains [ allie:hasMemberOf [ allie:hasLongFormOf [ rdfs:label ?lf; allie:frequency ?lfr ]; ]]; allie:frequency ?F; allie:hasShortFormRepresentationOf [ rdfs:label "SPF"@EN ]; allie:hasLongFormRepresentationOf [ rdfs:label ?L ]. FILTER ( lang(?L) = "en" && lang(?lf) = "en" ) } ORDER BY DESC(xsd:integer(?F)) DESC(xsd:integer(?lfr))
  • 23. PREFIX rdfs:<http://www.w3.org/2000/01/rdf-schema#> PREFIX rdf:<http://www.w3.org/1999/02/22-rdf-syntax-ns#> PREFIX xsd:<http://www.w3.org/2001/XMLSchema#> PREFIX allie: <http://purl.org/allie/ontology/201102#> select ?SF ?LF ?freq where { [] a allie:PairCluster; allie:frequency ?freq; allie:contains [ allie:hasMemberOf [ allie:hasShortFormOf [ rdfs:label ?SF ]; allie:hasLongFormOf [ rdfs:label ?LF ]; ]]. FILTER (xsd:integer(?freq) > 10 && !regex(?SF,"^s","i") && regex(?LF, "^specific", "i")) } ORDER BY DESC (xsd:integer(?freq))
  • 24. SPARQL? ✤ ✤ Virtuoso ✤
  • 25. DBCLS ✤ ✤

Editor's Notes

  1. \n
  2. \n
  3. \n
  4. \n
  5. \n
  6. \n
  7. \n
  8. \n
  9. \n
  10. \n
  11. \n
  12. \n
  13. \n
  14. \n
  15. \n
  16. \n
  17. \n
  18. \n
  19. \n
  20. \n
  21. \n
  22. \n
  23. \n
  24. \n
  25. \n