Study Support and Integration of Cultural Information Resources with Linked Data

  • 430 views
Uploaded on

A museum collection search system called Linked …

A museum collection search system called Linked
Open Data for Academia (LODAC) Museum has been developed that uses Linked Data. The LODAC Museum identifies and associates artists, artworks, and museum information from some different museums to provide integrated data that are published as Linked Data with the SPARQL endpoint.
(This side used at Culture and Computing 2011)

More in: Education , Technology
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Be the first to comment
    Be the first to like this
No Downloads

Views

Total Views
430
On Slideshare
0
From Embeds
0
Number of Embeds
0

Actions

Shares
Downloads
5
Comments
0
Likes
0

Embeds 0

No embeds

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
    No notes for slide

Transcript

  • 1. Study Support and Integration of Cultural Information Resources with Linked Data Tetsuro, KAMURA Doctoral Student School of Multidisciplinary Sciences, Department of Informatics The Graduate University for Advanced Studies(SOKENDAI) Fumihiro, KATO (National Institute of Informatics) Toru,TAKAHASHI (ATR-Promotions.Inc) Hiroshi,UEDA (ATR-Promotions.Inc) Ikki, OHMUKAI (National Institute of Informatics) Hideaki, TAKEDA (National Institute of Informatics)11年12月4日日曜日
  • 2. Agenda Introduction Approach Applications ・Next Generation Web ・Gathering ・Yokohama Art Spot ・Linked Data ・Standardization ・Photo BURARI(LODAC version) ・Integration & Association ・Publishing & Sharing11年12月4日日曜日
  • 3. Introduction11年12月4日日曜日
  • 4. Up until now, lots of Japanese museums have been built Online DB and Digitized collection.11年12月4日日曜日
  • 5. Up until now, lots of Japanese museums have been built Online DB and Digitized collection. Each museum has developed separate collection management system with original metadata. It is difficult to retrieve relevant information by searching multiple museum information.11年12月4日日曜日
  • 6. If, we could be search and use the multiple information on the Web.... Museum Museum Create original Infomation Publish Museum Data × = service to the Web service museum Publish GIS,Facilities GIS Publish and Data Museum Share × Event GIS = recommendation Local Publish Event Data Information × application Local events Using Open Data We can find new frontier, new study and create Web service so on... 511年12月4日日曜日
  • 7. The Museum field in particular. We challenge to solve Japanese Arts and Culture fields propositions with next generation Web technology11年12月4日日曜日
  • 8. A growing new way to distribute information on the Web11年12月4日日曜日
  • 9. Next Generation of Distribute Information Existing Web = Web of Document ex) PDF, HTML, Image format information. Already processed data. if, want to use data, you have to extract data from pdf data or strip HTML Tags from HTML data.11年12月4日日曜日
  • 10. Next Generation of Distribute Information Existing Web = Web of Document ex) PDF, HTML, Image format information. Already processed data. if, want to use data, you have to extract data from pdf data or strip HTML Tags from HTML data. Next Generation Web = Web of Data ex) SPARQL Endpoint, RDF format data. Directly refer to open data. Available to use RAW data immediately. The platform called...11年12月4日日曜日
  • 11. 11年12月4日日曜日
  • 12. Linked Data11年12月4日日曜日
  • 13. Basic Structure of the Linked Data 1011年12月4日日曜日
  • 14. Basic Structure of the Linked Data The address is described somethings about information. http://lod.ac./id/359 1011年12月4日日曜日
  • 15. Basic Structure of the Linked Data The address is described Access to the URL, somethings about information. you can look-up following string http://lod.ac./id/359 KANZAN What s mean? 1011年12月4日日曜日
  • 16. Basic Structure of the Linked Data Creator s name http://lod.ac./id/359 KANZAN Understand http://lod.ac/id/359 is described about creator s name KANZAN 1011年12月4日日曜日
  • 17. Basic Structure of the Linked Data Predicate Creator s name http://lod.ac./id/359 KANZAN Subject Object Understand http://lod.ac/id/359 is described about creator s name KANZAN The Linked Data consist of the three parts of resource. 10 This structure is called RDF model. (Resource Description Frameworl)11年12月4日日曜日
  • 18. Linking Data KANZAN Autumn among Tre Title of Artwork @en Creator s name is a 秋の木の間 Title of Artwork @ja Link to Artwork http://lod.ac./id/359 http://lod.ac./id/20029 http:// Collected Link to Creator s Reference Job is a Japanese style painter http://lod.ac./ref/359 1873 was born in Link node, Contains other information links. String node. Represent string information,(string,number,date ) 1111年12月4日日曜日
  • 19. Linked Data represents information as node and arc labeled directed graph Autumn among Treesof Artwork @en 秋の木の間 Title of Artwork @ja0029 http://lod.ac./id/912 Museumnese style painter 1873 12 11年12月4日日曜日
  • 20. Linked Data represents information as node and arc labeled directed graph http://lod.ac./id/16510 Autumn among Trees Link to Artworkof Artwork @en http://lod.ac./id/17327 秋の木の間 Link to Artwork Title of Artwork @ja Link to Artwork0029 http://lod.ac./id/17412 http://lod.ac./id/912 Museum Link to Facilities Reference Phone number isnese style painter http://lod.ac./ref/912 03-5777-8600 1873 Museum name is TheTokyo National Modern Museum 12 11年12月4日日曜日
  • 21. If, user wants look-up data. Current Web VS Linked Data11年12月4日日曜日
  • 22. If, user wants look-up data. Current Web VS Linked Data Processed Query Query Converted Current Query Search and extract data Processed with several websites every Query Converted time. Irritated User Distribute Information11年12月4日日曜日
  • 23. If, user wants look-up data. Current Web VS Linked Data Processed Query Query Converted Current Query Search and extract data Processed with several websites every Query Converted time. Irritated User Distribute Information Linked Data Query Querying integrated data. Integrate Information Happy User11年12月4日日曜日
  • 24. 11年12月4日日曜日
  • 25. LODAC Museum Power for Arts & Culture with Linked Data We applied Linked Data to a Japanese museum information.11年12月4日日曜日
  • 26. Approach11年12月4日日曜日
  • 27. Gathering data Museums Source Uses Data Amount of Data Catalog of the collections of 3 National Art Museum 25,180 National Museum of Western Art 4,373 Kyoto National Museum 5,819 Nara National Museum 431 Fukushima Pref. Art Museum 20 Tochigi Pref. Art Museum 32 Artwork Akita Pref. Art Museum 22 Iwate Pref. Art Museum 1,588 Tokushima Pref. Art Museum 18,482 Yamanashi Pref. Art Museum 5.416 Kagawa Higashitama Kaii Setouchi Art Museum 5.416 Yokohama Art Museum 6,286 These are not official authorized use... 1711年12月4日日曜日
  • 28. Relevant Sources Use Data Amount of Data Database for National Treasure & Important Cultural Artwork 915 Property of National Designated Cultural Heritage Online Facilities 648 DBpedia Japanese (Referred to DBpedia) WikiPedia - Geographical GIS data National and Regional Planning Bureau Facilities Artwork 266 Creator 3,800 The Japanese Art Thesaurus Association for Arts 1,332 Facilities 289 Overall 109,382 Covers a wide range of content types as already structured concept. Contains several metadata such as creator name,work title, era, owner, current location, facilities. 1811年12月4日日曜日
  • 29. Scraping and processing sources Museums website (HTML, Perl, PHP) Processed Raw Data Relevant source website (HTML, Perl) Extract contents data, as Raw Data The Japanese Art Thesaurus (MS-EXCEL Sheets) 1911年12月4日日曜日
  • 30. Standardization of data Re-organized common metadata. dc:title crm:P45_consistOf skos:preflabel Raw Data .... lodac:era Re-organized Metadata Current organized policies ・Use existing metadata.(Use string as data only) ・Define own metadata. 2011年12月4日日曜日
  • 31. Prefix Metadata Name crm CIDOC-CRM dc11 Dublin Core 1.1 dc DCMI Terms Simple Knowledge skos Organization System Resource Description rdfs Frame Work Schema foaf Friend of a Friend Resource Description rda2 and Access lodac LODAC Project 2111年12月4日日曜日
  • 32. Integrating Data Integration Data dc:references dc:references (Ref-resource) (ID-resource) (Ref-resource) Creator s reference Creator s information Creator s reference Generated from Generate RDF and RAW data to RDF assigned LODAC ID11年12月4日日曜日
  • 33. Integrating Creator s Information SHOMOMURA, DBpedia (Wikipedia) 1873 Kanzan@en foaf:name dc:references 下村観山@ja crm:P98I_was_born foaf:name foaf:name dc:references dc:source LODAC ID-resource LODAC Ref-resource Japanese Art Thesaurus Integrated lodac:creates External link External Link creator resource dc:references dc:source National Museum of LODAC ID-resource LODAC Ref-resource Modern Art dc:title dc:title dc:created 木の間の秋@ja dc:title dc:title Autumn Among Trees@en 1907 2311年12月4日日曜日
  • 34. Associating data Associate Creator and Artwork A. Japanese Art Thesaurus - 1,332 creators B. All of artwork - 61,861 titles Using string match method A. Creator of artwork Matching KEY B. Creator s Name 2411年12月4日日曜日
  • 35. Amount Integration Integrate Item Source of Data Data A.Japanese Art Thesaurus 648 Facilities 77 B.Cultural Heritage Online 915 Title of important A.Japanese Art Thesaurus (Art work) 3,800 74 cultural properties B.DB for National Treasure (Art work) 10,115 Creator information A.Japanese Art Thesaurus (Creator) 1,332 15,020 and Work Title B.All of art work (Work title string) 61,861 A.Japanese Art Thesaurus (Creator) 1,332 Creator name 615 B.All of art work title(using creator name) 61,861 2511年12月4日日曜日
  • 36. Publishing & Sharing We build a Linked Data infrastructure for for the museum information 2611年12月4日日曜日
  • 37. Publish data as RDF <?xml version="1.0" encoding="UTF-8"?> <rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:ns0="http://purl.org/dc/terms/" xmlns:ns1="http://xmlns.com/foaf/0.1/" ID-resource URI xmlns:ns2="http://lod.ac/ns/lodac#" (Own address) xmlns:ns3="http://www.w3.org/2000/01/rdf-schema#" xmlns:ns4="http://www.w3.org/2004/02/skos/core#"> http://lod.ac/id/359 <rdf:Description rdf:about="http://lod.ac/id/359.json"> <ns0:title>JSON representation for http://lod.ac/id/359</ns0:title> External link <ns1:primaryTopic> DBpedia Japanese <ns1:Person rdf:about="http://lod.ac/id/359"> <ns2:creates rdf:resource="http://lod.ac/id/20029"/> <ns0:references rdf:resource="http://dbpedia.jp/resource/%E4%B8%8B %E6%9D%91%E8%A6%B3%E5%B1%B1"/> <ns0:references rdf:resource="http://lod.ac/ref/359"/> <ns3:label xml:lang="ja">下村観山</ns3:label> <ns4:prefLabel xml:lang="ja">下村観山</ns4:prefLabel> <ns1:name xml:lang="ja">下村観山</ns1:name> </ns1:Person> Ref-resource URI </ns1:primaryTopic> http://lod.ac/ref/359 </rdf:Description> </rdf:RDF> 2711年12月4日日曜日
  • 38. SPARQL Query SPARQL query language is widely used for querying RDF data. How many duplicate titles? WHEREPull an artwork resourcesout of the RDF dataset An artwork resources SELECT Pulled data, count duplicate work title. 2811年12月4日日曜日
  • 39. SPARQL Query TOP20 s Duplicate Titles SPARQL query language is widely used for querying RDF data. How many duplicate titles? WHEREPull an artwork resourcesout of the RDF dataset An artwork resources SELECT Pulled data, count duplicate work title. 2811年12月4日日曜日
  • 40. Analyzed Technique and Medium of the Artworks 2911年12月4日日曜日
  • 41. Applications11年12月4日日曜日
  • 42. YOKOHAMA Art Spot Facilities Local GIS Artwork11年12月4日日曜日
  • 43. Photo BURARI (LODAC.Ver) (C)ATR-Promotions,Inc GIS and Facilities information through the SPARQL 3211年12月4日日曜日
  • 44. Summary • Organizing We tried to integrating distributed information as Linked Data. In consequence, approximately 11 million information available for common platform. • Publishing We published an RDF data on a LODAC Museum website. These are everybody can use for free! • Using Currently, the two applications use LODAC Museum s Data. We are more consider how to use these resources. (We have a plan to use for the purpose of study)11年12月4日日曜日
  • 45. http://lod.ac LODAC Project Linked Open Data for Academia11年12月4日日曜日