Making topic maps from Subject Headings for liking and organizing information 2010-4-15 Motomu Naito Center for Integrated...
Table of Contents <ul><li>1. Back ground </li></ul><ul><li>2. Purpose </li></ul><ul><li>3. Subject Headings </li></ul><ul>...
1 . Background: Area Study and Area Informatics <ul><li>This activity is a part of activities of Area Informatics in Cente...
Model of Area Informatics Source: Shoichiro Hara, TMJP2010
2. Purpose <ul><li>To make good system for linking and organizing Area Studies related information </li></ul><ul><li>・ Mak...
3. Subject Headings <ul><li>What is Subject Headings: </li></ul><ul><li>Wikipedia redirects “Subject Headings” to “Index t...
3.1  NDLSH <ul><li>・  NDLSH: National Diet Library Subject Headings, in Japan </li></ul><ul><li>・ We are making topic map ...
NDLSH Ontology <ul><li>Ontology graph of NDLSH topic map </li></ul>
NDLSH topic map application <ul><li>Screen shots of the application </li></ul>
3.2  BSH <ul><li>・  BSH : Basic Subject Headings, Japan Library Association </li></ul><ul><li>・  We are making topic map f...
BSH Ontology <ul><li>Ontology graph of BSH topic map </li></ul>
BSH topic map application <ul><li>Screen shots of the application </li></ul>
3.3  LCSH <ul><li>・  LCSH : Library of Congress Subject Headings in US </li></ul><ul><li>・  We are making topic map from L...
LCSH Ontology <ul><li>Ontology graph of LCSH topic map </li></ul>
LCSH topic map application <ul><li>Screen shots of the application </li></ul>
4.  Practical use of Subject Headings Subject Headings can be used as Organized PSI to organize, control and link informat...
Example 1: Organizing Wikipedia ・ Organizing Wikipedia according to SH ・ Available links to Wikipedia (NDLSH: 12051, BSH: ...
Organizing Wikipedia Beer Hop Malt Wines and Spirits Liquor Amenities of life Wine Whiskey Fruit liquor Brandy Barley Beer...
Organizing Wikipedia We can easily generate Wikipedia’s address “ http://ja.wikipedia.org/wiki/”  +  “ ビール”  (SH)
Example 2: Enrich our own subjects Sometimes SH doesn’t have enough subjects or vocabulary though it is very hard to gathe...
Example 3: Mapping between multi-language If each language is mapped to LCSH, multi-language mapping will be achieved NDLS...
Mapping between multi-language Link from NDLSH to LCSH  (USE-UF relation between NDLSH and LCSH) LCSH
Example 4: Web service for providing Subject Headings Ontopia -  Navigator Framework  -  Query engine Topic Maps  Web Appl...
5.  Demo I will do short demo if I have enough time
6. Conclusion ・  CIAS has already stored huge amount of information to organize  ・  Many well organized knowledge has alre...
7. Future work ・  To prompt NDL and JLA to expose their SH with IRI ・  Continue to try to achieve multi-language mapping  ...
ありがとう ございました。 Tusen Takk!
Upcoming SlideShare
Loading in …5
×

Making topic maps from Subject Headings for linking and organizing

3,974 views

Published on

Published in: Technology

Making topic maps from Subject Headings for linking and organizing

  1. 1. Making topic maps from Subject Headings for liking and organizing information 2010-4-15 Motomu Naito Center for Integrated Area Studies (CIAS) Kyoto University [email_address] Ψ http://psi.ontopedia.net/Motomu_Naito http://www.cias.kyoto-u.ac.jp/english/CIAS/
  2. 2. Table of Contents <ul><li>1. Back ground </li></ul><ul><li>2. Purpose </li></ul><ul><li>3. Subject Headings </li></ul><ul><li>3 .1  NDLSH </li></ul><ul><li>3 .2  BSH </li></ul><ul><li>3 .3  LCSH </li></ul><ul><li>4. Practical use of Subject Headings </li></ul><ul><li>5. Demo </li></ul><ul><li>6. Conclusion </li></ul><ul><li>7. Future work </li></ul>
  3. 3. 1 . Background: Area Study and Area Informatics <ul><li>This activity is a part of activities of Area Informatics in Center for Integrated Area Study (CIAS) in Kyoto university </li></ul><ul><li>Area Study is an Interdisciplinary Science </li></ul><ul><ul><li>Understanding/comparing areas comprehensively </li></ul></ul><ul><ul><li>Diverse languages/subjects/disciplines/methodologies: </li></ul></ul><ul><ul><ul><li>history, literature, religions, politics, economics, ethnology, folklore, agriculture, environment, etc. </li></ul></ul></ul><ul><li>Area I nformatics </li></ul><ul><ul><li>Informatics paradigm in area studies </li></ul></ul><ul><ul><li>Focusing on quantitative analysis </li></ul></ul><ul><ul><ul><li>Objective, comparative and reproducible approaches </li></ul></ul></ul><ul><ul><ul><li>Spatiotemporal attributes of events </li></ul></ul></ul><ul><ul><li>Knowledge discovery supports </li></ul></ul><ul><ul><ul><li>Integration of d isciplines </li></ul></ul></ul><ul><ul><ul><li>Creation of hypotheses </li></ul></ul></ul>Source: Shoichiro Hara, TMJP2010, http://www.knowledge-synergy.com/events/documents/TMJP2010-hara.pdf
  4. 4. Model of Area Informatics Source: Shoichiro Hara, TMJP2010
  5. 5. 2. Purpose <ul><li>To make good system for linking and organizing Area Studies related information </li></ul><ul><li>・ Making and maintaining well organized knowledge is very hard and time consuming work </li></ul><ul><li>・ We have already had well organized knowledge and we can use those knowledge </li></ul><ul><li>・ We are focusing attention on Subject Headings and thesauri </li></ul><ul><li>  - NDLSH, BSH, LCSH, JST thesaurus, etc. </li></ul><ul><li>・ We are making topic maps and PSI from those knowledge, to make good use of those knowledge for linking and organizing Area Studies information </li></ul>
  6. 6. 3. Subject Headings <ul><li>What is Subject Headings: </li></ul><ul><li>Wikipedia redirects “Subject Headings” to “Index term” and define the term as </li></ul><ul><li>“ An index term, subject term, subject heading, or descriptor, in information retrieval, is a term that captures the essence of the topic of a document. Index terms make up a controlled vocabulary for use in bibliographic records. </li></ul><ul><li>(http://en.wikipedia.org/wiki/Index_term) </li></ul><ul><li>・ We are working on the following SH at the moment </li></ul><ul><li>- NDLSH, BSH and LCSH </li></ul><ul><li>・ Probably we can find much more SH in each country </li></ul><ul><li>- Norwegian SH, Finnish SH, German SH, Thai SH, etc. </li></ul>
  7. 7. 3.1  NDLSH <ul><li>・ NDLSH: National Diet Library Subject Headings, in Japan </li></ul><ul><li>・ We are making topic map from NDLSH 2008 Version </li></ul><ul><li>  - Subject Headings : 17,953 </li></ul><ul><li>  - Subject Headings + Reference words : 47,816 (47,377) </li></ul><ul><li>  - BT-NT relation : 13,220     RT relation : 9,738 </li></ul><ul><li>- USE-UF relation with LCSH: 11,663 </li></ul><ul><li>・ Conversion from SH to Topic Map </li></ul><ul><li>- Subject Headings -> Topics </li></ul><ul><li>- BT-NT, RT, USE-UF relation -> Associations </li></ul><ul><li>- USE-UF, SA relation, Scope note, reading, … -> Occurrences </li></ul><ul><li>・ SHs have each own ID that can be used as PSI (e.g. 00574308) </li></ul><ul><li>e.g. http://www.ndl.go.jp/psi/ndlsh/heading-00560674 </li></ul><ul><li>・ If NDLSH shares PSI with LCSH, it can be merged with LCSH </li></ul><ul><li>・ NDLSH will be exposed on the Web soon, I hope </li></ul>
  8. 8. NDLSH Ontology <ul><li>Ontology graph of NDLSH topic map </li></ul>
  9. 9. NDLSH topic map application <ul><li>Screen shots of the application </li></ul>
  10. 10. 3.2  BSH <ul><li>・ BSH : Basic Subject Headings, Japan Library Association </li></ul><ul><li>・ We are making topic map from BSH4   </li></ul><ul><li>  - Subject Headings : 7,847 (8,036), Reference words : 2,873 (2,892) </li></ul><ul><li>  - Descriptive reference words : 93, Particular : 169 </li></ul><ul><li>  - BT-NT relation : 8,454, RT relation : 213 </li></ul><ul><li>   USE-UF relation : 3,065, Top Term : 10,064 </li></ul><ul><li>・ Conversion from SH to Topic Map </li></ul><ul><li>- Subject Headings, Reference words, Top terms, etc. -> Topics </li></ul><ul><li>- BT-NT, RT, USE-UF relation -> Associations </li></ul><ul><li>- Scope note, reading, NDC8, NDC9, etc. -> Occurrences </li></ul><ul><li>・ SHs have each own ID that can be used as PSI (e.g. BSH400894400) </li></ul><ul><li>e.g. http://www.jla.or.jp/psi/bsh/heading-BSH400894400 </li></ul><ul><li>・ If BSH shares PSI with LCSH, it can be merged with LCSH </li></ul><ul><li>・ BSH has not exposed on the Web yet </li></ul>
  11. 11. BSH Ontology <ul><li>Ontology graph of BSH topic map </li></ul>
  12. 12. BSH topic map application <ul><li>Screen shots of the application </li></ul>
  13. 13. 3.3  LCSH <ul><li>・ LCSH : Library of Congress Subject Headings in US </li></ul><ul><li>・ We are making topic map from LCSH </li></ul><ul><li>- We downloaded it from “http://id.loc.gov/authorities/” </li></ul><ul><li>- Subject Headings : 372, 399 </li></ul><ul><li>- BT-NT, RT, inScheme, closeMatch and sameAs relation: uncountable (because, we have not finished full conversion) </li></ul><ul><li>・ RDF (SKOS) to Topic Maps using Omnigator </li></ul><ul><li>- SH (core:Concept, core:ConceptScheme) etc. -> Topics </li></ul><ul><li>- BT-NT, RT, inScheme, closeMatch, sameAs relation -> Associations </li></ul><ul><li>- scopeNote, created, modified, comment etc. -> Occurrences </li></ul><ul><li>・ SHs have each own identifiers as URI that can be used as PSIs </li></ul><ul><li>(e.g. http://id.loc.gov/authorities/sh85000002#concept) </li></ul><ul><li>・ LCSH has already exposed on the Web in consideration of Linked data </li></ul>
  14. 14. LCSH Ontology <ul><li>Ontology graph of LCSH topic map </li></ul>
  15. 15. LCSH topic map application <ul><li>Screen shots of the application </li></ul>
  16. 16. 4. Practical use of Subject Headings Subject Headings can be used as Organized PSI to organize, control and link information ・ According to SH, organizing internal and external information ・ In order to enrich our own subjects and vocabularies, merging them with SH ・ Multilanguage mapping using LCSH as a core system ・ SH providing web service using TMRAP or SDShare protocol ・ To link SH with words within our tools such as Hu-Time ・ To use SH for thesaurus query
  17. 17. Example 1: Organizing Wikipedia ・ Organizing Wikipedia according to SH ・ Available links to Wikipedia (NDLSH: 12051, BSH: 6086) Articles of Wikipedia NDLSH, BSH or LCSH
  18. 18. Organizing Wikipedia Beer Hop Malt Wines and Spirits Liquor Amenities of life Wine Whiskey Fruit liquor Brandy Barley Beer Distilled liquor The world around “Beer” in NDLSH
  19. 19. Organizing Wikipedia We can easily generate Wikipedia’s address “ http://ja.wikipedia.org/wiki/” + “ ビール” (SH)
  20. 20. Example 2: Enrich our own subjects Sometimes SH doesn’t have enough subjects or vocabulary though it is very hard to gather enough subjects from scratch by ourselves By merging our own subjects with SH we can get enriched subjects SH Our own subjects merge Beer Ale Beer Lager Bock Pilsner IPA Barley Wine
  21. 21. Example 3: Mapping between multi-language If each language is mapped to LCSH, multi-language mapping will be achieved NDLSH or BSH (Japanese) LCSH (English) merge merge merge merge Norwegian SH (Norwegian) e.g. Japanese Norwegian mapping via LSCH (English) ビール Beer Øl
  22. 22. Mapping between multi-language Link from NDLSH to LCSH (USE-UF relation between NDLSH and LCSH) LCSH
  23. 23. Example 4: Web service for providing Subject Headings Ontopia - Navigator Framework - Query engine Topic Maps Web Application - JSP Page Topic Map SH Topic Map Ontopia - Navigator Framework - Query engine Topic Maps Web Application - JSP Page Client SH providing Web service “ Term or Subject” “ Subject” topic Request SH Return SH related TM fragments SH related information Subject Headings providing web service using TMRAP or SDShare Information from client’s Web application
  24. 24. 5. Demo I will do short demo if I have enough time
  25. 25. 6. Conclusion ・ CIAS has already stored huge amount of information to organize ・ Many well organized knowledge has already existed and we are trying to use them to organize information ・ We are focusing attention on Subject Headings and thesauri such as NDLSH, BSH, LCSH, etc. ・ We are making topic maps and their web application from them ・ Topic maps can inherit Subject Headings and their relationships such as BT-NT, RT and USE-UF naturally ・ According to the relationships, information can be linked and organized ・ there are many practical way to use them ・ By providing Subject Headings as topic maps and PSI for use in the context of Linked Topic Maps, they will become powerful elements and they will be used in many way
  26. 26. 7. Future work ・ To prompt NDL and JLA to expose their SH with IRI ・ Continue to try to achieve multi-language mapping using SH ・ Continue to try to realize the web service for providing SH ・ Continue to try to merge our domain subjects with SH ・ Continue to try to find out good ways to link SH with information resources
  27. 27. ありがとう ございました。 Tusen Takk!

×