This document provides a summary of linked data principles and examples. It discusses how linked data can help computers understand web data by structuring it using common standards like URIs, HTTP, RDF, and SPARQL. The key principles of linked data are explained, including using URIs to identify things, including useful information at those URIs, and linking to other URIs to discover more things. Examples of linked data applications in domains like academia, libraries, government, and media are also provided. The document concludes by discussing how linked data works technically using structured data, graphs, and W3C web standards.
LOD , Linked Open Data 에 대한 소개 자료 입니다. LOD는 공공 데이터를 제공, 공유, 재활용하기 위한 또 하나의 방법이며 오픈 데이터(Open Data) 를 위한 하나의 방법으로 웹을 기반으로 데이터를 공유하여 재활용하고자 방법이며 기술이고 데이터입니다.
Open Archives Initiatives For Metadata HarvestingNikesh Narayanan
The Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH) provides a simple but effective mechanism for metadata harvesting. It allows service providers to aggregate content from data providers to build value-added services. The OAI-PMH uses HTTP and XML to share metadata in any agreed format, with Dublin Core as a baseline. It defines a set of verbs and standards for harvesting metadata from repositories in a consistent way. This interoperability has helped surface resources and build services across independently developed digital libraries.
This document provides a summary of linked data principles and examples. It discusses how linked data can help computers understand web data by structuring it using common standards like URIs, HTTP, RDF, and SPARQL. The key principles of linked data are explained, including using URIs to identify things, including useful information at those URIs, and linking to other URIs to discover more things. Examples of linked data applications in domains like academia, libraries, government, and media are also provided. The document concludes by discussing how linked data works technically using structured data, graphs, and W3C web standards.
LOD , Linked Open Data 에 대한 소개 자료 입니다. LOD는 공공 데이터를 제공, 공유, 재활용하기 위한 또 하나의 방법이며 오픈 데이터(Open Data) 를 위한 하나의 방법으로 웹을 기반으로 데이터를 공유하여 재활용하고자 방법이며 기술이고 데이터입니다.
Open Archives Initiatives For Metadata HarvestingNikesh Narayanan
The Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH) provides a simple but effective mechanism for metadata harvesting. It allows service providers to aggregate content from data providers to build value-added services. The OAI-PMH uses HTTP and XML to share metadata in any agreed format, with Dublin Core as a baseline. It defines a set of verbs and standards for harvesting metadata from repositories in a consistent way. This interoperability has helped surface resources and build services across independently developed digital libraries.
Introdução às ontologias por Gisele Dziekaniak EDUTEC FURG 2009Gisele Dziekaniak
O documento introduz o conceito de ontologias, explicando onde atuam, o que são, para que servem e seus principais elementos. Também discute a diferença entre tesauros e ontologias, e a aplicação das ontologias na Biblioteconomia.
The document provides an introduction to Dublin Core metadata, including:
1) Dublin Core is a set of metadata standards including 15 simple elements and over 50 qualified elements for describing resources.
2) Dublin Core metadata can be used to improve resource discovery and is recommended for metadata harvesting and the semantic web.
3) Custom mappings can be made from other metadata standards like LOM to the Dublin Core Abstract Model to make metadata interoperable.
Overview of oss(open source software library) and its pros and consYuga Priya Satheesh
This document discusses the pros and cons of using open source software in libraries. It begins with an introduction to open source software and licenses. Popular open source software used in libraries are then outlined, including integrated library management systems (ILMS) like Koha and Evergreen, digital library software like DSpace and Greenstone, content management systems like Drupal and Joomla, and learning management systems like Moodle. The benefits of open source software for libraries are lower costs, easier licensing, better quality control through community involvement, customizability, and ample support resources. Potential drawbacks include less personal support, lack of training and expertise, need for technical skills to maintain the software, and lack of scalability in some cases. The
O documento discute a evolução dos padrões e normas para catalogação de recursos em meio digital. Apresenta os principais conceitos da catalogação como descrição bibliográfica, pontos de acesso e normas internacionais como ISBD e RDA. Também aborda temas como web semântica, dados ligados e a influência destes conceitos no desenvolvimento das normas de catalogação.
Choosing the Right Graph Database to Succeed in Your ProjectOntotext
The document discusses choosing the right graph database for projects. It describes Ontotext, a provider of graph database and semantic technology products. It outlines use cases for graph databases in areas like knowledge graphs, content management, and recommendations. The document then examines Ontotext's GraphDB semantic graph database product and how it can address key use cases. It provides guidance on choosing a GraphDB option based on project stage from learning to production.
DESIDOC is the Defence Scientific Information and Documentation Centre established in 1970 in Delhi, India. It operates under the Defence Research and Development Organization (DRDO) to disseminate science and technology information on cutting edge defence technologies. DESIDOC's vision is to be a centre of excellence for knowledge management in DRDO. It provides library resources and access to databases for DRDO headquarters and laboratories. DESIDOC conducts training programs and publishes various periodicals related to defence research.
LOD , Linked Open Data 에 대해 구축 절차 및 도구, 사례에 대한 자료 입니다. LOD는 공공 데이터를 제공, 공유, 재활용하기 위한 또 하나의 방법이며 오픈 데이터(Open Data) 를 위한 하나의 방법으로 웹을 기반으로 데이터를 공유하여 재활용하고자 방법이며 기술이고 데이터입니다.
Knowledge Organisation Systems in Digital Libraries: A Comparative StudyBhojaraju Gunjal
The document presents a study that compares the different Knowledge Organization Systems (KOS) used in major digital libraries. It finds that while traditional libraries used standardized systems like classification schemes, digital libraries employ various KOS tools including thesauri, ontologies, and subject headings. The study analyzes the specific KOS used in different digital libraries and summarizes the current state of KOS in these libraries.
This document discusses Neo4j and its applications in bioinformatics. It describes Bio4j, an open source bioinformatics graph database built using Neo4j that integrates data from sources like Uniprot, NCBI taxonomy, Gene Ontology, and more. Bio4j models biological data as nodes and relationships in a graph structure rather than tables. This allows for more flexible querying and knowledge integration. The document provides examples of how Bio4j can be accessed through its Java API, Cypher query language, Gremlin traversal language, and REST API. It also describes some tools and visualizations for exploring and analyzing Bio4j data.
Introduction to digital libraries - definitions, examples, concepts and trend...Olaf Janssen
This presentation gives an introduction to digital libraries.
It first explores different defintions of the phrase "Digital Library".
It then looks at 11 real life examples of digital library websites (slides 44-112), including Europeana, Google Books, Flickr the Commons, Delpher, Wikisource, The Memory of the Netherlands and Project Gutenberg. Each of these DLs is assessed against five different criteria (concepts, properties)
- Content/User experience
- Cultural heritage domain (libraries, archives, museums, AV-institutions)
- Controlled / run by
- Content providing parties
- User involvement
Many references are made to Web2.0-concepts from Tim O'Reilly's article http://www.oreilly.com/pub/a/web2/archive/what-is-web-20.html
From these 11x5 = 55 datapoints 6 trend plots are drawn (slides 116-166) to show "what is hot" and "what is not" in the current DL-landscape. Key slide summarizing this = no 168
Finally, some strategies for content & brand distribution of DLs are being discussed (SEO, Wikipedia, social & ego networks) , as well as some financial trends in DLs
This presentation was given by Olaf Janssen (National Library of the Netherlands - KB) as a lecture for students of the master's course "The Library" at Leiden University, most recently on 3-11-2016.
The International Nuclear Information System (INIS) was established in 1970 by the International Atomic Energy Agency to collect, preserve, and disseminate bibliographic data and full texts on the world's nuclear literature and technology. INIS maintains a database containing over 3.3 million indexed references in English and operates a repository of bibliographic references and full text documents. It provides eLearning courses and training seminars to help member states develop their nuclear information capacities. INIS has 132 member states and 24 international organizations that cooperate to further its mission.
RDA is a set of guidelines for cataloging digital resources that is based on FRBR and FRAD models. It addresses shortcomings of AACR2 for describing online resources. The RDA Toolkit provides the full RDA instructions and tools like mappings, workflows and an element set to support efficient RDA implementation. It is maintained by the RDA Steering Committee and aims to produce robust data that clearly defines relationships for discovery of resources in libraries, archives and other cultural heritage organizations.
DSpace is an open source repository software platform designed for academic and research institutions to capture, store, distribute and preserve digital materials. It provides tools to organize content such as articles, reports, datasets and multimedia into an institutional repository that is accessible over time. DSpace uses Dublin Core metadata standards and has customizable workflows, user interfaces and technological features like support for different file formats and persistent identifiers. It also implements the OAI-PMH protocol for metadata harvesting and interoperability with other repositories. DSpace is widely used with a large user community and support from the nonprofit organization DuraSpace.
FRBR stands for Functional Requirements for Bibliographic Records.
Functional Requirements for Bibliographic Records is a conceptual entity-relationship model developed by the International Federation of Library Associations and Institutions (IFLA).
A conceptual entity relationship model that relates user tasks of retrieval and access in online library catalogs and bibliographic databases from a user’s perspective.
A new conceptual model for bibliographic universe with a strong users focus .
The purpose of this entity relationship analysis was to discover the logical nature of bibliographic data in terms of entity, attributes and relationship.
The Semantic Web Health Care and Life Sciences Interest Group aims to develop and promote the use of Semantic Web technologies in healthcare, life sciences, clinical research, and translational medicine. These domains would greatly benefit from improved data interoperability enabled by Semantic Web technologies. The group will continue developing high-level and architectural vocabularies, implement proof-of-concept demonstrations, and disseminate information about their work to accelerate adoption of these technologies.
Introdução às ontologias por Gisele Dziekaniak EDUTEC FURG 2009Gisele Dziekaniak
O documento introduz o conceito de ontologias, explicando onde atuam, o que são, para que servem e seus principais elementos. Também discute a diferença entre tesauros e ontologias, e a aplicação das ontologias na Biblioteconomia.
The document provides an introduction to Dublin Core metadata, including:
1) Dublin Core is a set of metadata standards including 15 simple elements and over 50 qualified elements for describing resources.
2) Dublin Core metadata can be used to improve resource discovery and is recommended for metadata harvesting and the semantic web.
3) Custom mappings can be made from other metadata standards like LOM to the Dublin Core Abstract Model to make metadata interoperable.
Overview of oss(open source software library) and its pros and consYuga Priya Satheesh
This document discusses the pros and cons of using open source software in libraries. It begins with an introduction to open source software and licenses. Popular open source software used in libraries are then outlined, including integrated library management systems (ILMS) like Koha and Evergreen, digital library software like DSpace and Greenstone, content management systems like Drupal and Joomla, and learning management systems like Moodle. The benefits of open source software for libraries are lower costs, easier licensing, better quality control through community involvement, customizability, and ample support resources. Potential drawbacks include less personal support, lack of training and expertise, need for technical skills to maintain the software, and lack of scalability in some cases. The
O documento discute a evolução dos padrões e normas para catalogação de recursos em meio digital. Apresenta os principais conceitos da catalogação como descrição bibliográfica, pontos de acesso e normas internacionais como ISBD e RDA. Também aborda temas como web semântica, dados ligados e a influência destes conceitos no desenvolvimento das normas de catalogação.
Choosing the Right Graph Database to Succeed in Your ProjectOntotext
The document discusses choosing the right graph database for projects. It describes Ontotext, a provider of graph database and semantic technology products. It outlines use cases for graph databases in areas like knowledge graphs, content management, and recommendations. The document then examines Ontotext's GraphDB semantic graph database product and how it can address key use cases. It provides guidance on choosing a GraphDB option based on project stage from learning to production.
DESIDOC is the Defence Scientific Information and Documentation Centre established in 1970 in Delhi, India. It operates under the Defence Research and Development Organization (DRDO) to disseminate science and technology information on cutting edge defence technologies. DESIDOC's vision is to be a centre of excellence for knowledge management in DRDO. It provides library resources and access to databases for DRDO headquarters and laboratories. DESIDOC conducts training programs and publishes various periodicals related to defence research.
LOD , Linked Open Data 에 대해 구축 절차 및 도구, 사례에 대한 자료 입니다. LOD는 공공 데이터를 제공, 공유, 재활용하기 위한 또 하나의 방법이며 오픈 데이터(Open Data) 를 위한 하나의 방법으로 웹을 기반으로 데이터를 공유하여 재활용하고자 방법이며 기술이고 데이터입니다.
Knowledge Organisation Systems in Digital Libraries: A Comparative StudyBhojaraju Gunjal
The document presents a study that compares the different Knowledge Organization Systems (KOS) used in major digital libraries. It finds that while traditional libraries used standardized systems like classification schemes, digital libraries employ various KOS tools including thesauri, ontologies, and subject headings. The study analyzes the specific KOS used in different digital libraries and summarizes the current state of KOS in these libraries.
This document discusses Neo4j and its applications in bioinformatics. It describes Bio4j, an open source bioinformatics graph database built using Neo4j that integrates data from sources like Uniprot, NCBI taxonomy, Gene Ontology, and more. Bio4j models biological data as nodes and relationships in a graph structure rather than tables. This allows for more flexible querying and knowledge integration. The document provides examples of how Bio4j can be accessed through its Java API, Cypher query language, Gremlin traversal language, and REST API. It also describes some tools and visualizations for exploring and analyzing Bio4j data.
Introduction to digital libraries - definitions, examples, concepts and trend...Olaf Janssen
This presentation gives an introduction to digital libraries.
It first explores different defintions of the phrase "Digital Library".
It then looks at 11 real life examples of digital library websites (slides 44-112), including Europeana, Google Books, Flickr the Commons, Delpher, Wikisource, The Memory of the Netherlands and Project Gutenberg. Each of these DLs is assessed against five different criteria (concepts, properties)
- Content/User experience
- Cultural heritage domain (libraries, archives, museums, AV-institutions)
- Controlled / run by
- Content providing parties
- User involvement
Many references are made to Web2.0-concepts from Tim O'Reilly's article http://www.oreilly.com/pub/a/web2/archive/what-is-web-20.html
From these 11x5 = 55 datapoints 6 trend plots are drawn (slides 116-166) to show "what is hot" and "what is not" in the current DL-landscape. Key slide summarizing this = no 168
Finally, some strategies for content & brand distribution of DLs are being discussed (SEO, Wikipedia, social & ego networks) , as well as some financial trends in DLs
This presentation was given by Olaf Janssen (National Library of the Netherlands - KB) as a lecture for students of the master's course "The Library" at Leiden University, most recently on 3-11-2016.
The International Nuclear Information System (INIS) was established in 1970 by the International Atomic Energy Agency to collect, preserve, and disseminate bibliographic data and full texts on the world's nuclear literature and technology. INIS maintains a database containing over 3.3 million indexed references in English and operates a repository of bibliographic references and full text documents. It provides eLearning courses and training seminars to help member states develop their nuclear information capacities. INIS has 132 member states and 24 international organizations that cooperate to further its mission.
RDA is a set of guidelines for cataloging digital resources that is based on FRBR and FRAD models. It addresses shortcomings of AACR2 for describing online resources. The RDA Toolkit provides the full RDA instructions and tools like mappings, workflows and an element set to support efficient RDA implementation. It is maintained by the RDA Steering Committee and aims to produce robust data that clearly defines relationships for discovery of resources in libraries, archives and other cultural heritage organizations.
DSpace is an open source repository software platform designed for academic and research institutions to capture, store, distribute and preserve digital materials. It provides tools to organize content such as articles, reports, datasets and multimedia into an institutional repository that is accessible over time. DSpace uses Dublin Core metadata standards and has customizable workflows, user interfaces and technological features like support for different file formats and persistent identifiers. It also implements the OAI-PMH protocol for metadata harvesting and interoperability with other repositories. DSpace is widely used with a large user community and support from the nonprofit organization DuraSpace.
FRBR stands for Functional Requirements for Bibliographic Records.
Functional Requirements for Bibliographic Records is a conceptual entity-relationship model developed by the International Federation of Library Associations and Institutions (IFLA).
A conceptual entity relationship model that relates user tasks of retrieval and access in online library catalogs and bibliographic databases from a user’s perspective.
A new conceptual model for bibliographic universe with a strong users focus .
The purpose of this entity relationship analysis was to discover the logical nature of bibliographic data in terms of entity, attributes and relationship.
The Semantic Web Health Care and Life Sciences Interest Group aims to develop and promote the use of Semantic Web technologies in healthcare, life sciences, clinical research, and translational medicine. These domains would greatly benefit from improved data interoperability enabled by Semantic Web technologies. The group will continue developing high-level and architectural vocabularies, implement proof-of-concept demonstrations, and disseminate information about their work to accelerate adoption of these technologies.
This document provides an overview of linked data and semantic web technology. It discusses key concepts like URIs, RDF, SPARQL, and OWL. URIs are used to identify things on the web so they can be referred to and looked up. RDF is a general method for conceptual description or modeling of information using subject-predicate-object triples. SPARQL is a query language for retrieving and manipulating semantic data in RDF format. OWL builds on RDF and RDF Schema to provide additional vocabulary for describing properties and classes of semantic web resources.
This document discusses open data and linked data. It provides definitions and background from experts such as Tim Berners-Lee. Open data is defined as data that can be freely used, modified, and shared by anyone subject to preserving provenance and openness. Linked data follows principles like using URIs to identify things and providing useful information about those URIs to interconnect data on the web. The document outlines standards, licenses, and formats that help make data more open and linked to realize its economic and social benefits.
This paper, first, brings to light some features of social ntworking, introducing the concept of inter-subjectivity, theory of distributed cognition and principle of emergence, also mentioning the concept of information fluency for library communities. Secondly, this paper briefly reviews current library applications of social networking in the world level as well as status in Korea, such as twitter (Micro-blogging/ Presence updates), delicious (Web Resources Sharing), librarything (Cataloguing thru Social Networking: social cataloging web application for storing and sharing personal library catalogs and book lists) and library applications of some mash ups. Widgets, Libraries on FriendFeed and Google Profiles of libraries are also mentioned. Third, open source software platforms are also briefly reviewed in terms of library use. In this, a new paradigm shift of information organization in library field is mentioned: attempts are being made to move from a web of documents to a web of data. Popular Rdf Vocabularies are also briefly introduced. In this, FRBR vocabularies are specially emphasized. Since these are relatively not known to the specialists in other areas. FRBR can easily be implemented as an RDF vocabulary, that could be used to create a universal Linked Data library network. Some library related Linked Data projects are also briefed. Some notions of semantic interoperability are also briefed. Lastly,proposed models for Library apllications of social networking are suggested. Some implications of the use of library applications of social networking are also briefed.
KERIS 학술정보 글로벌 동향 2019년 Vol.10
인공지능 기술이 학술연구, 의료 등 전문분야를 넘어 일상생활로 파고들면서 인공지능의 오류에 대한 우려 또한 함께 증가하고 있음. 흔히 인공지능 블랙 박스로 부르는 이런 문제점들과 이를 해결하기 위한 대안으로 설명 가능한 인공지능에 대해 살펴보고 학술연구, 도서관에서의 활용성을 검토함
KERIS 학술정보 글로벌 동향 2019년 Vol.2
대학도서관은 학술 연구자라는 중요한 목표 이용자와 결과물에 대한 서비스 지원을 위해 노력해 옴. 오픈 사이언스는 새로운 열린 학술 연구 생태계를 대표하는 용어이며, 관련 도구는 학술 출판 생애주기 전체를 지원함. 데이터 중심의 개방 지향적 정보화 환경에서 도서관은 이전보다 적극적으로 학술 커뮤니케이션 전반에 대한 이해와 지원을 위한 서비스 시대에 대응해야함
The document discusses the National Digital Library of Korea (Dibrary) and its strategy to adopt linked data practices to make its resources available on the semantic web. It introduces Dibrary's branding, vision, establishment background, and main projects including building an integrated IT infrastructure and management system. Dibrary aims to provide an integrated search portal and digital space to offer access to cultural, academic and government information resources.
1. 2014 Linked Open Data Annual Conference
LOD 해외 사례
따
라
가
지
만
말
고
2014. 1. 24.
박진호(jino.kor@gmail.com)
성균관대학교 문헌정보학과 DataLab
2014.1.24. 국립중앙도서관 국제회의장
2. 이 자료는 네이버에서 제공한 나눔글꼴이 적용되어 있습니다
http://hangeul.naver.com/font
This work is licensed under the Creative Commons 저작자표시-비영리-변경금지
2.0 대한민국 License. To view a copy of this license, visit
http://creativecommons.org/licenses/by-nc-nd/2.0/kr/ or send a letter to
Creative Commons, 444 Castro Street, Suite 900, Mountain View, California,
94041, USA.
3. 사람간의 소통이 가능한 이유
같은 언어 사용
같은 개념 인지
같은 문화 공유
구문(Syntax), 의미(Semantics), 화용
(Pragmatics)
2
5. Semantic Web(Linked Data)
같은 언어 사용
같은 개념 인지
구문(Syntax), 의미(Semantics)
•
•
표준적인 구문을 사용하고,
개념, 대상을 정의하고 체계화하고(구조
화), 비교/분류하기(동일한 것, 비슷한 것,
다른 것, 어딘가에 속하고, 무언가를 가지
고 있고 등)
4
6. Linked Open Data
• 웹에서 보다 잘 소통하기 위한 방식 :
-
기본적으로 정보문제(Information needs) 해결을 위한 활동
-
웹은 문제해결이 가능한 거대한 데이터베이스(documents 중심)
-
기존의 소통방식도 좋으나 보다 정확하고 명확한 이해(machine
processable)가 가능하도록 하는 것이 필요
-
웹을 구조화시키고, 소통하기 위한 기본 원칙(LOD원칙)을 준수하고,
공통적으로(사람과 기계, 기계와 기계) 이해가 가능한 구조와 언어로
(RDF/OWL) 누구나 활용할 수 있도록(Open) 하기
5
8. 누군가는 원칙과 표준을 만들어가고
누군가는 데이터 플랫폼을 구축하고
에
누군가는 데이터를 출판하고(Publishing)
누군가는 이 데이터들을 활용하고
언젠가는 LOD를 위한, LOD를 활용한 새로
운
서비스들이 만들어질 것이고
7
9. 넓어지고 많아지면
무엇이 여기를 달리는 것이 가장 효과적인가?
무엇이 여기를 달리게 할 것인가?
data
data
data
data
data
data
data
data
data
data
data
data
data
data
data
data
data
data
data
data
data
data
data
data
data
data
data
data
data
data
data
data
data
data
data
data
data
data
data
data
data
data
data
data
data
data
data
data
data
data
data
data
data
data
data
data
data
data
data
data
data
data
data
data
data
data
data data로 만들어진
data
data
data
data
data
data
data를 연료로쓰는
data
data
data
data
data
이 도로는 어떻게 만들어야하는가?
이 도로에 들어가는 data는 어떻게 만들고 활용할 수 있는가?
8
11. 사례1 : DBpedia
• 위키피디아(Wikipedia, http://www.wikipedia.org)로부터 구조화된정
보를 추출하고 웹에서 이 정보들을 활용 가능하도록 함
- 약 400만 개의 개념(things)이 기술 : 약 80만 건이 사람에 대한 것, 60
만 건이 장소(인구정보를 포함하고 있는 약 40만 건 포함), 약 37만 건
이 저작물(11만 건의 음악앨범, 7만 건의 영화, 2만 건의 비디오 게임
포함), 20만 건이 기관(약 5만 건의 회사 정보와 4만 건의 교육기관 포
함), 22만 건의 종 정보와 약 5천 건의 질병관련 정보
- 영어 외에 DBpedia는 119개 언어로 구성된 정보를 제공함. 이는 약
2,400백만 건의 개념(things)들을 기술하고 있는데 이 중 약 16백만건
이 영어버전의 DBpedia와 연결되어 있음
10
15. 사례2 : BBC
• 미디어 분야 중 가장 적극적으로 LOD를 직접 활용하고 업무 영역까
지 확장하여 실제 이용자 서비스에 도입
- BBC가 구축, 관리하는 데이터에 대한 Linked Data 발행자이자,
MusicB_rainz, DBPedia, Wikipedia 데이터와의 연결을 지향하는 데이
터 소비자
- BBC가 생산하는 막대한 정보들은 상호간의 공유와 연계, 협업 없이 해
당 채널별 지역별로 분리되어, 데이터 중복, 데이터 관리/운영 어려움
(재정적어려움 포함) 초래
- 이러한 문제를 극복하기 위해 Linked Data 도입
14
16. 사례2 : BBC
• 2007년 10월 BBC Programmes 서비스
- 웹에서 BBC 프로그램에 대한 영구적인 접근과 정보활용 보장
- BBC 프로그램으로 8개의 BBC TV 채널, 10개의 주요 라디오 방송국,
스코틀랜드, 북아일랜드, 웨일즈를 포함하는 6개 방송국의 모든 프로
그램 데이터에 대한 접근
- 이를 구현하기 위해 프로그램 메타데이터 간의 공유와 활용을 위해
Linked Data 선택(BBC 웹사이트 15개 원칙)
- 특정 사람부터 특정 프로그램까지 모든 것을 식별할 수 있도록 하고,
이를 RDF를 활용하여 기계가독형으로 표현
15
17. 사례2 : BBC
The BBC’s 15 Web principles
• BBC 2.0 프로젝트의 일환
1. Build web products that meet audience needs
2. The very best websites do one thing really, really well 3. Do not attempt
to do everything yourselves
4. Fall forward, fast
5. Treat the entire web as a creative canvas
6. The web is a conversation. Join in 7. Any website is only as good as its
worst page
8. Make sure all your content can be linked to, forever.
9. Remember your granny won’t ever use “Second Life”
10. Maximise routes to content
11. Consistent design and navigation needn’t mean one-size-fits-all
12. Accessibility is not an optional extra
13. Let people paste your content on the walls of their virtual homes
14. Link to discussions on the web, don’t host them
15. Personalisation should be unobtrusive, elegant and transparent
16
18. 사례2 : BBC
• BBC Linked Data 플랫폼은 DSP(Dynamic Semantic Publishing)가
자연스럽게 진화한 형태로 BBC의 모든 콘텐츠에 태깅을 허용한다
는 생각에 기초
-
기존의 BBC 콘텐츠 관리 시스템은 개방되고 연결된 구조의 웹에서 재
활용되고 서비스되는데 적합하지 않음
-
모든 BBC 저작물이 데이터 뒤에 숨겨져 있는 의미를 이해할 수 있도
록 하고 “things”에 대한 검색이 가능하도록 하는 API를 제공하는 것임
-
현재 음악, 스포츠(축구와 올림픽), 정치, 학습 분야에 적용 향후 확장
고려
17
21. 사례2 : BBC
• 2012년부터 BBC News를 위한 시맨틱 프로토타이핑 플랫폼
BBC News Juicer 시작
The News Juicer
1
2
3
4
5
6
Grab
BBC News & Sp
ort Articles
Extract Concep
ts
Match to DBpe
dia
Annotate Articl
e
Push to Triplest
ore
Expose
via
API
20
22. 사례3 : OCLC WorldCat
• OCLC WorldCat.org
- Schema.org의 용어집을 활용하여 서지 정보에 대한 구조화된 데이터
를 제공
- 웹 검색엔진(서비스)에서 WorldCat 서지 정보에 대한 접근성을 강화하
는 효과를 갖는데, 이는 서지 정보 기술을 위한 OCLC의 1세대 링크드
데이터모델
21
25. 사례3 : OCLC
대상
특징
Virtual International Au
•
thority File
•
(VIAF)
Dewey Deciaml
Classification
FAST : Facet
Application of
Subject
URL : http://viaf.org
제공형태 : HTML, RDF/XML
•
•
URL : http://dewey.info
제공형태 : HTML/RDFa, RDF/XML, Turtle, JSON SPARQL
•
•
URL : id.worldcat.org/fast
제공형태 : HTML, RDF/XML - Download
24
26. 사례4 : Linked Open Vocabulary
• LOV(Linked Open Vocabularies, http://lov.okfn.org)
-
링크드 데이터 웹의 성장으로 수많은 RDFS/OWL 기반의 용어집들이
존재하며, 용어집은 SKOS, DC, FRBR과 같은 표준뿐만 아니라 FOAF,
Event Ontology와 같이 사실상 표준들인 것들도 있음
- 현재 수많은 용어집들이 새로운 메타데이터를 선언하고 상호 연결되
어 웹에 출간되면서 새로운 용어집 생태계를 이루고 있음
- LOV는 어휘 생태계에 쉽게 접근할 수 있는 방법을 제공함
25
29. 사례4 : GeoNames
• GeoNmaes 온톨로지는 웹에 URI를 활용해서 약 1,000만 건의 지리
적위치 정보를 제공하고 있으며 이중 약 800만 건은 지리와 관련된
다양한 특징적인 정보를 포함하고 있음
- 인구정보를 포함한 280만 건의 지리정보, 이명표기를 갖고 있는 550만
건의 지리정보 등
- GeoNames는 다양한 언어, 고도, 인구와 다양한 이명표기 등에 대한
데이터를 통합한 데이터로, 모든 위도, 경도 좌표는 WGS84(World
Geodetic System 1984)표준을 따르고 있음
28
31. 한 번 더 생각해볼 문제
• 어디에서 출발할 것인가?
- BBC의 경우처럼 현재 해결해야할 문제가 있는가?
- 단지, 내 데이터를 발행하는 것이 중요한가?
- 기존에 연결되어 있는 기관(웹사이트, 데이터베이스 등)과 함께 출발해
보는 것은 어떠한가?
- 꼭, RDF로 개방해야하는가?
30
32. 플랫폼은 누가, 어떻게 만들고 있고
데이터는 누가, 어떻게 만들고 있고
서비스는 누가 어떻게 만들고 있는가?
나의 문제 상황은 무엇인고, 내가 만들어야 하는 것은 플랫폼인가, 데이터인가, 서비스인
가?
31
33. 참고자료
•
•
•
•
박진호 (2013, 7월). 도서관은 웹에 존재하는가? : 링크드데이터, 글로벌 데이터베이스.
국가전자도서관 세미나. 경기도 : 국가기록원
박진호. (2013). 도서관 데이터의 링크드 데이터(Linked Data) 변환과
인터링킹(interlinking)을 통한 정보연계 확장성에 관한 연구 : 국립중앙도서관 서지,
주제명, 저자명 데이터를 중심으로. 석사학위논문, 성균관대학교, 서울.
박진호 (2013, 12월). 국내외 정보서비스 기관의 LOD 구축 동향, KISIT 세미나. 서울 :
한국과학기술정보연구원
Godby, J. Carol. (2013, June). The Relationship between BIBFRAME and OCLC’s Linked-Data
Model of Bibliographic Description: A Working Paper. OCLC Working Paper. Retrieved from
http://oclc.org/content/dam/research/publications/library/2013/2013-05.pdf
이미지 출처
http://www.flickr.com/photos/sergemelki/8156333460/
32