Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web (Barbara TIllett)

1,362 views

Published on

Chcete vědět víc? Mnoho dalších prezentací, videí z konferencí, fotografií i jiných dokumentů je k dispozici v institucionálním repozitáři NTK: http://repozitar.techlib.cz

Would you like to know more? Find presentations, reports, conference videos, photos and much more in our institutional repository at: http://repozitar.techlib.cz/?ln=en

Published in: Technology, Education
0 Comments
2 Likes
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total views
1,362
On SlideShare
0
From Embeds
0
Number of Embeds
1
Actions
Shares
0
Downloads
13
Comments
0
Likes
2
Embeds 0
No embeds

No notes for slide
  • Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web Abstract: Efforts have been underway for several years to lay some of the building blocks for linked data, cloud computing, and foundations for the Semantic Web. How are libraries contributing? Some of the projects that the Library of Congress has contributed to are described: the Virtual International Authority File, the id.loc.gov site for posting LC’s own controlled vocabularies, like the Library of Congress Subject Headings (LCSH) in SKOS (Simple Knowledge Organization Schema) format; and the controlled vocabularies prepared for the new cataloging code, RDA: Resource Description and Access. Credits: Some of the slides are from Ed O’Neill (on the VIAF algorithms), Thom Hickey (on VIAF statistics), and Karen Smith-Yoshimura (on VIAF images for Chekov ) from OCLC Research – with their permission. Today I’d like to share with you some things we are doing now - that work now on the Web - and that we hope will also be even more useful in the future as systems are designed to take better advantage of the rich data that libraries have provided for centuries through our bibliographic and authority records. We can repackage that data so it can be used in creative ways to better serve our users. Let me start with linked data on the Web.
  • “ Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/” Also see: http://richard.cyganiak.de/2007/10/lod/ (Modified notes from Karen Smith-Yoshimura) DBpedia  is a community effort to extract structured information from Wikipedia and to make this information available on the Web. DBpedia allows you to ask sophisticated queries against Wikipedia, and to link other data sets on the Web to Wikipedia data. The idea is that this will make it easier for the amazing amount of information in Wikipedia to be used in new and interesting ways, and  that it might inspire new mechanisms for navigating, linking and improving the encyclopaedia itself as well as the linked data, including connections to library data – <click> here showing the link to the database at the National Library of Sweden. This image shows the links as of last July and there are now many more. According to their online site as of September 2010, the DBpedia knowledge base describes more than 2.9 million things , including at least 282,000 persons, 339,000 places (including 241,000 populated places), 88,000 music albums, 44,000 films, 15,000 video games, 119,000 organizations (including 20,000 companies and 29,000 educational institutions), 130,000 species and 4400 diseases. The DBpedia knowledge base has labels and abstracts for these things in  at least 91 different languages; 807,000 links to images and 3,840,000 links to external Web pages; 4,878,100 external links into other RDF datasets, 415,000 Wikipedia categories, and 75,000 YAGO categories.
  • Information systems and content, like digital resources, images on Flickr, and our bibliographic and authority data may all be freely accessible on the Web or available for some nominal fee. It’s now part of the Internet cloud computing environment that we have today with Amazon, Google, and other systems where the elements that describe our resources are available to libraries and users everywhere in the world – not just on an institution’s computer, but shared and available to everyone through the Internet. The data comes from publishers, from the creators of the resources, from trusted libraries and other institutions, and can be augmented by further descriptive information from anyone who wants to help. All of the data about information resources in our bibliographic universe of things we have in our library collections and in archives will be accessible by any user anywhere at anytime.
  • Bibliographic data and digital resources are on the Web now and we’ve started adding the controlled vocabularies to help identify resources – such as the controlled values for naming the types of content (like sound, text, still images, and so on), types of carriers (like a film reel, a computer disc, a volume), and other elements in RDA that have controlled lists of values. Some of the RDA vocabularies are already being registered on the Web and can be used to present displays and show pathways to related resources. So today, I’d like to show you three services or building blocks for this linked Web environment that the Library of Congress is involved in with other libraries: <click> VIAF ( Virtual International Authority File) <click> LCSH – Library of Congress Subject Headings <click> And the controlled vocabularies from the new cataloging code, RDA: Resource Description and Access
  • VIAF, the Virtual International Authority File, was established to provide a free service on the Web to help share the authority data created by libraries around the world. Libraries can use it to help reduce the costs of cataloging, because authority control is one of the most expensive aspects of cataloging operations. VIAF also was designed to make authority control easier on an international scale, where libraries could help each other maintain the data. And longer term, we hope the VIAF data can be used through linked data services to enable displaying our bibliographic data in the form, language, and script that end users want.
  • For example, if a user gets information about Hiroshige Utagawa in Japanese while searching on the Web, but they have set up their profile to see English in our Latin script – a system application could use VIAF to help display the desired script – for searching further in France or in the US, and could move through the Internet to linked available works by or about this artist.
  • The concept of a shared authority file has been discussed within IFLA since the 1970’s as part of IFLA’s goal of “universal bibliographic control”: single heading for each person, corporate body, etc. to be used by everyone in the world. What’s wrong with that idea if you are interested in Hiroshige’s works and the heading is only to be presented in Japanese? Also, the technology wasn’t available in the 1970’s to realize international sharing of authority data. We didn’t have automated system capabilities to support catalogers doing authority work and to do as much of it as possible “automatically.” So, IFLA moved away from the principle of a single heading in response to users’ needs for language and scripts and to accommodate different cataloging codes. Parallel to IFLA’s efforts, the initial VIAF project in 2003 was a partnership between the Library of Congress, the Deutsche Nationalbibliothek (the national library of Germany), and OCLC. Various possible models were considered including those used by other authority file projects: Project LEAF, InterParty Project, and Project AUTHOR, but we decided to use a centralized model with links to different national authority files. OCLC developed algorithms to use information in the bibliographic records and information in authority records to match names in the two authority files. By 2007 the Bibliotheque nationale de France joined the partners.
  • Now, VIAF has 18 participants. Israel is contributing 4 separate authority files for different scripts (Latin script, Hebrew, Cyrillic, and Arabic), so there are 21 authority files with 18.4 million name records in about 14.5 million clusters.
  • In no particular order, these are the current participants. Notice “Library of Congress/NACO” includes the British Library, the National Library of New Zealand, and all of our other NACO partners around the world, so there are actually more institutions represented than just those listed as the participants. The most recent addition are the authority records from RERO in Switzerland. The National Institute for Informatics in Japan, the National Diet Library of Japan, the Russian State Library, as well as the National Library of Slovenia and a consortium in Belgium are also in progress.
  • From Thom Hickey’s blog Sept. 25, 2009: VIAF  (the Virtual International Authority File) is now available as linked data. Linked data means: URIs for everything - universal resource identifiers HTTP 303 redirects for URIs representing the personae our metadata is about  [ 303 See Other (since HTTP/1.1)] HTTP content negotiation for different data formats An RDF view of the data [The Resource Description Framework (RDF) is a language for representing information about resources in the World Wide Web] A rich a set of internal and external links in our data. The scripts are in Unicode and the data can be submitted as either UNIMARC, MARC 21, or MARCXML. We also learned from Thom Hickey that the usage of VIAF tripled this past year with thousands of visits daily and millions of visits by machines.
  • For every personal name used in either a 100 or a 700 field, a derived authority record is created. In addition to the name, the derived authority record includes a coded summary of the material published. A bibliographic record with multiple personal names will generate multiple derived authority records. All of the derived authority records for a particular person will be clustered with the authority record for the individual. The contents of all the derived authority records for the individual are added to form the enhanced authority record.
  • Two types of information are collected from the bibliographic record associated with the personal names. First, information specific to a resource is collected such as the title and LC Control number. Second, information that may apply to multiple works by an author is obtained. The latter information may help to match authors when specific title matches are not available. The OCLC Algorithms capture and label such data as … (walk through slide)
  • The Derived Authority Record moves the focus of the information in the bibliographic record from the manifestation to the person. This derived record is for Mitch Leigh who wrote the music. A similar derived authority record would also be created for Joe Darion - another composer, and the arrangers Roland Barrett & Alan Keown. Local fields (9xx) are used to store the information derived from the bibliographic record. All of the text (titles, publishers, etc.) is normalized using the NACO normalization rules. All upper case characters are changed to lower case, most diacritics are removed, and most punctuation is dropped. Subjects are entered using the number associated with the subject groups from the National Shelf List. This converts the LC classification into approximately 500 broad subject areas. The decade of publication is used for the publication dates.
  • To build the Enhanced Authority Record, the original Personal Name Authority Record is combined with all of the available derived authority records for the author. Repeated information is counted to increase its relevance in the matching analysis. The $9 subfield shows the frequency of the attribute. For example, it shows that two (in this case, all) of his publications are in English.
  • There is a lot of information that you can mine from the bibliographic and authority data – like the things shown here. Hickey
  • This is the URL for VIAF, and I encourage you all to take a look. How many of you are already using it?
  • Here’s what you see when you go to that Web site.
  • There is a Google- type search box – you have some drop down boxes to select whether you’d like to search by name or by title, another drop down box to select a particular authority file of all of VIAF, and a third search box to type in the name or title you want to search. As you begin typing, the system has a drop-down box of suggested names in the system that match what you are typing. Let’s select Cervantes, and you then retrieve (next screen)
  • The list of the authorized forms of names found in VIAF – attached to each is a little flag for the participating institutions and a sample title on the right to assure you have the one you were looking for. We can scroll further down the screen.
  • And we have the ability to scroll through various cover art associated with the publications by Cervantes.
  • Further down is the complete listing of the preferred forms of name – with the contributing institution’s flag.
  • If you don’t recognize which institution the flag belongs to, you can click on the flag in the circle to the right and up will pop the name as well as the record number for Cervantes in their authority file.
  • Further down the screen we have selected titles.
  • A bit further we have countries of publication –remember from the mined data we captured how many times a particular country was found in the bibliographic records associated with the name – here it is used to show relative volume of publications coming from each of the countries. Isn’t this a lot more interesting than seeing the MARC coded country codes? VIAF shows some ideas of how we can more creatively re-use the data we already have in our records.
  • This is a timeline of publication dates for Cervantes’ books. Each bar on the timeline is a decade.
  • and when you scroll over the bar, you get statistics about how many publications are in that decade as represented in the contributing databases.
  • VIAF also displays information about the person – here showing that Cervantes was a male, nationality – Spain, and VIAF shows the primary language used by this person as seen in various manifestations of his works. And there are some links to Wikipedia and to WorldCat Identities (since it is a product associated with OCLC) and there is the possibility to have other links in the future.
  • VIAF also has the MARC 21, UNIMARC, and RDF formatted “enhanced authority records” for each person.
  • Here’s what the data looks like as RDF (Resource Description Framework).
  • The Policy & Standards Division (PSD) at LC has had several questions on whether catalogers (LC and PCC – Program for Cooperative Cataloging) should be required to search VIAF for non-US authors BEFORE establishing these in the LC/NAF. Currently PCC and LC catalogers are not required to search VIAF before creating a Name Authority record to contribute to NACO, but are encouraged to do so. This may change when VIAF moves out of the beta stage - catalogers certainly are encouraged to use the VIAF to resolve conflicts. We have found that searching VIAF is an efficient mechanism for searching authority files, and LC is encouraging our catalogers and other to use VIAF in order to provide more consistency in authorized access points.
  • See slide – we are not including topical terms, because experience in trying to map vocabularies across thesauri in several projects has shown it is not worth doing – and there are other ways to offer users suggestions from multiple thesauri that are linked to the same bibliographic data – such as through faceted systems. However, LC is making its subject authority data available freely on the Web.
  • We are doing that through the use of SKOS – Simple Knowledge Organization System. In particular we have posted the Library of Congress Subject Headings in SKOS Format on the Web. SKOS is (see slide)
  • See slide
  • See slide
  • See slide
  • See slide
  • See slide
  • See slide
  • Here is the Web site for our “Authorities & Vocabularies” – also known as id.loc.gov
  • Here’s the page you see to search LCSH – notice there is also a tab <click> to enable anyone to suggest subject terminology or alert LC to any problems.
  • Notice when I searched for animated films, I get back 3 possiblities –each with a different LC Control number – the LCCNs – sh (LCSH – topical heading) sj (Children’s Subject Headings) g/f (Genre/Form)
  • See slide
  • Note each element in the MARC subject heading record is displayed in a more user-friendly form as portions of this textual display – again converting the MARC records to display data.
  • We also offer the ability to visualize the LCSH terms using AquaBrowser – Blue = narrower terms Green = broader terms Pink = related terms and you can click on these and move them around to explore the relationships.
  • Our contacts (see slide)
  • See slide
  • The last controlled vocabulary I’ll talk about today, is the various terms used in descriptive cataloging and found in the new cataloging code RDA: Resource Description and Access. This new cataloguing code replaces AACR2 and has been developed as a Web product – the publishers call the cataloging code plus the complementary tools, the “RDA Toolkit.” RDA was developed with the Web in mind. It gives instructions for identifying all of the things in our bibliographic universe, to use as we describe the things in our collections, the library collections, the associated persons, families, corporate bodies, concepts, objects, events, and places. The elements and sometimes specific values are included as controlled vocabularies, like the list of names of languages – English, French, Spanish, etc. (for us in the US, we take the list of languages from the MARC format), or the list of types of carriers, like computer discs, audio files, and so on.
  • The controlled vocabularies and the list of elements included in RDA are posted on a Web registry run by the Open Metadata Registry – as shown here.
  • Here’s what the registry looks like when you search the basic elements – things like the carrier type. These basic elements in RDA are the various identifying characteristics covered in the cataloging instructions, and
  • for each element, there is a URI – Universal Resource Identifier, which can be used in descriptions instead of the text string. This is helpful for displaying the language and script a user wants to see – because the URI number can be converted into a display of a text string in English or German, or whatever language. For now it’s only English and German. Spanish and French are coming soon. The Deutsche Nationalbibliothek is providing the German equivalent terms for the RDA element sets registered on the Web, and the same URI identifying Carrier type can be displayed in German to suit the application or user profile.
  • Going the next step beyond just identifying the element, to identifying the specific values or types or terms that can be used when identifying that element – for example, here we see the first page of terms that are carrier types – the green checkmarks are the broad categories and more specific terms are below each. In addition to the RDA registry, we will get the MARC list of languages from the id.loc.gov “Authorities & Vocabularies” from the Library of Congress.
  • The vocabulary for the controlled names of languages, used in RDA to identify the expression, can be displayed using a linked URI from data in the bibliographic or authority records – and depending on the user’s view – it can be displayed in English as shown here, or any other language, such as
  • in Spanish. By having the elements identified and the various language equivalents registered, the displays can be appropriate to the context – a user’s profile can alert the system which language/script should be displayed. I look forward to new systems that will help users navigate the library collections of the world as part of searching the Web. RDA is another of the building blocks to provide the identifying characteristics of the things in our bibliographic universe and to provide the relationships among those things so we can offer displays and pathways for users to explore our collections and the resources available to them worldwide -- so they can find what they need in global Web-based systems – in whatever language or script they want.
  • Google and Yahoo work in a cloud computing environment with linked data and a global perspective, and now libraries are finally getting into the act. I hope to see really creative services emerge that take advantage of the building blocks libraries provide through our controlled vocabularies and bibliographic data, so users can connect to our collections. Thank you for your attention.
  • Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web (Barbara TIllett)

    1. 1. Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web Dr. Barbara B. Tillett Chief, Policy & Standards Division Library of Congress For ELAG, May 2011
    2. 2. DBpedia Linked Data National Library of Sweden LCSH VIAF
    3. 3. Internet “ Cloud” Databases, Repositories Web front end Services 3
    4. 4. Internet “Cloud” Web front end Services VIAF Databases, Repositories LCSH 4
    5. 5. VIAF Objectives <ul><li>Facilitate exposure of authority data </li></ul><ul><li>Reduce cataloging costs </li></ul><ul><li>Simplify authority control (creation and maintenance) internationally </li></ul><ul><li>Provide authority data in form, language, and script users want </li></ul>
    6. 6. VIAF 6 歌 川 , 広重 2 世 1826-1869 ‎   Utagawa, Hiroshige, 1826?-1869
    7. 7. VIAF: The Virtual International Authority File <ul><li>Original VIAF partners </li></ul><ul><li>Library of Congress (LC) </li></ul><ul><li>Deutsche Nationalbibliothek (DNB) </li></ul><ul><li>Bibliothèque nationale de France (BnF) </li></ul><ul><li>OCLC - host </li></ul><ul><li>Virtually combining the name authority files of all institutions into a single name authority service. </li></ul><ul><li>http:// viaf.org / </li></ul>
    8. 8. Virtual International Authority File <ul><li>Matches names across 21 authority files of 18 institutions </li></ul><ul><ul><li>18.4 million name records </li></ul></ul><ul><ul><li>14.5 million clusters </li></ul></ul>Based on KSY Cooperative Identities Hub, CEAL 2010-03
    9. 9. <ul><li>  Library of Congress/ NACO </li></ul><ul><li>Deutsche Nationalbibliothek </li></ul><ul><li>  Bibliothèque nationale de France </li></ul><ul><li>National Library of Australia </li></ul><ul><li>  National Library of the Czech Republic </li></ul><ul><li>  Bibliotheca Alexandrina (Egypt) </li></ul><ul><li>  Getty Research Institute </li></ul><ul><li>National Library of Israel </li></ul><ul><li>  Istituto Centrale per il Catalogo Unico (Italy) </li></ul><ul><li>  Biblioteca National de Portugal </li></ul><ul><li>  Biblioteca Nacional de España </li></ul><ul><li>  National Library of Sweden </li></ul><ul><li>  Swiss National Library </li></ul><ul><li>  Vatican Library </li></ul><ul><li>  NUKAT Center (Poland) </li></ul><ul><li>  Library and Archives Canada </li></ul><ul><li>  National Széchényi Library (Hungary) </li></ul><ul><li>RERO (Switzerland) </li></ul>
    10. 10. Current Status <ul><li>Available as linked data with URIs (Universal Resource Identifiers) </li></ul><ul><li>Unicode throughout </li></ul><ul><li>MARC 21, UNIMARC, and RDF supported </li></ul><ul><li>Usage tripled this last year </li></ul><ul><ul><li>Thousands of visits daily </li></ul></ul>
    11. 11. Enhancing the Authorities Bibliographic Record Derived Authority Authority Record Enhanced Authority 11
    12. 12. Mining the Bibliographic Record LDR 00638ncm a22002057a 450 1 5773347 5 19960820101947.4 8 960815s1965 oruuua n eng 10 $a 96753638 040 $a DLC $c DLC 019 $a 17706440 020 $c $2.95 028 22 $a 48418 $b Matrix Publ. Co. 045 2 $b d198006 $b d198007 048 $b va01 $b ve01 $a ka01 050 00 $a M1258 $b .L 100 1 $a Leigh, Mitch, $d 1928- 245 14 $a The man of La Mancha / $c by Mitch Leigh & Joe Darion; arr. By Roland Barrett & Alan Keown. 260 $a Springfield, OR : $b Matrix Publ. Co., $c c1965. 300 $a 1 score (16 p.) ; $c 18 x 27 cm. 500 $a Brief record. 650 0 $a Musicals $x Excerpts. 600 10 $a Leigh, Mitch $x Musical settings. 700 1 $a Darion, Joe. Title Date of Publication Authors LC Control Number LC Classification Material Type Publisher Place of Publication Language Usage
    13. 13. Derived Authority Record <ul><li>00505cz a2200157n 450 </li></ul><ul><li>0 1 xlc 1 </li></ul><ul><li>1 3 OCoLC </li></ul><ul><li>2 5 19880921165012.4 </li></ul><ul><li>3 8 880831n|acannaab|n aaa c </li></ul><ul><li>4 040 $a OCoLC $b eng $c OCoLC $f viaf </li></ul><ul><li>5 100 1 $a Leigh, Mitch. </li></ul><ul><li>6 903 $a 88030979 </li></ul><ul><li>7 910 14 $a the man of la mancha </li></ul><ul><li>8 921 $a matrix publ co </li></ul><ul><li>9 922 $a oru </li></ul><ul><li>10 930 $a mitch leigh </li></ul><ul><li>11 940 $a eng </li></ul><ul><li>12 942 $a 234 </li></ul><ul><li>13 943 $a 196x </li></ul><ul><li>14 944 $a cm </li></ul><ul><li>15 950 1 $a darian, joe $d 1928- </li></ul>All text is normalized Subjects are grouped into broad subject areas Material type is coded Publication date is by decade Coauthor
    14. 14. Enhanced Authority Record 00505cz a2200157n 450 0 1 oca01144962 1 5 19880921165012.4 2 8 840702n| acannaab| |n aaa ||| 3 10 $a n 88090379 4 40 $a DLC $c DLC $d DLC 5 100 1 $a Leigh, Mitch, $d 1928- 6 670 $a the man of la mancha , c1966: $b t.p. (Mitch Leigh) 7 903 $a 84758340 $9 1 8 903 $a 93710923 $9 1 9 910 11 $a impossible dream $9 1 10 910 11 $a century library of music and sound by mitch leigh $9 1 11 921 $a matrix publ co $9 1 12 921 $a kapp $9 2 13 922 $a oru $9 2 14 930 $a mitch leigh $9 1 15 940 $a eng $9 2 16 942 $a 234 $9 2 17 943 $a 196x $9 1 18 943 $a 197x $9 1 19 944 $a cm $9 2 20 950 11 $a darian, joe $d 1928- $9 1 21 950 11 $a wasserman, dale $9 1
    15. 15. Information in Bibliographic Records <ul><li>He writes music </li></ul><ul><li>His primary subject area is music </li></ul><ul><li>He was published in the 1960s and 1970s by Matrix Publ. Co. in Oregon and Kapp in New York </li></ul><ul><li>Worked with Joe Darion and Dale Wasserman </li></ul><ul><li>Mitch Leigh is the only name he has used on his publications </li></ul><ul><li>Etc. </li></ul>
    16. 16. http:// www.viaf.org Hosted by
    17. 17. viaf.org
    18. 18. Cervantes Saavedra, Miguel de 1547 Cervantes de Salazar, Francisco, ca. 1514 Cervantes, 1823-1898 Cervantes Juan, 1395-1458 Cervantes, Ignacio, 1847-1905 Cervantes, Juan de, 1382-1453 Cervantès, François, 1959- Cervani, Giulio, 1919- Cervantes, María Antonieta Cervantes de Haro, fl. 1908-193- As viewed Nov. 1, 2010 cer
    19. 19. Cervantes
    20. 20. Cervantes
    21. 21. Cervantes Preferred Forms
    22. 22. Cervantes
    23. 23. Cervantes
    24. 24. Cervantes
    25. 25. Cervantes
    26. 26. Cervantes
    27. 27. Cervantes
    28. 28. MARC 21 Cervantes
    29. 29. RDF Cervantes
    30. 30. VIAF and Catalogers <ul><li>Use as a reference tool: </li></ul><ul><ul><li>To resolve conflicts, questionable dates, forms of name, etc. </li></ul></ul><ul><li>Cite as source in 670 $a, for example: </li></ul><ul><ul><li>BNF in VIAF, date searched </li></ul></ul><ul><ul><li>Nat. Lib. of Australia in VIAF, date searched </li></ul></ul><ul><ul><li>LAC in VIAF, date searched </li></ul></ul>
    31. 31. Next steps for VIAF <ul><li>Better searching </li></ul><ul><li>More “Linked data” </li></ul><ul><ul><li>Related persons as in WorldCat Identities, Wikipedia, etc. </li></ul></ul><ul><li>Participants beyond libraries </li></ul><ul><ul><li>Rights management agencies, Publishers </li></ul></ul><ul><ul><li>Museums, Archives </li></ul></ul><ul><li>More name types </li></ul><ul><ul><li>Corporate and Family names </li></ul></ul><ul><ul><li>Uniform titles </li></ul></ul><ul><ul><li>Geographic names </li></ul></ul><ul><ul><li>… not topical terms </li></ul></ul>
    32. 32. SKOS <ul><li>Simple Knowledge Organization System </li></ul><ul><ul><li>“ Provides a model for expressing the basic structure and content of concept schemes such as thesauri, classification schemes, subject heading lists, taxonomies, folksonomies, and other similar types of controlled vocabulary”— SKOS Primer </li></ul></ul>
    33. 33. SKOS <ul><li>Based on the Resource Description Framework (RDF) </li></ul><ul><ul><li>Resources can be exchanged between software applications and published on the Web </li></ul></ul><ul><ul><li>Interconnects data on the Web, helping create the Semantic Web </li></ul></ul>
    34. 34. id.loc.gov/authorities <ul><li>“Authorities & Vocabularies” from the Library of Congress </li></ul><ul><li>Intent: To provide human and programmatic access to commonly found standards and vocabularies developed by LC </li></ul>
    35. 35. “Authorities & Vocabularies” <ul><li>LCSH was the first offering </li></ul><ul><ul><li>Subject headings </li></ul></ul><ul><ul><li>Genre/form headings </li></ul></ul><ul><ul><li>Children’s subject headings </li></ul></ul><ul><ul><li>Subdivision records </li></ul></ul><ul><ul><li>Validation records </li></ul></ul><ul><li>Provides links from LCSH headings to RAMEAU headings </li></ul><ul><ul><li>Exploring Répertoire de vedettes-matière (RVM) and others </li></ul></ul>
    36. 36. “Authorities & Vocabularies” <ul><li>Also includes: </li></ul><ul><ul><li>Thesaurus for Graphic Materials ( TGM ) </li></ul></ul><ul><ul><li>MARC geographic area codes </li></ul></ul><ul><ul><li>MARC language codes </li></ul></ul><ul><ul><li>MARC relator codes </li></ul></ul><ul><ul><li>Preservation Events … etc. </li></ul></ul>
    37. 37. “Authorities & Vocabularies” <ul><li>Benefits </li></ul><ul><ul><li>Servers can download entire controlled vocabularies and the values within them, in multiple formats </li></ul></ul><ul><ul><li>Available for free on the Web </li></ul></ul>
    38. 38. “Authorities & Vocabularies” <ul><li>Human end-users can </li></ul><ul><ul><li>Search and view individual headings and data elements </li></ul></ul><ul><ul><ul><li>Details of the record </li></ul></ul></ul><ul><ul><ul><li>Visualization </li></ul></ul></ul><ul><ul><li>Suggest additions, changes </li></ul></ul>
    39. 39.
    40. 40.
    41. 41.
    42. 42. <ul><li>URI for specific LCSH records/ concepts: </li></ul><ul><li>id.loc.gov/authorities/[LCCN] </li></ul><ul><li>id.loc.gov/authorities/sh8508803 </li></ul>“Authorities & Vocabularies”
    43. 43.
    44. 44.
    45. 45. <ul><li>Contact information </li></ul><ul><ul><li>Content of site: </li></ul></ul><ul><li>Libby Dechman, edec@loc.gov </li></ul><ul><ul><li>Technical questions: </li></ul></ul><ul><li>Larry Dixson, ldix@loc.gov </li></ul>“Authorities & Vocabularies”
    46. 46. <ul><li>A comment form and discussion list are available at </li></ul>“Authorities & Vocabularies” http://id.loc.gov/authorities/contact.html
    47. 47. RDA Controlled Vocabularies - Registries <ul><li>Free on the Web at Open Metadata Registry </li></ul>http://metadataregistry.org/schema/list.html
    48. 48. http://metadataregistry.org/rdabrowse.htm
    49. 49. Carrier type
    50. 50. URI
    51. 51. RDA Carrier Types URI
    52. 52. RDA Linked Data Don Quixote Madrid, 1979 English Spanish French German Cervantes Library of Congress Copy 1 Green leather binding Exemplary novels Wasserman The Man of La Mancha Text Movies … Derivative works Subject created created created
    53. 53. RDA Linked Terms for Languages Don Quijote Madrid, 1979 Inglés Español Francés Alemán Cervantes Library of Congress Copia 1 Encuadernación en piel color verde Novelas Ejemplares Wasserman The Man of La Mancha Texto Películas … Obras derivadas Materias
    54. 54. Internet “Cloud” Web front end Services VIAF Databases, Repositories LCSH

    ×