SlideShare a Scribd company logo
1 of 57
Script away!!
APIs, XSLT, and linked data sets for creating and
enriching bibliographic data
Lucas Mak | Devin Higgins | Autumn Faulkner | Joshua Barton
Michigan State University Libraries
Three Projects:
• Geographic Area Code Enrichment
• Applying XSLT
• Rovi Music Collection
• Metadata enhancement via API
• Google Books Dataset
• Applying Python, leveraging APIs
PART 1: Geographic Area Code Enrichment
https://www.loc.gov/marc/geoareas/
Geographic Area Code Enrichment
• Objective
• Area studies librarians want to analyze collections in their respective areas by
Geographic Area Codes (GAC) in catalog records
• Problems
• Catalog records lacking GAC
• GACs contained in catalog record may not be comprehensive
• Until 2010, the Library of Congress (LC) assigned a maximum of three GACs to any one
bibliographic record
Geographic Area Codes
• Library of Congress Subject Heading Manual
• Appendix E governs the assignment of GAC
• Heading is tagged 651 or contains a geographic subdivision ($z)
• Location of individual named entities
• Events, exhibitions, movements, etc.
• And many more …
Solution
• Inserting GAC based on subject headings
• MARC 651$a
• Geographic subdivisions ($z)
• Government bodies subordinate to a jurisdictional heading (610 10$a)
• Geographic qualifier of conference headings (610$c & 611$c )
• Exclusions
• Ethnic groups, nationalities, civilizations, etc.
• False match possibility & matching efficiency
• Non-jurisdictional corporate bodies (610 20)
• Difficult to determine location of the corporate bodies
Conversion Table
• Source of data
• id.loc.gov
• LC has published the GAC data as linked data (http://id.loc.gov/vocabulary/geographicAreas)
• Bulk download available (http://id.loc.gov/download/)
• RDFXML, n-triple, and Turtle
• Data conversion
• RDFXML file into XML tables
• Name-Code conversion table
• United States → n-us--
• Deprecated-Current Code conversion table
• e.g. a-hk-- → a-cc-hk
• Name-Code conversion table
• Obsolete code conversion table
Default Processing Logic
• 1st geographic subdivisions ($z) matches as is
• $z Ohio $z Cleveland → Matches “Ohio” to the conversion table
• MARC 651$a, 610 10$a, 610$c, and 611$c
• Does not have qualifier (e.g. Ohio), matches as is
• Has qualifier (e.g. Cleveland (Ohio) or Cleveland, Ohio), extract qualifier for matching
• Output
• Converts deprecated GACs into current GACs (e.g. a-hk--- → a-cc-hk)
• Keeps existing GACs in 043 if current
• Dedups newly generated GACs against the existing 043
• Only outputs unique GACs
Issues in Name-Code Conversion
• Special Patterns in Geographic qualifiers
• Abbreviated state/provincial names (RDA Appendix B.11)
• Vancouver (B.C.) in 651$a, “Portland, Me.” in 611$c
• Multiple country/state/provincial names
• Cumberland River (Ky. and Tenn.)
• Type of place/jurisdiction
• Chignik Lagoon (Alaska : Bay), Addison (Ohio : Township)
• Intermediate place name
• Albany (Berks County, Pa.)
Issues in Name-Code Conversion
• Inconsistent Practices
• Australia
• Has codes down to State level, e.g. Victoria (u-at-vi)
• MARC 651 has states as geographic qualifier, e.g. Sydney (N.S.W.)
• Geographic subdivision ($z) has Australia as the 1st $z and followed by local place name with state
name as geographic qualifier in the 2nd $z
• $z Australia $z Sydney (N.S.W.)
• PRC China
• Has codes down to Province or Municipal level, e.g. Shanxi Sheng (a-cc-sh), Beijing (a-cc-pe)
• MARC 651 has
• Provinces & country as qualifier, e.g. Taiyuan (Shanxi Sheng, China)
• Country as qualifier, e.g. Beijing (China)
• Geographic subdivision ($z)
• $z China $z Taiyuan (Shanxi Sheng)
• $z China $z Beijing
Issues in Name-Code Conversion
• Malaysia (SHM H810)
• Only has code down to Country level (a-my---)
• MARC 651 has
• Provinces & country as qualifier, e.g. Kuching (Sarawak, Malaysia)
• Geographic subdivision ($z)
• $z Malaysia $z Kuching (Sarawak)
• Korea (South) vs Korea (North)
• Qualifier is dropped when qualifying a local place name (LC-PCC PS 16.2.2.4)
• Seoul (Korea) -- $z Korea (South) $z Seoul
• P'yŏngyang (Korea) -- $z Korea (North) $z P'yŏngyang
• Needs an exhaustive list of Korean place names for matching
• From LCNAF & LCSH files
Abbrev. ->
Full
Name ->
Code
Korean
Place
Names
Deprecated
-> Current
Code
XSLT Process
MARC
Records with
GACs
MARC
Records
PART 2: Rovi Music Collection
Metadata Enhancement by API
• Rovi Music Collection
• Spans mid-1980s to
2014
• American and some
international markets
• 681,000 CDs
• Rovi Music Collection:
• Increased physical
music holdings by
over 42 times
• Very basic metadata
included UPC
• Required automation
Phased Cataloging Process
• Phase 1 – Local Holdings Lookup
UPCs
HTTP Query Item records
for Rovi
Holdings
If Found
MSU
OPAC
MSU OPAC
XML Server
Phased Cataloging Process
• Phase 2 – Locating Copy Records
Remaining
UPCs
from Phase
1
SRU Query Download
Copy
Records
If Found
Sierra
API
Phased Cataloging Process
• Phase 3 – Brief Record Generation (Music)
Remaining
UPCs
from Phase 2
Brief
Records
Sierra
Metadata
from Donor
Limitations of brief records
• Rovi data do not differentiate personal name from corporate name
• Broad genre terms mapped from Rovi proprietary terms
• For classical music, only performers are listed
Remedies
• Record Enhancement
Phase 3
Records Enhanced
Records
Discogs & MusicBrainz
• Discogs.com
• Crowdsourced music database with more than 7.4 million entries
• Users contribute entries for sound recordings
• Controlled list of “Style” terms
• MusicBrainz.org
• Open content music database
• Entries are maintained by volunteer editors
• Differentiation between personal name & corporate body name
• Links to external services, e.g. VIAF, Wikidata, Discogs.com, etc.
• Uncontrolled keywords for genre
Application Program Interface (API)
• “A software tool…which performs a particular computational function…APIs act
as building blocks allowing software developers to create new applications
without having to code every function from scratch.”*
* Daniel Chandler and Rod Munday, “Application Programming Interface” in A Dictionary of Media and Communication (Oxford University Press, 2011)
API Database
Query
Result
Discogs & MusicBrainz API details
• MusicBrainz.org
• Non-commercial use of the web service is free
• Data in MusicBrainz Database is licensed under CC0
• Query result available in XML and JSON (beta) formats
• Searchable by UPC and many other typical data points
• Documentation https://musicbrainz.org/doc/Development
• Discogs.com
• Data is licensed under the “CC0 No Rights Reserved” license
• Query result available in JSON format only
• Not searchable by UPC though may be available in the returned JSON
• Documentation https://www.discogs.com/developers/
Sample MusicBrainz.org XML Result
Sample Discogs.com JSON Result
Mapping from Discogs.com terms to LC terms
Authorized Access Point Lookup
Enhanced
Records
Brief
Records
MARC from
MusicBrainz
Follows link
to Discogs
MARC from
Discogs
Authorized Access Point Lookup
Outcomes
• Benefits of the process
• More granular genre terms from Discogs.com
• Possible authorized forms of name from LC
• Correct tagging (700 vs 710) of names
• Limitations
• 1 query/sec. allowed in both APIs
• “503 Service unavailable” HTTP error
• Hard to dedup lists of names from two sources
• UPC lookup failure in MusicBrainz.org
• Failure to retrieve record from Discogs.com even if record is available
PART 3: Google Books Dataset
Google Dataset at MSU
• All public domain, Google-digitized books:
• OCR text (not page-images)
• 3 million volumes
• 3 TB zipped text
• 12 GB MarcXML metadata (aka catalog records)
• Remotely synced with HathiTrust
Full-text not intended for reading/public display, but for “Non-
consumptive research” (aka Text mining)
Full-text not intended for reading/public display, but for “Non-
consumptive research” (aka Text mining)
Accessible → Usable
• Stored in a “Pairtree” directory structure:
• Unique ID = miua.0048030.1838.001
• Path to Item =
• /miua/pairtree_root/00/48/03/0,/18/38/,0/01/
• Not intuitive for human access but adds stability to file
system and quick access for machines
“Workset” Creation
• Subset of the larger dataset built around specific
features:
• Publication Date
• Language
• Author
• Literary Form (poetry, fiction, etc.)
• Bibliographic Level (monograph, serials)
• Content Type (text, map, musical score, etc.)
• Nature of Contents (theses, catalogs, etc.)
Sample Workset Query
• “19th-Century French Fiction”
• Publication Date: Between 1800 and 1899
• Language: French
• Literary Form: Fiction+Novel+Short Stories
• Bibliographic Level: Monograph [maybe]
• Search Results: 966 volumes
Results Page
 Download full results or random
sample
 Download text, bibliographic and/or
technical metadata
 Download zipped or unzipped
volumes
 Download ID list for use with
HathiTrust API (or to make email
request)
Working with Metadata
• Python scripts to parse MarcXML metadata
• “Streaming” parser because metadata files were too
large to hold in memory
• Stored all MARC data in relational database (MySQL)
• Additionally, processed selected fields to index using
Solr
Limitations of Current Tool
• Subsetting by bibliographic data only
• Not able to answer:
• How would I gather works by 16th-century women?
By 19th-century men?
• Works by displaced/exiled authors during WWII
Linked Data
<subject><predicate><object>
<uri:Hamlet><uri:hasAuthor><uri:Shakespeare>
<uri:Shakespeare><uri:Birthplace><uri:>Stratford-Upon-Avon>
<uri:Stratford-Upon-Avon><uri:Country><uri:United_Kingdom>
<uri:United_Kingdom><uri:JanHighC><6.9 degrees>
Semantic connections that allow reasoning:
High temperature in January of the country that the author of
Hamlet was born in...
<datafield tag="100">
<subfield code="a">Emerson, Ralph Waldo,</subfield>
<subfield code="d">1803-1882.</subfield>
</datafield>
<datafield tag="245">
<subfield code="a">Representative men :</subfield>
<subfield code="b">seven lectures.</subfield>
</datafield>
<datafield tag="650”>
<subfield code="a">English language</subfield>
<subfield code="x">Rhetoric.</subfield>
</datafield>
Book Metadata → Linked Data
MarcXML Data
http://viaf.org/viaf/27079964/
Language: EN
- English
Nationality:
US - United States
Gender:
Male
http://dbpedia.org/resource/Ralph_Waldo_Emerson
Philosophical School:
Transcendentalism
Influenced by:
Hegel
Montaigne
Kant...
Notable Idea:
Over-Soul, Self-Reliance
Influenced:
Musil
Thoreau
Proust...
Founder of:
The Atlantic
Subjects:
American diarists
American
Unitarians
1803 Births
Mystics...
Birthplace:
Massachusetts
Boston
Death Place:
Concord, Massachusetts
Linked Data
• Compiling URIs via:
• WorldCat Identities API (author last name and OCLC
number search)
• LC Linked Data Service (author name match on
authoritative name)
• Query of multiple sources to check results.
Linked Data Implementation Scenario 1
• Store and index data points locally
• Store URIs and retrieved contextual data as text
• Fast search & retrieval
• Regular refresh of data required to capture
new/updated data points
Linked Data Implementation Scenario 2
• Store harvested URIs locally
• Retrieve data points from remote data stores using
harvested URIs
• Most up-to-date data
• Have to overcome system performance and result
normalization issues
Provide Author Context
• Display information about author on the fly
following author search
• Using stored URI, query dbpedia for author context
• Show thumbnail, etc. to user.
• Include link to Wikipedia and other info stores
Wrap Up
• These projects draw on different but
complementary skillsets
• Use similar data sources
• For example: LC Linked Data Service
• Resulting expertise informs other projects, which
involve some of the same key players
Wrap Up
• Example future projects:
• Linked Data Cross-Divisional Team
• Experimenting with transforming MARC bibliographic data to
BIBFRAME
• Linked Data enrichment in digital collections
• API development to harvest third-party metadata for
partial cataloging automation
Thank you!
Questions?
Lucas Mak – makw@msu.edu
Devin Higgins – higgi135@msu.edu
Autumn Faulkner – autumn@msu.edu
Joshua Barton – bartonjp@msu.edu

More Related Content

What's hot

Semantic Web
Semantic WebSemantic Web
Semantic Webhardchiu
 
Shrinking the silo boundary: data and schema in the Semantic Web
Shrinking the silo boundary: data and schema in the Semantic WebShrinking the silo boundary: data and schema in the Semantic Web
Shrinking the silo boundary: data and schema in the Semantic WebGordon Dunsire
 
Synapse india reviews on php website development
Synapse india reviews on php website developmentSynapse india reviews on php website development
Synapse india reviews on php website developmentsaritasingh19866
 
SPARQL in the Semantic Web
SPARQL in the Semantic WebSPARQL in the Semantic Web
SPARQL in the Semantic WebJan Beeck
 
Resource description framework
Resource description frameworkResource description framework
Resource description frameworkhozifa1010
 
Owl web ontology language
Owl  web ontology languageOwl  web ontology language
Owl web ontology languagehassco2011
 
Ukgovld registry-webinar-v3
Ukgovld registry-webinar-v3Ukgovld registry-webinar-v3
Ukgovld registry-webinar-v3Dave Reynolds
 
QMUL C4DM API Presentation @ BCN Music Hack Day
QMUL C4DM API Presentation @ BCN Music Hack DayQMUL C4DM API Presentation @ BCN Music Hack Day
QMUL C4DM API Presentation @ BCN Music Hack DayAmélie Anglade
 
btNOG 6: Next Generation Internet Registry Services - RDAP
btNOG 6: Next Generation Internet Registry Services - RDAPbtNOG 6: Next Generation Internet Registry Services - RDAP
btNOG 6: Next Generation Internet Registry Services - RDAPAPNIC
 

What's hot (11)

Semantic Web
Semantic WebSemantic Web
Semantic Web
 
Sparql
SparqlSparql
Sparql
 
Shrinking the silo boundary: data and schema in the Semantic Web
Shrinking the silo boundary: data and schema in the Semantic WebShrinking the silo boundary: data and schema in the Semantic Web
Shrinking the silo boundary: data and schema in the Semantic Web
 
Synapse india reviews on php website development
Synapse india reviews on php website developmentSynapse india reviews on php website development
Synapse india reviews on php website development
 
Introduction to RDF
Introduction to RDFIntroduction to RDF
Introduction to RDF
 
SPARQL in the Semantic Web
SPARQL in the Semantic WebSPARQL in the Semantic Web
SPARQL in the Semantic Web
 
Resource description framework
Resource description frameworkResource description framework
Resource description framework
 
Owl web ontology language
Owl  web ontology languageOwl  web ontology language
Owl web ontology language
 
Ukgovld registry-webinar-v3
Ukgovld registry-webinar-v3Ukgovld registry-webinar-v3
Ukgovld registry-webinar-v3
 
QMUL C4DM API Presentation @ BCN Music Hack Day
QMUL C4DM API Presentation @ BCN Music Hack DayQMUL C4DM API Presentation @ BCN Music Hack Day
QMUL C4DM API Presentation @ BCN Music Hack Day
 
btNOG 6: Next Generation Internet Registry Services - RDAP
btNOG 6: Next Generation Internet Registry Services - RDAPbtNOG 6: Next Generation Internet Registry Services - RDAP
btNOG 6: Next Generation Internet Registry Services - RDAP
 

Similar to Script away!!: APIs, XSLT, and linked data sets for creating and enriching bibliographic data / Lucas Mak, Devin Higgins, Autumn Faulkner, Joshua Barton

Everything you always wanted to know about WorldCat (but were afraid to ask) ...
Everything you always wanted to know about WorldCat (but were afraid to ask) ...Everything you always wanted to know about WorldCat (but were afraid to ask) ...
Everything you always wanted to know about WorldCat (but were afraid to ask) ...CILIP MDG
 
Resource Description and Access at University of Zimbabwe
Resource Description and Access at University of ZimbabweResource Description and Access at University of Zimbabwe
Resource Description and Access at University of ZimbabwePeter Kativhu
 
Linked Open Data and The Digital Archaeological Workflow at the Swedish Natio...
Linked Open Data and The Digital Archaeological Workflow at the Swedish Natio...Linked Open Data and The Digital Archaeological Workflow at the Swedish Natio...
Linked Open Data and The Digital Archaeological Workflow at the Swedish Natio...Marcus Smith
 
SAFETY NETS: RESCUE AND REVIVAL FOR ENDANGERED BORN-DIGITAL RECORDS- Program ...
SAFETY NETS: RESCUE AND REVIVAL FOR ENDANGERED BORN-DIGITAL RECORDS- Program ...SAFETY NETS: RESCUE AND REVIVAL FOR ENDANGERED BORN-DIGITAL RECORDS- Program ...
SAFETY NETS: RESCUE AND REVIVAL FOR ENDANGERED BORN-DIGITAL RECORDS- Program ...Micah Altman
 
RDA State of the Union
RDA State of the UnionRDA State of the Union
RDA State of the UnionJohn Baga
 
Introduction to Apache Solr
Introduction to Apache SolrIntroduction to Apache Solr
Introduction to Apache SolrAndy Jackson
 
Sherif Metadata Talk - London (June 25th 2018)
Sherif Metadata Talk - London (June 25th 2018)Sherif Metadata Talk - London (June 25th 2018)
Sherif Metadata Talk - London (June 25th 2018)Getaneh Alemu
 
Getaneh Alemu (Southampton Solent) - The existing challenges and opportunitie...
Getaneh Alemu (Southampton Solent) - The existing challenges and opportunitie...Getaneh Alemu (Southampton Solent) - The existing challenges and opportunitie...
Getaneh Alemu (Southampton Solent) - The existing challenges and opportunitie...sherif user group
 
Enhancing E-Resource Records for Discovery presentation for printing
Enhancing E-Resource Records for Discovery presentation for printingEnhancing E-Resource Records for Discovery presentation for printing
Enhancing E-Resource Records for Discovery presentation for printingCarla Arbagey
 
The WorldCat Search API
The WorldCat Search APIThe WorldCat Search API
The WorldCat Search APIOCLC Research
 
Publishing the British National Bibliography as Linked Open Data / Corine Del...
Publishing the British National Bibliography as Linked Open Data / Corine Del...Publishing the British National Bibliography as Linked Open Data / Corine Del...
Publishing the British National Bibliography as Linked Open Data / Corine Del...CIGScotland
 
Discovery Layer Strategies for Kuali OLE: Indiana University
Discovery Layer Strategies for Kuali OLE: Indiana UniversityDiscovery Layer Strategies for Kuali OLE: Indiana University
Discovery Layer Strategies for Kuali OLE: Indiana UniversityCourtney McDonald
 
Library Mashups & APIs
Library Mashups & APIsLibrary Mashups & APIs
Library Mashups & APIslibrarywebchic
 
Our Marathon Presentation at DH Data Curation Workshop
Our Marathon Presentation at DH Data Curation WorkshopOur Marathon Presentation at DH Data Curation Workshop
Our Marathon Presentation at DH Data Curation Workshopjkmcgrath
 
High and Lows of Library Linked Data
High and Lows of Library Linked DataHigh and Lows of Library Linked Data
High and Lows of Library Linked DataAdrian Stevenson
 
Putting the Pieces Together: Creating a National Educational Television Catalog
Putting the Pieces Together: Creating a National Educational Television CatalogPutting the Pieces Together: Creating a National Educational Television Catalog
Putting the Pieces Together: Creating a National Educational Television CatalogWGBH Media Library and Archives
 
Usage of Linked Data: Introduction and Application Scenarios
Usage of Linked Data: Introduction and Application ScenariosUsage of Linked Data: Introduction and Application Scenarios
Usage of Linked Data: Introduction and Application ScenariosEUCLID project
 

Similar to Script away!!: APIs, XSLT, and linked data sets for creating and enriching bibliographic data / Lucas Mak, Devin Higgins, Autumn Faulkner, Joshua Barton (20)

Everything you always wanted to know about WorldCat (but were afraid to ask) ...
Everything you always wanted to know about WorldCat (but were afraid to ask) ...Everything you always wanted to know about WorldCat (but were afraid to ask) ...
Everything you always wanted to know about WorldCat (but were afraid to ask) ...
 
Tillett, Hillmann, and Moen, "Bibliographic Control Alphabet Soup: AACR to R...
Tillett, Hillmann, and Moen, "Bibliographic Control Alphabet Soup:  AACR to R...Tillett, Hillmann, and Moen, "Bibliographic Control Alphabet Soup:  AACR to R...
Tillett, Hillmann, and Moen, "Bibliographic Control Alphabet Soup: AACR to R...
 
Resource Description and Access at University of Zimbabwe
Resource Description and Access at University of ZimbabweResource Description and Access at University of Zimbabwe
Resource Description and Access at University of Zimbabwe
 
Linked Open Data and The Digital Archaeological Workflow at the Swedish Natio...
Linked Open Data and The Digital Archaeological Workflow at the Swedish Natio...Linked Open Data and The Digital Archaeological Workflow at the Swedish Natio...
Linked Open Data and The Digital Archaeological Workflow at the Swedish Natio...
 
SAFETY NETS: RESCUE AND REVIVAL FOR ENDANGERED BORN-DIGITAL RECORDS- Program ...
SAFETY NETS: RESCUE AND REVIVAL FOR ENDANGERED BORN-DIGITAL RECORDS- Program ...SAFETY NETS: RESCUE AND REVIVAL FOR ENDANGERED BORN-DIGITAL RECORDS- Program ...
SAFETY NETS: RESCUE AND REVIVAL FOR ENDANGERED BORN-DIGITAL RECORDS- Program ...
 
RDA State of the Union
RDA State of the UnionRDA State of the Union
RDA State of the Union
 
Introduction to Apache Solr
Introduction to Apache SolrIntroduction to Apache Solr
Introduction to Apache Solr
 
Descriptive Cataloging of Scores in RDA
Descriptive Cataloging of Scores in RDADescriptive Cataloging of Scores in RDA
Descriptive Cataloging of Scores in RDA
 
Sherif Metadata Talk - London (June 25th 2018)
Sherif Metadata Talk - London (June 25th 2018)Sherif Metadata Talk - London (June 25th 2018)
Sherif Metadata Talk - London (June 25th 2018)
 
Getaneh Alemu (Southampton Solent) - The existing challenges and opportunitie...
Getaneh Alemu (Southampton Solent) - The existing challenges and opportunitie...Getaneh Alemu (Southampton Solent) - The existing challenges and opportunitie...
Getaneh Alemu (Southampton Solent) - The existing challenges and opportunitie...
 
Enhancing E-Resource Records for Discovery presentation for printing
Enhancing E-Resource Records for Discovery presentation for printingEnhancing E-Resource Records for Discovery presentation for printing
Enhancing E-Resource Records for Discovery presentation for printing
 
The WorldCat Search API
The WorldCat Search APIThe WorldCat Search API
The WorldCat Search API
 
Publishing the British National Bibliography as Linked Open Data / Corine Del...
Publishing the British National Bibliography as Linked Open Data / Corine Del...Publishing the British National Bibliography as Linked Open Data / Corine Del...
Publishing the British National Bibliography as Linked Open Data / Corine Del...
 
web opac
 web opac  web opac
web opac
 
Discovery Layer Strategies for Kuali OLE: Indiana University
Discovery Layer Strategies for Kuali OLE: Indiana UniversityDiscovery Layer Strategies for Kuali OLE: Indiana University
Discovery Layer Strategies for Kuali OLE: Indiana University
 
Library Mashups & APIs
Library Mashups & APIsLibrary Mashups & APIs
Library Mashups & APIs
 
Our Marathon Presentation at DH Data Curation Workshop
Our Marathon Presentation at DH Data Curation WorkshopOur Marathon Presentation at DH Data Curation Workshop
Our Marathon Presentation at DH Data Curation Workshop
 
High and Lows of Library Linked Data
High and Lows of Library Linked DataHigh and Lows of Library Linked Data
High and Lows of Library Linked Data
 
Putting the Pieces Together: Creating a National Educational Television Catalog
Putting the Pieces Together: Creating a National Educational Television CatalogPutting the Pieces Together: Creating a National Educational Television Catalog
Putting the Pieces Together: Creating a National Educational Television Catalog
 
Usage of Linked Data: Introduction and Application Scenarios
Usage of Linked Data: Introduction and Application ScenariosUsage of Linked Data: Introduction and Application Scenarios
Usage of Linked Data: Introduction and Application Scenarios
 

More from CILIP MDG

UK Committee on RDA, RDA Day: New Tools for the Future of Cataloguing - Jenny...
UK Committee on RDA, RDA Day: New Tools for the Future of Cataloguing - Jenny...UK Committee on RDA, RDA Day: New Tools for the Future of Cataloguing - Jenny...
UK Committee on RDA, RDA Day: New Tools for the Future of Cataloguing - Jenny...CILIP MDG
 
Challenges to implementation - Jenny Wright
Challenges to implementation - Jenny WrightChallenges to implementation - Jenny Wright
Challenges to implementation - Jenny WrightCILIP MDG
 
Application Profiles in RDA - Jenny Wright
Application Profiles in RDA - Jenny WrightApplication Profiles in RDA - Jenny Wright
Application Profiles in RDA - Jenny WrightCILIP MDG
 
The Official RDA Toolkit - Opportunities for Efficiency - Thurstan Young
The Official RDA Toolkit - Opportunities for Efficiency - Thurstan YoungThe Official RDA Toolkit - Opportunities for Efficiency - Thurstan Young
The Official RDA Toolkit - Opportunities for Efficiency - Thurstan YoungCILIP MDG
 
The Official RDA Toolkit - Opportunities for Enrichment - Thurstan Youing
The Official RDA Toolkit - Opportunities for Enrichment - Thurstan YouingThe Official RDA Toolkit - Opportunities for Enrichment - Thurstan Youing
The Official RDA Toolkit - Opportunities for Enrichment - Thurstan YouingCILIP MDG
 
UKCoR RDA Day 2023 - "Only" Connect
UKCoR RDA Day 2023 - "Only" ConnectUKCoR RDA Day 2023 - "Only" Connect
UKCoR RDA Day 2023 - "Only" ConnectCILIP MDG
 
RDA methods, scenarios, tools - Gordon Dunsire
RDA methods, scenarios, tools - Gordon DunsireRDA methods, scenarios, tools - Gordon Dunsire
RDA methods, scenarios, tools - Gordon DunsireCILIP MDG
 
Poster: What’s in a name? Re-Discovering cataloguing and index through metada...
Poster: What’s in a name? Re-Discovering cataloguing and index through metada...Poster: What’s in a name? Re-Discovering cataloguing and index through metada...
Poster: What’s in a name? Re-Discovering cataloguing and index through metada...CILIP MDG
 
Poster: Revamping our in-house cataloguing training / Victoria Parkinson (Kin...
Poster: Revamping our in-house cataloguing training / Victoria Parkinson (Kin...Poster: Revamping our in-house cataloguing training / Victoria Parkinson (Kin...
Poster: Revamping our in-house cataloguing training / Victoria Parkinson (Kin...CILIP MDG
 
Poster: FAST : can it lighten the load, and what is the impact? / Jenny Wrigh...
Poster: FAST : can it lighten the load, and what is the impact? / Jenny Wrigh...Poster: FAST : can it lighten the load, and what is the impact? / Jenny Wrigh...
Poster: FAST : can it lighten the load, and what is the impact? / Jenny Wrigh...CILIP MDG
 
Poster: The West Midlands Evidence Repository (WMER) : a regional collaborati...
Poster: The West Midlands Evidence Repository (WMER) : a regional collaborati...Poster: The West Midlands Evidence Repository (WMER) : a regional collaborati...
Poster: The West Midlands Evidence Repository (WMER) : a regional collaborati...CILIP MDG
 
Poster: Updating the Wessex Classification Scheme for UK health libraries : a...
Poster: Updating the Wessex Classification Scheme for UK health libraries : a...Poster: Updating the Wessex Classification Scheme for UK health libraries : a...
Poster: Updating the Wessex Classification Scheme for UK health libraries : a...CILIP MDG
 
Revamping in-house cataloguing training / Victoria Parkinson (King's College ...
Revamping in-house cataloguing training / Victoria Parkinson (King's College ...Revamping in-house cataloguing training / Victoria Parkinson (King's College ...
Revamping in-house cataloguing training / Victoria Parkinson (King's College ...CILIP MDG
 
UK NACO funnel : progress, obstacles, and solutions / Martin Kelleher (Univer...
UK NACO funnel : progress, obstacles, and solutions / Martin Kelleher (Univer...UK NACO funnel : progress, obstacles, and solutions / Martin Kelleher (Univer...
UK NACO funnel : progress, obstacles, and solutions / Martin Kelleher (Univer...CILIP MDG
 
Ship[w]right[e]s? : the challenges of cataloguing reports from scientific exp...
Ship[w]right[e]s? : the challenges of cataloguing reports from scientific exp...Ship[w]right[e]s? : the challenges of cataloguing reports from scientific exp...
Ship[w]right[e]s? : the challenges of cataloguing reports from scientific exp...CILIP MDG
 
BFI Reuben Library : an RDA implementation story / Anastasia Kerameos (BFI Re...
BFI Reuben Library : an RDA implementation story / Anastasia Kerameos (BFI Re...BFI Reuben Library : an RDA implementation story / Anastasia Kerameos (BFI Re...
BFI Reuben Library : an RDA implementation story / Anastasia Kerameos (BFI Re...CILIP MDG
 
RDA implementation at the British Library / Thurstan Young (British Library)
RDA implementation at the British Library / Thurstan Young (British Library)RDA implementation at the British Library / Thurstan Young (British Library)
RDA implementation at the British Library / Thurstan Young (British Library)CILIP MDG
 
Community forward : developing descriptive cataloguing of rare materials (RDA...
Community forward : developing descriptive cataloguing of rare materials (RDA...Community forward : developing descriptive cataloguing of rare materials (RDA...
Community forward : developing descriptive cataloguing of rare materials (RDA...CILIP MDG
 
The West Midlands Evidence Repository (WMER) : a regional collaboration proje...
The West Midlands Evidence Repository (WMER) : a regional collaboration proje...The West Midlands Evidence Repository (WMER) : a regional collaboration proje...
The West Midlands Evidence Repository (WMER) : a regional collaboration proje...CILIP MDG
 
Authority of assertion in repository contributions to the PID graph / George ...
Authority of assertion in repository contributions to the PID graph / George ...Authority of assertion in repository contributions to the PID graph / George ...
Authority of assertion in repository contributions to the PID graph / George ...CILIP MDG
 

More from CILIP MDG (20)

UK Committee on RDA, RDA Day: New Tools for the Future of Cataloguing - Jenny...
UK Committee on RDA, RDA Day: New Tools for the Future of Cataloguing - Jenny...UK Committee on RDA, RDA Day: New Tools for the Future of Cataloguing - Jenny...
UK Committee on RDA, RDA Day: New Tools for the Future of Cataloguing - Jenny...
 
Challenges to implementation - Jenny Wright
Challenges to implementation - Jenny WrightChallenges to implementation - Jenny Wright
Challenges to implementation - Jenny Wright
 
Application Profiles in RDA - Jenny Wright
Application Profiles in RDA - Jenny WrightApplication Profiles in RDA - Jenny Wright
Application Profiles in RDA - Jenny Wright
 
The Official RDA Toolkit - Opportunities for Efficiency - Thurstan Young
The Official RDA Toolkit - Opportunities for Efficiency - Thurstan YoungThe Official RDA Toolkit - Opportunities for Efficiency - Thurstan Young
The Official RDA Toolkit - Opportunities for Efficiency - Thurstan Young
 
The Official RDA Toolkit - Opportunities for Enrichment - Thurstan Youing
The Official RDA Toolkit - Opportunities for Enrichment - Thurstan YouingThe Official RDA Toolkit - Opportunities for Enrichment - Thurstan Youing
The Official RDA Toolkit - Opportunities for Enrichment - Thurstan Youing
 
UKCoR RDA Day 2023 - "Only" Connect
UKCoR RDA Day 2023 - "Only" ConnectUKCoR RDA Day 2023 - "Only" Connect
UKCoR RDA Day 2023 - "Only" Connect
 
RDA methods, scenarios, tools - Gordon Dunsire
RDA methods, scenarios, tools - Gordon DunsireRDA methods, scenarios, tools - Gordon Dunsire
RDA methods, scenarios, tools - Gordon Dunsire
 
Poster: What’s in a name? Re-Discovering cataloguing and index through metada...
Poster: What’s in a name? Re-Discovering cataloguing and index through metada...Poster: What’s in a name? Re-Discovering cataloguing and index through metada...
Poster: What’s in a name? Re-Discovering cataloguing and index through metada...
 
Poster: Revamping our in-house cataloguing training / Victoria Parkinson (Kin...
Poster: Revamping our in-house cataloguing training / Victoria Parkinson (Kin...Poster: Revamping our in-house cataloguing training / Victoria Parkinson (Kin...
Poster: Revamping our in-house cataloguing training / Victoria Parkinson (Kin...
 
Poster: FAST : can it lighten the load, and what is the impact? / Jenny Wrigh...
Poster: FAST : can it lighten the load, and what is the impact? / Jenny Wrigh...Poster: FAST : can it lighten the load, and what is the impact? / Jenny Wrigh...
Poster: FAST : can it lighten the load, and what is the impact? / Jenny Wrigh...
 
Poster: The West Midlands Evidence Repository (WMER) : a regional collaborati...
Poster: The West Midlands Evidence Repository (WMER) : a regional collaborati...Poster: The West Midlands Evidence Repository (WMER) : a regional collaborati...
Poster: The West Midlands Evidence Repository (WMER) : a regional collaborati...
 
Poster: Updating the Wessex Classification Scheme for UK health libraries : a...
Poster: Updating the Wessex Classification Scheme for UK health libraries : a...Poster: Updating the Wessex Classification Scheme for UK health libraries : a...
Poster: Updating the Wessex Classification Scheme for UK health libraries : a...
 
Revamping in-house cataloguing training / Victoria Parkinson (King's College ...
Revamping in-house cataloguing training / Victoria Parkinson (King's College ...Revamping in-house cataloguing training / Victoria Parkinson (King's College ...
Revamping in-house cataloguing training / Victoria Parkinson (King's College ...
 
UK NACO funnel : progress, obstacles, and solutions / Martin Kelleher (Univer...
UK NACO funnel : progress, obstacles, and solutions / Martin Kelleher (Univer...UK NACO funnel : progress, obstacles, and solutions / Martin Kelleher (Univer...
UK NACO funnel : progress, obstacles, and solutions / Martin Kelleher (Univer...
 
Ship[w]right[e]s? : the challenges of cataloguing reports from scientific exp...
Ship[w]right[e]s? : the challenges of cataloguing reports from scientific exp...Ship[w]right[e]s? : the challenges of cataloguing reports from scientific exp...
Ship[w]right[e]s? : the challenges of cataloguing reports from scientific exp...
 
BFI Reuben Library : an RDA implementation story / Anastasia Kerameos (BFI Re...
BFI Reuben Library : an RDA implementation story / Anastasia Kerameos (BFI Re...BFI Reuben Library : an RDA implementation story / Anastasia Kerameos (BFI Re...
BFI Reuben Library : an RDA implementation story / Anastasia Kerameos (BFI Re...
 
RDA implementation at the British Library / Thurstan Young (British Library)
RDA implementation at the British Library / Thurstan Young (British Library)RDA implementation at the British Library / Thurstan Young (British Library)
RDA implementation at the British Library / Thurstan Young (British Library)
 
Community forward : developing descriptive cataloguing of rare materials (RDA...
Community forward : developing descriptive cataloguing of rare materials (RDA...Community forward : developing descriptive cataloguing of rare materials (RDA...
Community forward : developing descriptive cataloguing of rare materials (RDA...
 
The West Midlands Evidence Repository (WMER) : a regional collaboration proje...
The West Midlands Evidence Repository (WMER) : a regional collaboration proje...The West Midlands Evidence Repository (WMER) : a regional collaboration proje...
The West Midlands Evidence Repository (WMER) : a regional collaboration proje...
 
Authority of assertion in repository contributions to the PID graph / George ...
Authority of assertion in repository contributions to the PID graph / George ...Authority of assertion in repository contributions to the PID graph / George ...
Authority of assertion in repository contributions to the PID graph / George ...
 

Recently uploaded

Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...Pooja Nehwal
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubaihf8803863
 
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样vhwb25kk
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfgstagge
 
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...ThinkInnovation
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPramod Kumar Srivastava
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998YohFuh
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130Suhani Kapoor
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...Suhani Kapoor
 
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsappssapnasaifi408
 
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Sapana Sha
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptSonatrach
 
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)jennyeacort
 
Data Science Jobs and Salaries Analysis.pptx
Data Science Jobs and Salaries Analysis.pptxData Science Jobs and Salaries Analysis.pptx
Data Science Jobs and Salaries Analysis.pptxFurkanTasci3
 
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDINTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDRafezzaman
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...Florian Roscheck
 

Recently uploaded (20)

Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
 
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
 
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdf
 
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
 
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
 
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
 
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
 
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
 
Data Science Jobs and Salaries Analysis.pptx
Data Science Jobs and Salaries Analysis.pptxData Science Jobs and Salaries Analysis.pptx
Data Science Jobs and Salaries Analysis.pptx
 
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDINTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
 

Script away!!: APIs, XSLT, and linked data sets for creating and enriching bibliographic data / Lucas Mak, Devin Higgins, Autumn Faulkner, Joshua Barton

  • 1. Script away!! APIs, XSLT, and linked data sets for creating and enriching bibliographic data Lucas Mak | Devin Higgins | Autumn Faulkner | Joshua Barton Michigan State University Libraries
  • 2. Three Projects: • Geographic Area Code Enrichment • Applying XSLT • Rovi Music Collection • Metadata enhancement via API • Google Books Dataset • Applying Python, leveraging APIs
  • 3. PART 1: Geographic Area Code Enrichment https://www.loc.gov/marc/geoareas/
  • 4. Geographic Area Code Enrichment • Objective • Area studies librarians want to analyze collections in their respective areas by Geographic Area Codes (GAC) in catalog records • Problems • Catalog records lacking GAC • GACs contained in catalog record may not be comprehensive • Until 2010, the Library of Congress (LC) assigned a maximum of three GACs to any one bibliographic record
  • 5. Geographic Area Codes • Library of Congress Subject Heading Manual • Appendix E governs the assignment of GAC • Heading is tagged 651 or contains a geographic subdivision ($z) • Location of individual named entities • Events, exhibitions, movements, etc. • And many more …
  • 6. Solution • Inserting GAC based on subject headings • MARC 651$a • Geographic subdivisions ($z) • Government bodies subordinate to a jurisdictional heading (610 10$a) • Geographic qualifier of conference headings (610$c & 611$c ) • Exclusions • Ethnic groups, nationalities, civilizations, etc. • False match possibility & matching efficiency • Non-jurisdictional corporate bodies (610 20) • Difficult to determine location of the corporate bodies
  • 7. Conversion Table • Source of data • id.loc.gov • LC has published the GAC data as linked data (http://id.loc.gov/vocabulary/geographicAreas) • Bulk download available (http://id.loc.gov/download/) • RDFXML, n-triple, and Turtle • Data conversion • RDFXML file into XML tables • Name-Code conversion table • United States → n-us-- • Deprecated-Current Code conversion table • e.g. a-hk-- → a-cc-hk
  • 9. • Obsolete code conversion table
  • 10. Default Processing Logic • 1st geographic subdivisions ($z) matches as is • $z Ohio $z Cleveland → Matches “Ohio” to the conversion table • MARC 651$a, 610 10$a, 610$c, and 611$c • Does not have qualifier (e.g. Ohio), matches as is • Has qualifier (e.g. Cleveland (Ohio) or Cleveland, Ohio), extract qualifier for matching • Output • Converts deprecated GACs into current GACs (e.g. a-hk--- → a-cc-hk) • Keeps existing GACs in 043 if current • Dedups newly generated GACs against the existing 043 • Only outputs unique GACs
  • 11. Issues in Name-Code Conversion • Special Patterns in Geographic qualifiers • Abbreviated state/provincial names (RDA Appendix B.11) • Vancouver (B.C.) in 651$a, “Portland, Me.” in 611$c • Multiple country/state/provincial names • Cumberland River (Ky. and Tenn.) • Type of place/jurisdiction • Chignik Lagoon (Alaska : Bay), Addison (Ohio : Township) • Intermediate place name • Albany (Berks County, Pa.)
  • 12.
  • 13. Issues in Name-Code Conversion • Inconsistent Practices • Australia • Has codes down to State level, e.g. Victoria (u-at-vi) • MARC 651 has states as geographic qualifier, e.g. Sydney (N.S.W.) • Geographic subdivision ($z) has Australia as the 1st $z and followed by local place name with state name as geographic qualifier in the 2nd $z • $z Australia $z Sydney (N.S.W.) • PRC China • Has codes down to Province or Municipal level, e.g. Shanxi Sheng (a-cc-sh), Beijing (a-cc-pe) • MARC 651 has • Provinces & country as qualifier, e.g. Taiyuan (Shanxi Sheng, China) • Country as qualifier, e.g. Beijing (China) • Geographic subdivision ($z) • $z China $z Taiyuan (Shanxi Sheng) • $z China $z Beijing
  • 14. Issues in Name-Code Conversion • Malaysia (SHM H810) • Only has code down to Country level (a-my---) • MARC 651 has • Provinces & country as qualifier, e.g. Kuching (Sarawak, Malaysia) • Geographic subdivision ($z) • $z Malaysia $z Kuching (Sarawak) • Korea (South) vs Korea (North) • Qualifier is dropped when qualifying a local place name (LC-PCC PS 16.2.2.4) • Seoul (Korea) -- $z Korea (South) $z Seoul • P'yŏngyang (Korea) -- $z Korea (North) $z P'yŏngyang • Needs an exhaustive list of Korean place names for matching • From LCNAF & LCSH files
  • 15.
  • 16. Abbrev. -> Full Name -> Code Korean Place Names Deprecated -> Current Code XSLT Process MARC Records with GACs MARC Records
  • 17. PART 2: Rovi Music Collection
  • 18. Metadata Enhancement by API • Rovi Music Collection • Spans mid-1980s to 2014 • American and some international markets • 681,000 CDs
  • 19. • Rovi Music Collection: • Increased physical music holdings by over 42 times • Very basic metadata included UPC • Required automation
  • 20. Phased Cataloging Process • Phase 1 – Local Holdings Lookup UPCs HTTP Query Item records for Rovi Holdings If Found MSU OPAC MSU OPAC XML Server
  • 21. Phased Cataloging Process • Phase 2 – Locating Copy Records Remaining UPCs from Phase 1 SRU Query Download Copy Records If Found Sierra API
  • 22. Phased Cataloging Process • Phase 3 – Brief Record Generation (Music) Remaining UPCs from Phase 2 Brief Records Sierra Metadata from Donor
  • 23. Limitations of brief records • Rovi data do not differentiate personal name from corporate name • Broad genre terms mapped from Rovi proprietary terms • For classical music, only performers are listed
  • 24. Remedies • Record Enhancement Phase 3 Records Enhanced Records
  • 25. Discogs & MusicBrainz • Discogs.com • Crowdsourced music database with more than 7.4 million entries • Users contribute entries for sound recordings • Controlled list of “Style” terms • MusicBrainz.org • Open content music database • Entries are maintained by volunteer editors • Differentiation between personal name & corporate body name • Links to external services, e.g. VIAF, Wikidata, Discogs.com, etc. • Uncontrolled keywords for genre
  • 26. Application Program Interface (API) • “A software tool…which performs a particular computational function…APIs act as building blocks allowing software developers to create new applications without having to code every function from scratch.”* * Daniel Chandler and Rod Munday, “Application Programming Interface” in A Dictionary of Media and Communication (Oxford University Press, 2011) API Database Query Result
  • 27. Discogs & MusicBrainz API details • MusicBrainz.org • Non-commercial use of the web service is free • Data in MusicBrainz Database is licensed under CC0 • Query result available in XML and JSON (beta) formats • Searchable by UPC and many other typical data points • Documentation https://musicbrainz.org/doc/Development • Discogs.com • Data is licensed under the “CC0 No Rights Reserved” license • Query result available in JSON format only • Not searchable by UPC though may be available in the returned JSON • Documentation https://www.discogs.com/developers/
  • 30. Mapping from Discogs.com terms to LC terms
  • 32.
  • 33. Enhanced Records Brief Records MARC from MusicBrainz Follows link to Discogs MARC from Discogs Authorized Access Point Lookup
  • 34. Outcomes • Benefits of the process • More granular genre terms from Discogs.com • Possible authorized forms of name from LC • Correct tagging (700 vs 710) of names • Limitations • 1 query/sec. allowed in both APIs • “503 Service unavailable” HTTP error • Hard to dedup lists of names from two sources • UPC lookup failure in MusicBrainz.org • Failure to retrieve record from Discogs.com even if record is available
  • 35. PART 3: Google Books Dataset
  • 36. Google Dataset at MSU • All public domain, Google-digitized books: • OCR text (not page-images) • 3 million volumes • 3 TB zipped text • 12 GB MarcXML metadata (aka catalog records) • Remotely synced with HathiTrust
  • 37. Full-text not intended for reading/public display, but for “Non- consumptive research” (aka Text mining)
  • 38. Full-text not intended for reading/public display, but for “Non- consumptive research” (aka Text mining)
  • 39.
  • 40.
  • 41. Accessible → Usable • Stored in a “Pairtree” directory structure: • Unique ID = miua.0048030.1838.001 • Path to Item = • /miua/pairtree_root/00/48/03/0,/18/38/,0/01/ • Not intuitive for human access but adds stability to file system and quick access for machines
  • 42. “Workset” Creation • Subset of the larger dataset built around specific features: • Publication Date • Language • Author • Literary Form (poetry, fiction, etc.) • Bibliographic Level (monograph, serials) • Content Type (text, map, musical score, etc.) • Nature of Contents (theses, catalogs, etc.)
  • 43. Sample Workset Query • “19th-Century French Fiction” • Publication Date: Between 1800 and 1899 • Language: French • Literary Form: Fiction+Novel+Short Stories • Bibliographic Level: Monograph [maybe] • Search Results: 966 volumes
  • 44.
  • 46.  Download full results or random sample  Download text, bibliographic and/or technical metadata  Download zipped or unzipped volumes  Download ID list for use with HathiTrust API (or to make email request)
  • 47. Working with Metadata • Python scripts to parse MarcXML metadata • “Streaming” parser because metadata files were too large to hold in memory • Stored all MARC data in relational database (MySQL) • Additionally, processed selected fields to index using Solr
  • 48. Limitations of Current Tool • Subsetting by bibliographic data only • Not able to answer: • How would I gather works by 16th-century women? By 19th-century men? • Works by displaced/exiled authors during WWII
  • 50. <datafield tag="100"> <subfield code="a">Emerson, Ralph Waldo,</subfield> <subfield code="d">1803-1882.</subfield> </datafield> <datafield tag="245"> <subfield code="a">Representative men :</subfield> <subfield code="b">seven lectures.</subfield> </datafield> <datafield tag="650”> <subfield code="a">English language</subfield> <subfield code="x">Rhetoric.</subfield> </datafield> Book Metadata → Linked Data MarcXML Data http://viaf.org/viaf/27079964/ Language: EN - English Nationality: US - United States Gender: Male http://dbpedia.org/resource/Ralph_Waldo_Emerson Philosophical School: Transcendentalism Influenced by: Hegel Montaigne Kant... Notable Idea: Over-Soul, Self-Reliance Influenced: Musil Thoreau Proust... Founder of: The Atlantic Subjects: American diarists American Unitarians 1803 Births Mystics... Birthplace: Massachusetts Boston Death Place: Concord, Massachusetts
  • 51. Linked Data • Compiling URIs via: • WorldCat Identities API (author last name and OCLC number search) • LC Linked Data Service (author name match on authoritative name) • Query of multiple sources to check results.
  • 52. Linked Data Implementation Scenario 1 • Store and index data points locally • Store URIs and retrieved contextual data as text • Fast search & retrieval • Regular refresh of data required to capture new/updated data points
  • 53. Linked Data Implementation Scenario 2 • Store harvested URIs locally • Retrieve data points from remote data stores using harvested URIs • Most up-to-date data • Have to overcome system performance and result normalization issues
  • 54. Provide Author Context • Display information about author on the fly following author search • Using stored URI, query dbpedia for author context • Show thumbnail, etc. to user. • Include link to Wikipedia and other info stores
  • 55. Wrap Up • These projects draw on different but complementary skillsets • Use similar data sources • For example: LC Linked Data Service • Resulting expertise informs other projects, which involve some of the same key players
  • 56. Wrap Up • Example future projects: • Linked Data Cross-Divisional Team • Experimenting with transforming MARC bibliographic data to BIBFRAME • Linked Data enrichment in digital collections • API development to harvest third-party metadata for partial cataloging automation
  • 57. Thank you! Questions? Lucas Mak – makw@msu.edu Devin Higgins – higgi135@msu.edu Autumn Faulkner – autumn@msu.edu Joshua Barton – bartonjp@msu.edu