SlideShare a Scribd company logo
1 of 33
A demonstration of transparent and scalable OpenURL quality metrics for use in promoting metadata consistency across content providers Adam Chandler Cornell University Library Cornell University Library, Metadata Working Group Forum 16 October 2009
OpenURL model
OpenURL model cont.  incoming OpenURL http://linkresolver.library.cornell.edu:4550/resserv?&url_ver=z39.88-2004&url_ctx_fmt=info:ofi/fmt:kev:mtx:ctx&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.atitle=item-level+usage+statistics+a+review+of+current+practices+and+recommendations+for+normalization+and+exchange&rft.auinit=c&rft.aulast=merk&rft.date=2009&rft.epage=162&rft.genre=article&rft.issn=0737-8831&rft.issue=1&rft.place=bingley&rft.pub=emerald+group+publishing+limited&rft.spage=151&rft.stitle=libr+hi+tech&rft.title=library+hi+tech&rft.volume=27&rfr_id=info:sid/www.isinet.com:wok:wos&rft.au=scholze,+f&rft.au=windisch,+n&rft_id=info:doi/10.1108%2f07378830910942991/ in our knowledge base? title: Library hi tech     issn: 0737-8831   start date: 19970101    end date:  link-to syntax for Emerald http://www.emeraldinsight.com/rpsv/cgi-bin/cgi?body=linker&reqidx=#@ISSN-HYPHEN#(#@DATE#)#@VOLUME#:#@ISSUE#L.#@SPAGE#
OpenURL is pervasive Cornell link resolver alone: July 1, 2008 – June 30, 2009: 402,000 OpenURL service requests. 402,000 * 123(ARL libraries) = 49 million
Cornell’s top 10 OpenURL sources Web of Knowledge WorldCat Local Google Scholar Webfeat (our “Find Articles” service) EBSCOHost OCLC FirstSearch SilverPlatter Weill Cornell Medical Center SciFinder Scholar  PubMed
example OpenURL http://linkresolver.library.cornell.edu:4550/resserv?&url_ver=z39.88-2004&url_ctx_fmt=info:ofi/fmt:kev:mtx:ctx&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.atitle=item-level+usage+statistics+a+review+of+current+practices+and+recommendations+for+normalization+and+exchange&rft.auinit=c&rft.aulast=merk&rft.date=2009&rft.epage=162&rft.genre=article&rft.issn=0737-8831&rft.issue=1&rft.place=bingley&rft.pub=emerald+group+publishing+limited&rft.spage=151&rft.stitle=libr+hi+tech&rft.title=library+hi+tech&rft.volume=27&rfr_id=info:sid/www.isinet.com:wok:wos&rft.au=scholze,+f&rft.au=windisch,+n&rft_id=info:doi/10.1108%2f07378830910942991/
example OpenURL (1) http://linkresolver.library.cornell.edu:4550/resserv?&url_ver=z39.88-2004 &url_ctx_fmt=info:ofi/fmt:kev:mtx:ctx &rft_val_fmt=info:ofi/fmt:kev:mtx:journal &rft.atitle=item-level+usage+statistics+a+review+of+current+practices+and+recommendations+for+normalization+and+exchange &rft.auinit=c &rft.aulast=merk &rft.date=2009 &rft.epage=162 &rft.genre=article &rft.issn=0737-8831
example OpenURL (2) &rft.issue=1 &rft.place=bingley &rft.pub=emerald+group+publishing+limited &rft.spage=151 &rft.stitle=libr+hi+tech &rft.title=library+hi+tech &rft.volume=27 &rfr_id=info:sid/www.isinet.com:wok:wos &rft.au=scholze,+f &rft.au=windisch,+n &rft_id=info:doi/10.1108%2f07378830910942991/
 … but quality of experience is difficult to benchmark Wrong start end date in the local library's holdings knowledge base (see NISO KBART) Semantically inaccurate metadata from the OpenURL origin (wrong ISSN, for example)  Wrong link-to syntax in link resolver Fragile handling of incoming links by content provider
 … but quality of experience is difficult to benchmark Inaccurate or missing Crossref DOI URL (sometimes the DOI registration process is out of sync with the mounting of articles) Subscription errors (especially with the start of a new calendar year) Syntactically incorrect or missing metadata from the OpenURL origin
Literature review I can identify no systematic study designed and carried out to benchmark the quality of linking. The OpenURL standard was introduced some ten years ago.
Wakimoto, Walker, and Dabbour (2006) Main finding: Users just expect full-text. When they do not get it they are disappointed. Jina Choi Wakimoto, David S. Walker, and Katherine S. Dabbour (2006). "The Myths and Realities of SFX in Academic Libraries." The Journal of Academic Librarianship 32 (2): 127–136
Wakimoto, Walker, and Dabbour (2006) "Where does SFX start and where does it end? If an SFX request does not result in a full-text link, does the problem lie with the source database’s metadata, the construction of the OpenURL request, the SFX KnowledgeBase, the SFX software, the resulting target resource, or even the local library’s collection development plan?" (p. 134) Jina Choi Wakimoto, David S. Walker, and Katherine S. Dabbour (2006). "The Myths and Realities of SFX in Academic Libraries." The Journal of Academic Librarianship 32 (2): 127–136
Blake and Knudson (2002) “Increased awareness of bibliographic/citation standards by authors. Increased submission of publications with bibliographical references reflecting the accepted standards.” Blake, Miriam E. and Frances L. Knudson. "Metadata and Reference Linking." Library Collections, Acquisitions & Technical Services 26 (3), (2002): 230.
Blake and Knudson (2002) “Increased outreach by librarians to authors emphasizing and promoting the importance of citation standards for electronic document retrieval.” Blake, Miriam E. and Frances L. Knudson. "Metadata and Reference Linking." Library Collections, Acquisitions & Technical Services 26 (3), (2002): 230.
Blake and Knudson (2002) “Increased communication between primary publishers and secondary publishers. Metadata corrections and updates need to be better coordinated.” (NISO KBART role) Blake, Miriam E. and Frances L. Knudson. "Metadata and Reference Linking." Library Collections, Acquisitions & Technical Services 26 (3), (2002): 230.
Blake and Knudson (2002) “Increased consistency in metadata within a single database and across databases. This would result in a higher success rate of linking and would allow the algorithms to be simpler. Simpler algorithms are easier to maintain and modify.” Blake, Miriam E. and Frances L. Knudson. "Metadata and Reference Linking." Library Collections, Acquisitions & Technical Services 26 (3), (2002): 230.
Hughes (2004) Hughes describes an initiative of the Open Language Archives Community (OLAC), a consortium of linguistic data archives, to create an infrastructure to support metadata quality assessment within a specialized Open Archives Initiative (OAI) community.  . Baden Hughes, Metadata Quality Evaluation: Experience from the Open Language Archives Community. 7th International Conference on Asian Digital Libraries, ICADL 2004, Shanghai, China, December 13-17, 2004. Proceedings, pp 320-329.
Hughes (2004) Metadata quality should be evaluated on a per record and per collection basis and assessed against the baseline of broader community practice. Metadata quality requires both structural and semantic validation.  . Baden Hughes, Metadata Quality Evaluation: Experience from the Open Language Archives Community. 7th International Conference on Asian Digital Libraries, ICADL 2004, Shanghai, China, December 13-17, 2004. Proceedings, pp 320-329.
Hughes (2004) Goals:  establish a baseline against which future instances can be compared;  provide assistance to data providers;  evaluate a set of domain-grounded controlled vocabularies. . Baden Hughes, Metadata Quality Evaluation: Experience from the Open Language Archives Community. 7th International Conference on Asian Digital Libraries, ICADL 2004, Shanghai, China, December 13-17, 2004. Proceedings, pp 320-329.
Hughes’ approach Each metadata record score from 0 - 10.  There are two parts, a "Code Existence Score and an Element Absence Penalty," with weighting.  The Code Existence Score is specific to the OLAC communities use of Dublin Core extensions.  The Element Absence Penalty is based on the premise that the usefullness of a given metadata decreases in the absence of core metadata fields.  The absence of a core element results in a negative 0.2 penalty. Baden Hughes, Metadata Quality Evaluation: Experience from the Open Language Archives Community. 7th International Conference on Asian Digital Libraries, ICADL 2004, Shanghai, China, December 13-17, 2004. Proceedings, pp 320-329.
Hughes’ approach From this simple approach, an array of metrics are derived:   archive diversity;  metadata quality;  core elements per record;  core element usage;  code usage;  code and element usage;  star rating. From these metrics a score is computed for each metadata record, each archive, and the community as a whole. Baden Hughes, Metadata Quality Evaluation: Experience from the Open Language Archives Community. 7th International Conference on Asian Digital Libraries, ICADL 2004, Shanghai, China, December 13-17, 2004. Proceedings, pp 320-329.
Mellon funded planning grant for L'Année philologique  1. Canonical Citation Linking: http://cwkb.org In collaboration with Eric Rebillard, Professor, Classics and History, and David Ruddy, Cornell University Library 2. OpenURL Quality Is it possible to build a tool for evaluating the quality of OpenURLs from a content provider?
Key findings from 2008 Mellon OpenURL quality investigation Hughes’ approach to metadata evaluation is excellent  scaffolding  to help build a model for OpenURL metadata evaluation, but it does not match the problem exactly.
Constant: Core elements used by content providers in their link-to targets title - 64% spage - 64% volume - 61% issue - 60% date - 48% aulast - 47% issn - 35% atitle - 35% DOI - 14% ISBN – 5% Based on an analysis of link-tos in the Cornell instance of the III WebBridge link resolver product.
Variable: Frequency of element string patterns for all sources
aulast  First author's family name. This may be more than one word. In many citations, the author's family name is recorded first and is followed by a comma, e.g. Smith, Fred James is recorded as "aulast=smith"
aulast   if ($e =~ /aulast/) {       $patterns{$neworigin}{$newsid}{$e}++;       if ($elementhash{$e} =~ /^[A-Za-z]+$/) { $patterns{$neworigin}{$newsid}{"aulast_simple"}++; } elsif ($elementhash{$e} =~ /^[A-Za-z]+, .+$/) { $patterns{$neworigin}{$newsid}{"aulast_comma"}++; } elsif ($elementhash{$e} =~ /^[A-Z][a-z]+( [A-Z])+$/) { $patterns{$neworigin}{$newsid}{"aulast_simpleplusinitial"}++;} else { $patterns{$neworigin}{$newsid}{"aulast_other"}++; }     }
aulast_other examples Ryan S Miller Louise D Bryant DAVID J MCKENZIE %C4%90okovi%C4%87 Indu B Ahluwalia Carreras-Sangr%c3%a0 Bautista-Casta%C3%B1o O%27Shea Melissa Ventura Marra Guan XueYing%3B Yu Nan%3B ShangguanXiaoXia
spage First page number of a start/end (spage-epage) pair. Note that pages are not always numeric.
spage      if ($e =~ /spage/) {       $patterns{$neworigin}{$newsid}{$e}++;       if ($elementhash{$e} =~ /^+$/) { $patterns{$neworigin}{$newsid}{"spage_number"}++; } elsif ($elementhash{$e} =~ /^+-+$/) { $patterns{$neworigin}{$newsid}{"spage_number_number"}++; } elsif ($elementhash{$e} =~ /[A-Za-z].+/) { $patterns{$neworigin}{$newsid}{"spage_string_w_number"}++; } else { $patterns{$neworigin}{$newsid}{"spage_other"}++; }     }
spage_other examples 1033 (6 pages) 85(19) 575 (11 pages) 283...290 PHYS GLRM 58,+VI
date The publication date of the item or bundle encoded in the "Complete date" variant of ISO8601 (see http://www.w3.org/TR/NOTE-datetime). This format is YYYYMM- DD where YYYY is the four-digit year, MM is the month of the year between 01 (January) and 12 (December), and DD is the day of the month between 01 and 28 or 29 or 30 or 31, depending on length of the month and whether it is a leap year.

More Related Content

What's hot

Open Annotation Collaboration Introduction
Open Annotation Collaboration IntroductionOpen Annotation Collaboration Introduction
Open Annotation Collaboration IntroductionTimothy Cole
 
Data Designed for Discovery
Data Designed for DiscoveryData Designed for Discovery
Data Designed for DiscoveryOCLC
 
LoCloud - D1.3 Content and Metadata Analysis
LoCloud - D1.3 Content and Metadata AnalysisLoCloud - D1.3 Content and Metadata Analysis
LoCloud - D1.3 Content and Metadata Analysislocloud
 
A Linked Data Prototype for the Union Catalog of Digital Archives Taiwan
A Linked Data Prototype for the Union Catalog of Digital Archives TaiwanA Linked Data Prototype for the Union Catalog of Digital Archives Taiwan
A Linked Data Prototype for the Union Catalog of Digital Archives Taiwanandrea huang
 
DBpedia Archive using Memento, Triple Pattern Fragments, and HDT
DBpedia Archive using Memento, Triple Pattern Fragments, and HDTDBpedia Archive using Memento, Triple Pattern Fragments, and HDT
DBpedia Archive using Memento, Triple Pattern Fragments, and HDTHerbert Van de Sompel
 
FAIR Signposting: A KISS Approach to a Burning Issue
FAIR Signposting: A KISS Approach to a Burning IssueFAIR Signposting: A KISS Approach to a Burning Issue
FAIR Signposting: A KISS Approach to a Burning IssueHerbert Van de Sompel
 
Visualising the Australian open data and research data landscape
Visualising the Australian open data and research data landscapeVisualising the Australian open data and research data landscape
Visualising the Australian open data and research data landscapeJonathan Yu
 
TDWG VoMaG Vocabulary management workflow, 2013-10-31
TDWG VoMaG Vocabulary management workflow, 2013-10-31TDWG VoMaG Vocabulary management workflow, 2013-10-31
TDWG VoMaG Vocabulary management workflow, 2013-10-31Dag Endresen
 

What's hot (10)

Open Annotation Collaboration Introduction
Open Annotation Collaboration IntroductionOpen Annotation Collaboration Introduction
Open Annotation Collaboration Introduction
 
A Clean Slate?
A Clean Slate?A Clean Slate?
A Clean Slate?
 
Data Designed for Discovery
Data Designed for DiscoveryData Designed for Discovery
Data Designed for Discovery
 
LoCloud - D1.3 Content and Metadata Analysis
LoCloud - D1.3 Content and Metadata AnalysisLoCloud - D1.3 Content and Metadata Analysis
LoCloud - D1.3 Content and Metadata Analysis
 
A Linked Data Prototype for the Union Catalog of Digital Archives Taiwan
A Linked Data Prototype for the Union Catalog of Digital Archives TaiwanA Linked Data Prototype for the Union Catalog of Digital Archives Taiwan
A Linked Data Prototype for the Union Catalog of Digital Archives Taiwan
 
DBpedia Archive using Memento, Triple Pattern Fragments, and HDT
DBpedia Archive using Memento, Triple Pattern Fragments, and HDTDBpedia Archive using Memento, Triple Pattern Fragments, and HDT
DBpedia Archive using Memento, Triple Pattern Fragments, and HDT
 
FAIR Signposting: A KISS Approach to a Burning Issue
FAIR Signposting: A KISS Approach to a Burning IssueFAIR Signposting: A KISS Approach to a Burning Issue
FAIR Signposting: A KISS Approach to a Burning Issue
 
Visualising the Australian open data and research data landscape
Visualising the Australian open data and research data landscapeVisualising the Australian open data and research data landscape
Visualising the Australian open data and research data landscape
 
TDWG VoMaG Vocabulary management workflow, 2013-10-31
TDWG VoMaG Vocabulary management workflow, 2013-10-31TDWG VoMaG Vocabulary management workflow, 2013-10-31
TDWG VoMaG Vocabulary management workflow, 2013-10-31
 
Creating Pockets of Persistence
Creating Pockets of PersistenceCreating Pockets of Persistence
Creating Pockets of Persistence
 

Viewers also liked

How does your media product represent particular social
How does your media product represent particular socialHow does your media product represent particular social
How does your media product represent particular sociallucymcdonnell5
 
Quesitonaire pie
Quesitonaire pieQuesitonaire pie
Quesitonaire piehalyma120
 
Five Key Principles to Embrace & Engage the Multi-Channel Consumer
Five Key Principles to Embrace & Engage the Multi-Channel ConsumerFive Key Principles to Embrace & Engage the Multi-Channel Consumer
Five Key Principles to Embrace & Engage the Multi-Channel ConsumerAdroit Digital
 
You Are My All in All
You Are My All in AllYou Are My All in All
You Are My All in Allladybag
 
BIBINGEORGETHOMAS.docx
BIBINGEORGETHOMAS.docxBIBINGEORGETHOMAS.docx
BIBINGEORGETHOMAS.docxBibin Thomas
 
Print Ad Marketing Plan
Print Ad Marketing PlanPrint Ad Marketing Plan
Print Ad Marketing Planabcd3
 
Газета Мала Батьківщина №1 (2012)
Газета Мала Батьківщина №1 (2012)Газета Мала Батьківщина №1 (2012)
Газета Мала Батьківщина №1 (2012)Sergii Illiukhin
 
少子化對大學校院的衝擊與因應之道~黃聰亮教授
少子化對大學校院的衝擊與因應之道~黃聰亮教授 少子化對大學校院的衝擊與因應之道~黃聰亮教授
少子化對大學校院的衝擊與因應之道~黃聰亮教授 vincent8899
 
考試沒教的事
考試沒教的事考試沒教的事
考試沒教的事ADAN CHEN
 
3.àrees funcionals decisions financeres
3.àrees funcionals   decisions financeres3.àrees funcionals   decisions financeres
3.àrees funcionals decisions financeresddaude
 
Conclusion ecc 2012 marc pattinson
Conclusion ecc 2012 marc pattinsonConclusion ecc 2012 marc pattinson
Conclusion ecc 2012 marc pattinsonClusterExcellence
 
Dispositivos de multimedia
Dispositivos de multimediaDispositivos de multimedia
Dispositivos de multimediasashiaisela
 
Synthesis multimedia learning
Synthesis multimedia learningSynthesis multimedia learning
Synthesis multimedia learningkylealee
 
US Energy Consumption by State as of 2005
US Energy Consumption by State  as of 2005US Energy Consumption by State  as of 2005
US Energy Consumption by State as of 2005Bruce LaCour
 

Viewers also liked (20)

Día de san valentín
Día de san valentínDía de san valentín
Día de san valentín
 
How does your media product represent particular social
How does your media product represent particular socialHow does your media product represent particular social
How does your media product represent particular social
 
Quesitonaire pie
Quesitonaire pieQuesitonaire pie
Quesitonaire pie
 
Five Key Principles to Embrace & Engage the Multi-Channel Consumer
Five Key Principles to Embrace & Engage the Multi-Channel ConsumerFive Key Principles to Embrace & Engage the Multi-Channel Consumer
Five Key Principles to Embrace & Engage the Multi-Channel Consumer
 
Self talk
Self talkSelf talk
Self talk
 
You Are My All in All
You Are My All in AllYou Are My All in All
You Are My All in All
 
BIBINGEORGETHOMAS.docx
BIBINGEORGETHOMAS.docxBIBINGEORGETHOMAS.docx
BIBINGEORGETHOMAS.docx
 
Print Ad Marketing Plan
Print Ad Marketing PlanPrint Ad Marketing Plan
Print Ad Marketing Plan
 
Газета Мала Батьківщина №1 (2012)
Газета Мала Батьківщина №1 (2012)Газета Мала Батьківщина №1 (2012)
Газета Мала Батьківщина №1 (2012)
 
T. suchman plenary friday & saturday building a culture
T. suchman plenary friday & saturday building a cultureT. suchman plenary friday & saturday building a culture
T. suchman plenary friday & saturday building a culture
 
Roses
RosesRoses
Roses
 
少子化對大學校院的衝擊與因應之道~黃聰亮教授
少子化對大學校院的衝擊與因應之道~黃聰亮教授 少子化對大學校院的衝擊與因應之道~黃聰亮教授
少子化對大學校院的衝擊與因應之道~黃聰亮教授
 
Bautista - PICARD 2011 Presentation
Bautista - PICARD 2011 PresentationBautista - PICARD 2011 Presentation
Bautista - PICARD 2011 Presentation
 
UIC Thesis Cancare
UIC Thesis CancareUIC Thesis Cancare
UIC Thesis Cancare
 
考試沒教的事
考試沒教的事考試沒教的事
考試沒教的事
 
3.àrees funcionals decisions financeres
3.àrees funcionals   decisions financeres3.àrees funcionals   decisions financeres
3.àrees funcionals decisions financeres
 
Conclusion ecc 2012 marc pattinson
Conclusion ecc 2012 marc pattinsonConclusion ecc 2012 marc pattinson
Conclusion ecc 2012 marc pattinson
 
Dispositivos de multimedia
Dispositivos de multimediaDispositivos de multimedia
Dispositivos de multimedia
 
Synthesis multimedia learning
Synthesis multimedia learningSynthesis multimedia learning
Synthesis multimedia learning
 
US Energy Consumption by State as of 2005
US Energy Consumption by State  as of 2005US Energy Consumption by State  as of 2005
US Energy Consumption by State as of 2005
 

Similar to A demonstration of transparent and scalable OpenURL quality metrics for use in promoting metadata consistency across content providers

The Rhetoric of Research Objects
The Rhetoric of Research ObjectsThe Rhetoric of Research Objects
The Rhetoric of Research ObjectsCarole Goble
 
VRA_2015_CatalogingRoundup_Seneff
VRA_2015_CatalogingRoundup_SeneffVRA_2015_CatalogingRoundup_Seneff
VRA_2015_CatalogingRoundup_SeneffHeather Seneff
 
Engaging Information Professionals in the Process of Authoritative Interlinki...
Engaging Information Professionals in the Process of Authoritative Interlinki...Engaging Information Professionals in the Process of Authoritative Interlinki...
Engaging Information Professionals in the Process of Authoritative Interlinki...Lucy McKenna
 
Authors' and Publications' Citations knowledge base
Authors' and Publications' Citations knowledge base Authors' and Publications' Citations knowledge base
Authors' and Publications' Citations knowledge base Leila Zemmouchi-Ghomari
 
Current metadata landscape in the library world Getaneh Alemu
Current metadata landscape in the library world Getaneh AlemuCurrent metadata landscape in the library world Getaneh Alemu
Current metadata landscape in the library world Getaneh AlemuGetaneh Alemu
 
Closing the scientific literature access gap with CORE - how to gain free acc...
Closing the scientific literature access gap with CORE - how to gain free acc...Closing the scientific literature access gap with CORE - how to gain free acc...
Closing the scientific literature access gap with CORE - how to gain free acc...Nancy Pontika
 
Ifla swsig meeting - Puerto Rico - 20110817
Ifla swsig meeting - Puerto Rico - 20110817Ifla swsig meeting - Puerto Rico - 20110817
Ifla swsig meeting - Puerto Rico - 20110817Figoblog
 
re3data.org – Registry of Research Data Repositories
re3data.org – Registry of Research Data Repositoriesre3data.org – Registry of Research Data Repositories
re3data.org – Registry of Research Data RepositoriesHeinz Pampel
 
Descubrimiento, entrega de información y gestión: tendencias actuales de las ...
Descubrimiento, entrega de información y gestión: tendencias actuales de las ...Descubrimiento, entrega de información y gestión: tendencias actuales de las ...
Descubrimiento, entrega de información y gestión: tendencias actuales de las ...innovatics
 
OSFair2017 Workshop | Building a global knowledge commons - ramping up reposi...
OSFair2017 Workshop | Building a global knowledge commons - ramping up reposi...OSFair2017 Workshop | Building a global knowledge commons - ramping up reposi...
OSFair2017 Workshop | Building a global knowledge commons - ramping up reposi...Open Science Fair
 
HIBERLINK: Reference Rot and Linked Data: Threat and Remedy
HIBERLINK: Reference Rot and Linked Data: Threat and RemedyHIBERLINK: Reference Rot and Linked Data: Threat and Remedy
HIBERLINK: Reference Rot and Linked Data: Threat and RemedyPRELIDA Project
 
RDA implementation: the new cataloguing standard in Europe - Dilyana Ducheva
 RDA implementation: the new cataloguing standard in Europe - Dilyana Ducheva RDA implementation: the new cataloguing standard in Europe - Dilyana Ducheva
RDA implementation: the new cataloguing standard in Europe - Dilyana DuchevaLISDISConference
 
Foundations to Actions: Extending Innovations to Digital Libraries in Partner...
Foundations to Actions: Extending Innovations to Digital Libraries in Partner...Foundations to Actions: Extending Innovations to Digital Libraries in Partner...
Foundations to Actions: Extending Innovations to Digital Libraries in Partner...Trish Rose-Sandler
 
eResources in Academic Libraries
eResources in Academic LibrarieseResources in Academic Libraries
eResources in Academic Librariesottumtk
 

Similar to A demonstration of transparent and scalable OpenURL quality metrics for use in promoting metadata consistency across content providers (20)

The Rhetoric of Research Objects
The Rhetoric of Research ObjectsThe Rhetoric of Research Objects
The Rhetoric of Research Objects
 
Ji cv6n2
Ji cv6n2Ji cv6n2
Ji cv6n2
 
VRA_2015_CatalogingRoundup_Seneff
VRA_2015_CatalogingRoundup_SeneffVRA_2015_CatalogingRoundup_Seneff
VRA_2015_CatalogingRoundup_Seneff
 
Engaging Information Professionals in the Process of Authoritative Interlinki...
Engaging Information Professionals in the Process of Authoritative Interlinki...Engaging Information Professionals in the Process of Authoritative Interlinki...
Engaging Information Professionals in the Process of Authoritative Interlinki...
 
"In the Early Days of a Better Nation": Enhancing the power of metadata today...
"In the Early Days of a Better Nation": Enhancing the power of metadata today..."In the Early Days of a Better Nation": Enhancing the power of metadata today...
"In the Early Days of a Better Nation": Enhancing the power of metadata today...
 
Authors' and Publications' Citations knowledge base
Authors' and Publications' Citations knowledge base Authors' and Publications' Citations knowledge base
Authors' and Publications' Citations knowledge base
 
FAIRy Stories
FAIRy StoriesFAIRy Stories
FAIRy Stories
 
Current metadata landscape in the library world Getaneh Alemu
Current metadata landscape in the library world Getaneh AlemuCurrent metadata landscape in the library world Getaneh Alemu
Current metadata landscape in the library world Getaneh Alemu
 
Closing the scientific literature access gap with CORE - how to gain free acc...
Closing the scientific literature access gap with CORE - how to gain free acc...Closing the scientific literature access gap with CORE - how to gain free acc...
Closing the scientific literature access gap with CORE - how to gain free acc...
 
2015 NISO Forum: The Future of Library Resource Discovery
2015 NISO Forum: The Future of Library Resource Discovery2015 NISO Forum: The Future of Library Resource Discovery
2015 NISO Forum: The Future of Library Resource Discovery
 
Ifla swsig meeting - Puerto Rico - 20110817
Ifla swsig meeting - Puerto Rico - 20110817Ifla swsig meeting - Puerto Rico - 20110817
Ifla swsig meeting - Puerto Rico - 20110817
 
re3data.org – Registry of Research Data Repositories
re3data.org – Registry of Research Data Repositoriesre3data.org – Registry of Research Data Repositories
re3data.org – Registry of Research Data Repositories
 
Scholze imcw 2014-11-25
Scholze imcw 2014-11-25Scholze imcw 2014-11-25
Scholze imcw 2014-11-25
 
Full Erdmann Ruttenberg Community Approaches to Open Data at Scale
Full Erdmann Ruttenberg Community Approaches to Open Data at ScaleFull Erdmann Ruttenberg Community Approaches to Open Data at Scale
Full Erdmann Ruttenberg Community Approaches to Open Data at Scale
 
Descubrimiento, entrega de información y gestión: tendencias actuales de las ...
Descubrimiento, entrega de información y gestión: tendencias actuales de las ...Descubrimiento, entrega de información y gestión: tendencias actuales de las ...
Descubrimiento, entrega de información y gestión: tendencias actuales de las ...
 
OSFair2017 Workshop | Building a global knowledge commons - ramping up reposi...
OSFair2017 Workshop | Building a global knowledge commons - ramping up reposi...OSFair2017 Workshop | Building a global knowledge commons - ramping up reposi...
OSFair2017 Workshop | Building a global knowledge commons - ramping up reposi...
 
HIBERLINK: Reference Rot and Linked Data: Threat and Remedy
HIBERLINK: Reference Rot and Linked Data: Threat and RemedyHIBERLINK: Reference Rot and Linked Data: Threat and Remedy
HIBERLINK: Reference Rot and Linked Data: Threat and Remedy
 
RDA implementation: the new cataloguing standard in Europe - Dilyana Ducheva
 RDA implementation: the new cataloguing standard in Europe - Dilyana Ducheva RDA implementation: the new cataloguing standard in Europe - Dilyana Ducheva
RDA implementation: the new cataloguing standard in Europe - Dilyana Ducheva
 
Foundations to Actions: Extending Innovations to Digital Libraries in Partner...
Foundations to Actions: Extending Innovations to Digital Libraries in Partner...Foundations to Actions: Extending Innovations to Digital Libraries in Partner...
Foundations to Actions: Extending Innovations to Digital Libraries in Partner...
 
eResources in Academic Libraries
eResources in Academic LibrarieseResources in Academic Libraries
eResources in Academic Libraries
 

Recently uploaded

How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17Celine George
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introductionMaksud Ahmed
 
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Celine George
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Sapana Sha
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxheathfieldcps1
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentInMediaRes1
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docxPoojaSen20
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdfSoniaTolstoy
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTiammrhaywood
 
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting DataJhengPantaleon
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application ) Sakshi Ghasle
 
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxContemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxRoyAbrique
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionSafetyChain Software
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13Steve Thomason
 

Recently uploaded (20)

Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
 
How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media Component
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docx
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
 
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
 
9953330565 Low Rate Call Girls In Rohini Delhi NCR
9953330565 Low Rate Call Girls In Rohini  Delhi NCR9953330565 Low Rate Call Girls In Rohini  Delhi NCR
9953330565 Low Rate Call Girls In Rohini Delhi NCR
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application )
 
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxContemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory Inspection
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13
 

A demonstration of transparent and scalable OpenURL quality metrics for use in promoting metadata consistency across content providers

  • 1. A demonstration of transparent and scalable OpenURL quality metrics for use in promoting metadata consistency across content providers Adam Chandler Cornell University Library Cornell University Library, Metadata Working Group Forum 16 October 2009
  • 3. OpenURL model cont. incoming OpenURL http://linkresolver.library.cornell.edu:4550/resserv?&url_ver=z39.88-2004&url_ctx_fmt=info:ofi/fmt:kev:mtx:ctx&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.atitle=item-level+usage+statistics+a+review+of+current+practices+and+recommendations+for+normalization+and+exchange&rft.auinit=c&rft.aulast=merk&rft.date=2009&rft.epage=162&rft.genre=article&rft.issn=0737-8831&rft.issue=1&rft.place=bingley&rft.pub=emerald+group+publishing+limited&rft.spage=151&rft.stitle=libr+hi+tech&rft.title=library+hi+tech&rft.volume=27&rfr_id=info:sid/www.isinet.com:wok:wos&rft.au=scholze,+f&rft.au=windisch,+n&rft_id=info:doi/10.1108%2f07378830910942991/ in our knowledge base? title: Library hi tech issn: 0737-8831 start date: 19970101 end date: link-to syntax for Emerald http://www.emeraldinsight.com/rpsv/cgi-bin/cgi?body=linker&reqidx=#@ISSN-HYPHEN#(#@DATE#)#@VOLUME#:#@ISSUE#L.#@SPAGE#
  • 4. OpenURL is pervasive Cornell link resolver alone: July 1, 2008 – June 30, 2009: 402,000 OpenURL service requests. 402,000 * 123(ARL libraries) = 49 million
  • 5. Cornell’s top 10 OpenURL sources Web of Knowledge WorldCat Local Google Scholar Webfeat (our “Find Articles” service) EBSCOHost OCLC FirstSearch SilverPlatter Weill Cornell Medical Center SciFinder Scholar PubMed
  • 7. example OpenURL (1) http://linkresolver.library.cornell.edu:4550/resserv?&url_ver=z39.88-2004 &url_ctx_fmt=info:ofi/fmt:kev:mtx:ctx &rft_val_fmt=info:ofi/fmt:kev:mtx:journal &rft.atitle=item-level+usage+statistics+a+review+of+current+practices+and+recommendations+for+normalization+and+exchange &rft.auinit=c &rft.aulast=merk &rft.date=2009 &rft.epage=162 &rft.genre=article &rft.issn=0737-8831
  • 8. example OpenURL (2) &rft.issue=1 &rft.place=bingley &rft.pub=emerald+group+publishing+limited &rft.spage=151 &rft.stitle=libr+hi+tech &rft.title=library+hi+tech &rft.volume=27 &rfr_id=info:sid/www.isinet.com:wok:wos &rft.au=scholze,+f &rft.au=windisch,+n &rft_id=info:doi/10.1108%2f07378830910942991/
  • 9. … but quality of experience is difficult to benchmark Wrong start end date in the local library's holdings knowledge base (see NISO KBART) Semantically inaccurate metadata from the OpenURL origin (wrong ISSN, for example) Wrong link-to syntax in link resolver Fragile handling of incoming links by content provider
  • 10. … but quality of experience is difficult to benchmark Inaccurate or missing Crossref DOI URL (sometimes the DOI registration process is out of sync with the mounting of articles) Subscription errors (especially with the start of a new calendar year) Syntactically incorrect or missing metadata from the OpenURL origin
  • 11. Literature review I can identify no systematic study designed and carried out to benchmark the quality of linking. The OpenURL standard was introduced some ten years ago.
  • 12. Wakimoto, Walker, and Dabbour (2006) Main finding: Users just expect full-text. When they do not get it they are disappointed. Jina Choi Wakimoto, David S. Walker, and Katherine S. Dabbour (2006). "The Myths and Realities of SFX in Academic Libraries." The Journal of Academic Librarianship 32 (2): 127–136
  • 13. Wakimoto, Walker, and Dabbour (2006) "Where does SFX start and where does it end? If an SFX request does not result in a full-text link, does the problem lie with the source database’s metadata, the construction of the OpenURL request, the SFX KnowledgeBase, the SFX software, the resulting target resource, or even the local library’s collection development plan?" (p. 134) Jina Choi Wakimoto, David S. Walker, and Katherine S. Dabbour (2006). "The Myths and Realities of SFX in Academic Libraries." The Journal of Academic Librarianship 32 (2): 127–136
  • 14. Blake and Knudson (2002) “Increased awareness of bibliographic/citation standards by authors. Increased submission of publications with bibliographical references reflecting the accepted standards.” Blake, Miriam E. and Frances L. Knudson. "Metadata and Reference Linking." Library Collections, Acquisitions & Technical Services 26 (3), (2002): 230.
  • 15. Blake and Knudson (2002) “Increased outreach by librarians to authors emphasizing and promoting the importance of citation standards for electronic document retrieval.” Blake, Miriam E. and Frances L. Knudson. "Metadata and Reference Linking." Library Collections, Acquisitions & Technical Services 26 (3), (2002): 230.
  • 16. Blake and Knudson (2002) “Increased communication between primary publishers and secondary publishers. Metadata corrections and updates need to be better coordinated.” (NISO KBART role) Blake, Miriam E. and Frances L. Knudson. "Metadata and Reference Linking." Library Collections, Acquisitions & Technical Services 26 (3), (2002): 230.
  • 17. Blake and Knudson (2002) “Increased consistency in metadata within a single database and across databases. This would result in a higher success rate of linking and would allow the algorithms to be simpler. Simpler algorithms are easier to maintain and modify.” Blake, Miriam E. and Frances L. Knudson. "Metadata and Reference Linking." Library Collections, Acquisitions & Technical Services 26 (3), (2002): 230.
  • 18. Hughes (2004) Hughes describes an initiative of the Open Language Archives Community (OLAC), a consortium of linguistic data archives, to create an infrastructure to support metadata quality assessment within a specialized Open Archives Initiative (OAI) community. . Baden Hughes, Metadata Quality Evaluation: Experience from the Open Language Archives Community. 7th International Conference on Asian Digital Libraries, ICADL 2004, Shanghai, China, December 13-17, 2004. Proceedings, pp 320-329.
  • 19. Hughes (2004) Metadata quality should be evaluated on a per record and per collection basis and assessed against the baseline of broader community practice. Metadata quality requires both structural and semantic validation. . Baden Hughes, Metadata Quality Evaluation: Experience from the Open Language Archives Community. 7th International Conference on Asian Digital Libraries, ICADL 2004, Shanghai, China, December 13-17, 2004. Proceedings, pp 320-329.
  • 20. Hughes (2004) Goals: establish a baseline against which future instances can be compared; provide assistance to data providers; evaluate a set of domain-grounded controlled vocabularies. . Baden Hughes, Metadata Quality Evaluation: Experience from the Open Language Archives Community. 7th International Conference on Asian Digital Libraries, ICADL 2004, Shanghai, China, December 13-17, 2004. Proceedings, pp 320-329.
  • 21. Hughes’ approach Each metadata record score from 0 - 10. There are two parts, a "Code Existence Score and an Element Absence Penalty," with weighting. The Code Existence Score is specific to the OLAC communities use of Dublin Core extensions. The Element Absence Penalty is based on the premise that the usefullness of a given metadata decreases in the absence of core metadata fields. The absence of a core element results in a negative 0.2 penalty. Baden Hughes, Metadata Quality Evaluation: Experience from the Open Language Archives Community. 7th International Conference on Asian Digital Libraries, ICADL 2004, Shanghai, China, December 13-17, 2004. Proceedings, pp 320-329.
  • 22. Hughes’ approach From this simple approach, an array of metrics are derived: archive diversity; metadata quality; core elements per record; core element usage; code usage; code and element usage; star rating. From these metrics a score is computed for each metadata record, each archive, and the community as a whole. Baden Hughes, Metadata Quality Evaluation: Experience from the Open Language Archives Community. 7th International Conference on Asian Digital Libraries, ICADL 2004, Shanghai, China, December 13-17, 2004. Proceedings, pp 320-329.
  • 23. Mellon funded planning grant for L'Année philologique 1. Canonical Citation Linking: http://cwkb.org In collaboration with Eric Rebillard, Professor, Classics and History, and David Ruddy, Cornell University Library 2. OpenURL Quality Is it possible to build a tool for evaluating the quality of OpenURLs from a content provider?
  • 24. Key findings from 2008 Mellon OpenURL quality investigation Hughes’ approach to metadata evaluation is excellent scaffolding to help build a model for OpenURL metadata evaluation, but it does not match the problem exactly.
  • 25. Constant: Core elements used by content providers in their link-to targets title - 64% spage - 64% volume - 61% issue - 60% date - 48% aulast - 47% issn - 35% atitle - 35% DOI - 14% ISBN – 5% Based on an analysis of link-tos in the Cornell instance of the III WebBridge link resolver product.
  • 26. Variable: Frequency of element string patterns for all sources
  • 27. aulast First author's family name. This may be more than one word. In many citations, the author's family name is recorded first and is followed by a comma, e.g. Smith, Fred James is recorded as "aulast=smith"
  • 28. aulast if ($e =~ /aulast/) { $patterns{$neworigin}{$newsid}{$e}++; if ($elementhash{$e} =~ /^[A-Za-z]+$/) { $patterns{$neworigin}{$newsid}{"aulast_simple"}++; } elsif ($elementhash{$e} =~ /^[A-Za-z]+, .+$/) { $patterns{$neworigin}{$newsid}{"aulast_comma"}++; } elsif ($elementhash{$e} =~ /^[A-Z][a-z]+( [A-Z])+$/) { $patterns{$neworigin}{$newsid}{"aulast_simpleplusinitial"}++;} else { $patterns{$neworigin}{$newsid}{"aulast_other"}++; } }
  • 29. aulast_other examples Ryan S Miller Louise D Bryant DAVID J MCKENZIE %C4%90okovi%C4%87 Indu B Ahluwalia Carreras-Sangr%c3%a0 Bautista-Casta%C3%B1o O%27Shea Melissa Ventura Marra Guan XueYing%3B Yu Nan%3B ShangguanXiaoXia
  • 30. spage First page number of a start/end (spage-epage) pair. Note that pages are not always numeric.
  • 31. spage if ($e =~ /spage/) { $patterns{$neworigin}{$newsid}{$e}++; if ($elementhash{$e} =~ /^+$/) { $patterns{$neworigin}{$newsid}{"spage_number"}++; } elsif ($elementhash{$e} =~ /^+-+$/) { $patterns{$neworigin}{$newsid}{"spage_number_number"}++; } elsif ($elementhash{$e} =~ /[A-Za-z].+/) { $patterns{$neworigin}{$newsid}{"spage_string_w_number"}++; } else { $patterns{$neworigin}{$newsid}{"spage_other"}++; } }
  • 32. spage_other examples 1033 (6 pages) 85(19) 575 (11 pages) 283...290 PHYS GLRM 58,+VI
  • 33. date The publication date of the item or bundle encoded in the "Complete date" variant of ISO8601 (see http://www.w3.org/TR/NOTE-datetime). This format is YYYYMM- DD where YYYY is the four-digit year, MM is the month of the year between 01 (January) and 12 (December), and DD is the day of the month between 01 and 28 or 29 or 30 or 31, depending on length of the month and whether it is a leap year.
  • 34. date if ($e =~ /date/) { $patterns{$neworigin}{$newsid}{$e}++; if ($elementhash{$e} =~ /^{4}$/) { $patterns{$neworigin}{$newsid}{"date_dddd"}++; } elsif ($elementhash{$e} =~ /^{4}-{2}$/) { $patterns{$neworigin}{$newsid}{"date_dddd-dd"}++; } elsif ($elementhash{$e} =~ /^{4}-{2}-{2}$/) { $patterns{$neworigin}{$newsid}{"date_dddd-dd-dd"}++; } elsif ($elementhash{$e} =~ /^{4}-{4}$/) { $patterns{$neworigin}{$newsid}{"date_dddd-dddd"}++; } elsif ($elementhash{$e} =~ /^{8}$/) { $patterns{$neworigin}{$newsid}{"date_dddddddd"}++; } else {$patterns{$neworigin}{$newsid}{"date_dateother"}++; } }
  • 35. date_other examples 1956 July %7E1994 June 5%2C 2002 JUN 30 05 2006%282007%29 1922,+April+25th %5B%5B1943-06-19%5D%5D
  • 36. issn International Standard Serials Number (ISSN). The issn may contain a hyphen, e.g. "1041-5653"
  • 37. issn if ($e =~ /issn/) { $patterns{$neworigin}{$newsid}{$e}++; if ($elementhash{$e} =~ /^{4}-{3}./) { $patterns{$neworigin}{$newsid}{"issn_number_number"}++; } elsif ($elementhash{$e} =~ /^{7}./) { $patterns{$neworigin}{$newsid}{"issn_number"}++; } else { $patterns{$neworigin}{$newsid}{"issn_other"}++; } }
  • 38. issn_other examples 0065-2598%28print%29 0018-5345+%28ISSN+print%29 ISSN ISBN 0-9525091-5-6. 0021-8375%28print%29%7C1439-0361%28electronic%29 1471-2164+%28ISSN+online%29 0191-8699%3B0191-8699 0741-8329 (Print)%3B NLM Unique Journal Identifier%3A 8502311
  • 39. How often out of 402,000 Cornell OpenURLs?
  • 40. flat file output logsourceyear quarter origin sid metric count cornell 2009 Q1 csacsa:commabs-set-c atitle 154 cornell 2009 Q1 csacsa:commabs-set-c atitle_colon 101 cornell 2009 Q1 csacsa:commabs-set-c atitle_other 53 cornell 2009 Q1 csacsa:commabs-set-c aulast 159 cornell 2009 Q1 csacsa:commabs-set-c aulast_other 4 cornell 2009 Q1 csacsa:commabs-set-c aulast_simple 155 cornell 2009 Q1 csacsa:commabs-set-c date 159 cornell 2009 Q1 csacsa:commabs-set-c date_dddd 110 cornell 2009 Q1 csacsa:commabs-set-c date_dddd-dd 49 cornell 2009 Q1 csacsa:commabs-set-c isbn 6 cornell 2009 Q1 csacsa:commabs-set-c isbn_10 6 cornell 2009 Q1 csacsa:commabs-set-c issn 135 cornell 2009 Q1 csacsa:commabs-set-c issn_number-number 135 cornell 2009 Q1 csacsa:commabs-set-c issue 136 cornell 2009 Q1 csacsa:commabs-set-c issue_number 132 cornell 2009 Q1 csacsa:commabs-set-c issue_number_dash_number2 cornell 2009 Q1 csacsa:commabs-set-c issue_other 2 cornell 2009 Q1 csacsa:commabs-set-c spage 153 cornell 2009 Q1 csacsa:commabs-set-c spage_number 153 cornell 2009 Q1 csacsa:commabs-set-c title 160 cornell 2009 Q1 csacsa:commabs-set-c total 160 cornell 2009 Q1 csacsa:commabs-set-c volume 139 cornell 2009 Q1 csacsa:commabs-set-c volume_number 139
  • 42. Next steps create a NISO structure to wrap around the metrics: “NISO OpenURL Quality Index” add non-Cornell data from libraries and link resolver vendors (model is agnostic to source) confirm and publicize key elements used by target syntaxes can the quality of the global OpenURL network be modeled mathematically?
  • 43. How to stay in the loop http://openurlquality.blogspot.com/ Adam ChandlerDatabase Management and Electronic Resources Research LibrarianCentral Library OperationsCornell University Librarytel: 607-255-5760email: alc28@cornell.edu