This document discusses the Biodiversity Heritage Library's (BHL) efforts to create a citation repository that would allow users to search and access articles from the BHL. It provides a brief history of related projects like CiteBank. It describes the current capabilities and limitations of accessing citations and full text articles through the BHL. It outlines the next steps needed to fully integrate citations and articles into the BHL by expanding the data model, developing interfaces for adding metadata, and changing how citations and articles are displayed. The goal is to support the Global Names Architecture by facilitating access to taxonomic literature.
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Biodiversity Heritage Library and bibliographic citations: towards new user services in support of a Global Names Architecture
1. The Biodiversity Heritage Library and bibliographic citations:
towards new user services in support of a Global Names
Architecture
Trish Rose-Sandler, William Ulate
Missouri Botanical Garden
2012 Annual TDWG Meeting
Beijing, China
Oct.21,2012
5. Genesis: “BHL Article Repository”
• Idea first introduced at TDWG 2008, Fremantle
(by BHL, many have discussed for years)
• YouTube for biodiversity articles
• Needed (need) a way to access articles in BHL
– “BHL has no articles.”
– BHL has hundreds of thousands of articles but you
can’t search for them via author, article title search
– Can find via “article coordinates” using BHL’s UI &
OpenURL resolver: Journal / Volume / Start Page / Year
6. CiteBank
• Objectives
– Create a repository for community-vetted
taxonomic bibliographies.
– Ability to ingest, display, download, and index
articles so that the BHL can operate as an article
repository.
– Provide links to content published online through
other repositories.
• Launched on December 6th 2010
9. Software Tools
Specimen Open Access
Databases Digital Libraries
Nomenclators
Commercial
Aggregators
Open Access
Publishers
Indices
International Collaborative Projects
10. Lessons Learned
• Biblio/Drupal data model insufficient for mass of data
envisioned for all biodiversity, too flat and difficult to
expand in collaboration with Biblio development
community
• Data providers want their content findable and
managed in the Biodiversity Heritage Library, not a
system alongside BHL
• Maintaining two platforms for biodiversity literature
threatens sustainability of the literature resources over
the longer term
13. Support citation reconciliation
.
.Linneaus, C. Species Plantarum, vol. 2 p. 971. 1753
.Linné, Carl von. Sp. Pl. Vol. 2 Page 971. 1753
.
Caroli Linnaei, Species Plantarum exhibentes plantas rite cognitas, ad genera
.relatas, cum Differentis Specificis, Nominibus Trivialibus,2:971. 1753 Selectis,
Locis Natalibus, secundum SYSTEMA SEXUALE digestas..
Synonymis
.L. Sp. Pl. 2: 971. 1753
.
Zea mays
14. Where are we?
• Articles
– Extend BHL data model to store article metadata
– Build process to harvest data from BioStor
• Create user interfaces for adding article metadata and
associated files
– Define functional requirements as improvements to
Drupal-based Citebank
– Define process flow for adding article metadata and
associated files
– Implement UI changes
• Change BHL UI to accommodate article search
• Change BHL UI to accommodate article display (TOC)
15. What is needed for a citation repository?
Admin. Interface
– IMPORT AND MAPPING TOOL
• Preview/Accept/Reject/Undo/Report on Import
• No standard schema, MODS or Bibtex
• Drag & drop GUI or mapped source and target field config.
– USER MANAGEMENT
• Self-Registration
• Admin. Approval & Deletion
• User Roles Assignment
– GLOBAL UPDATES
16. What is needed for a citation repository?
General User Interface
– IMPORT
• Upload/Preview/Accept/Reject/Undo/Report on Import
– CREATE CITATION
• By filling a Form, via BibTex
– BROWSE
• Faceted: title,author,subject, year, contributor, my citations
17. What is needed for a citation repository?
• CITATION TYPES
– Journal Article, Book Chapter, Conference Proceedings,
Conference Paper, Thesis, Government Report, Note, etc.
• OAI HARVESTING
– Harvest and serve data through OAI-PMH
• SPECIFICATIONS FOR DATA PROVIDERS PAGE
• CONTRIBUTORS PAGE
– Recognize ALL contributions
• REPORTING
– Statistics Page by Citation and Publication type
– Recent/Latest Uploads
18. Where are we going?
• Integrate BHL’s Services with ZooBank
• Authoritative list of titles in common use for
nomenclatural acts (“TL3”)
• Harvest relevant content from Mendeley
• Integrate services and interfaces with the GNUB
data model
• Interoperate with citation parsing tools & services
19. 2012 Annual TDWG Meeting
Beijing, China
October 21st, 2012
Thank you
Trish.Rose-Sandler@mobot.org William.Ulate@mobot.org
BHL Data Analyst Global BHL Project Manager
Art of Life Project Coordinator BHL Technical Director
Missouri Botanical Garden Missouri Botanical Garden
Editor's Notes
The data model and user interface for the Biodiversity Heritage Library (BHL) portal at http://www.biodiversitylibrary.org/ was originally designed to accommodate books and journals found in botanical garden libraries and natural history museums. As the size and reputation of the BHL grew, there were many publishers and individuals who wanted to contribute to the BHL but their content consisted of publication types at more granular levels, such as articles, book chapters, and dissertations. In order to ingest and serve these materials, in early 2011, BHL launched a separate portal called Citebank hosted at citebank.org. Currently, Citebank contains over 180,000 citations linked to content files, either hosted at citebank.org or hosted externally. While feedback on Citebank has been positive, users indicated a desire to combine both the services of the BHL portal and the services of the Citebank portal into a single interface in order to enable a unified search for all biodiversity literature. To respond to these needs, the BHL has begun expansion of its data model in the BHL portal to accommodate articles, book chapters, treatments and other segment-like material so that they can be searched alongside its traditional book and journal content. Parallel to this activity the NSF-funded Global Names Architecture (GNA) Project has enlisted Citebank to fulfill the role of a global biodiversity repository for bibliographic citations. In support of this, Citebank will provide a key functional component to the GNA - that of reconciliation services for citations. Once reconciled, citations can be linked either to scanned page images in the BHL, or to PDFs uploaded by users. If neither exists, citations can point to other digital representations online. Experience with Citebank has resulted in many lessons learned about working with diverse publication types; data formats; and contributors with varying levels of technical competencies. Those lessons were incorporated into a functional requirements document that is being used to inform development of the BHL data model. This talk will outline the functional requirements needed for a global citation repository for biodiversity and how those requirements will better serve the needs of the biodiversity community.
[PortalUser Interface]
[Book Viewer Interface]
[Citebank homepage]
[Citebank homepage]
[Citebank stats]
[World in which CiteBank lives]
[Citations in BHL and Sustainability Considerations]