SlideShare a Scribd company logo
1 of 73
Download to read offline
Changing Data
Implementing Primo for the Tri-Universities Group (TUG)
Presentation at ELUNA
May, 2009
Alison Hitchens
Cataloguing & Metadata Librarian
Outline
 Background
 Loading data into Primo
 Normalization
 Testing
Where are we?
 Formed in 1995
 Shared resources and collaboration including:
 Shared storage facility
 Shared integrated library system (ILS)
 Reciprocal borrowing
 Document delivery
 Statistics portal
 Shared databases
 Collaborative functional committees
 Shared ILS and Catalogue
 TRELLIS (Voyager)
 No significant changes to interface in 10 years
 Search is limited to catalogue data
 One place to search
 Potential to include a variety of datasets:
 Library catalogue (currently loaded into Primo)
 Articles
 GIS information
 Our Ontario image bank
 Local repositories
 Deep search
Primo Advantages
 User-friendly interface
 XML compliant
 Avoids duplication of search results
 Groups together different editions of the
same work (FRBR)
 Interoperability with existing tools
Primo Advantages
The Primo Team
The Primo Team
 Team created in late January 2008
 Training held in late March 2008
 Primo Alpha launched to staff in July 2008
 Primo Beta launched to TUG community in
November 2008
 Goal: make Primo the primary search tool in
late May 2009
Phase Two
 Usability testing
 Naming & branding
 New data sources
 New books list
 Fine-tuning functionality
 Deep search
The Primo Team
Loading Data Into Primo
MARC
Loading Data Into Primo
MARC
MARC
XML
Extract
MARC XML
MARC XML
Loading Data Into Primo
MARC
MARC
XML
PNX (Primo
Normalized XML)
Extract
Normalization
PNX: Display area
PNX: Search area
PNX: Deduplication area
Loading Data Into Primo
MARC
MARC
XML
PNX (Primo
Normalized XML)
Deduplication, FRBR,
Didumean, Indexing
Front End
(user interface)
Extract
Normalization
Deduplication
Deduplication
Normalization: what is it?
 Massaging data
 Rules that tell the program how to get from MARC
XML to Primo Normalized XML (PNX)
 Filter that distributes the incoming data and places it in
different sections
 What MARC tags hold the title
 What MARC codes show the format
 What data should be included in searches
 What data should be available for display
 Transformation rules
 How that data should be formatted (dates, punctuation,
capitalization, etc.)
Normalization: what is it?
 Customization
 Fixing “bad” data
 Complex changes
 Consortial issues
 Lessons learned the hard way
Customization
 Search fields
 Created call number search
 Augmented title search with contents note (505 tag)
 Display fields
 Added subject tag used for slide collection subjects (654 tag)
 Added explanatory text in front of analytical titles
 FRBR
 Excluded “selections”
 Facets
 Used location names as collection facets
Adding contents note to title
search
Adding contents note to title
search
Adding contents note to title
search
Fixing “bad” data
 Old records lacking proper indicators
 Main author (100 tag) with invalid indicators (1st
indicator blank or |)
 Old records lacking subfield coding
 Uniform title (240 tag) missing subfields ($k)
 ISBNS with hyphens
 008 with invalid data in first 6 characters
 Blanks or letters instead of record creation date
008 Workaround
008 Workaround
008 Workaround
008 Workaround
Complex changes
 Tweaking delivery of online journals
 Delivery using SFX
 Exclude serials that no longer have an online holding
but record still coded as online
 Exclude government serials
 Exclude public microdata files
 Exclude databases (integrating resources)
Tweaking delivery of online
journals
Tweaking delivery of online
journals
Tweaking delivery of online
journals
Consortial issues
 Restricting online resources to individual institutions
 Which URL should be presented?
 Should restrictions be presented?
 Coping with shared locations
 e.g. GWINTER = Internet resource shared by Guelph and
Waterloo but not Wilfrid Laurier
 Instead of 2 separate locations UGINTER and UWINTER
 Creating search scopes for colleges and campuses
 e.g. ability to limit search to Architecture materials
Restricting Online Resources
Restricting Online Resources
Restricting Online Resources
 Problem 1:
 Which link belongs to which institution?
 Otherwise will simply present the first URL in the
record
 Need to add $$I based on ownership
 Location code isn’t extracted with the 856
 Problem 2:
 Restricting the each link to each institution
 Otherwise will give Online access message to users
who do not have access
 Need to add restricted delivery scope
Specify Institution for Online
Resources
Restricting Online Resources
Restricting Online Resources
Restricting Online Resources
Limiting searches to a location
Limiting searches to a location
Limiting searches to a location
Lessons learned the hard way
 If checking that a tag
exists, need to also
include subfields
Lessons learned the hard way
 If writing more than one
value, need more than
one rule
Result: $$I GUELPH
Lessons learned the hard way
Match current, match any?
Problem: includes title
from 245, author from 100,
publisher from 260 etc.
Match current, match any?
Match any
If any of the 880 tags have $6 505 then copy the 880
tag as is.
This means that if any of the 880s tag meet this
requirement, it will copy all of the 880 tags.
Match current: just analyse them one at a time and
only copy the one that meets the condition
Testing
 Staging database
 What am I testing?
 Testing sample records
 Testing the process
 Random testing
Test specific changes
 Changes to normalization rules
 Changes to front end display
 Changes to tables (e.g. new location codes)
 New release enhancements/bug fixes
Look for:
 What you were expecting
 Note any surprises!
Staging database
 Holds 200,000 records
 Random sample of our collection
 100 titles from each location code
 Random sample proportionate to records held by
each institution
 Combination of old pre-TRELLIS records and
newly created records
 Shakespeare call number range to test grouping
of editions (FRBR)
Test records
 100 records for repeated testing in front end
 Brief records (acquisitions, e-reserves, CODOC)
 Different formats (micro, music, video, electronic)
 Things to test holdings info (acc. material, multi-volume,
multiple items, multiple locations)
 Foreign language materials
 Duplicates
 Editions
 10 records
 For immediate test of normalization rules in back office
Test specific functionality
 Fulfillment cycle
 When the user finds the item that he wants, can
he actually get the item based on:
 The information presented in the results screen
 The information presented in the full display
 The linking provided
 The information presented in the holdings display
Testing: online resources
 What am I testing?
(what do I want to happen)
 Is online availability showing correctly in relation
to the user?
 Online access
 Online access is restricted
 Physical resource
Testing: online resources
 What am I testing?
(what do I want to happen)
 Does user receive a relevant link?
 SFX delivery
 Direct to resource
 Link appropriate to institution
Testing: online resources
 What am I testing?
(what do I want to happen)
 Can user view alternate/multiple links in the full
record display?
Testing: Online resources
 Variables for sample records
 E-journals, e-books, e-data, databases
 SFX delivery, online delivery
 Multi-volume sets
 Restricted to one institution
 Different institutions, different providers
 Online for one institution, physical for another
Testing: online resources
 Instructions for testers:
 Each tester should check the list of records and
verify that there is an online link
 For each record test that the link label is correct:
 Available online: check against TRELLIS to verify that
your institution has an online holding
 Online access is restricted: check against TRELLIS to
verify that your institution does NOT have an online
holding
 Link takes you to the correct place
Testing: online resources
 Random sampling
 Each tester should also do a search on a subject
of their choice and verify links using the first page
of results
Testing environment
 Test in all views
 Waterloo, Laurier, Guelph
 Test in different IP ranges
 Test off campus
General testing
Testing: feedback from users
Overall, I am VERY
impressed with Primo. It
is far more functional in
many ways.
When I find an online
journal, the Click here
for access link does
not work
I wonder if the alerts by
email are working?
Let me login using
my UWDIR id
Changing Data
 Thank you!
Alison Hitchens
Cataloguing & Metadata Librarian
University of Waterloo Library
ahitchen@library.uwaterloo.ca
http://www.lib.uwaterloo.ca

More Related Content

What's hot

Advances in Scientific Workflow Environments
Advances in Scientific Workflow EnvironmentsAdvances in Scientific Workflow Environments
Advances in Scientific Workflow EnvironmentsCarole Goble
 
Big Data Analytics course: Named Entities and Deep Learning for NLP
Big Data Analytics course: Named Entities and Deep Learning for NLPBig Data Analytics course: Named Entities and Deep Learning for NLP
Big Data Analytics course: Named Entities and Deep Learning for NLPChristian Morbidoni
 
Information Skills: 2. Information Hunting (Natural Sciences, Bangor Universi...
Information Skills: 2. Information Hunting (Natural Sciences, Bangor Universi...Information Skills: 2. Information Hunting (Natural Sciences, Bangor Universi...
Information Skills: 2. Information Hunting (Natural Sciences, Bangor Universi...Vashti Zarach
 
Sherborn: Lyal - Digitising legacy taxonomic literature: processes, products ...
Sherborn: Lyal - Digitising legacy taxonomic literature: processes, products ...Sherborn: Lyal - Digitising legacy taxonomic literature: processes, products ...
Sherborn: Lyal - Digitising legacy taxonomic literature: processes, products ...ICZN
 
Bio ontologies and semantic technologies
Bio ontologies and semantic technologiesBio ontologies and semantic technologies
Bio ontologies and semantic technologiesProf. Wim Van Criekinge
 
2019 02 12_biological_databases_part1_v_upload
2019 02 12_biological_databases_part1_v_upload2019 02 12_biological_databases_part1_v_upload
2019 02 12_biological_databases_part1_v_uploadProf. Wim Van Criekinge
 
Reproducibility, Research Objects and Reality, Leiden 2016
Reproducibility, Research Objects and Reality, Leiden 2016Reproducibility, Research Objects and Reality, Leiden 2016
Reproducibility, Research Objects and Reality, Leiden 2016Carole Goble
 
SciFinder and its utility in Drug discovery
SciFinder and its utility in Drug discoverySciFinder and its utility in Drug discovery
SciFinder and its utility in Drug discoveryAlichy Sowmya
 
2019 03 05_biological_databases_part3_v_upload
2019 03 05_biological_databases_part3_v_upload2019 03 05_biological_databases_part3_v_upload
2019 03 05_biological_databases_part3_v_uploadProf. Wim Van Criekinge
 
Advanced searching on EBSCOhost to support systematic reviews
Advanced searching on EBSCOhost to support systematic reviewsAdvanced searching on EBSCOhost to support systematic reviews
Advanced searching on EBSCOhost to support systematic reviewsrosie.dunne
 
Working with data.open.ac.uk, the Linked Data Platform of the Open University
Working with data.open.ac.uk, the Linked Data Platform of the Open UniversityWorking with data.open.ac.uk, the Linked Data Platform of the Open University
Working with data.open.ac.uk, the Linked Data Platform of the Open UniversityMathieu d'Aquin
 
2019 03 05_biological_databases_part5_v_upload
2019 03 05_biological_databases_part5_v_upload2019 03 05_biological_databases_part5_v_upload
2019 03 05_biological_databases_part5_v_uploadProf. Wim Van Criekinge
 
LUCERO - Building the Open University Web of Linked Data
LUCERO - Building the Open University Web of Linked DataLUCERO - Building the Open University Web of Linked Data
LUCERO - Building the Open University Web of Linked DataMathieu d'Aquin
 
Experience from 10 months of University Linked Data
Experience from 10 months of University Linked Data Experience from 10 months of University Linked Data
Experience from 10 months of University Linked Data Mathieu d'Aquin
 
Connecting the dots: drug information and Linked Data
Connecting the dots: drug information and Linked DataConnecting the dots: drug information and Linked Data
Connecting the dots: drug information and Linked DataTomasz Adamusiak
 
Metadata Selection (revised)
Metadata Selection (revised)Metadata Selection (revised)
Metadata Selection (revised)Jill Strass
 

What's hot (20)

Advances in Scientific Workflow Environments
Advances in Scientific Workflow EnvironmentsAdvances in Scientific Workflow Environments
Advances in Scientific Workflow Environments
 
ANT1CAG: support session (library help)
ANT1CAG: support session (library help)ANT1CAG: support session (library help)
ANT1CAG: support session (library help)
 
Open Annotation Model
Open Annotation ModelOpen Annotation Model
Open Annotation Model
 
Big Data Analytics course: Named Entities and Deep Learning for NLP
Big Data Analytics course: Named Entities and Deep Learning for NLPBig Data Analytics course: Named Entities and Deep Learning for NLP
Big Data Analytics course: Named Entities and Deep Learning for NLP
 
Information Skills: 2. Information Hunting (Natural Sciences, Bangor Universi...
Information Skills: 2. Information Hunting (Natural Sciences, Bangor Universi...Information Skills: 2. Information Hunting (Natural Sciences, Bangor Universi...
Information Skills: 2. Information Hunting (Natural Sciences, Bangor Universi...
 
Sherborn: Lyal - Digitising legacy taxonomic literature: processes, products ...
Sherborn: Lyal - Digitising legacy taxonomic literature: processes, products ...Sherborn: Lyal - Digitising legacy taxonomic literature: processes, products ...
Sherborn: Lyal - Digitising legacy taxonomic literature: processes, products ...
 
Bio ontologies and semantic technologies
Bio ontologies and semantic technologiesBio ontologies and semantic technologies
Bio ontologies and semantic technologies
 
2019 02 12_biological_databases_part1_v_upload
2019 02 12_biological_databases_part1_v_upload2019 02 12_biological_databases_part1_v_upload
2019 02 12_biological_databases_part1_v_upload
 
Reproducibility, Research Objects and Reality, Leiden 2016
Reproducibility, Research Objects and Reality, Leiden 2016Reproducibility, Research Objects and Reality, Leiden 2016
Reproducibility, Research Objects and Reality, Leiden 2016
 
2020 02 11_biological_databases_part1
2020 02 11_biological_databases_part12020 02 11_biological_databases_part1
2020 02 11_biological_databases_part1
 
SciFinder and its utility in Drug discovery
SciFinder and its utility in Drug discoverySciFinder and its utility in Drug discovery
SciFinder and its utility in Drug discovery
 
2019 03 05_biological_databases_part3_v_upload
2019 03 05_biological_databases_part3_v_upload2019 03 05_biological_databases_part3_v_upload
2019 03 05_biological_databases_part3_v_upload
 
Scopus
ScopusScopus
Scopus
 
Advanced searching on EBSCOhost to support systematic reviews
Advanced searching on EBSCOhost to support systematic reviewsAdvanced searching on EBSCOhost to support systematic reviews
Advanced searching on EBSCOhost to support systematic reviews
 
Working with data.open.ac.uk, the Linked Data Platform of the Open University
Working with data.open.ac.uk, the Linked Data Platform of the Open UniversityWorking with data.open.ac.uk, the Linked Data Platform of the Open University
Working with data.open.ac.uk, the Linked Data Platform of the Open University
 
2019 03 05_biological_databases_part5_v_upload
2019 03 05_biological_databases_part5_v_upload2019 03 05_biological_databases_part5_v_upload
2019 03 05_biological_databases_part5_v_upload
 
LUCERO - Building the Open University Web of Linked Data
LUCERO - Building the Open University Web of Linked DataLUCERO - Building the Open University Web of Linked Data
LUCERO - Building the Open University Web of Linked Data
 
Experience from 10 months of University Linked Data
Experience from 10 months of University Linked Data Experience from 10 months of University Linked Data
Experience from 10 months of University Linked Data
 
Connecting the dots: drug information and Linked Data
Connecting the dots: drug information and Linked DataConnecting the dots: drug information and Linked Data
Connecting the dots: drug information and Linked Data
 
Metadata Selection (revised)
Metadata Selection (revised)Metadata Selection (revised)
Metadata Selection (revised)
 

Similar to Changing Data: Implementing Primo for the Tri University Group of Libraries (2009)

Faceted search using Solr and Ontopia
Faceted search using Solr and OntopiaFaceted search using Solr and Ontopia
Faceted search using Solr and OntopiaGeir Ove Grønmo
 
Inteligent Catalogue Final
Inteligent Catalogue FinalInteligent Catalogue Final
Inteligent Catalogue Finalguestcaef1d
 
Taxonomies in Search
Taxonomies in SearchTaxonomies in Search
Taxonomies in SearchTSoholt
 
Web Scale Discovery Services: Google like search experience
Web Scale Discovery Services: Google like search experienceWeb Scale Discovery Services: Google like search experience
Web Scale Discovery Services: Google like search experienceNikesh Narayanan
 
New member
New member New member
New member Crossref
 
Haystack 2018 - Algorithmic Extraction of Keywords Concepts and Vocabularies
Haystack 2018 - Algorithmic Extraction of Keywords Concepts and VocabulariesHaystack 2018 - Algorithmic Extraction of Keywords Concepts and Vocabularies
Haystack 2018 - Algorithmic Extraction of Keywords Concepts and VocabulariesMax Irwin
 
Ebsco discovery2012 to 2014
Ebsco discovery2012 to 2014Ebsco discovery2012 to 2014
Ebsco discovery2012 to 2014H Anil Kumar
 
Synchronicity: Just-In-Time Discovery of Lost Web Pages
Synchronicity: Just-In-Time Discovery of Lost Web PagesSynchronicity: Just-In-Time Discovery of Lost Web Pages
Synchronicity: Just-In-Time Discovery of Lost Web PagesMichael Nelson
 
Can personalised be upscaled?
Can personalised be upscaled?Can personalised be upscaled?
Can personalised be upscaled?Tim Wales
 
New member webinar 052418
New member webinar 052418New member webinar 052418
New member webinar 052418Crossref
 
Harvesting From Many Silos at Web-scale Makes E-content Truly Discoverable
Harvesting From Many Silos at Web-scale Makes E-content Truly  DiscoverableHarvesting From Many Silos at Web-scale Makes E-content Truly  Discoverable
Harvesting From Many Silos at Web-scale Makes E-content Truly DiscoverableElectronic Resources & Libraries
 
The Internet
The InternetThe Internet
The Internetmscuttle
 
Erl10 web scale-gb-sg
Erl10 web scale-gb-sgErl10 web scale-gb-sg
Erl10 web scale-gb-sgGeorge Boston
 
Social Web 2.0 Class Week 8: Social Metadata, Ratings, Social Tagging
Social Web 2.0 Class Week 8: Social Metadata, Ratings, Social TaggingSocial Web 2.0 Class Week 8: Social Metadata, Ratings, Social Tagging
Social Web 2.0 Class Week 8: Social Metadata, Ratings, Social TaggingShelly D. Farnham, Ph.D.
 
Encore Presentation - ACRL/NEC ITIG Annual Meeting
Encore Presentation - ACRL/NEC ITIG Annual MeetingEncore Presentation - ACRL/NEC ITIG Annual Meeting
Encore Presentation - ACRL/NEC ITIG Annual MeetingLaura Kohl
 

Similar to Changing Data: Implementing Primo for the Tri University Group of Libraries (2009) (20)

Searching techniques
Searching techniquesSearching techniques
Searching techniques
 
Searching techniques
Searching techniquesSearching techniques
Searching techniques
 
Faceted search using Solr and Ontopia
Faceted search using Solr and OntopiaFaceted search using Solr and Ontopia
Faceted search using Solr and Ontopia
 
NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Wor...
NISO/NFAIS Joint Virtual Conference:  Connecting the Library to the Wider Wor...NISO/NFAIS Joint Virtual Conference:  Connecting the Library to the Wider Wor...
NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Wor...
 
From federated to aggregated search
From federated to aggregated searchFrom federated to aggregated search
From federated to aggregated search
 
Inteligent Catalogue Final
Inteligent Catalogue FinalInteligent Catalogue Final
Inteligent Catalogue Final
 
Taxonomies in Search
Taxonomies in SearchTaxonomies in Search
Taxonomies in Search
 
Web Scale Discovery Services: Google like search experience
Web Scale Discovery Services: Google like search experienceWeb Scale Discovery Services: Google like search experience
Web Scale Discovery Services: Google like search experience
 
New member
New member New member
New member
 
Haystack 2018 - Algorithmic Extraction of Keywords Concepts and Vocabularies
Haystack 2018 - Algorithmic Extraction of Keywords Concepts and VocabulariesHaystack 2018 - Algorithmic Extraction of Keywords Concepts and Vocabularies
Haystack 2018 - Algorithmic Extraction of Keywords Concepts and Vocabularies
 
Ebsco discovery2012 to 2014
Ebsco discovery2012 to 2014Ebsco discovery2012 to 2014
Ebsco discovery2012 to 2014
 
Synchronicity: Just-In-Time Discovery of Lost Web Pages
Synchronicity: Just-In-Time Discovery of Lost Web PagesSynchronicity: Just-In-Time Discovery of Lost Web Pages
Synchronicity: Just-In-Time Discovery of Lost Web Pages
 
Can personalised be upscaled?
Can personalised be upscaled?Can personalised be upscaled?
Can personalised be upscaled?
 
New member webinar 052418
New member webinar 052418New member webinar 052418
New member webinar 052418
 
Harvesting From Many Silos at Web-scale Makes E-content Truly Discoverable
Harvesting From Many Silos at Web-scale Makes E-content Truly  DiscoverableHarvesting From Many Silos at Web-scale Makes E-content Truly  Discoverable
Harvesting From Many Silos at Web-scale Makes E-content Truly Discoverable
 
The Internet
The InternetThe Internet
The Internet
 
Erl10 web scale-gb-sg
Erl10 web scale-gb-sgErl10 web scale-gb-sg
Erl10 web scale-gb-sg
 
Social Web 2.0 Class Week 8: Social Metadata, Ratings, Social Tagging
Social Web 2.0 Class Week 8: Social Metadata, Ratings, Social TaggingSocial Web 2.0 Class Week 8: Social Metadata, Ratings, Social Tagging
Social Web 2.0 Class Week 8: Social Metadata, Ratings, Social Tagging
 
EDS for IFLA
EDS for IFLAEDS for IFLA
EDS for IFLA
 
Encore Presentation - ACRL/NEC ITIG Annual Meeting
Encore Presentation - ACRL/NEC ITIG Annual MeetingEncore Presentation - ACRL/NEC ITIG Annual Meeting
Encore Presentation - ACRL/NEC ITIG Annual Meeting
 

More from Alison Hitchens

Dewey Update: What's New with the DDC? (2010)
Dewey Update: What's New with the DDC? (2010)Dewey Update: What's New with the DDC? (2010)
Dewey Update: What's New with the DDC? (2010)Alison Hitchens
 
RDA 101: an introduction to RDA (2012)
RDA 101: an introduction to RDA (2012)RDA 101: an introduction to RDA (2012)
RDA 101: an introduction to RDA (2012)Alison Hitchens
 
Primo Reporting: Using 3rd Party Software to Create Primo Reports & Analyze P...
Primo Reporting: Using 3rd Party Software to Create Primo Reports & Analyze P...Primo Reporting: Using 3rd Party Software to Create Primo Reports & Analyze P...
Primo Reporting: Using 3rd Party Software to Create Primo Reports & Analyze P...Alison Hitchens
 
Primo Central Trial, Usability Testing, and Implementation Options (2012)
Primo Central Trial, Usability Testing, and Implementation Options (2012)Primo Central Trial, Usability Testing, and Implementation Options (2012)
Primo Central Trial, Usability Testing, and Implementation Options (2012)Alison Hitchens
 
Primo at TUG: Using Primo in a Consortial Environment (2013)
Primo at TUG: Using Primo in a Consortial Environment (2013)Primo at TUG: Using Primo in a Consortial Environment (2013)
Primo at TUG: Using Primo in a Consortial Environment (2013)Alison Hitchens
 
Trouble-shooting Tips for Primo (2013)
Trouble-shooting Tips for Primo (2013)Trouble-shooting Tips for Primo (2013)
Trouble-shooting Tips for Primo (2013)Alison Hitchens
 
What is #LODLAM?! (revised January 2015)
What is #LODLAM?! (revised January 2015)What is #LODLAM?! (revised January 2015)
What is #LODLAM?! (revised January 2015)Alison Hitchens
 
Getting "good" e-theses MARC records from DSpace
Getting "good" e-theses MARC records from DSpaceGetting "good" e-theses MARC records from DSpace
Getting "good" e-theses MARC records from DSpaceAlison Hitchens
 
What is #LODLAM?! Understanding linked open data in libraries, archives [and ...
What is #LODLAM?! Understanding linked open data in libraries, archives [and ...What is #LODLAM?! Understanding linked open data in libraries, archives [and ...
What is #LODLAM?! Understanding linked open data in libraries, archives [and ...Alison Hitchens
 
OPAC Via Primo (OvP): Sorting Out What is Primo and What is the ILS
OPAC Via Primo (OvP): Sorting Out What is Primo and What is the ILSOPAC Via Primo (OvP): Sorting Out What is Primo and What is the ILS
OPAC Via Primo (OvP): Sorting Out What is Primo and What is the ILSAlison Hitchens
 
Making PowerPoint accessible
Making PowerPoint accessibleMaking PowerPoint accessible
Making PowerPoint accessibleAlison Hitchens
 
Linked open data and libraries
Linked open data and librariesLinked open data and libraries
Linked open data and librariesAlison Hitchens
 
MOAR RDA For Systems Folks
MOAR RDA For Systems FolksMOAR RDA For Systems Folks
MOAR RDA For Systems FolksAlison Hitchens
 

More from Alison Hitchens (15)

Dewey Update: What's New with the DDC? (2010)
Dewey Update: What's New with the DDC? (2010)Dewey Update: What's New with the DDC? (2010)
Dewey Update: What's New with the DDC? (2010)
 
RDA 101: an introduction to RDA (2012)
RDA 101: an introduction to RDA (2012)RDA 101: an introduction to RDA (2012)
RDA 101: an introduction to RDA (2012)
 
Primo Reporting: Using 3rd Party Software to Create Primo Reports & Analyze P...
Primo Reporting: Using 3rd Party Software to Create Primo Reports & Analyze P...Primo Reporting: Using 3rd Party Software to Create Primo Reports & Analyze P...
Primo Reporting: Using 3rd Party Software to Create Primo Reports & Analyze P...
 
Primo Central Trial, Usability Testing, and Implementation Options (2012)
Primo Central Trial, Usability Testing, and Implementation Options (2012)Primo Central Trial, Usability Testing, and Implementation Options (2012)
Primo Central Trial, Usability Testing, and Implementation Options (2012)
 
Primo at TUG: Using Primo in a Consortial Environment (2013)
Primo at TUG: Using Primo in a Consortial Environment (2013)Primo at TUG: Using Primo in a Consortial Environment (2013)
Primo at TUG: Using Primo in a Consortial Environment (2013)
 
Trouble-shooting Tips for Primo (2013)
Trouble-shooting Tips for Primo (2013)Trouble-shooting Tips for Primo (2013)
Trouble-shooting Tips for Primo (2013)
 
What is #LODLAM?! (revised January 2015)
What is #LODLAM?! (revised January 2015)What is #LODLAM?! (revised January 2015)
What is #LODLAM?! (revised January 2015)
 
Getting "good" e-theses MARC records from DSpace
Getting "good" e-theses MARC records from DSpaceGetting "good" e-theses MARC records from DSpace
Getting "good" e-theses MARC records from DSpace
 
RDA for Public Services
RDA for Public ServicesRDA for Public Services
RDA for Public Services
 
What is #LODLAM?! Understanding linked open data in libraries, archives [and ...
What is #LODLAM?! Understanding linked open data in libraries, archives [and ...What is #LODLAM?! Understanding linked open data in libraries, archives [and ...
What is #LODLAM?! Understanding linked open data in libraries, archives [and ...
 
OPAC Via Primo (OvP): Sorting Out What is Primo and What is the ILS
OPAC Via Primo (OvP): Sorting Out What is Primo and What is the ILSOPAC Via Primo (OvP): Sorting Out What is Primo and What is the ILS
OPAC Via Primo (OvP): Sorting Out What is Primo and What is the ILS
 
Making PowerPoint accessible
Making PowerPoint accessibleMaking PowerPoint accessible
Making PowerPoint accessible
 
Linked open data and libraries
Linked open data and librariesLinked open data and libraries
Linked open data and libraries
 
Introducing linked data
Introducing linked dataIntroducing linked data
Introducing linked data
 
MOAR RDA For Systems Folks
MOAR RDA For Systems FolksMOAR RDA For Systems Folks
MOAR RDA For Systems Folks
 

Recently uploaded

Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatYousafMalik24
 
Planning a health career 4th Quarter.pptx
Planning a health career 4th Quarter.pptxPlanning a health career 4th Quarter.pptx
Planning a health career 4th Quarter.pptxLigayaBacuel1
 
Keynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designKeynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designMIPLM
 
Roles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceRoles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceSamikshaHamane
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxOH TEIK BIN
 
Grade 9 Q4-MELC1-Active and Passive Voice.pptx
Grade 9 Q4-MELC1-Active and Passive Voice.pptxGrade 9 Q4-MELC1-Active and Passive Voice.pptx
Grade 9 Q4-MELC1-Active and Passive Voice.pptxChelloAnnAsuncion2
 
How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17Celine George
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon AUnboundStockton
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxNirmalaLoungPoorunde1
 
DATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersDATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersSabitha Banu
 
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdfAMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdfphamnguyenenglishnb
 
Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)Mark Reed
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxthorishapillay1
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxiammrhaywood
 
How to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPHow to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPCeline George
 
Hierarchy of management that covers different levels of management
Hierarchy of management that covers different levels of managementHierarchy of management that covers different levels of management
Hierarchy of management that covers different levels of managementmkooblal
 

Recently uploaded (20)

Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice great
 
Planning a health career 4th Quarter.pptx
Planning a health career 4th Quarter.pptxPlanning a health career 4th Quarter.pptx
Planning a health career 4th Quarter.pptx
 
Keynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designKeynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-design
 
Roles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceRoles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in Pharmacovigilance
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptx
 
Grade 9 Q4-MELC1-Active and Passive Voice.pptx
Grade 9 Q4-MELC1-Active and Passive Voice.pptxGrade 9 Q4-MELC1-Active and Passive Voice.pptx
Grade 9 Q4-MELC1-Active and Passive Voice.pptx
 
How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17
 
9953330565 Low Rate Call Girls In Rohini Delhi NCR
9953330565 Low Rate Call Girls In Rohini  Delhi NCR9953330565 Low Rate Call Girls In Rohini  Delhi NCR
9953330565 Low Rate Call Girls In Rohini Delhi NCR
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon A
 
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptx
 
DATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersDATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginners
 
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdfAMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
 
OS-operating systems- ch04 (Threads) ...
OS-operating systems- ch04 (Threads) ...OS-operating systems- ch04 (Threads) ...
OS-operating systems- ch04 (Threads) ...
 
Rapple "Scholarly Communications and the Sustainable Development Goals"
Rapple "Scholarly Communications and the Sustainable Development Goals"Rapple "Scholarly Communications and the Sustainable Development Goals"
Rapple "Scholarly Communications and the Sustainable Development Goals"
 
Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptx
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
 
How to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPHow to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERP
 
Hierarchy of management that covers different levels of management
Hierarchy of management that covers different levels of managementHierarchy of management that covers different levels of management
Hierarchy of management that covers different levels of management
 

Changing Data: Implementing Primo for the Tri University Group of Libraries (2009)

  • 1. Changing Data Implementing Primo for the Tri-Universities Group (TUG) Presentation at ELUNA May, 2009 Alison Hitchens Cataloguing & Metadata Librarian
  • 2. Outline  Background  Loading data into Primo  Normalization  Testing
  • 4.  Formed in 1995  Shared resources and collaboration including:  Shared storage facility  Shared integrated library system (ILS)  Reciprocal borrowing  Document delivery  Statistics portal  Shared databases  Collaborative functional committees
  • 5.  Shared ILS and Catalogue  TRELLIS (Voyager)  No significant changes to interface in 10 years  Search is limited to catalogue data
  • 6.
  • 7.
  • 8.  One place to search  Potential to include a variety of datasets:  Library catalogue (currently loaded into Primo)  Articles  GIS information  Our Ontario image bank  Local repositories  Deep search Primo Advantages
  • 9.  User-friendly interface  XML compliant  Avoids duplication of search results  Groups together different editions of the same work (FRBR)  Interoperability with existing tools Primo Advantages
  • 10.
  • 11.
  • 13. The Primo Team  Team created in late January 2008  Training held in late March 2008  Primo Alpha launched to staff in July 2008  Primo Beta launched to TUG community in November 2008  Goal: make Primo the primary search tool in late May 2009
  • 14. Phase Two  Usability testing  Naming & branding  New data sources  New books list  Fine-tuning functionality  Deep search
  • 16. Loading Data Into Primo MARC
  • 17.
  • 18. Loading Data Into Primo MARC MARC XML Extract
  • 21. Loading Data Into Primo MARC MARC XML PNX (Primo Normalized XML) Extract Normalization
  • 25. Loading Data Into Primo MARC MARC XML PNX (Primo Normalized XML) Deduplication, FRBR, Didumean, Indexing Front End (user interface) Extract Normalization
  • 28. Normalization: what is it?  Massaging data  Rules that tell the program how to get from MARC XML to Primo Normalized XML (PNX)  Filter that distributes the incoming data and places it in different sections  What MARC tags hold the title  What MARC codes show the format  What data should be included in searches  What data should be available for display  Transformation rules  How that data should be formatted (dates, punctuation, capitalization, etc.)
  • 29. Normalization: what is it?  Customization  Fixing “bad” data  Complex changes  Consortial issues  Lessons learned the hard way
  • 30. Customization  Search fields  Created call number search  Augmented title search with contents note (505 tag)  Display fields  Added subject tag used for slide collection subjects (654 tag)  Added explanatory text in front of analytical titles  FRBR  Excluded “selections”  Facets  Used location names as collection facets
  • 31. Adding contents note to title search
  • 32. Adding contents note to title search
  • 33. Adding contents note to title search
  • 34. Fixing “bad” data  Old records lacking proper indicators  Main author (100 tag) with invalid indicators (1st indicator blank or |)  Old records lacking subfield coding  Uniform title (240 tag) missing subfields ($k)  ISBNS with hyphens  008 with invalid data in first 6 characters  Blanks or letters instead of record creation date
  • 39. Complex changes  Tweaking delivery of online journals  Delivery using SFX  Exclude serials that no longer have an online holding but record still coded as online  Exclude government serials  Exclude public microdata files  Exclude databases (integrating resources)
  • 40. Tweaking delivery of online journals
  • 41. Tweaking delivery of online journals
  • 42. Tweaking delivery of online journals
  • 43. Consortial issues  Restricting online resources to individual institutions  Which URL should be presented?  Should restrictions be presented?  Coping with shared locations  e.g. GWINTER = Internet resource shared by Guelph and Waterloo but not Wilfrid Laurier  Instead of 2 separate locations UGINTER and UWINTER  Creating search scopes for colleges and campuses  e.g. ability to limit search to Architecture materials
  • 46. Restricting Online Resources  Problem 1:  Which link belongs to which institution?  Otherwise will simply present the first URL in the record  Need to add $$I based on ownership  Location code isn’t extracted with the 856  Problem 2:  Restricting the each link to each institution  Otherwise will give Online access message to users who do not have access  Need to add restricted delivery scope
  • 47. Specify Institution for Online Resources
  • 51. Limiting searches to a location
  • 52. Limiting searches to a location
  • 53. Limiting searches to a location
  • 54. Lessons learned the hard way  If checking that a tag exists, need to also include subfields
  • 55. Lessons learned the hard way  If writing more than one value, need more than one rule Result: $$I GUELPH
  • 57. Match current, match any? Problem: includes title from 245, author from 100, publisher from 260 etc.
  • 58. Match current, match any? Match any If any of the 880 tags have $6 505 then copy the 880 tag as is. This means that if any of the 880s tag meet this requirement, it will copy all of the 880 tags. Match current: just analyse them one at a time and only copy the one that meets the condition
  • 59. Testing  Staging database  What am I testing?  Testing sample records  Testing the process  Random testing
  • 60. Test specific changes  Changes to normalization rules  Changes to front end display  Changes to tables (e.g. new location codes)  New release enhancements/bug fixes Look for:  What you were expecting  Note any surprises!
  • 61. Staging database  Holds 200,000 records  Random sample of our collection  100 titles from each location code  Random sample proportionate to records held by each institution  Combination of old pre-TRELLIS records and newly created records  Shakespeare call number range to test grouping of editions (FRBR)
  • 62. Test records  100 records for repeated testing in front end  Brief records (acquisitions, e-reserves, CODOC)  Different formats (micro, music, video, electronic)  Things to test holdings info (acc. material, multi-volume, multiple items, multiple locations)  Foreign language materials  Duplicates  Editions  10 records  For immediate test of normalization rules in back office
  • 63. Test specific functionality  Fulfillment cycle  When the user finds the item that he wants, can he actually get the item based on:  The information presented in the results screen  The information presented in the full display  The linking provided  The information presented in the holdings display
  • 64. Testing: online resources  What am I testing? (what do I want to happen)  Is online availability showing correctly in relation to the user?  Online access  Online access is restricted  Physical resource
  • 65. Testing: online resources  What am I testing? (what do I want to happen)  Does user receive a relevant link?  SFX delivery  Direct to resource  Link appropriate to institution
  • 66. Testing: online resources  What am I testing? (what do I want to happen)  Can user view alternate/multiple links in the full record display?
  • 67. Testing: Online resources  Variables for sample records  E-journals, e-books, e-data, databases  SFX delivery, online delivery  Multi-volume sets  Restricted to one institution  Different institutions, different providers  Online for one institution, physical for another
  • 68. Testing: online resources  Instructions for testers:  Each tester should check the list of records and verify that there is an online link  For each record test that the link label is correct:  Available online: check against TRELLIS to verify that your institution has an online holding  Online access is restricted: check against TRELLIS to verify that your institution does NOT have an online holding  Link takes you to the correct place
  • 69. Testing: online resources  Random sampling  Each tester should also do a search on a subject of their choice and verify links using the first page of results
  • 70. Testing environment  Test in all views  Waterloo, Laurier, Guelph  Test in different IP ranges  Test off campus
  • 72. Testing: feedback from users Overall, I am VERY impressed with Primo. It is far more functional in many ways. When I find an online journal, the Click here for access link does not work I wonder if the alerts by email are working? Let me login using my UWDIR id
  • 73. Changing Data  Thank you! Alison Hitchens Cataloguing & Metadata Librarian University of Waterloo Library ahitchen@library.uwaterloo.ca http://www.lib.uwaterloo.ca