SlideShare a Scribd company logo
OCLC Research Library Partnership
Work-In-Progress webinar
3 December 2015
A Close Look at the Four Million
Archival MARC Records in WorldCat
Jackie Dooley
Program Officer
OCLC Research
OVERVIEW
• Research Objective
• Some Initial Questions
• Scope of the Dataset
• Key Findings
• Data Analysis
• Tentative Recommendations
• What’s Next?
RESEARCH OBJECTIVE
Research Objective
Establish a detailed profile of MARC data element
occurrences in archival catalog records, providing a
view of 30+ years of practice.
• Reveal variations in descriptive practice across formats
• Characterize practice before MARC usage diminishes
• Debunk any inaccurate assumptions
• Suggest changes to descriptive practice
• Enable analysis of implications for discovery
Take note! I studied field occurrences, not content.
SOME INITIAL QUESTIONS
Some Initial Questions
• What is “archival material”?
• Is archival use of MARC accurate and fulfilling its potential?
• How does archival description differ across types of material?
• Are archival materials usually described as collections?
• Does the archival control byte capture all archival descriptions?
• How often is DACS specified as the content standard?
• To what extent have DACS minimum requirements been met?
• Bonus question: What implications for next-gen cataloging do
the data suggest?
SCOPE OF THE DATASET
Archival records filtered from WorldCat
• OCLC’s WorldCat database of 340+ million records
filtered to extract “archival” records
– Currently 4 million, about 1% of WorldCat
– Scope expanded two years ago to add more types of material
• Brief version of the filter specs
– “Unpublished” materials in any format
– Under “archival control”
– Held by a single institution
– Excludes published materials
Spoiler alert: It’s not perfect.
Same dataset as ArchiveGrid
• Only one library holding symbol is attached (to eliminate non-unique items or collections)
• The MARC Leader has one or more of the following:
– Leader byte 06 (recordtype) has the value d (manuscript music), f (manuscript
cartographic), g (projected graphics), i (nonmusic recording), j (music recording), k
(visual), p (mixed), r (realia), or t (textual manuscript). [does this include all the new
ones?]
– Leader byte 06 has the value "a" (language material) and Leader byte 07
(bibliographic level) has the value "c" (collection).
– Leader byte 08 has the value "a" (archival control).
• Field 260 subfields "a" and "b" are not present (to filter out published works)
• "Bibliography" does not occur at the beginning string of any MARC subject heading
subfield "a" or "v" (to filter out published works).
• Field 502 is not present (to filter out theses and dissertations).
• Records with material type "book" or "serial" that have no value in fields 008 or 006
“Nature of Contents” bytes (to eliminate theses, reference works, and other non-archival
materials).
http://beta.worldcat.org/archivegrid/about/
The full filter specs:
KEY FINDINGS
Key Findings
• Record type (Leader 06) sometimes used incorrectly
– Mixed materials, computer files, web sites (aka Integrating Resources)
• Cataloging practices reveal format-specific silos
– Record type, archival control, descriptive rules, note fields, use of
topical subject field (650) for genre/form terms (655)
• Records describing single items greatly predominate for all
record types except Mixed Materials
– … and 25% of Mixed Materials records describe a single item
• Format-specific notes (5xx) underutilized
– 506, 511, 520, 524, 545, 546, 555, 561 …
– 500 is most-used note for maps, recordings, scores, text, visual
Key Findings, cont.
• Archival control (Leader 08) specified in 28% of records
– 40% of Mixed Materials records
• Archival descriptive standards (040 $e) specified in 20% of
records
– appm, dacs, gihc
– 61% of records specify AACR2, 1.5% RDA
• One-third of records link (856) to digital content
– Digital objects or finding aids
DATA ANALYSIS
1. Full data
2. Visual materials
3. Mixed materials
4. Textual materials
5. Recordings
6. Scores
7. Maps
8. Other formats
1. Full data (4 million records)
• 88% are visual, mixed, or textual materials
• 39% describe collections, 51% single items
– “Component” levels are little used
– Records for collections are mostly Mixed Materials
• 28% of records specify archival control (Leader 08)
• 20% specify use of archival cataloging rules (040 $e)
• Creator names (1xx and 7xx) indexed in 86%
• Subject terms (6xx) indexed in 84%
• Link (856) to digital content in 33%
– Digital objects or finding aids
Percent of records by type of material
(Leader 06)
36.8%
31.6%
20.1%
8.0%
2.9%
0.6%
Visual
Mixed
Text
Recording
Score
All other formats
Number of records by bibliographic
level (Leader 07)
0
200,000
400,000
600,000
800,000
1,000,000
1,200,000
Visual Mixed Text Recording Score Other
formats
Collection (c )
Subunit (d)
Monograph/Item (m)
Other levels
Subject and genre/form index terms
2. Visual Materials
• 1.5 million records (36% of total)
– 2-D graphics (30% of all records)
– Projected graphics (film, video, slides: 6% of of all records)
– Small number of kits and 3-D artifacts
• Coded data
– 76% describe items, 15% collections
– Less than 10% specify archival control (Leader 08)
– 1% specify use of gihc
– Coded physical characteristics (007) in 57%
• Most-used notes
– General note (500) in 77% of records
– Summary (520) in 68%
– Conditions governing use/reproduction (540) in 57%
2. Visual Materials, cont.
• Primary creator (1xx) in 51% of all records
• Secondary creator (7xx) in about 31%
• Personal name subject (600) in 32%; mean of 1.1 per
record
• Topical subject (650) in 68%; mean of 4.2
• Geographic subject (651) in 38%; mean of 1.5
• Genre/form (655) in 81%; mean of 1.5
• Link to digital content (856) in 48%
3. Mixed Materials
• 1.3 million records (31% of all records)
• Coded data
– 75% describe collections, 25% items
– 40% specify archival control (Leader 08)
– 40% specify use of appm or dacs
• 10% have no title in 245 $a ($k usually included)
• Organization/arrangement (351) in 12%
• Most-used notes
• Summary (520) in 75% of records
• General note (500) in 44%
• Restrictions on access (506) in 37%
• Biographical/historical (545) in 27%
• No other 5xx used in more than 30%
3. Mixed Materials, cont.
• Personal author (100) is primary creator in 40%
• Corporate author (110) is primary creator in 21%
• Secondary creators (7xx) in about 20%
• Personal name subject (600) in 34%; mean of 1.5 per record
• Topical subject (650) in 45%; mean of 3.0
• Geographic subject (651) in 40%; mean of 1.3
• Genre/form (655) in 65%; mean of 1.3
• Link to digital content (856) in 34%
3. Mixed Materials, cont.
Presence of DACS (2004- ) single-level required minimum
elements (Mixed Materials records only)
• Reference code: stored in local database
• Name/location of repository: stored in MARC holdings record
• Title: 100% of records
• Date(s): 52% in 245 $f, 21% in 260 $c
• Extent (300): 78%
• Creator(s), if known (1xx): 61%
• Scope/content (520): 75%
• Conditions governing access (506): 37%
• Languages/scripts of the material (546): 13%
3. Mixed Materials, cont.
Note fields used in >10% of records
Field Key
500 44% General note 5-25%
506 37% Restrictions on access 26-50%
520 75% Summary 51-90%
524 15% Preferred citation 91-100%
540 31% Terms governing use/reproduction
541 18% Source of acquisition
545 27% Biographical/Historical note
546 13% Language
555 21% Finding aid
4. Textual materials
• 809,000 records (20% of all records)
– Collections of printed materials (4% of all records)
– Textual manuscripts (21% of all records)
• Coded data
– 66% describe collections, 29% items
– 16% specify archival control (Leader 08)
– 17% specify use of appm or dacs
• Most-used notes
– Summary (520) in 75%
– General note (500) in 54%
– Restrictions on access (506) in 37%
4. Textual materials, cont.
• Primary author (mostly 100) in 77% of records
• Secondary author (7xx) in about 50%
• Personal name subject (600) in 30%; mean of 0.9 per
record
• Topical subject (650) in 47%; mean of 1.7
• Geographic subject (651) in 29%; mean of 0.8
• Genre/form (655) in 35%; mean of 0.7
• Link to digital content (856) in 5%
5. Recordings
• 322,000 records (8% of all records)
– Music (5% of all records), nonmusic (3%)
• Coded data
– 95% describe items
– 3% specify archival control (Leader 08)
– Coded physical characteristics (007) in 78%
• Most-used notes
– General note (500) in 68% of records
– Date/time/place of event (518) in 49%
– Participant/performer (511) in 33%
5. Recordings, cont.
• Primary creator (1xx) in 75% of records
• Secondary creator (7xx) in 100%
• Topical subject (650) in 66%; mean of 5.2 per record
• Geographic subject (651) in 22%; mean of 0.9
• Genre/form term (655) in 25%; mean of 1.2
• Link to digital content (856) in 3%
6. Scores
• 117,000 records (3% of all records)
– Mostly manuscript scores (3% of all records), a few printed scores
• Coded data
– 77% describe items, 14% components
– 3% specify archival control (Leader 08)
• Uniform title (240) in 41%
• Most-used notes
– General note (500) in 96% of records
– Little use of any other 5xx’s
6. Scores, cont.
• Primary creator (1xx) in 90% of records
• Secondary creator (7xx) in ca. 50%
• Topical subject (650) in 96% of records; mean of 2.4
• Genre/form (655) in 34%; often in 650 instead
– 650s will gradually move to 655
• Link to digital content (856) in 25%
7. Maps
• 22,000 records (0.6% of all records)
– Mostly manuscript maps, a few printed maps
• Coded data
– 95% describe items
– Coded physical characteristics (007) in 65% of records
– 4% specify archival control (Leader 08)
– Hierarchical geographic area code (043) in 80%
– Geographic classification code (052) in 66%
• Cartographic mathematical data (255) in 92%
• Most-used notes
– General note (500) in 96%
– Little use of any other 5xx’s
7. Maps, cont.
• Primary creator (1xx) in 53% of records
• Secondary creator (7xx) in 50%
• Topical subject (650) in 68%; mean of 2.8 per record
• Geographic subject (651) in 83%; mean of 2.7
• Genre/form (655) in 84%; mean of 1.8
• Link to digital content (856) in 14%
Other formats
• Dataset also includes a few records for:
– Computer files (1,275)
• Most should instead use record type for nature of content
– Web sites (146)
• Record type used for these is Integrated Resources
• Thousands of others use another record type, e.g. Mixed Materials
– Serials (109)
• Included only because archival control (Leader 08) is
specified
WHAT’S NEXT?
My Questions for You
• Which of the findings are significant enough to
warrant changes in practice?
• Do the data debunk any assumptions?
• Would you tweak the specs of our filter?
• What other questions should I be asking?
• … And what are the implications for next-
generation cataloging?
Tentative Recommendations
• Consider eliminating some little-used note fields from MARC
• Educate archival community about accurate use of record
types and why consistency matters
• Promote DACS single-level minimum required elements
• Promote value of collection-level records to special
materials communities
• Consider doing some automated data remediation
– Sample possibilities: add missing language notes, “no restrictions”
notes, country codes, titles in 245 $a
• What else? What would help you in your work?
Next Steps
• Publish OCLC Research report early in 2016
• Prepare a second paper on implications for discovery,
comparing MARC and EAD data (Bron et al. in Code{4}Lib,
2013)
• Possible future projects
– Study data content
– Selective data remediation
• Enhance generic titles (e.g., Papers, Records)
• Add missing language notes (field 546)
– Descriptive practice for web archiving
• What research might you take on?
SM
Please send feedback!
Jackie Dooley
Program Officer, OCLC Research
dooleyj@oclc.org
@minniedw
OCLC Research Library Partnership
Work-in-progress webinar
3 December 2015

More Related Content

What's hot

On the Way to a Holding Ontology
On the Way to a Holding OntologyOn the Way to a Holding Ontology
On the Way to a Holding Ontology
Jakob .
 
RDA in the wilder world: workshop on serials
RDA in the wilder world: workshop on serialsRDA in the wilder world: workshop on serials
RDA in the wilder world: workshop on serials
ISSN International Centre
 
Datalift a-catalyser-for-the-web-of-data-fosdem-05-02-2011
Datalift a-catalyser-for-the-web-of-data-fosdem-05-02-2011Datalift a-catalyser-for-the-web-of-data-fosdem-05-02-2011
Datalift a-catalyser-for-the-web-of-data-fosdem-05-02-2011
François Scharffe
 
Rs detective 2nd_fri
Rs detective 2nd_friRs detective 2nd_fri
Rs detective 2nd_fri
LYRASIS_PRODEV
 
Methodology for Linguistic Linked Open Data generation. The Apertium RDF case
Methodology for Linguistic Linked Open Data generation. The Apertium RDF caseMethodology for Linguistic Linked Open Data generation. The Apertium RDF case
Methodology for Linguistic Linked Open Data generation. The Apertium RDF case
Jorge Gracia
 
Semantic Pipes and Semantic Mashups
Semantic Pipes and Semantic MashupsSemantic Pipes and Semantic Mashups
Semantic Pipes and Semantic Mashups
giurca
 
Network discovery - Inside out by Aakash Goel
Network discovery - Inside out by Aakash GoelNetwork discovery - Inside out by Aakash Goel
Network discovery - Inside out by Aakash Goel
OWASP Delhi
 
Trying SPARQL Anything with MEI
Trying SPARQL Anything with MEITrying SPARQL Anything with MEI
Trying SPARQL Anything with MEI
Enrico Daga
 
The SPARQL Anything project
The SPARQL Anything projectThe SPARQL Anything project
The SPARQL Anything project
Enrico Daga
 
Knowledge graph construction with a façade - The SPARQL Anything Project
Knowledge graph construction with a façade - The SPARQL Anything ProjectKnowledge graph construction with a façade - The SPARQL Anything Project
Knowledge graph construction with a façade - The SPARQL Anything Project
Enrico Daga
 
Presentation shexer
Presentation shexerPresentation shexer
Presentation shexer
Daniel Fernández Álvarez
 
Rda and new research potentials, agata kawalec
Rda and new research potentials, agata kawalecRda and new research potentials, agata kawalec
Rda and new research potentials, agata kawalec
Richard.Sapon-White
 
Using Public RDF Resources in Neo4j
Using Public RDF Resources in Neo4jUsing Public RDF Resources in Neo4j
Using Public RDF Resources in Neo4j
Neo4j
 
Rich Data? Poor Data? Depends on...
Rich Data? Poor Data? Depends on...Rich Data? Poor Data? Depends on...
Rich Data? Poor Data? Depends on...
Lars G. Svensson
 
SPARQL in the Semantic Web
SPARQL in the Semantic WebSPARQL in the Semantic Web
SPARQL in the Semantic Web
Jan Beeck
 
Migrating data to a new LMS: challenges, opportunities and lessons / Penny Do...
Migrating data to a new LMS: challenges, opportunities and lessons / Penny Do...Migrating data to a new LMS: challenges, opportunities and lessons / Penny Do...
Migrating data to a new LMS: challenges, opportunities and lessons / Penny Do...
CILIP MDG
 
20140521 sem-tech-biz-guest-lecture
20140521 sem-tech-biz-guest-lecture20140521 sem-tech-biz-guest-lecture
20140521 sem-tech-biz-guest-lecture
Vladimir Alexiev, PhD, PMP
 
SHACL: Shaping the Big Ball of Data Mud
SHACL: Shaping the Big Ball of Data MudSHACL: Shaping the Big Ball of Data Mud
SHACL: Shaping the Big Ball of Data Mud
Richard Cyganiak
 
DBpedia Citation Challenge. (Not only) Polish Citations in Wikipedia: analysi...
DBpedia Citation Challenge. (Not only) Polish Citations in Wikipedia: analysi...DBpedia Citation Challenge. (Not only) Polish Citations in Wikipedia: analysi...
DBpedia Citation Challenge. (Not only) Polish Citations in Wikipedia: analysi...
Krzysztof Wecel
 

What's hot (19)

On the Way to a Holding Ontology
On the Way to a Holding OntologyOn the Way to a Holding Ontology
On the Way to a Holding Ontology
 
RDA in the wilder world: workshop on serials
RDA in the wilder world: workshop on serialsRDA in the wilder world: workshop on serials
RDA in the wilder world: workshop on serials
 
Datalift a-catalyser-for-the-web-of-data-fosdem-05-02-2011
Datalift a-catalyser-for-the-web-of-data-fosdem-05-02-2011Datalift a-catalyser-for-the-web-of-data-fosdem-05-02-2011
Datalift a-catalyser-for-the-web-of-data-fosdem-05-02-2011
 
Rs detective 2nd_fri
Rs detective 2nd_friRs detective 2nd_fri
Rs detective 2nd_fri
 
Methodology for Linguistic Linked Open Data generation. The Apertium RDF case
Methodology for Linguistic Linked Open Data generation. The Apertium RDF caseMethodology for Linguistic Linked Open Data generation. The Apertium RDF case
Methodology for Linguistic Linked Open Data generation. The Apertium RDF case
 
Semantic Pipes and Semantic Mashups
Semantic Pipes and Semantic MashupsSemantic Pipes and Semantic Mashups
Semantic Pipes and Semantic Mashups
 
Network discovery - Inside out by Aakash Goel
Network discovery - Inside out by Aakash GoelNetwork discovery - Inside out by Aakash Goel
Network discovery - Inside out by Aakash Goel
 
Trying SPARQL Anything with MEI
Trying SPARQL Anything with MEITrying SPARQL Anything with MEI
Trying SPARQL Anything with MEI
 
The SPARQL Anything project
The SPARQL Anything projectThe SPARQL Anything project
The SPARQL Anything project
 
Knowledge graph construction with a façade - The SPARQL Anything Project
Knowledge graph construction with a façade - The SPARQL Anything ProjectKnowledge graph construction with a façade - The SPARQL Anything Project
Knowledge graph construction with a façade - The SPARQL Anything Project
 
Presentation shexer
Presentation shexerPresentation shexer
Presentation shexer
 
Rda and new research potentials, agata kawalec
Rda and new research potentials, agata kawalecRda and new research potentials, agata kawalec
Rda and new research potentials, agata kawalec
 
Using Public RDF Resources in Neo4j
Using Public RDF Resources in Neo4jUsing Public RDF Resources in Neo4j
Using Public RDF Resources in Neo4j
 
Rich Data? Poor Data? Depends on...
Rich Data? Poor Data? Depends on...Rich Data? Poor Data? Depends on...
Rich Data? Poor Data? Depends on...
 
SPARQL in the Semantic Web
SPARQL in the Semantic WebSPARQL in the Semantic Web
SPARQL in the Semantic Web
 
Migrating data to a new LMS: challenges, opportunities and lessons / Penny Do...
Migrating data to a new LMS: challenges, opportunities and lessons / Penny Do...Migrating data to a new LMS: challenges, opportunities and lessons / Penny Do...
Migrating data to a new LMS: challenges, opportunities and lessons / Penny Do...
 
20140521 sem-tech-biz-guest-lecture
20140521 sem-tech-biz-guest-lecture20140521 sem-tech-biz-guest-lecture
20140521 sem-tech-biz-guest-lecture
 
SHACL: Shaping the Big Ball of Data Mud
SHACL: Shaping the Big Ball of Data MudSHACL: Shaping the Big Ball of Data Mud
SHACL: Shaping the Big Ball of Data Mud
 
DBpedia Citation Challenge. (Not only) Polish Citations in Wikipedia: analysi...
DBpedia Citation Challenge. (Not only) Polish Citations in Wikipedia: analysi...DBpedia Citation Challenge. (Not only) Polish Citations in Wikipedia: analysi...
DBpedia Citation Challenge. (Not only) Polish Citations in Wikipedia: analysi...
 

Viewers also liked

تحميل البرمجيات الحرة
تحميل البرمجيات الحرةتحميل البرمجيات الحرة
تحميل البرمجيات الحرة
Aaban Hayy
 
Штрих Ру - рубашки поло и бейсболки
Штрих Ру - рубашки поло и бейсболкиШтрих Ру - рубашки поло и бейсболки
Штрих Ру - рубашки поло и бейсболки
Виктор Курсалин
 
Action plan
Action planAction plan
Action plan
asmediaf12
 
Controle acces biometrique
Controle acces biometriqueControle acces biometrique
Controle acces biometriquefrvoya
 
Google Panda - how to recover from the penalty box
Google Panda - how to recover from the penalty boxGoogle Panda - how to recover from the penalty box
Google Panda - how to recover from the penalty box
MarketingNomads.com
 
102c 1
102c 1102c 1
102c 1
ziyuniu102d
 
newtwork opnet app project
newtwork opnet app project newtwork opnet app project
newtwork opnet app project
Mohamed Elagnaf
 
Brand developement
Brand developementBrand developement
Brand developement
ziyuniu102d
 
Khung chuong trinh fb update 20-3
Khung chuong trinh fb update 20-3Khung chuong trinh fb update 20-3
Khung chuong trinh fb update 20-3thanhechip99
 
Sumanta Kumar Sahu -Project report
Sumanta Kumar Sahu -Project reportSumanta Kumar Sahu -Project report
Sumanta Kumar Sahu -Project report
Sumanta Kumar Sahu
 
الهيكل التنظيمى لكلية الحقوق جامعة بنها - 2015
الهيكل التنظيمى لكلية الحقوق جامعة بنها - 2015الهيكل التنظيمى لكلية الحقوق جامعة بنها - 2015
الهيكل التنظيمى لكلية الحقوق جامعة بنها - 2015
Hassan Ibrahim
 
Reações de Subst. Nucleofïlicas em Compostos Aromáticos
Reações de Subst. Nucleofïlicas em Compostos AromáticosReações de Subst. Nucleofïlicas em Compostos Aromáticos
Reações de Subst. Nucleofïlicas em Compostos Aromáticos
José Nunes da Silva Jr.
 

Viewers also liked (14)

تحميل البرمجيات الحرة
تحميل البرمجيات الحرةتحميل البرمجيات الحرة
تحميل البرمجيات الحرة
 
Штрих Ру - рубашки поло и бейсболки
Штрих Ру - рубашки поло и бейсболкиШтрих Ру - рубашки поло и бейсболки
Штрих Ру - рубашки поло и бейсболки
 
Action plan
Action planAction plan
Action plan
 
Controle acces biometrique
Controle acces biometriqueControle acces biometrique
Controle acces biometrique
 
Google Panda - how to recover from the penalty box
Google Panda - how to recover from the penalty boxGoogle Panda - how to recover from the penalty box
Google Panda - how to recover from the penalty box
 
Ya get me!
Ya get me!Ya get me!
Ya get me!
 
102c 1
102c 1102c 1
102c 1
 
newtwork opnet app project
newtwork opnet app project newtwork opnet app project
newtwork opnet app project
 
Brand developement
Brand developementBrand developement
Brand developement
 
Khung chuong trinh fb update 20-3
Khung chuong trinh fb update 20-3Khung chuong trinh fb update 20-3
Khung chuong trinh fb update 20-3
 
Sumanta Kumar Sahu -Project report
Sumanta Kumar Sahu -Project reportSumanta Kumar Sahu -Project report
Sumanta Kumar Sahu -Project report
 
الهيكل التنظيمى لكلية الحقوق جامعة بنها - 2015
الهيكل التنظيمى لكلية الحقوق جامعة بنها - 2015الهيكل التنظيمى لكلية الحقوق جامعة بنها - 2015
الهيكل التنظيمى لكلية الحقوق جامعة بنها - 2015
 
Reações de Subst. Nucleofïlicas em Compostos Aromáticos
Reações de Subst. Nucleofïlicas em Compostos AromáticosReações de Subst. Nucleofïlicas em Compostos Aromáticos
Reações de Subst. Nucleofïlicas em Compostos Aromáticos
 
Leadership
LeadershipLeadership
Leadership
 

Similar to A Close Look at the Four Million Archival MARC Records in WorldCat

MARC-y MARC and the Coding Bunch
MARC-y MARC and the Coding BunchMARC-y MARC and the Coding Bunch
MARC-y MARC and the Coding Bunch
Andrea Payant
 
Cataloging Basics Webinar (NEKLS)
Cataloging Basics Webinar (NEKLS)Cataloging Basics Webinar (NEKLS)
Cataloging Basics Webinar (NEKLS)
Heather Braum
 
Presentation FAIRsFAIR workshop (April 2020)
Presentation FAIRsFAIR workshop (April 2020)Presentation FAIRsFAIR workshop (April 2020)
Presentation FAIRsFAIR workshop (April 2020)
INRAE (MISTEA) and University of Montpellier (LIRMM)
 
Archives' User Studies & Archival WorldCat Records
Archives' User Studies & Archival WorldCat RecordsArchives' User Studies & Archival WorldCat Records
Archives' User Studies & Archival WorldCat Records
OCLC Research
 
FAIR data requires FAIR ontologies, how do we do?
FAIR data requires FAIR ontologies, how do we do?FAIR data requires FAIR ontologies, how do we do?
FAIR data requires FAIR ontologies, how do we do?
INRAE (MISTEA) and University of Montpellier (LIRMM)
 
Yang hofmann-next generationcatalogforenug
Yang hofmann-next generationcatalogforenugYang hofmann-next generationcatalogforenug
Yang hofmann-next generationcatalogforenug
ENUG
 
Kampmeier ecn 2012
Kampmeier ecn 2012Kampmeier ecn 2012
Kampmeier ecn 2012
ECNOfficer
 
Just digitise it - Daniel Wilksch of the Public Records Office Victoria
Just digitise it - Daniel Wilksch of the Public Records Office VictoriaJust digitise it - Daniel Wilksch of the Public Records Office Victoria
Just digitise it - Daniel Wilksch of the Public Records Office Victoria
National Library of Australia
 
On Your MARC, Get Set, Code!
On Your MARC, Get Set, Code!On Your MARC, Get Set, Code!
On Your MARC, Get Set, Code!
Andrea Payant
 
Ontology Design Patterns for Linked Data Tutorial at ISWC2016 - Introduction
Ontology Design Patterns for Linked Data Tutorial at ISWC2016 - IntroductionOntology Design Patterns for Linked Data Tutorial at ISWC2016 - Introduction
Ontology Design Patterns for Linked Data Tutorial at ISWC2016 - Introduction
Aldo Gangemi
 
Academic Writing and Research Data Management
Academic Writing and Research Data ManagementAcademic Writing and Research Data Management
Academic Writing and Research Data Management
CESSDA Training
 
Tillett, Hillmann, and Moen, "Bibliographic Control Alphabet Soup: AACR to R...
Tillett, Hillmann, and Moen, "Bibliographic Control Alphabet Soup:  AACR to R...Tillett, Hillmann, and Moen, "Bibliographic Control Alphabet Soup:  AACR to R...
Tillett, Hillmann, and Moen, "Bibliographic Control Alphabet Soup: AACR to R...
National Information Standards Organization (NISO)
 
Assessing Uniqueness in the System-wide Book Collection
Assessing Uniqueness in the System-wide Book CollectionAssessing Uniqueness in the System-wide Book Collection
Assessing Uniqueness in the System-wide Book Collection
Constance Malpas
 
Semantics-enhanced Geoscience Interoperability, Analytics, and Applications
Semantics-enhanced Geoscience Interoperability, Analytics, and ApplicationsSemantics-enhanced Geoscience Interoperability, Analytics, and Applications
Semantics-enhanced Geoscience Interoperability, Analytics, and Applications
Artificial Intelligence Institute at UofSC
 
Peer Council 2017 OCLC Update
Peer Council 2017 OCLC UpdatePeer Council 2017 OCLC Update
Peer Council 2017 OCLC Update
WiLS
 
Albert Merono-Penuela: Understanding Change in Versioned Web-Knowledge Organi...
Albert Merono-Penuela: Understanding Change in Versioned Web-Knowledge Organi...Albert Merono-Penuela: Understanding Change in Versioned Web-Knowledge Organi...
Albert Merono-Penuela: Understanding Change in Versioned Web-Knowledge Organi...
COST Action TD1210
 
Lis60002 dunhammcgurrjuly2008
Lis60002 dunhammcgurrjuly2008Lis60002 dunhammcgurrjuly2008
Lis60002 dunhammcgurrjuly2008
Barbara Dunham
 
Redescription Mining
Redescription MiningRedescription Mining
Redescription Mining
Peter Molnar
 
IT "The Power That Influence The World"
IT "The Power That Influence The World"IT "The Power That Influence The World"
IT "The Power That Influence The World"
USA Discussion Group
 
Taming the Wilde
Taming the WildeTaming the Wilde
Taming the Wilde
Charleston Conference
 

Similar to A Close Look at the Four Million Archival MARC Records in WorldCat (20)

MARC-y MARC and the Coding Bunch
MARC-y MARC and the Coding BunchMARC-y MARC and the Coding Bunch
MARC-y MARC and the Coding Bunch
 
Cataloging Basics Webinar (NEKLS)
Cataloging Basics Webinar (NEKLS)Cataloging Basics Webinar (NEKLS)
Cataloging Basics Webinar (NEKLS)
 
Presentation FAIRsFAIR workshop (April 2020)
Presentation FAIRsFAIR workshop (April 2020)Presentation FAIRsFAIR workshop (April 2020)
Presentation FAIRsFAIR workshop (April 2020)
 
Archives' User Studies & Archival WorldCat Records
Archives' User Studies & Archival WorldCat RecordsArchives' User Studies & Archival WorldCat Records
Archives' User Studies & Archival WorldCat Records
 
FAIR data requires FAIR ontologies, how do we do?
FAIR data requires FAIR ontologies, how do we do?FAIR data requires FAIR ontologies, how do we do?
FAIR data requires FAIR ontologies, how do we do?
 
Yang hofmann-next generationcatalogforenug
Yang hofmann-next generationcatalogforenugYang hofmann-next generationcatalogforenug
Yang hofmann-next generationcatalogforenug
 
Kampmeier ecn 2012
Kampmeier ecn 2012Kampmeier ecn 2012
Kampmeier ecn 2012
 
Just digitise it - Daniel Wilksch of the Public Records Office Victoria
Just digitise it - Daniel Wilksch of the Public Records Office VictoriaJust digitise it - Daniel Wilksch of the Public Records Office Victoria
Just digitise it - Daniel Wilksch of the Public Records Office Victoria
 
On Your MARC, Get Set, Code!
On Your MARC, Get Set, Code!On Your MARC, Get Set, Code!
On Your MARC, Get Set, Code!
 
Ontology Design Patterns for Linked Data Tutorial at ISWC2016 - Introduction
Ontology Design Patterns for Linked Data Tutorial at ISWC2016 - IntroductionOntology Design Patterns for Linked Data Tutorial at ISWC2016 - Introduction
Ontology Design Patterns for Linked Data Tutorial at ISWC2016 - Introduction
 
Academic Writing and Research Data Management
Academic Writing and Research Data ManagementAcademic Writing and Research Data Management
Academic Writing and Research Data Management
 
Tillett, Hillmann, and Moen, "Bibliographic Control Alphabet Soup: AACR to R...
Tillett, Hillmann, and Moen, "Bibliographic Control Alphabet Soup:  AACR to R...Tillett, Hillmann, and Moen, "Bibliographic Control Alphabet Soup:  AACR to R...
Tillett, Hillmann, and Moen, "Bibliographic Control Alphabet Soup: AACR to R...
 
Assessing Uniqueness in the System-wide Book Collection
Assessing Uniqueness in the System-wide Book CollectionAssessing Uniqueness in the System-wide Book Collection
Assessing Uniqueness in the System-wide Book Collection
 
Semantics-enhanced Geoscience Interoperability, Analytics, and Applications
Semantics-enhanced Geoscience Interoperability, Analytics, and ApplicationsSemantics-enhanced Geoscience Interoperability, Analytics, and Applications
Semantics-enhanced Geoscience Interoperability, Analytics, and Applications
 
Peer Council 2017 OCLC Update
Peer Council 2017 OCLC UpdatePeer Council 2017 OCLC Update
Peer Council 2017 OCLC Update
 
Albert Merono-Penuela: Understanding Change in Versioned Web-Knowledge Organi...
Albert Merono-Penuela: Understanding Change in Versioned Web-Knowledge Organi...Albert Merono-Penuela: Understanding Change in Versioned Web-Knowledge Organi...
Albert Merono-Penuela: Understanding Change in Versioned Web-Knowledge Organi...
 
Lis60002 dunhammcgurrjuly2008
Lis60002 dunhammcgurrjuly2008Lis60002 dunhammcgurrjuly2008
Lis60002 dunhammcgurrjuly2008
 
Redescription Mining
Redescription MiningRedescription Mining
Redescription Mining
 
IT "The Power That Influence The World"
IT "The Power That Influence The World"IT "The Power That Influence The World"
IT "The Power That Influence The World"
 
Taming the Wilde
Taming the WildeTaming the Wilde
Taming the Wilde
 

More from OCLC

Communicating library impact beyond library walls: Findings from an action-or...
Communicating library impact beyond library walls: Findings from an action-or...Communicating library impact beyond library walls: Findings from an action-or...
Communicating library impact beyond library walls: Findings from an action-or...
OCLC
 
"You can just tell whether a website looks reliable or not." People's modes o...
"You can just tell whether a website looks reliable or not." People's modes o..."You can just tell whether a website looks reliable or not." People's modes o...
"You can just tell whether a website looks reliable or not." People's modes o...
OCLC
 
Factors influencing research data management programs.
Factors influencing research data management programs.Factors influencing research data management programs.
Factors influencing research data management programs.
OCLC
 
Teaching research methods in LIS programs: Approaches, formats, and innovativ...
Teaching research methods in LIS programs: Approaches, formats, and innovativ...Teaching research methods in LIS programs: Approaches, formats, and innovativ...
Teaching research methods in LIS programs: Approaches, formats, and innovativ...
OCLC
 
OCLC ALISE Library & Information Science Research Grant Program
OCLC ALISE Library & Information Science Research Grant ProgramOCLC ALISE Library & Information Science Research Grant Program
OCLC ALISE Library & Information Science Research Grant Program
OCLC
 
Investing in library users and potential users: The Many Faces of Digital Vi...
 Investing in library users and potential users: The Many Faces of Digital Vi... Investing in library users and potential users: The Many Faces of Digital Vi...
Investing in library users and potential users: The Many Faces of Digital Vi...
OCLC
 
Academic library impact: Improving practice and essential areas to research
Academic library impact: Improving practice and essential areas to researchAcademic library impact: Improving practice and essential areas to research
Academic library impact: Improving practice and essential areas to research
OCLC
 
Studying information behavior: The Many Faces of Digital Visitors and Residents
Studying information behavior: The Many Faces of Digital Visitors and ResidentsStudying information behavior: The Many Faces of Digital Visitors and Residents
Studying information behavior: The Many Faces of Digital Visitors and Residents
OCLC
 
Online engagement and information literacy: The Many Face of Digital Visitors...
Online engagement and information literacy: The Many Face of Digital Visitors...Online engagement and information literacy: The Many Face of Digital Visitors...
Online engagement and information literacy: The Many Face of Digital Visitors...
OCLC
 
People's mode of online engagement: The Many Faces of Digital Visitors and R...
 People's mode of online engagement: The Many Faces of Digital Visitors and R... People's mode of online engagement: The Many Faces of Digital Visitors and R...
People's mode of online engagement: The Many Faces of Digital Visitors and R...
OCLC
 
Applying research methods: Investigating the Many Faces of Digital Visitors &...
Applying research methods: Investigating the Many Faces of Digital Visitors &...Applying research methods: Investigating the Many Faces of Digital Visitors &...
Applying research methods: Investigating the Many Faces of Digital Visitors &...
OCLC
 
OCLC RLP @ RLUK
OCLC RLP @ RLUKOCLC RLP @ RLUK
OCLC RLP @ RLUK
OCLC
 
Using Qualitative Methods for Library Evaluation: An Interactive Workshop
Using Qualitative Methods for Library Evaluation: An Interactive WorkshopUsing Qualitative Methods for Library Evaluation: An Interactive Workshop
Using Qualitative Methods for Library Evaluation: An Interactive Workshop
OCLC
 
Visitors and Residents: The Hows and Whys of Engagement with Technology
Visitors and Residents: The Hows and Whys of Engagement with TechnologyVisitors and Residents: The Hows and Whys of Engagement with Technology
Visitors and Residents: The Hows and Whys of Engagement with Technology
OCLC
 
Action-Oriented Research Agenda on Library Contributions to Student Learning ...
Action-Oriented Research Agenda on Library Contributions to Student Learning ...Action-Oriented Research Agenda on Library Contributions to Student Learning ...
Action-Oriented Research Agenda on Library Contributions to Student Learning ...
OCLC
 
Visitors and Residents: Interactive Mapping Exercise Workshop
Visitors and Residents: Interactive Mapping Exercise WorkshopVisitors and Residents: Interactive Mapping Exercise Workshop
Visitors and Residents: Interactive Mapping Exercise Workshop
OCLC
 
The Library in the Life of the User
The Library in the Life of the UserThe Library in the Life of the User
The Library in the Life of the User
OCLC
 
Where are We Going and What Do We Do Next? Demonstrating the Value of Academi...
Where are We Going and What Do We Do Next? Demonstrating the Value of Academi...Where are We Going and What Do We Do Next? Demonstrating the Value of Academi...
Where are We Going and What Do We Do Next? Demonstrating the Value of Academi...
OCLC
 
Changing Tack: A Future-Focused ACRL Research Agenda
Changing Tack: A Future-Focused ACRL Research AgendaChanging Tack: A Future-Focused ACRL Research Agenda
Changing Tack: A Future-Focused ACRL Research Agenda
OCLC
 
Qualitative Research Methods in LIS
Qualitative Research Methods in LISQualitative Research Methods in LIS
Qualitative Research Methods in LIS
OCLC
 

More from OCLC (20)

Communicating library impact beyond library walls: Findings from an action-or...
Communicating library impact beyond library walls: Findings from an action-or...Communicating library impact beyond library walls: Findings from an action-or...
Communicating library impact beyond library walls: Findings from an action-or...
 
"You can just tell whether a website looks reliable or not." People's modes o...
"You can just tell whether a website looks reliable or not." People's modes o..."You can just tell whether a website looks reliable or not." People's modes o...
"You can just tell whether a website looks reliable or not." People's modes o...
 
Factors influencing research data management programs.
Factors influencing research data management programs.Factors influencing research data management programs.
Factors influencing research data management programs.
 
Teaching research methods in LIS programs: Approaches, formats, and innovativ...
Teaching research methods in LIS programs: Approaches, formats, and innovativ...Teaching research methods in LIS programs: Approaches, formats, and innovativ...
Teaching research methods in LIS programs: Approaches, formats, and innovativ...
 
OCLC ALISE Library & Information Science Research Grant Program
OCLC ALISE Library & Information Science Research Grant ProgramOCLC ALISE Library & Information Science Research Grant Program
OCLC ALISE Library & Information Science Research Grant Program
 
Investing in library users and potential users: The Many Faces of Digital Vi...
 Investing in library users and potential users: The Many Faces of Digital Vi... Investing in library users and potential users: The Many Faces of Digital Vi...
Investing in library users and potential users: The Many Faces of Digital Vi...
 
Academic library impact: Improving practice and essential areas to research
Academic library impact: Improving practice and essential areas to researchAcademic library impact: Improving practice and essential areas to research
Academic library impact: Improving practice and essential areas to research
 
Studying information behavior: The Many Faces of Digital Visitors and Residents
Studying information behavior: The Many Faces of Digital Visitors and ResidentsStudying information behavior: The Many Faces of Digital Visitors and Residents
Studying information behavior: The Many Faces of Digital Visitors and Residents
 
Online engagement and information literacy: The Many Face of Digital Visitors...
Online engagement and information literacy: The Many Face of Digital Visitors...Online engagement and information literacy: The Many Face of Digital Visitors...
Online engagement and information literacy: The Many Face of Digital Visitors...
 
People's mode of online engagement: The Many Faces of Digital Visitors and R...
 People's mode of online engagement: The Many Faces of Digital Visitors and R... People's mode of online engagement: The Many Faces of Digital Visitors and R...
People's mode of online engagement: The Many Faces of Digital Visitors and R...
 
Applying research methods: Investigating the Many Faces of Digital Visitors &...
Applying research methods: Investigating the Many Faces of Digital Visitors &...Applying research methods: Investigating the Many Faces of Digital Visitors &...
Applying research methods: Investigating the Many Faces of Digital Visitors &...
 
OCLC RLP @ RLUK
OCLC RLP @ RLUKOCLC RLP @ RLUK
OCLC RLP @ RLUK
 
Using Qualitative Methods for Library Evaluation: An Interactive Workshop
Using Qualitative Methods for Library Evaluation: An Interactive WorkshopUsing Qualitative Methods for Library Evaluation: An Interactive Workshop
Using Qualitative Methods for Library Evaluation: An Interactive Workshop
 
Visitors and Residents: The Hows and Whys of Engagement with Technology
Visitors and Residents: The Hows and Whys of Engagement with TechnologyVisitors and Residents: The Hows and Whys of Engagement with Technology
Visitors and Residents: The Hows and Whys of Engagement with Technology
 
Action-Oriented Research Agenda on Library Contributions to Student Learning ...
Action-Oriented Research Agenda on Library Contributions to Student Learning ...Action-Oriented Research Agenda on Library Contributions to Student Learning ...
Action-Oriented Research Agenda on Library Contributions to Student Learning ...
 
Visitors and Residents: Interactive Mapping Exercise Workshop
Visitors and Residents: Interactive Mapping Exercise WorkshopVisitors and Residents: Interactive Mapping Exercise Workshop
Visitors and Residents: Interactive Mapping Exercise Workshop
 
The Library in the Life of the User
The Library in the Life of the UserThe Library in the Life of the User
The Library in the Life of the User
 
Where are We Going and What Do We Do Next? Demonstrating the Value of Academi...
Where are We Going and What Do We Do Next? Demonstrating the Value of Academi...Where are We Going and What Do We Do Next? Demonstrating the Value of Academi...
Where are We Going and What Do We Do Next? Demonstrating the Value of Academi...
 
Changing Tack: A Future-Focused ACRL Research Agenda
Changing Tack: A Future-Focused ACRL Research AgendaChanging Tack: A Future-Focused ACRL Research Agenda
Changing Tack: A Future-Focused ACRL Research Agenda
 
Qualitative Research Methods in LIS
Qualitative Research Methods in LISQualitative Research Methods in LIS
Qualitative Research Methods in LIS
 

Recently uploaded

BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...
Nguyen Thanh Tu Collection
 
PIMS Job Advertisement 2024.pdf Islamabad
PIMS Job Advertisement 2024.pdf IslamabadPIMS Job Advertisement 2024.pdf Islamabad
PIMS Job Advertisement 2024.pdf Islamabad
AyyanKhan40
 
A Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in EducationA Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in Education
Peter Windle
 
The simplified electron and muon model, Oscillating Spacetime: The Foundation...
The simplified electron and muon model, Oscillating Spacetime: The Foundation...The simplified electron and muon model, Oscillating Spacetime: The Foundation...
The simplified electron and muon model, Oscillating Spacetime: The Foundation...
RitikBhardwaj56
 
Digital Artifact 1 - 10VCD Environments Unit
Digital Artifact 1 - 10VCD Environments UnitDigital Artifact 1 - 10VCD Environments Unit
Digital Artifact 1 - 10VCD Environments Unit
chanes7
 
Chapter 4 - Islamic Financial Institutions in Malaysia.pptx
Chapter 4 - Islamic Financial Institutions in Malaysia.pptxChapter 4 - Islamic Financial Institutions in Malaysia.pptx
Chapter 4 - Islamic Financial Institutions in Malaysia.pptx
Mohd Adib Abd Muin, Senior Lecturer at Universiti Utara Malaysia
 
Liberal Approach to the Study of Indian Politics.pdf
Liberal Approach to the Study of Indian Politics.pdfLiberal Approach to the Study of Indian Politics.pdf
Liberal Approach to the Study of Indian Politics.pdf
WaniBasim
 
Digital Artefact 1 - Tiny Home Environmental Design
Digital Artefact 1 - Tiny Home Environmental DesignDigital Artefact 1 - Tiny Home Environmental Design
Digital Artefact 1 - Tiny Home Environmental Design
amberjdewit93
 
How to Manage Your Lost Opportunities in Odoo 17 CRM
How to Manage Your Lost Opportunities in Odoo 17 CRMHow to Manage Your Lost Opportunities in Odoo 17 CRM
How to Manage Your Lost Opportunities in Odoo 17 CRM
Celine George
 
Top five deadliest dog breeds in America
Top five deadliest dog breeds in AmericaTop five deadliest dog breeds in America
Top five deadliest dog breeds in America
Bisnar Chase Personal Injury Attorneys
 
Lapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdfLapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdf
Jean Carlos Nunes Paixão
 
Hindi varnamala | hindi alphabet PPT.pdf
Hindi varnamala | hindi alphabet PPT.pdfHindi varnamala | hindi alphabet PPT.pdf
Hindi varnamala | hindi alphabet PPT.pdf
Dr. Mulla Adam Ali
 
Introduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp NetworkIntroduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp Network
TechSoup
 
A Survey of Techniques for Maximizing LLM Performance.pptx
A Survey of Techniques for Maximizing LLM Performance.pptxA Survey of Techniques for Maximizing LLM Performance.pptx
A Survey of Techniques for Maximizing LLM Performance.pptx
thanhdowork
 
The Diamonds of 2023-2024 in the IGRA collection
The Diamonds of 2023-2024 in the IGRA collectionThe Diamonds of 2023-2024 in the IGRA collection
The Diamonds of 2023-2024 in the IGRA collection
Israel Genealogy Research Association
 
Smart-Money for SMC traders good time and ICT
Smart-Money for SMC traders good time and ICTSmart-Money for SMC traders good time and ICT
Smart-Money for SMC traders good time and ICT
simonomuemu
 
DRUGS AND ITS classification slide share
DRUGS AND ITS classification slide shareDRUGS AND ITS classification slide share
DRUGS AND ITS classification slide share
taiba qazi
 
Pride Month Slides 2024 David Douglas School District
Pride Month Slides 2024 David Douglas School DistrictPride Month Slides 2024 David Douglas School District
Pride Month Slides 2024 David Douglas School District
David Douglas School District
 
MARY JANE WILSON, A “BOA MÃE” .
MARY JANE WILSON, A “BOA MÃE”           .MARY JANE WILSON, A “BOA MÃE”           .
MARY JANE WILSON, A “BOA MÃE” .
Colégio Santa Teresinha
 
ISO/IEC 27001, ISO/IEC 42001, and GDPR: Best Practices for Implementation and...
ISO/IEC 27001, ISO/IEC 42001, and GDPR: Best Practices for Implementation and...ISO/IEC 27001, ISO/IEC 42001, and GDPR: Best Practices for Implementation and...
ISO/IEC 27001, ISO/IEC 42001, and GDPR: Best Practices for Implementation and...
PECB
 

Recently uploaded (20)

BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 CẢ NĂM - GLOBAL SUCCESS - NĂM HỌC 2023-2024 (CÓ FI...
 
PIMS Job Advertisement 2024.pdf Islamabad
PIMS Job Advertisement 2024.pdf IslamabadPIMS Job Advertisement 2024.pdf Islamabad
PIMS Job Advertisement 2024.pdf Islamabad
 
A Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in EducationA Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in Education
 
The simplified electron and muon model, Oscillating Spacetime: The Foundation...
The simplified electron and muon model, Oscillating Spacetime: The Foundation...The simplified electron and muon model, Oscillating Spacetime: The Foundation...
The simplified electron and muon model, Oscillating Spacetime: The Foundation...
 
Digital Artifact 1 - 10VCD Environments Unit
Digital Artifact 1 - 10VCD Environments UnitDigital Artifact 1 - 10VCD Environments Unit
Digital Artifact 1 - 10VCD Environments Unit
 
Chapter 4 - Islamic Financial Institutions in Malaysia.pptx
Chapter 4 - Islamic Financial Institutions in Malaysia.pptxChapter 4 - Islamic Financial Institutions in Malaysia.pptx
Chapter 4 - Islamic Financial Institutions in Malaysia.pptx
 
Liberal Approach to the Study of Indian Politics.pdf
Liberal Approach to the Study of Indian Politics.pdfLiberal Approach to the Study of Indian Politics.pdf
Liberal Approach to the Study of Indian Politics.pdf
 
Digital Artefact 1 - Tiny Home Environmental Design
Digital Artefact 1 - Tiny Home Environmental DesignDigital Artefact 1 - Tiny Home Environmental Design
Digital Artefact 1 - Tiny Home Environmental Design
 
How to Manage Your Lost Opportunities in Odoo 17 CRM
How to Manage Your Lost Opportunities in Odoo 17 CRMHow to Manage Your Lost Opportunities in Odoo 17 CRM
How to Manage Your Lost Opportunities in Odoo 17 CRM
 
Top five deadliest dog breeds in America
Top five deadliest dog breeds in AmericaTop five deadliest dog breeds in America
Top five deadliest dog breeds in America
 
Lapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdfLapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdf
 
Hindi varnamala | hindi alphabet PPT.pdf
Hindi varnamala | hindi alphabet PPT.pdfHindi varnamala | hindi alphabet PPT.pdf
Hindi varnamala | hindi alphabet PPT.pdf
 
Introduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp NetworkIntroduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp Network
 
A Survey of Techniques for Maximizing LLM Performance.pptx
A Survey of Techniques for Maximizing LLM Performance.pptxA Survey of Techniques for Maximizing LLM Performance.pptx
A Survey of Techniques for Maximizing LLM Performance.pptx
 
The Diamonds of 2023-2024 in the IGRA collection
The Diamonds of 2023-2024 in the IGRA collectionThe Diamonds of 2023-2024 in the IGRA collection
The Diamonds of 2023-2024 in the IGRA collection
 
Smart-Money for SMC traders good time and ICT
Smart-Money for SMC traders good time and ICTSmart-Money for SMC traders good time and ICT
Smart-Money for SMC traders good time and ICT
 
DRUGS AND ITS classification slide share
DRUGS AND ITS classification slide shareDRUGS AND ITS classification slide share
DRUGS AND ITS classification slide share
 
Pride Month Slides 2024 David Douglas School District
Pride Month Slides 2024 David Douglas School DistrictPride Month Slides 2024 David Douglas School District
Pride Month Slides 2024 David Douglas School District
 
MARY JANE WILSON, A “BOA MÃE” .
MARY JANE WILSON, A “BOA MÃE”           .MARY JANE WILSON, A “BOA MÃE”           .
MARY JANE WILSON, A “BOA MÃE” .
 
ISO/IEC 27001, ISO/IEC 42001, and GDPR: Best Practices for Implementation and...
ISO/IEC 27001, ISO/IEC 42001, and GDPR: Best Practices for Implementation and...ISO/IEC 27001, ISO/IEC 42001, and GDPR: Best Practices for Implementation and...
ISO/IEC 27001, ISO/IEC 42001, and GDPR: Best Practices for Implementation and...
 

A Close Look at the Four Million Archival MARC Records in WorldCat

  • 1. OCLC Research Library Partnership Work-In-Progress webinar 3 December 2015 A Close Look at the Four Million Archival MARC Records in WorldCat Jackie Dooley Program Officer OCLC Research
  • 2. OVERVIEW • Research Objective • Some Initial Questions • Scope of the Dataset • Key Findings • Data Analysis • Tentative Recommendations • What’s Next?
  • 4. Research Objective Establish a detailed profile of MARC data element occurrences in archival catalog records, providing a view of 30+ years of practice. • Reveal variations in descriptive practice across formats • Characterize practice before MARC usage diminishes • Debunk any inaccurate assumptions • Suggest changes to descriptive practice • Enable analysis of implications for discovery Take note! I studied field occurrences, not content.
  • 6. Some Initial Questions • What is “archival material”? • Is archival use of MARC accurate and fulfilling its potential? • How does archival description differ across types of material? • Are archival materials usually described as collections? • Does the archival control byte capture all archival descriptions? • How often is DACS specified as the content standard? • To what extent have DACS minimum requirements been met? • Bonus question: What implications for next-gen cataloging do the data suggest?
  • 7. SCOPE OF THE DATASET
  • 8. Archival records filtered from WorldCat • OCLC’s WorldCat database of 340+ million records filtered to extract “archival” records – Currently 4 million, about 1% of WorldCat – Scope expanded two years ago to add more types of material • Brief version of the filter specs – “Unpublished” materials in any format – Under “archival control” – Held by a single institution – Excludes published materials Spoiler alert: It’s not perfect.
  • 9. Same dataset as ArchiveGrid • Only one library holding symbol is attached (to eliminate non-unique items or collections) • The MARC Leader has one or more of the following: – Leader byte 06 (recordtype) has the value d (manuscript music), f (manuscript cartographic), g (projected graphics), i (nonmusic recording), j (music recording), k (visual), p (mixed), r (realia), or t (textual manuscript). [does this include all the new ones?] – Leader byte 06 has the value "a" (language material) and Leader byte 07 (bibliographic level) has the value "c" (collection). – Leader byte 08 has the value "a" (archival control). • Field 260 subfields "a" and "b" are not present (to filter out published works) • "Bibliography" does not occur at the beginning string of any MARC subject heading subfield "a" or "v" (to filter out published works). • Field 502 is not present (to filter out theses and dissertations). • Records with material type "book" or "serial" that have no value in fields 008 or 006 “Nature of Contents” bytes (to eliminate theses, reference works, and other non-archival materials). http://beta.worldcat.org/archivegrid/about/ The full filter specs:
  • 11. Key Findings • Record type (Leader 06) sometimes used incorrectly – Mixed materials, computer files, web sites (aka Integrating Resources) • Cataloging practices reveal format-specific silos – Record type, archival control, descriptive rules, note fields, use of topical subject field (650) for genre/form terms (655) • Records describing single items greatly predominate for all record types except Mixed Materials – … and 25% of Mixed Materials records describe a single item • Format-specific notes (5xx) underutilized – 506, 511, 520, 524, 545, 546, 555, 561 … – 500 is most-used note for maps, recordings, scores, text, visual
  • 12. Key Findings, cont. • Archival control (Leader 08) specified in 28% of records – 40% of Mixed Materials records • Archival descriptive standards (040 $e) specified in 20% of records – appm, dacs, gihc – 61% of records specify AACR2, 1.5% RDA • One-third of records link (856) to digital content – Digital objects or finding aids
  • 13. DATA ANALYSIS 1. Full data 2. Visual materials 3. Mixed materials 4. Textual materials 5. Recordings 6. Scores 7. Maps 8. Other formats
  • 14. 1. Full data (4 million records) • 88% are visual, mixed, or textual materials • 39% describe collections, 51% single items – “Component” levels are little used – Records for collections are mostly Mixed Materials • 28% of records specify archival control (Leader 08) • 20% specify use of archival cataloging rules (040 $e) • Creator names (1xx and 7xx) indexed in 86% • Subject terms (6xx) indexed in 84% • Link (856) to digital content in 33% – Digital objects or finding aids
  • 15. Percent of records by type of material (Leader 06) 36.8% 31.6% 20.1% 8.0% 2.9% 0.6% Visual Mixed Text Recording Score All other formats
  • 16. Number of records by bibliographic level (Leader 07) 0 200,000 400,000 600,000 800,000 1,000,000 1,200,000 Visual Mixed Text Recording Score Other formats Collection (c ) Subunit (d) Monograph/Item (m) Other levels
  • 17. Subject and genre/form index terms
  • 18. 2. Visual Materials • 1.5 million records (36% of total) – 2-D graphics (30% of all records) – Projected graphics (film, video, slides: 6% of of all records) – Small number of kits and 3-D artifacts • Coded data – 76% describe items, 15% collections – Less than 10% specify archival control (Leader 08) – 1% specify use of gihc – Coded physical characteristics (007) in 57% • Most-used notes – General note (500) in 77% of records – Summary (520) in 68% – Conditions governing use/reproduction (540) in 57%
  • 19. 2. Visual Materials, cont. • Primary creator (1xx) in 51% of all records • Secondary creator (7xx) in about 31% • Personal name subject (600) in 32%; mean of 1.1 per record • Topical subject (650) in 68%; mean of 4.2 • Geographic subject (651) in 38%; mean of 1.5 • Genre/form (655) in 81%; mean of 1.5 • Link to digital content (856) in 48%
  • 20. 3. Mixed Materials • 1.3 million records (31% of all records) • Coded data – 75% describe collections, 25% items – 40% specify archival control (Leader 08) – 40% specify use of appm or dacs • 10% have no title in 245 $a ($k usually included) • Organization/arrangement (351) in 12% • Most-used notes • Summary (520) in 75% of records • General note (500) in 44% • Restrictions on access (506) in 37% • Biographical/historical (545) in 27% • No other 5xx used in more than 30%
  • 21. 3. Mixed Materials, cont. • Personal author (100) is primary creator in 40% • Corporate author (110) is primary creator in 21% • Secondary creators (7xx) in about 20% • Personal name subject (600) in 34%; mean of 1.5 per record • Topical subject (650) in 45%; mean of 3.0 • Geographic subject (651) in 40%; mean of 1.3 • Genre/form (655) in 65%; mean of 1.3 • Link to digital content (856) in 34%
  • 22. 3. Mixed Materials, cont. Presence of DACS (2004- ) single-level required minimum elements (Mixed Materials records only) • Reference code: stored in local database • Name/location of repository: stored in MARC holdings record • Title: 100% of records • Date(s): 52% in 245 $f, 21% in 260 $c • Extent (300): 78% • Creator(s), if known (1xx): 61% • Scope/content (520): 75% • Conditions governing access (506): 37% • Languages/scripts of the material (546): 13%
  • 23. 3. Mixed Materials, cont. Note fields used in >10% of records Field Key 500 44% General note 5-25% 506 37% Restrictions on access 26-50% 520 75% Summary 51-90% 524 15% Preferred citation 91-100% 540 31% Terms governing use/reproduction 541 18% Source of acquisition 545 27% Biographical/Historical note 546 13% Language 555 21% Finding aid
  • 24. 4. Textual materials • 809,000 records (20% of all records) – Collections of printed materials (4% of all records) – Textual manuscripts (21% of all records) • Coded data – 66% describe collections, 29% items – 16% specify archival control (Leader 08) – 17% specify use of appm or dacs • Most-used notes – Summary (520) in 75% – General note (500) in 54% – Restrictions on access (506) in 37%
  • 25. 4. Textual materials, cont. • Primary author (mostly 100) in 77% of records • Secondary author (7xx) in about 50% • Personal name subject (600) in 30%; mean of 0.9 per record • Topical subject (650) in 47%; mean of 1.7 • Geographic subject (651) in 29%; mean of 0.8 • Genre/form (655) in 35%; mean of 0.7 • Link to digital content (856) in 5%
  • 26. 5. Recordings • 322,000 records (8% of all records) – Music (5% of all records), nonmusic (3%) • Coded data – 95% describe items – 3% specify archival control (Leader 08) – Coded physical characteristics (007) in 78% • Most-used notes – General note (500) in 68% of records – Date/time/place of event (518) in 49% – Participant/performer (511) in 33%
  • 27. 5. Recordings, cont. • Primary creator (1xx) in 75% of records • Secondary creator (7xx) in 100% • Topical subject (650) in 66%; mean of 5.2 per record • Geographic subject (651) in 22%; mean of 0.9 • Genre/form term (655) in 25%; mean of 1.2 • Link to digital content (856) in 3%
  • 28. 6. Scores • 117,000 records (3% of all records) – Mostly manuscript scores (3% of all records), a few printed scores • Coded data – 77% describe items, 14% components – 3% specify archival control (Leader 08) • Uniform title (240) in 41% • Most-used notes – General note (500) in 96% of records – Little use of any other 5xx’s
  • 29. 6. Scores, cont. • Primary creator (1xx) in 90% of records • Secondary creator (7xx) in ca. 50% • Topical subject (650) in 96% of records; mean of 2.4 • Genre/form (655) in 34%; often in 650 instead – 650s will gradually move to 655 • Link to digital content (856) in 25%
  • 30. 7. Maps • 22,000 records (0.6% of all records) – Mostly manuscript maps, a few printed maps • Coded data – 95% describe items – Coded physical characteristics (007) in 65% of records – 4% specify archival control (Leader 08) – Hierarchical geographic area code (043) in 80% – Geographic classification code (052) in 66% • Cartographic mathematical data (255) in 92% • Most-used notes – General note (500) in 96% – Little use of any other 5xx’s
  • 31. 7. Maps, cont. • Primary creator (1xx) in 53% of records • Secondary creator (7xx) in 50% • Topical subject (650) in 68%; mean of 2.8 per record • Geographic subject (651) in 83%; mean of 2.7 • Genre/form (655) in 84%; mean of 1.8 • Link to digital content (856) in 14%
  • 32. Other formats • Dataset also includes a few records for: – Computer files (1,275) • Most should instead use record type for nature of content – Web sites (146) • Record type used for these is Integrated Resources • Thousands of others use another record type, e.g. Mixed Materials – Serials (109) • Included only because archival control (Leader 08) is specified
  • 34. My Questions for You • Which of the findings are significant enough to warrant changes in practice? • Do the data debunk any assumptions? • Would you tweak the specs of our filter? • What other questions should I be asking? • … And what are the implications for next- generation cataloging?
  • 35. Tentative Recommendations • Consider eliminating some little-used note fields from MARC • Educate archival community about accurate use of record types and why consistency matters • Promote DACS single-level minimum required elements • Promote value of collection-level records to special materials communities • Consider doing some automated data remediation – Sample possibilities: add missing language notes, “no restrictions” notes, country codes, titles in 245 $a • What else? What would help you in your work?
  • 36. Next Steps • Publish OCLC Research report early in 2016 • Prepare a second paper on implications for discovery, comparing MARC and EAD data (Bron et al. in Code{4}Lib, 2013) • Possible future projects – Study data content – Selective data remediation • Enhance generic titles (e.g., Papers, Records) • Add missing language notes (field 546) – Descriptive practice for web archiving • What research might you take on?
  • 37. SM Please send feedback! Jackie Dooley Program Officer, OCLC Research dooleyj@oclc.org @minniedw OCLC Research Library Partnership Work-in-progress webinar 3 December 2015