SlideShare a Scribd company logo
1 of 31
YAMZ: better, faster, cheaper
vocabulary standardization
John Kunze
California Digital Library
2
The metadata mess
• Why does metadata interoperability (MI) seem to
fail each time a new initiative addresses it?
• Why does each attempt not even come close to
delivering MI?
• Why are these failures so expensive?
2
3
Traditional metadata standards have failed
• Thinking outside the box to stop failing.
• YAMZ is Yet Another Metadata Zoo; it is not Yet
Another Metadata Standard.
• Instead it is a dictionary of terms, some fixed and
others still evolving.
• Terms meant to be selectively referenced by future
standards, but are otherwise decoupled from them.
• Each term has a unique persistent identifier that tracks
it from evolving to mature stability to deprecated.
3
4
Session structure
• Introduction to YAMZ
• Four disciplinary uses
• Trying it out
• Invitation for feedback, discussion, and
participation.
4
5
Metadata (un)happiness
• Are you happy with your metadata?
• Are you happy with others’ metadata?
• Are you and your meta/cataloguer happy with day-
to-day experience complying with your chosen
standard? Have you asked them what they think?
• Which standard(s) do you use? Are you compliant?
5
6
Metadata in theory vs practice
The façade
• We use standard X, anyone using X can work with us.
Behind the façade (see Roy Tennant’s “Bitter Harvest”)
• We use standard X with local modifications.
• Our mods evolve and depend on the specific
collection, so no one using X can work with us.
• Few people know what our local mods are.
6
7
Option 1: lobby for changes in X
• Use formal commenting mechanism
• Wait 2-5 years for revision to appear, during which
• a small number of busy experts evaluate
• in largely closed discussions
• no testing (since there’s no implementation yet)
• What happens to legacy metadata every time a 30 –
year-old standard does have a new release?
7
8
Option 2: Semantic Web with RDF
• Spend a few years modeling all your present and
future assets in RDF
• Reference, for better or worse, existing terms from
existing vocabularies
• Pro: get unique, unambiguous concept identifiers
• Con: expensive, and no one uses RDF except libraries
8
9
Option 3: think locally, act locally
If your organization doesn't have much time or staff
• This is the common case
• Document your local mods to Standard X
• Effectively, this is a secret metadata standard
9
10
Option 4: think globally, act globally
if your organization does have the time and staff
• Create your own standard or profile
• Create your own committees and work with your
own partner organizations
• Import snapshots of other vocabularies
• A missing terms to your liking
• Publish your terms and definitions
10
11
(New) Option 5: think locally, act globally
At a minimum, use YAMZ to
• get a persistent identifier for your term
• use it so everyone knows what you mean by it
Everything else is gravy
• track comments, upvotes and downvotes
• notice what related terms others are using
Otherwise, we’re stuck with the current blizzard of
cross-walking between hundreds of vocabularies...
11
The Metadata Universe
Jenn Riley,
IU
The Metadata Universe
Jenn Riley,
IU
The Metadata Universe
Jenn Riley,
IU
The Metadata Universe
Jenn Riley,
IU
The Metadata Universe
Jenn Riley,
IU
17
Let’s do something different
• Instead of yet another ontology, how about a
dictionary?
• … a dictionary that tracks terms over time?
• … a dictionary whose terms standards will reference?
17
18
Summarizing key desiderata
• Natural language strings with persistent concept
identifiers
• Avoiding largely closed discussion
• Support for testing and rapid prototyping
• Support for unambiguous term referencing, where
• some terms may change
• other terms may not change
• Ability to add missing terms
• Publishing your own terms
• Dealing with historical terminology
18
19
An alternate metadata universe
• Vision: one dictionary, one namespace
• All research domains, any part of “metadata speech”
• Names, values, units, relationships, ...
19
SimonRobertson@flickr
20
YAMZ.net (Yet Another Metadata Zoo)
20
21
Crowdsourced, but with voting
21
vernacular
canonical
deprecated
3 classes
of term
 all terms are born here
 these don’t evolve
 so terms never go away
Each term gets a unique persistent id. Example:
identifier: http://n2t.net/ark:/99152/h1193
term: oba
definition: other (origin: from Tagalog)
22
Reputation-based voting resists “gaming”
• Meritocracy: strong terms rise, weak terms decline
• Lessons from StackOverflow, Internet standards, and
Wikipedia processes
22
Karunakar Rayker @flickr
23
24
YAMZ usage patterns
24
Search for
terms
(words and
definitions)
find a term you love
great – use it
find a term you kind of love try it out, comment,
engage with author
no workable term found instantly enter own term
and watch for comments
find a word you love “I want that word!”, so
enter a competing term
but a definition you hate
26
Example term in group
26
27
Term tag in YAMZ
27
28
Discipline-specific subsets in YAMZ
• Global Cryosphere Watch (GCW)
• Citizen Science (Sloan)
• DesignSafe (UTA)
• Persistence statements (CDL, UCLA, TACC)
28
29
People
• Vision: Jane Greenberg, John Kunze (NSF DataONE)
• Nassib Nassar, Angela Murillo, Greg Janee, et al.
• First implementation: Chris Patton (summer intern)
• Other interns: Dillon Arevalo, Manoj Tuguru
• LEADS fellow 2018: Mark Phillips
• LEADS fellow 2019: Bridget Disney, Hanlin Zhang
• LEADS fellow 2020: Chris Rauch
29
30
Trying it out
• Try out YAMZ at yamz.net ( caveat: it’s read-only
since login is not working!  )
• browse
• search
30
31
Getting involved
• Newest code at github.com/metadata-
research/yamz
31

More Related Content

Similar to YAMZ Metadata Vocabulary Builder

RDAP 15: Beyond Metadata: Leveraging the “README” to support disciplinary Doc...
RDAP 15: Beyond Metadata: Leveraging the “README” to support disciplinary Doc...RDAP 15: Beyond Metadata: Leveraging the “README” to support disciplinary Doc...
RDAP 15: Beyond Metadata: Leveraging the “README” to support disciplinary Doc...ASIS&T
 
An introduction to Metadata Application Profiles
An introduction to Metadata Application ProfilesAn introduction to Metadata Application Profiles
An introduction to Metadata Application Profileskcoylenet
 
Clean code presentation
Clean code presentationClean code presentation
Clean code presentationBhavin Gandhi
 
Immersive Recommendation
Immersive RecommendationImmersive Recommendation
Immersive Recommendation承剛 謝
 
Cross-Platform Profiling tutorial at the Digital Methods Summer School 2013
Cross-Platform Profiling tutorial at the Digital Methods Summer School 2013Cross-Platform Profiling tutorial at the Digital Methods Summer School 2013
Cross-Platform Profiling tutorial at the Digital Methods Summer School 2013Digital Methods Initiative
 
YAMZ: a cross-domain crowd-sourced metadata vocabulary
YAMZ: a cross-domain crowd-sourced metadata vocabularyYAMZ: a cross-domain crowd-sourced metadata vocabulary
YAMZ: a cross-domain crowd-sourced metadata vocabularyJohn Kunze
 
Implementing Conceptual Search in Solr using LSA and Word2Vec: Presented by S...
Implementing Conceptual Search in Solr using LSA and Word2Vec: Presented by S...Implementing Conceptual Search in Solr using LSA and Word2Vec: Presented by S...
Implementing Conceptual Search in Solr using LSA and Word2Vec: Presented by S...Lucidworks
 
Detecting Good Practices and Pitfalls when Publishing Vocabularies on the Web
Detecting Good Practices and Pitfalls when Publishing Vocabularies on the Web Detecting Good Practices and Pitfalls when Publishing Vocabularies on the Web
Detecting Good Practices and Pitfalls when Publishing Vocabularies on the Web María Poveda Villalón
 
Linked Open Data Alignment and Enrichment Using Bootstrapping Based Techniques
Linked Open Data Alignment and Enrichment Using Bootstrapping Based TechniquesLinked Open Data Alignment and Enrichment Using Bootstrapping Based Techniques
Linked Open Data Alignment and Enrichment Using Bootstrapping Based TechniquesPrateek Jain
 
Créer une communauté open source: pourquoi ? comment ?
Créer une communauté open source: pourquoi ? comment ?Créer une communauté open source: pourquoi ? comment ?
Créer une communauté open source: pourquoi ? comment ?Stefane Fermigier
 
Moving from a Locally-Developed Data Model to a Standard Conceptual Model
Moving from a Locally-Developed Data Model to a Standard Conceptual ModelMoving from a Locally-Developed Data Model to a Standard Conceptual Model
Moving from a Locally-Developed Data Model to a Standard Conceptual ModelJenn Riley
 
Improving Library Resource Discovery
Improving Library Resource DiscoveryImproving Library Resource Discovery
Improving Library Resource DiscoveryDanya Leebaw
 
3-27-12 Preservation & Archiving Highlights from ADR - Presentation Slides
3-27-12 Preservation & Archiving Highlights from ADR - Presentation Slides3-27-12 Preservation & Archiving Highlights from ADR - Presentation Slides
3-27-12 Preservation & Archiving Highlights from ADR - Presentation SlidesDuraSpace
 
Dice.com Bay Area Search - Beyond Learning to Rank Talk
Dice.com Bay Area Search - Beyond Learning to Rank TalkDice.com Bay Area Search - Beyond Learning to Rank Talk
Dice.com Bay Area Search - Beyond Learning to Rank TalkSimon Hughes
 
Levine-Clark, Michael, and Barbara Kawecki, "Best Practices for Demand-Driven...
Levine-Clark, Michael, and Barbara Kawecki, "Best Practices for Demand-Driven...Levine-Clark, Michael, and Barbara Kawecki, "Best Practices for Demand-Driven...
Levine-Clark, Michael, and Barbara Kawecki, "Best Practices for Demand-Driven...Michael Levine-Clark
 
2012.10 - Workshop on Semantic Statistics - 1
2012.10 - Workshop on Semantic Statistics - 12012.10 - Workshop on Semantic Statistics - 1
2012.10 - Workshop on Semantic Statistics - 1Dr.-Ing. Thomas Hartmann
 
Data Management for Undergraduate Research
Data Management for Undergraduate ResearchData Management for Undergraduate Research
Data Management for Undergraduate ResearchRebekah Cummings
 
ISO 25964-1Working Group ISO/TC 46/SC 9/WG 8
ISO 25964-1Working Group ISO/TC 46/SC 9/WG 8ISO 25964-1Working Group ISO/TC 46/SC 9/WG 8
ISO 25964-1Working Group ISO/TC 46/SC 9/WG 8Access Innovations, Inc.
 

Similar to YAMZ Metadata Vocabulary Builder (20)

RDAP 15: Beyond Metadata: Leveraging the “README” to support disciplinary Doc...
RDAP 15: Beyond Metadata: Leveraging the “README” to support disciplinary Doc...RDAP 15: Beyond Metadata: Leveraging the “README” to support disciplinary Doc...
RDAP 15: Beyond Metadata: Leveraging the “README” to support disciplinary Doc...
 
An introduction to Metadata Application Profiles
An introduction to Metadata Application ProfilesAn introduction to Metadata Application Profiles
An introduction to Metadata Application Profiles
 
Clean code presentation
Clean code presentationClean code presentation
Clean code presentation
 
Immersive Recommendation
Immersive RecommendationImmersive Recommendation
Immersive Recommendation
 
Cross-Platform Profiling tutorial at the Digital Methods Summer School 2013
Cross-Platform Profiling tutorial at the Digital Methods Summer School 2013Cross-Platform Profiling tutorial at the Digital Methods Summer School 2013
Cross-Platform Profiling tutorial at the Digital Methods Summer School 2013
 
YAMZ: a cross-domain crowd-sourced metadata vocabulary
YAMZ: a cross-domain crowd-sourced metadata vocabularyYAMZ: a cross-domain crowd-sourced metadata vocabulary
YAMZ: a cross-domain crowd-sourced metadata vocabulary
 
Implementing Conceptual Search in Solr using LSA and Word2Vec: Presented by S...
Implementing Conceptual Search in Solr using LSA and Word2Vec: Presented by S...Implementing Conceptual Search in Solr using LSA and Word2Vec: Presented by S...
Implementing Conceptual Search in Solr using LSA and Word2Vec: Presented by S...
 
Detecting Good Practices and Pitfalls when Publishing Vocabularies on the Web
Detecting Good Practices and Pitfalls when Publishing Vocabularies on the Web Detecting Good Practices and Pitfalls when Publishing Vocabularies on the Web
Detecting Good Practices and Pitfalls when Publishing Vocabularies on the Web
 
Linked Open Data Alignment and Enrichment Using Bootstrapping Based Techniques
Linked Open Data Alignment and Enrichment Using Bootstrapping Based TechniquesLinked Open Data Alignment and Enrichment Using Bootstrapping Based Techniques
Linked Open Data Alignment and Enrichment Using Bootstrapping Based Techniques
 
Créer une communauté open source: pourquoi ? comment ?
Créer une communauté open source: pourquoi ? comment ?Créer une communauté open source: pourquoi ? comment ?
Créer une communauté open source: pourquoi ? comment ?
 
Moving from a Locally-Developed Data Model to a Standard Conceptual Model
Moving from a Locally-Developed Data Model to a Standard Conceptual ModelMoving from a Locally-Developed Data Model to a Standard Conceptual Model
Moving from a Locally-Developed Data Model to a Standard Conceptual Model
 
Improving Library Resource Discovery
Improving Library Resource DiscoveryImproving Library Resource Discovery
Improving Library Resource Discovery
 
3-27-12 Preservation & Archiving Highlights from ADR - Presentation Slides
3-27-12 Preservation & Archiving Highlights from ADR - Presentation Slides3-27-12 Preservation & Archiving Highlights from ADR - Presentation Slides
3-27-12 Preservation & Archiving Highlights from ADR - Presentation Slides
 
Management de communaute
Management de communauteManagement de communaute
Management de communaute
 
Dice.com Bay Area Search - Beyond Learning to Rank Talk
Dice.com Bay Area Search - Beyond Learning to Rank TalkDice.com Bay Area Search - Beyond Learning to Rank Talk
Dice.com Bay Area Search - Beyond Learning to Rank Talk
 
Levine-Clark, Michael, and Barbara Kawecki, "Best Practices for Demand-Driven...
Levine-Clark, Michael, and Barbara Kawecki, "Best Practices for Demand-Driven...Levine-Clark, Michael, and Barbara Kawecki, "Best Practices for Demand-Driven...
Levine-Clark, Michael, and Barbara Kawecki, "Best Practices for Demand-Driven...
 
Introduction to Text Mining
Introduction to Text MiningIntroduction to Text Mining
Introduction to Text Mining
 
2012.10 - Workshop on Semantic Statistics - 1
2012.10 - Workshop on Semantic Statistics - 12012.10 - Workshop on Semantic Statistics - 1
2012.10 - Workshop on Semantic Statistics - 1
 
Data Management for Undergraduate Research
Data Management for Undergraduate ResearchData Management for Undergraduate Research
Data Management for Undergraduate Research
 
ISO 25964-1Working Group ISO/TC 46/SC 9/WG 8
ISO 25964-1Working Group ISO/TC 46/SC 9/WG 8ISO 25964-1Working Group ISO/TC 46/SC 9/WG 8
ISO 25964-1Working Group ISO/TC 46/SC 9/WG 8
 

More from John Kunze

The ARK Alliance: 20 years, 850 institutions, 8.2 billion persistent identifi...
The ARK Alliance: 20 years, 850 institutions, 8.2 billion persistent identifi...The ARK Alliance: 20 years, 850 institutions, 8.2 billion persistent identifi...
The ARK Alliance: 20 years, 850 institutions, 8.2 billion persistent identifi...John Kunze
 
EZID and N2T at CDL
EZID and N2T at CDLEZID and N2T at CDL
EZID and N2T at CDLJohn Kunze
 
YAMZ.net: better, faster, cheaper taxonomy building
YAMZ.net:  better, faster, cheaper taxonomy buildingYAMZ.net:  better, faster, cheaper taxonomy building
YAMZ.net: better, faster, cheaper taxonomy buildingJohn Kunze
 
A Vocabulary for Persistence
A Vocabulary for PersistenceA Vocabulary for Persistence
A Vocabulary for PersistenceJohn Kunze
 
Identifiers obey Resolvers not Schemes
Identifiers obey Resolvers not SchemesIdentifiers obey Resolvers not Schemes
Identifiers obey Resolvers not SchemesJohn Kunze
 
Names, Things, and Open Identifier Infrastructure: N2T and ARKs
Names, Things, and Open Identifier Infrastructure: N2T and ARKsNames, Things, and Open Identifier Infrastructure: N2T and ARKs
Names, Things, and Open Identifier Infrastructure: N2T and ARKsJohn Kunze
 
ARK identifiers: lessons learnt at BnF: paths forward
ARK identifiers: lessons learnt at BnF: paths forwardARK identifiers: lessons learnt at BnF: paths forward
ARK identifiers: lessons learnt at BnF: paths forwardJohn Kunze
 
DataONE Preservation and Metadata Working Group Report 2014
DataONE Preservation and Metadata Working Group Report 2014DataONE Preservation and Metadata Working Group Report 2014
DataONE Preservation and Metadata Working Group Report 2014John Kunze
 
Selected Bash shell tricks from Camp CDL breakout group
Selected Bash shell tricks from Camp CDL breakout groupSelected Bash shell tricks from Camp CDL breakout group
Selected Bash shell tricks from Camp CDL breakout groupJohn Kunze
 
Annotating Research Datasets
Annotating Research DatasetsAnnotating Research Datasets
Annotating Research DatasetsJohn Kunze
 
The Data Management Ecosystem
The Data Management EcosystemThe Data Management Ecosystem
The Data Management EcosystemJohn Kunze
 
Library Tools Supporting Data-Rich Research
Library Tools Supporting Data-Rich ResearchLibrary Tools Supporting Data-Rich Research
Library Tools Supporting Data-Rich ResearchJohn Kunze
 
Big Data's Long Tail
Big Data's Long TailBig Data's Long Tail
Big Data's Long TailJohn Kunze
 
Scalable Identifiers for Natural History Collections
Scalable Identifiers for Natural History CollectionsScalable Identifiers for Natural History Collections
Scalable Identifiers for Natural History CollectionsJohn Kunze
 
Future-Proofing the Web: What We Can Do Today
Future-Proofing the Web: What We Can Do TodayFuture-Proofing the Web: What We Can Do Today
Future-Proofing the Web: What We Can Do TodayJohn Kunze
 
Supporting Data-Rich Research on Many Fronts
Supporting Data-Rich Research on Many FrontsSupporting Data-Rich Research on Many Fronts
Supporting Data-Rich Research on Many FrontsJohn Kunze
 
The ARK Identifier Scheme at Ten Years Old
The ARK Identifier Scheme at Ten Years OldThe ARK Identifier Scheme at Ten Years Old
The ARK Identifier Scheme at Ten Years OldJohn Kunze
 
New Metaphors: Data Papers and Data Citations
New Metaphors: Data Papers and Data CitationsNew Metaphors: Data Papers and Data Citations
New Metaphors: Data Papers and Data CitationsJohn Kunze
 
Pairtrees for object storage
Pairtrees for object storagePairtrees for object storage
Pairtrees for object storageJohn Kunze
 

More from John Kunze (20)

The ARK Alliance: 20 years, 850 institutions, 8.2 billion persistent identifi...
The ARK Alliance: 20 years, 850 institutions, 8.2 billion persistent identifi...The ARK Alliance: 20 years, 850 institutions, 8.2 billion persistent identifi...
The ARK Alliance: 20 years, 850 institutions, 8.2 billion persistent identifi...
 
EZID and N2T at CDL
EZID and N2T at CDLEZID and N2T at CDL
EZID and N2T at CDL
 
YAMZ.net: better, faster, cheaper taxonomy building
YAMZ.net:  better, faster, cheaper taxonomy buildingYAMZ.net:  better, faster, cheaper taxonomy building
YAMZ.net: better, faster, cheaper taxonomy building
 
A Vocabulary for Persistence
A Vocabulary for PersistenceA Vocabulary for Persistence
A Vocabulary for Persistence
 
Identifiers obey Resolvers not Schemes
Identifiers obey Resolvers not SchemesIdentifiers obey Resolvers not Schemes
Identifiers obey Resolvers not Schemes
 
Names, Things, and Open Identifier Infrastructure: N2T and ARKs
Names, Things, and Open Identifier Infrastructure: N2T and ARKsNames, Things, and Open Identifier Infrastructure: N2T and ARKs
Names, Things, and Open Identifier Infrastructure: N2T and ARKs
 
ARK identifiers: lessons learnt at BnF: paths forward
ARK identifiers: lessons learnt at BnF: paths forwardARK identifiers: lessons learnt at BnF: paths forward
ARK identifiers: lessons learnt at BnF: paths forward
 
DataONE Preservation and Metadata Working Group Report 2014
DataONE Preservation and Metadata Working Group Report 2014DataONE Preservation and Metadata Working Group Report 2014
DataONE Preservation and Metadata Working Group Report 2014
 
Selected Bash shell tricks from Camp CDL breakout group
Selected Bash shell tricks from Camp CDL breakout groupSelected Bash shell tricks from Camp CDL breakout group
Selected Bash shell tricks from Camp CDL breakout group
 
Annotating Research Datasets
Annotating Research DatasetsAnnotating Research Datasets
Annotating Research Datasets
 
The Data Management Ecosystem
The Data Management EcosystemThe Data Management Ecosystem
The Data Management Ecosystem
 
Library Tools Supporting Data-Rich Research
Library Tools Supporting Data-Rich ResearchLibrary Tools Supporting Data-Rich Research
Library Tools Supporting Data-Rich Research
 
Big Data's Long Tail
Big Data's Long TailBig Data's Long Tail
Big Data's Long Tail
 
Pamwg 2012ahm
Pamwg 2012ahmPamwg 2012ahm
Pamwg 2012ahm
 
Scalable Identifiers for Natural History Collections
Scalable Identifiers for Natural History CollectionsScalable Identifiers for Natural History Collections
Scalable Identifiers for Natural History Collections
 
Future-Proofing the Web: What We Can Do Today
Future-Proofing the Web: What We Can Do TodayFuture-Proofing the Web: What We Can Do Today
Future-Proofing the Web: What We Can Do Today
 
Supporting Data-Rich Research on Many Fronts
Supporting Data-Rich Research on Many FrontsSupporting Data-Rich Research on Many Fronts
Supporting Data-Rich Research on Many Fronts
 
The ARK Identifier Scheme at Ten Years Old
The ARK Identifier Scheme at Ten Years OldThe ARK Identifier Scheme at Ten Years Old
The ARK Identifier Scheme at Ten Years Old
 
New Metaphors: Data Papers and Data Citations
New Metaphors: Data Papers and Data CitationsNew Metaphors: Data Papers and Data Citations
New Metaphors: Data Papers and Data Citations
 
Pairtrees for object storage
Pairtrees for object storagePairtrees for object storage
Pairtrees for object storage
 

Recently uploaded

Call Girls In Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls In Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Defence Colony Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
Call Now ☎ 8264348440 !! Call Girls in Sarai Rohilla Escort Service Delhi N.C.R.
Call Now ☎ 8264348440 !! Call Girls in Sarai Rohilla Escort Service Delhi N.C.R.Call Now ☎ 8264348440 !! Call Girls in Sarai Rohilla Escort Service Delhi N.C.R.
Call Now ☎ 8264348440 !! Call Girls in Sarai Rohilla Escort Service Delhi N.C.R.soniya singh
 
INDIVIDUAL ASSIGNMENT #3 CBG, PRESENTATION.
INDIVIDUAL ASSIGNMENT #3 CBG, PRESENTATION.INDIVIDUAL ASSIGNMENT #3 CBG, PRESENTATION.
INDIVIDUAL ASSIGNMENT #3 CBG, PRESENTATION.CarlotaBedoya1
 
Call Now ☎ 8264348440 !! Call Girls in Shahpur Jat Escort Service Delhi N.C.R.
Call Now ☎ 8264348440 !! Call Girls in Shahpur Jat Escort Service Delhi N.C.R.Call Now ☎ 8264348440 !! Call Girls in Shahpur Jat Escort Service Delhi N.C.R.
Call Now ☎ 8264348440 !! Call Girls in Shahpur Jat Escort Service Delhi N.C.R.soniya singh
 
Call Girls In Model Towh Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Model Towh Delhi 💯Call Us 🔝8264348440🔝Call Girls In Model Towh Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Model Towh Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
VIP Model Call Girls Hadapsar ( Pune ) Call ON 9905417584 Starting High Prof...
VIP Model Call Girls Hadapsar ( Pune ) Call ON 9905417584 Starting  High Prof...VIP Model Call Girls Hadapsar ( Pune ) Call ON 9905417584 Starting  High Prof...
VIP Model Call Girls Hadapsar ( Pune ) Call ON 9905417584 Starting High Prof...singhpriety023
 
Top Rated Pune Call Girls Daund ⟟ 6297143586 ⟟ Call Me For Genuine Sex Servi...
Top Rated  Pune Call Girls Daund ⟟ 6297143586 ⟟ Call Me For Genuine Sex Servi...Top Rated  Pune Call Girls Daund ⟟ 6297143586 ⟟ Call Me For Genuine Sex Servi...
Top Rated Pune Call Girls Daund ⟟ 6297143586 ⟟ Call Me For Genuine Sex Servi...Call Girls in Nagpur High Profile
 
Hot Service (+9316020077 ) Goa Call Girls Real Photos and Genuine Service
Hot Service (+9316020077 ) Goa  Call Girls Real Photos and Genuine ServiceHot Service (+9316020077 ) Goa  Call Girls Real Photos and Genuine Service
Hot Service (+9316020077 ) Goa Call Girls Real Photos and Genuine Servicesexy call girls service in goa
 
Pune Airport ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready...
Pune Airport ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready...Pune Airport ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready...
Pune Airport ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready...tanu pandey
 
How is AI changing journalism? (v. April 2024)
How is AI changing journalism? (v. April 2024)How is AI changing journalism? (v. April 2024)
How is AI changing journalism? (v. April 2024)Damian Radcliffe
 
Call Girls Ludhiana Just Call 98765-12871 Top Class Call Girl Service Available
Call Girls Ludhiana Just Call 98765-12871 Top Class Call Girl Service AvailableCall Girls Ludhiana Just Call 98765-12871 Top Class Call Girl Service Available
Call Girls Ludhiana Just Call 98765-12871 Top Class Call Girl Service AvailableSeo
 
Moving Beyond Twitter/X and Facebook - Social Media for local news providers
Moving Beyond Twitter/X and Facebook - Social Media for local news providersMoving Beyond Twitter/X and Facebook - Social Media for local news providers
Moving Beyond Twitter/X and Facebook - Social Media for local news providersDamian Radcliffe
 
On Starlink, presented by Geoff Huston at NZNOG 2024
On Starlink, presented by Geoff Huston at NZNOG 2024On Starlink, presented by Geoff Huston at NZNOG 2024
On Starlink, presented by Geoff Huston at NZNOG 2024APNIC
 
All Time Service Available Call Girls Mg Road 👌 ⏭️ 6378878445
All Time Service Available Call Girls Mg Road 👌 ⏭️ 6378878445All Time Service Available Call Girls Mg Road 👌 ⏭️ 6378878445
All Time Service Available Call Girls Mg Road 👌 ⏭️ 6378878445ruhi
 
AWS Community DAY Albertini-Ellan Cloud Security (1).pptx
AWS Community DAY Albertini-Ellan Cloud Security (1).pptxAWS Community DAY Albertini-Ellan Cloud Security (1).pptx
AWS Community DAY Albertini-Ellan Cloud Security (1).pptxellan12
 
Call Girls Dubai Prolapsed O525547819 Call Girls In Dubai Princes$
Call Girls Dubai Prolapsed O525547819 Call Girls In Dubai Princes$Call Girls Dubai Prolapsed O525547819 Call Girls In Dubai Princes$
Call Girls Dubai Prolapsed O525547819 Call Girls In Dubai Princes$kojalkojal131
 
Delhi Call Girls Rohini 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Rohini 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Rohini 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Rohini 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 

Recently uploaded (20)

Call Girls In Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls In Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Defence Colony Delhi 💯Call Us 🔝8264348440🔝
 
Call Now ☎ 8264348440 !! Call Girls in Sarai Rohilla Escort Service Delhi N.C.R.
Call Now ☎ 8264348440 !! Call Girls in Sarai Rohilla Escort Service Delhi N.C.R.Call Now ☎ 8264348440 !! Call Girls in Sarai Rohilla Escort Service Delhi N.C.R.
Call Now ☎ 8264348440 !! Call Girls in Sarai Rohilla Escort Service Delhi N.C.R.
 
INDIVIDUAL ASSIGNMENT #3 CBG, PRESENTATION.
INDIVIDUAL ASSIGNMENT #3 CBG, PRESENTATION.INDIVIDUAL ASSIGNMENT #3 CBG, PRESENTATION.
INDIVIDUAL ASSIGNMENT #3 CBG, PRESENTATION.
 
VVVIP Call Girls In Connaught Place ➡️ Delhi ➡️ 9999965857 🚀 No Advance 24HRS...
VVVIP Call Girls In Connaught Place ➡️ Delhi ➡️ 9999965857 🚀 No Advance 24HRS...VVVIP Call Girls In Connaught Place ➡️ Delhi ➡️ 9999965857 🚀 No Advance 24HRS...
VVVIP Call Girls In Connaught Place ➡️ Delhi ➡️ 9999965857 🚀 No Advance 24HRS...
 
Call Now ☎ 8264348440 !! Call Girls in Shahpur Jat Escort Service Delhi N.C.R.
Call Now ☎ 8264348440 !! Call Girls in Shahpur Jat Escort Service Delhi N.C.R.Call Now ☎ 8264348440 !! Call Girls in Shahpur Jat Escort Service Delhi N.C.R.
Call Now ☎ 8264348440 !! Call Girls in Shahpur Jat Escort Service Delhi N.C.R.
 
Call Girls In Model Towh Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Model Towh Delhi 💯Call Us 🔝8264348440🔝Call Girls In Model Towh Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Model Towh Delhi 💯Call Us 🔝8264348440🔝
 
VIP Model Call Girls Hadapsar ( Pune ) Call ON 9905417584 Starting High Prof...
VIP Model Call Girls Hadapsar ( Pune ) Call ON 9905417584 Starting  High Prof...VIP Model Call Girls Hadapsar ( Pune ) Call ON 9905417584 Starting  High Prof...
VIP Model Call Girls Hadapsar ( Pune ) Call ON 9905417584 Starting High Prof...
 
Top Rated Pune Call Girls Daund ⟟ 6297143586 ⟟ Call Me For Genuine Sex Servi...
Top Rated  Pune Call Girls Daund ⟟ 6297143586 ⟟ Call Me For Genuine Sex Servi...Top Rated  Pune Call Girls Daund ⟟ 6297143586 ⟟ Call Me For Genuine Sex Servi...
Top Rated Pune Call Girls Daund ⟟ 6297143586 ⟟ Call Me For Genuine Sex Servi...
 
@9999965857 🫦 Sexy Desi Call Girls Laxmi Nagar 💓 High Profile Escorts Delhi 🫶
@9999965857 🫦 Sexy Desi Call Girls Laxmi Nagar 💓 High Profile Escorts Delhi 🫶@9999965857 🫦 Sexy Desi Call Girls Laxmi Nagar 💓 High Profile Escorts Delhi 🫶
@9999965857 🫦 Sexy Desi Call Girls Laxmi Nagar 💓 High Profile Escorts Delhi 🫶
 
Hot Service (+9316020077 ) Goa Call Girls Real Photos and Genuine Service
Hot Service (+9316020077 ) Goa  Call Girls Real Photos and Genuine ServiceHot Service (+9316020077 ) Goa  Call Girls Real Photos and Genuine Service
Hot Service (+9316020077 ) Goa Call Girls Real Photos and Genuine Service
 
Pune Airport ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready...
Pune Airport ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready...Pune Airport ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready...
Pune Airport ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready...
 
How is AI changing journalism? (v. April 2024)
How is AI changing journalism? (v. April 2024)How is AI changing journalism? (v. April 2024)
How is AI changing journalism? (v. April 2024)
 
Call Girls Ludhiana Just Call 98765-12871 Top Class Call Girl Service Available
Call Girls Ludhiana Just Call 98765-12871 Top Class Call Girl Service AvailableCall Girls Ludhiana Just Call 98765-12871 Top Class Call Girl Service Available
Call Girls Ludhiana Just Call 98765-12871 Top Class Call Girl Service Available
 
Rohini Sector 6 Call Girls Delhi 9999965857 @Sabina Saikh No Advance
Rohini Sector 6 Call Girls Delhi 9999965857 @Sabina Saikh No AdvanceRohini Sector 6 Call Girls Delhi 9999965857 @Sabina Saikh No Advance
Rohini Sector 6 Call Girls Delhi 9999965857 @Sabina Saikh No Advance
 
Moving Beyond Twitter/X and Facebook - Social Media for local news providers
Moving Beyond Twitter/X and Facebook - Social Media for local news providersMoving Beyond Twitter/X and Facebook - Social Media for local news providers
Moving Beyond Twitter/X and Facebook - Social Media for local news providers
 
On Starlink, presented by Geoff Huston at NZNOG 2024
On Starlink, presented by Geoff Huston at NZNOG 2024On Starlink, presented by Geoff Huston at NZNOG 2024
On Starlink, presented by Geoff Huston at NZNOG 2024
 
All Time Service Available Call Girls Mg Road 👌 ⏭️ 6378878445
All Time Service Available Call Girls Mg Road 👌 ⏭️ 6378878445All Time Service Available Call Girls Mg Road 👌 ⏭️ 6378878445
All Time Service Available Call Girls Mg Road 👌 ⏭️ 6378878445
 
AWS Community DAY Albertini-Ellan Cloud Security (1).pptx
AWS Community DAY Albertini-Ellan Cloud Security (1).pptxAWS Community DAY Albertini-Ellan Cloud Security (1).pptx
AWS Community DAY Albertini-Ellan Cloud Security (1).pptx
 
Call Girls Dubai Prolapsed O525547819 Call Girls In Dubai Princes$
Call Girls Dubai Prolapsed O525547819 Call Girls In Dubai Princes$Call Girls Dubai Prolapsed O525547819 Call Girls In Dubai Princes$
Call Girls Dubai Prolapsed O525547819 Call Girls In Dubai Princes$
 
Delhi Call Girls Rohini 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Rohini 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Rohini 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Rohini 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 

YAMZ Metadata Vocabulary Builder

  • 1. YAMZ: better, faster, cheaper vocabulary standardization John Kunze California Digital Library
  • 2. 2 The metadata mess • Why does metadata interoperability (MI) seem to fail each time a new initiative addresses it? • Why does each attempt not even come close to delivering MI? • Why are these failures so expensive? 2
  • 3. 3 Traditional metadata standards have failed • Thinking outside the box to stop failing. • YAMZ is Yet Another Metadata Zoo; it is not Yet Another Metadata Standard. • Instead it is a dictionary of terms, some fixed and others still evolving. • Terms meant to be selectively referenced by future standards, but are otherwise decoupled from them. • Each term has a unique persistent identifier that tracks it from evolving to mature stability to deprecated. 3
  • 4. 4 Session structure • Introduction to YAMZ • Four disciplinary uses • Trying it out • Invitation for feedback, discussion, and participation. 4
  • 5. 5 Metadata (un)happiness • Are you happy with your metadata? • Are you happy with others’ metadata? • Are you and your meta/cataloguer happy with day- to-day experience complying with your chosen standard? Have you asked them what they think? • Which standard(s) do you use? Are you compliant? 5
  • 6. 6 Metadata in theory vs practice The façade • We use standard X, anyone using X can work with us. Behind the façade (see Roy Tennant’s “Bitter Harvest”) • We use standard X with local modifications. • Our mods evolve and depend on the specific collection, so no one using X can work with us. • Few people know what our local mods are. 6
  • 7. 7 Option 1: lobby for changes in X • Use formal commenting mechanism • Wait 2-5 years for revision to appear, during which • a small number of busy experts evaluate • in largely closed discussions • no testing (since there’s no implementation yet) • What happens to legacy metadata every time a 30 – year-old standard does have a new release? 7
  • 8. 8 Option 2: Semantic Web with RDF • Spend a few years modeling all your present and future assets in RDF • Reference, for better or worse, existing terms from existing vocabularies • Pro: get unique, unambiguous concept identifiers • Con: expensive, and no one uses RDF except libraries 8
  • 9. 9 Option 3: think locally, act locally If your organization doesn't have much time or staff • This is the common case • Document your local mods to Standard X • Effectively, this is a secret metadata standard 9
  • 10. 10 Option 4: think globally, act globally if your organization does have the time and staff • Create your own standard or profile • Create your own committees and work with your own partner organizations • Import snapshots of other vocabularies • A missing terms to your liking • Publish your terms and definitions 10
  • 11. 11 (New) Option 5: think locally, act globally At a minimum, use YAMZ to • get a persistent identifier for your term • use it so everyone knows what you mean by it Everything else is gravy • track comments, upvotes and downvotes • notice what related terms others are using Otherwise, we’re stuck with the current blizzard of cross-walking between hundreds of vocabularies... 11
  • 17. 17 Let’s do something different • Instead of yet another ontology, how about a dictionary? • … a dictionary that tracks terms over time? • … a dictionary whose terms standards will reference? 17
  • 18. 18 Summarizing key desiderata • Natural language strings with persistent concept identifiers • Avoiding largely closed discussion • Support for testing and rapid prototyping • Support for unambiguous term referencing, where • some terms may change • other terms may not change • Ability to add missing terms • Publishing your own terms • Dealing with historical terminology 18
  • 19. 19 An alternate metadata universe • Vision: one dictionary, one namespace • All research domains, any part of “metadata speech” • Names, values, units, relationships, ... 19 SimonRobertson@flickr
  • 20. 20 YAMZ.net (Yet Another Metadata Zoo) 20
  • 21. 21 Crowdsourced, but with voting 21 vernacular canonical deprecated 3 classes of term  all terms are born here  these don’t evolve  so terms never go away Each term gets a unique persistent id. Example: identifier: http://n2t.net/ark:/99152/h1193 term: oba definition: other (origin: from Tagalog)
  • 22. 22 Reputation-based voting resists “gaming” • Meritocracy: strong terms rise, weak terms decline • Lessons from StackOverflow, Internet standards, and Wikipedia processes 22 Karunakar Rayker @flickr
  • 23. 23
  • 24. 24 YAMZ usage patterns 24 Search for terms (words and definitions) find a term you love great – use it find a term you kind of love try it out, comment, engage with author no workable term found instantly enter own term and watch for comments find a word you love “I want that word!”, so enter a competing term but a definition you hate
  • 25.
  • 26. 26 Example term in group 26
  • 27. 27 Term tag in YAMZ 27
  • 28. 28 Discipline-specific subsets in YAMZ • Global Cryosphere Watch (GCW) • Citizen Science (Sloan) • DesignSafe (UTA) • Persistence statements (CDL, UCLA, TACC) 28
  • 29. 29 People • Vision: Jane Greenberg, John Kunze (NSF DataONE) • Nassib Nassar, Angela Murillo, Greg Janee, et al. • First implementation: Chris Patton (summer intern) • Other interns: Dillon Arevalo, Manoj Tuguru • LEADS fellow 2018: Mark Phillips • LEADS fellow 2019: Bridget Disney, Hanlin Zhang • LEADS fellow 2020: Chris Rauch 29
  • 30. 30 Trying it out • Try out YAMZ at yamz.net ( caveat: it’s read-only since login is not working!  ) • browse • search 30
  • 31. 31 Getting involved • Newest code at github.com/metadata- research/yamz 31

Editor's Notes

  1. So what is YAMZ about?
  2. Here’s a one-slide summary of YAMZ. The traditional process of achieving metadata standards has failed, and I know what I’m talking about because of DC, BagIt, Z39.50, URLs, and ARKs. We must think outside the box or we will keep failing. YAMZ is not Yet Another Metadata Standard, but something different. Instead it is a dictionary of terms, some fixed and others still evolving, that are meant to be selectively referenced by future standards. Terms are otherwise decoupled from standards that reference them. Each term is a kind of nano-specification with a unique persistent identifier that tracks the term from evolving to mature to deprecated.
  3. In this session we will introduce YAMZ, offer a limited demo and invite feedback, discussion, and participation. It will also describe four separate disciplines that have used it to assist in finding the balance between local descriptive needs and the desire for MI.
  4. Taking the temperature of metadata contentment. Are you happy with your metadata and how it plays with the metadata of others? Are you happy with your ability to find useful, relevant stuff based on search of other people’s metadata? Are you and your cataloguer happy with day-to-day experience complying with your chosen standard? Have you asked your metalogers what they think? Which metadata standard do you use? Do you know if you're in compliance?
  5. // The façade - We use standard X, so everyone using X works with us. We don’t pretend to interoperate with other standards. Behind the façade - We use standard X with local mods tailored to our needs. Our mods change over time and depend on the collection, so no one using X works with us, even between our own collections. We don't interoperate with anyone or anything external. Few people outside or even inside our organization know what our local mods are. Don't believe me? Try the simplest, and oldest of all metadata ontologies, Dublin Core, and see Roy Tennant’s “Bitter Harvest” article on the interoperability problems with DC from multiple institutions. It gets worse the more complex the ontology.
  6. Option 1: try to change a traditional std X that you’re using - use commenting mechanism, along with many other orgs - wait 2-5 years for revision to appear, during which time ** a small number of busy experts evaluate and try to resolve all comments based on ** largely closed discussions and ** no testing ** what about managing changes in terminology over time? (eg, half of the LEADS project we heard about this morning)
  7. Option 2: spend a few years modeling all your present and future assets in RDF, and plan to reference terms from existing vocabularies made unique with namespace designations in front of them - ** benefit: unique, unambiguous concept identifiers - problem: expensive, and no one uses RDF except libraries
  8. Option 3: think locally, act locally if your org doesn't have much time and staff, ** create local doc describing your mdata, eg, Std X with the following mods - very common: effectively, this is a secret metadata std
  9. Option 4: think globally, act globally if your org does have the time and staff, try the more expensive route: - create your own std or profile - create your own committees - work with your own partner orgs Usually this means ** importing a snapshot of other vocabularies (which will now evolve separately), then ** add missing terms to your liking and ** publish a document listing your terms and definitions ** and this takes us to the present situation…
  10. // Whew. As we've seen from all the work being done with historical terms in existing archives, this complicate snapshot hides an even larger legacy problem, which is that traditional metadata standards don't have unambiguous ways of referencing terms, either in the present or over time as terms evolve. ** - Even the experts sitting on the stds committees are frustrated with the traditional approach. That's a lot of crummy news, and Jane and I know it well, having both served as experts in the development of Dublin Core, PREMIS, and other ontologies. Finally we had a chance to do something different, and we did. Instead of an ontology, we built a dictionary. - One thing remains the same: ontologies can be created as before, but we encourage terms to reference definitions from a dictionary. This decouples definitions from all the rules about mandatory vs optional vs conditional, and allows disciplines to more easily share terms.
  11. // Looking back at key points just made, let's summarize them: - natural language strings without persistent concept identifiers - avoid largely closed discussion (exception: public comment periods) ( except: mention Ted Haberman's thing) support testing and rapid prototyping Support unambiguous referencing of terms from elsewhere, where -- some terms may change -- other terms may not change - add missing terms - publish your own terms - dealing with historical terminology
  12. An alternative vision One dictionary, one namespace All research domains, any part of “metadata speech” Names, values, units, relationships, ... Instead the typical practice of banishing all words that don’t fit in your ontology, leave them in.
  13. So we created this metadictionary called yamz (yet another metadata zoo). We needed modifications, from proper paging through search, to logging in with your own ORCID id, to term import and export.
  14. Depending on your risk tolerance, you may cite a balance of stable (canonical) terms and evolving (vernacular) terms, such as those controlled by you or your community. A term is a combination of a label and a definition.
  15. Learn from wikipedia, internet-drafts/RFCs, StackOverflow, and American Heritage Dictionary
  16. Variation: find a word you love, a definition you’re fine with, but you want to add a new, non-competing definition, eg, foo.1, foo.2, foo.3
  17. Very simple (maybe too simple).
  18. Example of a term and definition [] you can add a comment [] at different times [] supporting a conversation [] and you can tag things if you want to restrict your view to just, say, your working group’s tags, and not see the rest of the dictionary Note that you get automatic emails for any terms that you’re “watching”, to notify you when someone comments on your terms. By default, you’re watching any terms you own.