SlideShare a Scribd company logo
1 of 21
Download to read offline
Twenty Years of Metadata:
                              Lessons from the
                       First Two Decades of the Web
                                    Stuart Weibel
                            University of Tsukuba Visiting Scholar
                                         May 13, 2011




Friday, May 13, 2011                                                 1
Outline

                        The Context

                        Dublin Core in the Metadata Matrix

                        What we did right

                        The major impediments

                        A few words about models

                        What about the future?
                                      Image: Carved figures (Morikawa Toen), Tokyo National Museum




Friday, May 13, 2011                                                                                 2
THe Context

               When I started working at OCLC in 1985:

                       I was 4 years away from my first email address

                       A PC hard drive wasn’t large enough to store a
                       single high resolution digital image.
                       (which was ok, because…)

                       Cameras still used film

                       Cell phones were suitcase-sized                               me… circa 1994


                       MARC Cataloging stood alone as the discovery tool for intellectual assets of
                       libraries

                       No end-user access to the global library catalogs

Friday, May 13, 2011                                                                                  3
And now?
               A cell phone has more computing power than the Space Shuttle

               An iPod will hold WorldCat

               Bandwidth is more important than computing power

               The library is still mostly mired in MARC

               There are many metadata standards (mostly struggling for traction)

               People (mostly) find things with Google

               but….


Friday, May 13, 2011                                                                4
Metadata is more than just
                      search
                          Metadata-dependent actions
                                    Describe
                                      Access
                                 Encode/Render
                                     Preserve
                               Rights Management
                                   Administer
                       “Bind” digital pages in digital books


Friday, May 13, 2011                                           5
50 years of Metadata
          MARC standards (library metadata)
                       OCLC founded (shared library cataloging)
                             ARPANET Operational - forerunner of the Internet
                                      Networking diffuses throughout academia
                                                  The Web begins... FRBR work begins
                                                             First Dublin Core Workshop
                                                               DCMI established
                                                                 Google is founded
                                                                     First Dublin Core Conference (Tokyo)
                                            my first email

                                               address
                                                                         WorldCat introduced
                                                                            RDA introduced

            1960s            1970s      1980s                1990s    2000s

Friday, May 13, 2011                                                                                        6
The confusion:
                          How bad is it?




            “This visual map of the metadata landscape is intended to assist
         planners with the selection and implementation of metadata standards.”
      http://www.dlib.indiana.edu/~jenlrile/metadatamap/

Friday, May 13, 2011                                                              7
JenN Riley’s Metadata Map
               105 standards

               30 most common across the top (3 predate the Web)

               some share common models… most do not

               much overlap

               many work together

               Who among us can choose rationally from the array of
               standards, platforms, technologies?

               Will the results have any reasonable expectation of
               interoperability?
Friday, May 13, 2011                                                  8
The real world is not
                                 standards-centric
              Metadata-
           dependent actions        Standard          Information Entities (ex.)
                  Describe         MARC, DC, MODS,                  Agents
                                  RDA, LCSH, MeSH….   (persons, corporate entities, devices)

                       Access      HTTP, FTP….                      Events
                                   RDF, media-type
           Encode/render          dependent (many)       Time intervals or eras

                  Preserve            PREMIS                      Concepts
                  Rights             CC licenses,
                Management        eCommerce systems              Collections

               Administer         METS, MARC….                  Media-types
           “Bind” digital pages     METS, eBook
             in digital books        standards            Structured data type


Friday, May 13, 2011                                                                           9
The map is much more
                            complicated
               “This visual map of the metadata landscape is intended to assist
            planners with the selection and implementation of metadata standards.”




               “selection and implementation of metadata standards requires a clear
                 understanding of the information entities, the standards, and the
                        functional requirements of the system under design”
                                                                Image: Kyoto horizon from above the Tenru-ji Temple




Friday, May 13, 2011                                                                                                  10
Dublin Core in the
                          metadata matrix
               The first metadata standard for
               the Web

               General and cross-disciplinary

               Simple starting place, but
               extensible

               International and multilingual

               Consensus-driven (bottom-up,
               rather than top-down)
                                                 Image: Jomon Pottery, Tokyo National Museum,




Friday, May 13, 2011                                                                            11
Things we did right
               We didn’t call it ‘cataloging’ (Web, not libraries)

               A hybrid of technical engineering
               and social engineering

               International - Major events on
               5 continents, element definitions
               in 20+ languages (maintained in
               Tsukuba)

               Separated syntax and semantics

               Built a community of practice

               About the right level of complexity for a core element set
                                                                            Image: Harajuku train station platform, Tokyo




Friday, May 13, 2011                                                                                                        12
Impediments that tripped
          us up
                 Too many syntaxes to support
                 (HTML, XML, RDF-XML)

                 No common data model
                 but we tried hard:
                 data model group,
                 architecture group,
                 abstract model,
                 Singapore Framework...

                 Without a data model, the story we told was not consistent: confusion resulted

                 Without a data model, details of implementation become arbitrary (and less
                 interoperable)
                                                                               Image: Netsuke, Tokyo National Museum




Friday, May 13, 2011                                                                                                   13
Data Modeling: what is it?
               Entity-relationship model defines the important concepts or things
               (entities), and the relationships among them

               A model is a model, not reality

               Designed to solve a problem,
               not to emulate the real world

               The complexity of the model
               should be mapped to the
               problem, not to reality

               Identifying the right level of abstraction is an art       Image: Edo Museum




Friday, May 13, 2011                                                                          14
Data Modeling: why is it
          necessary?
               Without a shared
               understanding of the
               important entities, and the
               relationships among them,
               systems will not
               interoperate easily

               Cross-walks become
               necessary: clumsy,            Changing rail car ‘bogeys’ on the
               inaccurate, inefficient           China/Mongolia border



Friday, May 13, 2011                                                             15
An example of modeling
          mismatch
           Citation information
                               Date
                                Title
                             Author
                          Affiliation
                       Email address


          - Which of the attributes are Dublin Core?
          - Is “email address” an attribute of the resource, or the person?
          - Should there be a distinction between Title and Subtitle?

Friday, May 13, 2011                                                          16
Is Dublin Core well-matched to the
          problem of bibliographic description?

               It is too simple to capture the precision of detailed
               bibliographic description

               BUT… It is good enough for many purposes, including the
               description of most simple internet resources

               The trade-off between perfect matching of model and
               problem, and simplicity of use is always a compromise

               DC was intended for general resource description, not to
               replace MARC


Friday, May 13, 2011                                                      17
The problem with models
               Matching the complexity of models to a diverse and evolving
               problem is challenging, and full of compromises

                       too much complexity
                       leads to failure
                       (creeping elegance)

                       too little complexity
                       leads to failure
                       (insufficient richness
                       to solve the problem)

               HOW DO YOU KNOW WHEN IT IS RIGHT?
                                                            Image: figures from a model in the Kyushu National Museum




Friday, May 13, 2011                                                                                                    18
Conceptual Models in the
                            Library World
                                                The dominant models for
                         FRBR and FRAD
                                             bibliographic and authority data
                                               Reference model for Open
                              OAIS
                                              Archive Information Systems
                                              Conceptual Reference Model for
                          CIDOC CRM           cultural heritage documentation



                                             Largely unintelligible data model
                Dublin Core Abstract Model
                                              for Dublin Core instance data
                                               A vague framework describing
                       Singapore Framework   levels of metadata interoperability


Friday, May 13, 2011                                                               19
The Next Chapters in the Web
                            Metadata story...

               ...are being written in the W3C Incubator Group on Library Linked Data (http://
               www.w3.org/2005/Incubator/lld/)

               Many questions:

                       Will the data be open?

                       Who will maintain it?

                       Is semantic web infrastructure stable?

                       Can existing metadata be integrate
                       seamlessly into the web?

                       Can a model be agreed upon?

                       Will we ever have interoperability across domain silos?
                                                                                 Image: Stone Monk in the Nezu Museum Garden




Friday, May 13, 2011                                                                                                           20
stuart.weibel@gmail.com


          http://weibel-lines.typepad.com


          @stuartweibel on twitter


          stuartweibel on Facebook

                       all photographs by the author
                                                       Image: Lantern overlooking the Irises in the Nezu Museum Garden




Friday, May 13, 2011                                                                                                     21

More Related Content

What's hot

Digital natives and virtual libraries: What does the future hold for libraries?
Digital natives and virtual libraries: What does the future hold for libraries?Digital natives and virtual libraries: What does the future hold for libraries?
Digital natives and virtual libraries: What does the future hold for libraries?Yasar Tonta
 
Un-defining digital literacies: students' day-to-day engagements with technol...
Un-defining digital literacies: students' day-to-day engagements with technol...Un-defining digital literacies: students' day-to-day engagements with technol...
Un-defining digital literacies: students' day-to-day engagements with technol...Martin Oliver
 
The Links that became a Web
The Links that became a WebThe Links that became a Web
The Links that became a WebJohan Koren
 
Orchestrating Soft and Hard Technologies
Orchestrating Soft and Hard TechnologiesOrchestrating Soft and Hard Technologies
Orchestrating Soft and Hard Technologiesjondron
 
TEL Developments & Trends
TEL Developments & TrendsTEL Developments & Trends
TEL Developments & Trendstimku
 
Network of Excellence in Internet Science (Multidisciplinarity and its Implic...
Network of Excellence in Internet Science (Multidisciplinarity and its Implic...Network of Excellence in Internet Science (Multidisciplinarity and its Implic...
Network of Excellence in Internet Science (Multidisciplinarity and its Implic...i_scienceEU
 
Introduction - Lecture 1 - Seminar Web Information Systems Technology (WE-DIN...
Introduction - Lecture 1 - Seminar Web Information Systems Technology (WE-DIN...Introduction - Lecture 1 - Seminar Web Information Systems Technology (WE-DIN...
Introduction - Lecture 1 - Seminar Web Information Systems Technology (WE-DIN...Beat Signer
 
Learning with technology as coordinated sociomaterial practice: digital liter...
Learning with technology as coordinated sociomaterial practice: digital liter...Learning with technology as coordinated sociomaterial practice: digital liter...
Learning with technology as coordinated sociomaterial practice: digital liter...Martin Oliver
 
The learner voice: students' use and experience of technologies
The learner voice: students' use and experience of technologiesThe learner voice: students' use and experience of technologies
The learner voice: students' use and experience of technologiesgrainne
 
What is information?
What is information?What is information?
What is information?Johan Koren
 
Mist2012 panel discussion-ruo ando
Mist2012 panel discussion-ruo andoMist2012 panel discussion-ruo ando
Mist2012 panel discussion-ruo andoRuo Ando
 
Internet to web: The 40-year old Internet and the 20-year-old Web
Internet to web:  The 40-year old Internet and the 20-year-old WebInternet to web:  The 40-year old Internet and the 20-year-old Web
Internet to web: The 40-year old Internet and the 20-year-old WebJohan Koren
 
History of neural networks
History of neural networks History of neural networks
History of neural networks EliyasJain
 
Ψηφιακές βιβλιοθήκες, ψηφιακά αποθετήρια, υποδομές δεδομένων: θεμέλια της νέα...
Ψηφιακές βιβλιοθήκες, ψηφιακά αποθετήρια, υποδομές δεδομένων: θεμέλια της νέα...Ψηφιακές βιβλιοθήκες, ψηφιακά αποθετήρια, υποδομές δεδομένων: θεμέλια της νέα...
Ψηφιακές βιβλιοθήκες, ψηφιακά αποθετήρια, υποδομές δεδομένων: θεμέλια της νέα...kebepcy
 
Conole keynote edmedia
Conole keynote edmediaConole keynote edmedia
Conole keynote edmediagrainne
 
Pres oslo2013-digital-didactics-isajahnke-v4
Pres oslo2013-digital-didactics-isajahnke-v4Pres oslo2013-digital-didactics-isajahnke-v4
Pres oslo2013-digital-didactics-isajahnke-v4Isa Jahnke
 

What's hot (20)

Digital natives and virtual libraries: What does the future hold for libraries?
Digital natives and virtual libraries: What does the future hold for libraries?Digital natives and virtual libraries: What does the future hold for libraries?
Digital natives and virtual libraries: What does the future hold for libraries?
 
Un-defining digital literacies: students' day-to-day engagements with technol...
Un-defining digital literacies: students' day-to-day engagements with technol...Un-defining digital literacies: students' day-to-day engagements with technol...
Un-defining digital literacies: students' day-to-day engagements with technol...
 
The Links that became a Web
The Links that became a WebThe Links that became a Web
The Links that became a Web
 
Orchestrating Soft and Hard Technologies
Orchestrating Soft and Hard TechnologiesOrchestrating Soft and Hard Technologies
Orchestrating Soft and Hard Technologies
 
TEL Developments & Trends
TEL Developments & TrendsTEL Developments & Trends
TEL Developments & Trends
 
Final multimedia
Final multimediaFinal multimedia
Final multimedia
 
Stefan Decker
Stefan DeckerStefan Decker
Stefan Decker
 
Network of Excellence in Internet Science (Multidisciplinarity and its Implic...
Network of Excellence in Internet Science (Multidisciplinarity and its Implic...Network of Excellence in Internet Science (Multidisciplinarity and its Implic...
Network of Excellence in Internet Science (Multidisciplinarity and its Implic...
 
Introduction - Lecture 1 - Seminar Web Information Systems Technology (WE-DIN...
Introduction - Lecture 1 - Seminar Web Information Systems Technology (WE-DIN...Introduction - Lecture 1 - Seminar Web Information Systems Technology (WE-DIN...
Introduction - Lecture 1 - Seminar Web Information Systems Technology (WE-DIN...
 
Learning with technology as coordinated sociomaterial practice: digital liter...
Learning with technology as coordinated sociomaterial practice: digital liter...Learning with technology as coordinated sociomaterial practice: digital liter...
Learning with technology as coordinated sociomaterial practice: digital liter...
 
The learner voice: students' use and experience of technologies
The learner voice: students' use and experience of technologiesThe learner voice: students' use and experience of technologies
The learner voice: students' use and experience of technologies
 
What is information?
What is information?What is information?
What is information?
 
Mist2012 panel discussion-ruo ando
Mist2012 panel discussion-ruo andoMist2012 panel discussion-ruo ando
Mist2012 panel discussion-ruo ando
 
Internet to web: The 40-year old Internet and the 20-year-old Web
Internet to web:  The 40-year old Internet and the 20-year-old WebInternet to web:  The 40-year old Internet and the 20-year-old Web
Internet to web: The 40-year old Internet and the 20-year-old Web
 
History of neural networks
History of neural networks History of neural networks
History of neural networks
 
Internet to Web
Internet to WebInternet to Web
Internet to Web
 
Ψηφιακές βιβλιοθήκες, ψηφιακά αποθετήρια, υποδομές δεδομένων: θεμέλια της νέα...
Ψηφιακές βιβλιοθήκες, ψηφιακά αποθετήρια, υποδομές δεδομένων: θεμέλια της νέα...Ψηφιακές βιβλιοθήκες, ψηφιακά αποθετήρια, υποδομές δεδομένων: θεμέλια της νέα...
Ψηφιακές βιβλιοθήκες, ψηφιακά αποθετήρια, υποδομές δεδομένων: θεμέλια της νέα...
 
Conole keynote edmedia
Conole keynote edmediaConole keynote edmedia
Conole keynote edmedia
 
Pres oslo2013-digital-didactics-isajahnke-v4
Pres oslo2013-digital-didactics-isajahnke-v4Pres oslo2013-digital-didactics-isajahnke-v4
Pres oslo2013-digital-didactics-isajahnke-v4
 
Ljc
LjcLjc
Ljc
 

Viewers also liked (16)

Loyola University Chicago
Loyola University ChicagoLoyola University Chicago
Loyola University Chicago
 
Bulgaria
BulgariaBulgaria
Bulgaria
 
Foro de matematicas 2011
Foro de matematicas 2011Foro de matematicas 2011
Foro de matematicas 2011
 
Planoghandling
PlanoghandlingPlanoghandling
Planoghandling
 
Swine Flu
Swine FluSwine Flu
Swine Flu
 
Grant Euroskin Presentation
Grant Euroskin PresentationGrant Euroskin Presentation
Grant Euroskin Presentation
 
PSP -Ecclesall Woods
PSP -Ecclesall WoodsPSP -Ecclesall Woods
PSP -Ecclesall Woods
 
Badajoz: present and past
Badajoz: present and pastBadajoz: present and past
Badajoz: present and past
 
Spain: This is our country
Spain: This is our countrySpain: This is our country
Spain: This is our country
 
Est Overview Feb 2010 Ac V2
Est Overview Feb 2010 Ac V2Est Overview Feb 2010 Ac V2
Est Overview Feb 2010 Ac V2
 
Paisajes de escuernavacas
Paisajes de escuernavacasPaisajes de escuernavacas
Paisajes de escuernavacas
 
Stigma Presentation
Stigma PresentationStigma Presentation
Stigma Presentation
 
Hungary's history
Hungary's historyHungary's history
Hungary's history
 
Bulgariacarmenkaczmar
BulgariacarmenkaczmarBulgariacarmenkaczmar
Bulgariacarmenkaczmar
 
Presentationturkiaengodollo
PresentationturkiaengodolloPresentationturkiaengodollo
Presentationturkiaengodollo
 
Brief history of Spain
Brief history of SpainBrief history of Spain
Brief history of Spain
 

Similar to Twenty Years of Metadata Lessons

One Big Happy Family
One Big Happy FamilyOne Big Happy Family
One Big Happy FamilyDan Brickley
 
Machine learning and multimedia information retrieval
Machine learning and multimedia information retrievalMachine learning and multimedia information retrieval
Machine learning and multimedia information retrievalSi Krishan
 
Living the life electric
Living the life electricLiving the life electric
Living the life electricDoctorG
 
Soeren okfn greece meetup
Soeren okfn greece meetupSoeren okfn greece meetup
Soeren okfn greece meetupOKFN-GR
 
Introduction to the FP7 CODE project @ BDBC
Introduction to the FP7 CODE project @ BDBCIntroduction to the FP7 CODE project @ BDBC
Introduction to the FP7 CODE project @ BDBCFlorian Stegmaier
 
Django and Neo4j - Domain modeling that kicks ass
Django and Neo4j - Domain modeling that kicks assDjango and Neo4j - Domain modeling that kicks ass
Django and Neo4j - Domain modeling that kicks assTobias Lindaaker
 
IASSIT Kansa Presentation
IASSIT Kansa PresentationIASSIT Kansa Presentation
IASSIT Kansa Presentationekansa
 
Poster Semantic Web - Abhijit Chandrasen Manepatil
Poster Semantic Web - Abhijit Chandrasen ManepatilPoster Semantic Web - Abhijit Chandrasen Manepatil
Poster Semantic Web - Abhijit Chandrasen Manepatilap
 
Ten Technology Trends That Will Change the World in Ten Years
Ten Technology Trends That Will Change the World in Ten YearsTen Technology Trends That Will Change the World in Ten Years
Ten Technology Trends That Will Change the World in Ten YearsCisco Services
 
XXIX Charleston Semantic Web Leicht
XXIX Charleston   Semantic Web LeichtXXIX Charleston   Semantic Web Leicht
XXIX Charleston Semantic Web LeichtDarrell W. Gunter
 
The Navigation Layer - Making Sense Of It All
The Navigation Layer - Making Sense Of It AllThe Navigation Layer - Making Sense Of It All
The Navigation Layer - Making Sense Of It AllJim Kalbach
 
Electronic Lab Notebooks in Biomedical Research
Electronic Lab Notebooks in Biomedical ResearchElectronic Lab Notebooks in Biomedical Research
Electronic Lab Notebooks in Biomedical ResearchAxiope Limited
 
ESI Supplemental Webinar 2 - DataONE presentation slides
ESI Supplemental Webinar 2 - DataONE presentation slides ESI Supplemental Webinar 2 - DataONE presentation slides
ESI Supplemental Webinar 2 - DataONE presentation slides DuraSpace
 
Data-Intensive Text Processing with MapReduce
Data-Intensive Text Processing  with MapReduce Data-Intensive Text Processing  with MapReduce
Data-Intensive Text Processing with MapReduce George Ang
 
Data-Intensive Text Processing with MapReduce
Data-Intensive Text Processing with MapReduceData-Intensive Text Processing with MapReduce
Data-Intensive Text Processing with MapReduceGeorge Ang
 

Similar to Twenty Years of Metadata Lessons (20)

One Big Happy Family
One Big Happy FamilyOne Big Happy Family
One Big Happy Family
 
Machine learning and multimedia information retrieval
Machine learning and multimedia information retrievalMachine learning and multimedia information retrieval
Machine learning and multimedia information retrieval
 
Data Research Vision
Data Research VisionData Research Vision
Data Research Vision
 
Semantic Web Nature
Semantic Web NatureSemantic Web Nature
Semantic Web Nature
 
Living the life electric
Living the life electricLiving the life electric
Living the life electric
 
Metadata extraction
Metadata extractionMetadata extraction
Metadata extraction
 
Soeren okfn greece meetup
Soeren okfn greece meetupSoeren okfn greece meetup
Soeren okfn greece meetup
 
Introduction to the FP7 CODE project @ BDBC
Introduction to the FP7 CODE project @ BDBCIntroduction to the FP7 CODE project @ BDBC
Introduction to the FP7 CODE project @ BDBC
 
Django and Neo4j - Domain modeling that kicks ass
Django and Neo4j - Domain modeling that kicks assDjango and Neo4j - Domain modeling that kicks ass
Django and Neo4j - Domain modeling that kicks ass
 
IASSIT Kansa Presentation
IASSIT Kansa PresentationIASSIT Kansa Presentation
IASSIT Kansa Presentation
 
Poster Semantic Web - Abhijit Chandrasen Manepatil
Poster Semantic Web - Abhijit Chandrasen ManepatilPoster Semantic Web - Abhijit Chandrasen Manepatil
Poster Semantic Web - Abhijit Chandrasen Manepatil
 
Ten Technology Trends That Will Change the World in Ten Years
Ten Technology Trends That Will Change the World in Ten YearsTen Technology Trends That Will Change the World in Ten Years
Ten Technology Trends That Will Change the World in Ten Years
 
XXIX Charleston Semantic Web Leicht
XXIX Charleston   Semantic Web LeichtXXIX Charleston   Semantic Web Leicht
XXIX Charleston Semantic Web Leicht
 
Metadata 101public
Metadata 101publicMetadata 101public
Metadata 101public
 
Adfi forum 4_12
Adfi forum 4_12Adfi forum 4_12
Adfi forum 4_12
 
The Navigation Layer - Making Sense Of It All
The Navigation Layer - Making Sense Of It AllThe Navigation Layer - Making Sense Of It All
The Navigation Layer - Making Sense Of It All
 
Electronic Lab Notebooks in Biomedical Research
Electronic Lab Notebooks in Biomedical ResearchElectronic Lab Notebooks in Biomedical Research
Electronic Lab Notebooks in Biomedical Research
 
ESI Supplemental Webinar 2 - DataONE presentation slides
ESI Supplemental Webinar 2 - DataONE presentation slides ESI Supplemental Webinar 2 - DataONE presentation slides
ESI Supplemental Webinar 2 - DataONE presentation slides
 
Data-Intensive Text Processing with MapReduce
Data-Intensive Text Processing  with MapReduce Data-Intensive Text Processing  with MapReduce
Data-Intensive Text Processing with MapReduce
 
Data-Intensive Text Processing with MapReduce
Data-Intensive Text Processing with MapReduceData-Intensive Text Processing with MapReduce
Data-Intensive Text Processing with MapReduce
 

Recently uploaded

Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptxVS Mahajan Coaching Centre
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application ) Sakshi Ghasle
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Educationpboyjonauth
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdfssuser54595a
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Sapana Sha
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTiammrhaywood
 
Class 11 Legal Studies Ch-1 Concept of State .pdf
Class 11 Legal Studies Ch-1 Concept of State .pdfClass 11 Legal Studies Ch-1 Concept of State .pdf
Class 11 Legal Studies Ch-1 Concept of State .pdfakmcokerachita
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13Steve Thomason
 
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting DataJhengPantaleon
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...EduSkills OECD
 
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...M56BOOKSTORE PRODUCT/SERVICE
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxSayali Powar
 
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfEnzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfSumit Tiwari
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionSafetyChain Software
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxheathfieldcps1
 
MENTAL STATUS EXAMINATION format.docx
MENTAL     STATUS EXAMINATION format.docxMENTAL     STATUS EXAMINATION format.docx
MENTAL STATUS EXAMINATION format.docxPoojaSen20
 

Recently uploaded (20)

Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application )
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Education
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
 
Class 11 Legal Studies Ch-1 Concept of State .pdf
Class 11 Legal Studies Ch-1 Concept of State .pdfClass 11 Legal Studies Ch-1 Concept of State .pdf
Class 11 Legal Studies Ch-1 Concept of State .pdf
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13
 
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
 
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
 
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfEnzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory Inspection
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
MENTAL STATUS EXAMINATION format.docx
MENTAL     STATUS EXAMINATION format.docxMENTAL     STATUS EXAMINATION format.docx
MENTAL STATUS EXAMINATION format.docx
 

Twenty Years of Metadata Lessons

  • 1. Twenty Years of Metadata: Lessons from the First Two Decades of the Web Stuart Weibel University of Tsukuba Visiting Scholar May 13, 2011 Friday, May 13, 2011 1
  • 2. Outline The Context Dublin Core in the Metadata Matrix What we did right The major impediments A few words about models What about the future? Image: Carved figures (Morikawa Toen), Tokyo National Museum Friday, May 13, 2011 2
  • 3. THe Context When I started working at OCLC in 1985: I was 4 years away from my first email address A PC hard drive wasn’t large enough to store a single high resolution digital image. (which was ok, because…) Cameras still used film Cell phones were suitcase-sized me… circa 1994 MARC Cataloging stood alone as the discovery tool for intellectual assets of libraries No end-user access to the global library catalogs Friday, May 13, 2011 3
  • 4. And now? A cell phone has more computing power than the Space Shuttle An iPod will hold WorldCat Bandwidth is more important than computing power The library is still mostly mired in MARC There are many metadata standards (mostly struggling for traction) People (mostly) find things with Google but…. Friday, May 13, 2011 4
  • 5. Metadata is more than just search Metadata-dependent actions Describe Access Encode/Render Preserve Rights Management Administer “Bind” digital pages in digital books Friday, May 13, 2011 5
  • 6. 50 years of Metadata MARC standards (library metadata) OCLC founded (shared library cataloging) ARPANET Operational - forerunner of the Internet Networking diffuses throughout academia The Web begins... FRBR work begins First Dublin Core Workshop DCMI established Google is founded First Dublin Core Conference (Tokyo) my first email address WorldCat introduced RDA introduced 1960s 1970s 1980s 1990s 2000s Friday, May 13, 2011 6
  • 7. The confusion: How bad is it? “This visual map of the metadata landscape is intended to assist planners with the selection and implementation of metadata standards.” http://www.dlib.indiana.edu/~jenlrile/metadatamap/ Friday, May 13, 2011 7
  • 8. JenN Riley’s Metadata Map 105 standards 30 most common across the top (3 predate the Web) some share common models… most do not much overlap many work together Who among us can choose rationally from the array of standards, platforms, technologies? Will the results have any reasonable expectation of interoperability? Friday, May 13, 2011 8
  • 9. The real world is not standards-centric Metadata- dependent actions Standard Information Entities (ex.) Describe MARC, DC, MODS, Agents RDA, LCSH, MeSH…. (persons, corporate entities, devices) Access HTTP, FTP…. Events RDF, media-type Encode/render dependent (many) Time intervals or eras Preserve PREMIS Concepts Rights CC licenses, Management eCommerce systems Collections Administer METS, MARC…. Media-types “Bind” digital pages METS, eBook in digital books standards Structured data type Friday, May 13, 2011 9
  • 10. The map is much more complicated “This visual map of the metadata landscape is intended to assist planners with the selection and implementation of metadata standards.” “selection and implementation of metadata standards requires a clear understanding of the information entities, the standards, and the functional requirements of the system under design” Image: Kyoto horizon from above the Tenru-ji Temple Friday, May 13, 2011 10
  • 11. Dublin Core in the metadata matrix The first metadata standard for the Web General and cross-disciplinary Simple starting place, but extensible International and multilingual Consensus-driven (bottom-up, rather than top-down) Image: Jomon Pottery, Tokyo National Museum, Friday, May 13, 2011 11
  • 12. Things we did right We didn’t call it ‘cataloging’ (Web, not libraries) A hybrid of technical engineering and social engineering International - Major events on 5 continents, element definitions in 20+ languages (maintained in Tsukuba) Separated syntax and semantics Built a community of practice About the right level of complexity for a core element set Image: Harajuku train station platform, Tokyo Friday, May 13, 2011 12
  • 13. Impediments that tripped us up Too many syntaxes to support (HTML, XML, RDF-XML) No common data model but we tried hard: data model group, architecture group, abstract model, Singapore Framework... Without a data model, the story we told was not consistent: confusion resulted Without a data model, details of implementation become arbitrary (and less interoperable) Image: Netsuke, Tokyo National Museum Friday, May 13, 2011 13
  • 14. Data Modeling: what is it? Entity-relationship model defines the important concepts or things (entities), and the relationships among them A model is a model, not reality Designed to solve a problem, not to emulate the real world The complexity of the model should be mapped to the problem, not to reality Identifying the right level of abstraction is an art Image: Edo Museum Friday, May 13, 2011 14
  • 15. Data Modeling: why is it necessary? Without a shared understanding of the important entities, and the relationships among them, systems will not interoperate easily Cross-walks become necessary: clumsy, Changing rail car ‘bogeys’ on the inaccurate, inefficient China/Mongolia border Friday, May 13, 2011 15
  • 16. An example of modeling mismatch Citation information Date Title Author Affiliation Email address - Which of the attributes are Dublin Core? - Is “email address” an attribute of the resource, or the person? - Should there be a distinction between Title and Subtitle? Friday, May 13, 2011 16
  • 17. Is Dublin Core well-matched to the problem of bibliographic description? It is too simple to capture the precision of detailed bibliographic description BUT… It is good enough for many purposes, including the description of most simple internet resources The trade-off between perfect matching of model and problem, and simplicity of use is always a compromise DC was intended for general resource description, not to replace MARC Friday, May 13, 2011 17
  • 18. The problem with models Matching the complexity of models to a diverse and evolving problem is challenging, and full of compromises too much complexity leads to failure (creeping elegance) too little complexity leads to failure (insufficient richness to solve the problem) HOW DO YOU KNOW WHEN IT IS RIGHT? Image: figures from a model in the Kyushu National Museum Friday, May 13, 2011 18
  • 19. Conceptual Models in the Library World The dominant models for FRBR and FRAD bibliographic and authority data Reference model for Open OAIS Archive Information Systems Conceptual Reference Model for CIDOC CRM cultural heritage documentation Largely unintelligible data model Dublin Core Abstract Model for Dublin Core instance data A vague framework describing Singapore Framework levels of metadata interoperability Friday, May 13, 2011 19
  • 20. The Next Chapters in the Web Metadata story... ...are being written in the W3C Incubator Group on Library Linked Data (http:// www.w3.org/2005/Incubator/lld/) Many questions: Will the data be open? Who will maintain it? Is semantic web infrastructure stable? Can existing metadata be integrate seamlessly into the web? Can a model be agreed upon? Will we ever have interoperability across domain silos? Image: Stone Monk in the Nezu Museum Garden Friday, May 13, 2011 20
  • 21. stuart.weibel@gmail.com http://weibel-lines.typepad.com @stuartweibel on twitter stuartweibel on Facebook all photographs by the author Image: Lantern overlooking the Irises in the Nezu Museum Garden Friday, May 13, 2011 21