SlideShare a Scribd company logo
1 of 30
EDUG 2012
Symposium

26 April 2012    DDC metadata
Boston Spa, UK




                 Michael Panzer
                 Assistant Editor, DDC
                 OCLC
                 panzerm@oclc.org
Types of DDC data



 - Usually, Dewey numbers provide metadata for describing
   other resources
    - DDC as value vocabulary for metadata element sets

 - Instead, the following focuses on cases where Dewey
   numbers and DDC editions are the resources described

 - Two levels of DDC metadata
    - Number-level metadata (focus on bibliographic records)

    - Edition-level metadata (focus on classification records)
DDC metadata



 Metadata about

 - Dewey numbers (082, 083, 085 fields in MARC
   Bibliographic)
    - Provenance of machine-generated classication data

    - Dewey number components in linked 085 fields

 - Dewey editions (084, 686 fields in MARC Classification)
    - Interplay between class- and edition-level metadata rendered
      in MARC Classification format
Agenda



 Scenario                  Context
 1. Provenance of          - Proposal for MARBI;
    machine-generated        (metadata) provenance
    data                     initiatives at W3C / DCMI

 2. Edition-level metadata - Relationship between
                             translations and other
                             ―versions‖

 3. Metadata about Dewey - Enhancing Dewey
    number components      numbers for retrieval
MARBI proposal


 - Drafted over the last two months in cooperation with colleagues
   from DNB and LC
 - To be presented at MARBI meeting at ALA Annual Conference
   2012
 - Two options
    - Option 1: Addresses the immediate needs of documenting
      information about machine generation of classification data
        - Defines additional subfields in 082, 083, 084

    - Option 2: Proposes a more general way of dealing with metadata
      provenance
        - Applicable to all MARC variable fields (in principle)
        - Heeds the distinction between provenance in general and metadata
          provenance in particular
Option 1


 Defined for 082, 083, 084
 $i - Method of assignment designator
         Fully machine-generated (m)
         Not fully machine-generated (x)
 $u - Process of assignment
         May contain a URI, a process name, or some other
         description of process designated in $i
 $1 - Confidence value
         Confidence of the assigning agency in relation to the
         process described in $u. Contains value from the interval
         [0,1]
 $q – Assigning agency (already defined)
Examples



 DDC 23 number assigned by LC using AutoDewey. The
  AutoDewey process involves machine assistance followed
  by intellectual review:

  082 00 $a829/.3$223$ix$uautodewey$11



 Fictitious example of DDC 22 number assigned by OCLC in a
   fully automated way using information in Classify:

  082 04 $a394.12$222$im$uclassify$10.5$qOCoLC
Option 2


 883 - Data provenance (R)
 First Indicator: Method of assignment
   # - No information provided
   0 – Fully machine-generated
   1 – Not fully machine-generated
 $d - Date on which the linked field was generated
 $u - Process used to generate linked field
 $q - Agency using the process/activity to generate the linked field
 $1 - Confidence value
 $x - Ending date of validity
 $0 - Authority record control number or standard number
 $8 - Field link and sequence number (with new field link type ―p – Data
      provenance‖)
Examples



 082   00 $81p$a829/.3$223

 883   1# $81p$uautodewey$d20120407$qDLC$11



 082   04 $81p$a394.12$222$qOCoLC

 883   0# $81p$uclassify$d20120407$qOCoLC$10.5
Examples (2)



 082   04 $81p$a004$222/ger$qNO-OsNB

 883   0# $81p$udeweyclassifierv0.1$d20120101
       $x20141231$qNO-OsNB$10.25
       $0(DE-101)040268942



 082   04 $81p$a004$222/ger$qDE-101

 883   0# $81p$uparallelrecordcopy$d20120101
       $x20141231$qNO-OsNB
Agenda



 Scenario                  Context
 1. Provenance of          - Proposal for MARBI;
    machine-generated        (metadata) provenance
    data                     initiatives at W3C / DCMI

 2. Edition-level metadata - Relationship between
                             translations and other
                             ―versions‖

 3. Metadata about Dewey - Enhancing Dewey
    number components      numbers for retrieval
Edition-level metadata



 - Edition registry: capturing information about editions and
   translations in a centralized manner outside of MARC
   records
 - Storing additional metadata about editions/translations in
   MARC records
    - Better management of translation data and other versions

 - MARC does not offer edition-level records
    - Data info has to be carried in individual records, even when
      it applies to the whole edition

 - Relevant fields:     084 - Classification Scheme and Edition
                        686 - Relationship to Source Note
DDC translations:
Anatomy of an edition
                                           German              Italian
                                           DDC 22              DDC 22
                                                                                 Swedish
                 French
                                                                                  Mixed
                 DDC 22
                                                                                 DDC 22

 Afrikaans
 Arabic
                                                                                                 English
 Chinese
                                                                                                 French
 French               DDC                           DDC 22                      DDC Sach-
                    Summaries                                                    Gruppen         Italian
 German                                                                         (German)
 Norwegian                                                                                       Rhaeto-
 Portuguese                                                                                      Romansch
 Russian                                                               200
                                 Guide                               Religion
 Scots Gaelic                   (French)                              Class

 Spanish
 Swedish
                                                     A14

                                                                                    Vietnamese
                French
                                                                                        A14
                 A14

                                Hebrew                               Spanish
                                 A14                                   A14
                                                     Italian
                                                       A14
Types of editions



 - Related to an edition, with relationships not captured at
   record level

  Examples: sdnb, DDC Summaries, Guide

                            versus

 - Related to an edition, with relationships captured at
   record level

  Examples: 200 Religion, translations, A15engind
Tracking edition-to-edition relationships


 Translation of standard edition
         084 1# $a ddc $c 15 $e ind
                                                           Source edition
                                               084 1# $a ddc $c 15 $e eng


 Authorized derivative version of standard edition
         084 8# $a ddc $c 22sdnb $d 22 $e ger
                                                           Source edition
                                               084 0# $a ddc $c 22 $e eng
 - Not explicitly full or abridged; ―8‖ is used for value of first indicator
 - $n should be automatically populated with relevant information
   about the changes regarding the source edition.
Tracking record-to-record relationships



                1. Record has been modified

 Translation of standard edition
  084 1# $a ddc $c 15 $e ind
  686 3# $i modified



                                                Source record
                                   084 1# $a ddc $c 15 $e eng
Tracking record-to-record relationships (2)



            2. Record was created for translation

 Translation of standard edition
  084 1# $a ddc $c 15 $e ind
  686 1# $b 305.899



                                               Source record
                                             [does not exist]
Tracking record-to-record relationships (3)



     3. Unmodified record from different source edition

 Translation of standard edition
  084 1# $a ddc $c 15 $e ind
  686 0# $2 23



                                                Source record
                                   084 0# $a ddc $c 23 $e eng
Agenda



 Scenario                  Context
 1. Provenance of          - Proposal for MARBI;
    machine-generated        (metadata) provenance
    data                     initiatives at W3C / DCMI

 2. Edition-level metadata - Relationship between
                             translations and other
                             ―versions‖

 3. Metadata about Dewey - Enhancing Dewey
    number components      numbers for retrieval
085 - Synthesized Classification Number
Components


 - 085 fields provide information about components of Dewey
   numbers in linked 082 or 083 fields

 - Mirror 765 fields in MARC Classification format

 - Vital for faceted retrieval driven by Dewey numbers
    - Further enhancements possible by utilizing mappings of
      Dewey numbers that occur prominently as components, e.g,
      geographic data, time periods

 - Definition of new indexes is a requirement for retrieval
   use for WoldCat data
Exploiting Dewey facets in WorldCat



 Das Highlander-Kochbuch

 082 04 $8 1x $a 641.594115 $q DE-101 $2 22/ger

 085 ## $8 1x $b 641.59

 085 ## $8 1x $z 2 $s 4115



 641.593-.599 Cooking characteristic of specific continents,
              countries, localities

 T2—4115       Highland
Proposed new indexes (083 fields)



 ―Dewey additional‖ index

 da index:    Add $z and $c ($y) to elements already in dd
              index

 Pattern:     [z--]a[-c][:a[-c]]
Proposed new indexes (085 fields)



 ―Dewey components‖ index

 dc index:    Index $s and $t concatenated with full
              address

 Pattern:     [z--]rs|w[-c][:t]



 ―Dewey synthesized‖ index

 ds index:    Index all components

 Pattern:     [z--]a|b|rs|u|w[-c][:a|b|t|u|v[-c]]
Proposed new indexes (082/083/085 fields)



 ―Dewey general‖ index

 dg index:    Index all elements in Dewey numbers

 Pattern:     Combine dd, da, and ds indexes
Example: History of Cologne during WWII


Built number: 943.55140864

 9             History & geography

+ T2—435514 Cologne

+ 943.0864     Period of World War II, 1939-1945




082 00 $8 1x $a 943/.55140864 $2 22

085 0# $8 1x $b 9 $a 930 $c 990 $z 2 $s 435514 $u 943.5514

085 0# $8 1x $b 943.5514 $a 930 $c 990 $v 01 $c 09 $f 0 $r 943.0 $s 864
  $u 943.55140864
Access points / findability


082 00 $8 1x $a 943/.55140864 $2 22

085 0# $8 1x $b 9 $a 930 $c 990 $z 2 $s 435514 $u 943.5514

085 0# $8 1x $b 943.5514 $a 930 $c 990 $v 01 $c 09 $f 0 $r 943.0 $s 864
  $u 943.55140864

   Components in dc index:

     2--435514

     943.0864

   Synthesized components in ds index:

     2--435514, 9, 930-990, 930-990:01-09, 943.0864,
     943.5514, 943.55140864
Scenarios / Use cases



 - Components / facets can be varied independently of each
   other
    - Allows for expanding, but also "morphing" the query by
      changing individual components

 - Integration of mapped vocabularies into Dewey-driven
   discovery process
    - Using terms that have been mapped to any number
      components

 - Usage of local hierarchies of number components instead
   of just the hierarchical relationships of the base number
Example: Dewey-driven discovery
 Number components + mapped GeoNames

 394.120954
                                                            Neighboring countries:
 394.12 + T2—54                                                  China           T2—51
                                                                 Pakistan        T2—5491
                                                                 Bangladesh      T2—5492
                                             gn:neighbour        Nepal           T2—5496
                                                                 Bhutan          T2—5498
       394.125 Meals                                             Myanmar         T2—591
                            notational


               structural         394.120954

395.54 Table manners
                                structural



394.13 Drinking of alcoholic
       beverages
                          394.13 + T2—51
Thank You!

Questions? Comments? Ideas?
Some useful links

DDC 23                http://www.oclc.org/us/en/dewey/versions/print/default.htm


Abridged Edition 15   http://www.oclc.org/us/en/dewey/versions/abridged/default.htm


WebDewey 2.0          http://dewey.org/webdewey


dewey.info            http://dewey.info


Dewey webinars &      http://www.oclc.org/us/en/dewey/news/conferences/default.htm
presentations

025.431:              http://ddc.typepad.com
The Dewey blog
Classify              http://classify.oclc.org/classify2/


Questions?            dewey@loc.gov (Dewey Editorial Office)
                      dewey@oclc.org (Licensing, group purchases, LIS program)

More Related Content

Viewers also liked

Nowiny Gliwickie tabletowo
Nowiny Gliwickie tabletowoNowiny Gliwickie tabletowo
Nowiny Gliwickie tabletowombuksa
 
How To Use Social Media
How To Use Social MediaHow To Use Social Media
How To Use Social Mediaartsubsites
 
H&M Social Media Strategy
H&M Social Media StrategyH&M Social Media Strategy
H&M Social Media StrategyWendeeMeyers
 
Nature of businees among african and asian owned business 1
Nature of businees among african and asian owned business 1Nature of businees among african and asian owned business 1
Nature of businees among african and asian owned business 1John Johari
 
M athematics 1
M athematics 1M athematics 1
M athematics 1dinalyn01
 
Marketing Automation 2014 in 15 numbers
Marketing Automation 2014 in 15 numbersMarketing Automation 2014 in 15 numbers
Marketing Automation 2014 in 15 numbersiPresso
 

Viewers also liked (13)

Nowiny Gliwickie tabletowo
Nowiny Gliwickie tabletowoNowiny Gliwickie tabletowo
Nowiny Gliwickie tabletowo
 
Question one
Question oneQuestion one
Question one
 
How To Use Social Media
How To Use Social MediaHow To Use Social Media
How To Use Social Media
 
H&M Social Media Strategy
H&M Social Media StrategyH&M Social Media Strategy
H&M Social Media Strategy
 
Audience feedback
Audience feedback Audience feedback
Audience feedback
 
Nature of businees among african and asian owned business 1
Nature of businees among african and asian owned business 1Nature of businees among african and asian owned business 1
Nature of businees among african and asian owned business 1
 
Tips exam ptd
Tips exam ptdTips exam ptd
Tips exam ptd
 
Depression informe
Depression informeDepression informe
Depression informe
 
My school 1
My school 1My school 1
My school 1
 
M athematics 1
M athematics 1M athematics 1
M athematics 1
 
Tips exam ptd 2012
Tips exam ptd 2012Tips exam ptd 2012
Tips exam ptd 2012
 
Marketing Automation 2014 in 15 numbers
Marketing Automation 2014 in 15 numbersMarketing Automation 2014 in 15 numbers
Marketing Automation 2014 in 15 numbers
 
Presentatie speedart
Presentatie speedartPresentatie speedart
Presentatie speedart
 

Recently uploaded

Barangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxBarangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxCarlos105
 
DATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersDATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersSabitha Banu
 
Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)Mark Reed
 
Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Celine George
 
Q4 English4 Week3 PPT Melcnmg-based.pptx
Q4 English4 Week3 PPT Melcnmg-based.pptxQ4 English4 Week3 PPT Melcnmg-based.pptx
Q4 English4 Week3 PPT Melcnmg-based.pptxnelietumpap1
 
Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Celine George
 
Roles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceRoles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceSamikshaHamane
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxthorishapillay1
 
Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Jisc
 
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxINTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxHumphrey A Beña
 
ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4MiaBumagat1
 
Grade 9 Q4-MELC1-Active and Passive Voice.pptx
Grade 9 Q4-MELC1-Active and Passive Voice.pptxGrade 9 Q4-MELC1-Active and Passive Voice.pptx
Grade 9 Q4-MELC1-Active and Passive Voice.pptxChelloAnnAsuncion2
 
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...JhezDiaz1
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTiammrhaywood
 
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17Celine George
 
ACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdfACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdfSpandanaRallapalli
 
How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17Celine George
 

Recently uploaded (20)

Barangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxBarangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
 
DATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersDATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginners
 
Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)
 
Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17
 
Q4 English4 Week3 PPT Melcnmg-based.pptx
Q4 English4 Week3 PPT Melcnmg-based.pptxQ4 English4 Week3 PPT Melcnmg-based.pptx
Q4 English4 Week3 PPT Melcnmg-based.pptx
 
Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17
 
FINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptx
FINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptxFINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptx
FINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptx
 
Roles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceRoles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in Pharmacovigilance
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptx
 
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
 
Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...
 
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxINTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
 
ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4
 
Grade 9 Q4-MELC1-Active and Passive Voice.pptx
Grade 9 Q4-MELC1-Active and Passive Voice.pptxGrade 9 Q4-MELC1-Active and Passive Voice.pptx
Grade 9 Q4-MELC1-Active and Passive Voice.pptx
 
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
 
OS-operating systems- ch04 (Threads) ...
OS-operating systems- ch04 (Threads) ...OS-operating systems- ch04 (Threads) ...
OS-operating systems- ch04 (Threads) ...
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
 
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
 
ACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdfACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdf
 
How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17
 

DDC metadata

  • 1. EDUG 2012 Symposium 26 April 2012 DDC metadata Boston Spa, UK Michael Panzer Assistant Editor, DDC OCLC panzerm@oclc.org
  • 2. Types of DDC data - Usually, Dewey numbers provide metadata for describing other resources - DDC as value vocabulary for metadata element sets - Instead, the following focuses on cases where Dewey numbers and DDC editions are the resources described - Two levels of DDC metadata - Number-level metadata (focus on bibliographic records) - Edition-level metadata (focus on classification records)
  • 3. DDC metadata Metadata about - Dewey numbers (082, 083, 085 fields in MARC Bibliographic) - Provenance of machine-generated classication data - Dewey number components in linked 085 fields - Dewey editions (084, 686 fields in MARC Classification) - Interplay between class- and edition-level metadata rendered in MARC Classification format
  • 4. Agenda Scenario Context 1. Provenance of - Proposal for MARBI; machine-generated (metadata) provenance data initiatives at W3C / DCMI 2. Edition-level metadata - Relationship between translations and other ―versions‖ 3. Metadata about Dewey - Enhancing Dewey number components numbers for retrieval
  • 5. MARBI proposal - Drafted over the last two months in cooperation with colleagues from DNB and LC - To be presented at MARBI meeting at ALA Annual Conference 2012 - Two options - Option 1: Addresses the immediate needs of documenting information about machine generation of classification data - Defines additional subfields in 082, 083, 084 - Option 2: Proposes a more general way of dealing with metadata provenance - Applicable to all MARC variable fields (in principle) - Heeds the distinction between provenance in general and metadata provenance in particular
  • 6. Option 1 Defined for 082, 083, 084 $i - Method of assignment designator Fully machine-generated (m) Not fully machine-generated (x) $u - Process of assignment May contain a URI, a process name, or some other description of process designated in $i $1 - Confidence value Confidence of the assigning agency in relation to the process described in $u. Contains value from the interval [0,1] $q – Assigning agency (already defined)
  • 7. Examples DDC 23 number assigned by LC using AutoDewey. The AutoDewey process involves machine assistance followed by intellectual review: 082 00 $a829/.3$223$ix$uautodewey$11 Fictitious example of DDC 22 number assigned by OCLC in a fully automated way using information in Classify: 082 04 $a394.12$222$im$uclassify$10.5$qOCoLC
  • 8. Option 2 883 - Data provenance (R) First Indicator: Method of assignment # - No information provided 0 – Fully machine-generated 1 – Not fully machine-generated $d - Date on which the linked field was generated $u - Process used to generate linked field $q - Agency using the process/activity to generate the linked field $1 - Confidence value $x - Ending date of validity $0 - Authority record control number or standard number $8 - Field link and sequence number (with new field link type ―p – Data provenance‖)
  • 9. Examples 082 00 $81p$a829/.3$223 883 1# $81p$uautodewey$d20120407$qDLC$11 082 04 $81p$a394.12$222$qOCoLC 883 0# $81p$uclassify$d20120407$qOCoLC$10.5
  • 10. Examples (2) 082 04 $81p$a004$222/ger$qNO-OsNB 883 0# $81p$udeweyclassifierv0.1$d20120101 $x20141231$qNO-OsNB$10.25 $0(DE-101)040268942 082 04 $81p$a004$222/ger$qDE-101 883 0# $81p$uparallelrecordcopy$d20120101 $x20141231$qNO-OsNB
  • 11. Agenda Scenario Context 1. Provenance of - Proposal for MARBI; machine-generated (metadata) provenance data initiatives at W3C / DCMI 2. Edition-level metadata - Relationship between translations and other ―versions‖ 3. Metadata about Dewey - Enhancing Dewey number components numbers for retrieval
  • 12. Edition-level metadata - Edition registry: capturing information about editions and translations in a centralized manner outside of MARC records - Storing additional metadata about editions/translations in MARC records - Better management of translation data and other versions - MARC does not offer edition-level records - Data info has to be carried in individual records, even when it applies to the whole edition - Relevant fields: 084 - Classification Scheme and Edition 686 - Relationship to Source Note
  • 13. DDC translations: Anatomy of an edition German Italian DDC 22 DDC 22 Swedish French Mixed DDC 22 DDC 22 Afrikaans Arabic English Chinese French French DDC DDC 22 DDC Sach- Summaries Gruppen Italian German (German) Norwegian Rhaeto- Portuguese Romansch Russian 200 Guide Religion Scots Gaelic (French) Class Spanish Swedish A14 Vietnamese French A14 A14 Hebrew Spanish A14 A14 Italian A14
  • 14. Types of editions - Related to an edition, with relationships not captured at record level Examples: sdnb, DDC Summaries, Guide versus - Related to an edition, with relationships captured at record level Examples: 200 Religion, translations, A15engind
  • 15. Tracking edition-to-edition relationships Translation of standard edition 084 1# $a ddc $c 15 $e ind Source edition 084 1# $a ddc $c 15 $e eng Authorized derivative version of standard edition 084 8# $a ddc $c 22sdnb $d 22 $e ger Source edition 084 0# $a ddc $c 22 $e eng - Not explicitly full or abridged; ―8‖ is used for value of first indicator - $n should be automatically populated with relevant information about the changes regarding the source edition.
  • 16. Tracking record-to-record relationships 1. Record has been modified Translation of standard edition 084 1# $a ddc $c 15 $e ind 686 3# $i modified Source record 084 1# $a ddc $c 15 $e eng
  • 17. Tracking record-to-record relationships (2) 2. Record was created for translation Translation of standard edition 084 1# $a ddc $c 15 $e ind 686 1# $b 305.899 Source record [does not exist]
  • 18. Tracking record-to-record relationships (3) 3. Unmodified record from different source edition Translation of standard edition 084 1# $a ddc $c 15 $e ind 686 0# $2 23 Source record 084 0# $a ddc $c 23 $e eng
  • 19. Agenda Scenario Context 1. Provenance of - Proposal for MARBI; machine-generated (metadata) provenance data initiatives at W3C / DCMI 2. Edition-level metadata - Relationship between translations and other ―versions‖ 3. Metadata about Dewey - Enhancing Dewey number components numbers for retrieval
  • 20. 085 - Synthesized Classification Number Components - 085 fields provide information about components of Dewey numbers in linked 082 or 083 fields - Mirror 765 fields in MARC Classification format - Vital for faceted retrieval driven by Dewey numbers - Further enhancements possible by utilizing mappings of Dewey numbers that occur prominently as components, e.g, geographic data, time periods - Definition of new indexes is a requirement for retrieval use for WoldCat data
  • 21. Exploiting Dewey facets in WorldCat Das Highlander-Kochbuch 082 04 $8 1x $a 641.594115 $q DE-101 $2 22/ger 085 ## $8 1x $b 641.59 085 ## $8 1x $z 2 $s 4115 641.593-.599 Cooking characteristic of specific continents, countries, localities T2—4115 Highland
  • 22. Proposed new indexes (083 fields) ―Dewey additional‖ index da index: Add $z and $c ($y) to elements already in dd index Pattern: [z--]a[-c][:a[-c]]
  • 23. Proposed new indexes (085 fields) ―Dewey components‖ index dc index: Index $s and $t concatenated with full address Pattern: [z--]rs|w[-c][:t] ―Dewey synthesized‖ index ds index: Index all components Pattern: [z--]a|b|rs|u|w[-c][:a|b|t|u|v[-c]]
  • 24. Proposed new indexes (082/083/085 fields) ―Dewey general‖ index dg index: Index all elements in Dewey numbers Pattern: Combine dd, da, and ds indexes
  • 25. Example: History of Cologne during WWII Built number: 943.55140864 9 History & geography + T2—435514 Cologne + 943.0864 Period of World War II, 1939-1945 082 00 $8 1x $a 943/.55140864 $2 22 085 0# $8 1x $b 9 $a 930 $c 990 $z 2 $s 435514 $u 943.5514 085 0# $8 1x $b 943.5514 $a 930 $c 990 $v 01 $c 09 $f 0 $r 943.0 $s 864 $u 943.55140864
  • 26. Access points / findability 082 00 $8 1x $a 943/.55140864 $2 22 085 0# $8 1x $b 9 $a 930 $c 990 $z 2 $s 435514 $u 943.5514 085 0# $8 1x $b 943.5514 $a 930 $c 990 $v 01 $c 09 $f 0 $r 943.0 $s 864 $u 943.55140864 Components in dc index: 2--435514 943.0864 Synthesized components in ds index: 2--435514, 9, 930-990, 930-990:01-09, 943.0864, 943.5514, 943.55140864
  • 27. Scenarios / Use cases - Components / facets can be varied independently of each other - Allows for expanding, but also "morphing" the query by changing individual components - Integration of mapped vocabularies into Dewey-driven discovery process - Using terms that have been mapped to any number components - Usage of local hierarchies of number components instead of just the hierarchical relationships of the base number
  • 28. Example: Dewey-driven discovery Number components + mapped GeoNames 394.120954 Neighboring countries: 394.12 + T2—54 China T2—51 Pakistan T2—5491 Bangladesh T2—5492 gn:neighbour Nepal T2—5496 Bhutan T2—5498 394.125 Meals Myanmar T2—591 notational structural 394.120954 395.54 Table manners structural 394.13 Drinking of alcoholic beverages 394.13 + T2—51
  • 30. Some useful links DDC 23 http://www.oclc.org/us/en/dewey/versions/print/default.htm Abridged Edition 15 http://www.oclc.org/us/en/dewey/versions/abridged/default.htm WebDewey 2.0 http://dewey.org/webdewey dewey.info http://dewey.info Dewey webinars & http://www.oclc.org/us/en/dewey/news/conferences/default.htm presentations 025.431: http://ddc.typepad.com The Dewey blog Classify http://classify.oclc.org/classify2/ Questions? dewey@loc.gov (Dewey Editorial Office) dewey@oclc.org (Licensing, group purchases, LIS program)

Editor's Notes

  1. 1. How to provide provenance metadata for machine-generated Dewey numbers and other pieces of classification information, and how to effectively express data provenance in the context of a MARC record.2. How metadata about Dewey editions can be used to establish relationships between translations and other versions of the classification.3. How metadata about Dewey number components provided by 085 fields can enhance the use of Dewey numbers in information discovery.
  2. Note: $n should be automatically populated with relevant information about the changes regarding the source edition. A possible place to store $n on an edition level is the edition registry
  3. Why go through all this trouble indexing subfields of number components?