SlideShare a Scribd company logo
1 of 14
Dragging old data forward:
finding yourself an RDA Helper
Terry Reese, Gray Family Chair for Innovative Library Services
Email: terry.reese@oregonstate.edu
Vehicle for Research -- MarcEdit

• MarcEdit
   • http://people.oregonstate.edu/~reeset/marcedit




1
January 28, 2013
http://tardthegrumpycat.tumblr.com/page/2




2
January 28, 2013
Common Questions I hear

• What about the GMD?
• We code all our data in RDA, how do we deal with other
  peoples?
• What do we do with bulk data loads? Vendor data?
• Do we care about Legacy Data?
• My library has been encoding records with RDA fields for over
  a year and now they are incomplete. I have thousands – what
  can I do?
• WHAT ABOUT THE GMD?


3
January 28, 2013
So what is the RDA Helper?

• It’s a proof of concept to demonstrate that:

   1.      Most current RDA fields can be derived from existing data
   2.      Migration paths for legacy/bulk data can and should exist
   3.      Abbreviation expansion maybe isn’t as straightforward as we would
           like
   4.      GMD data can be automatically generating from existing RDA data
   5.      Vehicle for experimentation




4
January 28, 2013
Scope of the project

• RDA helper has been limited to looking at practical
  implementation of RDA elements into MARC
   • Looking specifically at:
       •   336/337/338 field groups
       •   344/345/346/347 field groups
       •   380/381 field groups
       •   Evaluating the 260
       •   Processing Abbreviation Expansion
       •   GMD processing


• Determine how easy 3rd-party development/engagement with
  the RDA standard/metadata community will be going forward.
5
January 28, 2013
http://talkingleadership.wordpress.com/2012/05/01/building-a-feedback-relationship/



6
January 28, 2013
Hitting a brick wall




                   http://www.flickr.com/photos/camknows/8374910613/


7
January 28, 2013
Mining the Data

• Does the data already exist in MARC records?
   • Yes and no – while much of the data can be extrapolated, the generation of
     many new RDA specific fields requires evaluation of multiple data points.

• The most important data points?
   • LDR/007/008 – with these three data points, you can generate most RDA
     specific field data.
   • GMD
   • 856
   • 300
   • 130
   • 240
   • 730
   • 740


8
January 28, 2013
Mining the Data

• Abbreviation Expansion is challenging
   • Real-world data is simply real-world crazy

   • Simple Example:
           =300    $a1 v.
           =300    $a1 vol.
           =300    $aOne v.
           =300    $a1 vols.
           =300    $aV.
           =300    $av.
           =300    $a12 v.




9
January 28, 2013
So how does this thing work?

• RDA Helper
   • http://www.youtube.com/watch?v=cqLMPp9vZVM&feature=player_embedded




10
January 28, 2013
So why create something like this at all?

• Admittedly, most of the promise behind RDA isn’t going to be
  found in these first baby steps in MARC, but…
   • To demonstrate that much of this initial work can be done automagically
     and that much of the data in our existing hybrid environments can be
     moved forward.
   • To provide a testable implementation for catalogers who are still
     uncomfortable with what these changes mean.
   • To support public libraries, many of which utilizing ILS systems that rely
     on data that that is going away like the GMD to create more user-friendly
     interfaces.
   • To support vendors that provide MARC records and offer a simplified
     path for moving their data forward.

11
January 28, 2013
Going forward




           http://www.flickr.com/photos/jannem/2079422115/sizes/z/in/photostream/

12
January 28, 2013
Thank you

Contact Information:
Terry Reese
Email: terry.reese@oregonstate.edu
Work: 541.737.6384

Getting MarcEdit:
http://people.oregonstate.edu/~reeset/marcedit




13
January 28, 2013

More Related Content

Similar to Dragging old data forward: finding yourself an RDA Helper

Data Analytic Technology Platforms: Options and Tradeoffs
Data Analytic Technology Platforms: Options and TradeoffsData Analytic Technology Platforms: Options and Tradeoffs
Data Analytic Technology Platforms: Options and Tradeoffs
J Singh
 
DATA SCIENCE AND BIG DATA ANALYTICSCHAPTER 2 DATA ANA.docx
DATA SCIENCE AND BIG DATA ANALYTICSCHAPTER 2 DATA ANA.docxDATA SCIENCE AND BIG DATA ANALYTICSCHAPTER 2 DATA ANA.docx
DATA SCIENCE AND BIG DATA ANALYTICSCHAPTER 2 DATA ANA.docx
randyburney60861
 
Data Science Machine Lerning Bigdat.pptx
Data Science Machine Lerning Bigdat.pptxData Science Machine Lerning Bigdat.pptx
Data Science Machine Lerning Bigdat.pptx
Priyadarshini648418
 

Similar to Dragging old data forward: finding yourself an RDA Helper (20)

Differences between data lakes and datawarehouse
  Differences between data lakes and datawarehouse  Differences between data lakes and datawarehouse
Differences between data lakes and datawarehouse
 
Introduction to Data Science.pptx
Introduction to Data Science.pptxIntroduction to Data Science.pptx
Introduction to Data Science.pptx
 
Treasure Data Cloud Strategy
Treasure Data Cloud StrategyTreasure Data Cloud Strategy
Treasure Data Cloud Strategy
 
The Economics of SQL on Hadoop
The Economics of SQL on HadoopThe Economics of SQL on Hadoop
The Economics of SQL on Hadoop
 
Data Analytic Technology Platforms: Options and Tradeoffs
Data Analytic Technology Platforms: Options and TradeoffsData Analytic Technology Platforms: Options and Tradeoffs
Data Analytic Technology Platforms: Options and Tradeoffs
 
Foundation for Success: How Big Data Fits in an Information Architecture
Foundation for Success: How Big Data Fits in an Information ArchitectureFoundation for Success: How Big Data Fits in an Information Architecture
Foundation for Success: How Big Data Fits in an Information Architecture
 
OSMC 2019 | How to improve database Observability by Charles Judith
OSMC 2019 | How to improve database Observability by Charles JudithOSMC 2019 | How to improve database Observability by Charles Judith
OSMC 2019 | How to improve database Observability by Charles Judith
 
Big data in transport an international transport forum overview oct 2013
Big data in transport    an international transport forum overview oct 2013Big data in transport    an international transport forum overview oct 2013
Big data in transport an international transport forum overview oct 2013
 
big data processing.pptx
big data processing.pptxbig data processing.pptx
big data processing.pptx
 
DATA SCIENCE AND BIG DATA ANALYTICSCHAPTER 2 DATA ANA.docx
DATA SCIENCE AND BIG DATA ANALYTICSCHAPTER 2 DATA ANA.docxDATA SCIENCE AND BIG DATA ANALYTICSCHAPTER 2 DATA ANA.docx
DATA SCIENCE AND BIG DATA ANALYTICSCHAPTER 2 DATA ANA.docx
 
Designing analytics for big data
Designing analytics for big dataDesigning analytics for big data
Designing analytics for big data
 
VTU 7TH SEM CSE DATA WAREHOUSING AND DATA MINING SOLVED PAPERS OF DEC2013 JUN...
VTU 7TH SEM CSE DATA WAREHOUSING AND DATA MINING SOLVED PAPERS OF DEC2013 JUN...VTU 7TH SEM CSE DATA WAREHOUSING AND DATA MINING SOLVED PAPERS OF DEC2013 JUN...
VTU 7TH SEM CSE DATA WAREHOUSING AND DATA MINING SOLVED PAPERS OF DEC2013 JUN...
 
التنقيب في البيانات - Data Mining
التنقيب في البيانات -  Data Miningالتنقيب في البيانات -  Data Mining
التنقيب في البيانات - Data Mining
 
Big Data and Hadoop
Big Data and HadoopBig Data and Hadoop
Big Data and Hadoop
 
SoftServe BI/BigData Workshop in Utah
SoftServe BI/BigData Workshop in UtahSoftServe BI/BigData Workshop in Utah
SoftServe BI/BigData Workshop in Utah
 
Data Science Machine Lerning Bigdat.pptx
Data Science Machine Lerning Bigdat.pptxData Science Machine Lerning Bigdat.pptx
Data Science Machine Lerning Bigdat.pptx
 
What is spatial sql
What is spatial sqlWhat is spatial sql
What is spatial sql
 
Making RDA Easy(er) with MarcEdit
Making RDA Easy(er) with MarcEditMaking RDA Easy(er) with MarcEdit
Making RDA Easy(er) with MarcEdit
 
Data Mart Lake Ware.pptx
Data Mart Lake Ware.pptxData Mart Lake Ware.pptx
Data Mart Lake Ware.pptx
 
Ledingkart Meetup #4: Data pipeline @ lk
Ledingkart Meetup #4: Data pipeline @ lkLedingkart Meetup #4: Data pipeline @ lk
Ledingkart Meetup #4: Data pipeline @ lk
 

More from Terry Reese

More from Terry Reese (20)

MarcEdit Shelter-In-Place Webinar 8: Automated editing through scripts and to...
MarcEdit Shelter-In-Place Webinar 8: Automated editing through scripts and to...MarcEdit Shelter-In-Place Webinar 8: Automated editing through scripts and to...
MarcEdit Shelter-In-Place Webinar 8: Automated editing through scripts and to...
 
MarcEdit Shelter-In-Place Webinar 7: Making Regular Expressions work for you ...
MarcEdit Shelter-In-Place Webinar 7: Making Regular Expressions work for you ...MarcEdit Shelter-In-Place Webinar 7: Making Regular Expressions work for you ...
MarcEdit Shelter-In-Place Webinar 7: Making Regular Expressions work for you ...
 
MarcEdit Shelter-In-Place Webinar 6: Regular Expressions and .NET, A Primer
MarcEdit Shelter-In-Place Webinar 6: Regular Expressions and .NET, A PrimerMarcEdit Shelter-In-Place Webinar 6: Regular Expressions and .NET, A Primer
MarcEdit Shelter-In-Place Webinar 6: Regular Expressions and .NET, A Primer
 
MarcEdit Shelter-In-Place Webinar 5.5: Transliterations in MarcEdit
MarcEdit Shelter-In-Place Webinar 5.5: Transliterations in MarcEditMarcEdit Shelter-In-Place Webinar 5.5: Transliterations in MarcEdit
MarcEdit Shelter-In-Place Webinar 5.5: Transliterations in MarcEdit
 
MarcEdit Shelter-In-Place Webinar 5: Working with MarcEdit's Linked Data Fram...
MarcEdit Shelter-In-Place Webinar 5: Working with MarcEdit's Linked Data Fram...MarcEdit Shelter-In-Place Webinar 5: Working with MarcEdit's Linked Data Fram...
MarcEdit Shelter-In-Place Webinar 5: Working with MarcEdit's Linked Data Fram...
 
MarcEdit Shelter-In-Place Webinar 4: Merging, Clustering, and Integrations…oh...
MarcEdit Shelter-In-Place Webinar 4: Merging, Clustering, and Integrations…oh...MarcEdit Shelter-In-Place Webinar 4: Merging, Clustering, and Integrations…oh...
MarcEdit Shelter-In-Place Webinar 4: Merging, Clustering, and Integrations…oh...
 
MarcEdit Shelter-in-place Webinar 2.5: Getting Started with MarcEdit Mac
MarcEdit Shelter-in-place Webinar 2.5: Getting Started with MarcEdit MacMarcEdit Shelter-in-place Webinar 2.5: Getting Started with MarcEdit Mac
MarcEdit Shelter-in-place Webinar 2.5: Getting Started with MarcEdit Mac
 
Working with the MarcEditor
Working with the MarcEditorWorking with the MarcEditor
Working with the MarcEditor
 
Slides from the NASIG 2018 Preconference
Slides from the NASIG 2018 PreconferenceSlides from the NASIG 2018 Preconference
Slides from the NASIG 2018 Preconference
 
Making complicated processes simple: a look at how MarcEdit 7 is expanding th...
Making complicated processes simple: a look at how MarcEdit 7 is expanding th...Making complicated processes simple: a look at how MarcEdit 7 is expanding th...
Making complicated processes simple: a look at how MarcEdit 7 is expanding th...
 
Rejoining the Information access landscape
Rejoining the Information access landscapeRejoining the Information access landscape
Rejoining the Information access landscape
 
Open metadata, open systems…redrawing the library metadata landscape
Open metadata, open systems…redrawing the library metadata landscapeOpen metadata, open systems…redrawing the library metadata landscape
Open metadata, open systems…redrawing the library metadata landscape
 
Getting Started with Regular Expressions In MarcEdit
Getting Started with Regular Expressions In MarcEditGetting Started with Regular Expressions In MarcEdit
Getting Started with Regular Expressions In MarcEdit
 
Fitting MarcEdit into the library software ecosystem
Fitting MarcEdit into the library software ecosystemFitting MarcEdit into the library software ecosystem
Fitting MarcEdit into the library software ecosystem
 
Thinking about Preservation: OSUL Content Manage Workflow
Thinking about Preservation: OSUL Content Manage WorkflowThinking about Preservation: OSUL Content Manage Workflow
Thinking about Preservation: OSUL Content Manage Workflow
 
The world beyond MARC: let’s focus on asking the right questions
The world beyond MARC: let’s focus on asking the right questionsThe world beyond MARC: let’s focus on asking the right questions
The world beyond MARC: let’s focus on asking the right questions
 
Reframing Public Housing: Visualization and Data Analytics in History
Reframing Public Housing: Visualization and Data Analytics in History Reframing Public Housing: Visualization and Data Analytics in History
Reframing Public Housing: Visualization and Data Analytics in History
 
#mashcat: Evolving MarcEdit: Leveraging Semantic Data in MarcEdit
#mashcat: Evolving MarcEdit: Leveraging Semantic Data in MarcEdit#mashcat: Evolving MarcEdit: Leveraging Semantic Data in MarcEdit
#mashcat: Evolving MarcEdit: Leveraging Semantic Data in MarcEdit
 
Preparing Catalogers for Linked data
Preparing Catalogers for Linked dataPreparing Catalogers for Linked data
Preparing Catalogers for Linked data
 
Harnessing the Lifecycle: Planning and Implementing a Strategic Digital Coll...
Harnessing the Lifecycle: Planning and Implementing a Strategic Digital Coll...Harnessing the Lifecycle: Planning and Implementing a Strategic Digital Coll...
Harnessing the Lifecycle: Planning and Implementing a Strategic Digital Coll...
 

Recently uploaded

Spellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please PractiseSpellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please Practise
AnaAcapella
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
heathfieldcps1
 

Recently uploaded (20)

Fostering Friendships - Enhancing Social Bonds in the Classroom
Fostering Friendships - Enhancing Social Bonds  in the ClassroomFostering Friendships - Enhancing Social Bonds  in the Classroom
Fostering Friendships - Enhancing Social Bonds in the Classroom
 
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
 
FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024
 
Spellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please PractiseSpellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please Practise
 
SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentation
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docx
 
How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17
 
Graduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - EnglishGraduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - English
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptx
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdf
 
Google Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptxGoogle Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptx
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
 
REMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptxREMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptx
 
Towards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxTowards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptx
 
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptxHMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptx
 

Dragging old data forward: finding yourself an RDA Helper

  • 1. Dragging old data forward: finding yourself an RDA Helper Terry Reese, Gray Family Chair for Innovative Library Services Email: terry.reese@oregonstate.edu
  • 2. Vehicle for Research -- MarcEdit • MarcEdit • http://people.oregonstate.edu/~reeset/marcedit 1 January 28, 2013
  • 4. Common Questions I hear • What about the GMD? • We code all our data in RDA, how do we deal with other peoples? • What do we do with bulk data loads? Vendor data? • Do we care about Legacy Data? • My library has been encoding records with RDA fields for over a year and now they are incomplete. I have thousands – what can I do? • WHAT ABOUT THE GMD? 3 January 28, 2013
  • 5. So what is the RDA Helper? • It’s a proof of concept to demonstrate that: 1. Most current RDA fields can be derived from existing data 2. Migration paths for legacy/bulk data can and should exist 3. Abbreviation expansion maybe isn’t as straightforward as we would like 4. GMD data can be automatically generating from existing RDA data 5. Vehicle for experimentation 4 January 28, 2013
  • 6. Scope of the project • RDA helper has been limited to looking at practical implementation of RDA elements into MARC • Looking specifically at: • 336/337/338 field groups • 344/345/346/347 field groups • 380/381 field groups • Evaluating the 260 • Processing Abbreviation Expansion • GMD processing • Determine how easy 3rd-party development/engagement with the RDA standard/metadata community will be going forward. 5 January 28, 2013
  • 8. Hitting a brick wall http://www.flickr.com/photos/camknows/8374910613/ 7 January 28, 2013
  • 9. Mining the Data • Does the data already exist in MARC records? • Yes and no – while much of the data can be extrapolated, the generation of many new RDA specific fields requires evaluation of multiple data points. • The most important data points? • LDR/007/008 – with these three data points, you can generate most RDA specific field data. • GMD • 856 • 300 • 130 • 240 • 730 • 740 8 January 28, 2013
  • 10. Mining the Data • Abbreviation Expansion is challenging • Real-world data is simply real-world crazy • Simple Example: =300 $a1 v. =300 $a1 vol. =300 $aOne v. =300 $a1 vols. =300 $aV. =300 $av. =300 $a12 v. 9 January 28, 2013
  • 11. So how does this thing work? • RDA Helper • http://www.youtube.com/watch?v=cqLMPp9vZVM&feature=player_embedded 10 January 28, 2013
  • 12. So why create something like this at all? • Admittedly, most of the promise behind RDA isn’t going to be found in these first baby steps in MARC, but… • To demonstrate that much of this initial work can be done automagically and that much of the data in our existing hybrid environments can be moved forward. • To provide a testable implementation for catalogers who are still uncomfortable with what these changes mean. • To support public libraries, many of which utilizing ILS systems that rely on data that that is going away like the GMD to create more user-friendly interfaces. • To support vendors that provide MARC records and offer a simplified path for moving their data forward. 11 January 28, 2013
  • 13. Going forward http://www.flickr.com/photos/jannem/2079422115/sizes/z/in/photostream/ 12 January 28, 2013
  • 14. Thank you Contact Information: Terry Reese Email: terry.reese@oregonstate.edu Work: 541.737.6384 Getting MarcEdit: http://people.oregonstate.edu/~reeset/marcedit 13 January 28, 2013

Editor's Notes

  1. I’ve found over the past couple years giving workshops on metadata processing, that talking about RDA is like talking about Religion and Politics. It can really bring out the crazy.
  2. I wish I was kidding about the GMD
  3. Experimentation – treating specific fields as objects for purposes of validation.
  4. RDA Helper was designed for practical usage. Now, there are a lot of concepts related to RDA that exist outside of MARC. The RDA Helper is definitely concerned with how these concepts are related into MARC.
  5. OSU gives me a lot of indirect support when it comes to my work around MarcEdit. Because of that – I usually find that I spend close to 2-3k a year to access ISO standards documents. These are international standards documents and as a developer, I don’t like it, but I think of it as the cost of doing business. However, I was unprepared to have to do the same to access what should be an open library standard. The library community is going to have to deal with RDA in some form – but I do worry that this specification will be dead on arrival for communities outside the library if we insist on keeping it behind a paywall.
  6. Is the data already there?You can use other data elements, but as you move down the tree, the ability to extrapolate data correctly becomes more difficult.
  7. You can us the expansion lists as a guide, but in testing, people create their own abbreviations, they are applied unevenly,
  8. OSU is in this boat – our primary cataloger is on sabbatical and our technicians haven’t been formally trained. This tool gives them the ability to look process records and start seeing what the data might look like