SlideShare a Scribd company logo
MarcEdit: Doing more, but
faster
Terry Reese
Gray Family Chair for Innovative Library Services
Terry.reese@oregonstate.edu
• Making your metadata work for you
• Finding ways to use MarcEdit to merge and manipulate existing
metadata in various formats
• i.e., working with XML formats, delimited formats, Excel, Access
• Dealing with data in multiple charactersets as we transition to a
Unicode world
• Learning how to automate repetitive tasks, and understand what
editing functions are available to you
• Leveraging webservices like OCLC WorldCat to provide automatic
classifications
METADATA MANIPULATION
MARC Tools Portal
Marc Tools
• Built-in functions
• MarcBreaker – Tool used to convert MARC records to the
MarcEdit mnemonic format
• MarcMaker – Tool used to convert MarcEdit mnemonic format to
MARC
• MARC=>MARC21XML – converts MARC to MARC21XML
• Automatically converts data from MARC-8 to UTF8
• MARC21XML=>MARC – converts MARC21XML to MARC
• Doesn’t automatically convert data from UTF8 to MARC8 – will leave
data in UTF8
MARC Character Conversions
• Supports moving between
any known Windows
Characterset and MARC8.
• Can be run from the
Breaker/Maker – or as its
own standalone utility
MARCSplit/MARCJoin
• Utility used for
splitting large MARC
record sets into
smaller files
• Utility used for
joining large
sets of MARC
data to a single
file
Batch Record Processor
• Allows MarcEdit to process
“lots” of files.
• Files can be processed
against an entire folder’s
contents or by file type
• Can utilize any built-in or
derived XML Function
transformation
MarcEdit and bad records
• Two MARC breaking algorithms
• Strict MARC algorithm
• Loose breaking algorithm
• Loose algorithm can heal MARC records (sometimes)
• Structural errors
• Missing field or record markers
Delimited text translator
• Delimited Text Translator
• Translates Tab, comma, pipe, Excel (Office 2000-2007), Access
(Office 2000-2007) files into MARC
• Can save translation maps
• Can create constant data
Delimited text translator Options
• Wizard-like interface
• Supports Unicode data (in excel or delimited file)
• Joining (relating) fields
• Editing global 008/LDR
Delimited Text Translator: Mapping
format
• Map to: Field + subfield
• Indicators: Indicator values
• Term Punct.: Trailing
punctuation
• Arguments – Joining
defined items (select and
right click on items)
• Ability to save templates
Common Joining techniques
• When would I mark a field as repeatable?
• By default, when the Delimited Text translator encounters two
like subfields on the same field, it creates a new field. For
example:
column 1: This is a note
column 2: This is a note 2
if I mapped column 1 500$a and column 2 to 500$a, by default,
MarcEdit would generate the following output:
=500 $aThis is a note
=500 $aThis is a note 2
• However….
Common Joining techniques
• When would I mark a field as repeatable?
• If I need to have multiple, like subfields on the same field, for
example, like a subject field – we would mark the field as
repeatable:
column 1: Geology
column 2: Oregon
column 3: Corvallis
If these fields were not marked as repeatable, the output would
look like:
=650 0$aGeology$zOregon
=650 0$zCorvallis
However, if these fields were marked as repeatable, the output
would look like:
=650 0$aGeology$zOregon$zCorvallis
MARC Conversions
MarcEdit Crosswalking model
Finding and Contributing
Crosswalks
• In MarcEdit 5.6, an option was added to allow users to search
for crosswalks
• Currently, these are crosswalks I or LC have created
• Hopefully, community members will submit crosswalks for
inclusion into the registry
MarcEdit: Crosswalks for everyone
Harvesting Metadata
• MarcEdit includes a
builtin OAI harvester
• Allows for direct
XML=>MARC
translations
• Allows for custom
modification of XSLT
translation tables.
Harvesting Metadata
• Required data
• Host name: i.e., http://ir.library.oregonstate.edu/request/oai
• Metadata Type
• Natively supports MARCXML, Dublin Core, OAIMARC and MODS
• Options to support conditional harvests, raw data harvests, and
resumptive harvests.
RECORD EDITING
MarcEditor
MarcEditor Properties
• Templates
• Fonts
• Encodings
• Preview Settings
Configuring New Paging
• Set in the Options dialog
Paging Example
• If you load the full file, or turn the preview mode off
Editing MARC
• MarcEditor
• Supports a number of global editing functions:
• Edit Subsets of records
• Find/Replace functionality
• Globally Add/Delete MARC fields
• Globally Edit Subfield data
• Conditionally add/remove field data
• Globally Edit Indicator data
• Globally Swap field data
• Record Deduplication
• Record Sorting
• Call Number Generator
• Macros
Editing MARC – Find/Replace
• Works like a normal
Find/Replace in most
Textpad utilities.
• Unlike most Textpads,
Replace supports UTF-8
(when working with UTF-
8 files) and regular
expressions.
Editing MARC – Find All
• Find all function was
designed for use with the
Paging mode
• Allows users to find any
text across all pages
• Generates a jump list that
can be used to find
individual records for edit
Jump to
• Jump to…record:
• Allows you to jump to any records
• Jump to…page:
• Allows you to jump to any page
Editing MARC – Global
Add/Delete Field
• Globally add fields to all MARC records
• Allows users to set insertion position.
• Globally delete fields
• Allows global delete
• Allows conditional delete
• Supports Regular Expressions
Editing MARC – Modifying
subfield data
• Allows for the modification of variable MARC
field subfield data (MARC fields >10)
• Allows for the modification of control field data
by position or range of positions
• Allows users to prepend and append data to
subfields.
• Allows users to change subfield tagging.
Editing MARC – Modifying
subfield data
• Allows users to insert new subfields and define subfield
placement.
• Allows users to move field data from one field to another.
• Supports:
• UTF-8 with UTF-8 files
• Regular Expressions
• Adding new subfields.
Editing MARC – Modifying subfield data
Editing MARC – Swapping
Fields
• Swap parts of MARC
Fields or entire MARC
fields
• Define field, indicator
and subfields to move.
• Can move field data and
delete the original field
or clone the field data
and move the clone to
the new location.
• Can add data to an
existing field.
Character Conversions within
the MarcEditor
• MarcEditor allows users to
convert character data
between different
charactersets.
Fixing Boo-boos
• MarcEdit’s Special Undo
• Allows you to step back one global change.
Sorting Fields
• MarcEdit provides multiple
sorting types:
• Control Number
• Sorts record position within the file
• Title
• Sorts record position within the file
• Author
• Sorts record position within the file
• Call Number
• Sorts record position within the file
• 0xx Fields
• Sorts the 0xx fields within individual
records (does *not* change record
position within a file)
• All Fields
• Sorts all fields within individual
records (does *not* change record
position within a file)
• Custom Sort
• Sorts all defined fields within
individual records (does *not*
change record position within a file)
Record Deduplication
• MarcEdit provides a
simple dedup tool that
can:
• Dedup on a defined
control field (any field)
• Dedup on a transaction
field (or using an additional
transaction field)
• Output
• Removes all duplications
and saves the duplications
to a file
• Prints just unique items
within the file (i.e., those
without a duplicate pair)
Field Counts
• Field Count
• Provides a quick count
of fields
• Report of subfields
used within a
particular field
• Detailed reports of all
fields/subfields used
within a fileset.
Material Type Report
• Material Type Report
• Reports number of
records by material
type
• Breaks down material
type by sub-types
• Utilizes the Leader,
008 and GMD to
determine format
types
Task Automation Tool
• Stacking Operations
• Task automation provides a way for non-programmers to create
defined task lists that can then be executed automatically
• The different between a task and a macro is that MarcEdit tasks
essentially function like the user was calling specific functions
within MarcEdit.
• Anything that you can do in the MarcEditor, you can automate as
a task.
Task Automation
• Managing Tasks
• Task management
works like macro
management
• You can
• Create new tasks
• Clone tasks
• Rename tasks
• Delete tasks
• Edit tasks
Task Automation Demo
• Additional Information:
• Youtube:
• Introduction to task automation: http://www.youtube.com/watch?
v=gmqTGfTubU4
• Introduction to new task automation functions:
http://www.youtube.com/watch?v=fnorN0MFFN0
• MarcEdit can leverage OCLC WorldCat to generate call
numbers automatically for files
• Fields used:
• 001
• 010$a$z
• 020$a$z
• 022$a$z
• 024$a$z
• 1xx$a
• 776$w$z
OCLC Classify Service
OCLC Classify Service
FUTURE DEVELOPMENT
MarcEdit 5.9+
• AACR2->RDA macros
• Low-hanging conversions to support batch data processing
• Merge Record Enhancements
• Adding more data points and customized merge fields
• More Automation support
• Ability to turn Edit shortcuts into Automation tasks
• Batch OAI Harvesting
• Create jobs that you can schedule and have automatically run for you
• Batch Set Holdings
• Using either crappy z39.50 or OCLC’s yet to be publically released API
for holdings settings.
Getting Help
• Call/write me:
• terry.reese@oregonstate.edu
• Ask the list:
• MarcEdit ListServ
• http://listserv.gmu.edu/cgi-bin/wa?A0=marcedit-l
Questions

More Related Content

What's hot

MarcEdit Shelter-In-Place Webinar 4: Merging, Clustering, and Integrations…oh...
MarcEdit Shelter-In-Place Webinar 4: Merging, Clustering, and Integrations…oh...MarcEdit Shelter-In-Place Webinar 4: Merging, Clustering, and Integrations…oh...
MarcEdit Shelter-In-Place Webinar 4: Merging, Clustering, and Integrations…oh...
Terry Reese
 
Make MarcEdit Work For You: OLC Technical Services Retreat
Make MarcEdit Work For You: OLC Technical Services RetreatMake MarcEdit Work For You: OLC Technical Services Retreat
Make MarcEdit Work For You: OLC Technical Services Retreat
Terry Reese
 
MarcEdit Shelter-In-Place Webinar 5: Working with MarcEdit's Linked Data Fram...
MarcEdit Shelter-In-Place Webinar 5: Working with MarcEdit's Linked Data Fram...MarcEdit Shelter-In-Place Webinar 5: Working with MarcEdit's Linked Data Fram...
MarcEdit Shelter-In-Place Webinar 5: Working with MarcEdit's Linked Data Fram...
Terry Reese
 
Fitting MarcEdit into the library software ecosystem
Fitting MarcEdit into the library software ecosystemFitting MarcEdit into the library software ecosystem
Fitting MarcEdit into the library software ecosystem
Terry Reese
 
Reimagining Serials handout: BIBFRAME Exercise
Reimagining Serials handout: BIBFRAME ExerciseReimagining Serials handout: BIBFRAME Exercise
Reimagining Serials handout: BIBFRAME Exercise
NASIG
 
Data base connectivity and flex grid in vb
Data base connectivity and flex grid in vbData base connectivity and flex grid in vb
Data base connectivity and flex grid in vbAmandeep Kaur
 
vishual basic data base Pankaj
vishual basic data base Pankajvishual basic data base Pankaj
vishual basic data base Pankaj
Rai Saheb Bhanwar Singh College Nasrullaganj
 
External Tables - not just loading a csv file
External Tables - not just loading a csv fileExternal Tables - not just loading a csv file
External Tables - not just loading a csv file
Kim Berg Hansen
 
Data Connection using ADO DC
Data Connection using ADO DCData Connection using ADO DC
Data Connection using ADO DC
Purbanjali Das
 
Find your way in Graph labyrinths
Find your way in Graph labyrinthsFind your way in Graph labyrinths
Find your way in Graph labyrinthsDaniel Camarda
 
Oracle 10g Forms Lesson 8
Oracle 10g Forms Lesson 8Oracle 10g Forms Lesson 8
Oracle 10g Forms Lesson 8
KAMA3
 
Visual basic databases
Visual basic databasesVisual basic databases
Visual basic databases
Speed Cyber Cafe
 
Deletion of a Record from a File - K Karun
Deletion of a Record from a File - K KarunDeletion of a Record from a File - K Karun
Deletion of a Record from a File - K Karun
Dipayan Sarkar
 
Libreoffice conference 2014 Easy hacks to improve writer - ooxml interoperabi...
Libreoffice conference 2014 Easy hacks to improve writer - ooxml interoperabi...Libreoffice conference 2014 Easy hacks to improve writer - ooxml interoperabi...
Libreoffice conference 2014 Easy hacks to improve writer - ooxml interoperabi...
Sushil Shinde
 

What's hot (17)

MarcEdit Shelter-In-Place Webinar 4: Merging, Clustering, and Integrations…oh...
MarcEdit Shelter-In-Place Webinar 4: Merging, Clustering, and Integrations…oh...MarcEdit Shelter-In-Place Webinar 4: Merging, Clustering, and Integrations…oh...
MarcEdit Shelter-In-Place Webinar 4: Merging, Clustering, and Integrations…oh...
 
Make MarcEdit Work For You: OLC Technical Services Retreat
Make MarcEdit Work For You: OLC Technical Services RetreatMake MarcEdit Work For You: OLC Technical Services Retreat
Make MarcEdit Work For You: OLC Technical Services Retreat
 
MarcEdit Shelter-In-Place Webinar 5: Working with MarcEdit's Linked Data Fram...
MarcEdit Shelter-In-Place Webinar 5: Working with MarcEdit's Linked Data Fram...MarcEdit Shelter-In-Place Webinar 5: Working with MarcEdit's Linked Data Fram...
MarcEdit Shelter-In-Place Webinar 5: Working with MarcEdit's Linked Data Fram...
 
Fitting MarcEdit into the library software ecosystem
Fitting MarcEdit into the library software ecosystemFitting MarcEdit into the library software ecosystem
Fitting MarcEdit into the library software ecosystem
 
Reimagining Serials handout: BIBFRAME Exercise
Reimagining Serials handout: BIBFRAME ExerciseReimagining Serials handout: BIBFRAME Exercise
Reimagining Serials handout: BIBFRAME Exercise
 
Data base connectivity and flex grid in vb
Data base connectivity and flex grid in vbData base connectivity and flex grid in vb
Data base connectivity and flex grid in vb
 
vishual basic data base Pankaj
vishual basic data base Pankajvishual basic data base Pankaj
vishual basic data base Pankaj
 
External Tables - not just loading a csv file
External Tables - not just loading a csv fileExternal Tables - not just loading a csv file
External Tables - not just loading a csv file
 
Data Connection using ADO DC
Data Connection using ADO DCData Connection using ADO DC
Data Connection using ADO DC
 
Find your way in Graph labyrinths
Find your way in Graph labyrinthsFind your way in Graph labyrinths
Find your way in Graph labyrinths
 
Cdi implementation
Cdi implementationCdi implementation
Cdi implementation
 
Oracle 10g Forms Lesson 8
Oracle 10g Forms Lesson 8Oracle 10g Forms Lesson 8
Oracle 10g Forms Lesson 8
 
Visual basic databases
Visual basic databasesVisual basic databases
Visual basic databases
 
Deletion of a Record from a File - K Karun
Deletion of a Record from a File - K KarunDeletion of a Record from a File - K Karun
Deletion of a Record from a File - K Karun
 
Les04
Les04Les04
Les04
 
Data loader.ppt
Data loader.pptData loader.ppt
Data loader.ppt
 
Libreoffice conference 2014 Easy hacks to improve writer - ooxml interoperabi...
Libreoffice conference 2014 Easy hacks to improve writer - ooxml interoperabi...Libreoffice conference 2014 Easy hacks to improve writer - ooxml interoperabi...
Libreoffice conference 2014 Easy hacks to improve writer - ooxml interoperabi...
 

Similar to Marc edit and_nonmarc_data (1)

marcedit_discussion.pdf
marcedit_discussion.pdfmarcedit_discussion.pdf
marcedit_discussion.pdf
HOANGNGOCTUAN2
 
OLC Batch Edit Panel: Specialized Tools for Batch Editing MarcEdit
OLC Batch Edit Panel: Specialized Tools for Batch Editing MarcEditOLC Batch Edit Panel: Specialized Tools for Batch Editing MarcEdit
OLC Batch Edit Panel: Specialized Tools for Batch Editing MarcEdit
Terry Reese
 
Making RDA Easy(er) with MarcEdit
Making RDA Easy(er) with MarcEditMaking RDA Easy(er) with MarcEdit
Making RDA Easy(er) with MarcEdit
Terry Reese
 
Java development with the dynamo framework
Java development with the dynamo frameworkJava development with the dynamo framework
Java development with the dynamo framework
Patrick Deenen
 
MarcEdit for Everyone with Katie Dunn
MarcEdit for Everyone with Katie DunnMarcEdit for Everyone with Katie Dunn
MarcEdit for Everyone with Katie Dunn
WiLS
 
MarcEdit for Illinois Library Association conference 2011
MarcEdit for Illinois Library Association conference 2011MarcEdit for Illinois Library Association conference 2011
MarcEdit for Illinois Library Association conference 2011
GailBordenTech
 
Ils on a shoe string budget
Ils on a shoe string budgetIls on a shoe string budget
Ils on a shoe string budget
Jolene81
 
MarcEdit Tools and Workflows
MarcEdit Tools and WorkflowsMarcEdit Tools and Workflows
MarcEdit Tools and Workflows
WiLS
 
Informatica Online Training
Informatica Online TrainingInformatica Online Training
Informatica Online Training
Rao Rao
 
Getting started with CATIA V5 Macros
Getting started with CATIA V5 MacrosGetting started with CATIA V5 Macros
Getting started with CATIA V5 Macros
Emmett Ross
 
new_informatica_1
new_informatica_1new_informatica_1
new_informatica_1rao dpr
 
Spring data presentation
Spring data presentationSpring data presentation
Spring data presentation
Oleksii Usyk
 
Query editor for multi databases
Query editor for multi databasesQuery editor for multi databases
Query editor for multi databases
Aarthi Raghavendra
 
MariaDB ColumnStore
MariaDB ColumnStoreMariaDB ColumnStore
MariaDB ColumnStore
MariaDB plc
 
Think Like Spark: Some Spark Concepts and a Use Case
Think Like Spark: Some Spark Concepts and a Use CaseThink Like Spark: Some Spark Concepts and a Use Case
Think Like Spark: Some Spark Concepts and a Use Case
Rachel Warren
 
44CON London 2015 - Reverse engineering and exploiting font rasterizers: the ...
44CON London 2015 - Reverse engineering and exploiting font rasterizers: the ...44CON London 2015 - Reverse engineering and exploiting font rasterizers: the ...
44CON London 2015 - Reverse engineering and exploiting font rasterizers: the ...
44CON
 
Using the Archivists' Toolkit: Hands-on practice and related tools
Using the Archivists' Toolkit: Hands-on practice and related toolsUsing the Archivists' Toolkit: Hands-on practice and related tools
Using the Archivists' Toolkit: Hands-on practice and related tools
Audra Eagle Yun
 
Lavacon 2011: Managing Translations in Frame DITA without a CMS
Lavacon 2011: Managing Translations in Frame DITA without a CMSLavacon 2011: Managing Translations in Frame DITA without a CMS
Lavacon 2011: Managing Translations in Frame DITA without a CMSClearPath, LLC
 

Similar to Marc edit and_nonmarc_data (1) (20)

marcedit_discussion.pdf
marcedit_discussion.pdfmarcedit_discussion.pdf
marcedit_discussion.pdf
 
OLC Batch Edit Panel: Specialized Tools for Batch Editing MarcEdit
OLC Batch Edit Panel: Specialized Tools for Batch Editing MarcEditOLC Batch Edit Panel: Specialized Tools for Batch Editing MarcEdit
OLC Batch Edit Panel: Specialized Tools for Batch Editing MarcEdit
 
Making RDA Easy(er) with MarcEdit
Making RDA Easy(er) with MarcEditMaking RDA Easy(er) with MarcEdit
Making RDA Easy(er) with MarcEdit
 
Introduction to birt
Introduction to birtIntroduction to birt
Introduction to birt
 
Java development with the dynamo framework
Java development with the dynamo frameworkJava development with the dynamo framework
Java development with the dynamo framework
 
MarcEdit for Everyone with Katie Dunn
MarcEdit for Everyone with Katie DunnMarcEdit for Everyone with Katie Dunn
MarcEdit for Everyone with Katie Dunn
 
MarcEdit for Illinois Library Association conference 2011
MarcEdit for Illinois Library Association conference 2011MarcEdit for Illinois Library Association conference 2011
MarcEdit for Illinois Library Association conference 2011
 
Ssn0020 ssis 2012 for beginners
Ssn0020   ssis 2012 for beginnersSsn0020   ssis 2012 for beginners
Ssn0020 ssis 2012 for beginners
 
Ils on a shoe string budget
Ils on a shoe string budgetIls on a shoe string budget
Ils on a shoe string budget
 
MarcEdit Tools and Workflows
MarcEdit Tools and WorkflowsMarcEdit Tools and Workflows
MarcEdit Tools and Workflows
 
Informatica Online Training
Informatica Online TrainingInformatica Online Training
Informatica Online Training
 
Getting started with CATIA V5 Macros
Getting started with CATIA V5 MacrosGetting started with CATIA V5 Macros
Getting started with CATIA V5 Macros
 
new_informatica_1
new_informatica_1new_informatica_1
new_informatica_1
 
Spring data presentation
Spring data presentationSpring data presentation
Spring data presentation
 
Query editor for multi databases
Query editor for multi databasesQuery editor for multi databases
Query editor for multi databases
 
MariaDB ColumnStore
MariaDB ColumnStoreMariaDB ColumnStore
MariaDB ColumnStore
 
Think Like Spark: Some Spark Concepts and a Use Case
Think Like Spark: Some Spark Concepts and a Use CaseThink Like Spark: Some Spark Concepts and a Use Case
Think Like Spark: Some Spark Concepts and a Use Case
 
44CON London 2015 - Reverse engineering and exploiting font rasterizers: the ...
44CON London 2015 - Reverse engineering and exploiting font rasterizers: the ...44CON London 2015 - Reverse engineering and exploiting font rasterizers: the ...
44CON London 2015 - Reverse engineering and exploiting font rasterizers: the ...
 
Using the Archivists' Toolkit: Hands-on practice and related tools
Using the Archivists' Toolkit: Hands-on practice and related toolsUsing the Archivists' Toolkit: Hands-on practice and related tools
Using the Archivists' Toolkit: Hands-on practice and related tools
 
Lavacon 2011: Managing Translations in Frame DITA without a CMS
Lavacon 2011: Managing Translations in Frame DITA without a CMSLavacon 2011: Managing Translations in Frame DITA without a CMS
Lavacon 2011: Managing Translations in Frame DITA without a CMS
 

Recently uploaded

Presentatie 8. Joost van der Linde & Daniel Anderton - Eliq 28 mei 2024
Presentatie 8. Joost van der Linde & Daniel Anderton - Eliq 28 mei 2024Presentatie 8. Joost van der Linde & Daniel Anderton - Eliq 28 mei 2024
Presentatie 8. Joost van der Linde & Daniel Anderton - Eliq 28 mei 2024
Dutch Power
 
Bonzo subscription_hjjjjjjjj5hhhhhhh_2024.pdf
Bonzo subscription_hjjjjjjjj5hhhhhhh_2024.pdfBonzo subscription_hjjjjjjjj5hhhhhhh_2024.pdf
Bonzo subscription_hjjjjjjjj5hhhhhhh_2024.pdf
khadija278284
 
Presentatie 4. Jochen Cremer - TU Delft 28 mei 2024
Presentatie 4. Jochen Cremer - TU Delft 28 mei 2024Presentatie 4. Jochen Cremer - TU Delft 28 mei 2024
Presentatie 4. Jochen Cremer - TU Delft 28 mei 2024
Dutch Power
 
somanykidsbutsofewfathers-140705000023-phpapp02.pptx
somanykidsbutsofewfathers-140705000023-phpapp02.pptxsomanykidsbutsofewfathers-140705000023-phpapp02.pptx
somanykidsbutsofewfathers-140705000023-phpapp02.pptx
Howard Spence
 
Collapsing Narratives: Exploring Non-Linearity • a micro report by Rosie Wells
Collapsing Narratives: Exploring Non-Linearity • a micro report by Rosie WellsCollapsing Narratives: Exploring Non-Linearity • a micro report by Rosie Wells
Collapsing Narratives: Exploring Non-Linearity • a micro report by Rosie Wells
Rosie Wells
 
Announcement of 18th IEEE International Conference on Software Testing, Verif...
Announcement of 18th IEEE International Conference on Software Testing, Verif...Announcement of 18th IEEE International Conference on Software Testing, Verif...
Announcement of 18th IEEE International Conference on Software Testing, Verif...
Sebastiano Panichella
 
Obesity causes and management and associated medical conditions
Obesity causes and management and associated medical conditionsObesity causes and management and associated medical conditions
Obesity causes and management and associated medical conditions
Faculty of Medicine And Health Sciences
 
Burning Issue Presentation By Kenmaryon.pdf
Burning Issue Presentation By Kenmaryon.pdfBurning Issue Presentation By Kenmaryon.pdf
Burning Issue Presentation By Kenmaryon.pdf
kkirkland2
 
2024-05-30_meetup_devops_aix-marseille.pdf
2024-05-30_meetup_devops_aix-marseille.pdf2024-05-30_meetup_devops_aix-marseille.pdf
2024-05-30_meetup_devops_aix-marseille.pdf
Frederic Leger
 
International Workshop on Artificial Intelligence in Software Testing
International Workshop on Artificial Intelligence in Software TestingInternational Workshop on Artificial Intelligence in Software Testing
International Workshop on Artificial Intelligence in Software Testing
Sebastiano Panichella
 
Gregory Harris - Cycle 2 - Civics Presentation
Gregory Harris - Cycle 2 - Civics PresentationGregory Harris - Cycle 2 - Civics Presentation
Gregory Harris - Cycle 2 - Civics Presentation
gharris9
 
Supercharge your AI - SSP Industry Breakout Session 2024-v2_1.pdf
Supercharge your AI - SSP Industry Breakout Session 2024-v2_1.pdfSupercharge your AI - SSP Industry Breakout Session 2024-v2_1.pdf
Supercharge your AI - SSP Industry Breakout Session 2024-v2_1.pdf
Access Innovations, Inc.
 
Doctoral Symposium at the 17th IEEE International Conference on Software Test...
Doctoral Symposium at the 17th IEEE International Conference on Software Test...Doctoral Symposium at the 17th IEEE International Conference on Software Test...
Doctoral Symposium at the 17th IEEE International Conference on Software Test...
Sebastiano Panichella
 
Tom tresser burning issue.pptx My Burning issue
Tom tresser burning issue.pptx My Burning issueTom tresser burning issue.pptx My Burning issue
Tom tresser burning issue.pptx My Burning issue
amekonnen
 
Mastering the Concepts Tested in the Databricks Certified Data Engineer Assoc...
Mastering the Concepts Tested in the Databricks Certified Data Engineer Assoc...Mastering the Concepts Tested in the Databricks Certified Data Engineer Assoc...
Mastering the Concepts Tested in the Databricks Certified Data Engineer Assoc...
SkillCertProExams
 
Media as a Mind Controlling Strategy In Old and Modern Era
Media as a Mind Controlling Strategy In Old and Modern EraMedia as a Mind Controlling Strategy In Old and Modern Era
Media as a Mind Controlling Strategy In Old and Modern Era
faizulhassanfaiz1670
 
Gregory Harris' Civics Presentation.pptx
Gregory Harris' Civics Presentation.pptxGregory Harris' Civics Presentation.pptx
Gregory Harris' Civics Presentation.pptx
gharris9
 
María Carolina Martínez - eCommerce Day Colombia 2024
María Carolina Martínez - eCommerce Day Colombia 2024María Carolina Martínez - eCommerce Day Colombia 2024
María Carolina Martínez - eCommerce Day Colombia 2024
eCommerce Institute
 
AWANG ANIQKMALBIN AWANG TAJUDIN B22080004 ASSIGNMENT 2 MPU3193 PHILOSOPHY AND...
AWANG ANIQKMALBIN AWANG TAJUDIN B22080004 ASSIGNMENT 2 MPU3193 PHILOSOPHY AND...AWANG ANIQKMALBIN AWANG TAJUDIN B22080004 ASSIGNMENT 2 MPU3193 PHILOSOPHY AND...
AWANG ANIQKMALBIN AWANG TAJUDIN B22080004 ASSIGNMENT 2 MPU3193 PHILOSOPHY AND...
AwangAniqkmals
 

Recently uploaded (19)

Presentatie 8. Joost van der Linde & Daniel Anderton - Eliq 28 mei 2024
Presentatie 8. Joost van der Linde & Daniel Anderton - Eliq 28 mei 2024Presentatie 8. Joost van der Linde & Daniel Anderton - Eliq 28 mei 2024
Presentatie 8. Joost van der Linde & Daniel Anderton - Eliq 28 mei 2024
 
Bonzo subscription_hjjjjjjjj5hhhhhhh_2024.pdf
Bonzo subscription_hjjjjjjjj5hhhhhhh_2024.pdfBonzo subscription_hjjjjjjjj5hhhhhhh_2024.pdf
Bonzo subscription_hjjjjjjjj5hhhhhhh_2024.pdf
 
Presentatie 4. Jochen Cremer - TU Delft 28 mei 2024
Presentatie 4. Jochen Cremer - TU Delft 28 mei 2024Presentatie 4. Jochen Cremer - TU Delft 28 mei 2024
Presentatie 4. Jochen Cremer - TU Delft 28 mei 2024
 
somanykidsbutsofewfathers-140705000023-phpapp02.pptx
somanykidsbutsofewfathers-140705000023-phpapp02.pptxsomanykidsbutsofewfathers-140705000023-phpapp02.pptx
somanykidsbutsofewfathers-140705000023-phpapp02.pptx
 
Collapsing Narratives: Exploring Non-Linearity • a micro report by Rosie Wells
Collapsing Narratives: Exploring Non-Linearity • a micro report by Rosie WellsCollapsing Narratives: Exploring Non-Linearity • a micro report by Rosie Wells
Collapsing Narratives: Exploring Non-Linearity • a micro report by Rosie Wells
 
Announcement of 18th IEEE International Conference on Software Testing, Verif...
Announcement of 18th IEEE International Conference on Software Testing, Verif...Announcement of 18th IEEE International Conference on Software Testing, Verif...
Announcement of 18th IEEE International Conference on Software Testing, Verif...
 
Obesity causes and management and associated medical conditions
Obesity causes and management and associated medical conditionsObesity causes and management and associated medical conditions
Obesity causes and management and associated medical conditions
 
Burning Issue Presentation By Kenmaryon.pdf
Burning Issue Presentation By Kenmaryon.pdfBurning Issue Presentation By Kenmaryon.pdf
Burning Issue Presentation By Kenmaryon.pdf
 
2024-05-30_meetup_devops_aix-marseille.pdf
2024-05-30_meetup_devops_aix-marseille.pdf2024-05-30_meetup_devops_aix-marseille.pdf
2024-05-30_meetup_devops_aix-marseille.pdf
 
International Workshop on Artificial Intelligence in Software Testing
International Workshop on Artificial Intelligence in Software TestingInternational Workshop on Artificial Intelligence in Software Testing
International Workshop on Artificial Intelligence in Software Testing
 
Gregory Harris - Cycle 2 - Civics Presentation
Gregory Harris - Cycle 2 - Civics PresentationGregory Harris - Cycle 2 - Civics Presentation
Gregory Harris - Cycle 2 - Civics Presentation
 
Supercharge your AI - SSP Industry Breakout Session 2024-v2_1.pdf
Supercharge your AI - SSP Industry Breakout Session 2024-v2_1.pdfSupercharge your AI - SSP Industry Breakout Session 2024-v2_1.pdf
Supercharge your AI - SSP Industry Breakout Session 2024-v2_1.pdf
 
Doctoral Symposium at the 17th IEEE International Conference on Software Test...
Doctoral Symposium at the 17th IEEE International Conference on Software Test...Doctoral Symposium at the 17th IEEE International Conference on Software Test...
Doctoral Symposium at the 17th IEEE International Conference on Software Test...
 
Tom tresser burning issue.pptx My Burning issue
Tom tresser burning issue.pptx My Burning issueTom tresser burning issue.pptx My Burning issue
Tom tresser burning issue.pptx My Burning issue
 
Mastering the Concepts Tested in the Databricks Certified Data Engineer Assoc...
Mastering the Concepts Tested in the Databricks Certified Data Engineer Assoc...Mastering the Concepts Tested in the Databricks Certified Data Engineer Assoc...
Mastering the Concepts Tested in the Databricks Certified Data Engineer Assoc...
 
Media as a Mind Controlling Strategy In Old and Modern Era
Media as a Mind Controlling Strategy In Old and Modern EraMedia as a Mind Controlling Strategy In Old and Modern Era
Media as a Mind Controlling Strategy In Old and Modern Era
 
Gregory Harris' Civics Presentation.pptx
Gregory Harris' Civics Presentation.pptxGregory Harris' Civics Presentation.pptx
Gregory Harris' Civics Presentation.pptx
 
María Carolina Martínez - eCommerce Day Colombia 2024
María Carolina Martínez - eCommerce Day Colombia 2024María Carolina Martínez - eCommerce Day Colombia 2024
María Carolina Martínez - eCommerce Day Colombia 2024
 
AWANG ANIQKMALBIN AWANG TAJUDIN B22080004 ASSIGNMENT 2 MPU3193 PHILOSOPHY AND...
AWANG ANIQKMALBIN AWANG TAJUDIN B22080004 ASSIGNMENT 2 MPU3193 PHILOSOPHY AND...AWANG ANIQKMALBIN AWANG TAJUDIN B22080004 ASSIGNMENT 2 MPU3193 PHILOSOPHY AND...
AWANG ANIQKMALBIN AWANG TAJUDIN B22080004 ASSIGNMENT 2 MPU3193 PHILOSOPHY AND...
 

Marc edit and_nonmarc_data (1)

  • 1. MarcEdit: Doing more, but faster Terry Reese Gray Family Chair for Innovative Library Services Terry.reese@oregonstate.edu
  • 2. • Making your metadata work for you • Finding ways to use MarcEdit to merge and manipulate existing metadata in various formats • i.e., working with XML formats, delimited formats, Excel, Access • Dealing with data in multiple charactersets as we transition to a Unicode world • Learning how to automate repetitive tasks, and understand what editing functions are available to you • Leveraging webservices like OCLC WorldCat to provide automatic classifications
  • 5. Marc Tools • Built-in functions • MarcBreaker – Tool used to convert MARC records to the MarcEdit mnemonic format • MarcMaker – Tool used to convert MarcEdit mnemonic format to MARC • MARC=>MARC21XML – converts MARC to MARC21XML • Automatically converts data from MARC-8 to UTF8 • MARC21XML=>MARC – converts MARC21XML to MARC • Doesn’t automatically convert data from UTF8 to MARC8 – will leave data in UTF8
  • 6. MARC Character Conversions • Supports moving between any known Windows Characterset and MARC8. • Can be run from the Breaker/Maker – or as its own standalone utility
  • 7. MARCSplit/MARCJoin • Utility used for splitting large MARC record sets into smaller files • Utility used for joining large sets of MARC data to a single file
  • 8. Batch Record Processor • Allows MarcEdit to process “lots” of files. • Files can be processed against an entire folder’s contents or by file type • Can utilize any built-in or derived XML Function transformation
  • 9. MarcEdit and bad records • Two MARC breaking algorithms • Strict MARC algorithm • Loose breaking algorithm • Loose algorithm can heal MARC records (sometimes) • Structural errors • Missing field or record markers
  • 10. Delimited text translator • Delimited Text Translator • Translates Tab, comma, pipe, Excel (Office 2000-2007), Access (Office 2000-2007) files into MARC • Can save translation maps • Can create constant data
  • 11. Delimited text translator Options • Wizard-like interface • Supports Unicode data (in excel or delimited file) • Joining (relating) fields • Editing global 008/LDR
  • 12. Delimited Text Translator: Mapping format • Map to: Field + subfield • Indicators: Indicator values • Term Punct.: Trailing punctuation • Arguments – Joining defined items (select and right click on items) • Ability to save templates
  • 13. Common Joining techniques • When would I mark a field as repeatable? • By default, when the Delimited Text translator encounters two like subfields on the same field, it creates a new field. For example: column 1: This is a note column 2: This is a note 2 if I mapped column 1 500$a and column 2 to 500$a, by default, MarcEdit would generate the following output: =500 $aThis is a note =500 $aThis is a note 2 • However….
  • 14. Common Joining techniques • When would I mark a field as repeatable? • If I need to have multiple, like subfields on the same field, for example, like a subject field – we would mark the field as repeatable: column 1: Geology column 2: Oregon column 3: Corvallis If these fields were not marked as repeatable, the output would look like: =650 0$aGeology$zOregon =650 0$zCorvallis However, if these fields were marked as repeatable, the output would look like: =650 0$aGeology$zOregon$zCorvallis
  • 17. Finding and Contributing Crosswalks • In MarcEdit 5.6, an option was added to allow users to search for crosswalks • Currently, these are crosswalks I or LC have created • Hopefully, community members will submit crosswalks for inclusion into the registry
  • 19. Harvesting Metadata • MarcEdit includes a builtin OAI harvester • Allows for direct XML=>MARC translations • Allows for custom modification of XSLT translation tables.
  • 20. Harvesting Metadata • Required data • Host name: i.e., http://ir.library.oregonstate.edu/request/oai • Metadata Type • Natively supports MARCXML, Dublin Core, OAIMARC and MODS • Options to support conditional harvests, raw data harvests, and resumptive harvests.
  • 23. MarcEditor Properties • Templates • Fonts • Encodings • Preview Settings
  • 24. Configuring New Paging • Set in the Options dialog
  • 25. Paging Example • If you load the full file, or turn the preview mode off
  • 26. Editing MARC • MarcEditor • Supports a number of global editing functions: • Edit Subsets of records • Find/Replace functionality • Globally Add/Delete MARC fields • Globally Edit Subfield data • Conditionally add/remove field data • Globally Edit Indicator data • Globally Swap field data • Record Deduplication • Record Sorting • Call Number Generator • Macros
  • 27. Editing MARC – Find/Replace • Works like a normal Find/Replace in most Textpad utilities. • Unlike most Textpads, Replace supports UTF-8 (when working with UTF- 8 files) and regular expressions.
  • 28. Editing MARC – Find All • Find all function was designed for use with the Paging mode • Allows users to find any text across all pages • Generates a jump list that can be used to find individual records for edit
  • 29. Jump to • Jump to…record: • Allows you to jump to any records • Jump to…page: • Allows you to jump to any page
  • 30. Editing MARC – Global Add/Delete Field • Globally add fields to all MARC records • Allows users to set insertion position. • Globally delete fields • Allows global delete • Allows conditional delete • Supports Regular Expressions
  • 31. Editing MARC – Modifying subfield data • Allows for the modification of variable MARC field subfield data (MARC fields >10) • Allows for the modification of control field data by position or range of positions • Allows users to prepend and append data to subfields. • Allows users to change subfield tagging.
  • 32. Editing MARC – Modifying subfield data • Allows users to insert new subfields and define subfield placement. • Allows users to move field data from one field to another. • Supports: • UTF-8 with UTF-8 files • Regular Expressions • Adding new subfields.
  • 33. Editing MARC – Modifying subfield data
  • 34. Editing MARC – Swapping Fields • Swap parts of MARC Fields or entire MARC fields • Define field, indicator and subfields to move. • Can move field data and delete the original field or clone the field data and move the clone to the new location. • Can add data to an existing field.
  • 35. Character Conversions within the MarcEditor • MarcEditor allows users to convert character data between different charactersets.
  • 36. Fixing Boo-boos • MarcEdit’s Special Undo • Allows you to step back one global change.
  • 37. Sorting Fields • MarcEdit provides multiple sorting types: • Control Number • Sorts record position within the file • Title • Sorts record position within the file • Author • Sorts record position within the file • Call Number • Sorts record position within the file • 0xx Fields • Sorts the 0xx fields within individual records (does *not* change record position within a file) • All Fields • Sorts all fields within individual records (does *not* change record position within a file) • Custom Sort • Sorts all defined fields within individual records (does *not* change record position within a file)
  • 38. Record Deduplication • MarcEdit provides a simple dedup tool that can: • Dedup on a defined control field (any field) • Dedup on a transaction field (or using an additional transaction field) • Output • Removes all duplications and saves the duplications to a file • Prints just unique items within the file (i.e., those without a duplicate pair)
  • 39. Field Counts • Field Count • Provides a quick count of fields • Report of subfields used within a particular field • Detailed reports of all fields/subfields used within a fileset.
  • 40. Material Type Report • Material Type Report • Reports number of records by material type • Breaks down material type by sub-types • Utilizes the Leader, 008 and GMD to determine format types
  • 41. Task Automation Tool • Stacking Operations • Task automation provides a way for non-programmers to create defined task lists that can then be executed automatically • The different between a task and a macro is that MarcEdit tasks essentially function like the user was calling specific functions within MarcEdit. • Anything that you can do in the MarcEditor, you can automate as a task.
  • 42. Task Automation • Managing Tasks • Task management works like macro management • You can • Create new tasks • Clone tasks • Rename tasks • Delete tasks • Edit tasks
  • 43. Task Automation Demo • Additional Information: • Youtube: • Introduction to task automation: http://www.youtube.com/watch? v=gmqTGfTubU4 • Introduction to new task automation functions: http://www.youtube.com/watch?v=fnorN0MFFN0
  • 44. • MarcEdit can leverage OCLC WorldCat to generate call numbers automatically for files • Fields used: • 001 • 010$a$z • 020$a$z • 022$a$z • 024$a$z • 1xx$a • 776$w$z OCLC Classify Service
  • 47. MarcEdit 5.9+ • AACR2->RDA macros • Low-hanging conversions to support batch data processing • Merge Record Enhancements • Adding more data points and customized merge fields • More Automation support • Ability to turn Edit shortcuts into Automation tasks • Batch OAI Harvesting • Create jobs that you can schedule and have automatically run for you • Batch Set Holdings • Using either crappy z39.50 or OCLC’s yet to be publically released API for holdings settings.
  • 48. Getting Help • Call/write me: • terry.reese@oregonstate.edu • Ask the list: • MarcEdit ListServ • http://listserv.gmu.edu/cgi-bin/wa?A0=marcedit-l

Editor's Notes

  1. This is really the heart of MarcEdit All utilities and functions interact with the MARCEngine in some fashion.
  2. Best way to think of the MarcEditor is like notepad for MARC. It has been designed to work specifically with MARC data.
  3. Replace all works great for handling regular find/replace operations but can also be used to: Change field tags Using regular expressions to move subfield information from on subfield to another Using regular expressions to do complex find/replace operations.
  4. Replace all works great for handling regular find/replace operations but can also be used to: Change field tags Using regular expressions to move subfield information from on subfield to another Using regular expressions to do complex find/replace operations.
  5. The function is primarily useful if you have a field that needs to go into every record. For example, OSU receives aggregator records for EBSCOHost and we insert a text string into every record so that we can easily identify these records using listing tools within our ILS system. Another example: in our ILS system, we use a 949 field to pass command-line options to the MARC loader. When doing database maintenance operations, I can automatically add a single 949 field to all records to define the load table and common arguments to be used when loading the record.