SlideShare a Scribd company logo
1 of 4
Metadata remediation
Marina Georgieva
March 19, 2019
Special Collections and Archives Division Meeting
Updates, procedures, workflow
Extract raw
metadata
Workflow
Analyze
data
Apply best
practices
Prepare
worksheets
REMEDIATE METADATA
Clean up ARKs Enhance
Collaboration with the Metadata Migration Workgroup
Update
documentation
Updates
Completed Next
● Inactive digital collections
○ NV Test Site
○ Menus
○ Showgirls
○ Boomtown
○ ∞
3 projects
6 remediation cycles
41,937 objects
How | Why?
How we remediate? Why is remediation important?
● Supports the DAMS migration project
● High priority for Phase 1 of migration
project
● Clean data is easier to migrate
● Clean data is consistent
● Data is uniform across all collections
Tool: Excel
● Advanced features
○ Formulas
○ Functions
○ VBA code
● Advantages
○ Easy to use
○ Easy to share
○ Easy to learn
○ Powerful for large sets of data

More Related Content

Similar to Metadata Remediation Procedures and Workflow Updates

Counting Unique Users in Real-Time: Here's a Challenge for You!
Counting Unique Users in Real-Time: Here's a Challenge for You!Counting Unique Users in Real-Time: Here's a Challenge for You!
Counting Unique Users in Real-Time: Here's a Challenge for You!DataWorks Summit
 
Data and AI summit: data pipelines observability with open lineage
Data and AI summit: data pipelines observability with open lineageData and AI summit: data pipelines observability with open lineage
Data and AI summit: data pipelines observability with open lineageJulien Le Dem
 
Observability for Data Pipelines With OpenLineage
Observability for Data Pipelines With OpenLineageObservability for Data Pipelines With OpenLineage
Observability for Data Pipelines With OpenLineageDatabricks
 
MongoDB World 2019: Packing Up Your Data and Moving to MongoDB Atlas
MongoDB World 2019: Packing Up Your Data and Moving to MongoDB AtlasMongoDB World 2019: Packing Up Your Data and Moving to MongoDB Atlas
MongoDB World 2019: Packing Up Your Data and Moving to MongoDB AtlasMongoDB
 
(Greach 2015) Decathlon Sport Meeting
(Greach 2015) Decathlon Sport Meeting(Greach 2015) Decathlon Sport Meeting
(Greach 2015) Decathlon Sport MeetingAlonso Torres
 
What's New in Cartegraph
What's New in CartegraphWhat's New in Cartegraph
What's New in CartegraphCartegraph
 
Our journey with druid - from initial research to full production scale
Our journey with druid - from initial research to full production scaleOur journey with druid - from initial research to full production scale
Our journey with druid - from initial research to full production scaleItai Yaffe
 
Production ready big ml workflows from zero to hero daniel marcous @ waze
Production ready big ml workflows from zero to hero daniel marcous @ wazeProduction ready big ml workflows from zero to hero daniel marcous @ waze
Production ready big ml workflows from zero to hero daniel marcous @ wazeIdo Shilon
 
Keepin’ It Real(-Time) With Nadine Farah | Current 2022
Keepin’ It Real(-Time) With Nadine Farah | Current 2022Keepin’ It Real(-Time) With Nadine Farah | Current 2022
Keepin’ It Real(-Time) With Nadine Farah | Current 2022HostedbyConfluent
 
Spring Data Neo4j: Graph Power Your Enterprise Apps
Spring Data Neo4j: Graph Power Your Enterprise AppsSpring Data Neo4j: Graph Power Your Enterprise Apps
Spring Data Neo4j: Graph Power Your Enterprise AppsGraphAware
 
A primer on building real time data-driven products
A primer on building real time data-driven productsA primer on building real time data-driven products
A primer on building real time data-driven productsLars Albertsson
 
Apache Iceberg - A Table Format for Hige Analytic Datasets
Apache Iceberg - A Table Format for Hige Analytic DatasetsApache Iceberg - A Table Format for Hige Analytic Datasets
Apache Iceberg - A Table Format for Hige Analytic DatasetsAlluxio, Inc.
 
MapReduce: Optimizations, Limitations, and Open Issues
MapReduce: Optimizations, Limitations, and Open IssuesMapReduce: Optimizations, Limitations, and Open Issues
MapReduce: Optimizations, Limitations, and Open IssuesVasia Kalavri
 
Lambda architecture @ Indix
Lambda architecture @ IndixLambda architecture @ Indix
Lambda architecture @ IndixRajesh Muppalla
 
Data Engineer's Lunch #85: Designing a Modern Data Stack
Data Engineer's Lunch #85: Designing a Modern Data StackData Engineer's Lunch #85: Designing a Modern Data Stack
Data Engineer's Lunch #85: Designing a Modern Data StackAnant Corporation
 
Webinar slides: DevOps Tutorial: how to automate your database infrastructure
Webinar slides: DevOps Tutorial: how to automate your database infrastructureWebinar slides: DevOps Tutorial: how to automate your database infrastructure
Webinar slides: DevOps Tutorial: how to automate your database infrastructureSeveralnines
 
Networks are like onions: Practical Deep Learning with TensorFlow
Networks are like onions: Practical Deep Learning with TensorFlowNetworks are like onions: Practical Deep Learning with TensorFlow
Networks are like onions: Practical Deep Learning with TensorFlowBarbara Fusinska
 
How to Develop and Operate Cloud Native Data Platforms and Applications
How to Develop and Operate Cloud Native Data Platforms and ApplicationsHow to Develop and Operate Cloud Native Data Platforms and Applications
How to Develop and Operate Cloud Native Data Platforms and ApplicationsAlluxio, Inc.
 
Finding the best solution for Image Processing
Finding the best solution for Image ProcessingFinding the best solution for Image Processing
Finding the best solution for Image ProcessingTech Triveni
 
Data Enginering from Google Data Warehouse
Data Enginering from Google Data WarehouseData Enginering from Google Data Warehouse
Data Enginering from Google Data Warehousearungansi
 

Similar to Metadata Remediation Procedures and Workflow Updates (20)

Counting Unique Users in Real-Time: Here's a Challenge for You!
Counting Unique Users in Real-Time: Here's a Challenge for You!Counting Unique Users in Real-Time: Here's a Challenge for You!
Counting Unique Users in Real-Time: Here's a Challenge for You!
 
Data and AI summit: data pipelines observability with open lineage
Data and AI summit: data pipelines observability with open lineageData and AI summit: data pipelines observability with open lineage
Data and AI summit: data pipelines observability with open lineage
 
Observability for Data Pipelines With OpenLineage
Observability for Data Pipelines With OpenLineageObservability for Data Pipelines With OpenLineage
Observability for Data Pipelines With OpenLineage
 
MongoDB World 2019: Packing Up Your Data and Moving to MongoDB Atlas
MongoDB World 2019: Packing Up Your Data and Moving to MongoDB AtlasMongoDB World 2019: Packing Up Your Data and Moving to MongoDB Atlas
MongoDB World 2019: Packing Up Your Data and Moving to MongoDB Atlas
 
(Greach 2015) Decathlon Sport Meeting
(Greach 2015) Decathlon Sport Meeting(Greach 2015) Decathlon Sport Meeting
(Greach 2015) Decathlon Sport Meeting
 
What's New in Cartegraph
What's New in CartegraphWhat's New in Cartegraph
What's New in Cartegraph
 
Our journey with druid - from initial research to full production scale
Our journey with druid - from initial research to full production scaleOur journey with druid - from initial research to full production scale
Our journey with druid - from initial research to full production scale
 
Production ready big ml workflows from zero to hero daniel marcous @ waze
Production ready big ml workflows from zero to hero daniel marcous @ wazeProduction ready big ml workflows from zero to hero daniel marcous @ waze
Production ready big ml workflows from zero to hero daniel marcous @ waze
 
Keepin’ It Real(-Time) With Nadine Farah | Current 2022
Keepin’ It Real(-Time) With Nadine Farah | Current 2022Keepin’ It Real(-Time) With Nadine Farah | Current 2022
Keepin’ It Real(-Time) With Nadine Farah | Current 2022
 
Spring Data Neo4j: Graph Power Your Enterprise Apps
Spring Data Neo4j: Graph Power Your Enterprise AppsSpring Data Neo4j: Graph Power Your Enterprise Apps
Spring Data Neo4j: Graph Power Your Enterprise Apps
 
A primer on building real time data-driven products
A primer on building real time data-driven productsA primer on building real time data-driven products
A primer on building real time data-driven products
 
Apache Iceberg - A Table Format for Hige Analytic Datasets
Apache Iceberg - A Table Format for Hige Analytic DatasetsApache Iceberg - A Table Format for Hige Analytic Datasets
Apache Iceberg - A Table Format for Hige Analytic Datasets
 
MapReduce: Optimizations, Limitations, and Open Issues
MapReduce: Optimizations, Limitations, and Open IssuesMapReduce: Optimizations, Limitations, and Open Issues
MapReduce: Optimizations, Limitations, and Open Issues
 
Lambda architecture @ Indix
Lambda architecture @ IndixLambda architecture @ Indix
Lambda architecture @ Indix
 
Data Engineer's Lunch #85: Designing a Modern Data Stack
Data Engineer's Lunch #85: Designing a Modern Data StackData Engineer's Lunch #85: Designing a Modern Data Stack
Data Engineer's Lunch #85: Designing a Modern Data Stack
 
Webinar slides: DevOps Tutorial: how to automate your database infrastructure
Webinar slides: DevOps Tutorial: how to automate your database infrastructureWebinar slides: DevOps Tutorial: how to automate your database infrastructure
Webinar slides: DevOps Tutorial: how to automate your database infrastructure
 
Networks are like onions: Practical Deep Learning with TensorFlow
Networks are like onions: Practical Deep Learning with TensorFlowNetworks are like onions: Practical Deep Learning with TensorFlow
Networks are like onions: Practical Deep Learning with TensorFlow
 
How to Develop and Operate Cloud Native Data Platforms and Applications
How to Develop and Operate Cloud Native Data Platforms and ApplicationsHow to Develop and Operate Cloud Native Data Platforms and Applications
How to Develop and Operate Cloud Native Data Platforms and Applications
 
Finding the best solution for Image Processing
Finding the best solution for Image ProcessingFinding the best solution for Image Processing
Finding the best solution for Image Processing
 
Data Enginering from Google Data Warehouse
Data Enginering from Google Data WarehouseData Enginering from Google Data Warehouse
Data Enginering from Google Data Warehouse
 

More from Marina Georgieva

Metadata for compound objects | training
Metadata for compound objects | trainingMetadata for compound objects | training
Metadata for compound objects | trainingMarina Georgieva
 
In-house vs. Outsourced Digitization: similarities, key differences and pitfa...
In-house vs. Outsourced Digitization: similarities, key differences and pitfa...In-house vs. Outsourced Digitization: similarities, key differences and pitfa...
In-house vs. Outsourced Digitization: similarities, key differences and pitfa...Marina Georgieva
 
Metadata: An Overview for Digital Collections
Metadata: An Overview for Digital CollectionsMetadata: An Overview for Digital Collections
Metadata: An Overview for Digital CollectionsMarina Georgieva
 
Creating websites and leading librarians to a new level of project engagement
Creating websites and leading librarians to a new level of project engagementCreating websites and leading librarians to a new level of project engagement
Creating websites and leading librarians to a new level of project engagementMarina Georgieva
 
From Temporary to Transformative: Leveraging Externally-Funded Special Collec...
From Temporary to Transformative: Leveraging Externally-Funded Special Collec...From Temporary to Transformative: Leveraging Externally-Funded Special Collec...
From Temporary to Transformative: Leveraging Externally-Funded Special Collec...Marina Georgieva
 
2018 Professional accomplishments in numbers
2018 Professional accomplishments in numbers2018 Professional accomplishments in numbers
2018 Professional accomplishments in numbersMarina Georgieva
 
Building websites and leading librarians to a new level of project engagement
Building websites and leading librarians to a new level of project engagementBuilding websites and leading librarians to a new level of project engagement
Building websites and leading librarians to a new level of project engagementMarina Georgieva
 
The digital librarian: the liaison between digital collections and digital pr...
The digital librarian: the liaison between digital collections and digital pr...The digital librarian: the liaison between digital collections and digital pr...
The digital librarian: the liaison between digital collections and digital pr...Marina Georgieva
 
Digitization revealed (2018 NLA Annual Conference)
Digitization revealed (2018 NLA Annual Conference)Digitization revealed (2018 NLA Annual Conference)
Digitization revealed (2018 NLA Annual Conference)Marina Georgieva
 
Project Management Poster Handout for ALA Annual 2018 attendees
Project Management Poster Handout for ALA Annual 2018 attendeesProject Management Poster Handout for ALA Annual 2018 attendees
Project Management Poster Handout for ALA Annual 2018 attendeesMarina Georgieva
 
Project Management Poster at ALA Annual 2018
Project Management Poster at ALA Annual 2018Project Management Poster at ALA Annual 2018
Project Management Poster at ALA Annual 2018Marina Georgieva
 
ContentDm Landing pages for Digital Collections
ContentDm Landing pages for Digital CollectionsContentDm Landing pages for Digital Collections
ContentDm Landing pages for Digital CollectionsMarina Georgieva
 
Nevada Digital Newspaper Project at the Clark County Nevada Genealogy Meeting
Nevada Digital Newspaper Project at the Clark County Nevada Genealogy MeetingNevada Digital Newspaper Project at the Clark County Nevada Genealogy Meeting
Nevada Digital Newspaper Project at the Clark County Nevada Genealogy MeetingMarina Georgieva
 
Nevada Digital Newspaper Project and Chronicling America Presentation
Nevada Digital Newspaper Project and Chronicling America PresentationNevada Digital Newspaper Project and Chronicling America Presentation
Nevada Digital Newspaper Project and Chronicling America PresentationMarina Georgieva
 
Nevada Digital Newspaper Project | Upcoming events
Nevada Digital Newspaper Project | Upcoming eventsNevada Digital Newspaper Project | Upcoming events
Nevada Digital Newspaper Project | Upcoming eventsMarina Georgieva
 
Nevada Digital Newspaper Project | New addition to Chronicling America
Nevada Digital Newspaper Project | New addition to Chronicling AmericaNevada Digital Newspaper Project | New addition to Chronicling America
Nevada Digital Newspaper Project | New addition to Chronicling AmericaMarina Georgieva
 
Large-scale digitization plan | UNLV Libraries, Dec 2017
Large-scale digitization plan | UNLV Libraries, Dec 2017Large-scale digitization plan | UNLV Libraries, Dec 2017
Large-scale digitization plan | UNLV Libraries, Dec 2017Marina Georgieva
 
Nevada Digital Newspaper Project | SC Division Meeting Update (Feb 2018)
Nevada Digital Newspaper Project | SC Division Meeting Update (Feb 2018)Nevada Digital Newspaper Project | SC Division Meeting Update (Feb 2018)
Nevada Digital Newspaper Project | SC Division Meeting Update (Feb 2018)Marina Georgieva
 
Inforgraphic: facts about NDNP batch
Inforgraphic: facts about NDNP batchInforgraphic: facts about NDNP batch
Inforgraphic: facts about NDNP batchMarina Georgieva
 
Nevada Digital Newspaper Project Midterm Status
Nevada Digital Newspaper Project Midterm StatusNevada Digital Newspaper Project Midterm Status
Nevada Digital Newspaper Project Midterm StatusMarina Georgieva
 

More from Marina Georgieva (20)

Metadata for compound objects | training
Metadata for compound objects | trainingMetadata for compound objects | training
Metadata for compound objects | training
 
In-house vs. Outsourced Digitization: similarities, key differences and pitfa...
In-house vs. Outsourced Digitization: similarities, key differences and pitfa...In-house vs. Outsourced Digitization: similarities, key differences and pitfa...
In-house vs. Outsourced Digitization: similarities, key differences and pitfa...
 
Metadata: An Overview for Digital Collections
Metadata: An Overview for Digital CollectionsMetadata: An Overview for Digital Collections
Metadata: An Overview for Digital Collections
 
Creating websites and leading librarians to a new level of project engagement
Creating websites and leading librarians to a new level of project engagementCreating websites and leading librarians to a new level of project engagement
Creating websites and leading librarians to a new level of project engagement
 
From Temporary to Transformative: Leveraging Externally-Funded Special Collec...
From Temporary to Transformative: Leveraging Externally-Funded Special Collec...From Temporary to Transformative: Leveraging Externally-Funded Special Collec...
From Temporary to Transformative: Leveraging Externally-Funded Special Collec...
 
2018 Professional accomplishments in numbers
2018 Professional accomplishments in numbers2018 Professional accomplishments in numbers
2018 Professional accomplishments in numbers
 
Building websites and leading librarians to a new level of project engagement
Building websites and leading librarians to a new level of project engagementBuilding websites and leading librarians to a new level of project engagement
Building websites and leading librarians to a new level of project engagement
 
The digital librarian: the liaison between digital collections and digital pr...
The digital librarian: the liaison between digital collections and digital pr...The digital librarian: the liaison between digital collections and digital pr...
The digital librarian: the liaison between digital collections and digital pr...
 
Digitization revealed (2018 NLA Annual Conference)
Digitization revealed (2018 NLA Annual Conference)Digitization revealed (2018 NLA Annual Conference)
Digitization revealed (2018 NLA Annual Conference)
 
Project Management Poster Handout for ALA Annual 2018 attendees
Project Management Poster Handout for ALA Annual 2018 attendeesProject Management Poster Handout for ALA Annual 2018 attendees
Project Management Poster Handout for ALA Annual 2018 attendees
 
Project Management Poster at ALA Annual 2018
Project Management Poster at ALA Annual 2018Project Management Poster at ALA Annual 2018
Project Management Poster at ALA Annual 2018
 
ContentDm Landing pages for Digital Collections
ContentDm Landing pages for Digital CollectionsContentDm Landing pages for Digital Collections
ContentDm Landing pages for Digital Collections
 
Nevada Digital Newspaper Project at the Clark County Nevada Genealogy Meeting
Nevada Digital Newspaper Project at the Clark County Nevada Genealogy MeetingNevada Digital Newspaper Project at the Clark County Nevada Genealogy Meeting
Nevada Digital Newspaper Project at the Clark County Nevada Genealogy Meeting
 
Nevada Digital Newspaper Project and Chronicling America Presentation
Nevada Digital Newspaper Project and Chronicling America PresentationNevada Digital Newspaper Project and Chronicling America Presentation
Nevada Digital Newspaper Project and Chronicling America Presentation
 
Nevada Digital Newspaper Project | Upcoming events
Nevada Digital Newspaper Project | Upcoming eventsNevada Digital Newspaper Project | Upcoming events
Nevada Digital Newspaper Project | Upcoming events
 
Nevada Digital Newspaper Project | New addition to Chronicling America
Nevada Digital Newspaper Project | New addition to Chronicling AmericaNevada Digital Newspaper Project | New addition to Chronicling America
Nevada Digital Newspaper Project | New addition to Chronicling America
 
Large-scale digitization plan | UNLV Libraries, Dec 2017
Large-scale digitization plan | UNLV Libraries, Dec 2017Large-scale digitization plan | UNLV Libraries, Dec 2017
Large-scale digitization plan | UNLV Libraries, Dec 2017
 
Nevada Digital Newspaper Project | SC Division Meeting Update (Feb 2018)
Nevada Digital Newspaper Project | SC Division Meeting Update (Feb 2018)Nevada Digital Newspaper Project | SC Division Meeting Update (Feb 2018)
Nevada Digital Newspaper Project | SC Division Meeting Update (Feb 2018)
 
Inforgraphic: facts about NDNP batch
Inforgraphic: facts about NDNP batchInforgraphic: facts about NDNP batch
Inforgraphic: facts about NDNP batch
 
Nevada Digital Newspaper Project Midterm Status
Nevada Digital Newspaper Project Midterm StatusNevada Digital Newspaper Project Midterm Status
Nevada Digital Newspaper Project Midterm Status
 

Recently uploaded

[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...gurkirankumar98700
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 

Recently uploaded (20)

[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 

Metadata Remediation Procedures and Workflow Updates

  • 1. Metadata remediation Marina Georgieva March 19, 2019 Special Collections and Archives Division Meeting Updates, procedures, workflow
  • 2. Extract raw metadata Workflow Analyze data Apply best practices Prepare worksheets REMEDIATE METADATA Clean up ARKs Enhance Collaboration with the Metadata Migration Workgroup Update documentation
  • 3. Updates Completed Next ● Inactive digital collections ○ NV Test Site ○ Menus ○ Showgirls ○ Boomtown ○ ∞ 3 projects 6 remediation cycles 41,937 objects
  • 4. How | Why? How we remediate? Why is remediation important? ● Supports the DAMS migration project ● High priority for Phase 1 of migration project ● Clean data is easier to migrate ● Clean data is consistent ● Data is uniform across all collections Tool: Excel ● Advanced features ○ Formulas ○ Functions ○ VBA code ● Advantages ○ Easy to use ○ Easy to share ○ Easy to learn ○ Powerful for large sets of data

Editor's Notes

  1. Let me introduce you to the metadata remediation workflow and share some project updates, and explain why metadata remediation is one of the high priority projects in Digital Collections this year. Darnelle is taking the lead on this project and I am honored to work with him. This brief talk introduces the metadata remediation workflow and updates from my perspective.
  2. Extract raw metadata - exporting original metadata from ContentDm in a text file and importing it in Excel spreadsheet Analyze data - find abnormalities or patterns; think of ways to incorporate best practices from past projects or ways to twist best practices to accommodate the new project peculiarities Best practices include (1) check if data imported well - no errors in any cells (2) check for duplicates prior to data clean up (3) look for inconsistencies across the fields (4) review old metadata profile fields and make decisions how to map values to new profile to capture all legacy metadata (5) version control Prepare worksheets includes creating Excel spreadsheets for remediating complex fields like subject, name authorities, spatial fields. REMEDIATION process includes assigning ARKs (persistent identifiers) to all digital objects at a parent level, cleaning up messy data to improve consistency across all collections, and enhancing certain fields by adding metadata or extracting metadata and moving it to a more appropriate field (for example moving dates from description to date field to enable faceting) Updating documentation includes updating all shared spreadsheets and documents what was completed, future considerations and future projects, recommendations and updating other Migration Workgroup members what been done. Meanwhile, the remediation process doesn’t happen on its own - it’s rather a collaboration that stretches out of Digital Collections to WADS and Technical Services. I’d like to emphasize its a great deal of searching for metadata and file discrepancies in the Vault as well as inaccuracies in the metadata and the Finding aids and correcting those as part of a bigger effort to migrate clean data in the new DAMS.
  3. During the testing and learning phase of the remediation work, I noticed that data is unique and it’s hard to draw conclusions and create a standard remediation procedure. Darnelle trained me and showed me several ways to accomplish the same result, but often I had to go beyond and twist the procedures to learn a new method or to accommodate collection with very peculiar unstandardized data. The good thing is that after many trials and errors, and several remediations of the same data sets, some patterns started to emerge and I’m able to continue the remediation work with more structured approach that I can apply to particular fields. Sadly, due to the unique content of the collections I can’t apply the same strategies on a collection level, but rather on a metadata field level. So, I can say I developed best practices for my own workflow to make my work more efficient.