Reusing Collection Metadata as Data

Itza Carbajal
Itza CarbajalLatin America Metadata Librarian
Reusing Collection Metadata as Data
Mapping the Spanish Mission Landscape Workshop
March 2, 2019 | University of Texas at Austin
Presentation by: Itza Carbajal, Latin American Metadata Librarian
who creates metadata?
● WHO DOESN’T is the real question
● Individuals
○ Tagging of photos, file naming, project contributions
● Information Science professionals (librarians, archivists, database managers, etc)
○ Cataloging book records
○ Access mechanisms such as finding aids, online repositories, CMS
○ Databases
● Mixed media creators
○ Film production, photography, software developers, music producers
● Publishing
○ Publication agencies, writers working with digital materials, illustrators
why is metadata created?
● Identifying
● Managing
● Searching
● Analyzing
● Designing
what type of metadata is typically captured?
Administrative
Metadata used in managing and administering collections and information
resources
Descriptive
Metadata used to identify and describe collections and related information
resources
Technical
Metadata related to how a system functions or metadata behaves
re-purpose metadata for digital scholarship
● Classroom Instruction
○ Discovery and deep group discussions
● Layered Analysis
○ Geographic Information systems
● In depth searchability
○ Transcription
capturing metadata
Scribe an open source framework for community transcription
built by NYPL Labs in collaboration with Zooniverse
Scraper gets data out of web pages and into spreadsheets
Optical Character Recognition (OCR) technologies -
including programs like Google Drive, Tesseract or Adobe
Acrobat that can detect text to make it searchable/readable
*Rate of accuracy varies and access to affordable software not consistent
accessing existing metadata
Digital Public Library of America (DPLA) open API enables people to use
millions of records describing cultural heritage resources held by
institutions across the US.
Flickr has over 5 billion photos with valuable metadata such as tags,
geolocation, and Exif data
The Europeana provides access to over 50 million digitised items – books,
music, artworks and more from thousands of European archives, libraries
and museums
HathiTrust Digital Library has more than 2 million volumes are in the
public domain and freely viewable on the Web
analyzing metadata
Map Warper built by NYPL Labs is a tool suite used to align (or "rectify")
historical maps to the digital maps of today.
Gephi an open-source software for network visualization and analysis of
data sets to summarize their main characteristics, often with visual
methods.
MALLET is a Java-based package for statistical natural language
processing, document classification, clustering, topic modeling,
information extraction, and other machine learning applications to text.
manipulating Metadata
OpenRefine - clean up messy or inconsistent data
Data Wrangler - used to merge, delete, autofill, filling in missing
data or incorporating data from another source, and move
information in your set.
Data Science Toolkit - set of open-source tools for data
science information transformation needs
thank you.Email questions to: i.carbajal@austin.utexas.edu
1 of 10

Recommended

Hacks, hackers and data journalism by
Hacks, hackers and data journalismHacks, hackers and data journalism
Hacks, hackers and data journalismGlen McGregor
1.2K views42 slides
Poster: Linked Open Data for Cultural Heritage by
Poster: Linked Open Data for Cultural HeritagePoster: Linked Open Data for Cultural Heritage
Poster: Linked Open Data for Cultural HeritageNoreen Whysel
2.3K views1 slide
Cultural Heritage Insitutions and Big Data Collections by
Cultural Heritage Insitutions and Big Data CollectionsCultural Heritage Insitutions and Big Data Collections
Cultural Heritage Insitutions and Big Data Collectionslljohnston
844 views24 slides
LODAC Museum -- Connecting Museums with LOD -- by
LODAC Museum -- Connecting Museums with LOD --LODAC Museum -- Connecting Museums with LOD --
LODAC Museum -- Connecting Museums with LOD --National Institute of Informatics (NII)
1K views22 slides
SemanticWebApp by
SemanticWebAppSemanticWebApp
SemanticWebAppAdela Beres
743 views20 slides
ELAG 2014, Workshop on Electronic Resource Management by
ELAG 2014, Workshop on Electronic Resource ManagementELAG 2014, Workshop on Electronic Resource Management
ELAG 2014, Workshop on Electronic Resource ManagementLydiaU
523 views15 slides

More Related Content

What's hot

EurnewsLDN_Toine_Pieters by
EurnewsLDN_Toine_PietersEurnewsLDN_Toine_Pieters
EurnewsLDN_Toine_PietersEuropeana Newspapers
302 views12 slides
Convergence and Interoperability (IFLA 2011) by
Convergence and Interoperability (IFLA 2011)Convergence and Interoperability (IFLA 2011)
Convergence and Interoperability (IFLA 2011)Figoblog
1.2K views15 slides
BHL-Europe for sherborn 2011 - henning scholz by
BHL-Europe for sherborn 2011 - henning scholzBHL-Europe for sherborn 2011 - henning scholz
BHL-Europe for sherborn 2011 - henning scholzcoelatura
511 views28 slides
Sherborn: Scholz - BHL-Europe: Tools and Services for Legacy Taxonomic Litera... by
Sherborn: Scholz - BHL-Europe: Tools and Services for Legacy Taxonomic Litera...Sherborn: Scholz - BHL-Europe: Tools and Services for Legacy Taxonomic Litera...
Sherborn: Scholz - BHL-Europe: Tools and Services for Legacy Taxonomic Litera...ICZN
1K views28 slides
Geographic information systems (gis) for libraries by
Geographic information systems (gis) for librariesGeographic information systems (gis) for libraries
Geographic information systems (gis) for librariesSeti Keshmiripour
1K views22 slides
agINFRA – a multilingual infrastructure for information on agricultural innov... by
agINFRA – a multilingual infrastructure for information on agricultural innov...agINFRA – a multilingual infrastructure for information on agricultural innov...
agINFRA – a multilingual infrastructure for information on agricultural innov...AIMS (Agricultural Information Management Standards)
984 views36 slides

What's hot(20)

Convergence and Interoperability (IFLA 2011) by Figoblog
Convergence and Interoperability (IFLA 2011)Convergence and Interoperability (IFLA 2011)
Convergence and Interoperability (IFLA 2011)
Figoblog1.2K views
BHL-Europe for sherborn 2011 - henning scholz by coelatura
BHL-Europe for sherborn 2011 - henning scholzBHL-Europe for sherborn 2011 - henning scholz
BHL-Europe for sherborn 2011 - henning scholz
coelatura511 views
Sherborn: Scholz - BHL-Europe: Tools and Services for Legacy Taxonomic Litera... by ICZN
Sherborn: Scholz - BHL-Europe: Tools and Services for Legacy Taxonomic Litera...Sherborn: Scholz - BHL-Europe: Tools and Services for Legacy Taxonomic Litera...
Sherborn: Scholz - BHL-Europe: Tools and Services for Legacy Taxonomic Litera...
ICZN1K views
Geographic information systems (gis) for libraries by Seti Keshmiripour
Geographic information systems (gis) for librariesGeographic information systems (gis) for libraries
Geographic information systems (gis) for libraries
Trellis by alana420
TrellisTrellis
Trellis
alana420279 views
" Overview of the Metadata in the new CountrySTAT platform " by FAO
" Overview of the Metadata in the new CountrySTAT platform "" Overview of the Metadata in the new CountrySTAT platform "
" Overview of the Metadata in the new CountrySTAT platform "
FAO217 views
Trellis_animation by alana420
Trellis_animationTrellis_animation
Trellis_animation
alana420156 views
De- and Reassembling Data Infrastructures by cgrltz
De- and Reassembling Data InfrastructuresDe- and Reassembling Data Infrastructures
De- and Reassembling Data Infrastructures
cgrltz201 views
Mapping the European(a) metadata landscape by Sally Chambers
Mapping the European(a) metadata landscapeMapping the European(a) metadata landscape
Mapping the European(a) metadata landscape
Sally Chambers1.3K views
Athena richard zijdeman by CLARIAH
Athena richard zijdemanAthena richard zijdeman
Athena richard zijdeman
CLARIAH443 views
WORLDMAP: A SPATIAL INFRASTRUCTURE TO SUPPORT TEACHING AND RESEARCH (BROWN BA... by Micah Altman
WORLDMAP: A SPATIAL INFRASTRUCTURE TO SUPPORT TEACHING AND RESEARCH (BROWN BA...WORLDMAP: A SPATIAL INFRASTRUCTURE TO SUPPORT TEACHING AND RESEARCH (BROWN BA...
WORLDMAP: A SPATIAL INFRASTRUCTURE TO SUPPORT TEACHING AND RESEARCH (BROWN BA...
Micah Altman2.2K views
IIIF for Index of Christian Art by Jon Stroop
IIIF for Index of Christian ArtIIIF for Index of Christian Art
IIIF for Index of Christian Art
Jon Stroop865 views
Linking Knowledge Organization Systems via Wikidata (DCMI conference 2018) by Joachim Neubert
Linking Knowledge Organization Systems via Wikidata (DCMI conference 2018)Linking Knowledge Organization Systems via Wikidata (DCMI conference 2018)
Linking Knowledge Organization Systems via Wikidata (DCMI conference 2018)
Joachim Neubert3.5K views
Introduction to Scratchpads & ViBRANT by Edward Baker
Introduction to Scratchpads & ViBRANTIntroduction to Scratchpads & ViBRANT
Introduction to Scratchpads & ViBRANT
Edward Baker475 views

Similar to Reusing Collection Metadata as Data

Neuroscience as networked science by
Neuroscience as networked scienceNeuroscience as networked science
Neuroscience as networked scienceNeuroscience Information Framework
245 views69 slides
Pratt Sils Knowledge Organization Fall 2008 by
Pratt Sils Knowledge Organization Fall 2008Pratt Sils Knowledge Organization Fall 2008
Pratt Sils Knowledge Organization Fall 2008PrattSILS
663 views6 slides
The Future of Metadata Management & Making Library Collections Discoverable o... by
The Future of Metadata Management & Making Library Collections Discoverable o...The Future of Metadata Management & Making Library Collections Discoverable o...
The Future of Metadata Management & Making Library Collections Discoverable o...tfons
370 views45 slides
Cataloging Presentation by
Cataloging PresentationCataloging Presentation
Cataloging PresentationAngela Dresselhaus
429 views21 slides
Ji cv6n1 by
Ji cv6n1Ji cv6n1
Ji cv6n1Gerry McKiernan
576 views17 slides
Open Science by
Open Science Open Science
Open Science Andrea Miller-Nesbitt
1.6K views17 slides

Similar to Reusing Collection Metadata as Data(20)

Pratt Sils Knowledge Organization Fall 2008 by PrattSILS
Pratt Sils Knowledge Organization Fall 2008Pratt Sils Knowledge Organization Fall 2008
Pratt Sils Knowledge Organization Fall 2008
PrattSILS663 views
The Future of Metadata Management & Making Library Collections Discoverable o... by tfons
The Future of Metadata Management & Making Library Collections Discoverable o...The Future of Metadata Management & Making Library Collections Discoverable o...
The Future of Metadata Management & Making Library Collections Discoverable o...
tfons370 views
Integrating Unique Materials into the Global Discovery Network by OCLC Research
Integrating Unique Materials into the Global Discovery NetworkIntegrating Unique Materials into the Global Discovery Network
Integrating Unique Materials into the Global Discovery Network
OCLC Research458 views
1.1 library concepts, terms and systems edited by ChandraSekhar1115
1.1 library concepts, terms and systems edited1.1 library concepts, terms and systems edited
1.1 library concepts, terms and systems edited
Pratt Sils LIS653 4 Fall 2007 by PrattSILS
Pratt Sils LIS653 4 Fall 2007Pratt Sils LIS653 4 Fall 2007
Pratt Sils LIS653 4 Fall 2007
PrattSILS396 views
A Deep Survey of the Digital Resource Landscape: Perspectives from the Neuros... by Maryann Martone
A Deep Survey of the Digital Resource Landscape:Perspectives from the Neuros...A Deep Survey of the Digital Resource Landscape:Perspectives from the Neuros...
A Deep Survey of the Digital Resource Landscape: Perspectives from the Neuros...
Maryann Martone797 views
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums by Jon Voss
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & MuseumsALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
Jon Voss3.4K views
Databases and Ontologies: Where do we go from here? by Maryann Martone
Databases and Ontologies:  Where do we go from here?Databases and Ontologies:  Where do we go from here?
Databases and Ontologies: Where do we go from here?
Maryann Martone1.2K views
Pratt SILS Knowledge Organization Spring 2011 by PrattSILS
Pratt SILS Knowledge Organization Spring 2011Pratt SILS Knowledge Organization Spring 2011
Pratt SILS Knowledge Organization Spring 2011
PrattSILS801 views
Boundless Opportunity by Rachel Frick
Boundless OpportunityBoundless Opportunity
Boundless Opportunity
Rachel Frick753 views
Data-knowledge transition zones within the biomedical research ecosystem by Maryann Martone
Data-knowledge transition zones within the biomedical research ecosystemData-knowledge transition zones within the biomedical research ecosystem
Data-knowledge transition zones within the biomedical research ecosystem
Maryann Martone639 views
Leslie Johnston: Library Big Data Repository Services, Open Repositories 2012 by lljohnston
Leslie Johnston: Library Big Data Repository Services, Open Repositories 2012Leslie Johnston: Library Big Data Repository Services, Open Repositories 2012
Leslie Johnston: Library Big Data Repository Services, Open Repositories 2012
lljohnston783 views
Humanities data curation slides by Harriett Green
Humanities data curation slidesHumanities data curation slides
Humanities data curation slides
Harriett Green998 views

More from Itza Carbajal

Post Custodial Metadata Development & Decisions by
Post Custodial Metadata Development & DecisionsPost Custodial Metadata Development & Decisions
Post Custodial Metadata Development & DecisionsItza Carbajal
297 views8 slides
Metadata From the Source: Participatory Metadata Models in Post-Custodial Pro... by
Metadata From the Source: Participatory Metadata Models in Post-Custodial Pro...Metadata From the Source: Participatory Metadata Models in Post-Custodial Pro...
Metadata From the Source: Participatory Metadata Models in Post-Custodial Pro...Itza Carbajal
247 views16 slides
community, communities & archives by
community, communities & archivescommunity, communities & archives
community, communities & archivesItza Carbajal
367 views19 slides
community & archives? by
community & archives?community & archives?
community & archives?Itza Carbajal
169 views17 slides
Post-Custodial Methods in Archival Practice by
Post-Custodial Methods in Archival PracticePost-Custodial Methods in Archival Practice
Post-Custodial Methods in Archival PracticeItza Carbajal
856 views20 slides
Introduction to Linked Data - Part 1 by
Introduction to Linked Data - Part 1Introduction to Linked Data - Part 1
Introduction to Linked Data - Part 1Itza Carbajal
205 views19 slides

More from Itza Carbajal(12)

Post Custodial Metadata Development & Decisions by Itza Carbajal
Post Custodial Metadata Development & DecisionsPost Custodial Metadata Development & Decisions
Post Custodial Metadata Development & Decisions
Itza Carbajal297 views
Metadata From the Source: Participatory Metadata Models in Post-Custodial Pro... by Itza Carbajal
Metadata From the Source: Participatory Metadata Models in Post-Custodial Pro...Metadata From the Source: Participatory Metadata Models in Post-Custodial Pro...
Metadata From the Source: Participatory Metadata Models in Post-Custodial Pro...
Itza Carbajal247 views
community, communities & archives by Itza Carbajal
community, communities & archivescommunity, communities & archives
community, communities & archives
Itza Carbajal367 views
Post-Custodial Methods in Archival Practice by Itza Carbajal
Post-Custodial Methods in Archival PracticePost-Custodial Methods in Archival Practice
Post-Custodial Methods in Archival Practice
Itza Carbajal856 views
Introduction to Linked Data - Part 1 by Itza Carbajal
Introduction to Linked Data - Part 1Introduction to Linked Data - Part 1
Introduction to Linked Data - Part 1
Itza Carbajal205 views
CROSSING BORDERS: Why Archival Science Students Benefit from Interdepartment... by Itza Carbajal
CROSSING BORDERS:  Why Archival Science Students Benefit from Interdepartment...CROSSING BORDERS:  Why Archival Science Students Benefit from Interdepartment...
CROSSING BORDERS: Why Archival Science Students Benefit from Interdepartment...
Itza Carbajal262 views
Creating Knowledges: A Discussion on the Significance of Gloria Anzaldúa and ... by Itza Carbajal
Creating Knowledges: A Discussion on the Significance of Gloria Anzaldúa and ...Creating Knowledges: A Discussion on the Significance of Gloria Anzaldúa and ...
Creating Knowledges: A Discussion on the Significance of Gloria Anzaldúa and ...
Itza Carbajal142 views
Centering Consent: Investigating Archival Donor Relations Practices Panel by Itza Carbajal
Centering Consent: Investigating Archival Donor Relations Practices PanelCentering Consent: Investigating Archival Donor Relations Practices Panel
Centering Consent: Investigating Archival Donor Relations Practices Panel
Itza Carbajal102 views
Radical Shared History Online Portal Work Session by Itza Carbajal
Radical Shared History Online Portal Work SessionRadical Shared History Online Portal Work Session
Radical Shared History Online Portal Work Session
Itza Carbajal235 views
Digital Keepers: Ethics of Saving Online Data About Latin American Social Mo... by Itza Carbajal
Digital Keepers:  Ethics of Saving Online Data About Latin American Social Mo...Digital Keepers:  Ethics of Saving Online Data About Latin American Social Mo...
Digital Keepers: Ethics of Saving Online Data About Latin American Social Mo...
Itza Carbajal187 views
Defining the Archive on Our Terms: A Look at the Esperanza Peace and Justice ... by Itza Carbajal
Defining the Archive on Our Terms: A Look at the Esperanza Peace and Justice ...Defining the Archive on Our Terms: A Look at the Esperanza Peace and Justice ...
Defining the Archive on Our Terms: A Look at the Esperanza Peace and Justice ...
Itza Carbajal377 views

Recently uploaded

PROGRAMME.pdf by
PROGRAMME.pdfPROGRAMME.pdf
PROGRAMME.pdfHiNedHaJar
17 views13 slides
Vikas 500 BIG DATA TECHNOLOGIES LAB.pdf by
Vikas 500 BIG DATA TECHNOLOGIES LAB.pdfVikas 500 BIG DATA TECHNOLOGIES LAB.pdf
Vikas 500 BIG DATA TECHNOLOGIES LAB.pdfvikas12611618
8 views30 slides
Short Story Assignment by Kelly Nguyen by
Short Story Assignment by Kelly NguyenShort Story Assignment by Kelly Nguyen
Short Story Assignment by Kelly Nguyenkellynguyen01
18 views17 slides
How Leaders See Data? (Level 1) by
How Leaders See Data? (Level 1)How Leaders See Data? (Level 1)
How Leaders See Data? (Level 1)Narendra Narendra
13 views76 slides
UNEP FI CRS Climate Risk Results.pptx by
UNEP FI CRS Climate Risk Results.pptxUNEP FI CRS Climate Risk Results.pptx
UNEP FI CRS Climate Risk Results.pptxpekka28
11 views51 slides
Chapter 3b- Process Communication (1) (1)(1) (1).pptx by
Chapter 3b- Process Communication (1) (1)(1) (1).pptxChapter 3b- Process Communication (1) (1)(1) (1).pptx
Chapter 3b- Process Communication (1) (1)(1) (1).pptxayeshabaig2004
5 views30 slides

Recently uploaded(20)

Vikas 500 BIG DATA TECHNOLOGIES LAB.pdf by vikas12611618
Vikas 500 BIG DATA TECHNOLOGIES LAB.pdfVikas 500 BIG DATA TECHNOLOGIES LAB.pdf
Vikas 500 BIG DATA TECHNOLOGIES LAB.pdf
vikas126116188 views
Short Story Assignment by Kelly Nguyen by kellynguyen01
Short Story Assignment by Kelly NguyenShort Story Assignment by Kelly Nguyen
Short Story Assignment by Kelly Nguyen
kellynguyen0118 views
UNEP FI CRS Climate Risk Results.pptx by pekka28
UNEP FI CRS Climate Risk Results.pptxUNEP FI CRS Climate Risk Results.pptx
UNEP FI CRS Climate Risk Results.pptx
pekka2811 views
Chapter 3b- Process Communication (1) (1)(1) (1).pptx by ayeshabaig2004
Chapter 3b- Process Communication (1) (1)(1) (1).pptxChapter 3b- Process Communication (1) (1)(1) (1).pptx
Chapter 3b- Process Communication (1) (1)(1) (1).pptx
ayeshabaig20045 views
Advanced_Recommendation_Systems_Presentation.pptx by neeharikasingh29
Advanced_Recommendation_Systems_Presentation.pptxAdvanced_Recommendation_Systems_Presentation.pptx
Advanced_Recommendation_Systems_Presentation.pptx
3196 The Case of The East River by ErickANDRADE90
3196 The Case of The East River3196 The Case of The East River
3196 The Case of The East River
ErickANDRADE9011 views
Cross-network in Google Analytics 4.pdf by GA4 Tutorials
Cross-network in Google Analytics 4.pdfCross-network in Google Analytics 4.pdf
Cross-network in Google Analytics 4.pdf
GA4 Tutorials6 views
Organic Shopping in Google Analytics 4.pdf by GA4 Tutorials
Organic Shopping in Google Analytics 4.pdfOrganic Shopping in Google Analytics 4.pdf
Organic Shopping in Google Analytics 4.pdf
GA4 Tutorials10 views
[DSC Europe 23] Spela Poklukar & Tea Brasanac - Retrieval Augmented Generation by DataScienceConferenc1
[DSC Europe 23] Spela Poklukar & Tea Brasanac - Retrieval Augmented Generation[DSC Europe 23] Spela Poklukar & Tea Brasanac - Retrieval Augmented Generation
[DSC Europe 23] Spela Poklukar & Tea Brasanac - Retrieval Augmented Generation
Understanding Hallucinations in LLMs - 2023 09 29.pptx by Greg Makowski
Understanding Hallucinations in LLMs - 2023 09 29.pptxUnderstanding Hallucinations in LLMs - 2023 09 29.pptx
Understanding Hallucinations in LLMs - 2023 09 29.pptx
Greg Makowski13 views
Introduction to Microsoft Fabric.pdf by ishaniuudeshika
Introduction to Microsoft Fabric.pdfIntroduction to Microsoft Fabric.pdf
Introduction to Microsoft Fabric.pdf
ishaniuudeshika24 views
[DSC Europe 23] Zsolt Feleki - Machine Translation should we trust it.pptx by DataScienceConferenc1
[DSC Europe 23] Zsolt Feleki - Machine Translation should we trust it.pptx[DSC Europe 23] Zsolt Feleki - Machine Translation should we trust it.pptx
[DSC Europe 23] Zsolt Feleki - Machine Translation should we trust it.pptx
Data structure and algorithm. by Abdul salam
Data structure and algorithm. Data structure and algorithm.
Data structure and algorithm.
Abdul salam 18 views
Survey on Factuality in LLM's.pptx by NeethaSherra1
Survey on Factuality in LLM's.pptxSurvey on Factuality in LLM's.pptx
Survey on Factuality in LLM's.pptx
NeethaSherra15 views
Supercharging your Data with Azure AI Search and Azure OpenAI by Peter Gallagher
Supercharging your Data with Azure AI Search and Azure OpenAISupercharging your Data with Azure AI Search and Azure OpenAI
Supercharging your Data with Azure AI Search and Azure OpenAI
Peter Gallagher37 views
CRIJ4385_Death Penalty_F23.pptx by yvettemm100
CRIJ4385_Death Penalty_F23.pptxCRIJ4385_Death Penalty_F23.pptx
CRIJ4385_Death Penalty_F23.pptx
yvettemm1006 views

Reusing Collection Metadata as Data

  • 1. Reusing Collection Metadata as Data Mapping the Spanish Mission Landscape Workshop March 2, 2019 | University of Texas at Austin Presentation by: Itza Carbajal, Latin American Metadata Librarian
  • 2. who creates metadata? ● WHO DOESN’T is the real question ● Individuals ○ Tagging of photos, file naming, project contributions ● Information Science professionals (librarians, archivists, database managers, etc) ○ Cataloging book records ○ Access mechanisms such as finding aids, online repositories, CMS ○ Databases ● Mixed media creators ○ Film production, photography, software developers, music producers ● Publishing ○ Publication agencies, writers working with digital materials, illustrators
  • 3. why is metadata created? ● Identifying ● Managing ● Searching ● Analyzing ● Designing
  • 4. what type of metadata is typically captured? Administrative Metadata used in managing and administering collections and information resources Descriptive Metadata used to identify and describe collections and related information resources Technical Metadata related to how a system functions or metadata behaves
  • 5. re-purpose metadata for digital scholarship ● Classroom Instruction ○ Discovery and deep group discussions ● Layered Analysis ○ Geographic Information systems ● In depth searchability ○ Transcription
  • 6. capturing metadata Scribe an open source framework for community transcription built by NYPL Labs in collaboration with Zooniverse Scraper gets data out of web pages and into spreadsheets Optical Character Recognition (OCR) technologies - including programs like Google Drive, Tesseract or Adobe Acrobat that can detect text to make it searchable/readable *Rate of accuracy varies and access to affordable software not consistent
  • 7. accessing existing metadata Digital Public Library of America (DPLA) open API enables people to use millions of records describing cultural heritage resources held by institutions across the US. Flickr has over 5 billion photos with valuable metadata such as tags, geolocation, and Exif data The Europeana provides access to over 50 million digitised items – books, music, artworks and more from thousands of European archives, libraries and museums HathiTrust Digital Library has more than 2 million volumes are in the public domain and freely viewable on the Web
  • 8. analyzing metadata Map Warper built by NYPL Labs is a tool suite used to align (or "rectify") historical maps to the digital maps of today. Gephi an open-source software for network visualization and analysis of data sets to summarize their main characteristics, often with visual methods. MALLET is a Java-based package for statistical natural language processing, document classification, clustering, topic modeling, information extraction, and other machine learning applications to text.
  • 9. manipulating Metadata OpenRefine - clean up messy or inconsistent data Data Wrangler - used to merge, delete, autofill, filling in missing data or incorporating data from another source, and move information in your set. Data Science Toolkit - set of open-source tools for data science information transformation needs
  • 10. thank you.Email questions to: i.carbajal@austin.utexas.edu