SlideShare a Scribd company logo
Reusing Collection Metadata as Data
Mapping the Spanish Mission Landscape Workshop
March 2, 2019 | University of Texas at Austin
Presentation by: Itza Carbajal, Latin American Metadata Librarian
who creates metadata?
● WHO DOESN’T is the real question
● Individuals
○ Tagging of photos, file naming, project contributions
● Information Science professionals (librarians, archivists, database managers, etc)
○ Cataloging book records
○ Access mechanisms such as finding aids, online repositories, CMS
○ Databases
● Mixed media creators
○ Film production, photography, software developers, music producers
● Publishing
○ Publication agencies, writers working with digital materials, illustrators
why is metadata created?
● Identifying
● Managing
● Searching
● Analyzing
● Designing
what type of metadata is typically captured?
Administrative
Metadata used in managing and administering collections and information
resources
Descriptive
Metadata used to identify and describe collections and related information
resources
Technical
Metadata related to how a system functions or metadata behaves
re-purpose metadata for digital scholarship
● Classroom Instruction
○ Discovery and deep group discussions
● Layered Analysis
○ Geographic Information systems
● In depth searchability
○ Transcription
capturing metadata
Scribe an open source framework for community transcription
built by NYPL Labs in collaboration with Zooniverse
Scraper gets data out of web pages and into spreadsheets
Optical Character Recognition (OCR) technologies -
including programs like Google Drive, Tesseract or Adobe
Acrobat that can detect text to make it searchable/readable
*Rate of accuracy varies and access to affordable software not consistent
accessing existing metadata
Digital Public Library of America (DPLA) open API enables people to use
millions of records describing cultural heritage resources held by
institutions across the US.
Flickr has over 5 billion photos with valuable metadata such as tags,
geolocation, and Exif data
The Europeana provides access to over 50 million digitised items – books,
music, artworks and more from thousands of European archives, libraries
and museums
HathiTrust Digital Library has more than 2 million volumes are in the
public domain and freely viewable on the Web
analyzing metadata
Map Warper built by NYPL Labs is a tool suite used to align (or "rectify")
historical maps to the digital maps of today.
Gephi an open-source software for network visualization and analysis of
data sets to summarize their main characteristics, often with visual
methods.
MALLET is a Java-based package for statistical natural language
processing, document classification, clustering, topic modeling,
information extraction, and other machine learning applications to text.
manipulating Metadata
OpenRefine - clean up messy or inconsistent data
Data Wrangler - used to merge, delete, autofill, filling in missing
data or incorporating data from another source, and move
information in your set.
Data Science Toolkit - set of open-source tools for data
science information transformation needs
thank you.Email questions to: i.carbajal@austin.utexas.edu

More Related Content

What's hot

EurnewsLDN_Toine_Pieters
EurnewsLDN_Toine_PietersEurnewsLDN_Toine_Pieters
EurnewsLDN_Toine_Pieters
Europeana Newspapers
 
Convergence and Interoperability (IFLA 2011)
Convergence and Interoperability (IFLA 2011)Convergence and Interoperability (IFLA 2011)
Convergence and Interoperability (IFLA 2011)Figoblog
 
BHL-Europe for sherborn 2011 - henning scholz
BHL-Europe for sherborn 2011 - henning scholzBHL-Europe for sherborn 2011 - henning scholz
BHL-Europe for sherborn 2011 - henning scholzcoelatura
 
Sherborn: Scholz - BHL-Europe: Tools and Services for Legacy Taxonomic Litera...
Sherborn: Scholz - BHL-Europe: Tools and Services for Legacy Taxonomic Litera...Sherborn: Scholz - BHL-Europe: Tools and Services for Legacy Taxonomic Litera...
Sherborn: Scholz - BHL-Europe: Tools and Services for Legacy Taxonomic Litera...
ICZN
 
Geographic information systems (gis) for libraries
Geographic information systems (gis) for librariesGeographic information systems (gis) for libraries
Geographic information systems (gis) for libraries
Seti Keshmiripour
 
" Overview of the Metadata in the new CountrySTAT platform "
" Overview of the Metadata in the new CountrySTAT platform "" Overview of the Metadata in the new CountrySTAT platform "
" Overview of the Metadata in the new CountrySTAT platform "
FAO
 
Trellis_animation
Trellis_animationTrellis_animation
Trellis_animationalana420
 
De- and Reassembling Data Infrastructures
De- and Reassembling Data InfrastructuresDe- and Reassembling Data Infrastructures
De- and Reassembling Data Infrastructures
cgrltz
 
Mapping the European(a) metadata landscape
Mapping the European(a) metadata landscapeMapping the European(a) metadata landscape
Mapping the European(a) metadata landscape
Sally Chambers
 
PerFedPat patent search system
PerFedPat patent search systemPerFedPat patent search system
PerFedPat patent search system
Mike Salampasis
 
Athena richard zijdeman
Athena richard zijdemanAthena richard zijdeman
Athena richard zijdeman
CLARIAH
 
WORLDMAP: A SPATIAL INFRASTRUCTURE TO SUPPORT TEACHING AND RESEARCH (BROWN BA...
WORLDMAP: A SPATIAL INFRASTRUCTURE TO SUPPORT TEACHING AND RESEARCH (BROWN BA...WORLDMAP: A SPATIAL INFRASTRUCTURE TO SUPPORT TEACHING AND RESEARCH (BROWN BA...
WORLDMAP: A SPATIAL INFRASTRUCTURE TO SUPPORT TEACHING AND RESEARCH (BROWN BA...
Micah Altman
 
IIIF for Index of Christian Art
IIIF for Index of Christian ArtIIIF for Index of Christian Art
IIIF for Index of Christian ArtJon Stroop
 
Linking Knowledge Organization Systems via Wikidata (DCMI conference 2018)
Linking Knowledge Organization Systems via Wikidata (DCMI conference 2018)Linking Knowledge Organization Systems via Wikidata (DCMI conference 2018)
Linking Knowledge Organization Systems via Wikidata (DCMI conference 2018)
Joachim Neubert
 
Introduction to Scratchpads & ViBRANT
Introduction to Scratchpads & ViBRANTIntroduction to Scratchpads & ViBRANT
Introduction to Scratchpads & ViBRANTEdward Baker
 

What's hot (20)

EurnewsLDN_Toine_Pieters
EurnewsLDN_Toine_PietersEurnewsLDN_Toine_Pieters
EurnewsLDN_Toine_Pieters
 
Convergence and Interoperability (IFLA 2011)
Convergence and Interoperability (IFLA 2011)Convergence and Interoperability (IFLA 2011)
Convergence and Interoperability (IFLA 2011)
 
BHL-Europe for sherborn 2011 - henning scholz
BHL-Europe for sherborn 2011 - henning scholzBHL-Europe for sherborn 2011 - henning scholz
BHL-Europe for sherborn 2011 - henning scholz
 
Sherborn: Scholz - BHL-Europe: Tools and Services for Legacy Taxonomic Litera...
Sherborn: Scholz - BHL-Europe: Tools and Services for Legacy Taxonomic Litera...Sherborn: Scholz - BHL-Europe: Tools and Services for Legacy Taxonomic Litera...
Sherborn: Scholz - BHL-Europe: Tools and Services for Legacy Taxonomic Litera...
 
Geographic information systems (gis) for libraries
Geographic information systems (gis) for librariesGeographic information systems (gis) for libraries
Geographic information systems (gis) for libraries
 
agINFRA – a multilingual infrastructure for information on agricultural innov...
agINFRA – a multilingual infrastructure for information on agricultural innov...agINFRA – a multilingual infrastructure for information on agricultural innov...
agINFRA – a multilingual infrastructure for information on agricultural innov...
 
Trellis
TrellisTrellis
Trellis
 
" Overview of the Metadata in the new CountrySTAT platform "
" Overview of the Metadata in the new CountrySTAT platform "" Overview of the Metadata in the new CountrySTAT platform "
" Overview of the Metadata in the new CountrySTAT platform "
 
Trellis_animation
Trellis_animationTrellis_animation
Trellis_animation
 
Finding Data Sets
Finding Data SetsFinding Data Sets
Finding Data Sets
 
De- and Reassembling Data Infrastructures
De- and Reassembling Data InfrastructuresDe- and Reassembling Data Infrastructures
De- and Reassembling Data Infrastructures
 
Mapping the European(a) metadata landscape
Mapping the European(a) metadata landscapeMapping the European(a) metadata landscape
Mapping the European(a) metadata landscape
 
PerFedPat patent search system
PerFedPat patent search systemPerFedPat patent search system
PerFedPat patent search system
 
Athena richard zijdeman
Athena richard zijdemanAthena richard zijdeman
Athena richard zijdeman
 
WORLDMAP: A SPATIAL INFRASTRUCTURE TO SUPPORT TEACHING AND RESEARCH (BROWN BA...
WORLDMAP: A SPATIAL INFRASTRUCTURE TO SUPPORT TEACHING AND RESEARCH (BROWN BA...WORLDMAP: A SPATIAL INFRASTRUCTURE TO SUPPORT TEACHING AND RESEARCH (BROWN BA...
WORLDMAP: A SPATIAL INFRASTRUCTURE TO SUPPORT TEACHING AND RESEARCH (BROWN BA...
 
Curadoria digital e dados abertos conectados
Curadoria digital e dados abertos conectadosCuradoria digital e dados abertos conectados
Curadoria digital e dados abertos conectados
 
IIIF for Index of Christian Art
IIIF for Index of Christian ArtIIIF for Index of Christian Art
IIIF for Index of Christian Art
 
Exploring Linked Data
Exploring Linked DataExploring Linked Data
Exploring Linked Data
 
Linking Knowledge Organization Systems via Wikidata (DCMI conference 2018)
Linking Knowledge Organization Systems via Wikidata (DCMI conference 2018)Linking Knowledge Organization Systems via Wikidata (DCMI conference 2018)
Linking Knowledge Organization Systems via Wikidata (DCMI conference 2018)
 
Introduction to Scratchpads & ViBRANT
Introduction to Scratchpads & ViBRANTIntroduction to Scratchpads & ViBRANT
Introduction to Scratchpads & ViBRANT
 

Similar to Reusing Collection Metadata as Data

Neuroscience as networked science
Neuroscience as networked scienceNeuroscience as networked science
Neuroscience as networked science
Neuroscience Information Framework
 
Pratt Sils Knowledge Organization Fall 2008
Pratt Sils Knowledge Organization Fall 2008Pratt Sils Knowledge Organization Fall 2008
Pratt Sils Knowledge Organization Fall 2008PrattSILS
 
The Future of Metadata Management & Making Library Collections Discoverable o...
The Future of Metadata Management & Making Library Collections Discoverable o...The Future of Metadata Management & Making Library Collections Discoverable o...
The Future of Metadata Management & Making Library Collections Discoverable o...
tfons
 
Cataloging Presentation
Cataloging PresentationCataloging Presentation
Cataloging Presentation
Angela Dresselhaus
 
Integrating Unique Materials into the Global Discovery Network
Integrating Unique Materials into the Global Discovery NetworkIntegrating Unique Materials into the Global Discovery Network
Integrating Unique Materials into the Global Discovery Network
OCLC Research
 
1.1 library concepts, terms and systems edited
1.1 library concepts, terms and systems edited1.1 library concepts, terms and systems edited
1.1 library concepts, terms and systems edited
ChandraSekhar1115
 
Pratt Sils LIS653 4 Fall 2007
Pratt Sils LIS653 4 Fall 2007Pratt Sils LIS653 4 Fall 2007
Pratt Sils LIS653 4 Fall 2007PrattSILS
 
A Deep Survey of the Digital Resource Landscape: Perspectives from the Neuros...
A Deep Survey of the Digital Resource Landscape:Perspectives from the Neuros...A Deep Survey of the Digital Resource Landscape:Perspectives from the Neuros...
A Deep Survey of the Digital Resource Landscape: Perspectives from the Neuros...
Maryann Martone
 
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & MuseumsALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & MuseumsJon Voss
 
A Deep Survey of the Digital Resource Landscape
A Deep Survey of the Digital Resource LandscapeA Deep Survey of the Digital Resource Landscape
A Deep Survey of the Digital Resource Landscape
Neuroscience Information Framework
 
Databases and Ontologies: Where do we go from here?
Databases and Ontologies:  Where do we go from here?Databases and Ontologies:  Where do we go from here?
Databases and Ontologies: Where do we go from here?
Maryann Martone
 
Pratt SILS Knowledge Organization Spring 2011
Pratt SILS Knowledge Organization Spring 2011Pratt SILS Knowledge Organization Spring 2011
Pratt SILS Knowledge Organization Spring 2011PrattSILS
 
Boundless Opportunity
Boundless OpportunityBoundless Opportunity
Boundless Opportunity
Rachel Frick
 
UKSG 2024 -From algorithms to empowerment:teaching algorithmic literacy (AL) ...
UKSG 2024 -From algorithms to empowerment:teaching algorithmic literacy (AL) ...UKSG 2024 -From algorithms to empowerment:teaching algorithmic literacy (AL) ...
UKSG 2024 -From algorithms to empowerment:teaching algorithmic literacy (AL) ...
UKSG: connecting the knowledge community
 
Data-knowledge transition zones within the biomedical research ecosystem
Data-knowledge transition zones within the biomedical research ecosystemData-knowledge transition zones within the biomedical research ecosystem
Data-knowledge transition zones within the biomedical research ecosystem
Maryann Martone
 
Leslie Johnston: Library Big Data Repository Services, Open Repositories 2012
Leslie Johnston: Library Big Data Repository Services, Open Repositories 2012Leslie Johnston: Library Big Data Repository Services, Open Repositories 2012
Leslie Johnston: Library Big Data Repository Services, Open Repositories 2012lljohnston
 
Humanities data curation slides
Humanities data curation slidesHumanities data curation slides
Humanities data curation slides
Harriett Green
 
Toward universal information access on the digital object cloud
Toward universal information access on the digital object cloudToward universal information access on the digital object cloud
Toward universal information access on the digital object cloud
National Institute of Informatics
 

Similar to Reusing Collection Metadata as Data (20)

Neuroscience as networked science
Neuroscience as networked scienceNeuroscience as networked science
Neuroscience as networked science
 
Pratt Sils Knowledge Organization Fall 2008
Pratt Sils Knowledge Organization Fall 2008Pratt Sils Knowledge Organization Fall 2008
Pratt Sils Knowledge Organization Fall 2008
 
The Future of Metadata Management & Making Library Collections Discoverable o...
The Future of Metadata Management & Making Library Collections Discoverable o...The Future of Metadata Management & Making Library Collections Discoverable o...
The Future of Metadata Management & Making Library Collections Discoverable o...
 
Cataloging Presentation
Cataloging PresentationCataloging Presentation
Cataloging Presentation
 
Ji cv6n1
Ji cv6n1Ji cv6n1
Ji cv6n1
 
Open Science
Open Science Open Science
Open Science
 
Integrating Unique Materials into the Global Discovery Network
Integrating Unique Materials into the Global Discovery NetworkIntegrating Unique Materials into the Global Discovery Network
Integrating Unique Materials into the Global Discovery Network
 
1.1 library concepts, terms and systems edited
1.1 library concepts, terms and systems edited1.1 library concepts, terms and systems edited
1.1 library concepts, terms and systems edited
 
Pratt Sils LIS653 4 Fall 2007
Pratt Sils LIS653 4 Fall 2007Pratt Sils LIS653 4 Fall 2007
Pratt Sils LIS653 4 Fall 2007
 
A Deep Survey of the Digital Resource Landscape: Perspectives from the Neuros...
A Deep Survey of the Digital Resource Landscape:Perspectives from the Neuros...A Deep Survey of the Digital Resource Landscape:Perspectives from the Neuros...
A Deep Survey of the Digital Resource Landscape: Perspectives from the Neuros...
 
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & MuseumsALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
 
A Deep Survey of the Digital Resource Landscape
A Deep Survey of the Digital Resource LandscapeA Deep Survey of the Digital Resource Landscape
A Deep Survey of the Digital Resource Landscape
 
Databases and Ontologies: Where do we go from here?
Databases and Ontologies:  Where do we go from here?Databases and Ontologies:  Where do we go from here?
Databases and Ontologies: Where do we go from here?
 
Pratt SILS Knowledge Organization Spring 2011
Pratt SILS Knowledge Organization Spring 2011Pratt SILS Knowledge Organization Spring 2011
Pratt SILS Knowledge Organization Spring 2011
 
Boundless Opportunity
Boundless OpportunityBoundless Opportunity
Boundless Opportunity
 
UKSG 2024 -From algorithms to empowerment:teaching algorithmic literacy (AL) ...
UKSG 2024 -From algorithms to empowerment:teaching algorithmic literacy (AL) ...UKSG 2024 -From algorithms to empowerment:teaching algorithmic literacy (AL) ...
UKSG 2024 -From algorithms to empowerment:teaching algorithmic literacy (AL) ...
 
Data-knowledge transition zones within the biomedical research ecosystem
Data-knowledge transition zones within the biomedical research ecosystemData-knowledge transition zones within the biomedical research ecosystem
Data-knowledge transition zones within the biomedical research ecosystem
 
Leslie Johnston: Library Big Data Repository Services, Open Repositories 2012
Leslie Johnston: Library Big Data Repository Services, Open Repositories 2012Leslie Johnston: Library Big Data Repository Services, Open Repositories 2012
Leslie Johnston: Library Big Data Repository Services, Open Repositories 2012
 
Humanities data curation slides
Humanities data curation slidesHumanities data curation slides
Humanities data curation slides
 
Toward universal information access on the digital object cloud
Toward universal information access on the digital object cloudToward universal information access on the digital object cloud
Toward universal information access on the digital object cloud
 

More from Itza Carbajal

Post Custodial Metadata Development & Decisions
Post Custodial Metadata Development & DecisionsPost Custodial Metadata Development & Decisions
Post Custodial Metadata Development & Decisions
Itza Carbajal
 
Metadata From the Source: Participatory Metadata Models in Post-Custodial Pro...
Metadata From the Source: Participatory Metadata Models in Post-Custodial Pro...Metadata From the Source: Participatory Metadata Models in Post-Custodial Pro...
Metadata From the Source: Participatory Metadata Models in Post-Custodial Pro...
Itza Carbajal
 
community, communities & archives
community, communities & archivescommunity, communities & archives
community, communities & archives
Itza Carbajal
 
community & archives?
community & archives?community & archives?
community & archives?
Itza Carbajal
 
Post-Custodial Methods in Archival Practice
Post-Custodial Methods in Archival PracticePost-Custodial Methods in Archival Practice
Post-Custodial Methods in Archival Practice
Itza Carbajal
 
Introduction to Linked Data - Part 1
Introduction to Linked Data - Part 1Introduction to Linked Data - Part 1
Introduction to Linked Data - Part 1
Itza Carbajal
 
CROSSING BORDERS: Why Archival Science Students Benefit from Interdepartment...
CROSSING BORDERS:  Why Archival Science Students Benefit from Interdepartment...CROSSING BORDERS:  Why Archival Science Students Benefit from Interdepartment...
CROSSING BORDERS: Why Archival Science Students Benefit from Interdepartment...
Itza Carbajal
 
Creating Knowledges: A Discussion on the Significance of Gloria Anzaldúa and ...
Creating Knowledges: A Discussion on the Significance of Gloria Anzaldúa and ...Creating Knowledges: A Discussion on the Significance of Gloria Anzaldúa and ...
Creating Knowledges: A Discussion on the Significance of Gloria Anzaldúa and ...
Itza Carbajal
 
Centering Consent: Investigating Archival Donor Relations Practices Panel
Centering Consent: Investigating Archival Donor Relations Practices PanelCentering Consent: Investigating Archival Donor Relations Practices Panel
Centering Consent: Investigating Archival Donor Relations Practices Panel
Itza Carbajal
 
Radical Shared History Online Portal Work Session
Radical Shared History Online Portal Work SessionRadical Shared History Online Portal Work Session
Radical Shared History Online Portal Work Session
Itza Carbajal
 
Digital Keepers: Ethics of Saving Online Data About Latin American Social Mo...
Digital Keepers:  Ethics of Saving Online Data About Latin American Social Mo...Digital Keepers:  Ethics of Saving Online Data About Latin American Social Mo...
Digital Keepers: Ethics of Saving Online Data About Latin American Social Mo...
Itza Carbajal
 
Defining the Archive on Our Terms: A Look at the Esperanza Peace and Justice ...
Defining the Archive on Our Terms: A Look at the Esperanza Peace and Justice ...Defining the Archive on Our Terms: A Look at the Esperanza Peace and Justice ...
Defining the Archive on Our Terms: A Look at the Esperanza Peace and Justice ...
Itza Carbajal
 

More from Itza Carbajal (12)

Post Custodial Metadata Development & Decisions
Post Custodial Metadata Development & DecisionsPost Custodial Metadata Development & Decisions
Post Custodial Metadata Development & Decisions
 
Metadata From the Source: Participatory Metadata Models in Post-Custodial Pro...
Metadata From the Source: Participatory Metadata Models in Post-Custodial Pro...Metadata From the Source: Participatory Metadata Models in Post-Custodial Pro...
Metadata From the Source: Participatory Metadata Models in Post-Custodial Pro...
 
community, communities & archives
community, communities & archivescommunity, communities & archives
community, communities & archives
 
community & archives?
community & archives?community & archives?
community & archives?
 
Post-Custodial Methods in Archival Practice
Post-Custodial Methods in Archival PracticePost-Custodial Methods in Archival Practice
Post-Custodial Methods in Archival Practice
 
Introduction to Linked Data - Part 1
Introduction to Linked Data - Part 1Introduction to Linked Data - Part 1
Introduction to Linked Data - Part 1
 
CROSSING BORDERS: Why Archival Science Students Benefit from Interdepartment...
CROSSING BORDERS:  Why Archival Science Students Benefit from Interdepartment...CROSSING BORDERS:  Why Archival Science Students Benefit from Interdepartment...
CROSSING BORDERS: Why Archival Science Students Benefit from Interdepartment...
 
Creating Knowledges: A Discussion on the Significance of Gloria Anzaldúa and ...
Creating Knowledges: A Discussion on the Significance of Gloria Anzaldúa and ...Creating Knowledges: A Discussion on the Significance of Gloria Anzaldúa and ...
Creating Knowledges: A Discussion on the Significance of Gloria Anzaldúa and ...
 
Centering Consent: Investigating Archival Donor Relations Practices Panel
Centering Consent: Investigating Archival Donor Relations Practices PanelCentering Consent: Investigating Archival Donor Relations Practices Panel
Centering Consent: Investigating Archival Donor Relations Practices Panel
 
Radical Shared History Online Portal Work Session
Radical Shared History Online Portal Work SessionRadical Shared History Online Portal Work Session
Radical Shared History Online Portal Work Session
 
Digital Keepers: Ethics of Saving Online Data About Latin American Social Mo...
Digital Keepers:  Ethics of Saving Online Data About Latin American Social Mo...Digital Keepers:  Ethics of Saving Online Data About Latin American Social Mo...
Digital Keepers: Ethics of Saving Online Data About Latin American Social Mo...
 
Defining the Archive on Our Terms: A Look at the Esperanza Peace and Justice ...
Defining the Archive on Our Terms: A Look at the Esperanza Peace and Justice ...Defining the Archive on Our Terms: A Look at the Esperanza Peace and Justice ...
Defining the Archive on Our Terms: A Look at the Esperanza Peace and Justice ...
 

Recently uploaded

Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptxData_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
AnirbanRoy608946
 
Machine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptxMachine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptx
balafet
 
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
sameer shah
 
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
slg6lamcq
 
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
g4dpvqap0
 
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
oz8q3jxlp
 
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
mbawufebxi
 
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
axoqas
 
Adjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTESAdjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTES
Subhajit Sahu
 
一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理
一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理
一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理
mzpolocfi
 
Analysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performanceAnalysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performance
roli9797
 
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
apvysm8
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
Timothy Spann
 
Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)
TravisMalana
 
Unleashing the Power of Data_ Choosing a Trusted Analytics Platform.pdf
Unleashing the Power of Data_ Choosing a Trusted Analytics Platform.pdfUnleashing the Power of Data_ Choosing a Trusted Analytics Platform.pdf
Unleashing the Power of Data_ Choosing a Trusted Analytics Platform.pdf
Enterprise Wired
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
jerlynmaetalle
 
My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.
rwarrenll
 
The Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series DatabaseThe Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series Database
javier ramirez
 
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
74nqk8xf
 
Learn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queriesLearn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queries
manishkhaire30
 

Recently uploaded (20)

Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptxData_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
 
Machine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptxMachine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptx
 
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
 
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
 
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
 
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
 
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
 
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
 
Adjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTESAdjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTES
 
一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理
一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理
一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理
 
Analysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performanceAnalysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performance
 
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
办(uts毕业证书)悉尼科技大学毕业证学历证书原版一模一样
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
 
Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)
 
Unleashing the Power of Data_ Choosing a Trusted Analytics Platform.pdf
Unleashing the Power of Data_ Choosing a Trusted Analytics Platform.pdfUnleashing the Power of Data_ Choosing a Trusted Analytics Platform.pdf
Unleashing the Power of Data_ Choosing a Trusted Analytics Platform.pdf
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
 
My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.
 
The Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series DatabaseThe Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series Database
 
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
 
Learn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queriesLearn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queries
 

Reusing Collection Metadata as Data

  • 1. Reusing Collection Metadata as Data Mapping the Spanish Mission Landscape Workshop March 2, 2019 | University of Texas at Austin Presentation by: Itza Carbajal, Latin American Metadata Librarian
  • 2. who creates metadata? ● WHO DOESN’T is the real question ● Individuals ○ Tagging of photos, file naming, project contributions ● Information Science professionals (librarians, archivists, database managers, etc) ○ Cataloging book records ○ Access mechanisms such as finding aids, online repositories, CMS ○ Databases ● Mixed media creators ○ Film production, photography, software developers, music producers ● Publishing ○ Publication agencies, writers working with digital materials, illustrators
  • 3. why is metadata created? ● Identifying ● Managing ● Searching ● Analyzing ● Designing
  • 4. what type of metadata is typically captured? Administrative Metadata used in managing and administering collections and information resources Descriptive Metadata used to identify and describe collections and related information resources Technical Metadata related to how a system functions or metadata behaves
  • 5. re-purpose metadata for digital scholarship ● Classroom Instruction ○ Discovery and deep group discussions ● Layered Analysis ○ Geographic Information systems ● In depth searchability ○ Transcription
  • 6. capturing metadata Scribe an open source framework for community transcription built by NYPL Labs in collaboration with Zooniverse Scraper gets data out of web pages and into spreadsheets Optical Character Recognition (OCR) technologies - including programs like Google Drive, Tesseract or Adobe Acrobat that can detect text to make it searchable/readable *Rate of accuracy varies and access to affordable software not consistent
  • 7. accessing existing metadata Digital Public Library of America (DPLA) open API enables people to use millions of records describing cultural heritage resources held by institutions across the US. Flickr has over 5 billion photos with valuable metadata such as tags, geolocation, and Exif data The Europeana provides access to over 50 million digitised items – books, music, artworks and more from thousands of European archives, libraries and museums HathiTrust Digital Library has more than 2 million volumes are in the public domain and freely viewable on the Web
  • 8. analyzing metadata Map Warper built by NYPL Labs is a tool suite used to align (or "rectify") historical maps to the digital maps of today. Gephi an open-source software for network visualization and analysis of data sets to summarize their main characteristics, often with visual methods. MALLET is a Java-based package for statistical natural language processing, document classification, clustering, topic modeling, information extraction, and other machine learning applications to text.
  • 9. manipulating Metadata OpenRefine - clean up messy or inconsistent data Data Wrangler - used to merge, delete, autofill, filling in missing data or incorporating data from another source, and move information in your set. Data Science Toolkit - set of open-source tools for data science information transformation needs
  • 10. thank you.Email questions to: i.carbajal@austin.utexas.edu