4. OPENREFINE
...a Java-based power tool that allows you
to load data, understand it, clean it up,
reconcile it, and augment it with data
coming from the web.
-https://github.com/OpenRefine/OpenRefine
“
”
7. OPENREFINE
“Completeness: Checking to see what elements/properties/attributes are
present (and how much of it is missing!)
Accuracy: Information is correct and factual (to the best of our abilities)
Conformance to expectations: Information adheres to our expectations
Consistency: Values are consistent within our domain, elements are
represented in a consistent manner”
-Gretchen Gueguen, Digital Public Library of America
9. DATA VALIDATION
A two-step definition for Data Validation:
“An activity aimed at verifying whether the value of a data item comes from the
given (finite or infinite) set of acceptable values.”
-UNECE 2013 Glossary of terms on statistical data editing
“Data validation could be operationally defined as a process which ensures the
correspondence of the final (published) data with a number of quality
characteristics.”
-Simon A., (2013a) Definition of validation levels and other related concepts v01307. Working document
10. DATA VALIDATION
Two paths
to Data
Validation:
Manually check terms against sources
(use when no existing reconciliation service/no
URIs and no. of content is small)
Use reconciliation services to check terms against
sources
(use when services available, especially helpful with
large no. of content!)
12. View of original Art and Architecture terms
View of validated Art and
Architecture terms
DATA VALIDATION
Source: Texas Archival Resources Online (TARO) project, as part of current work duties
14. FUZZY MATCHING
University of Texas. Glee Club
University of Texas. Glee Clubs
University of Texas Glee Club
The University of Texas. Glee Club
The University of Texas. glee club
University of Texas--glee clubs
University of Texas. Glee Club
17. Screenshot of Library of Congress subject headings for
Nuu-Chah-Nulth people and related topics.
18. Screenshot of Getty AAT record for Nuu-Chah-Nulth, with sources
used to make the record highlighted
19. OpenRefine project
Using Recon Service API (GET, POST)
Name Authorities with URIs
Retrieve terms and URIs
PROBLEMATIZING DATA TRANSFORMATION
Where did this information come from?
Are there pieces missing?
Who may not know this is here?
Is this record truly a 1:1 relationship?
Should all of this information be shared?
22. It’s all about ...
Relationships
Img source: Lea L on Unsplash
23. ...it is insufficient to care only for the object, which is
the material expression of a people’s way of life.
Instead, the knowledge itself, including the means
of its making, must be treated with
respect...including the establishment of
fair and just reciprocal relationships between the
holding institutions and the Indigenous peoples
who created the original expressions.
-Littletree, Belarde-Lewis, and Duarte (2020), p. 416
“
”
24. Indian Arts Research Center. 2019. Guidelines for Collaboration (website). Facilitated by Landis Smith,
Cynthia Chavez Lamar, and Brian Vallo. Santa Fe, NM: School for Advanced Research
25. A REVISED WORKFLOW
Metadata project
Name authorities with URIs
Retrieve terms and URIs
Use reconciliation services
Community stakeholders/resources
Consult communities/resources
Employ protocols for info
Img sources: spreadsheet by Yuri Mazursky from the Noun Project; Community by Adrien Coquet from the Noun Project
Share results
27. Not all information can be shared in our collections
systems
Not all knowledge systems/ways of knowing can
be ethically represented in our collections systems
Sometimes collaboration is not possible
(and as such, we should not forge ahead with
related projects without it)
Sometimes, resources do not yet exist
Even so, it is all still worthwhile work :)
Img source: Photo of Tataviam land (Vasquez Rocks) by author, 2020.
29. THANK YOU
The following list contains resources on the tools I have used, literature on reconciliation services, and critical cataloging.
Open Refine
https://openrefine.org/
https://github.com/OpenRefine/OpenRefine
https://www.getty.edu/research/tools/vocabularies/obtain/getty_vocabularies_openrefine_tutorial.pdf
https://guides.library.illinois.edu/openrefine/gettingstarted
https://librarycarpentry.org/lc-open-refine/aio.html
https://mnylc.org/fellows/2017/03/17/using-openrefine-to-reconcile-name-entities/
https://github.com/OpenRefine/OpenRefine/wiki/Clustering-In-Depth
Reconciliation Services
http://refine.codefork.com/reconcile/viafproxy/LC
https://github.com/cmharlow/lc-reconcile/tree/5739c08f0e20f51a3a9a27637b1ec13869709002
https://www.getty.edu/research/tools/vocabularies/obtain/openrefine.html
https://www.howtogeek.com/343877/what-is-an-api/
https://github.com/cmharlow/c4lMDCpres/blob/master/slides/OpenRefineReconSlides.pdf
Critical Cataloging
Changes to Library of Congress Subject Headings Related to Indigenous Peoples
Archives for Black Lives in Philadelphia Anti-Racist Description Resources
Writing About Slavery? This Might Help. P. Gabrielle Forman, et al.
Homosaurus
Disability Language Style Guide
30. THANK YOU
The following list contains resources on the tools I have used, literature on reconciliation services, and critical cataloging.
Critical Cataloging cont.
First Nations, Metis, and Inuit Ontology
Change the Subject Documentary
Referencing Indigenous Knowledge in non-Indigenous sources
"The Right to Know": Decolonizing Native American Archives
Decentering Whiteness in Design History Resources
Citing Indigenous Elders and Knowledge Keepers from UBC
Centering Relationality: A Conceptual Model to Advance Indigenous Knowledge Organization Practices
‘Of course, data can never fully represent reality’: Assessing the Relationship between Indigenous Data and IK, TEK, and TK
Local Contexts Traditional Knowledge Labels
Knowledge Organization from an Indigenous Perspective: The Mashantucket Pequot Thesaurus of American Indian Terminology Project
Chicano Studies Collection thesaurus and The Chicano Database: Re-imagining Data Management as (Un)disciplinarity
Continuing Education workshops, hosted by Maskwacis Cultural College
United States Indigenous Data Sovereignty Network
Land Acknowledgement Info
Eastern Band of Cherokee Indians, EBCI Story Maps Project
Absentee Shawnee , Shawnee Tribe, and Eastern Shawnee Tribe of Oklahoma
Miami Tribe of Oklahoma, Myaamia Center
Peoria Tribe of Oklahoma
Yuchi Tribe -- assimilated into many different tribal groups