This document discusses Wikidata and cultural heritage data. It aims to establish Wikidata as a central hub for cultural heritage data by ingesting related data and enhancing it. Key challenges include getting institutions to provide open data, assisting with data scraping, addressing coverage biases, mapping data models during ingestion, and dealing with incorrect data. Maintaining data quality over time through processes like updating and dispute resolution is also challenging. The document explores how Wikidata can better integrate with other databases and cultural heritage organizations to maximize data sharing and reuse.
The Open Education Working Group: Bringing people and projects togetherMarieke Guy
Presentation given at Open Data in Education Seminar, St Petersburg, 10th March 2014: http://linkededucation.org/events/open-data-in-education-seminar-st-petersburg
This presentation will discuss how the structured data, together with the semantically indexed/mined entities in semi-structured and unstructured data, are contributing to researches beyond libraries, especially in digital humanities. It aims to explore the opportunities and strategies to use, reuse, share, and effectively elaborate the smart data -- generated or to be generated -- in libraries.
Lunch talk at the Centre for Digital Humanities by Laurents Sesink, Peter Verhaar and Ben Companjen on the implementation of IIIF by Leiden University Libraries.
The Open Education Working Group: Bringing people and projects togetherMarieke Guy
Presentation given at Open Data in Education Seminar, St Petersburg, 10th March 2014: http://linkededucation.org/events/open-data-in-education-seminar-st-petersburg
This presentation will discuss how the structured data, together with the semantically indexed/mined entities in semi-structured and unstructured data, are contributing to researches beyond libraries, especially in digital humanities. It aims to explore the opportunities and strategies to use, reuse, share, and effectively elaborate the smart data -- generated or to be generated -- in libraries.
Lunch talk at the Centre for Digital Humanities by Laurents Sesink, Peter Verhaar and Ben Companjen on the implementation of IIIF by Leiden University Libraries.
Slides of the presentations gives as part of the Europeana Research panel "Cultural Heritage Data for Research: A Europeana Research Panel" at DH Benelux 2017 in Utrecht.
Estermann Panel on Authority Files, 3 June 2020Beat Estermann
Panel on Authority Files and Controlled Vocabularies: Welcome and Introduction; GLAM Inventory; Named Entities in the Context of the LOD Ecosystem for the Performing Arts. Side programme of the Swiss Open Cultural Data Hackathon 2020, Online Session, 3 June 2020.
Are we failing users? Can open approaches meet their needs? - Maura MarxJisc
Are we failing users? Can open approaches meet their needs?
Maura's plenary presentation at the Jisc/British Library Discovery Summit 2013
February 2013, London
Presentation of the Performing Arts Project on Wikidata, Meeting of the Archival Working Group of the German Society for Theatre Research, Frankfurt a.M., 16 January 2018.
An overview of what EDINA has to offer to researchers in UK HE and FE. Presented by Nicola Osborne and Lisa Otty at Supporting Digital Scholarship in CHSS on 2 December 2015
Clare Lanigan - Presentation to IES Studentsdri_ireland
Presentation given by Clare Lanigan, DRI Education and Outreach Manager, to students of the School of Information and Library Science, University of North Carolina, at the Institute for the International Education of Students (IES) Abroad centre in Rathmines, Dublin, on 1 June 2017.
This was a presentation for the Connecticut Library Association 2016. It introduces how the Connecticut Digital Archive came to be, the challenges of the CTDA and how it is moving forward.
A Manifesto for the Digital Shift in Research LibrariesTorsten Reimer
A report from the Digital Shift working group for RLUK (Research Libraries UK) on the challenges libraries face with regards to the digital shift and how to overcome them. Presented at a virtual RLUK seminar on 18th May 2020.
Slides of the presentations gives as part of the Europeana Research panel "Cultural Heritage Data for Research: A Europeana Research Panel" at DH Benelux 2017 in Utrecht.
Estermann Panel on Authority Files, 3 June 2020Beat Estermann
Panel on Authority Files and Controlled Vocabularies: Welcome and Introduction; GLAM Inventory; Named Entities in the Context of the LOD Ecosystem for the Performing Arts. Side programme of the Swiss Open Cultural Data Hackathon 2020, Online Session, 3 June 2020.
Are we failing users? Can open approaches meet their needs? - Maura MarxJisc
Are we failing users? Can open approaches meet their needs?
Maura's plenary presentation at the Jisc/British Library Discovery Summit 2013
February 2013, London
Presentation of the Performing Arts Project on Wikidata, Meeting of the Archival Working Group of the German Society for Theatre Research, Frankfurt a.M., 16 January 2018.
An overview of what EDINA has to offer to researchers in UK HE and FE. Presented by Nicola Osborne and Lisa Otty at Supporting Digital Scholarship in CHSS on 2 December 2015
Clare Lanigan - Presentation to IES Studentsdri_ireland
Presentation given by Clare Lanigan, DRI Education and Outreach Manager, to students of the School of Information and Library Science, University of North Carolina, at the Institute for the International Education of Students (IES) Abroad centre in Rathmines, Dublin, on 1 June 2017.
This was a presentation for the Connecticut Library Association 2016. It introduces how the Connecticut Digital Archive came to be, the challenges of the CTDA and how it is moving forward.
A Manifesto for the Digital Shift in Research LibrariesTorsten Reimer
A report from the Digital Shift working group for RLUK (Research Libraries UK) on the challenges libraries face with regards to the digital shift and how to overcome them. Presented at a virtual RLUK seminar on 18th May 2020.
The Biodiversity Information Standards (TDWG): Opportunities for Collaboratio...Martin Kalfatovic
The Biodiversity Information Standards (TDWG), also known as the Taxonomic Databases Working Group, is a non-profit scientific and educational association that is affiliated with the International Union of Biological Sciences. TDWG was formed to establish international collaboration among biological database projects and related services. Promoting the wider and more effective dissemination of information about the World's heritage of biological organisms for the benefit of the world at large, TDWG focuses on the development of standards for the exchange of biological/biodiversity data. TDWG promotes the use of standards through the most appropriate and effective means and acts as a forum for discussion through holding meetings and through publications, especially the recently launched open access journal, Biodiversity Information Standards and Science. This presentation will focus on areas of possible collaboration by the larger networked information community around bioinformatic standards, areas where TDWG collaborates with other biodiversity organizations such as the Biodiversity Heritage Library (BHL), the Encyclopedia of Life (EOL), and the Global Biodiversity Information Facility (GBIF).
Aggregation of Linked Data A case study in the cultural heritage domainNuno Freire
Presented at IEEE BIGDATA 2018 conference - December 2018
A very large number of online cultural heritage (CH) resources is made available through numerous digital libraries. To address the difficulties of discoverability in CH, the common practice is metadata aggregation, where centralized efforts like Europeana facilitate discoverability by collecting the resources’ metadata. In the last years, the CH domain has invested in data models for Linked Data (LD) representation of CH metadata. LD, however, also has potential for innovating metadata aggregation. We present the results of a pilot case study within the Europeana Network. In this pilot, the National Library of The Netherlands plays the role of initial data provider, with the Dutch Digital Heritage Network the one of intermediary service providing datasets to Europeana. We analysed the requirements for an LD aggregation solution and defined a workflow that fulfils the same functional requirements as Europeana’s current solution. The workflow was put into practice within the pilot and led to the development of several software components for managing datasets, harvesting LD, data analysis and integration. Our analysis of the experience discusses the effort of adopting such an LD approach for data providers and aggregators, the expertise required by CH data analysts, and the supporting tools required for semantic data.
Research into Practice case study 2: Library linked data implementations an...Hazel Hall
The research underlying this presentation explored the role that libraries play in the linked data context. Focusing on European national libraries and Scottish libraries, multiple data gathering methods and constant comparative analysis were applied in the study. Amongst the findings, a general lack of awareness within the library community of the Semantic Web and the implications of linked data was identified. At the same time, there is recognition that linked data augments the discoverability and enhances the interoperability of library data. The presentation will include recommendations for the application of the findings of this research in practice.
How you and your gateway can benefit from the services of the Science Gateway...Katherine Lawrence
January 2017 webinar of the Science Gateways Community Institute. Recording and additional details available at http://sciencegateways.org/upcoming-events/webinars/#previous
A summary of DBpedia's History and a detailed analysis of challenges and solutions.
We show how the Linked Data Cloud evolved around DBpedia and also what problems we and other data projects encountered. We included a section on the new solutions that will lead DBpedia into a bright future.
This presentation was provided by Edward M. Corrado on Wednesday, June 14, during the NISO virtual event, Images: Digitization & Preservation of Special Collections in Libraries, Museums and Archives.
Keynote : Beyond DM2E: towards sustainable digital services for humanities research communities in Europe? (Sally Chambers – DARIAH-EU, Göttingen Centre for Digital Humanities) at Enabling humanities research in the Linked Open Web – DM2E final event (11 December 2014, Navacchio, Italy)
Presentation given by Sarah Jones and Joy Davidson to a group of South African librarians at a webinar organised by LIASA HELIG. http://www.liasa.org.za/node/977
Webinar: Decarboni.se – building a climate change solution web platform Global CCS Institute
This webinar provided an overview on the recently launched Decarboni.se knowledge sharing platform. Decarboni.se aims to be the best place for people to learn from over 400 organisations working to decarbonise the economy. On Decarboni.se you’ll find quality information and detailed descriptions of techniques and lessons learnt from experts around the world. This webinar presented why Decarboni.se was built, how you can use it and how it improves the knowledge sharing process for clean energy. We also presented how we’re reaching out to people (including those outside the CCS community) to tell them about the approach and get them involved in the knowledge sharing process.
This webinar was presented by Sean McClowry, General Manager - Information Management and Brian Houston, Community Manager from the Global CCS Institute.
Using Open Data and Citizen Science to Promote Citizen Engagement with Green ...Azavea
Presentation given at the Green Infrastructure and Water Management in Growing Metropolitan Areas conference on January 15, 2014 at the Patel College of Global Sustainability at the University of South Florida, Tampa, Florida.
Hot Topics: The DuraSpace Community Webinar Series
Series 1: Knowledge Futures: Digital Preservation Planning
Webinar 2: Preservation Planning Success Stories
Curated by Liz Bishoff
Presentation Slides
Similar to Estermann Wikidata and Heritage Data 20170914 (20)
Using Wikidata for Performing Arts Related DataBeat Estermann
Slides of the Webinar held on 5 June 2024 entitled "Using Wikidata for Performing Arts Related Data" in the context of the Open Science Open Science for Arts, Design and Music Project.
https://meta.wikimedia.org/wiki/Open_Science_for_Arts,_Design_and_Music/Training/Webinars#Using_Wikidata_for_Performing_Arts_Related_Data
Transformación digital del patrimonio cultural y sus implicaciones practicasBeat Estermann
Public lecture on the digital transformation of the public sector, the heritage sector, recent trends, and practical implications.
BUAP Central Library, Puebla, Mexico - 11 April 2024.
(Spanish translation of the original slide deck in English)
Digital Transformation of the Heritage Sector and its Practical ImplicationsBeat Estermann
Public lecture on the digital transformation of the public sector, the heritage sector, recent trends, and practical implications.
BUAP Central Library, Puebla, Mexico - 11 April 2024.
(A Spanish version of the slide deck is available)
Linked Open Data for the Performing Arts: Latest Developments in Switzerland,...Beat Estermann
Presentation at conference "PERFORMANCE – PRODUCTION – DATA. Modeling and Communicating Event-related Information", Leipzig (Germany), 14-15 September 2023
Estermann Linked Data Ecosystem for Heritage Data - 29 Feb 2020Beat Estermann
Linked Open Data Ecosystem for Heritage Data. Presentation held at the 5th Anniversary of the Swiss Open Cultural Data Hackathon on 29 February 2020 at the National Library in Bern.
Overview of OpenGLAM in Switzerland and the latest activities of the Bern University of Applied Sciences in the area of open cultural data. Presentation held at the Conference on Conference on Open Data and Open Maps for Heritage Protection in Bellinzona, Switzerland, 21 Feb 2020.
BFH-Studie Digitalisierung und Umwelt - BAFU-Kaderklausur - 20191127Beat Estermann
Digitalisierung und Umwelt: Chancen, Risiken und Handlungsbedarf. Wichtigste Ergebnisse einer Studie im Auftrag des Bundesamts für Umwelt (BAFU). Präsentation anlässlich der BAFU-Kaderklausur vom 27. November 2019 in Gwatt/Thun.
Slides for the GLAM Panel at WikidataCon 2019 in Berlin, 25. October 2019, on the role of Wikidata within data ecosystems extending beyond the realm of Wikimedia projects. Authors: Susanna Ånäs (Finland); Mike Dickison (New Zealand); Joachim Neubert (Germany); Beat Estermann (Switzerland).
Estermann ENICPA Wiki Loves Performing Arts 20191022Beat Estermann
Presentation at the ENICPA Round Table on 22 October 2019 in Prague on Wikidata and performing arts. Author: Beat Estermann, Bern University of Applied Sciences.
Bootstrapping the International Knowledge Base for the Performing ArtsBeat Estermann
Presentation at the Conference "Open Data - Open Access: New Frontiers for Archives and Digital Platforms Dedicated to the Performing Arts" in Rome, 7 June 2019.
Beat Estermann, Bern University of Applied Sciences
Workshop "Performing Arts Database based on Wikidata"Beat Estermann
Workshop at the occasion of the Congress of the Society of Theatre Research (Gesellschaft für Theaterwissenschaft), Düsseldorf, Germany, 10 November 2018.
Presentation of the Data Model developed for the Swiss Performing Arts Platform, Meeting of the Archival Working Group of the German Society for Theatre Research, Frankfurt a.M., 16 January 2018.
This session provides a comprehensive overview of the latest updates to the Uniform Administrative Requirements, Cost Principles, and Audit Requirements for Federal Awards (commonly known as the Uniform Guidance) outlined in the 2 CFR 200.
With a focus on the 2024 revisions issued by the Office of Management and Budget (OMB), participants will gain insight into the key changes affecting federal grant recipients. The session will delve into critical regulatory updates, providing attendees with the knowledge and tools necessary to navigate and comply with the evolving landscape of federal grant management.
Learning Objectives:
- Understand the rationale behind the 2024 updates to the Uniform Guidance outlined in 2 CFR 200, and their implications for federal grant recipients.
- Identify the key changes and revisions introduced by the Office of Management and Budget (OMB) in the 2024 edition of 2 CFR 200.
- Gain proficiency in applying the updated regulations to ensure compliance with federal grant requirements and avoid potential audit findings.
- Develop strategies for effectively implementing the new guidelines within the grant management processes of their respective organizations, fostering efficiency and accountability in federal grant administration.
Jennifer Schaus and Associates hosts a complimentary webinar series on The FAR in 2024. Join the webinars on Wednesdays and Fridays at noon, eastern.
Recordings are on YouTube and the company website.
https://www.youtube.com/@jenniferschaus/videos
Monitoring Health for the SDGs - Global Health Statistics 2024 - WHOChristina Parmionova
The 2024 World Health Statistics edition reviews more than 50 health-related indicators from the Sustainable Development Goals and WHO’s Thirteenth General Programme of Work. It also highlights the findings from the Global health estimates 2021, notably the impact of the COVID-19 pandemic on life expectancy and healthy life expectancy.
Jennifer Schaus and Associates hosts a complimentary webinar series on The FAR in 2024. Join the webinars on Wednesdays and Fridays at noon, eastern.
Recordings are on YouTube and the company website.
https://www.youtube.com/@jenniferschaus/videos
Donate to charity during this holiday seasonSERUDS INDIA
For people who have money and are philanthropic, there are infinite opportunities to gift a needy person or child a Merry Christmas. Even if you are living on a shoestring budget, you will be surprised at how much you can do.
Donate Us
https://serudsindia.org/how-to-donate-to-charity-during-this-holiday-season/
#charityforchildren, #donateforchildren, #donateclothesforchildren, #donatebooksforchildren, #donatetoysforchildren, #sponsorforchildren, #sponsorclothesforchildren, #sponsorbooksforchildren, #sponsortoysforchildren, #seruds, #kurnool
Presentation by Jared Jageler, David Adler, Noelia Duchovny, and Evan Herrnstadt, analysts in CBO’s Microeconomic Studies and Health Analysis Divisions, at the Association of Environmental and Resource Economists Summer Conference.
ZGB - The Role of Generative AI in Government transformation.pdfSaeed Al Dhaheri
This keynote was presented during the the 7th edition of the UAE Hackathon 2024. It highlights the role of AI and Generative AI in addressing government transformation to achieve zero government bureaucracy
1. Wikidata & Heritage Data
Where do we stand? What’s next?
Lausanne, 14 September 2017
Sijie Dai, Captain Alving – Prix de Lausanne 2010. Photo by Inisheer, CC BY-SA (Wikimedia Commons)
Unless otherwise noted,, the content of this presentation is made available under the CC BY 4.0 license.
2. ▶ The aim of this project is to coordinate, facilitate and promote
the ingestion of cultural heritage related data into
Wikidata, to facilitate the cleansing and enhancement of this
data and to promote its use across Wikipedia, its sister
projects and beyond.
▶ It is our vision to establish Wikidata as a central hub for data
integration, data enhancement, and data management in
the heritage domain.
Aim and Vision (WikiProject Cultural Heritage)
3. ▶ Establish Wikidata as a database that covers the entire world’s
cultural heritage.
▶ Establish Wikidata as a central hub that interlinks GLAM collections
around the world; and provides links to bibliographic, genealogic,
scientifc and other collections of information; create the ultimate
authority file.
▶ Foster truly multilingual and global collaboration among people
from various backgrounds.
▶ Leverage synergies between institutions, reduce duplicate work.
▶ Encourage debate in the community by highlighting and
interrogating differences in perspective.
▶ Provide a single source of data for some of the most popular web
sites and apps, including Wikipedia infoboxes and lists.
Vision (Blog posts: Stinson et al. 2016; Thornton / Cochrane 2016; Poulter 2017)
7. ▶ Wikidata needs to be explained to institutions in view of data
donations.
• Lack of awareness of the importance of open licenses in
databases
• Fears of loss of control related to publishing data under CC-0
• What can institutions gain from their involvement in Wikidata?
▶ Community members need assistance with scraping data from
websites.
▶ Present coverage is biased; it is highest for Western Europe and
North America; how to get access to data from other world regions?
How To Get Access to Freely Licensed Data?
8. ▶ http://make.opendata.ch/wiki/data:glam_ch
• Personnalités Vaudoises (BCUL)
• Swiss Photography Metadata (Büro für Fotografiegeschichte)
• Artist data from the SIKART Lexicon on art in Switzerland (SIK-ISEA)
• Metadata of the Historical Dictionary of Switzerland (HLS)
• PCP Inventory (Federal Office for Civil Protection)
• Inventory of Historical Monuments (Canton of Zurich)
• Inventory of Historical Monuments (City of Zurich)
• Inventory of classified Gardens and Parks (City of Zurich)
• Art in the Urban Space (City of Zurich)
• Swiss GLAM Inventory (OpenGLAM)
• Inventory of Research Libraries in Switzerland (Swissbib)
• ISplus Swiss (G)LAM Inventory (Swiss National Library)
• Schauspielhaus Zürich Repertoire of Theatre and other Productions, 1938–1968
• Swiss Theatre Metadata (Swiss Theatre Collection)
• Plazi TreatmentBank (repository of the world's species) (Plazi.org)
• Historical Statistics of Switzerland (University of Zurich)
Data Provision – Which Datasets are Useful?
13. ▶ Coping with the Bazaar:
• Sometimes changes to property definitions are too easily made by
volunteers
• There is a rigorous process for creating new properties, but not for
changing definitions of properties or creating new classes
• No master language; how to keep translations of definitions in synch?
• Sometimes different approaches are used to model the same thing.
▶ What are good design principles?
• Re-usability of properties across various domains
• Select high priority areas first, do not try to solve everything overnight for
the entire cultural heritage domain
• …
▶ Finding a balance between:
• The expressive power of an ontology
• Its practicability when it comes to large scale use by many people
• Its queryability (usability from the perspective of data users)
Challenges Related to Ontology Development (2/2)
14. ▶ Mapping Between Data Models
• Getting an overview of appropriate properties and classes can be a
time-consuming exercise.
• Creating new properties requires community agreement and may involve
lengthy discussions and compromises.
• There is still a lot of work to be done in the area of typologies and
thesauri [Example]
▶ Matching Items / Disambiguation
• There are tools like Mix’n’Match and OpenRefine to support this, but it
remains a major challenge, esp. with datasets which haven’t resolved this
issue internally.
▶ Incorrect / Incoherent Data on Wikidata
• Many data ingestion projects require cleansing up of existing data.
▶ Repeated Ingestion / Updates
• How to approach the historicization of data?
• How to set up processes to regularly update data?
Challenges Related to Data Ingestion
N.B.: We are not filling a void or starting from scratch, but contributing to an
existing ecosystem of data, data models, and community members!
17. ▶ Establishing and Documenting Data Quality
• Getting rid of duplicates
• Dealing with incorrect and inconsistent data
• How to monitor data quality and data completeness?
▶ Building a Network of Trust
• Linking all statements to a reliable source
• In the future: “Signed Statements”
▶ Data Exchange Between Wikidata and Primary Databases
▶ Data synchronization: How to keep data mutually up to date?
▶ How to make it easier for GLAM employees to follow
changes/improvements to their data on Wikidata?
Challenges Related to Data Maintenance
18. ▶ Chicken and Egg Problem:
• Data usage drives data quality & completeness
• Data quality & completeness are prerequisites of data use
Challenges Related to Data Use
20. ▶ Linking Wikidata with other databases
• Map existing standards from the GLAM sector to Wikidata
• Merge data imported from Wikipedia with data from reliable databases
▶ In what areas is Wikidata supposed to…
• serve as the master database (referencing sources other than databases)?
• hold data imported from reliable databases?
• link to authoritative databases (without holding the actual data)?
▶ How should GLAMs organize their relationship with Wikidata?
• Provide mutual links?
• Ingest part or all of their data into Wikidata?
• Synchronize part or all of their data with Wikidata?
• Use Wikidata as their main database?
Wikidata and the Wider Data Landscape
21. ▶ How to improve guidelines, community structures, reporting etc. in
order to be able to involve more GLAM personnel in Wikidata?
▶ How best to foster a shared data modelling practice in various
areas? (Need for more modelling show cases, coordination, etc.)
▶ Need for training and tools (to facilitate the accomplishment of
certain tasks).
▶ The evolving tools landscape constitutes a challenge when
establishing processes and working with guidelines.
▶ https://www.wikidata.org/wiki/Wikidata:WikiProject_Cultural_heritag
e
▶ Wikidata + GLAM Facebook Group
Community & Collaboration
22. Useful Tools
▶ Example: Tools I used for the ingest of the Swiss GLAM
Inventory:
• Microsoft Excel / Open Office Calc
• Wikidata Query Service
• Open Refine
• Reconcile-csv
• Listeria
• Quick Statements
• Microsoft Word / Excel (mail merge)
• Hatnote: «Listen to Wikipedia»
23. ▶ Diff tools to help tracking changes in datasets on Wikidata and to
synchronize with external databases
▶ Statistics tools (data completeness; data use)
▶ Data visualization tools (beyond what the Query service can already
do)
▶ Data tracking tools (data completeness; see how data evolves)
▶ Improved version of the Quick Statements Tool (see feature
requests)
▶ Customizable forms for manual data entry
Tools – Wishlist
24. Thank You for Your Attention!
Contact
Beat Estermann
Bern University of Applied Sciences
beat.estermann@bfh.ch
+41 31 848 34 38