This document summarizes Dr. Stefan Schmunk's presentation on new roles for libraries in relation to digital humanities, research data, and research infrastructures. The presentation discusses how digital humanities projects involving tasks like digital scholarly editions require new skills from libraries, such as expertise in XML encoding, long-term preservation of digital materials, and creation of virtual research environments. It also explores how libraries must adapt to help researchers with the growing importance of research data in the humanities by taking on roles like hosting data repositories, providing data management support and training, and building research data infrastructures.
This presentation was provided by Carolyn Hansen of the University of Cincinnati during the NISO Training Thursday event, Metadata and the IR, held on Thursday, February 23, 2017.
This presentation was provided by Carolyn Hansen of the University of Cincinnati during the NISO Training Thursday event, Metadata and the IR, held on Thursday, February 23, 2017.
Sebastian Colutto (University of Innsbruck, AT): Transkribus. A virtual research environment for the transcription and recognition of historical documents
co:op-READ-Convention Marburg
Technology meets Scholarship, or how Handwritten Text Recognition will Revolutionize Access to Archival Collections.
With a special focus on biographical data in archives
Hessian State Archives Marburg Friedrichsplatz 15, D - 35037 Marburg
19-21 January 2016
Günter Mühlberger (University of Innsbruck, AT): The READ project. Objectives, tasks and partner organisations
co:op-READ-Convention Marburg
Technology meets Scholarship, or how Handwritten Text Recognition will Revolutionize Access to Archival Collections.
With a special focus on biographical data in archives
Hessian State Archives Marburg Friedrichsplatz 15, D - 35037 Marburg
19-21 January 2016
WORLDMAP: A SPATIAL INFRASTRUCTURE TO SUPPORT TEACHING AND RESEARCH (BROWN BA...Micah Altman
The WorldMap platform http://worldmap.harvard.edu is the largest open source collaborative mapping system in the world, with over 13,000 map layers contributed by thousands of users from Harvard and around the world. Researchers may upload large spatial datasets to the system, create data-driven visualizations, edit data, and control access. Users may keep their data private, share it in groups, or publish to the world.
The user base is interdisciplinary, including scholars from the humanities, social sciences, sciences, public health, design, planning, etc. All are able to access, view, and use one another’s data, either online, via map services, or by downloading.
Current work is underway to create and maintain a global registry of map services and take us a step closer to one-stop-access for public geospatial data. Another project is working on tools to support the visualization of spatial datasets with over a billion features. Current collaborations are underway with groups inside Harvard, such as Dataverse, HarvardX, and various departments, and with groups outside Harvard, such as Cornell University and the University of Pennsylvania. Major additional contributors to the underlying source code include the WorldBank, the U.S. State Department, and the United Nations.
The source code for the WorldMap platform is available on GitHub https://github.com/cga-harvard/cga-worldmap.
Location: E25-202
Discussant: Ben Lewis is system architect and project manager for WorldMap, an open source infrastructure that supports collaborative research centered on geospatial information. Before joining Harvard, Ben was a project manager with Advanced Technology Solutions of Pennsylvania, where he led the company in adopting platform independent approaches to GIS system development. Ben studied Chinese at the University of Wisconsin and has a Masters in Planning from the University of Pennsylvania. After Penn, Ben helped start the GIS Lab at U.C. Berkeley, founded the GIS group for transportation engineering firm McCormick Taylor, and coordinated the Land Acquisition Mapping System for South Florida Water Management District. Ben is especially interested in technologies that lower the barrier to spatial technology access.
Information Science Brown Bag talks, hosted by the Program on Information Science, consists of regular discussions and brainstorming sessions on all aspects of information science and uses of information science and technology to assess and solve institutional, social and research problems. These are informal talks. Discussions are often inspired by real-world problems being faced by the lead discussant.
This presenations provides an outlook of what we anticipate with the structured data hub: to create linkable datasets, enhance the use of provenance, add quality flags to data, answer new questions and finally, borrow from and provide to public sources such as dbpedia
Basilis Gatos (Computational Intelligence Laboratory, Institute of Informatics and Telecommunications, National Center for Scientific Research “Demokritos”, GR): Hard Tasks in the Background. Layout analysis
co:op-READ-Convention Marburg
Technology meets Scholarship, or how Handwritten Text Recognition will Revolutionize Access to Archival Collections.
With a special focus on biographical data in archives
Hessian State Archives Marburg Friedrichsplatz 15, D - 35037 Marburg
19-21 January 2016
Wednesday 6 May: Hand me the data! What you should know as a humanities resea...WARCnet
Wednesday 6 May: Hand me the data! What you should know as a humanities researcher before asking for data from a web archive, Ulrich Have, NetLab/DIGHUMLAB, Aarhus University
This presentation was provided by Anne Washington of the University of Houston during the NISO virtual conference, Open Data Projects, held on Wednesday, June 13, 2018.
Towards digitizing scholarly communicationSören Auer
Slides of the VIVO 2016 Conference keynote: Despite the availability of ubiquitous connectivity and information technology, scholarly communication has not changed much in the last hundred years: research findings are still encoded in and decoded from linear, static articles and the possibilities of digitization are rarely used. In this talk, we will discuss strategies for digitizing scholarly communication. This comprises in particular: the use of machine-readable, dynamic content; the description and interlinking of research artifacts using Linked Data; the crowd-sourcing of multilingual
educational and learning content. We discuss the relation of these developments to research information systems and how they could become part of an open ecosystem for scholarly communication.
Linked data for Enterprise Data IntegrationSören Auer
The Web evolves into a Web of Data. In parallel Intranets of large companies will evolve into Data Intranets based on the Linked Data principles. Linked Data has the potential to complement the SOA paradigm with a light-weight, adaptive data integration approach.
Introduction to the Data Web, DBpedia and the Life-cycle of Linked DataSören Auer
Over the past 4 years, the Semantic Web activity has gained momentum with the widespread publishing of structured data as RDF. The Linked Data paradigm has therefore evolved from a practical research idea into
a very promising candidate for addressing one of the biggest challenges
of computer science: the exploitation of the Web as a platform for data
and information integration. To translate this initial success into a
world-scale reality, a number of research challenges need to be
addressed: the performance gap between relational and RDF data
management has to be closed, coherence and quality of data published on
the Web have to be improved, provenance and trust on the Linked Data Web
must be established and generally the entrance barrier for data
publishers and users has to be lowered. This tutorial will discuss
approaches for tackling these challenges. As an example of a successful
Linked Data project we will present DBpedia, which leverages Wikipedia
by extracting structured information and by making this information
freely accessible on the Web. The tutorial will also outline some recent advances in DBpedia, such as the mappings Wiki, DBpedia Live as well as
the recently launched DBpedia benchmark.
DSpace for Cultural Heritage: adding support for images visualization,audio/v...Andrea Bollini
Digital Repositories are continuously evolving into platforms aimed at managing, visualizing, curating and preserving a variety of different cultural digital objects together with their relationships
To support interoperability and to allow a broad dissemination and re-use of cultural heritage and research results, we have built two DSpace add-ons to be released as open source, the IIIF (International Image Interoperability Framework) Image Viewer and the audio/video streaming module. The first one manages the complexity of digital objects such as page sequences, chapters and sections, exposing the metadata and the structure with the IIIF presentation API and use the IIIF API to provide fast visualization and low bandwidth use. The streaming module allows to stream audio / video content loaded in the repository using adaptive streaming and the DASH industry standard. Both modules provide a full open source stack or enable the integration with external Images and Media Server.
Managing the relations between digital objects both in a hierarchical or relational way is a key feature, in order to manage every kind of cultural heritage material. Thus we are enhancing the DSpace Data Model in order to provide not only structural metadata management but also the description of relationships within cultural contexts.
Slides presented at OR2017 - Brisbane, Australia
Rebecca Grant - DH research data: identification and challenges (DH2016)dri_ireland
Presentation made by Rebecca Grant as part of the panel session “Digital data sharing: the opportunities and challenges of opening research” at the Digital Humanities conference, Krakow, 15 July 2016. This paper “DH research data: identification and challenges” provided an introduction to concepts of research data in the digital humanities, including accepted definitions of what constitutes research data in a DH context.
Sebastian Colutto (University of Innsbruck, AT): Transkribus. A virtual research environment for the transcription and recognition of historical documents
co:op-READ-Convention Marburg
Technology meets Scholarship, or how Handwritten Text Recognition will Revolutionize Access to Archival Collections.
With a special focus on biographical data in archives
Hessian State Archives Marburg Friedrichsplatz 15, D - 35037 Marburg
19-21 January 2016
Günter Mühlberger (University of Innsbruck, AT): The READ project. Objectives, tasks and partner organisations
co:op-READ-Convention Marburg
Technology meets Scholarship, or how Handwritten Text Recognition will Revolutionize Access to Archival Collections.
With a special focus on biographical data in archives
Hessian State Archives Marburg Friedrichsplatz 15, D - 35037 Marburg
19-21 January 2016
WORLDMAP: A SPATIAL INFRASTRUCTURE TO SUPPORT TEACHING AND RESEARCH (BROWN BA...Micah Altman
The WorldMap platform http://worldmap.harvard.edu is the largest open source collaborative mapping system in the world, with over 13,000 map layers contributed by thousands of users from Harvard and around the world. Researchers may upload large spatial datasets to the system, create data-driven visualizations, edit data, and control access. Users may keep their data private, share it in groups, or publish to the world.
The user base is interdisciplinary, including scholars from the humanities, social sciences, sciences, public health, design, planning, etc. All are able to access, view, and use one another’s data, either online, via map services, or by downloading.
Current work is underway to create and maintain a global registry of map services and take us a step closer to one-stop-access for public geospatial data. Another project is working on tools to support the visualization of spatial datasets with over a billion features. Current collaborations are underway with groups inside Harvard, such as Dataverse, HarvardX, and various departments, and with groups outside Harvard, such as Cornell University and the University of Pennsylvania. Major additional contributors to the underlying source code include the WorldBank, the U.S. State Department, and the United Nations.
The source code for the WorldMap platform is available on GitHub https://github.com/cga-harvard/cga-worldmap.
Location: E25-202
Discussant: Ben Lewis is system architect and project manager for WorldMap, an open source infrastructure that supports collaborative research centered on geospatial information. Before joining Harvard, Ben was a project manager with Advanced Technology Solutions of Pennsylvania, where he led the company in adopting platform independent approaches to GIS system development. Ben studied Chinese at the University of Wisconsin and has a Masters in Planning from the University of Pennsylvania. After Penn, Ben helped start the GIS Lab at U.C. Berkeley, founded the GIS group for transportation engineering firm McCormick Taylor, and coordinated the Land Acquisition Mapping System for South Florida Water Management District. Ben is especially interested in technologies that lower the barrier to spatial technology access.
Information Science Brown Bag talks, hosted by the Program on Information Science, consists of regular discussions and brainstorming sessions on all aspects of information science and uses of information science and technology to assess and solve institutional, social and research problems. These are informal talks. Discussions are often inspired by real-world problems being faced by the lead discussant.
This presenations provides an outlook of what we anticipate with the structured data hub: to create linkable datasets, enhance the use of provenance, add quality flags to data, answer new questions and finally, borrow from and provide to public sources such as dbpedia
Basilis Gatos (Computational Intelligence Laboratory, Institute of Informatics and Telecommunications, National Center for Scientific Research “Demokritos”, GR): Hard Tasks in the Background. Layout analysis
co:op-READ-Convention Marburg
Technology meets Scholarship, or how Handwritten Text Recognition will Revolutionize Access to Archival Collections.
With a special focus on biographical data in archives
Hessian State Archives Marburg Friedrichsplatz 15, D - 35037 Marburg
19-21 January 2016
Wednesday 6 May: Hand me the data! What you should know as a humanities resea...WARCnet
Wednesday 6 May: Hand me the data! What you should know as a humanities researcher before asking for data from a web archive, Ulrich Have, NetLab/DIGHUMLAB, Aarhus University
This presentation was provided by Anne Washington of the University of Houston during the NISO virtual conference, Open Data Projects, held on Wednesday, June 13, 2018.
Towards digitizing scholarly communicationSören Auer
Slides of the VIVO 2016 Conference keynote: Despite the availability of ubiquitous connectivity and information technology, scholarly communication has not changed much in the last hundred years: research findings are still encoded in and decoded from linear, static articles and the possibilities of digitization are rarely used. In this talk, we will discuss strategies for digitizing scholarly communication. This comprises in particular: the use of machine-readable, dynamic content; the description and interlinking of research artifacts using Linked Data; the crowd-sourcing of multilingual
educational and learning content. We discuss the relation of these developments to research information systems and how they could become part of an open ecosystem for scholarly communication.
Linked data for Enterprise Data IntegrationSören Auer
The Web evolves into a Web of Data. In parallel Intranets of large companies will evolve into Data Intranets based on the Linked Data principles. Linked Data has the potential to complement the SOA paradigm with a light-weight, adaptive data integration approach.
Introduction to the Data Web, DBpedia and the Life-cycle of Linked DataSören Auer
Over the past 4 years, the Semantic Web activity has gained momentum with the widespread publishing of structured data as RDF. The Linked Data paradigm has therefore evolved from a practical research idea into
a very promising candidate for addressing one of the biggest challenges
of computer science: the exploitation of the Web as a platform for data
and information integration. To translate this initial success into a
world-scale reality, a number of research challenges need to be
addressed: the performance gap between relational and RDF data
management has to be closed, coherence and quality of data published on
the Web have to be improved, provenance and trust on the Linked Data Web
must be established and generally the entrance barrier for data
publishers and users has to be lowered. This tutorial will discuss
approaches for tackling these challenges. As an example of a successful
Linked Data project we will present DBpedia, which leverages Wikipedia
by extracting structured information and by making this information
freely accessible on the Web. The tutorial will also outline some recent advances in DBpedia, such as the mappings Wiki, DBpedia Live as well as
the recently launched DBpedia benchmark.
DSpace for Cultural Heritage: adding support for images visualization,audio/v...Andrea Bollini
Digital Repositories are continuously evolving into platforms aimed at managing, visualizing, curating and preserving a variety of different cultural digital objects together with their relationships
To support interoperability and to allow a broad dissemination and re-use of cultural heritage and research results, we have built two DSpace add-ons to be released as open source, the IIIF (International Image Interoperability Framework) Image Viewer and the audio/video streaming module. The first one manages the complexity of digital objects such as page sequences, chapters and sections, exposing the metadata and the structure with the IIIF presentation API and use the IIIF API to provide fast visualization and low bandwidth use. The streaming module allows to stream audio / video content loaded in the repository using adaptive streaming and the DASH industry standard. Both modules provide a full open source stack or enable the integration with external Images and Media Server.
Managing the relations between digital objects both in a hierarchical or relational way is a key feature, in order to manage every kind of cultural heritage material. Thus we are enhancing the DSpace Data Model in order to provide not only structural metadata management but also the description of relationships within cultural contexts.
Slides presented at OR2017 - Brisbane, Australia
Rebecca Grant - DH research data: identification and challenges (DH2016)dri_ireland
Presentation made by Rebecca Grant as part of the panel session “Digital data sharing: the opportunities and challenges of opening research” at the Digital Humanities conference, Krakow, 15 July 2016. This paper “DH research data: identification and challenges” provided an introduction to concepts of research data in the digital humanities, including accepted definitions of what constitutes research data in a DH context.
A whirlwind introduction to digital humanities for CDP Digital Humanities: Collections & Heritage - current challenges and futures workshop. February 22, 2018 Imperial War Museum
Andrea Scharnhorst (2016) Humanities and ICT. Introduction at the Workshop National Infrastructure, Social Science and Humanities, January 20, 2015, ePlan workshop at NLeSC, Amsterdam.
Bibliotheca Digitalis. Reconstitution of Early Modern Cultural Networks. From Primary Source to Data.
DARIAH / Biblissima Summer School, 4-8 July 2017, Le Mans, France.
5th and last day, July 8th – Digital representation and data accuracy for Humanities.
Humanities at Scale and Dariah-EU.
Nicolas Larrousse – Research officer, TGIR Huma-Num.
Abstract: https://bvh.hypotheses.org/3330#resume-NLarousse
A presentation about DARIAH (Digital Research Infrastructure for the Arts and Humanities) given as a digital humanities (DH) showcase at the LibraryLab of the Faculty of Arts and Philosophy Library, Ghent University on 2 April 2015
This presentation, created by Syed Faiz ul Hassan, explores the profound influence of media on public perception and behavior. It delves into the evolution of media from oral traditions to modern digital and social media platforms. Key topics include the role of media in information propagation, socialization, crisis awareness, globalization, and education. The presentation also examines media influence through agenda setting, propaganda, and manipulative techniques used by advertisers and marketers. Furthermore, it highlights the impact of surveillance enabled by media technologies on personal behavior and preferences. Through this comprehensive overview, the presentation aims to shed light on how media shapes collective consciousness and public opinion.
Acorn Recovery: Restore IT infra within minutesIP ServerOne
Introducing Acorn Recovery as a Service, a simple, fast, and secure managed disaster recovery (DRaaS) by IP ServerOne. A DR solution that helps restore your IT infra within minutes.
0x01 - Newton's Third Law: Static vs. Dynamic AbusersOWASP Beja
f you offer a service on the web, odds are that someone will abuse it. Be it an API, a SaaS, a PaaS, or even a static website, someone somewhere will try to figure out a way to use it to their own needs. In this talk we'll compare measures that are effective against static attackers and how to battle a dynamic attacker who adapts to your counter-measures.
About the Speaker
===============
Diogo Sousa, Engineering Manager @ Canonical
An opinionated individual with an interest in cryptography and its intersection with secure software development.
This presentation by Morris Kleiner (University of Minnesota), was made during the discussion “Competition and Regulation in Professions and Occupations” held at the Working Party No. 2 on Competition and Regulation on 10 June 2024. More papers and presentations on the topic can be found out at oe.cd/crps.
This presentation was uploaded with the author’s consent.
Sharpen existing tools or get a new toolbox? Contemporary cluster initiatives...Orkestra
UIIN Conference, Madrid, 27-29 May 2024
James Wilson, Orkestra and Deusto Business School
Emily Wise, Lund University
Madeline Smith, The Glasgow School of Art
Have you ever wondered how search works while visiting an e-commerce site, internal website, or searching through other types of online resources? Look no further than this informative session on the ways that taxonomies help end-users navigate the internet! Hear from taxonomists and other information professionals who have first-hand experience creating and working with taxonomies that aid in navigation, search, and discovery across a range of disciplines.
Announcement of 18th IEEE International Conference on Software Testing, Verif...
New tasks, new roles: Libraries in the tension between Digital Humanities, Research Data, and Research Infrastructures
1. New tasks, new roles: Libraries in the tension
between Digital Humanities, Research Data,
and Research Infrastructures
Dr. Stefan Schmunk | Jasná, 3rd April 2017
Research & Development
Department (RDD)
@DARIAHde
@StefanSchmunk
2. • Introduction
• Digital Humanities: DARIAH-DE and TextGrid
• Research Data in the Arts and Humanities
• Digital Scholarly Editions as new tasks for Research
Libraries
• Conclusion
03.04.17
Overview – Agenda
3. 03.04.17
Arts and Humanities
& Social Science Computer Science
Libraries & Archives – Information Science
Digital Humanities =
Applied digital enabled research
in the Arts and Humanities
Digital Humanities – an interdisciplinary approach
4. - funding: 2011-2019
- Continous operation after
2019
- funded by
- about 15 m euros
- consortium of 17 partners:
universities, research
institutions, computing
centres, libraries, Academy
of Sciences and
Humanities, SME, NGO
- funding: 2006-2015
(integration in DARIAH-DE)
- funded by
- about 8 m euros
- consortium of 10 partners:
universities, research
institutions, computing
centre, library, Academy of
Sciences and Humanities,
SME
03.04.17
www.de.dariah.eu www.textgrid.de
5. … supports digitally-enabled
research and teaching in the
arts and humanities
- Teaching
- Research
- Research Data
- Technical Infrastructure
… supports the creation of
digital editions in the humanities
and cultural studies
- Tools for text enrichment and
annotation
- Repository for long term
preservation
- Training and education
03.04.17
è Research driven and conceptualized as an architecture of par4cipa4on
6. 03.04.17
Teaching (support, training)
• Workshops on methods,
expert colloquia, Summer Schools
• Coordination of national and
international curricular developments
Research Data (research data
collections, research data management)
• Best practices for metadata,
standardised exchange of data,
ontologies
• Development of generic search,
collection and schema registry
• Development of a tool based
federation architecture for research
data
Research (DH & Information
Science)
• DH methods and practices
• Use Cases: Annotation, Big Data
• Tools and services
• Bibliography Doing Digital Humanities
Technical Infrastructure
(development and provision of
infrastructural services)
• collaborative research environments,
virtual machines, monitoring,
authentication and authorization
infrastructure etc.
DARIAH-DE – four main tasks
7. 03.04.17
Definition Research Data
“By research data in the Arts and Humanities is meant
all those sources and outputs that are based on a
research question, collected either by scholars,
libraries, archives or any other institution, which may
be described, evaluated and/or generated and stored
in machine-readable form for analyzing, archiving
purposes, citability, and for further processing.”
Research Data – Arts and Humanities
8. 03.04.17
• Metadata, bibliographical data, finding aids
• Digital and/or digitized data and/or digital
representation of analog data
• Digital objects (tool generated data)
• Full text, transcripts
• Enriched full text
• Images, movies, music & notes
• Authority files, controlled vocabularies, ontologies
• ...
Research Data – Arts and Humanities
9. 03.04.17
Research Data Collection – epidat
• Data: 30.000 inscriptions of more than 130 Jewish cemeteries in
Germany and the Netherlands (1050-2000)
• Contains: Digitized Data, transcripts,
maps, photographs, and
archival sources
• Methods: Visual Analytics,
Named-Entity-Recognition,
Temporal-Spatial Visualization
Source: http://www.steinheim-institut.de/cgi-bin/epidat?info=edv&art=art6
Example: Epigraphical Database
10. 03.04.17
Photography
Scholarly Edition – synoptical view of inscriptions
Burial site
Symbols
Further information about Epidat:
Thomas Kollatz
kol@steinheim-institut.org
Example: Epigraphical Database
11. 03.04.17
• DARIAH-DE Geo-
Browser
• Visualisation of
data, sets, and
collection
• Temporal-Spatial-
Contextualisation
• Open Source tool
as a service for the
community
Source:
http://www.steinheim-institut.de/cgi-bin/epidat?
info=e4d&lang=de
Epidat: Data-Visualisation
www.geobrowser.de.dariah.eu
12. Research data are representations of observations, objects, or other
entities used as evidence of phenomena for the purposes of research or
scholarship” C.L. Borgman (2015). Big Data, Little Data, No Data: Scholarship in the Networked
World.
New challenges in the Humanities dealing with
Research data concerning:
• Discovering data
• Access to data
• Extended search strategies for data
• Using data tools (and generating data)
• Linking articles with data à enhanced publications
03.04.17
What is the situation for the Digital Humanities?
14. 03.04.17
Repositories for the Arts and Humanities
https://de.dariah.eu/publish https://textgridrep.org
DARIAH-DE Repository TextGrid Repository
Integra4ng exis4ng tools
Analyze and visualize
Data e.g. Voyant Tools
Archive,
store,
preserve,
but also:
explore,
find,
analyze and
visualize
15. • period: 1859 - 1880s
• 67 notebooks
• 64 - 120 leaves each
• < 10.000 pages
Theodor Fontane
(1819 - 1898)
Digital Scholarly Edition
Theodor Fontane‘s Notebooks
03.04.17
https://fontane-nb.dariah.eu/index.html
18. Research
Object
Research Data
observational data,
audio, video,
interviews, statistical
data, ...
Publication
• Documentation of research
data for replication and
detection of fraud
• Sharing and publication of
research data: „old“ data for
new research
• Building up research
infrastructures for scientist
• Keep digital data usable
despite changes in
technology, organisations,
background knowledge, ...
• Supporting researchers
during the project phase
(embedded data managers)
New Tasks for Libraries:
Data, Infrastructures, and Digital Humanities
03.04.17
19. • Scientific Information is more than a journal article or
a book
• Libraries should open their catalogues to any kind of
information
• The catalogue of the future is NOT ONLY a window
to the library‘s holding, but
• A portal in a net of trusted providers of scientific
content
Consequences for Libraries
03.04.17