Be the first to like this
Humanities scholars work with unstructured data: information about human culture stored in books, archival records, audiovisual sources and other carriers of information. Traditionally, the data from these various sources were extracted and processed in the mind of the scholar. With the growing availability of these data in digital form, the tasks of extracting and combining information from various datasets becomes mediated by computational tools. In order to support scholars in working with digital data, a high level of transparency is required: scholars want to know exactly where the data originate, how they have been processed and manipulated, and what this means for their results and interpretation. In this lecture I will discuss our experiences in designing the CLARIAH research infrastructure for media studies research, focusing on the requirements regarding the transparency of data and tools.