Towards a Graph of Ancient World Data & an Ecosystem of Gazetteersaboutgeo
Pelagios 3 is a 2-year project funded by the Andrew W. Mellon Foundation to annotate geographic documents predating 1492 by associating places mentioned in them with entries from various gazetteers. It aims to grow a graph of linked ancient world data by annotating over 39 partners from 6 countries have contributed around 830,000 annotations so far. The project develops tools to annotate documents and link gazetteer entries, and defines profiles for publishing gazetteer links and metadata online to allow cross-searching.
Controlled Vocabularies and Text Mining - Use Cases at the Goettingen State a...Ralf Stockmann
The document discusses several use cases for text mining and controlled vocabularies at the Goettingen State and University Library. It describes a project called eAqua that compares semantic graphs between journal article headings and full texts. It also discusses a project called Europeana 4D that visualizes data from multiple sources on an interactive timeline and map to show connections and relationships. Guidelines are provided for how to build datasets in KML format and contribute them to the Europeana 4D prototype visualization tool.
A presentation to attendees of our Arabic Scientific Manuscripts ground truth for OCR transcription workshop.
For more details see: https://www.eventbrite.co.uk/e/arabic-scientific-manuscripts-transcription-workshop-tickets-43303096728
About the project: http://blogs.bl.uk/digital-scholarship/2018/03/arabic-handwrittten-ocr.html
Lighting presentation for Moving People / Linking Lives at the University of Virginia, March 2015. It discusses EAC-CPF, mapping to RDF linked data, and using xEAC as the publication framework for highly interlinked archival authorities and scholarly prosopographies.
DARIAH Geo-browser: Exploring Data through Time and SpaceMatteo Romanello
This document discusses the DARIAH Geo-browser tool, which allows users to explore datasets with both temporal and spatial information by visualizing the data on a map. The tool is suitable for exploratory research to help users visualize patterns within their data. It can import data in KML format that contains time and place information. As an example, the document demonstrates how the tool can be used to explore publications related to places along the Roman Limes by mapping the publications to the locations and dates. The key benefits highlighted are the ability to conduct exploratory research through data visualization, and the interoperability of the tool through its use of APIs, common identifiers, and ability to import/export different data formats.
EAA 2017 Re-engineering the process: How best to share, connect, re-use & pro...Keith.May
The document discusses re-engineering archaeological processes to better share, connect, and provide access to archaeological information through emerging digital technologies and improved collaboration. It outlines three key areas from the 2015 EAC Agenda: 1) using technology to share information and develop networks; 2) encouraging interdisciplinary cooperation and data sharing; and 3) providing greatest possible public access to digital archaeological resources. It explores potential for open access publishing platforms and analytics research on various heritage data types, and emphasizes conceptualizing common relationships and aligning vocabularies to enable cross-searching of archaeological data from different systems.
New Discovery Tools for Digital Humanities and Spatial Data (Summary of the J...Micah Altman
This document discusses new tools for digital humanities and spatial data. It describes how physical discovery of manuscripts led to new methods of transmission and preservation of information over time. Modern libraries are indexing resources through internal catalogs and digital objects. The text advocates for moving resources on the semantic web using linked open data with RDF to better integrate geographic data and connect projects. The future of catalogs may involve direct access to digital resources through APIs, linked open data, and graph databases to allow deeper analysis of content and spatial indexing of metadata.
Towards a Graph of Ancient World Data & an Ecosystem of Gazetteersaboutgeo
Pelagios 3 is a 2-year project funded by the Andrew W. Mellon Foundation to annotate geographic documents predating 1492 by associating places mentioned in them with entries from various gazetteers. It aims to grow a graph of linked ancient world data by annotating over 39 partners from 6 countries have contributed around 830,000 annotations so far. The project develops tools to annotate documents and link gazetteer entries, and defines profiles for publishing gazetteer links and metadata online to allow cross-searching.
Controlled Vocabularies and Text Mining - Use Cases at the Goettingen State a...Ralf Stockmann
The document discusses several use cases for text mining and controlled vocabularies at the Goettingen State and University Library. It describes a project called eAqua that compares semantic graphs between journal article headings and full texts. It also discusses a project called Europeana 4D that visualizes data from multiple sources on an interactive timeline and map to show connections and relationships. Guidelines are provided for how to build datasets in KML format and contribute them to the Europeana 4D prototype visualization tool.
A presentation to attendees of our Arabic Scientific Manuscripts ground truth for OCR transcription workshop.
For more details see: https://www.eventbrite.co.uk/e/arabic-scientific-manuscripts-transcription-workshop-tickets-43303096728
About the project: http://blogs.bl.uk/digital-scholarship/2018/03/arabic-handwrittten-ocr.html
Lighting presentation for Moving People / Linking Lives at the University of Virginia, March 2015. It discusses EAC-CPF, mapping to RDF linked data, and using xEAC as the publication framework for highly interlinked archival authorities and scholarly prosopographies.
DARIAH Geo-browser: Exploring Data through Time and SpaceMatteo Romanello
This document discusses the DARIAH Geo-browser tool, which allows users to explore datasets with both temporal and spatial information by visualizing the data on a map. The tool is suitable for exploratory research to help users visualize patterns within their data. It can import data in KML format that contains time and place information. As an example, the document demonstrates how the tool can be used to explore publications related to places along the Roman Limes by mapping the publications to the locations and dates. The key benefits highlighted are the ability to conduct exploratory research through data visualization, and the interoperability of the tool through its use of APIs, common identifiers, and ability to import/export different data formats.
EAA 2017 Re-engineering the process: How best to share, connect, re-use & pro...Keith.May
The document discusses re-engineering archaeological processes to better share, connect, and provide access to archaeological information through emerging digital technologies and improved collaboration. It outlines three key areas from the 2015 EAC Agenda: 1) using technology to share information and develop networks; 2) encouraging interdisciplinary cooperation and data sharing; and 3) providing greatest possible public access to digital archaeological resources. It explores potential for open access publishing platforms and analytics research on various heritage data types, and emphasizes conceptualizing common relationships and aligning vocabularies to enable cross-searching of archaeological data from different systems.
New Discovery Tools for Digital Humanities and Spatial Data (Summary of the J...Micah Altman
This document discusses new tools for digital humanities and spatial data. It describes how physical discovery of manuscripts led to new methods of transmission and preservation of information over time. Modern libraries are indexing resources through internal catalogs and digital objects. The text advocates for moving resources on the semantic web using linked open data with RDF to better integrate geographic data and connect projects. The future of catalogs may involve direct access to digital resources through APIs, linked open data, and graph databases to allow deeper analysis of content and spatial indexing of metadata.
WORLDMAP: A SPATIAL INFRASTRUCTURE TO SUPPORT TEACHING AND RESEARCH (BROWN BA...Micah Altman
The WorldMap platform http://worldmap.harvard.edu is the largest open source collaborative mapping system in the world, with over 13,000 map layers contributed by thousands of users from Harvard and around the world. Researchers may upload large spatial datasets to the system, create data-driven visualizations, edit data, and control access. Users may keep their data private, share it in groups, or publish to the world.
The user base is interdisciplinary, including scholars from the humanities, social sciences, sciences, public health, design, planning, etc. All are able to access, view, and use one another’s data, either online, via map services, or by downloading.
Current work is underway to create and maintain a global registry of map services and take us a step closer to one-stop-access for public geospatial data. Another project is working on tools to support the visualization of spatial datasets with over a billion features. Current collaborations are underway with groups inside Harvard, such as Dataverse, HarvardX, and various departments, and with groups outside Harvard, such as Cornell University and the University of Pennsylvania. Major additional contributors to the underlying source code include the WorldBank, the U.S. State Department, and the United Nations.
The source code for the WorldMap platform is available on GitHub https://github.com/cga-harvard/cga-worldmap.
Location: E25-202
Discussant: Ben Lewis is system architect and project manager for WorldMap, an open source infrastructure that supports collaborative research centered on geospatial information. Before joining Harvard, Ben was a project manager with Advanced Technology Solutions of Pennsylvania, where he led the company in adopting platform independent approaches to GIS system development. Ben studied Chinese at the University of Wisconsin and has a Masters in Planning from the University of Pennsylvania. After Penn, Ben helped start the GIS Lab at U.C. Berkeley, founded the GIS group for transportation engineering firm McCormick Taylor, and coordinated the Land Acquisition Mapping System for South Florida Water Management District. Ben is especially interested in technologies that lower the barrier to spatial technology access.
Information Science Brown Bag talks, hosted by the Program on Information Science, consists of regular discussions and brainstorming sessions on all aspects of information science and uses of information science and technology to assess and solve institutional, social and research problems. These are informal talks. Discussions are often inspired by real-world problems being faced by the lead discussant.
This document summarizes a meeting between librarians and researchers on sharing research data. It includes presentations on the changing data environment, sharing geographic data, libraries providing infrastructure for research data curation, and the Cambridge context. Attendees discussed making data available and structured online, managing risks like licensing, and the roles of different players in moving from data production to consumption.
What does it take to create a web of government Linked Data? The UK government is finding out. Our story is one of pioneers. You will hear how we are moving out of existing settlements to the wide plains of government data. How we are starting to build the first railroads across this vast territory to open a new lands of opportunity. All the time, of course, having to avoid both outlaws and the Civil War back east.
Providing geospatial information as Linked Open DataPat Kenny
ADAPT is revolutionising the way people can seamlessly interact with digital content, systems and each other and enabling users to achieve unprecedented levels of access and efficiency. - Prof. Declan O'Sullivan, Trinity College Dublin. Address given at Ordnance Survey Ireland GI R&D Initiatives, Tuesday, 22 March 2016, 13:00 to 20:30 (GMT), Maynooth University.
This document discusses using R for spatial data analysis. It begins by outlining the key topics that will be covered, including why R is used for spatial analysis and its relationship to GIS. It then provides background on R, describing it as a free software environment for statistical computing and graphics. The document also defines spatial data and the common types. It notes that R has thousands of packages that add functionality and discusses challenges of storing and displaying spatial data. Finally, it provides an overview of the book's structure, which first presents shared R packages for spatial data and then showcases more specialized spatial analysis techniques.
Pelagios is a project that links ancient places mentioned in texts, inscriptions, archaeological finds, and museum objects through annotations. It involves over 45 partners from 10 countries annotating over 1 million references. The goal is to enable discovery and enrich context by connecting data through common references rather than imposing a common schema. Pelagios develops tools to make annotation easy and provides services like a search API and downloadable geospatial data to explore the links between sources. The project aims to support digital scholarship by building an ecosystem that interconnects people, places, times and more across various projects and resources.
Space, Time and Space-Time. Where, When and How should we use them? Considera...Keith.May
This document discusses considerations for incorporating space, time, and their relationships in archaeological research questions. It presents conceptual models for representing the spatiotemporal relationships between archaeological records, contexts, and events. While tools like Harris matrices and section drawings help visualize some relationships, new ways of explicitly modeling and visualizing the full complexity of spatiotemporal data are still needed. Semantic technologies offer potential for improved conceptualization and representation of archaeological spatiotemporal information.
A Comparative Kalendar - DH2013 Presentationblalbritton
The document proposes building a research tool called SharedKalendar to facilitate discovery and analysis of medieval Books of Hours by extracting structured data like dates and liturgical events from digitized manuscript kalendars. It aims to leverage existing image resources and annotations created with interoperable tools on a shared canvas. A prototype exhibits basic transcription data and links to interactive manuscript views. Challenges include encouraging data creation and addressing varying data availability, but the approach could enable new comparative analyses across distributed collections and repositories.
Integrating archaeological data: The ARIADNE Infrastructure, Achille Felicett...ariadnenetwork
This presentation by Achille Felicetti of PIN (Università degli Studi di Firenze, Prato) on the work by the ARIADNE infrastructure to integrating archaeological data was given as part of a workshop organised by Digital Humanities Austria. The workshop focussed on the pressing question of long-term preservation of digital data from various angles, central being user needs specific to the different fields of the Humanities. Felicetti introduced the ARIADNE research infrastructure, which has been funded by the EC's FP7 programme, to integrate archaeological research datasets from across Europe and support their uses by researchers.
An Overview of the PELAGIOS project (http://pelagios-project.blogspot.com), a digital classics network that aims at interlinking research resources by means of a Linked Open Data approach.
Vocabularies as Linked Data - OUDCE March2014Keith.May
Presentation given as part of OUDCE course in Oxford 04-03-2014 on "Digital Data and Archaeology: Management, Preservation and Publishing.
Acknowledgements to Ceri Binding @Ceribin for many of the slides.
Vocabularies as Linked Data: SENESCHAL & HeritageData.orgKeith.May
This document discusses the SENESCHAL project which converted archaeological controlled vocabularies into Linked Open Data using the SKOS standard. It enabled vocabulary providers like Historic England and RCAHMS to make their thesauri available as Linked Data and facilitated concept searching and browsing. The document outlines how the vocabularies were developed and aligned with legacy data to create unique, persistent concept identifiers and relationships between concepts, datasets, and countries to improve semantic search and data sharing.
De presentatie van Erwin Folmer, tijdens de parallelle sessie 'Randvoorwaarden voor data gedreven beleid en politiek' van het congres 'Data gedreven Beleidsontwikkeling' in Den Haag op 28 november 2017.
SMART-GS: A Tool for Studying Digitized Historical ManuscriptsYuta Hashimoto
SMART-GS is an open source transcription and annotation tool developed at Kyoto University for studying digitized historical manuscripts. It allows users to mark up images, add text transcriptions and annotations, and link related markups. The tool has been used in projects transcribing historical diaries and notebooks. It supports group work through a shared online platform. The developers hope to make SMART-GS a global collaboration platform to support international projects digitizing and analyzing Japanese classical texts.
Geo-linked data: towards deep integration of location in the web of dataAndrew Woolf
This document discusses linking location data on the web. It notes that most data has a spatial or location component, and that linking this location data can provide valuable insights. It provides examples of linked location data from various organizations, including the UK Ordnance Survey and Bureau of Meteorology in Australia. It also describes the benefits of following linked data principles and using standards like GeoSPARQL to query linked geospatial data. Finally, it outlines efforts in Australia to develop best practices for publishing government data as linked open data, including the formation of an inter-agency working group.
WORLDMAP: A SPATIAL INFRASTRUCTURE TO SUPPORT TEACHING AND RESEARCH (BROWN BA...Micah Altman
The WorldMap platform http://worldmap.harvard.edu is the largest open source collaborative mapping system in the world, with over 13,000 map layers contributed by thousands of users from Harvard and around the world. Researchers may upload large spatial datasets to the system, create data-driven visualizations, edit data, and control access. Users may keep their data private, share it in groups, or publish to the world.
The user base is interdisciplinary, including scholars from the humanities, social sciences, sciences, public health, design, planning, etc. All are able to access, view, and use one another’s data, either online, via map services, or by downloading.
Current work is underway to create and maintain a global registry of map services and take us a step closer to one-stop-access for public geospatial data. Another project is working on tools to support the visualization of spatial datasets with over a billion features. Current collaborations are underway with groups inside Harvard, such as Dataverse, HarvardX, and various departments, and with groups outside Harvard, such as Cornell University and the University of Pennsylvania. Major additional contributors to the underlying source code include the WorldBank, the U.S. State Department, and the United Nations.
The source code for the WorldMap platform is available on GitHub https://github.com/cga-harvard/cga-worldmap.
Location: E25-202
Discussant: Ben Lewis is system architect and project manager for WorldMap, an open source infrastructure that supports collaborative research centered on geospatial information. Before joining Harvard, Ben was a project manager with Advanced Technology Solutions of Pennsylvania, where he led the company in adopting platform independent approaches to GIS system development. Ben studied Chinese at the University of Wisconsin and has a Masters in Planning from the University of Pennsylvania. After Penn, Ben helped start the GIS Lab at U.C. Berkeley, founded the GIS group for transportation engineering firm McCormick Taylor, and coordinated the Land Acquisition Mapping System for South Florida Water Management District. Ben is especially interested in technologies that lower the barrier to spatial technology access.
Information Science Brown Bag talks, hosted by the Program on Information Science, consists of regular discussions and brainstorming sessions on all aspects of information science and uses of information science and technology to assess and solve institutional, social and research problems. These are informal talks. Discussions are often inspired by real-world problems being faced by the lead discussant.
This document summarizes a meeting between librarians and researchers on sharing research data. It includes presentations on the changing data environment, sharing geographic data, libraries providing infrastructure for research data curation, and the Cambridge context. Attendees discussed making data available and structured online, managing risks like licensing, and the roles of different players in moving from data production to consumption.
What does it take to create a web of government Linked Data? The UK government is finding out. Our story is one of pioneers. You will hear how we are moving out of existing settlements to the wide plains of government data. How we are starting to build the first railroads across this vast territory to open a new lands of opportunity. All the time, of course, having to avoid both outlaws and the Civil War back east.
Providing geospatial information as Linked Open DataPat Kenny
ADAPT is revolutionising the way people can seamlessly interact with digital content, systems and each other and enabling users to achieve unprecedented levels of access and efficiency. - Prof. Declan O'Sullivan, Trinity College Dublin. Address given at Ordnance Survey Ireland GI R&D Initiatives, Tuesday, 22 March 2016, 13:00 to 20:30 (GMT), Maynooth University.
This document discusses using R for spatial data analysis. It begins by outlining the key topics that will be covered, including why R is used for spatial analysis and its relationship to GIS. It then provides background on R, describing it as a free software environment for statistical computing and graphics. The document also defines spatial data and the common types. It notes that R has thousands of packages that add functionality and discusses challenges of storing and displaying spatial data. Finally, it provides an overview of the book's structure, which first presents shared R packages for spatial data and then showcases more specialized spatial analysis techniques.
Pelagios is a project that links ancient places mentioned in texts, inscriptions, archaeological finds, and museum objects through annotations. It involves over 45 partners from 10 countries annotating over 1 million references. The goal is to enable discovery and enrich context by connecting data through common references rather than imposing a common schema. Pelagios develops tools to make annotation easy and provides services like a search API and downloadable geospatial data to explore the links between sources. The project aims to support digital scholarship by building an ecosystem that interconnects people, places, times and more across various projects and resources.
Space, Time and Space-Time. Where, When and How should we use them? Considera...Keith.May
This document discusses considerations for incorporating space, time, and their relationships in archaeological research questions. It presents conceptual models for representing the spatiotemporal relationships between archaeological records, contexts, and events. While tools like Harris matrices and section drawings help visualize some relationships, new ways of explicitly modeling and visualizing the full complexity of spatiotemporal data are still needed. Semantic technologies offer potential for improved conceptualization and representation of archaeological spatiotemporal information.
A Comparative Kalendar - DH2013 Presentationblalbritton
The document proposes building a research tool called SharedKalendar to facilitate discovery and analysis of medieval Books of Hours by extracting structured data like dates and liturgical events from digitized manuscript kalendars. It aims to leverage existing image resources and annotations created with interoperable tools on a shared canvas. A prototype exhibits basic transcription data and links to interactive manuscript views. Challenges include encouraging data creation and addressing varying data availability, but the approach could enable new comparative analyses across distributed collections and repositories.
Integrating archaeological data: The ARIADNE Infrastructure, Achille Felicett...ariadnenetwork
This presentation by Achille Felicetti of PIN (Università degli Studi di Firenze, Prato) on the work by the ARIADNE infrastructure to integrating archaeological data was given as part of a workshop organised by Digital Humanities Austria. The workshop focussed on the pressing question of long-term preservation of digital data from various angles, central being user needs specific to the different fields of the Humanities. Felicetti introduced the ARIADNE research infrastructure, which has been funded by the EC's FP7 programme, to integrate archaeological research datasets from across Europe and support their uses by researchers.
An Overview of the PELAGIOS project (http://pelagios-project.blogspot.com), a digital classics network that aims at interlinking research resources by means of a Linked Open Data approach.
Vocabularies as Linked Data - OUDCE March2014Keith.May
Presentation given as part of OUDCE course in Oxford 04-03-2014 on "Digital Data and Archaeology: Management, Preservation and Publishing.
Acknowledgements to Ceri Binding @Ceribin for many of the slides.
Vocabularies as Linked Data: SENESCHAL & HeritageData.orgKeith.May
This document discusses the SENESCHAL project which converted archaeological controlled vocabularies into Linked Open Data using the SKOS standard. It enabled vocabulary providers like Historic England and RCAHMS to make their thesauri available as Linked Data and facilitated concept searching and browsing. The document outlines how the vocabularies were developed and aligned with legacy data to create unique, persistent concept identifiers and relationships between concepts, datasets, and countries to improve semantic search and data sharing.
De presentatie van Erwin Folmer, tijdens de parallelle sessie 'Randvoorwaarden voor data gedreven beleid en politiek' van het congres 'Data gedreven Beleidsontwikkeling' in Den Haag op 28 november 2017.
SMART-GS: A Tool for Studying Digitized Historical ManuscriptsYuta Hashimoto
SMART-GS is an open source transcription and annotation tool developed at Kyoto University for studying digitized historical manuscripts. It allows users to mark up images, add text transcriptions and annotations, and link related markups. The tool has been used in projects transcribing historical diaries and notebooks. It supports group work through a shared online platform. The developers hope to make SMART-GS a global collaboration platform to support international projects digitizing and analyzing Japanese classical texts.
Geo-linked data: towards deep integration of location in the web of dataAndrew Woolf
This document discusses linking location data on the web. It notes that most data has a spatial or location component, and that linking this location data can provide valuable insights. It provides examples of linked location data from various organizations, including the UK Ordnance Survey and Bureau of Meteorology in Australia. It also describes the benefits of following linked data principles and using standards like GeoSPARQL to query linked geospatial data. Finally, it outlines efforts in Australia to develop best practices for publishing government data as linked open data, including the formation of an inter-agency working group.
WWW09 - Triplify Light-Weight Linked Data Publication from Relational DatabasesSören Auer
Triplify is a tool that publishes semantic data from relational databases on the web as Linked Data. It works by mapping SQL queries to RDF representations. The SQL queries select structured data from databases behind existing web applications. Triplify then converts the query results into RDF triples. This exposes the semantics behind web applications and makes the data accessible to semantic search engines and applications. Triplify aims to overcome the lack of semantic data on the web by leveraging existing relational data sources.
This document discusses linking data openly about the ancient world. It begins by introducing the Linked Ancient World Data Institute and some related projects like Pleiades and Pelagios that link place-based data. It notes that annotating data, rather than unifying models, allows linking different datasets. Examples are given of scholarly projects and museums that are exploring relations between places and data through open linked data approaches. The document suggests this represents a paradigm shift in scholarship as it allows bringing different resources together to explore connections and interpretations in new ways.
Towards a graph of ancient world geographical knowledgeElton Barker
Presentation on three collaborative projects: Hestia (http://hestia.open.ac.uk/), GAP (http://googleancientplaces.wordpress.com/gapvis/), and Pelagios (pelagios-project.blogspot.com)
Towards Semi-Automatic Annotation of Toponyms on Old Mapsaboutgeo
This document discusses a semi-automatic process for detecting toponyms, or place names, on old maps. The approach uses image processing and connected component analysis to determine the location, extent, and orientation of features. It is a 3-step process involving background/foreground separation, feature detection, and feature linking. Some preliminary experiments detecting toponyms on maps from the 15th-18th centuries showed recall rates of around 50% and precision rates of around 31%. Future work aims to develop annotation tools to help index toponyms across collections of early geospatial documents.
Reflections from the Pelagios Commons by Leif Isaksen, Lancaster University. http://commons.pelagios.org. Presentation at the 1st Lancaster Data Conversations 30 January 2017
The document discusses the GeoChronos project, which uses the Elgg platform to facilitate collaboration between earth observation scientists. Key features of the GeoChronos portal include interactive application services that allow scientists to access tools remotely, and spectral libraries that help scientists share and annotate spectral data. Elgg provides profiles, groups, forums and other features, and custom plugins were developed for applications, libraries, and other scientific needs. The portal has been successfully used for collaboration and teaching.
This paper describes the creation of linked data for cultural heritage domain, using semantic technologies. The Gothenburg city museum data are described according to an ontological model combining a series of upper-level and domain specific ontologies, such as PROTON and CIDOC-CRM, triplified and interlinked with data from LOD, e.g. DBpedia. The implementation is done as a reason-able view of the web of data and the data are loaded in OWLIM semantic repositoyr.
The document discusses aggregation as an intervention tactic to improve discoverability of online content. It argues that early web approaches focused on human accessibility but hid complexity, while aggregation can expose relationships and make content more understandable and findable by machines. Done strategically with purposes of engagement, value-adding, and enhancing discoverability through promiscuous metadata, aggregation can help unlock online riches.
This document summarizes a talk about whether repositories are disruptive or disrupted technology. It discusses how repositories have evolved from focusing on individual institutions to being more distributed, collaborative, web-oriented and interoperable. It argues repositories should be integrated into broader information infrastructure and expose digital content for reuse on the web using standards like OAI-ORE. The document also summarizes the history of the Fedora repository project's approaches to interoperability and its plans to adopt more common web APIs and connect backend storage to cloud services. The goal is to make repositories more open and integrated parts of the global networked environment rather than closed local systems.
Repositories are systems mainly used to store and publish academic contents. This presentation discusses why repositories contents should be published as Linked (Open) Data and how repositories can be extended to do so.
Talk at 3th Keystone Training School - Keyword Search in Big Linked Data - Institute for Software Technology and Interactive Systems, TU Wien, Austria, 2017
Pelagios @ the 2019 meeting of the Society for Classical StudiesElton Barker
This document discusses semantic annotation using Recogito and Pelagios. It provides an overview of what Pelagios is for linking data and gazetteer URIs. It describes how Recogito can be used to annotate texts, images, and documents by identifying entities and linking them to authoritative gazetteers. The workshop also showed how annotation allows users to map texts, compare texts, map maps, and link additional data resources to facilitate connecting historical information through shared places.
The common use by archaeologists of ubiquitous technologies such as computers and digital cameras means that archaeological research projects now produce huge amounts of diverse, digital documentation. However, while the technology is available to collect this documentation, we still largely lack community accepted dissemination channels appropriate for such torrents of data. Open Context (http://www.opencontext.org) aims to help fill this gap by providing open access data publication services for archaeology. Open Context has a flexible and generalized technical architecture that can accommodate most archaeological datasets, despite the lack of common recording systems or other documentation standards. Open Context includes a variety of tools to make data dissemination easier and more worthwhile. Authorship is clearly identified through citation tools, a web-based publication systems enables individuals upload their own data for review, and collaboration is facilitated through easy download and other features. While we have demonstrated a potentially valuable approach for data sharing, we face significant challenges in scaling Open Context up for serving large quantities of data from multiple projects.
The document discusses a proposed Grid-based digital library management system. It describes how digital libraries have evolved beyond digitized versions of classical libraries to become complex data management systems. A Grid-enabled digital library would distribute storage, processing, and access across computing resources. This could help address challenges of large volumes of content, concurrent access, and computationally intensive tasks like automatic indexing. The authors outline their ontology-based model and prototype Grid-enabled digital library implementation that distributes search tasks for more efficient processing.
The document discusses the implementation of a digital library management system using grid computing technologies. It proposes mapping digital library functions to grid services to enable distributed storage, parallel processing, and user management capabilities. The authors describe developing an ontology-based model and pilot digital library application. Experimental results show that distributing search tasks across multiple grid nodes can significantly reduce execution time compared to a single node. The document concludes digital libraries must include powerful search tools and grid infrastructures can help implement efficient parallel processing functions.
The document summarizes the European Archival Records and Knowledge (E-ARK) project, which developed an OAIS-compliant system for fast creation, search, and access of archival information packages. It describes the key components and functionality of the E-ARK reference implementation, including tools for ingest, archival storage, data management, access, and data mining of archived content. Current pilots of the E-ARK system are being used by several national archives for large-scale archiving and access of records.
2013 04-29 american art collaborative lod meeting - washington dc - weblecmaj
The document discusses Lec Maj's presentation on linked open data, end-users, and applications at the American Art Collaborative Linked Open Data Meeting. It provides details on Maj's role at the Yale Center for British Art and the projects they have been working on to create linked open data including connecting data between the YCBA and other institutions like the British Museum. It also showcases some open applications that have been created using the YCBA linked open data like an ecard app and an exhibit objects app.
Similar to Linked Geo-Data and Early Geospatial Documents (20)
Vom Wort zum Ort & Wieder Zurück - Pelagios @DHd 2014aboutgeo
Slides for my talk "Vom Wort zu Ort & Wieder Zurück" - Geographische Verknüpfung von altertumswissenschaftlichen Daten im Rahmen von PELAGIOS" for GeoHumanities Workshop @ DHd 2014, Passau, March 25, 2014 (German)
JCDL 2011: Semantically Augmented Annotations in Digitized Map Collectionsaboutgeo
Historic maps are valuable scholarly resources that record information often retained by no other written source. With the YUMA Map Annotation Tool we want to facilitate collaborative annotation for scholars studying historic maps, and allow for semantic augmentation of annotations with structured, contextually relevant information retrieved from Linked Open Data sources. We believe that the integration of Web resource linkage into the scholarly annotation process is not only relevant for collaborative research, but can also be exploited to improve search and retrieval. In this paper, we introduce the COMPASS Experiment, an ongoing crowdsourcing effort in which we are collecting data that can serve as a basis for evaluating our assumption. We discuss the scope and setup of the experiment framework and
report on lessons learned from the data collected so far.
The document discusses using Linked Data and semantic annotation to annotate and geotag digital cultural heritage resources. It describes Europeana and the EuropeanaConnect project, which aims to develop tools like YUMA for media annotation. YUMA allows semantic annotation using Linked Data principles and can georeference maps. This enhances search capabilities and makes the annotations accessible to others.
MagickTiler is a Java tool and library for converting images into tile formats suitable for zoomable web images. It supports various tiling schemes like TMS, Zoomify, Google Maps, and generates tilesets and previews in formats like JPEG and PNG. The tool can be run from the command line to generate tilesets from images.
TrustArc Webinar - 2024 Global Privacy SurveyTrustArc
How does your privacy program stack up against your peers? What challenges are privacy teams tackling and prioritizing in 2024?
In the fifth annual Global Privacy Benchmarks Survey, we asked over 1,800 global privacy professionals and business executives to share their perspectives on the current state of privacy inside and outside of their organizations. This year’s report focused on emerging areas of importance for privacy and compliance professionals, including considerations and implications of Artificial Intelligence (AI) technologies, building brand trust, and different approaches for achieving higher privacy competence scores.
See how organizational priorities and strategic approaches to data security and privacy are evolving around the globe.
This webinar will review:
- The top 10 privacy insights from the fifth annual Global Privacy Benchmarks Survey
- The top challenges for privacy leaders, practitioners, and organizations in 2024
- Key themes to consider in developing and maintaining your privacy program
Monitoring and Managing Anomaly Detection on OpenShift.pdfTosin Akinosho
Monitoring and Managing Anomaly Detection on OpenShift
Overview
Dive into the world of anomaly detection on edge devices with our comprehensive hands-on tutorial. This SlideShare presentation will guide you through the entire process, from data collection and model training to edge deployment and real-time monitoring. Perfect for those looking to implement robust anomaly detection systems on resource-constrained IoT/edge devices.
Key Topics Covered
1. Introduction to Anomaly Detection
- Understand the fundamentals of anomaly detection and its importance in identifying unusual behavior or failures in systems.
2. Understanding Edge (IoT)
- Learn about edge computing and IoT, and how they enable real-time data processing and decision-making at the source.
3. What is ArgoCD?
- Discover ArgoCD, a declarative, GitOps continuous delivery tool for Kubernetes, and its role in deploying applications on edge devices.
4. Deployment Using ArgoCD for Edge Devices
- Step-by-step guide on deploying anomaly detection models on edge devices using ArgoCD.
5. Introduction to Apache Kafka and S3
- Explore Apache Kafka for real-time data streaming and Amazon S3 for scalable storage solutions.
6. Viewing Kafka Messages in the Data Lake
- Learn how to view and analyze Kafka messages stored in a data lake for better insights.
7. What is Prometheus?
- Get to know Prometheus, an open-source monitoring and alerting toolkit, and its application in monitoring edge devices.
8. Monitoring Application Metrics with Prometheus
- Detailed instructions on setting up Prometheus to monitor the performance and health of your anomaly detection system.
9. What is Camel K?
- Introduction to Camel K, a lightweight integration framework built on Apache Camel, designed for Kubernetes.
10. Configuring Camel K Integrations for Data Pipelines
- Learn how to configure Camel K for seamless data pipeline integrations in your anomaly detection workflow.
11. What is a Jupyter Notebook?
- Overview of Jupyter Notebooks, an open-source web application for creating and sharing documents with live code, equations, visualizations, and narrative text.
12. Jupyter Notebooks with Code Examples
- Hands-on examples and code snippets in Jupyter Notebooks to help you implement and test anomaly detection models.
Full-RAG: A modern architecture for hyper-personalizationZilliz
Mike Del Balso, CEO & Co-Founder at Tecton, presents "Full RAG," a novel approach to AI recommendation systems, aiming to push beyond the limitations of traditional models through a deep integration of contextual insights and real-time data, leveraging the Retrieval-Augmented Generation architecture. This talk will outline Full RAG's potential to significantly enhance personalization, address engineering challenges such as data management and model training, and introduce data enrichment with reranking as a key solution. Attendees will gain crucial insights into the importance of hyperpersonalization in AI, the capabilities of Full RAG for advanced personalization, and strategies for managing complex data integrations for deploying cutting-edge AI solutions.
Building Production Ready Search Pipelines with Spark and MilvusZilliz
Spark is the widely used ETL tool for processing, indexing and ingesting data to serving stack for search. Milvus is the production-ready open-source vector database. In this talk we will show how to use Spark to process unstructured data to extract vector representations, and push the vectors to Milvus vector database for search serving.
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?Speck&Tech
ABSTRACT: A prima vista, un mattoncino Lego e la backdoor XZ potrebbero avere in comune il fatto di essere entrambi blocchi di costruzione, o dipendenze di progetti creativi e software. La realtà è che un mattoncino Lego e il caso della backdoor XZ hanno molto di più di tutto ciò in comune.
Partecipate alla presentazione per immergervi in una storia di interoperabilità, standard e formati aperti, per poi discutere del ruolo importante che i contributori hanno in una comunità open source sostenibile.
BIO: Sostenitrice del software libero e dei formati standard e aperti. È stata un membro attivo dei progetti Fedora e openSUSE e ha co-fondato l'Associazione LibreItalia dove è stata coinvolta in diversi eventi, migrazioni e formazione relativi a LibreOffice. In precedenza ha lavorato a migrazioni e corsi di formazione su LibreOffice per diverse amministrazioni pubbliche e privati. Da gennaio 2020 lavora in SUSE come Software Release Engineer per Uyuni e SUSE Manager e quando non segue la sua passione per i computer e per Geeko coltiva la sua curiosità per l'astronomia (da cui deriva il suo nickname deneb_alpha).
Essentials of Automations: The Art of Triggers and Actions in FMESafe Software
In this second installment of our Essentials of Automations webinar series, we’ll explore the landscape of triggers and actions, guiding you through the nuances of authoring and adapting workspaces for seamless automations. Gain an understanding of the full spectrum of triggers and actions available in FME, empowering you to enhance your workspaces for efficient automation.
We’ll kick things off by showcasing the most commonly used event-based triggers, introducing you to various automation workflows like manual triggers, schedules, directory watchers, and more. Plus, see how these elements play out in real scenarios.
Whether you’re tweaking your current setup or building from the ground up, this session will arm you with the tools and insights needed to transform your FME usage into a powerhouse of productivity. Join us to discover effective strategies that simplify complex processes, enhancing your productivity and transforming your data management practices with FME. Let’s turn complexity into clarity and make your workspaces work wonders!
For the full video of this presentation, please visit: https://www.edge-ai-vision.com/2024/06/building-and-scaling-ai-applications-with-the-nx-ai-manager-a-presentation-from-network-optix/
Robin van Emden, Senior Director of Data Science at Network Optix, presents the “Building and Scaling AI Applications with the Nx AI Manager,” tutorial at the May 2024 Embedded Vision Summit.
In this presentation, van Emden covers the basics of scaling edge AI solutions using the Nx tool kit. He emphasizes the process of developing AI models and deploying them globally. He also showcases the conversion of AI models and the creation of effective edge AI pipelines, with a focus on pre-processing, model conversion, selecting the appropriate inference engine for the target hardware and post-processing.
van Emden shows how Nx can simplify the developer’s life and facilitate a rapid transition from concept to production-ready applications.He provides valuable insights into developing scalable and efficient edge AI solutions, with a strong focus on practical implementation.
Ivanti’s Patch Tuesday breakdown goes beyond patching your applications and brings you the intelligence and guidance needed to prioritize where to focus your attention first. Catch early analysis on our Ivanti blog, then join industry expert Chris Goettl for the Patch Tuesday Webinar Event. There we’ll do a deep dive into each of the bulletins and give guidance on the risks associated with the newly-identified vulnerabilities.
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdfMalak Abu Hammad
Discover how MongoDB Atlas and vector search technology can revolutionize your application's search capabilities. This comprehensive presentation covers:
* What is Vector Search?
* Importance and benefits of vector search
* Practical use cases across various industries
* Step-by-step implementation guide
* Live demos with code snippets
* Enhancing LLM capabilities with vector search
* Best practices and optimization strategies
Perfect for developers, AI enthusiasts, and tech leaders. Learn how to leverage MongoDB Atlas to deliver highly relevant, context-aware search results, transforming your data retrieval process. Stay ahead in tech innovation and maximize the potential of your applications.
#MongoDB #VectorSearch #AI #SemanticSearch #TechInnovation #DataScience #LLM #MachineLearning #SearchTechnology
Infrastructure Challenges in Scaling RAG with Custom AI modelsZilliz
Building Retrieval-Augmented Generation (RAG) systems with open-source and custom AI models is a complex task. This talk explores the challenges in productionizing RAG systems, including retrieval performance, response synthesis, and evaluation. We’ll discuss how to leverage open-source models like text embeddings, language models, and custom fine-tuned models to enhance RAG performance. Additionally, we’ll cover how BentoML can help orchestrate and scale these AI components efficiently, ensuring seamless deployment and management of RAG systems in the cloud.
In his public lecture, Christian Timmerer provides insights into the fascinating history of video streaming, starting from its humble beginnings before YouTube to the groundbreaking technologies that now dominate platforms like Netflix and ORF ON. Timmerer also presents provocative contributions of his own that have significantly influenced the industry. He concludes by looking at future challenges and invites the audience to join in a discussion.
Best 20 SEO Techniques To Improve Website Visibility In SERPPixlogix Infotech
Boost your website's visibility with proven SEO techniques! Our latest blog dives into essential strategies to enhance your online presence, increase traffic, and rank higher on search engines. From keyword optimization to quality content creation, learn how to make your site stand out in the crowded digital landscape. Discover actionable tips and expert insights to elevate your SEO game.
Ocean lotus Threat actors project by John Sitima 2024 (1).pptxSitimaJohn
Ocean Lotus cyber threat actors represent a sophisticated, persistent, and politically motivated group that poses a significant risk to organizations and individuals in the Southeast Asian region. Their continuous evolution and adaptability underscore the need for robust cybersecurity measures and international cooperation to identify and mitigate the threats posed by such advanced persistent threat groups.
Generating privacy-protected synthetic data using Secludy and MilvusZilliz
During this demo, the founders of Secludy will demonstrate how their system utilizes Milvus to store and manipulate embeddings for generating privacy-protected synthetic data. Their approach not only maintains the confidentiality of the original data but also enhances the utility and scalability of LLMs under privacy constraints. Attendees, including machine learning engineers, data scientists, and data managers, will witness first-hand how Secludy's integration with Milvus empowers organizations to harness the power of LLMs securely and efficiently.
In the rapidly evolving landscape of technologies, XML continues to play a vital role in structuring, storing, and transporting data across diverse systems. The recent advancements in artificial intelligence (AI) present new methodologies for enhancing XML development workflows, introducing efficiency, automation, and intelligent capabilities. This presentation will outline the scope and perspective of utilizing AI in XML development. The potential benefits and the possible pitfalls will be highlighted, providing a balanced view of the subject.
We will explore the capabilities of AI in understanding XML markup languages and autonomously creating structured XML content. Additionally, we will examine the capacity of AI to enrich plain text with appropriate XML markup. Practical examples and methodological guidelines will be provided to elucidate how AI can be effectively prompted to interpret and generate accurate XML markup.
Further emphasis will be placed on the role of AI in developing XSLT, or schemas such as XSD and Schematron. We will address the techniques and strategies adopted to create prompts for generating code, explaining code, or refactoring the code, and the results achieved.
The discussion will extend to how AI can be used to transform XML content. In particular, the focus will be on the use of AI XPath extension functions in XSLT, Schematron, Schematron Quick Fixes, or for XML content refactoring.
The presentation aims to deliver a comprehensive overview of AI usage in XML development, providing attendees with the necessary knowledge to make informed decisions. Whether you’re at the early stages of adopting AI or considering integrating it in advanced XML development, this presentation will cover all levels of expertise.
By highlighting the potential advantages and challenges of integrating AI with XML development tools and languages, the presentation seeks to inspire thoughtful conversation around the future of XML development. We’ll not only delve into the technical aspects of AI-powered XML development but also discuss practical implications and possible future directions.
1. PELAGIOS
Linked Geo-Data & Early Geospatial Documents
Rainer Simon, Austrian Institute of Technology
Elton Barker, The Open University
http://pelagios-project.blogspot.co.uk
@pelagiosproject
Historical GIS Workshop | 15. 11. 2013, Berlin
2. Pelagios | Linking Data, Openly
Inscriptions
Texts
Museum
Objects
Connecting Ancient World Research Resources
through the Places they refer to
Archaeological
Sites
…or any other online resource that bears
a relation to a particular ancient place!
Archaeological
Finds
2
3.
4.
5.
6. One Ring to Rule them All | Non-Goals
Data aggregation
Federated search
Standardization of data representation
Schema alignment & interoperability on a metadata level
Connectivity through common references rather than a common schema!
6
11. Data | Annotations as Standoff Markup
Publish-able as dump files (no triple store required)
De-coupled from your data infrastructure – no impact on
schemas, conceptual models, technology
Pelagios data license is independent from the data itself – only
annotations need to be CC
11
12. Pelagios 3 | (Partly) Future Work
Early Geospatial Documents
Andrew W. Mellon Foundation Funding
September 2013 – August 2015
Pelagios Approach extended to
maps and geographic texts up to 1492
Latin, Greek & Byzantine, European
Medieval, European Maritime, Islamic,
Chinese
12
13. Pelagios 3 | Technology Topics
Aligning Gazetteers
Linking based on “connectivity through common references”
Using GeoNames and Wikidata as shared reference system
Automating Annotation
Geo-Parsing (Text -> Toponyms)
Geo-Resolution (Toponyms -> Gazetteer IDs)
Image processing to identify toponyms on maps scans
Workflows & tools for expert correction
Visualizing Pelagios Data
Visualizing the “Pelagios Network”
Publishing annotated materials (where possible)
14. Pelagios 3 | Open Source
Pelagios uses Open Source – e.g.
Stanford NLP Toolkit and Edina Unlock Geoparser
Lucene index engine
Neo4j graph database
OpenCV computer vision library
Leaflet, D3 et al.
Pelagios develops Open Source – e.g.
Scalagios
Pelagios Data Toolbelt
API & Visualization Workbench
http://github.com/pelagios