The document summarizes Dr. Timo Borst's presentation on improving library services with semantic web technology, specifically regarding repository systems.
[1] Current library applications generate and manage siloed metadata collections, making aggregation and standardization of metadata difficult for expanded services.
[2] Semantic web services as part of an overarching authority data infrastructure can help standardize metadata early in the process while still allowing for local differences.
[3] If these services are widely adopted and used, there is potential for greater networking of locally managed metadata while improving data-driven services through lightweight integration into existing library applications.
Presentation for the OCLC Linked Data Roundtable event for IFLA Helsinki 2012. Covers the reasoning behind the BL's linked open data version of the British National Bibliography, the processes needed to create the service and challenges to be addressed.
Making the most of metadata Feb 2014 - BNB Linked Data Updatenw13
Presentation given at the 'Making the Most of Metadata' BL Labs event at the British Library, London in February 2014. Provides an update on the BNB LOD service.
Presented at the International Internet Preservation Consortium (IIPC) Web Archiving Week, University of London, 16 June 2017.
Web archiving has become imperative to ensure that our digital heritage does not disappear forever, yet many institutions have not begun this work. In addition, archived websites are not easily discoverable, which severely limits their use. To address this challenge, OCLC Research has established the OCLC Research Library Partnership Web Archiving Metadata Working Group to develop a data dictionary that will be compatible with library and archives standards. Three reports on this project are available in July 2017, focused on metadata best practices guidelines, user needs and behaviors, and evaluation of web archiving tools.
More information: oc.lc/wam
Contact: Jackie Dooley, dooleyj@oclc.org
Bibliographic Infrastructure for Shared Print ManagementConstance Malpas
Slides from ALCTS pre-conference on Shared Print Management, 5 June 2012. Outlines strategy behind OCLC Print Archives Disclosure Pilot project. (First part of session; second half was by Lizanne Payne, on detailed metadata guidelines.)
Revolutionary and Evolutionary Innovation - Marshall Breeding CONUL Conference
Presented at the CONUL Conference, July 2015, Athlone, Ireland by Marshall Breeding.
Biography
Marshall Breeding is an independent consultant, speaker, and author. He is the creator and editor of Library Technology Guides and the libraries.org online directory of libraries on the Web. His monthly column Systems Librarian appears in Computers in Libraries; he is the Editor for Smart Libraries Newsletter published by the American Library Association, and has authored the annual Library Systems Report published by Library Journal from 2002-2013 and by American Libraries since 2014. He has authored nine issues of ALA’s Library Technology Reports, and has written many other articles and book chapters. Marshall has edited or authored seven books, including Cloud Computing for Libraries published by in 2012 by Neal-Schuman, now part of ALA TechSource. He regularly teaches workshops and gives presentations at library conferences on a wide range of topics.
He has been an invited speaker for many library conferences and workshops throughout the United States and internationally. He has spoken in throughout the United States and in Korea, Taiwan, Thailand, China, Singapore, India, Japan, Australia, New Zealand, Iceland, the Czech Republic, Slovenia, Israel, Austria, Germany, The Netherlands, Norway, Denmark, Sweden, Spain, Ireland, the United Kingdom, Israel, Lebanon, Jordan, Colombia, Chile, Mexico, and Argentina.
Marshall Breeding held a variety of positions for the Vanderbilt University Libraries in Nashville, TN from 1985 through May 2012, including as Director for Innovative Technologies and Research as the Executive Director the Vanderbilt Television News Archive.
Breeding was the 2010 recipient of the LITA LITA/Library Hi Tech Award for Outstanding Communication for Continuing Education in Library and Information Science.
Read his Guideposts blog on Library Technology Guides at:
www.librarytechnology.org
Presentation for the OCLC Linked Data Roundtable event for IFLA Helsinki 2012. Covers the reasoning behind the BL's linked open data version of the British National Bibliography, the processes needed to create the service and challenges to be addressed.
Making the most of metadata Feb 2014 - BNB Linked Data Updatenw13
Presentation given at the 'Making the Most of Metadata' BL Labs event at the British Library, London in February 2014. Provides an update on the BNB LOD service.
Presented at the International Internet Preservation Consortium (IIPC) Web Archiving Week, University of London, 16 June 2017.
Web archiving has become imperative to ensure that our digital heritage does not disappear forever, yet many institutions have not begun this work. In addition, archived websites are not easily discoverable, which severely limits their use. To address this challenge, OCLC Research has established the OCLC Research Library Partnership Web Archiving Metadata Working Group to develop a data dictionary that will be compatible with library and archives standards. Three reports on this project are available in July 2017, focused on metadata best practices guidelines, user needs and behaviors, and evaluation of web archiving tools.
More information: oc.lc/wam
Contact: Jackie Dooley, dooleyj@oclc.org
Bibliographic Infrastructure for Shared Print ManagementConstance Malpas
Slides from ALCTS pre-conference on Shared Print Management, 5 June 2012. Outlines strategy behind OCLC Print Archives Disclosure Pilot project. (First part of session; second half was by Lizanne Payne, on detailed metadata guidelines.)
Revolutionary and Evolutionary Innovation - Marshall Breeding CONUL Conference
Presented at the CONUL Conference, July 2015, Athlone, Ireland by Marshall Breeding.
Biography
Marshall Breeding is an independent consultant, speaker, and author. He is the creator and editor of Library Technology Guides and the libraries.org online directory of libraries on the Web. His monthly column Systems Librarian appears in Computers in Libraries; he is the Editor for Smart Libraries Newsletter published by the American Library Association, and has authored the annual Library Systems Report published by Library Journal from 2002-2013 and by American Libraries since 2014. He has authored nine issues of ALA’s Library Technology Reports, and has written many other articles and book chapters. Marshall has edited or authored seven books, including Cloud Computing for Libraries published by in 2012 by Neal-Schuman, now part of ALA TechSource. He regularly teaches workshops and gives presentations at library conferences on a wide range of topics.
He has been an invited speaker for many library conferences and workshops throughout the United States and internationally. He has spoken in throughout the United States and in Korea, Taiwan, Thailand, China, Singapore, India, Japan, Australia, New Zealand, Iceland, the Czech Republic, Slovenia, Israel, Austria, Germany, The Netherlands, Norway, Denmark, Sweden, Spain, Ireland, the United Kingdom, Israel, Lebanon, Jordan, Colombia, Chile, Mexico, and Argentina.
Marshall Breeding held a variety of positions for the Vanderbilt University Libraries in Nashville, TN from 1985 through May 2012, including as Director for Innovative Technologies and Research as the Executive Director the Vanderbilt Television News Archive.
Breeding was the 2010 recipient of the LITA LITA/Library Hi Tech Award for Outstanding Communication for Continuing Education in Library and Information Science.
Read his Guideposts blog on Library Technology Guides at:
www.librarytechnology.org
The network reshapes the research library collectionlisld
The library collection has been central to library identity and service, however we are now seeing major changes in how libraries help discover, curate and create collections. This is a response to evolving research and learning behaviors in a network environment. This presentation considers trends which are influencing how we think about library curatorial activities and are reshaping their collections. The first direction is the ‘inside-out library’ which is a response to the reorganization of research work by the digital environment. The second is the facilitated collections, which is a response to the reorganization of the information space by the network. The presentation discusses three ways in which we are thinking differently about collections: the inside out collection, the facilitated collection, and the collective collection.
Repositories are systems to safely store and publish digital objects and their descriptive metadata. Repositories mainly serve their data by using web interfaces which are primarily oriented towards human consumption. They either hide their data behind non-generic interfaces or do not publish them at all in a way a computer can process easily. At the same time the data stored in repositories are particularly suited to be used in the Semantic Web as metadata are already available. They do not have to be generated or entered manually for publication as Linked Data. In my talk I will present a concept of how metadata and digital objects stored in repositories can be woven into the Linked (Open) Data Cloud and which characteristics of repositories have to be considered while doing so. One problem it targets is the use of existing metadata to present Linked Data. The concept can be applied to almost every repository software. At the end of my talk I will present an implementation for DSpace, one of the software solutions for repositories most widely used. With this implementation every institution using DSpace should become able to export their repository content as Linked Data.
Going, going, gone - Can legal deposit save us from the digital black hole? -...CONUL Conference
Presented at the CONUL Conference, July 2015, Athlone, Ireland by Margaret Flood, Arlene Healy, Trinity College Dublin.
Abstract
The internet has evolved beyond recognition since its advent in 1980s; fundamentally changing the way we live, work and communicate. However its pervasiveness is mirrored by the transient nature of much of the content and the consequent loss of collective memory has been described as the digital black hole. Historically nations have relied on national libraries and other legal deposit libraries, to collect preserve and provide ongoing access to the intellectual, cultural and social outputs of their country, and in an increasingly digital world restricting legal deposit to publications in print has put the national record at risk. Over the last decade countries across the world have extended legal deposit provisions in their legislation to cover non-print formats. This presentation focuses on the experience of the UK, as a case study, from new legislation in 2003 through the experience of implementation in 2013 to where we are today. Challenges, viewed through the lens of an academic library, include defining what is national in a digital world; balancing the interests of multiple stakeholders; technical challenges to implement robust collection, preservation and access systems within legal constraints; dealing with multiple and rapidly evolving formats; the sheer scale and cost of collecting and preserving content and providing ongoing access to it. Two years on from UK implementation of the legislation how successful have the legal deposit libraries been in this endeavour, what does the future look like and what lessons might be applicable to the Irish digital environment?
Biography
"Margaret Flood heads the Collection Management Division of Trinity College Library. She has been actively engaged with the British Library and UK legal deposit libraries since 2003 in the planning to bring non-print legal deposit from legislation to implementation and ultimately business as usual. She represents TCD on a number of key committees including the Legal Deposit Implementation Group and Joint Committee for Legal Deposit which draws its representation from the publishing and library communities. She chairs the TCD internal Steering Group responsible for coordination of the implementation of UK Non-Print Legal Deposit within TCD. Margaret also chairs the CONUL Regulatory Affairs Sub-Committee which includes legal deposit in its remit. On behalf of CONUL the Sub-Committee responded to public the two public consultations initiated by the Copyright Review Committee including detailed submissions on the urgency of legislating for digital legal deposit for Ireland
Arlene Healy is Sub-librarian of the Digital Systems and Services (Readers’ Services Division) in Trinity College Library, Dublin, where she is a member of the Leadership Team. In her role she provides strategic leadership for digital services and
This presentation was delivered by Rebekah Cummings of the University of Utah during a NISO Virtual Conference on the topic of data curation, held on Wednesday, August 31, 2016
This presentation gives a brief overview on achievements and challenges of the Data Web and describes different aspects of using the Semantic Data Wiki OntoWiki for Linked Data management.
DBpedia - An Interlinking Hub in the Web of DataChris Bizer
Given and overview about the DBpedia project and the role of DBpedia in the Web of Data and outlines the next steps from the Dbpedia project as well as ideas for using DBpedia data within the BBC.
Data mining OCLC for translations.
Creating authority records for VIAF.
Remodelling the bibliorgraphic structure to make the best mutli-lingual displays from all available data in a work set.
Harvesting Using the Open Archives Initiative Protocol: What Your OAI Stream ...Sandra McIntyre
Webinar from the Mountain West Digital Library
Sandra McIntyre, MWDL Director
Anna Neatrour, MWDL Digital Metadata Librarian
Want to understand what happens behind the scenes with the MWDL harvesting? In this webinar, Sandra McIntyre and Anna Neatrour will explain the Open Archives Initiative Protocol for Metadata Harvestiong (OAI-PMH) and how it makes metadata aggregation possible in the MWDL. They will explain the process of harvesting and how MWDL normalizes your metadata. They will also show you how you can learn from your collections' OAI stream by using the six query verbs (requests) defined in the OAI-PMH.
Open Data management is still not trivial nor sustainable - COMSODE results are here to bring automation to publication and management of Open Data in public institutions and companies. Presentation includes Open Data Ready standard proposal, three use cases and invitation for Horizon 2020 projects 2016.
From local infrastructure to engagement - thinking about the library in the l...lisld
Libraries are rebalancing services and directions so that they are more active in the lives of their users. This presentation frames this discussion. It looks at shifts in user behaviours, collections, and spaces, and describes how OCLC Reseach is helping libraries make these transitions.
This presentation was given at the Minitex ILL Meeting in St Paul on 12 May 2015.
Repositories are systems mainly used to store and publish academic contents. This presentation discusses why repositories contents should be published as Linked (Open) Data and how repositories can be extended to do so.
The network reshapes the research library collectionlisld
The library collection has been central to library identity and service, however we are now seeing major changes in how libraries help discover, curate and create collections. This is a response to evolving research and learning behaviors in a network environment. This presentation considers trends which are influencing how we think about library curatorial activities and are reshaping their collections. The first direction is the ‘inside-out library’ which is a response to the reorganization of research work by the digital environment. The second is the facilitated collections, which is a response to the reorganization of the information space by the network. The presentation discusses three ways in which we are thinking differently about collections: the inside out collection, the facilitated collection, and the collective collection.
Repositories are systems to safely store and publish digital objects and their descriptive metadata. Repositories mainly serve their data by using web interfaces which are primarily oriented towards human consumption. They either hide their data behind non-generic interfaces or do not publish them at all in a way a computer can process easily. At the same time the data stored in repositories are particularly suited to be used in the Semantic Web as metadata are already available. They do not have to be generated or entered manually for publication as Linked Data. In my talk I will present a concept of how metadata and digital objects stored in repositories can be woven into the Linked (Open) Data Cloud and which characteristics of repositories have to be considered while doing so. One problem it targets is the use of existing metadata to present Linked Data. The concept can be applied to almost every repository software. At the end of my talk I will present an implementation for DSpace, one of the software solutions for repositories most widely used. With this implementation every institution using DSpace should become able to export their repository content as Linked Data.
Going, going, gone - Can legal deposit save us from the digital black hole? -...CONUL Conference
Presented at the CONUL Conference, July 2015, Athlone, Ireland by Margaret Flood, Arlene Healy, Trinity College Dublin.
Abstract
The internet has evolved beyond recognition since its advent in 1980s; fundamentally changing the way we live, work and communicate. However its pervasiveness is mirrored by the transient nature of much of the content and the consequent loss of collective memory has been described as the digital black hole. Historically nations have relied on national libraries and other legal deposit libraries, to collect preserve and provide ongoing access to the intellectual, cultural and social outputs of their country, and in an increasingly digital world restricting legal deposit to publications in print has put the national record at risk. Over the last decade countries across the world have extended legal deposit provisions in their legislation to cover non-print formats. This presentation focuses on the experience of the UK, as a case study, from new legislation in 2003 through the experience of implementation in 2013 to where we are today. Challenges, viewed through the lens of an academic library, include defining what is national in a digital world; balancing the interests of multiple stakeholders; technical challenges to implement robust collection, preservation and access systems within legal constraints; dealing with multiple and rapidly evolving formats; the sheer scale and cost of collecting and preserving content and providing ongoing access to it. Two years on from UK implementation of the legislation how successful have the legal deposit libraries been in this endeavour, what does the future look like and what lessons might be applicable to the Irish digital environment?
Biography
"Margaret Flood heads the Collection Management Division of Trinity College Library. She has been actively engaged with the British Library and UK legal deposit libraries since 2003 in the planning to bring non-print legal deposit from legislation to implementation and ultimately business as usual. She represents TCD on a number of key committees including the Legal Deposit Implementation Group and Joint Committee for Legal Deposit which draws its representation from the publishing and library communities. She chairs the TCD internal Steering Group responsible for coordination of the implementation of UK Non-Print Legal Deposit within TCD. Margaret also chairs the CONUL Regulatory Affairs Sub-Committee which includes legal deposit in its remit. On behalf of CONUL the Sub-Committee responded to public the two public consultations initiated by the Copyright Review Committee including detailed submissions on the urgency of legislating for digital legal deposit for Ireland
Arlene Healy is Sub-librarian of the Digital Systems and Services (Readers’ Services Division) in Trinity College Library, Dublin, where she is a member of the Leadership Team. In her role she provides strategic leadership for digital services and
This presentation was delivered by Rebekah Cummings of the University of Utah during a NISO Virtual Conference on the topic of data curation, held on Wednesday, August 31, 2016
This presentation gives a brief overview on achievements and challenges of the Data Web and describes different aspects of using the Semantic Data Wiki OntoWiki for Linked Data management.
DBpedia - An Interlinking Hub in the Web of DataChris Bizer
Given and overview about the DBpedia project and the role of DBpedia in the Web of Data and outlines the next steps from the Dbpedia project as well as ideas for using DBpedia data within the BBC.
Data mining OCLC for translations.
Creating authority records for VIAF.
Remodelling the bibliorgraphic structure to make the best mutli-lingual displays from all available data in a work set.
Harvesting Using the Open Archives Initiative Protocol: What Your OAI Stream ...Sandra McIntyre
Webinar from the Mountain West Digital Library
Sandra McIntyre, MWDL Director
Anna Neatrour, MWDL Digital Metadata Librarian
Want to understand what happens behind the scenes with the MWDL harvesting? In this webinar, Sandra McIntyre and Anna Neatrour will explain the Open Archives Initiative Protocol for Metadata Harvestiong (OAI-PMH) and how it makes metadata aggregation possible in the MWDL. They will explain the process of harvesting and how MWDL normalizes your metadata. They will also show you how you can learn from your collections' OAI stream by using the six query verbs (requests) defined in the OAI-PMH.
Open Data management is still not trivial nor sustainable - COMSODE results are here to bring automation to publication and management of Open Data in public institutions and companies. Presentation includes Open Data Ready standard proposal, three use cases and invitation for Horizon 2020 projects 2016.
From local infrastructure to engagement - thinking about the library in the l...lisld
Libraries are rebalancing services and directions so that they are more active in the lives of their users. This presentation frames this discussion. It looks at shifts in user behaviours, collections, and spaces, and describes how OCLC Reseach is helping libraries make these transitions.
This presentation was given at the Minitex ILL Meeting in St Paul on 12 May 2015.
Repositories are systems mainly used to store and publish academic contents. This presentation discusses why repositories contents should be published as Linked (Open) Data and how repositories can be extended to do so.
There are challenges in the contact center that make it necessary to have reports that help reduce the lack of productivity and poor agent performance. The positioning of those reports depends on the audience and the goals.
Cloud web scale discovery services landscape an overviewNikesh Narayanan
Abstract
The impact of Internet and Google like search engines radically influenced the information behavior of Net Generation users. They expect same environment in library services such that all their required information make available in a single set of results through unified search across all the available resources. Libraries have been striving to respond to this challenge for years. Until recently, federated search technology of the past decade was the better attempt in this area to meet these user expectations. But federated search solution is marked by the drawbacks of its slowness as it searches each database on the fly. New Generation cloud based Library Web scale discovery technology is a promising entrant in this landscape. This Paper attempts to provide a comprehensive overview of Library Web Scale Discovery solutions by depicting various facets of Web Scale Discovery solutions such as its importance to Library field, their possible role as the starting point for research, content coverage, and finally analyses the competition at the discovery front by comparing the services of major players. The comparative analysis shows that all the major service providers are extending competitive features and services, but varies in some areas and the adoption choice depends on the concerned library’s preferences and the cost involved.
Implementing web scale discovery services: special reference to Indian Librar...Nikesh Narayanan
Web scale Discovery services arebecoming the widely adopted Information Retrieval solution in libraries across the world to connect its patrons with the relevant information they seek. In lieu with the world trend, Resources Discovery Solution implementation is gathering momentum in Indian libraries also.
Considering the Indian Libraries scenario, this paper attempts to provide an overview of Library Web Scale Discovery solutions, its need in Indian Libraries, important parameters to be considered for evaluation of Discovery Services, essential factors to be considered prior to implementation, stages of implementation and finally some thoughts on post implementation analysis for measuring the success.
Web scale Discovery services are becoming the most sought after solution for Libraries to connect its patrons with the relevant information they seek. Many studies show that these services are getting wide acceptance from users as well as Library staff and making revolution in Library Information retrieval arena. Given such broad implications, selecting a new discovery service for libraries is an important undertaking. Library professionals should carefully evaluate options to meet their goal of finding the best potential match for their library. This Paper attempts to provide a comprehensive overview of Library Web Scale Discovery solutions by depicting various facets of Web Scale Discovery, how it differs from federated searching and highlights the important parameters to be considered for taking an informed and confident decision on selecting discovery service.
This presentation was provided by Marshall Breeding of Library Technology for a NISO webinar, Integrating Library Management Systems, held on June 8, 2016
In the recent past, Resource sharing concept has become prime factor and playing vital role in
libraries because of innovative developments in Information, Communication and Technology
(ICT). ICT has made easy to establish networks among libraries and share their information
resources quickly and instantly. Resource sharing has become prime reason for establishing
cooperation between libraries without any geographical barriers. The various reasons for resource
sharing are might be cost benefits, non-availability of resources, insufficient library funds, lack of
skills etc. In this paper, the attempt has been made to understand the various aspects of resource
sharing in modern library technological environment.
This presentation was provided by Oren Beit-Arie of Ex Libris, Inc. during the NISO event, "Library Resource Management Systems: New Challenges, New Opportunities," held October 8 - 9, 2009.
CLARIAH Toogdag 2018: A distributed network of digital heritage informationEnno Meijers
Slides of my keynote at the CLARIAH Toogdag 2018 on 9 March at the National Library of the Netherlands. The main topics were the development of the distributed digital heritage network and the alignment to and cooperation with the CLARIAH infrastructure and data. It also points at some of the current limitations of the semantic web technology.
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Tobias Schneck
As AI technology is pushing into IT I was wondering myself, as an “infrastructure container kubernetes guy”, how get this fancy AI technology get managed from an infrastructure operational view? Is it possible to apply our lovely cloud native principals as well? What benefit’s both technologies could bring to each other?
Let me take this questions and provide you a short journey through existing deployment models and use cases for AI software. On practical examples, we discuss what cloud/on-premise strategy we may need for applying it to our own infrastructure to get it to work from an enterprise perspective. I want to give an overview about infrastructure requirements and technologies, what could be beneficial or limiting your AI use cases in an enterprise environment. An interactive Demo will give you some insides, what approaches I got already working for real.
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Ramesh Iyer
In today's fast-changing business world, Companies that adapt and embrace new ideas often need help to keep up with the competition. However, fostering a culture of innovation takes much work. It takes vision, leadership and willingness to take risks in the right proportion. Sachin Dev Duggal, co-founder of Builder.ai, has perfected the art of this balance, creating a company culture where creativity and growth are nurtured at each stage.
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf91mobiles
91mobiles recently conducted a Smart TV Buyer Insights Survey in which we asked over 3,000 respondents about the TV they own, aspects they look at on a new TV, and their TV buying preferences.
Key Trends Shaping the Future of Infrastructure.pdfCheryl Hung
Keynote at DIGIT West Expo, Glasgow on 29 May 2024.
Cheryl Hung, ochery.com
Sr Director, Infrastructure Ecosystem, Arm.
The key trends across hardware, cloud and open-source; exploring how these areas are likely to mature and develop over the short and long-term, and then considering how organisations can position themselves to adapt and thrive.
GraphRAG is All You need? LLM & Knowledge GraphGuy Korland
Guy Korland, CEO and Co-founder of FalkorDB, will review two articles on the integration of language models with knowledge graphs.
1. Unifying Large Language Models and Knowledge Graphs: A Roadmap.
https://arxiv.org/abs/2306.08302
2. Microsoft Research's GraphRAG paper and a review paper on various uses of knowledge graphs:
https://www.microsoft.com/en-us/research/blog/graphrag-unlocking-llm-discovery-on-narrative-private-data/
The Art of the Pitch: WordPress Relationships and SalesLaura Byrne
Clients don’t know what they don’t know. What web solutions are right for them? How does WordPress come into the picture? How do you make sure you understand scope and timeline? What do you do if sometime changes?
All these questions and more will be explored as we talk about matching clients’ needs with what your agency offers without pulling teeth or pulling your hair out. Practical tips, and strategies for successful relationship building that leads to closing the deal.
PHP Frameworks: I want to break free (IPC Berlin 2024)Ralf Eggert
In this presentation, we examine the challenges and limitations of relying too heavily on PHP frameworks in web development. We discuss the history of PHP and its frameworks to understand how this dependence has evolved. The focus will be on providing concrete tips and strategies to reduce reliance on these frameworks, based on real-world examples and practical considerations. The goal is to equip developers with the skills and knowledge to create more flexible and future-proof web applications. We'll explore the importance of maintaining autonomy in a rapidly changing tech landscape and how to make informed decisions in PHP development.
This talk is aimed at encouraging a more independent approach to using PHP frameworks, moving towards a more flexible and future-proof approach to PHP development.
Neuro-symbolic is not enough, we need neuro-*semantic*Frank van Harmelen
Neuro-symbolic (NeSy) AI is on the rise. However, simply machine learning on just any symbolic structure is not sufficient to really harvest the gains of NeSy. These will only be gained when the symbolic structures have an actual semantics. I give an operational definition of semantics as “predictable inference”.
All of this illustrated with link prediction over knowledge graphs, but the argument is general.
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...UiPathCommunity
💥 Speed, accuracy, and scaling – discover the superpowers of GenAI in action with UiPath Document Understanding and Communications Mining™:
See how to accelerate model training and optimize model performance with active learning
Learn about the latest enhancements to out-of-the-box document processing – with little to no training required
Get an exclusive demo of the new family of UiPath LLMs – GenAI models specialized for processing different types of documents and messages
This is a hands-on session specifically designed for automation developers and AI enthusiasts seeking to enhance their knowledge in leveraging the latest intelligent document processing capabilities offered by UiPath.
Speakers:
👨🏫 Andras Palfi, Senior Product Manager, UiPath
👩🏫 Lenka Dulovicova, Product Program Manager, UiPath
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Improving library services with semantic web technology in the realm of repositories
1. Improving Library Services with
Semantic Web Technology
- in the realm of Repository Systems
Dr. Timo Borst
Head of IT Development
German National Library for Economics /
Leibniz-Information Centre Economics
Kiel/Hamburg, Germany
ICDK 2011
14th – 16th February, Gurgaon/India
Die ZBW ist Mitglied der Leibniz-Gemeinschaft
2. Overview
1. Current situation: Distributed (meta-)data management in library
applications
2. Popular approaches towards aggregation and homogeneity of
metadata
3. Our approach: Integration and aggregation of authority values
with Semantic Web technology
a) General idea
b) Use case: Indexing
c) Use case: Retrieving
4. “Lightweight” integration into existing repository systems and
service providers
5. Conclusion
Seite 2
3. Current situation
• The rise of repository systems for academic publishing…
• …has led to a landscape of distributed systems, each of them
holding its own metadata…
• …which is harvested and aggregated by service providers
Seite 3
4. Popular approaches towards aggregation and
homogeneity of metadata
• Normalization in advance (before harvesting) requires
• a mandatory metadata scheme to be applied by the local repositories
• a set of controlled vocabularies (e.g. for publication types)
• an automatic validation of the harvested metadata
• Normalization afterwards (after harvesting) requires
• the definition of a minimum set of metadata fields
• the definition of a basic intermediate metadata scheme for normalizing
the heterogeneous metadata records,
• optionally data cleansing strategies like name disambiguation and
automatic indexing on the basis of thesauri
Both approaches are problematic and reveal ambiguities on the aggregation level !
Seite 4
5. Current situation
• …sounds easy and straight, but implies
severe problems esp. with regard to
ambiguity of
• author names
• subject headings
Seite 5
6. Current situation
„The major difficulty we have found is with DSpace’s handling of
metadata. While we feel that the number of fields in Dublin Core is
adequate for most if not all uses (DCMI Usage Board 2006), we are
troubled by the lack of authority control when completing its fields.
Without some control over uniform titles, authors and subjects
accessing the items in the future will very problematic.“
S. Chabot (http://subjectobject.net/2006/11/09/the-dspace-digital-
repository-a-project-analysis/)
„Neither the standards nor the software unterlying
institutional repositories anticipated performing naming
authority control on widely disparate metadata from
highly unreliable sources.“
D. Salo (http://minds.wisconsin.edu/handle/1793/31735)
Seite 6
7. Our approach: Integration of authority values with
Semantic Web technology
• General idea: “Provide a framework for integrating authority
data, which is both normative and flexible enough to tolerate
local idiosyncrasies on a string level.”
• Approach: Concept modelling based on Semantic Web / SKOS
standards
Seite 7
9. Our approach: Integration of authority values with
Semantic Web technology – Web service
Example queries (for concepts):
http://zbw.eu/beta/stw-ws/suggest?query=finanzkr
…delivers all terms beginning with “finanzkr”
http://zbw.eu/beta/stw-ws/stw-ws-wrapper.php?service=labels&
concept=http://zbw.eu/stw/descriptor/19664-4&lang=en
…delivers all english synonyms of the german “Finanzkrise”
Seite 9
10. Use case: (Self-)Indexing
• One of the most prominent use cases especially for librarians, but also
for scientists and active users not familiar with subject specific
vocabularies
• Main goals:
• Support the process of indexing in order to achieve a classification
of documents which is both coherent and flexible in the sense that
it permits local idiosyncrasies related to authority terms
• Align different vocabularies in the sense that indexing in one
vocabulary is automatically linked to another vocabulary
• Implementation: Extension of the submission interface of our repository by
integrating the terminology web service as an autosuggest function
Seite 10
12. Use case: Retrieving
• To be considered as the most important use case
• Often leading into the classical dilemma of precision and
recall
• Main goal:
• Support the process of retrieving, so users can find the
relevant set of documents
• Implementation: Automatic expansion of the original query with
synonyms, narrower and related terms
Seite 12
19. “Lightweight” integration into existing repository systems
and service providers
Benefits
• „Lightweight“ extension of legacy systems
• Strategy of „least intrusion“: No update or migration needed
• No changes to the core system, only some changes to the data model
may be required:
• Additional column for storing the URI of the authority key
• Export resp. harvesting of the authority as a resource must be able
(->OAI-ORE)
• Other types of library applications suitable for these adaptations:
• catalogues
• portals (e.g. to generate publication lists from an identified author or
thematic issues)
• Any collaborative system with annotation system
Seite 19
20. Zusammenfassung und Fazit
• Bibliotheksanwendungen erzeugen und verwalten jeweils eigene
idiosynkratische Datenbestände.
• Dies erschwert die Pflege, den Austausch, die Aggregation und die
Homogenisierung der (Meta-)Daten für erweiterte Dienste.
• Vorgelagerte Webservices als Teil einer übergreifenden Normdaten-
Infrastruktur können frühzeitig zur Homogenisierung der Metadaten
beitragen (bei gleichzeitiger Lokalisierung).
• Wenn diese Webservices verbreitet entstehen und genutzt werden,
besteht die Chance zu einer weitergehenden Vernetzung lokal
gepflegter Metadaten bei gleichzeitiger Verbesserung der
datenbasierten Services.
• Die Möglichkeit zur „leichtgewichtigen Integration“ ist ein Angebot an
Betreiber von Bibliotheksanwendungen, diese Webservices mit
möglichst minimalem Aufwand in ihre Anwendungen zu integrieren.
Seite 20
21. Vielen Dank!
Dr. Timo Borst
Deutsche Zentralbibliothek für
Wirtschaftswissenschaften /
Leibniz-Informationszentrum
Wirtschaft (ZBW)
t.borst@zbw.eu
Seite 21
22. Anwendungsfall_3: Erfassung von Autoren
•Der Normalfall in Katalogen - in anderen Erfassungssystemen bisher
der Ausnahmefall
•Nutzergruppen: BibliothekarInnen + WissenschaftlerInnen (?) +
BibliotheksnutzerInnen (?)
•Vorgang: Eingabe von AutorInnen-Namen
•Zielstellung: Den Vorgang der Autorenerfassung mit Hilfe von
Normdaten zu verbessern, die durch Webservices bereit gestellt werden
Seite 22
24. Bisherige Lösungsansätze zur Aggregierung &
Homogenisierung
•Metadatensuche durch Aggregatoren
• Parallele Abfrage entfernt-verteilter Systeme
• Rückgabe und Aufbereitung des Suchergebnisses als
zusammengesetzte Trefferliste
•Harvesting
• Regelmäßiges Einsammeln von entfernt-verteilten
Metadaten
• Homogenisierung ex ante oder ex post
•Föderierte Suche
•…
Seite 24