This document discusses a project funded by DEFRA to create a taxonomy backbone - a graph of taxonomy - for names and natural history collections. The project has three phases focusing on names and taxonomy, collections, and taxon-based information. The goal is to create, curate, and cite semantically meaningful objects or "things" rather than just strings. A three-layered model is proposed with strings at the bottom layer, things in the middle layer representing names in a nomenclatural sense, and a graph of things at the top layer representing taxonomic concepts and the relationships between them. This changes how data is created, curated, and cited, and allows for integration of different classification data and analysis of differences of opinions in
Short presentation on text and data mining from a digital heritage and library perspective, given at the FutureTDM Knowledge Café in Helsinki during the LIBER 2016 conference.
Between information retrieval services and bibliometrics research. New ...Andrea Scharnhorst
R. Koopman, S. Wang, A. Scharnhorst (2015) Between information retrieval services and bibliometrics research. New ways of semantic browsing and visual analytics. Presentation at the Sigmetrics workshop, ASIST 2015, November 7, 2015 St. Louis, Missouri
Connecting Heterogeneous Collections using Linked DataVictor de Boer
Presentation about connecting Heterogeneous Collections using Linked Data as presented at the NIAS Lorentz workshhop on Migrant Re-Collections (http://www.leiden-delft-erasmus.nl/nl/agenda/2016-08-22-nias-lorentz-workshop-migrant-re-collections-on-digitalising-migrant-heritage)
eMuseum Network is a search and collaboration platform designed and hosted by Gallery Systems. The project enables member museums to share their collections catalogues and to search and export data across all participating collections from a single access point, in a share-and-share-alike fashion. This presentation will give an overview of the project and how Gallery Systems plans to provide a path for museums to participate in the Linked Data Initiative.
discussion of interdisciplinary, international, multicultural, virtual working team on how to globalise an existing common names webservice on the example of japanese resources of different provenience (science: biology, lingustics, cultural studies; citizens; administrative etc.)
WORLDMAP: A SPATIAL INFRASTRUCTURE TO SUPPORT TEACHING AND RESEARCH (BROWN BA...Micah Altman
The WorldMap platform http://worldmap.harvard.edu is the largest open source collaborative mapping system in the world, with over 13,000 map layers contributed by thousands of users from Harvard and around the world. Researchers may upload large spatial datasets to the system, create data-driven visualizations, edit data, and control access. Users may keep their data private, share it in groups, or publish to the world.
The user base is interdisciplinary, including scholars from the humanities, social sciences, sciences, public health, design, planning, etc. All are able to access, view, and use one another’s data, either online, via map services, or by downloading.
Current work is underway to create and maintain a global registry of map services and take us a step closer to one-stop-access for public geospatial data. Another project is working on tools to support the visualization of spatial datasets with over a billion features. Current collaborations are underway with groups inside Harvard, such as Dataverse, HarvardX, and various departments, and with groups outside Harvard, such as Cornell University and the University of Pennsylvania. Major additional contributors to the underlying source code include the WorldBank, the U.S. State Department, and the United Nations.
The source code for the WorldMap platform is available on GitHub https://github.com/cga-harvard/cga-worldmap.
Location: E25-202
Discussant: Ben Lewis is system architect and project manager for WorldMap, an open source infrastructure that supports collaborative research centered on geospatial information. Before joining Harvard, Ben was a project manager with Advanced Technology Solutions of Pennsylvania, where he led the company in adopting platform independent approaches to GIS system development. Ben studied Chinese at the University of Wisconsin and has a Masters in Planning from the University of Pennsylvania. After Penn, Ben helped start the GIS Lab at U.C. Berkeley, founded the GIS group for transportation engineering firm McCormick Taylor, and coordinated the Land Acquisition Mapping System for South Florida Water Management District. Ben is especially interested in technologies that lower the barrier to spatial technology access.
Information Science Brown Bag talks, hosted by the Program on Information Science, consists of regular discussions and brainstorming sessions on all aspects of information science and uses of information science and technology to assess and solve institutional, social and research problems. These are informal talks. Discussions are often inspired by real-world problems being faced by the lead discussant.
Trove: More Than a Treasure? ALIA Conference Presentation 2010 Brisbane by Ro...Rose Holley
Describes the innovative development of Trove at the National Library of Australia. Trove is a search engine for Australians about Australians. It contains 90 million items from over 1000 contributing organisations.
Short presentation on text and data mining from a digital heritage and library perspective, given at the FutureTDM Knowledge Café in Helsinki during the LIBER 2016 conference.
Between information retrieval services and bibliometrics research. New ...Andrea Scharnhorst
R. Koopman, S. Wang, A. Scharnhorst (2015) Between information retrieval services and bibliometrics research. New ways of semantic browsing and visual analytics. Presentation at the Sigmetrics workshop, ASIST 2015, November 7, 2015 St. Louis, Missouri
Connecting Heterogeneous Collections using Linked DataVictor de Boer
Presentation about connecting Heterogeneous Collections using Linked Data as presented at the NIAS Lorentz workshhop on Migrant Re-Collections (http://www.leiden-delft-erasmus.nl/nl/agenda/2016-08-22-nias-lorentz-workshop-migrant-re-collections-on-digitalising-migrant-heritage)
eMuseum Network is a search and collaboration platform designed and hosted by Gallery Systems. The project enables member museums to share their collections catalogues and to search and export data across all participating collections from a single access point, in a share-and-share-alike fashion. This presentation will give an overview of the project and how Gallery Systems plans to provide a path for museums to participate in the Linked Data Initiative.
discussion of interdisciplinary, international, multicultural, virtual working team on how to globalise an existing common names webservice on the example of japanese resources of different provenience (science: biology, lingustics, cultural studies; citizens; administrative etc.)
WORLDMAP: A SPATIAL INFRASTRUCTURE TO SUPPORT TEACHING AND RESEARCH (BROWN BA...Micah Altman
The WorldMap platform http://worldmap.harvard.edu is the largest open source collaborative mapping system in the world, with over 13,000 map layers contributed by thousands of users from Harvard and around the world. Researchers may upload large spatial datasets to the system, create data-driven visualizations, edit data, and control access. Users may keep their data private, share it in groups, or publish to the world.
The user base is interdisciplinary, including scholars from the humanities, social sciences, sciences, public health, design, planning, etc. All are able to access, view, and use one another’s data, either online, via map services, or by downloading.
Current work is underway to create and maintain a global registry of map services and take us a step closer to one-stop-access for public geospatial data. Another project is working on tools to support the visualization of spatial datasets with over a billion features. Current collaborations are underway with groups inside Harvard, such as Dataverse, HarvardX, and various departments, and with groups outside Harvard, such as Cornell University and the University of Pennsylvania. Major additional contributors to the underlying source code include the WorldBank, the U.S. State Department, and the United Nations.
The source code for the WorldMap platform is available on GitHub https://github.com/cga-harvard/cga-worldmap.
Location: E25-202
Discussant: Ben Lewis is system architect and project manager for WorldMap, an open source infrastructure that supports collaborative research centered on geospatial information. Before joining Harvard, Ben was a project manager with Advanced Technology Solutions of Pennsylvania, where he led the company in adopting platform independent approaches to GIS system development. Ben studied Chinese at the University of Wisconsin and has a Masters in Planning from the University of Pennsylvania. After Penn, Ben helped start the GIS Lab at U.C. Berkeley, founded the GIS group for transportation engineering firm McCormick Taylor, and coordinated the Land Acquisition Mapping System for South Florida Water Management District. Ben is especially interested in technologies that lower the barrier to spatial technology access.
Information Science Brown Bag talks, hosted by the Program on Information Science, consists of regular discussions and brainstorming sessions on all aspects of information science and uses of information science and technology to assess and solve institutional, social and research problems. These are informal talks. Discussions are often inspired by real-world problems being faced by the lead discussant.
Trove: More Than a Treasure? ALIA Conference Presentation 2010 Brisbane by Ro...Rose Holley
Describes the innovative development of Trove at the National Library of Australia. Trove is a search engine for Australians about Australians. It contains 90 million items from over 1000 contributing organisations.
Challenges in developing names services - RDAnickyn
Explanation of how names data are gathered, structured, standardised and annotated - and how these data are mobilised using names services. Challenges are around credit and attribution, usage metrics on services.
Presented at the Research Data Alliance plenary 5, 9-11 March 2015, San Diego.
NISO Virtual Conference: Expanding the Assessment Toolbox: Blending the Old and New Assessment Practices
Dismantling a Single-Discipline Journal Bundle: A Triangulation Method for Assessment Diane (DeDe) Dawson MSc, MLIS, Science Liaison Librarian, Science Library, University of Saskatchewan
Jisc, the Wellcome Library, and non UK universities and professional societies, have been working on a three-year large-scale digitisation project of more than 15 million pages of 19th century published works, resulting in the UK Medical Heritage Library, a valuable resource for the exploration of medical humanities.
I hosted a live lab day on the 26th October, with researchers and developers, at the Wellcome Library, to look at how this resource can be developed. These are the results of the discussion.
A whirlwind introduction to digital humanities for CDP Digital Humanities: Collections & Heritage - current challenges and futures workshop. February 22, 2018 Imperial War Museum
Data Infrastructure and the Scholarly Ecosystem of the FutureAndrew Treloar
Talk delivered at forum at SURF in the Netherlands with the hashtag #disef. Talk deals with an overview of some thinking being done about elements of the ecosystem for scholarship, as well as some slides dealing with the Australian National Data Service (ands.org.au) and the Research Data Alliance (rd-alliance.org). These latter slides were used during a Q&A session as part of the talk.
The Computer Science Ontology: A Large-Scale Taxonomy of Research AreasAngelo Salatino
Ontologies of research areas are important tools for characterising, exploring, and analysing the research landscape. Some fields of research are comprehensively described by large-scale taxonomies, e.g., MeSH in Biology and PhySH in Physics. Conversely, current Computer Science taxonomies are coarse-grained and tend to evolve slowly. For instance, the ACM classification scheme contains only about 2K research topics and the last version dates back to 2012. In this paper, we introduce the Computer Science Ontology (CSO), a large-scale, automatically generated ontology of research areas, which includes about 15K topics and 70K semantic relationships. It was created by applying the Klink-2 algorithm on a very large dataset of 16M scientific articles. CSO presents two main advantages over the alternatives: i) it includes a very large number of topics that do not appear in other classifications, and ii) it can be updated automatically by running Klink-2 on recent corpora of publications. CSO powers several tools adopted by the editorial team at Springer Nature and has been used to enable a variety of solutions, such as classifying research publications, detecting research communities, and predicting research trends. To facilitate the uptake of CSO we have developed the CSO Portal, a web application that enables users to download, explore, and provide granular feedback on CSO at different levels. Users can use the portal to rate topics and relationships, suggest missing relationships, and visualise sections of the ontology. The portal will support the publication of and access to regular new releases of CSO, with the aim of providing a comprehensive resource to the various communities engaged with scholarly data.
Presented at the 2013 Annual Conference of the Council of American Jewish Museums (http://www.cajm.net/annual-conference). Based on the research exhibition "Case Study No. 3 | Sound Objects," created at The Magnes Collection of Jewish Art and Life, University of California, Berkeley, in 2012-2013 (http://bit.ly/sound-objects).
The Computer Science Ontology: A Large-Scale Taxonomy of Research AreasAngelo Salatino
Ontologies of research areas are important tools for characterising, exploring, and analysing the research landscape. Some fields of research are comprehensively described by large-scale taxonomies, e.g., MeSH in Biology and PhySH in Physics. Conversely, current Computer Science taxonomies are coarse-grained and tend to evolve slowly. For instance, the ACM classification scheme contains only about 2K research topics and the last version dates back to 2012. In this paper, we introduce the Computer Science Ontology (CSO), a large-scale, automatically generated ontology of research areas, which includes about 15K topics and 70K semantic relationships. It was created by applying the Klink-2 algorithm on a very large dataset of 16M scientific articles. CSO presents two main advantages over the alternatives: i) it includes a very large number of topics that do not appear in other classifications, and ii) it can be updated automatically by running Klink-2 on recent corpora of publications. CSO powers several tools adopted by the editorial team at Springer Nature and has been used to enable a variety of solutions, such as classifying research publications, detecting research communities, and predicting research trends. To facilitate the uptake of CSO we have developed the CSO Portal, a web application that enables users to download, explore, and provide granular feedback on CSO at different levels. Users can use the portal to rate topics and relationships, suggest missing relationships, and visualise sections of the ontology. The portal will support the publication of and access to regular new releases of CSO, with the aim of providing a comprehensive resource to the various communities engaged with scholarly data.
Sources of Change in Modern Knowledge Organization SystemsPaul Groth
Talk covering how knowledge graphs are making us rethink how change occurs in Knowledge Organization Systems. Based on https://arxiv.org/abs/1611.00217
UiPath Test Automation using UiPath Test Suite series, part 3DianaGray10
Welcome to UiPath Test Automation using UiPath Test Suite series part 3. In this session, we will cover desktop automation along with UI automation.
Topics covered:
UI automation Introduction,
UI automation Sample
Desktop automation flow
Pradeep Chinnala, Senior Consultant Automation Developer @WonderBotz and UiPath MVP
Deepak Rai, Automation Practice Lead, Boundaryless Group and UiPath MVP
Accelerate your Kubernetes clusters with Varnish CachingThijs Feryn
A presentation about the usage and availability of Varnish on Kubernetes. This talk explores the capabilities of Varnish caching and shows how to use the Varnish Helm chart to deploy it to Kubernetes.
This presentation was delivered at K8SUG Singapore. See https://feryn.eu/presentations/accelerate-your-kubernetes-clusters-with-varnish-caching-k8sug-singapore-28-2024 for more details.
The Art of the Pitch: WordPress Relationships and SalesLaura Byrne
Clients don’t know what they don’t know. What web solutions are right for them? How does WordPress come into the picture? How do you make sure you understand scope and timeline? What do you do if sometime changes?
All these questions and more will be explored as we talk about matching clients’ needs with what your agency offers without pulling teeth or pulling your hair out. Practical tips, and strategies for successful relationship building that leads to closing the deal.
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf91mobiles
91mobiles recently conducted a Smart TV Buyer Insights Survey in which we asked over 3,000 respondents about the TV they own, aspects they look at on a new TV, and their TV buying preferences.
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Ramesh Iyer
In today's fast-changing business world, Companies that adapt and embrace new ideas often need help to keep up with the competition. However, fostering a culture of innovation takes much work. It takes vision, leadership and willingness to take risks in the right proportion. Sachin Dev Duggal, co-founder of Builder.ai, has perfected the art of this balance, creating a company culture where creativity and growth are nurtured at each stage.
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...DanBrown980551
Do you want to learn how to model and simulate an electrical network from scratch in under an hour?
Then welcome to this PowSyBl workshop, hosted by Rte, the French Transmission System Operator (TSO)!
During the webinar, you will discover the PowSyBl ecosystem as well as handle and study an electrical network through an interactive Python notebook.
PowSyBl is an open source project hosted by LF Energy, which offers a comprehensive set of features for electrical grid modelling and simulation. Among other advanced features, PowSyBl provides:
- A fully editable and extendable library for grid component modelling;
- Visualization tools to display your network;
- Grid simulation tools, such as power flows, security analyses (with or without remedial actions) and sensitivity analyses;
The framework is mostly written in Java, with a Python binding so that Python developers can access PowSyBl functionalities as well.
What you will learn during the webinar:
- For beginners: discover PowSyBl's functionalities through a quick general presentation and the notebook, without needing any expert coding skills;
- For advanced developers: master the skills to efficiently apply PowSyBl functionalities to your real-world scenarios.
Essentials of Automations: Optimizing FME Workflows with ParametersSafe Software
Are you looking to streamline your workflows and boost your projects’ efficiency? Do you find yourself searching for ways to add flexibility and control over your FME workflows? If so, you’re in the right place.
Join us for an insightful dive into the world of FME parameters, a critical element in optimizing workflow efficiency. This webinar marks the beginning of our three-part “Essentials of Automation” series. This first webinar is designed to equip you with the knowledge and skills to utilize parameters effectively: enhancing the flexibility, maintainability, and user control of your FME projects.
Here’s what you’ll gain:
- Essentials of FME Parameters: Understand the pivotal role of parameters, including Reader/Writer, Transformer, User, and FME Flow categories. Discover how they are the key to unlocking automation and optimization within your workflows.
- Practical Applications in FME Form: Delve into key user parameter types including choice, connections, and file URLs. Allow users to control how a workflow runs, making your workflows more reusable. Learn to import values and deliver the best user experience for your workflows while enhancing accuracy.
- Optimization Strategies in FME Flow: Explore the creation and strategic deployment of parameters in FME Flow, including the use of deployment and geometry parameters, to maximize workflow efficiency.
- Pro Tips for Success: Gain insights on parameterizing connections and leveraging new features like Conditional Visibility for clarity and simplicity.
We’ll wrap up with a glimpse into future webinars, followed by a Q&A session to address your specific questions surrounding this topic.
Don’t miss this opportunity to elevate your FME expertise and drive your projects to new heights of efficiency.
Connector Corner: Automate dynamic content and events by pushing a buttonDianaGray10
Here is something new! In our next Connector Corner webinar, we will demonstrate how you can use a single workflow to:
Create a campaign using Mailchimp with merge tags/fields
Send an interactive Slack channel message (using buttons)
Have the message received by managers and peers along with a test email for review
But there’s more:
In a second workflow supporting the same use case, you’ll see:
Your campaign sent to target colleagues for approval
If the “Approve” button is clicked, a Jira/Zendesk ticket is created for the marketing design team
But—if the “Reject” button is pushed, colleagues will be alerted via Slack message
Join us to learn more about this new, human-in-the-loop capability, brought to you by Integration Service connectors.
And...
Speakers:
Akshay Agnihotri, Product Manager
Charlie Greenberg, Host
Transcript: Selling digital books in 2024: Insights from industry leaders - T...BookNet Canada
The publishing industry has been selling digital audiobooks and ebooks for over a decade and has found its groove. What’s changed? What has stayed the same? Where do we go from here? Join a group of leading sales peers from across the industry for a conversation about the lessons learned since the popularization of digital books, best practices, digital book supply chain management, and more.
Link to video recording: https://bnctechforum.ca/sessions/selling-digital-books-in-2024-insights-from-industry-leaders/
Presented by BookNet Canada on May 28, 2024, with support from the Department of Canadian Heritage.
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
A names backbone - a graph of taxonomy
1. A names backbone: a
graph of taxonomy
Nicky Nicolson, RBG Kew
Informatics Horizons for the Natural History Museum
24th July 2013
2. Project and aim
Project
• DEFRA funded
• Split into 3 phases:
• Names and taxonomy
• Collections
• Taxon based information
Aim
• To create, curate and cite semantically-meaningful
objects
• “Things “ not “strings”
13. Summary
Implications:
- Changes in the way we create, curate and cite data
and how we present it to systems and users
Applications:
- Integration of data, when that data has been stored
using different classifications
- Analysis - why do differences of opinion occur?
- Synthesis - propose classifications by integrating
existing overlapping concept data
Not pulling data together for the hell of it.It should increase efficiency, but also allow us to participate in analysis of existing data and synthesis of new dataParticipate in semantic web developments – un-covering the meaning in our data.
Attempts at transcribing an entity (“strings”) -> Recognised entities (“things”) -> Assertions about the inter-relationships between entities (a graph of “things”)Lexical entities -> semantic entities
Name occurrence layer – any informal attempt at the transcription of a nameWe struggled about what to call these. We called them name occurrences for a bit. Then we looked at the data and we thought “nomenclutter” was probably a better term.
Some name occurrences are code governed names – eligible to appear in the next layer – the names layer – this holds all the objective published facts about a name – its orthography, authorship, protologue reference, type citation and objective synonymy
Hypotheses about how names inter-relateConcepts layer – hypotheses draw these names together to form concepts via heterotypic synonymy.Projects such as World checklist (monographs) but also floras (regional checklists).The questions we are asked tend to be about concepts – species, their characteristics and how they inter-relate. But the resources we have tend to be name occurrences.So we want to answer scientific questions and operate at the concept level. But we too often have to start at the lowest level.
We need to provide ways to allow people to better navigate between the layers, and better focus their efforts – e.g. build classifications using the same objective bases.
We need to provide ways to allow people to better navigate between the layers, and better focus their efforts – e.g. build classifications using the same objective bases.We’ve recognised we need this three layer model. Conceptually it is a graph structure – and in implementation, we’re using graph technology to store and process the data. Graph technology has moved from computer science research to the mainstream with increasing use of social networks. This approach promotes the relations between items as “first class citizens” in the model.
Populating the centre layer – agreeing on the facts – is key.Changing our processesCollaboratively working on an authoritative set of names, which we use to build the graph of concepts
Holding data on the relations means that we can more precisely model the nomenclatural / taxonomic domain. We can reuse names to form many different, overlapping, conflicting hypotheses. We can compare hypotheses by looking at how a name (as an object – a “thing” not a “string”) is treated in different classifications.
Contrived example, but shows that completely opposing hypotheses can be modelled using the same basic elements.