Lecture presented at the Journals Club of the Naturhistorisches Museum Bern, March 17, 2014.
"Towards an (European) Open Biodiversity Knowledge Management System"
The Biodiversity Heritage Library 10 Years and More!Martin Kalfatovic
The Biodiversity Heritage Library 10 Years and More! Martin R. Kalfatovic. TDWG 2016. Centro de Transferencia Tecnológica y Educación Continua (CTEC) San Carlos, Santa Clara, Costa Rica. 7 December 2016.
Increasing Access, Promoting Progress: Empowering Global Research through the...Martin Kalfatovic
Increasing Access, Promoting Progress: Empowering Global Research through the BHL. Martin R. Kalfatovic. Group of 12 Meeting. Muséum national d’Histoire naturelle. Paris, 2 December 2016.
FAIR data requires FAIR ontologies, how do we do?EUDAT
This document discusses making ontologies FAIR (Findable, Accessible, Interoperable, Reusable) through the use of ontology repositories. It provides examples of existing repositories like NCBO BioPortal and how they help standardize ontologies and make them more discoverable and reusable. The document also describes related projects that are reusing NCBO's technology like AgroPortal for agricultural ontologies and SIFR for French biomedical ontologies and terminologies. Key challenges discussed are scaling repositories to many domains and large numbers of ontologies while maintaining features and curation.
Increasing Access, Promoting Progress: Empowering Global Research through the...Martin Kalfatovic
Increasing Access, Promoting Progress: Empowering Global Research through the BHL. Martin R. Kalfatovic. Expanding Access to Biodiversity Workshop. Atlanta History Center. Atlanta, GA. 24 January 2017.
This document discusses content mining of scientific literature in Europe. It describes what content mining is and why it is useful, particularly for tasks like mapping clinical trials to related papers. However, copyright restrictions and technical obstacles imposed by publishers currently limit widespread content mining. The document advocates for policies and technologies that enable open content mining of facts and data from the complete scientific literature for reproducible research.
This document describes a project to create a universal biological indexer and organizer. It aims to address the challenges of indexing and searching biological data using taxonomic names, which are complex due to issues like synonyms, homonyms, and changing classifications. The project involves creating databases called NameBank and ClassificationBank that index all taxonomic names and concepts. These will serve as the foundation for tools and applications to help integrate names into literature retrieval, search, and browsing of biological data from various sources. The end goal is a comprehensive portal and services that enable connections within the biological research community.
The Biodiversity Heritage Library 10 Years and More!Martin Kalfatovic
The Biodiversity Heritage Library 10 Years and More! Martin R. Kalfatovic. TDWG 2016. Centro de Transferencia Tecnológica y Educación Continua (CTEC) San Carlos, Santa Clara, Costa Rica. 7 December 2016.
Increasing Access, Promoting Progress: Empowering Global Research through the...Martin Kalfatovic
Increasing Access, Promoting Progress: Empowering Global Research through the BHL. Martin R. Kalfatovic. Group of 12 Meeting. Muséum national d’Histoire naturelle. Paris, 2 December 2016.
FAIR data requires FAIR ontologies, how do we do?EUDAT
This document discusses making ontologies FAIR (Findable, Accessible, Interoperable, Reusable) through the use of ontology repositories. It provides examples of existing repositories like NCBO BioPortal and how they help standardize ontologies and make them more discoverable and reusable. The document also describes related projects that are reusing NCBO's technology like AgroPortal for agricultural ontologies and SIFR for French biomedical ontologies and terminologies. Key challenges discussed are scaling repositories to many domains and large numbers of ontologies while maintaining features and curation.
Increasing Access, Promoting Progress: Empowering Global Research through the...Martin Kalfatovic
Increasing Access, Promoting Progress: Empowering Global Research through the BHL. Martin R. Kalfatovic. Expanding Access to Biodiversity Workshop. Atlanta History Center. Atlanta, GA. 24 January 2017.
This document discusses content mining of scientific literature in Europe. It describes what content mining is and why it is useful, particularly for tasks like mapping clinical trials to related papers. However, copyright restrictions and technical obstacles imposed by publishers currently limit widespread content mining. The document advocates for policies and technologies that enable open content mining of facts and data from the complete scientific literature for reproducible research.
This document describes a project to create a universal biological indexer and organizer. It aims to address the challenges of indexing and searching biological data using taxonomic names, which are complex due to issues like synonyms, homonyms, and changing classifications. The project involves creating databases called NameBank and ClassificationBank that index all taxonomic names and concepts. These will serve as the foundation for tools and applications to help integrate names into literature retrieval, search, and browsing of biological data from various sources. The end goal is a comprehensive portal and services that enable connections within the biological research community.
Liberating facts from the scientific literature - Jisc Digifest 2016 TheContentMine
Published on Mar 4, 2016 by PMR
Text and data mining (TDM) techniques can be applied to a wide range of materials, from published research papers, books and theses, to cultural heritage materials, digitised collections, administrative and management reports and documentation, etc. Use cases include academic research, resource discovery and business intelligence.
This workshop will show the value and benefits of TDM techniques and demonstrate how ContentMine aims to liberate 100,000,000 facts from the scientific literature, and ContentMine will provide a hands on demo on a topical and accessible scientific/medical subject.
An Introduction to the Biodiversity Heritage LibraryMartin Kalfatovic
An Introduction to the Biodiversity Heritage Library. Martin R. Kalfatovic. BHL Australian Node Meeting: National Library of Australia. 4 June 2010. Canberra, Australia.
Talk given at the symposium about government-funded databases and open chemistry at the national meeting of the American Chemical Society in Washington, 21 Aug 2017
Automatic Extraction of Knowledge from Biomedical literature TheContentMine
Published on Mar 16, 2016 by PMR
A plenary lecture to Cochrane Collaboration in Birmingham, on the value of automatically extracting knowledge. Covers the Why? How? What? Who? and problems and invites collaboration
Specimen-level mining: bringing knowledge back 'home' to the Natural History ...Ross Mounce
A talk given at the Geological Society of London, UK on 2016/03/09 as part of the Lyell meeting on Palaeoinformatics. http://www.geolsoc.org.uk/lyell16 #lyell16
PRIDE and ProteomeXchange: supporting the cultural change in proteomics publi...Juan Antonio Vizcaino
The document discusses PRIDE and ProteomeXchange, which are resources that support the deposition of proteomics data to public repositories. PRIDE stores mass spectrometry-based proteomics data, and is one of the repositories that is part of ProteomeXchange, a framework that allows standard submission of proteomics data between major repositories. The document outlines the cultural change in proteomics towards public data sharing, and provides information on submitting proteomics data to PRIDE and accessing data deposited in PRIDE and ProteomeXchange.
Automatic Extraction of Knowledge from Biomedical literaturepetermurrayrust
a plenary lecture to Cochrane Collaboration in Birmingham, on the value of automatically extracting knowledge. Covers the Why? How? What? Who? and problems and invites collaboration
This document discusses the Biodiversity Heritage Library (BHL) project and its role in supporting other biodiversity initiatives like the Encyclopedia of Life (EOL). The BHL aims to digitize published literature on biodiversity and make it openly accessible online. It has already digitized over 4 million pages and works closely with groups like EOL to integrate taxonomic data. The document outlines the BHL's goals, partnerships, digitization process, and how it brings together distributed information on species through its use of taxonomic intelligence.
Automatic Extraction of Knowledge from the LiteratureTheContentMine
Published on May 11, 2016 by PMR
ContentMine tools (and the Harvest alliance) can be used to search the literature for knowledge, especially in biomedicine. All tools are Open and shortly we shall be indexing the complete daily scholarly literature
Sharing re-usable phylogenetic data: we're not there yetRoss Mounce
Ross Mounce discusses challenges with sharing phylogenetic data from published studies. Only a small percentage of studies archive their data, and researchers are often unwilling to share data upon request. Mounce developed tools to extract and reformat phylogenetic data from PDFs to make it more accessible and reusable. He received funding to continue this work and develop software to unlock and open phylogenetic literature data.
The scientific and medical literature is a vast resource of knowledge, but it needs turning into semantic FAIR form. The ContentMine can do this and we presented a rapid overview of the potential
This document discusses ontology and vocabulary management technologies in France. It describes challenges for ontology repositories, including metadata, evaluation, multilingual support, ontology alignment, scalability and interoperability. Two collaborative projects are highlighted that reuse NCBO ontology repository technology: SIFR BioPortal for French biomedical ontologies and AgroPortal for agronomy. Shared technology visions are proposed, with ontology repositories working together through a unified open source platform to support multiple domains. Open questions remain around long-term support, the European Open Science Cloud, and France's role in this area.
Automatic Extraction of Knowledge from the Literaturepetermurrayrust
ContentMine tools (and the Harvest alliance) can be used to search the literature for knowledge, especially in biomedicine. All tools are Open and shortly we shall be indexing the complete daily scholarly literature
Published on Jan 29, 2016 by PMR
Keynote talk to LEARN (LERU/H2020 project) for research data management. Emphasizes that problems are cultural not technical. Promotes modern approaches such as Git / continuous Integration, announces DAT. Asserts that the Right to Read in the Right to Mine. Calls for widespread development of content mining (TDM)
Liberating facts from the scientific literature - Jisc Digifest 2016 TheContentMine
Published on Mar 4, 2016 by PMR
Text and data mining (TDM) techniques can be applied to a wide range of materials, from published research papers, books and theses, to cultural heritage materials, digitised collections, administrative and management reports and documentation, etc. Use cases include academic research, resource discovery and business intelligence.
This workshop will show the value and benefits of TDM techniques and demonstrate how ContentMine aims to liberate 100,000,000 facts from the scientific literature, and ContentMine will provide a hands on demo on a topical and accessible scientific/medical subject.
An Introduction to the Biodiversity Heritage LibraryMartin Kalfatovic
An Introduction to the Biodiversity Heritage Library. Martin R. Kalfatovic. BHL Australian Node Meeting: National Library of Australia. 4 June 2010. Canberra, Australia.
Talk given at the symposium about government-funded databases and open chemistry at the national meeting of the American Chemical Society in Washington, 21 Aug 2017
Automatic Extraction of Knowledge from Biomedical literature TheContentMine
Published on Mar 16, 2016 by PMR
A plenary lecture to Cochrane Collaboration in Birmingham, on the value of automatically extracting knowledge. Covers the Why? How? What? Who? and problems and invites collaboration
Specimen-level mining: bringing knowledge back 'home' to the Natural History ...Ross Mounce
A talk given at the Geological Society of London, UK on 2016/03/09 as part of the Lyell meeting on Palaeoinformatics. http://www.geolsoc.org.uk/lyell16 #lyell16
PRIDE and ProteomeXchange: supporting the cultural change in proteomics publi...Juan Antonio Vizcaino
The document discusses PRIDE and ProteomeXchange, which are resources that support the deposition of proteomics data to public repositories. PRIDE stores mass spectrometry-based proteomics data, and is one of the repositories that is part of ProteomeXchange, a framework that allows standard submission of proteomics data between major repositories. The document outlines the cultural change in proteomics towards public data sharing, and provides information on submitting proteomics data to PRIDE and accessing data deposited in PRIDE and ProteomeXchange.
Automatic Extraction of Knowledge from Biomedical literaturepetermurrayrust
a plenary lecture to Cochrane Collaboration in Birmingham, on the value of automatically extracting knowledge. Covers the Why? How? What? Who? and problems and invites collaboration
This document discusses the Biodiversity Heritage Library (BHL) project and its role in supporting other biodiversity initiatives like the Encyclopedia of Life (EOL). The BHL aims to digitize published literature on biodiversity and make it openly accessible online. It has already digitized over 4 million pages and works closely with groups like EOL to integrate taxonomic data. The document outlines the BHL's goals, partnerships, digitization process, and how it brings together distributed information on species through its use of taxonomic intelligence.
Automatic Extraction of Knowledge from the LiteratureTheContentMine
Published on May 11, 2016 by PMR
ContentMine tools (and the Harvest alliance) can be used to search the literature for knowledge, especially in biomedicine. All tools are Open and shortly we shall be indexing the complete daily scholarly literature
Sharing re-usable phylogenetic data: we're not there yetRoss Mounce
Ross Mounce discusses challenges with sharing phylogenetic data from published studies. Only a small percentage of studies archive their data, and researchers are often unwilling to share data upon request. Mounce developed tools to extract and reformat phylogenetic data from PDFs to make it more accessible and reusable. He received funding to continue this work and develop software to unlock and open phylogenetic literature data.
The scientific and medical literature is a vast resource of knowledge, but it needs turning into semantic FAIR form. The ContentMine can do this and we presented a rapid overview of the potential
This document discusses ontology and vocabulary management technologies in France. It describes challenges for ontology repositories, including metadata, evaluation, multilingual support, ontology alignment, scalability and interoperability. Two collaborative projects are highlighted that reuse NCBO ontology repository technology: SIFR BioPortal for French biomedical ontologies and AgroPortal for agronomy. Shared technology visions are proposed, with ontology repositories working together through a unified open source platform to support multiple domains. Open questions remain around long-term support, the European Open Science Cloud, and France's role in this area.
Automatic Extraction of Knowledge from the Literaturepetermurrayrust
ContentMine tools (and the Harvest alliance) can be used to search the literature for knowledge, especially in biomedicine. All tools are Open and shortly we shall be indexing the complete daily scholarly literature
Published on Jan 29, 2016 by PMR
Keynote talk to LEARN (LERU/H2020 project) for research data management. Emphasizes that problems are cultural not technical. Promotes modern approaches such as Git / continuous Integration, announces DAT. Asserts that the Right to Read in the Right to Mine. Calls for widespread development of content mining (TDM)
Data Sharing Principles and Legal Interoperability for Essential Biodiversity...agosti
The document discusses principles of open data sharing and legal interoperability of research data. It provides summaries of the GEO Data Sharing Principles from 2005 and a proposed updated version from 2015. The principles advocate sharing data as open data by default without charge or reuse restrictions. Exceptions can be made for reasons of national security, endangered species protection, or other restrictions allowed by law. The document also summarizes proposed principles from RDA/CODATA on facilitating lawful access to research data while balancing various legal interests through transparent communication of rights.
The document discusses Plazi's work on converting biodiversity literature into structured data through text mining and markup. Key points include:
- Plazi extracts scientific names, tables, references and geographic data from literature and converts it into semantically enriched text and RDF.
- Their pipelines currently have over 50,000 taxonomic treatments life and are providing data to databases like NCBI, GBIF and EOL.
- Future plans include collaborating with ContentMine for daily treatment extraction, releasing RDF and text mining versions 1.0, and expanding the biodiversity literature repository to 100,000 references.
DOI and the Mitteilungen: communicating scientific results in the futureagosti
lecture presented at Ento.CH, Neuchâtel, Switzerland; March 4, 2016. Developing a scenario for the future of the Mitteilungen der Schweizerischen Entomologischen Gesellschaft into the direction of semantically enhanced publications.
This document summarizes a presentation about the Plazi Treatment Repository project. It discusses how Plazi aims to make over 1 million taxonomic treatments openly accessible by semantically enhancing and linking content from biodiversity literature. A major challenge is copyright restrictions on publications, which Plazi addresses by only including non-copyrighted content and material for internal use. The presentation argues for legal changes like mandatory research licenses to further remove barriers to information exchange.
Plazi or the challenge to free biodiversity data caught in hundreds of millio...agosti
A call for participation of citizen scientists to help to convert the half billion of pages of biodiversity legacy literature into Linked Open Data, an imperative for conservation of the worlds declining biodiversity, but as much as a scientific frontier: How many species are there? Traditionally, citizen scientist plaid a decisive role in describing the Earth's biodiversity. Now the confront a paradoxical challenge that they do not have access to their published scientifc corpus and might have to launch into a second wave of discovery: This time in the printed record.
Lecture held at the "Opportunities and challenges for citizen scientists" workshop at the ETH Zurich, January 23, 2015
A Step Towards (From) Read to Write Access to Taxonomic Publicationsagosti
Lecture provided at the 8th International Congress of Hymenopterists, Cuzco, Peru, July 23, 2014.
The lectures makes a case to join the bibligraphies and pdf of hymenoptera taxonomy literature on the Biodiersity Literature Repository / Zenodo
Setting the Scene for ViBRANT – Strategy, Philosophy and Communicationvbrant
The document discusses the future of scientific publishing and open access. It envisions a future where publications are semantically marked up and linked to external data sources to enable advanced text mining and knowledge extraction. Treatments of species would be structured using XML to define content and be linked to identifiers, bibliographic metadata, and other sources. This would allow publications to be queried and analyzed by both humans and machines. Prospective publications could be semantically enhanced from the start, while tools are needed to mark up legacy literature. The goal is open access to scientific knowledge beyond traditional PDFs.
The document discusses the future of scientific publishing and open access to scientific literature. It envisions a future where publications are semantically marked up and linked to external data sources to enable advanced text mining and knowledge extraction. Treatments of species would be structured using XML to define content and be linked to identifiers, bibliographic metadata, and other sources. This would allow publications to be queried and analyzed by both humans and machines. Prospective publications could be semantically enhanced from the start, while tools are needed to mark up legacy literature. The goal is open access and dissemination of scientific findings funded by public resources.
The Biodiversity Heritage Library (BHL) is a digital library that aims to make the literature of biodiversity openly available to the world. It has scanned over 70,000 volumes and 26 million pages of literature. The BHL was created in response to the "taxonomic impediment", which describes gaps in taxonomic knowledge. It works with natural history institutions around the world to digitize their collections. The BHL provides open access to this literature and develops tools to enhance access, such as name finding and citation services.
The Encyclopedia of Life, Biodiversity Heritage Library, Biodiversity Informa...drielinger
The document discusses the Encyclopedia of Life (EOL) project, which aims to create a web page for every known species. It provides details on the project's goals and structure, as well as its partnerships with other biodiversity organizations. These include the Biodiversity Heritage Library (BHL), which will digitize literature to support the EOL by providing the scientific underpinning. The BHL is forming collaborations internationally to make biodiversity literature openly accessible online.
The document summarizes the goals and activities of the Biodiversity Heritage Library (BHL) project, which aims to digitize the published literature on biodiversity and make it openly accessible online. It discusses BHL's partnerships with other organizations like the Encyclopedia of Life to aggregate content. It also provides details on BHL's scanning operations and efforts to engage international partners to expand its global coverage of literature.
ViBRANT—Virtual Biodiversity Research and Access Network for TaxonomyVince Smith
Presented by Dave Roberts and coauthored by Vince Smith at BioIdentify 2010, the National Muséum of Natural History (MNHN), Paris, France. 20-22 Sept, 2010.
Digital Services Division & The Biodiversity Heritage LibraryMartin Kalfatovic
Digital Services Division & The Biodiversity Heritage Library. Martin R. Kalfatovic. Smithsonian Science Executive Committee. Washington, DC. 12 January 2015
Bouchout Declaration
Introduction to the Bouchout Declaration for Open Biodiversity Knowledge Management.
The declaration has been initiated by the Pro-iBiosphere consortium and the official launch will be on June 12, 2014 at the Bouchout castle at the Plantentuin Meise, Meise, Belgium.
This presentation has been given at the Swiss "Konservatorentagung 2014, Frauenfeld", May 23.2014
Museum collections as research data - October 2019Dag Endresen
This document discusses how natural history museums can embrace open science principles by making their collections openly available as research data. It provides context on initiatives like GBIF and DiSSCo that aim to publish biodiversity data according to common standards. While only around 5-10% of specimen records are currently digitized globally, the push for open access to publicly funded research means that museums need to develop new approaches to remain relevant providers of scientific resources. Open science practices like data sharing, citation and reuse can help address reproducibility issues and enable new discovery.
Donat Agosti - Copyright, Biopiracy and the Taxonomic Impediment ICZN
The document discusses several issues related to open access of taxonomic publications, including:
1) Taxonomic publications currently face a "taxonomic impediment" due to copyright restrictions that limit access. Making publications open access could help address conservation and research needs.
2) Descriptions of new species in particular should be considered open access similar to gene sequences, as they represent factual scientific knowledge.
3) Potential solutions include making all taxonomic publications openly accessible online, or at least allowing open access to the actual species descriptions within publications.
ContentMining for France and Europe; Lessons from 2 years in UKpetermurrayrust
This document summarizes Peter Murray-Rust's presentation on two years of content mining in the UK and lessons for France and Europe. Some key points discussed include:
- Content mining can save lives by enabling researchers to search literature and find past warnings, as in the case of Ebola.
- However, publishers like Elsevier and Wiley have stopped researchers' content mining efforts, hampering their work.
- France, Europe and the UK must actively support content mining through funding, tools, training and protecting researchers from restrictive publishers.
- Examples are given of ContentMine fellows' projects mining literature on topics like weevil-plant associations, cell migration and depression in animals.
The document discusses biodiversity informatics and efforts to create comprehensive digital libraries and resources about organisms and taxonomy. It describes the Biodiversity Heritage Library project to digitize literature on biodiversity and make it openly accessible online. Key goals are to develop ways to reconcile different scientific names for the same organisms and integrate distributed content from various sources and languages. This will allow users to find all information about an organism by searching with any name.
FAO has developed several semantic technologies and ontologies to improve information sharing and interoperability across different knowledge domains, including AGROVOC, the Agricultural Ontology Service, ontologies for fisheries, crops, nutrition, and geopolitics. These projects use techniques such as concept mapping, multilingual support, and semantic search to facilitate knowledge organization and exchange.
An International Cooperative Digital Library for Taxonomic Literature: The Bi...Martin Kalfatovic
An International Cooperative Digital Library for Taxonomic Literature: The Biodiversity Heritage Library. Martin R. Kalfatovic. American Library Association Annual Meeting. Collaborative Digital Initiatives: Show and Tell and Lessons Learned. June 30, 2008. Anaheim, CA.
Biodiversity Heritage Library : Development and PartnerhipsNancy Gwinn
Biodiversity Heritage Library. Development and Partnerships. Nancy E. Gwinn. Biodiversity and Ecosystems Informatics Group, National Science Foundation, March 24, 2008, Washington, D.C.
The document summarizes the Digital Taiwan – Culture & Nature Project which aims to digitize and provide open access to Taiwan's cultural and natural heritage. It discusses two main phases: 1) The National Digital Archives Program from 2002-2006 digitized over 3 million cultural heritage pieces. 2) The Taiwan e-Learning and Digital Archives Program integrated an e-learning component and established an international collaboration division to share Taiwan's digital achievements globally. The project works to digitize museum collections, integrate biodiversity data, and increase translated English content to promote international awareness of Taiwan's archives.
The Biodiversity Heritage Library (BHL) is a project to digitize the published literature of biodiversity. It aims to provide open access to over 5.4 million books and publications dating back to 1469. The BHL involves many museum, botanical garden, and research institution libraries collaborating to scan materials. It uses taxonomic intelligence to link names in the literature to databases. The long-term goal is a sustainable platform to make biodiversity literature freely available online.
Pensoft is presenting Biodiversity Data Journal's Species Conservation Profile (SCP) - a streamlined workflow for collaborative authoring, peer-review and scholarly publication, serving the IUCN Red Data List
Revolutionizing the Research on Ants through new Methods and Technologies: th...agosti
Invited lecture presented at the XXII Simpósio de Mirmecologia, Ilhéus, BA, Brazil, October 22, 2015. The title takes reference to the ant conference to create a standard protocol to measure and monitor ants - the manual has been published 2000 as "Ants: measuring and monitoring biodiversity" which has been close to 2500 times cited by followup papers - and with the preparation started in 1995 in Ilhéus. The focus is on open access, digital library, sharing of data, publishing and the sociology of myrmecology, and how the data could be used in project like EU-BON.
Plazi's thrive to achive Tim Berners-Lee 5*data for biodiversity literature. Lecture from the Opendata.ch/2015 conference, July1, 2015, Berne, Switzerland http://opendata.ch/projects/opendata-ch2015-konferenz/
This document summarizes the Bouchout Declaration for Open Biodiversity Knowledge Management. The declaration encourages an overarching approach to open biodiversity knowledge management based on principles like open access, open licenses, attribution, infrastructure, registers, persistent identifiers, linked open data, development and business sustainability. It aims to foster free and open access to biodiversity data through policy developments and technical agreements.
This document discusses building an open biodiversity knowledge management system that can extract, store, and serve information on taxa in an interoperable way across kingdoms of life. It notes that the legacy literature contains over 200 million pages but data is incomplete and disconnected. The Pro-iBiosphere project aims to demonstrate how to markup taxon treatments to make them accessible and linkable. Pilots have marked up over 1,500 treatments of plants, fungi, bryophytes, insects and spiders. The document recommends standardizing markup and applying it prospectively to enhance semantic interoperability of biodiversity data.
This document discusses a schema for describing and exchanging the content of taxonomic publications in a way that allows both human and machine access. It proposes using semantic markup like XML to tag elements in publications like names, descriptions, and references in a way that links related data across sources. This would allow content to be more accessible for tasks like data mining while maintaining context. The schema is part of ongoing work by Plazi to apply semantic markup to digitize existing publications and structure new ones for improved dissemination and reuse of taxonomic knowledge.
20110222 behesty monitoring and measuring biodiversityagosti
This document discusses various topics related to monitoring and measuring biodiversity on a global scale. It mentions several key organizations and initiatives, including the Earth Summit, IPBES, NCBI, GBIF, TDWG, and Darwin Core, that are involved with assessing global biodiversity patterns and developing standards for exchanging biodiversity data. The document emphasizes that monitoring biodiversity as a comparative science requires access to data, use of identification aids, metadata standards, networks to share information, and applying data to understand changes over space and time.
The document discusses making taxonomic literature openly accessible in digital format. It proposes marking up publications with XML tags to encode semantic information that allows machines to extract and link data. This would facilitate access to the estimated 100 million pages of existing literature as well as integration of new data. Key recommendations include adopting open access policies, understanding copyright, self-archiving publications, using structured formats like XML, and developing standards and infrastructure to support digitization and interoperability of biodiversity data.
Best 20 SEO Techniques To Improve Website Visibility In SERPPixlogix Infotech
Boost your website's visibility with proven SEO techniques! Our latest blog dives into essential strategies to enhance your online presence, increase traffic, and rank higher on search engines. From keyword optimization to quality content creation, learn how to make your site stand out in the crowded digital landscape. Discover actionable tips and expert insights to elevate your SEO game.
Driving Business Innovation: Latest Generative AI Advancements & Success StorySafe Software
Are you ready to revolutionize how you handle data? Join us for a webinar where we’ll bring you up to speed with the latest advancements in Generative AI technology and discover how leveraging FME with tools from giants like Google Gemini, Amazon, and Microsoft OpenAI can supercharge your workflow efficiency.
During the hour, we’ll take you through:
Guest Speaker Segment with Hannah Barrington: Dive into the world of dynamic real estate marketing with Hannah, the Marketing Manager at Workspace Group. Hear firsthand how their team generates engaging descriptions for thousands of office units by integrating diverse data sources—from PDF floorplans to web pages—using FME transformers, like OpenAIVisionConnector and AnthropicVisionConnector. This use case will show you how GenAI can streamline content creation for marketing across the board.
Ollama Use Case: Learn how Scenario Specialist Dmitri Bagh has utilized Ollama within FME to input data, create custom models, and enhance security protocols. This segment will include demos to illustrate the full capabilities of FME in AI-driven processes.
Custom AI Models: Discover how to leverage FME to build personalized AI models using your data. Whether it’s populating a model with local data for added security or integrating public AI tools, find out how FME facilitates a versatile and secure approach to AI.
We’ll wrap up with a live Q&A session where you can engage with our experts on your specific use cases, and learn more about optimizing your data workflows with AI.
This webinar is ideal for professionals seeking to harness the power of AI within their data management systems while ensuring high levels of customization and security. Whether you're a novice or an expert, gain actionable insights and strategies to elevate your data processes. Join us to see how FME and AI can revolutionize how you work with data!
Full-RAG: A modern architecture for hyper-personalizationZilliz
Mike Del Balso, CEO & Co-Founder at Tecton, presents "Full RAG," a novel approach to AI recommendation systems, aiming to push beyond the limitations of traditional models through a deep integration of contextual insights and real-time data, leveraging the Retrieval-Augmented Generation architecture. This talk will outline Full RAG's potential to significantly enhance personalization, address engineering challenges such as data management and model training, and introduce data enrichment with reranking as a key solution. Attendees will gain crucial insights into the importance of hyperpersonalization in AI, the capabilities of Full RAG for advanced personalization, and strategies for managing complex data integrations for deploying cutting-edge AI solutions.
GraphRAG for Life Science to increase LLM accuracyTomaz Bratanic
GraphRAG for life science domain, where you retriever information from biomedical knowledge graphs using LLMs to increase the accuracy and performance of generated answers
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfPaige Cruz
Monitoring and observability aren’t traditionally found in software curriculums and many of us cobble this knowledge together from whatever vendor or ecosystem we were first introduced to and whatever is a part of your current company’s observability stack.
While the dev and ops silo continues to crumble….many organizations still relegate monitoring & observability as the purview of ops, infra and SRE teams. This is a mistake - achieving a highly observable system requires collaboration up and down the stack.
I, a former op, would like to extend an invitation to all application developers to join the observability party will share these foundational concepts to build on:
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024Neo4j
Neha Bajwa, Vice President of Product Marketing, Neo4j
Join us as we explore breakthrough innovations enabled by interconnected data and AI. Discover firsthand how organizations use relationships in data to uncover contextual insights and solve our most pressing challenges – from optimizing supply chains, detecting fraud, and improving customer experiences to accelerating drug discoveries.
HCL Notes and Domino License Cost Reduction in the World of DLAUpanagenda
Webinar Recording: https://www.panagenda.com/webinars/hcl-notes-and-domino-license-cost-reduction-in-the-world-of-dlau/
The introduction of DLAU and the CCB & CCX licensing model caused quite a stir in the HCL community. As a Notes and Domino customer, you may have faced challenges with unexpected user counts and license costs. You probably have questions on how this new licensing approach works and how to benefit from it. Most importantly, you likely have budget constraints and want to save money where possible. Don’t worry, we can help with all of this!
We’ll show you how to fix common misconfigurations that cause higher-than-expected user counts, and how to identify accounts which you can deactivate to save money. There are also frequent patterns that can cause unnecessary cost, like using a person document instead of a mail-in for shared mailboxes. We’ll provide examples and solutions for those as well. And naturally we’ll explain the new licensing model.
Join HCL Ambassador Marc Thomas in this webinar with a special guest appearance from Franz Walder. It will give you the tools and know-how to stay on top of what is going on with Domino licensing. You will be able lower your cost through an optimized configuration and keep it low going forward.
These topics will be covered
- Reducing license cost by finding and fixing misconfigurations and superfluous accounts
- How do CCB and CCX licenses really work?
- Understanding the DLAU tool and how to best utilize it
- Tips for common problem areas, like team mailboxes, functional/test users, etc
- Practical examples and best practices to implement right away
Unlocking Productivity: Leveraging the Potential of Copilot in Microsoft 365, a presentation by Christoforos Vlachos, Senior Solutions Manager – Modern Workplace, Uni Systems
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!SOFTTECHHUB
As the digital landscape continually evolves, operating systems play a critical role in shaping user experiences and productivity. The launch of Nitrux Linux 3.5.0 marks a significant milestone, offering a robust alternative to traditional systems such as Windows 11. This article delves into the essence of Nitrux Linux 3.5.0, exploring its unique features, advantages, and how it stands as a compelling choice for both casual users and tech enthusiasts.
Essentials of Automations: The Art of Triggers and Actions in FMESafe Software
In this second installment of our Essentials of Automations webinar series, we’ll explore the landscape of triggers and actions, guiding you through the nuances of authoring and adapting workspaces for seamless automations. Gain an understanding of the full spectrum of triggers and actions available in FME, empowering you to enhance your workspaces for efficient automation.
We’ll kick things off by showcasing the most commonly used event-based triggers, introducing you to various automation workflows like manual triggers, schedules, directory watchers, and more. Plus, see how these elements play out in real scenarios.
Whether you’re tweaking your current setup or building from the ground up, this session will arm you with the tools and insights needed to transform your FME usage into a powerhouse of productivity. Join us to discover effective strategies that simplify complex processes, enhancing your productivity and transforming your data management practices with FME. Let’s turn complexity into clarity and make your workspaces work wonders!
For the full video of this presentation, please visit: https://www.edge-ai-vision.com/2024/06/building-and-scaling-ai-applications-with-the-nx-ai-manager-a-presentation-from-network-optix/
Robin van Emden, Senior Director of Data Science at Network Optix, presents the “Building and Scaling AI Applications with the Nx AI Manager,” tutorial at the May 2024 Embedded Vision Summit.
In this presentation, van Emden covers the basics of scaling edge AI solutions using the Nx tool kit. He emphasizes the process of developing AI models and deploying them globally. He also showcases the conversion of AI models and the creation of effective edge AI pipelines, with a focus on pre-processing, model conversion, selecting the appropriate inference engine for the target hardware and post-processing.
van Emden shows how Nx can simplify the developer’s life and facilitate a rapid transition from concept to production-ready applications.He provides valuable insights into developing scalable and efficient edge AI solutions, with a strong focus on practical implementation.
Dr. Sean Tan, Head of Data Science, Changi Airport Group
Discover how Changi Airport Group (CAG) leverages graph technologies and generative AI to revolutionize their search capabilities. This session delves into the unique search needs of CAG’s diverse passengers and customers, showcasing how graph data structures enhance the accuracy and relevance of AI-generated search results, mitigating the risk of “hallucinations” and improving the overall customer journey.
UiPath Test Automation using UiPath Test Suite series, part 5DianaGray10
Welcome to UiPath Test Automation using UiPath Test Suite series part 5. In this session, we will cover CI/CD with devops.
Topics covered:
CI/CD with in UiPath
End-to-end overview of CI/CD pipeline with Azure devops
Speaker:
Lyndsey Byblow, Test Suite Sales Engineer @ UiPath, Inc.
Communications Mining Series - Zero to Hero - Session 1DianaGray10
This session provides introduction to UiPath Communication Mining, importance and platform overview. You will acquire a good understand of the phases in Communication Mining as we go over the platform with you. Topics covered:
• Communication Mining Overview
• Why is it important?
• How can it help today’s business and the benefits
• Phases in Communication Mining
• Demo on Platform overview
• Q/A
Sudheer Mechineni, Head of Application Frameworks, Standard Chartered Bank
Discover how Standard Chartered Bank harnessed the power of Neo4j to transform complex data access challenges into a dynamic, scalable graph database solution. This keynote will cover their journey from initial adoption to deploying a fully automated, enterprise-grade causal cluster, highlighting key strategies for modelling organisational changes and ensuring robust disaster recovery. Learn how these innovations have not only enhanced Standard Chartered Bank’s data infrastructure but also positioned them as pioneers in the banking sector’s adoption of graph technology.
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slackshyamraj55
Discover the seamless integration of RPA (Robotic Process Automation), COMPOSER, and APM with AWS IDP enhanced with Slack notifications. Explore how these technologies converge to streamline workflows, optimize performance, and ensure secure access, all while leveraging the power of AWS IDP and real-time communication via Slack notifications.
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
20140317 pi b_nmbe_journal_club
1. Towards an (European) Open
Biodiversity Knowledge
Management System
Donat Agosti (Plazi, Bern)
March 17, 2014
Berne, Journal Club @ NMBE
2. El Bulli: Cooking in Progress (2011) Ferran Adria (Actor), Gereon Wetzel (Director)
3. The cook (Ferran Adriá) wants to know when he can
expect what seafood for his kitchen.
He assumes that phenological data is open and
accessible to anyone.
He has a question and needs to know: What seafood
at what time?
His goal is to provide a service based on the use of
observation data, i.e. treat you (and make some
money).
4. The fishmonger knows when what seafood is
available.
He considers his knowledge of seafood phenology
as his asset to make money.
His goal is to make money with knowledge based
on observation records and understanding the
characteristics of seafood.
5. What do YOU want to know?
How do YOU expect to get to your information?
6. • What are the main online resources you use?
• Do you maintain your own digital library?
• Do you participate in an online project, eg
scratchpads, catalogue, digital archive and
make your data accessible?
• … ?
7. What does this mean?
Meredith Lane, e-biosphere Conference, London 2009
8. Hardisty, Nature 502, 171 (2013)
BUT: predictive ecology has substantial data needs
Harfoot, BIH2013, Rome, 2013
The big question
What is the future of the biological world?
Imagine if we could:
…Predict community level dynamics of ecosystems at
scales from local to global, based on the ecology and
biology of all individual organisms
9. Decentralized biodiversity infrastructure
Plants
3,400 Herbaria worldwide
10,000 Associate curators and specialists
350,000,000 specimens in collections
180,000,000 specimens digitized
2,000,000,000 specimens including animals
Source: gbif.org; http://sciweb.nybg.org/science2/IndexHerbariorum.asp
10. 200,000,000+ printed pages
1,900,000 species described
20,000,000+ species treatments
17,000 new species per year
Biodiversity libraries
BUT: The data are hidden
Incomplete digitization
Publications are
unstructured
Collections are incomplete
Data is not linked
Most data are not open
11. Nationaal Herbarium Nederland collection on GBIF
Source: http://www.gbif.org/dataset/7b33b040-f762-11e1-a439-00145eb45e9a
One collection’s view of the world
12. Another collection’s view of the world
http://www.gbif.org/dataset/82b0f51c-f762-11e1-a439-00145eb45e9a
13. What does this mean?
The Linking Open Data cloud diagram
Linked Open Data Cloud
14. Names as information tags in life sciences
Names
Characteristics
Publications
GenesCollections
Specimens
Distribution
15. The enhanced and linked treatments, extracted, stored on Plazi.org, and served in
a human readable form, are linked to the underlying data: Fisher & Smith, 2008,
PLoS ONE.
17. Coordination and Policy Development in Preparation for a
European Open Biodiversity Knowledge Management
System
Supported by the European Commission through its FP7 research funding programme
pro-iBiosphere
19. Create digital objects
+ Identifiers and resolvers
+ Open Access
+ Adequate infrastructure
+ Sustainable and permanent infrastructure
+ Reliable services for partners in research projects and society
Seamless Global Virtual Research Knowledge Management System
(European Open Biodiversity Knowledge Management System)
Biodiversity Knowledge Management System
20. Impact
Support reliable and permanent open access to digital biodiversity
records
Create identifiers and link biodiversity literature, collections, digital
objects, genes, etc.
Ensure global interoperability and sharing of biodiversity data,
information and knowledge
Provide new services in support of open science
Provide the ground for modelling biosphere
Develop data policies to harness the potential of open access
European Open Biodiversity Management System
The envisaged
will:
24. Treatment
A publication or section of a publication documenting the
features or distribution of a related group of organisms
(called a “taxon”, plural “taxa”) in ways adhering to highly
formalized conventions.
http://terms.tdwg.org/wiki/tp:taxon-treatment
Catapano, 2010.
35. The enhanced and linked treatments, extracted, stored on Plazi.org, and served in
a human readable form, are linked to the underlying data: Fisher & Smith, 2008,
PLoS ONE.
36. Penestomus egazini Miller, Haddad & Griswold, 2010
Progress
Treatments (% complete): 4/4 (100%)
Data summary
Specimen records:41
adult female
adult male
other
51%
2%
46%
Specimen collections
Institutions: 3
Distribution
Muséum National d'Histoire Naturelle, Paris
California Academy of Sciences, San Francisco
Albany Museum, Grahamstown
2%
5%
76%
20%
Countries
Lesotho
South Africa
Georeferenced materials citations
Export species materials citations (DwC)
Export treatment materials citations (DwC)
37. 0
2000
4000
6000
8000
10000
12000
14000
16000
18000
20000
Materials Citations Records by Researcher
Other
Donat Agosti
David Grimaldi
Toby Schuh
James Carpenter
Norman Platnick
American Museum of Natural History
Data summary
Materials citations 2004-2013:111,364
Distribution
Georeferenced materials citations
Export species materials citations (DwC)
MaterialsCitationsRecords
38. 0
500
1000
1500
2000
2500
Materials Citations Records by Institution
Other
Muséum National d'Histoire
Naturelle, Paris
Natural History Museum,
London
Museum of Comparative
Zoology
Smithsonian Institution
American Museum of Natural
History
Zootaxa
Data summary
Materials citations 2004-2013:11,476
Distribution
Georeferenced materials citations
Export species materials citations (DwC)
MaterialsCitationsRecords
43. Unified marked up final output
Taxon treatments, keys, images, localities
PROSPECTIVE PUBLISHING | HISTORICAL LITERATURE
Legacy and new taxonomic literature
Content management systems &
repositories (e.g., Plazi, EOL, GBIF, SCRATCHPADS, EDIT)
TaxPub XML schema
PENSOFT MARK UP tool
Marked up publications
PDF, HTML and XML
archiving
WIKI
Species-ID, Wikispecies
Wikipedia
Indexing (IPNI,
ZooBank, Myco-
Bank, GNA)
Aggregators
(EOL, GBIF)
Electronic
archives; Data
Centers
END
USERS
TaxonX schema
PLAZI’ GOLDEN GATE editor
Automated
submission; peer-
review
45. Access to ant taxonomic publications through antbase.org /Smithsonian Institution, including currently the entire
body of non-copyrighted publications since 1758 (>4,000 publications or 85,000 pages)
48. Before antbase.org, Harvard‘s Museum of
Comparative Zoology could claim to be the only
location with a complete set of ant systematics
publications from 1758 - present.
49. Before antbase.org, Harvard‘s Museum of
Comparative Zoology could claim to be the only
location with a complete set of ant systematics
publications from 1758 - present.
Through antbase.org‘s
digital library, access
to this body of
literature is worldwide,
and it is actively used
(>10,000 visits in one
month only).
52. • The free and open use of content, services and other digital resources
about biodiversity;
• Licenses that grant all users a free, irrevocable, world-wide, right to
copy, use, distribute, transmit and display the work publicly as well as
build on the work and making derivative works, subject to proper
attribution consistent with community practices;
• Policy developments that will foster free and open access to biodiversity
data;
• Tracking the use of information to ensure that sources and suppliers of
data are assigned credit for their contributions;
• An agreed infrastructure, standards and protocols to improve access to
and use of open data;
Bouchout Declaration, 2014 (1)
53. • Registers for content and services to allow discovery, access and use of
open data;
• Persistent, dereferenceable identifiers for data objects and physical
objects such as specimens, images and taxonomic treatments;
• Linking data using agreed vocabularies, both within and beyond
biodiversity, that enable participation in the Linked Open Data Cloud;
• Dialogue coordinated by the leading signatories to refine the concept,
priorities and technical requirements of Open Biodiversity Knowledge
Management.
• A sustainable Open Biodiversity Knowledge Management that is
attentive to scientific, sociological, legal, and financial aspects.
Bouchout Declaration, 2014 (2)
60. founded in 2008
Swiss based NGO with members in
Switzerland, Germany, Bulgaria, US and
Iran
research based think tank with the
mission to promote open access to
scientific content
five pillars: Legal advice,
technical innovations and solutions,
maintenance of a treatment repository
and Biowikifarm, consultancy, advocacy
66. founded in 2008
Swiss based NGO with members in
Switzerland, Germany, Bulgaria, US and
Iran
research based think tank with the
mission to promote open access to
scientific content
five pillars: Legal advice,
technical innovations and solutions,
maintenance of a treatment repository
and Biowikifarm, consultancy, advocacy
Plazi GmbH founded in 2012 as
service SME owned by Plazi
67. research based think tank with the
mission to promote open access to
scientific content
five pillars: Legal advice,
technical innovations and solutions,
maintenance of a treatment repository
and Biowikifarm, consultancy, advocacy
Plazi GmbH founded in 2012 as
service SME owned by Plazi
Funding from public donors, eg. EU,
corporate and private
69. five pillars: Legal advice, technical
innovations and solutions, maintenance
of a treatment repository and
Biowikifarm, consultancy, advocacy
Plazi GmbH founded in 2012 as
service SME owned by Plazi
Funding from public donors, eg. EU,
corporate and private
Clients are global
70. Consultancies and Services:
Consulting publishers on how to
produce XML semantically enhanced
output (eg. EJT, Zootaxa, Smithsonian
Institution)
Service to mark-up literature
71. http://plazi.org
Thank you very much!
Donat Agosti
agosti@plazi.org
This project is funded under the European Union's Seventh Framework
Programme (FP7/2007-2013) under grant agreement №312848.