The EC-funded agINFRA project implemented a Linked Data layer of vocabularies and data. This presentation gives an overview of the methodology and the outcomes.
Presented at the 4th RDA Plenary Meeting in Amsterdam on 22/09/2014.
agINFRA work on germplasm and soil Linked Data by Luca Matteus, Giovanni L’Ab...CIARD Movement
Presentation delivered at the Agricultural Data Interoperability Interest Group -- Research Data Alliance (RDA) 4th Plenary Meeting -- Amsterdam, September 2014
The RDF Report Card: Beyond the Triple CountLeigh Dodds
My talk from the Semtech Biz conference in London.
I argued that it is time to move beyond discussing size of datasets and encourage a more nuanced view to understand quality and utility.
The RDF Report Card is offered as one simple, high-level visualization.
Overview of how data on the Web of Data can be consumed (first and foremost Linked Data) and implications for the development of usage mining approaches.
References:
Elbedweihy, K., Mazumdar, S., Cano, A. E., Wrigley, S. N., & Ciravegna, F. (2011). Identifying Information Needs by Modelling Collective Query Patterns. COLD, 782.
Elbedweihy, K., Wrigley, S. N., & Ciravegna, F. (2012). Improving Semantic Search Using Query Log Analysis. Interacting with Linked Data (ILD 2012), 61.
Raghuveer, A. (2012). Characterizing machine agent behavior through SPARQL query mining. In Proceedings of the International Workshop on Usage Analysis and the Web of Data, Lyon, France.
Arias, M., Fernández, J. D., Martínez-Prieto, M. A., & de la Fuente, P. (2011). An empirical study of real-world SPARQL queries. arXiv preprint arXiv:1103.5043.
Hartig, O., Bizer, C., & Freytag, J. C. (2009). Executing SPARQL queries over the web of linked data (pp. 293-309). Springer Berlin Heidelberg.
Verborgh, R., Hartig, O., De Meester, B., Haesendonck, G., De Vocht, L., Vander Sande, M., ... & Van de Walle, R. (2014). Querying datasets on the web with high availability. In The Semantic Web–ISWC 2014 (pp. 180-196). Springer International Publishing.
Verborgh, R., Vander Sande, M., Colpaert, P., Coppens, S., Mannens, E., & Van de Walle, R. (2014, April). Web-Scale Querying through Linked Data Fragments. In LDOW.
Luczak-Rösch, M., & Bischoff, M. (2011). Statistical analysis of web of data usage. In Joint Workshop on Knowledge Evolution and Ontology Dynamics (EvoDyn2011), CEUR WS.
Luczak-Rösch, M. (2014). Usage-dependent maintenance of structured Web data sets (Doctoral dissertation, Freie Universität Berlin, Germany), http://edocs.fu-berlin.de/diss/receive/FUDISS_thesis_000000096138.
agINFRA work on germplasm and soil Linked Data by Luca Matteus, Giovanni L’Ab...CIARD Movement
Presentation delivered at the Agricultural Data Interoperability Interest Group -- Research Data Alliance (RDA) 4th Plenary Meeting -- Amsterdam, September 2014
The RDF Report Card: Beyond the Triple CountLeigh Dodds
My talk from the Semtech Biz conference in London.
I argued that it is time to move beyond discussing size of datasets and encourage a more nuanced view to understand quality and utility.
The RDF Report Card is offered as one simple, high-level visualization.
Overview of how data on the Web of Data can be consumed (first and foremost Linked Data) and implications for the development of usage mining approaches.
References:
Elbedweihy, K., Mazumdar, S., Cano, A. E., Wrigley, S. N., & Ciravegna, F. (2011). Identifying Information Needs by Modelling Collective Query Patterns. COLD, 782.
Elbedweihy, K., Wrigley, S. N., & Ciravegna, F. (2012). Improving Semantic Search Using Query Log Analysis. Interacting with Linked Data (ILD 2012), 61.
Raghuveer, A. (2012). Characterizing machine agent behavior through SPARQL query mining. In Proceedings of the International Workshop on Usage Analysis and the Web of Data, Lyon, France.
Arias, M., Fernández, J. D., Martínez-Prieto, M. A., & de la Fuente, P. (2011). An empirical study of real-world SPARQL queries. arXiv preprint arXiv:1103.5043.
Hartig, O., Bizer, C., & Freytag, J. C. (2009). Executing SPARQL queries over the web of linked data (pp. 293-309). Springer Berlin Heidelberg.
Verborgh, R., Hartig, O., De Meester, B., Haesendonck, G., De Vocht, L., Vander Sande, M., ... & Van de Walle, R. (2014). Querying datasets on the web with high availability. In The Semantic Web–ISWC 2014 (pp. 180-196). Springer International Publishing.
Verborgh, R., Vander Sande, M., Colpaert, P., Coppens, S., Mannens, E., & Van de Walle, R. (2014, April). Web-Scale Querying through Linked Data Fragments. In LDOW.
Luczak-Rösch, M., & Bischoff, M. (2011). Statistical analysis of web of data usage. In Joint Workshop on Knowledge Evolution and Ontology Dynamics (EvoDyn2011), CEUR WS.
Luczak-Rösch, M. (2014). Usage-dependent maintenance of structured Web data sets (Doctoral dissertation, Freie Universität Berlin, Germany), http://edocs.fu-berlin.de/diss/receive/FUDISS_thesis_000000096138.
Rapid Digitization of Latin American Ephemera with HydraJon Stroop
Princeton University Library began to collect and build an archive of Latin American ephemera and gray literature in the mid 1970s to document the activities of political and social organizations and movements, as well as the broader political, socioeconomic and cultural developments of the region. Access to the material was provided by slowly accumulating and organizing thematic sub-collections, creating finding aids, and microfilming selected curated sub-collections. Reproductions of the microfilm were commercially distributed and resulting royalties were used to fund new acquisitions. That model gradually become unsustainable during the past decade and microfilming was halted in 2008.
Hydra breathes new life into this project by providing us with a framework for creating an end-to-end application that will facilitate rapid digitization, cataloging, and access to this important collection. Since the system went into production in April of 2014, nearly 1500 items have been cataloged, with the throughput rate ultimately accelerating to over 300 items per month in August.
Quickly re-publish CSV/TSV files from existing repositories as FAIR Data with just a few mouse clicks!
You select the columns to "project" as Linked Data, and the associated ontology terms. The FAIR Projector Builder will create a FAIR Projector for you: a Triple Pattern Fragment server to provide the Linked Data; a published DCAT Distribution containing metadata about those triples and their source; and an RML model (syntactic and semantic of the triples, to aid in third-party discovery of this novel projection.
(current status - first prototype, not ready for public consumption)
-------
Thanks to the NBDC/DBCLS for sponsoring the hackathon series.
MDW also funded by Ministerio de Economía y Competitividad grant number TIN2014-55993-RM
247th ACS Meeting: The Eureka Research WorkbenchStuart Chalk
Academic scientists need a tool to capture the science they do so that it can be shared in open science, integrated with linked data, and shared/searched. Eureka is an evolving platform to do this.
An exploration of a possible pipeline for RDF datasets from Timbuctoo instances to the digital archive EASY.
- Get, verify, ingest archive and disseminate (linked) data and metadata.
- What are the implications for an archive: serving linked data over (longer periods of) time
- Practical stuff.
Presentation delivered in the context of the Agricultural Data Interoperability WG meeeting, during the RDA 3rd Plenary Meeting in Dublin, Ireland. 26/3/2014.
The presentation is mostly focused on the work done by the agINFRA project towards proposing a methodology for the definition of Germplasm descriptors as RDF, based on the existing work of experts in the field and making use of the existing effort in this direction.
Tech. session : Interoperability and Data FAIRness emerges from a novel combi...Mark Wilkinson
My presentation to OAI10 - CERN - UNIGE Workshop on Innovations in Scholarly Communication, 21-23 June 2017
University of Geneva.
https://indico.cern.ch/event/405949/contributions/2487823/
A description of the FAIR Accessor and FAIR Projector technologies: REST-compliant approaches to publishing FAIR Metadata and FAIR Data (respectively)
Spanish Ministerio de Economía y Competitividad TIN2014-55993-R
The nature.com ontologies portal: nature.com/ontologiesTony Hammond
Presentation by Tony Hammond and Michele Pasin to Linked Science workshop, co-located with International Semantic Web Conference (ISWC) 2015, on October 12, 2015
nanopub-java: A Java Library for NanopublicationsTobias Kuhn
The concept of nanopublications was first proposed about six years ago, but it lacked openly available implementations. The library presented here is the first one that has become an official implementation of the nanopublication community. Its core features are stable, but it also contains unofficial and experimental extensions: for publishing to a decentralized server network, for defining sets of nanopublications with indexes, for informal assertions, and for digitally signing nanopublications. Most of the features of the library can also be accessed via an online validator interface.
Sharing data with lightweight data standards, such as schema.org and bioschemas. The Knetminer case, an application for the agrifood domain and molecular biology.
Presented at Open Data Sicilia (#ODS2021)
Data Integration is a key part of many of today’s data management challenges: from data warehousing, to MDM, to mergers & acquisitions. Issues can arise not only in trying to align technical formats from various databases and legacy systems, but in trying to achieve common business definitions and rules.
Join this webinar to see how a data model can help with both of these challenges – from ‘bottom-up’ technical integration, to the ‘top-down’ business alignment.
Rapid Digitization of Latin American Ephemera with HydraJon Stroop
Princeton University Library began to collect and build an archive of Latin American ephemera and gray literature in the mid 1970s to document the activities of political and social organizations and movements, as well as the broader political, socioeconomic and cultural developments of the region. Access to the material was provided by slowly accumulating and organizing thematic sub-collections, creating finding aids, and microfilming selected curated sub-collections. Reproductions of the microfilm were commercially distributed and resulting royalties were used to fund new acquisitions. That model gradually become unsustainable during the past decade and microfilming was halted in 2008.
Hydra breathes new life into this project by providing us with a framework for creating an end-to-end application that will facilitate rapid digitization, cataloging, and access to this important collection. Since the system went into production in April of 2014, nearly 1500 items have been cataloged, with the throughput rate ultimately accelerating to over 300 items per month in August.
Quickly re-publish CSV/TSV files from existing repositories as FAIR Data with just a few mouse clicks!
You select the columns to "project" as Linked Data, and the associated ontology terms. The FAIR Projector Builder will create a FAIR Projector for you: a Triple Pattern Fragment server to provide the Linked Data; a published DCAT Distribution containing metadata about those triples and their source; and an RML model (syntactic and semantic of the triples, to aid in third-party discovery of this novel projection.
(current status - first prototype, not ready for public consumption)
-------
Thanks to the NBDC/DBCLS for sponsoring the hackathon series.
MDW also funded by Ministerio de Economía y Competitividad grant number TIN2014-55993-RM
247th ACS Meeting: The Eureka Research WorkbenchStuart Chalk
Academic scientists need a tool to capture the science they do so that it can be shared in open science, integrated with linked data, and shared/searched. Eureka is an evolving platform to do this.
An exploration of a possible pipeline for RDF datasets from Timbuctoo instances to the digital archive EASY.
- Get, verify, ingest archive and disseminate (linked) data and metadata.
- What are the implications for an archive: serving linked data over (longer periods of) time
- Practical stuff.
Presentation delivered in the context of the Agricultural Data Interoperability WG meeeting, during the RDA 3rd Plenary Meeting in Dublin, Ireland. 26/3/2014.
The presentation is mostly focused on the work done by the agINFRA project towards proposing a methodology for the definition of Germplasm descriptors as RDF, based on the existing work of experts in the field and making use of the existing effort in this direction.
Tech. session : Interoperability and Data FAIRness emerges from a novel combi...Mark Wilkinson
My presentation to OAI10 - CERN - UNIGE Workshop on Innovations in Scholarly Communication, 21-23 June 2017
University of Geneva.
https://indico.cern.ch/event/405949/contributions/2487823/
A description of the FAIR Accessor and FAIR Projector technologies: REST-compliant approaches to publishing FAIR Metadata and FAIR Data (respectively)
Spanish Ministerio de Economía y Competitividad TIN2014-55993-R
The nature.com ontologies portal: nature.com/ontologiesTony Hammond
Presentation by Tony Hammond and Michele Pasin to Linked Science workshop, co-located with International Semantic Web Conference (ISWC) 2015, on October 12, 2015
nanopub-java: A Java Library for NanopublicationsTobias Kuhn
The concept of nanopublications was first proposed about six years ago, but it lacked openly available implementations. The library presented here is the first one that has become an official implementation of the nanopublication community. Its core features are stable, but it also contains unofficial and experimental extensions: for publishing to a decentralized server network, for defining sets of nanopublications with indexes, for informal assertions, and for digitally signing nanopublications. Most of the features of the library can also be accessed via an online validator interface.
Sharing data with lightweight data standards, such as schema.org and bioschemas. The Knetminer case, an application for the agrifood domain and molecular biology.
Presented at Open Data Sicilia (#ODS2021)
Data Integration is a key part of many of today’s data management challenges: from data warehousing, to MDM, to mergers & acquisitions. Issues can arise not only in trying to align technical formats from various databases and legacy systems, but in trying to achieve common business definitions and rules.
Join this webinar to see how a data model can help with both of these challenges – from ‘bottom-up’ technical integration, to the ‘top-down’ business alignment.
A global linked and open data infrastructure for agricultural developmentValeria Pesce
Valeria Pesce, FAO / GFAR, Italy.
Presented at the BigDataEurope workshop on Societal Challenge 2 "food and agriculture", held in Paris at INRA on 22 September 2015.
How to describe a dataset. Interoperability issuesValeria Pesce
Presented by Valeria Pesce during the pre-meeting of the Agricultural Data Interoperability Interest Group (IGAD) of the Research Data Alliance (RDA), held on 21 and 22 September 2015 in Paris at INRA.
Sharing Agricultural Events Information: When and where is that workshop?Gauri Salokhe
In the last few years a strong need has emerged for a standard way to interchange various types of information, such as on organizations, projects, experts, events and news, in the agricultural community. This paper focuses on the metadata set for events, the Agricultural Events Application Profile (Ag-Events AP), created specifically to enhance description, exchange and reuse of information on events. The Ag-Events AP provides a minimum interoperability layer through which information about upcoming events related to agriculture can be described, shared and reused. The Ag-Events AP was developed by FAO, in collaboration with its partners, the Global Forum on Agricultural Research (GFAR) and Global Forest Information Service (GFIS), to offer a “minimum” set of metadata elements necessary to share quality descriptions about events. This paper talks about the work done on creating the AP, its use in various applications and the next steps.
The path to a Modern Data Architecture in Financial ServicesHortonworks
Delivering Data-Driven Applications at the Speed of Business: Global Banking AML use case.
Chief Data Officers in financial services have unique challenges: they need to establish an effective data ecosystem under strict governance and regulatory requirements. They need to build the data-driven applications that enable risk and compliance initiatives to run efficiently. In this webinar, we will discuss the case of a global banking leader and the anti-money laundering solution they built on the data lake. With a single platform to aggregate structured and unstructured information essential to determine and document AML case disposition, they reduced mean time for case resolution by 75%. They have a roadmap for building over 150 data-driven applications on the same search-based data discovery platform so they can mitigate risks and seize opportunities, at the speed of business.
Big Data Integration & Analytics Data Flows with AWS Data Pipeline (BDT207) |...Amazon Web Services
AWS offers many data services, each optimized for a specific set of structure, size, latency, and concurrency requirements. Making the best use of all specialized services has historically required custom, error-prone data transformation and transport. Now, users can use the AWS Data Pipeline service to orchestrate data flows between Amazon S3, Amazon RDS, Amazon DynamoDB, Amazon Redshift, and on-premise data stores, seamlessly and efficiently applying EC2 instances and EMR clusters to process and transform data. In this session, we demonstrate how you can use AWS Data Pipeline to coordinate your Big Data workflows, applying the optimal data storage technology to each part of your data integration architecture. Swipely's Head of Engineering shows how Swipely uses AWS Data Pipeline to build batch analytics, backfilling all their data, while using resources efficiently. Consequently, Swipely launches novel product features with less development time and less operational complexity.
Microsoft Technologies for Data Science 201612Mark Tabladillo
Delivered to SQL Saturday BI Edition -- Atlanta, GA
Microsoft provides several technologies in and around Azure which can be used for casual to serious data science. This presentation provides an overview of the major Microsoft options for both on-premise and cloud-based data science (and hybrid). These technologies have been used by the presenter in various companies and industries, both as a Microsoft consultant and previously independent consultant. As well, the speaker provides insights into data science careers, information which helps imply where the business will likely be for consultants and partners.
A presentation by Gordon Dunsire.
Delivered at the Cataloguing and Indexing Group Scotland (CIGS) Linked Open Data (LOD) Conference which took place Fri 21 September 2012 at the Edinburgh Centre for Carbon Innovation.
Exposing Bibliographic Information as Linked Open Data using Standards-based ...Nikolaos Konstantinou
The Linked Open Data (LOD) movement is constantly gaining worldwide acceptance. In this paper we describe how LOD is generated in the case of digital repositories that contain bibliographic information, adopting international standards. The available options and respective choices are presented and justified while we also provide a technical description, the methodology we followed, the possibilities and difficulties in the way, and the respective benefits and drawbacks. Detailed examples are provided regarding the implementation and query capabilities, and the paper concludes after a discussion over the results and the challenges associated with our approach, and our most important observations and future plans.
IBC FAIR Data Prototype Implementation slideshowMark Wilkinson
Discussion about ways of achieving FAIRness of both metadata and data. Brute force approaches, and more elegant "projection" approaches are shown.
Relevant papers are at:
doi: 10.7717/peerj-cs.110 (https://peerj.com/articles/cs-110/)
doi: 10.3389/fpls.2016.00641 (https://doi.org/10.3389/fpls.2016.00641)
Spanish Ministerio de Economía y Competitividad grant number TIN2014-55993-R
A guide for finding scientific and technical grey literature. Topics include search engines and tools, repositories and subject specific databases. Emphasis on open access/low cost resources.
re3data.org – Registry of Research Data RepositoriesHeinz Pampel
Heinz Pampel | GFZ German Research Centre for Geosciences, LIS
Maxi Kindling | Humboldt-Universität zu Berlin, Berlin School of Library and Information Science Frank Scholze | Karlsruhe Institute of Technology, KIT Library
RDA-Deutschland-Treffen 2015| Potsdam, November 26, 2015
FAIR Data Prototype - Interoperability and FAIRness through a novel combinati...Mark Wilkinson
This slide deck accompanies the manuscript "Interoperability and FAIRness through a novel combination of Web technologies", submitted to PeerJ Computer Science: https://doi.org/10.7287/peerj.preprints.2522v1
It describes the output of the "Skunkworks" FAIR implementation group, who were tasked with building a prototype infrastructure that would fulfill the FAIR Principles for scholarly data publishing. We show how a novel combination of the Linked Data Platform, RDF Mapping Language (RML) and Triple Pattern Fragments (TPF) can be combined to create a scholarly publishing infrastructure that is markedly interoperable, at both the metadata and the data level.
This slide deck (or something close) will be presented at the Dutch Techcenter for Life Sciences Partners Workshop, November 4, 2016.
Spanish Ministerio de Economía y Competitividad grant number TIN2014-55993-R
ARIADNE: Report on project metadata standards and thesauri in useariadnenetwork
D3.2 Describes the metadata standards and thesauri already used within the consortium and the approach for integrating these into the registry through a rich ontology such as CIDOC-CRM.
Authors: Paola Ronzino, Kate Fernie, Christos Papatheodorou, Holly Wright and Julian Richards
FAIR Workflows and Research Objects get a Workout Carole Goble
So, you want to build a pan-national digital space for bioscience data and methods? That works with a bunch of pre-existing data repositories and processing platforms? So you can share FAIR workflows and move them between services? Package them up with data and other stuff (or just package up data for that matter)? How? WorkflowHub (https://workflowhub.eu) and RO-Crate Research Objects (https://www.researchobject.org/ro-crate) that’s how! A step towards FAIR Digital Objects gets a workout.
Presented at DataVerse Community Meeting 2021
The new CIARD RING, a machine-readable directory of datasets for agricultureValeria Pesce
The CIARD RING, a global directory of datasets for agriculture, has been enhanced during the EC-funded agINFRA project. It has become a Linked Data hub that can be queried by other applications.
Presented at the 4th RDA Plenary Meeting in Amsterdam on 22/09/2014.
Codes of conduct for farm data sharing. Work done and ideas for a GODAN/CTA s...Valeria Pesce
Presentation given at the GODAN / CTA / KTBL workshop on "On legal and policy aspects of open data in agriculture affecting farmers" hed in Darmstadt on 25-26 July 2019.
Digital agriculture: ICT-amplified data asymmetries and power imbalances. Pol...Valeria Pesce
Presentation given at the World Bank / Georgetown University workshop "Digital Acceleration of Agricultural Transformation", 25 June 2019, Washington D.C.
Presentation prepared for CTA, for the CTA/AgriCord/PAFO workshop on "Building the next generation of farmers
Supporting capacity-development of African Farmers Organisations through improved Policies, Technologies and Capabilities", Brussels, 6-7 November 2018 .
Publishing Germplasm Vocabularies as Linked DataValeria Pesce
What has already been published?
What may still be needed?
How to do it?
This presentation is a part of the 3rd Session of the 1st International e-Conference on Germplasm Data Interoperability https://sites.google.com/site/germplasminteroperability/
Developing Agricultural Research Information Systems. The experience of the G...Valeria Pesce
Presented by Ajit Maru at the WCCA Congress 2010 held in Reno, US.
The role of the Global Forum has been that of fostering and supporting the development of agricultural research information systems at national, regional and global levels. This cross-stakeholder vision has allowed GFAR to have a privileged perspective on the evolution of the approach to building ARD information systems.
Information / software architectures based on Content Management Systems (CMS)Valeria Pesce
Presented at the WORKSHOP ON INFORMATION SYSTEMS ARCHITECTURES held on 25th April 2010 as a side event of the IAALD 13th Congress in Montpeller, France, 26-29 April 2010.
Information on the workshop:
http://aims.fao.org/events/ciard-workshop-information-systems-architectures-iaald-2010
The CIARD RING, an infrastructure for interoperability of agricultural resear...Valeria Pesce
Presented at the IAALD 13th Congress held in Montpellier, France, 26-29 April 2010.
Abstract: "Creating integrated information services in agriculture giving access and adding value to information residing in distributed sources remains a major challenge.
In distributed architectures, value added services by definition interface several information sources / services. Therefore value added services cannot be built without an awareness of what others have done: which sources are available, how to tap into them, how to exploit their semantics.
The Coherence in Information for Agricultural Research for Development (CIARD) Routemap to Information Nodes and Gateways (RING) is a portal offering an interlinked registry of existing information services in agriculture..."
The CIARD RING is available at:
http://ring.ciard.net
This presentation is dated May 2009.
The CIARD RING has since become available at:
http://ring.ciard.net
A more up-to-date presentation is available here:
http://www.slideshare.net/valeriap/the-ciard-ring-an-infrastructure-for-interoperability-of-agricultural-research-information-services
Presented at the IAALD-AFITA-WCCA Conference held in Atsugi (Japan) in August 2008.
The WebRing concept has evolved in the meantime and the resulting service is the CIARD RING, available at: http://ring.ciard.net
A more up-to-date presentation is available here:
http://www.slideshare.net/valeriap/the-ciard-ring-an-infrastructure-for-interoperability-of-agricultural-research-information-services
UiPath Test Automation using UiPath Test Suite series, part 5DianaGray10
Welcome to UiPath Test Automation using UiPath Test Suite series part 5. In this session, we will cover CI/CD with devops.
Topics covered:
CI/CD with in UiPath
End-to-end overview of CI/CD pipeline with Azure devops
Speaker:
Lyndsey Byblow, Test Suite Sales Engineer @ UiPath, Inc.
Climate Impact of Software Testing at Nordic Testing DaysKari Kakkonen
My slides at Nordic Testing Days 6.6.2024
Climate impact / sustainability of software testing discussed on the talk. ICT and testing must carry their part of global responsibility to help with the climat warming. We can minimize the carbon footprint but we can also have a carbon handprint, a positive impact on the climate. Quality characteristics can be added with sustainability, and then measured continuously. Test environments can be used less, and in smaller scale and on demand. Test techniques can be used in optimizing or minimizing number of tests. Test automation can be used to speed up testing.
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionAggregage
Join Maher Hanafi, VP of Engineering at Betterworks, in this new session where he'll share a practical framework to transform Gen AI prototypes into impactful products! He'll delve into the complexities of data collection and management, model selection and optimization, and ensuring security, scalability, and responsible use.
Epistemic Interaction - tuning interfaces to provide information for AI supportAlan Dix
Paper presented at SYNERGY workshop at AVI 2024, Genoa, Italy. 3rd June 2024
https://alandix.com/academic/papers/synergy2024-epistemic/
As machine learning integrates deeper into human-computer interactions, the concept of epistemic interaction emerges, aiming to refine these interactions to enhance system adaptability. This approach encourages minor, intentional adjustments in user behaviour to enrich the data available for system learning. This paper introduces epistemic interaction within the context of human-system communication, illustrating how deliberate interaction design can improve system understanding and adaptation. Through concrete examples, we demonstrate the potential of epistemic interaction to significantly advance human-computer interaction by leveraging intuitive human communication strategies to inform system design and functionality, offering a novel pathway for enriching user-system engagements.
Pushing the limits of ePRTC: 100ns holdover for 100 daysAdtran
At WSTS 2024, Alon Stern explored the topic of parametric holdover and explained how recent research findings can be implemented in real-world PNT networks to achieve 100 nanoseconds of accuracy for up to 100 days.
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdfMalak Abu Hammad
Discover how MongoDB Atlas and vector search technology can revolutionize your application's search capabilities. This comprehensive presentation covers:
* What is Vector Search?
* Importance and benefits of vector search
* Practical use cases across various industries
* Step-by-step implementation guide
* Live demos with code snippets
* Enhancing LLM capabilities with vector search
* Best practices and optimization strategies
Perfect for developers, AI enthusiasts, and tech leaders. Learn how to leverage MongoDB Atlas to deliver highly relevant, context-aware search results, transforming your data retrieval process. Stay ahead in tech innovation and maximize the potential of your applications.
#MongoDB #VectorSearch #AI #SemanticSearch #TechInnovation #DataScience #LLM #MachineLearning #SearchTechnology
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex ProofsAlex Pruden
This paper presents Reef, a system for generating publicly verifiable succinct non-interactive zero-knowledge proofs that a committed document matches or does not match a regular expression. We describe applications such as proving the strength of passwords, the provenance of email despite redactions, the validity of oblivious DNS queries, and the existence of mutations in DNA. Reef supports the Perl Compatible Regular Expression syntax, including wildcards, alternation, ranges, capture groups, Kleene star, negations, and lookarounds. Reef introduces a new type of automata, Skipping Alternating Finite Automata (SAFA), that skips irrelevant parts of a document when producing proofs without undermining soundness, and instantiates SAFA with a lookup argument. Our experimental evaluation confirms that Reef can generate proofs for documents with 32M characters; the proofs are small and cheap to verify (under a second).
Paper: https://eprint.iacr.org/2023/1886
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...James Anderson
Effective Application Security in Software Delivery lifecycle using Deployment Firewall and DBOM
The modern software delivery process (or the CI/CD process) includes many tools, distributed teams, open-source code, and cloud platforms. Constant focus on speed to release software to market, along with the traditional slow and manual security checks has caused gaps in continuous security as an important piece in the software supply chain. Today organizations feel more susceptible to external and internal cyber threats due to the vast attack surface in their applications supply chain and the lack of end-to-end governance and risk management.
The software team must secure its software delivery process to avoid vulnerability and security breaches. This needs to be achieved with existing tool chains and without extensive rework of the delivery processes. This talk will present strategies and techniques for providing visibility into the true risk of the existing vulnerabilities, preventing the introduction of security issues in the software, resolving vulnerabilities in production environments quickly, and capturing the deployment bill of materials (DBOM).
Speakers:
Bob Boule
Robert Boule is a technology enthusiast with PASSION for technology and making things work along with a knack for helping others understand how things work. He comes with around 20 years of solution engineering experience in application security, software continuous delivery, and SaaS platforms. He is known for his dynamic presentations in CI/CD and application security integrated in software delivery lifecycle.
Gopinath Rebala
Gopinath Rebala is the CTO of OpsMx, where he has overall responsibility for the machine learning and data processing architectures for Secure Software Delivery. Gopi also has a strong connection with our customers, leading design and architecture for strategic implementations. Gopi is a frequent speaker and well-known leader in continuous delivery and integrating security into software delivery.
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfPaige Cruz
Monitoring and observability aren’t traditionally found in software curriculums and many of us cobble this knowledge together from whatever vendor or ecosystem we were first introduced to and whatever is a part of your current company’s observability stack.
While the dev and ops silo continues to crumble….many organizations still relegate monitoring & observability as the purview of ops, infra and SRE teams. This is a mistake - achieving a highly observable system requires collaboration up and down the stack.
I, a former op, would like to extend an invitation to all application developers to join the observability party will share these foundational concepts to build on:
In the rapidly evolving landscape of technologies, XML continues to play a vital role in structuring, storing, and transporting data across diverse systems. The recent advancements in artificial intelligence (AI) present new methodologies for enhancing XML development workflows, introducing efficiency, automation, and intelligent capabilities. This presentation will outline the scope and perspective of utilizing AI in XML development. The potential benefits and the possible pitfalls will be highlighted, providing a balanced view of the subject.
We will explore the capabilities of AI in understanding XML markup languages and autonomously creating structured XML content. Additionally, we will examine the capacity of AI to enrich plain text with appropriate XML markup. Practical examples and methodological guidelines will be provided to elucidate how AI can be effectively prompted to interpret and generate accurate XML markup.
Further emphasis will be placed on the role of AI in developing XSLT, or schemas such as XSD and Schematron. We will address the techniques and strategies adopted to create prompts for generating code, explaining code, or refactoring the code, and the results achieved.
The discussion will extend to how AI can be used to transform XML content. In particular, the focus will be on the use of AI XPath extension functions in XSLT, Schematron, Schematron Quick Fixes, or for XML content refactoring.
The presentation aims to deliver a comprehensive overview of AI usage in XML development, providing attendees with the necessary knowledge to make informed decisions. Whether you’re at the early stages of adopting AI or considering integrating it in advanced XML development, this presentation will cover all levels of expertise.
By highlighting the potential advantages and challenges of integrating AI with XML development tools and languages, the presentation seeks to inspire thoughtful conversation around the future of XML development. We’ll not only delve into the technical aspects of AI-powered XML development but also discuss practical implications and possible future directions.
Removing Uninteresting Bytes in Software FuzzingAftab Hussain
Imagine a world where software fuzzing, the process of mutating bytes in test seeds to uncover hidden and erroneous program behaviors, becomes faster and more effective. A lot depends on the initial seeds, which can significantly dictate the trajectory of a fuzzing campaign, particularly in terms of how long it takes to uncover interesting behaviour in your code. We introduce DIAR, a technique designed to speedup fuzzing campaigns by pinpointing and eliminating those uninteresting bytes in the seeds. Picture this: instead of wasting valuable resources on meaningless mutations in large, bloated seeds, DIAR removes the unnecessary bytes, streamlining the entire process.
In this work, we equipped AFL, a popular fuzzer, with DIAR and examined two critical Linux libraries -- Libxml's xmllint, a tool for parsing xml documents, and Binutil's readelf, an essential debugging and security analysis command-line tool used to display detailed information about ELF (Executable and Linkable Format). Our preliminary results show that AFL+DIAR does not only discover new paths more quickly but also achieves higher coverage overall. This work thus showcases how starting with lean and optimized seeds can lead to faster, more comprehensive fuzzing campaigns -- and DIAR helps you find such seeds.
- These are slides of the talk given at IEEE International Conference on Software Testing Verification and Validation Workshop, ICSTW 2022.
Essentials of Automations: The Art of Triggers and Actions in FMESafe Software
In this second installment of our Essentials of Automations webinar series, we’ll explore the landscape of triggers and actions, guiding you through the nuances of authoring and adapting workspaces for seamless automations. Gain an understanding of the full spectrum of triggers and actions available in FME, empowering you to enhance your workspaces for efficient automation.
We’ll kick things off by showcasing the most commonly used event-based triggers, introducing you to various automation workflows like manual triggers, schedules, directory watchers, and more. Plus, see how these elements play out in real scenarios.
Whether you’re tweaking your current setup or building from the ground up, this session will arm you with the tools and insights needed to transform your FME usage into a powerhouse of productivity. Join us to discover effective strategies that simplify complex processes, enhancing your productivity and transforming your data management practices with FME. Let’s turn complexity into clarity and make your workspaces work wonders!
“An Outlook of the Ongoing and Future Relationship between Blockchain Technologies and Process-aware Information Systems.” Invited talk at the joint workshop on Blockchain for Information Systems (BC4IS) and Blockchain for Trusted Data Sharing (B4TDS), co-located with with the 36th International Conference on Advanced Information Systems Engineering (CAiSE), 3 June 2024, Limassol, Cyprus.
In his public lecture, Christian Timmerer provides insights into the fascinating history of video streaming, starting from its humble beginnings before YouTube to the groundbreaking technologies that now dominate platforms like Netflix and ORF ON. Timmerer also presents provocative contributions of his own that have significantly influenced the industry. He concludes by looking at future challenges and invites the audience to join in a discussion.
1. The agINFRA Linked Data layer
Valeria Pesce
Global Forum on Agricultural Research (GFAR)
Giovanni L’Abate
Consiglio per la Ricerca e la sperimentazione in Agricoltura
Centro di ricerca per l’agrobiologia e la pedologia (CRA-ABP)
Luca Matteis
Koraljka Golub
Research Data Alliance 4th Plenary Meeting
22-24 September 2014, Amsterdam
Agricultural Data Interoperability Interest Group
agINFRA project
EC 7th framework program INFRA-2011-1.2.2 - Grant agr. no: 283770
2. agINFRA - Background
• agINFRA: FP7 project
EC 7th framework program INFRA-2011-1.2.2 - Grant agr. no: 283770
Objective: Promoting data sharing and development
of trust in agricultural sciences
• agINFRA Knowledge Fair co-located with the
Agricultural Data Interoperability Interest group
meeting at the RDA 4th Plenary Meeting
• Types of data covered by agINFRA:
bibliographic, educational, germplasm, soil
3. Interoperability 1
They are often both called vocabularies
Metadata elements
to describe individual “things” (entities, datums, series…)
Aka metadata sets, metadata element sets, vocabularies
Sets of values
for (some of) the metadata elements
Aka controlled vocabularies, authority data, value
vocabularies, Knowledge Organization Systems (KOSs)
4. Various flavors of vocabularies
Title
Author(s)
Abstract
Subject(s)
Publication date
Publication place
Type of document
other features…
Entity to be describedType?
Bibliographic
resource
for describing
bibliographic
resources
Metadata
vocabulary
Authority data
KOS
“Value vocabularies”
Data of type Person
Authority data
Data of type
Geographic location
“Description
vocabularies”
Controlled list
Concepts suitable for
organizing by Topic
Concepts suitable for
organizing by Type
for describing
people
Metadata
vocabularyfor describing
geographic places
Ontology
5. Names of
things
URIs of
things
Links to
other URIs
Metadata vocabularies RDFs / OWL
KOS SKOS
Names of
metadata
elements
URIs of
classes and
properties
Links to
other URIs
Serialization into RDF
Interoperability 2: RDF and Linked Data
http://purl.org/dc/elements/1.1/contributorhttp://purl.org/ontology/bibo/editor“Editor”
rdfs:subPropertyOf
http://aims.fao.org/aos/agrovoc/c_6599 http://id.loc.gov/authorities/sh85113862#concept“Rice”
skos:exactMatch
6. Names of metadata elements URIs of classes and properties
Databases /
tables / series
Names of things URIs of things Links to other URIs
http://vocabularies.aginfra.eu/soil#isObservedOnLocation
• Then, other complex things like
“URI de-referencing” and “content negotiation”…
(some good triple store platforms do it out of the box)
Interoperability 2: RDF and Linked Data
http://purl.org/ontology/bibo/editor“Editor”
https://aginfra-
sg.ct.infn.it/rdf/cncp/resource/ObservedSoilSite/
16.4CLcch1-1
http://vocabularies.aginfra.eu/soil#Obs
ervedSoilProfile
“Observed soil
16.4CLcch1”
rdfs:type
Serialization into RDF
“Observed in location”
7. Tools used in agINFRA
• For building and managing SKOS:
the FAO VocBench
• For publishing KOSs as Linked Data:
SKOS loaded into Allegrograph
• For building and publishing RDF vocabularies:
Neologism
• For publishing data as Linked Data:
D2RQ from database to RDF
> mapping to published classes and properties
Links are provided in the last slide
8. Linked Data in agINFRA
• Linked Data Vocabularies
– Reference to existing relevant RDF vocabularies and SKOS
– New RDF vocabularies only when not existing (e.g. soil
ontology)
– New KOS only when:
• Not existing
• Mapping needed between local concepts and published concepts
• Extension needed
• Linked Data datasets
– Bibliographic data: AGRIS triple store
– Germplasm data:
• CAAS Linked Data API (presented later)
• CRA triple store (presented later)
– Soil data: CRA triple store (presented later)
10. URLs of agINFRA Linked Data
vocabulary platforms
• agINFRA overview of vocabularies:
http://vocabularies.aginfra.eu
• New agINFRA Soil Vocabulary:
http://vocabularies.aginfra.eu/soil#
• VocBench instances:
http://202.73.13.50:55481/aginfra/
http://artemide.art.uniroma2.it/vocbench2
• Allegrograph triple store of agINFRA KOSs:
http://202.45.139.84:10035/catalogs/fao/repositories/ag
INFRA
11. Namespaces of agINFRA new
vocabularies
• agINFRA Soil vocabulary:
http://vocabularies.aginfra.eu/soil#
• CRA Soil Terms:
http://data.entecra.it/rdf/kos/soil/ or
http://soilmaps.entecra.it/rdf/kos/soil/
• CRA Germplasm Terms:
http://data.entecra.it/rdf/germplasm/soil/ or
http://planta-res.entecra.it/rdf/kos/germplasm/
• agINFRA Resource Types Terms:
http://aginfra.eu/voc/aginfra_doctypes/
• agINFRA Educational Resources Terms:
http://aginfra.eu/voc/aginfra_eduterms/
12. Example 1: the Soil Terms KOS
Rationale:
• CRA had local lists of values for several soil
properties
• In most cases those values mapped
conceptually with terms in published KOSs
Local values published as new KOS with
mappings to USDA Soil Taxonomy terms and/or
WRB whenever possible
15. Example 2: Resource types in AGRIS
Starting from a table
Concept Type of Concept Relationship among concepts
Bibliography Top broadMatch http://purl.org/dc/dcmitype/Text
Book Top NT Handbook/Manual broadMatch http://purl.org/dc/dcmitype/Text
Conference BT Event broadMatch http://purl.org/dc/dcmitype/Event
Dictionary Top broadMatch http://purl.org/dc/dcmitype/Text
Directory Top relatedTerm http://purl.org/dc/dcmitype/Collection
Drawing BT Image broadMatch http://purl.org/dc/dcmitype/Image
Encyclopaedia Top broadMatch http://purl.org/dc/dcmitype/Text
Event Top NT Conference exactMatch http://purl.org/dc/dcmitype/Event
Extension Top
Film Top broadMatch http://purl.org/dc/dcmitype/MovingImage
Graphics BT Image broadMatch http://purl.org/dc/dcmitype/Image
Handbook/Manual BT Book broadMatch http://purl.org/dc/dcmitype/Text
Image Top NT Drawing, NT Graphics, NT MapsorAtlases exactMatch http://purl.org/dc/dcmitype/Image
JournalArticle Top NT Preprint broadMatch http://purl.org/dc/dcmitype/Text
Lit.Review Top broadMatch http://purl.org/dc/dcmitype/Text
Manuscript Top broadMatch http://purl.org/dc/dcmitype/Text
MapsorAtlases BT Image broadMatch http://purl.org/dc/dcmitype/Image
News Top
Non-Conventional Top
NumericalData Top broadMatch http://purl.org/dc/dcmitype/Dataset
Other Top
Patent Top broadMatch http://purl.org/dc/dcmitype/Text
Preprint BT JournalArticle broadMatch http://purl.org/dc/dcmitype/Text
Report Top broadMatch http://purl.org/dc/dcmitype/Text
Sound/Music Top broadMatch http://purl.org/dc/dcmitype/Sound
Speech Top broadMatch http://purl.org/dc/dcmitype/Sound
Standard Top broadMatch http://purl.org/dc/dcmitype/Text
Summary Top broadMatch http://purl.org/dc/dcmitype/Text
Thesaurus Top broadMatch http://purl.org/dc/dcmitype/Text
Thesis Top broadMatch http://purl.org/dc/dcmitype/Text
Website Top
External mapping
19. agINFRA LOD data
19
Germplasm data
http://[CAAS-API-base-URL]/germplasm/rest
https://aginfra-sg.ct.infn.it/rdf/cncp/
CIARD RING
Existing
datasets
CAAS
CRA
New agINFRA datasets
http://ring.ciard.net
DATASETS agINFRA shop? CIARD directories
Germplasm data
Soil data
API
https://aginfra-sg.ct.infn.it/rdf/... ?? CRA Triple store
CRA Triple store
CKAN
CKAN
Dataverse
AGRIS
GLN
20. Namespaces of agINFRA
partners’ Linked Data
• Sustainability namespaces with the data
owners
• CRA data: data.entecra.it (presented later)
– http://data.entecra.it/rdf/soil/
temporarily at https://aginfra-sg.ct.infn.it/rdf/cncp/
– http://data.entecra.it/rdf/germplasm/
temporarily at http://93.63.35.32:8080/d2rq/
• CAAS data (presented later)
– API: http://www.cgris.net/pquery.asp
• AGRIS data
– http://agris.fao.org/aos/
20