This document summarizes a talk on common errors found in Linked Data. It discusses several types of errors discovered through analyzing over 150,000 RDF documents, including HTTP-level issues like URIs that don't return RDF descriptions, inaccurate content-type reporting, and duplicate content served at different URIs. It also describes reasoning issues such as undefined classes and properties, non-unique values for inverse-functional properties, malformed datatypes, and instances of disjoint classes. The document provides solutions like application workarounds, publishing validators, and the Pedantic Web Group for improving Linked Data quality.
How to Build Linked Data Sites with Drupal 7 and RDFascorlosquet
Slides of the tutorial Stéphane Corlosquet, Lin Clark and Alexandre Passant presented at SemTech 2010 in San Francisco http://semtech2010.semanticuniverse.com/sessionPop.cfm?confid=42& proposalid=2889
These slides explain (1) the motivation for using RDFa, for embedding structured data on web pages, (2) RDF as the foundation of RDFa, and (3) RDFa through examples.
How to Build Linked Data Sites with Drupal 7 and RDFascorlosquet
Slides of the tutorial Stéphane Corlosquet, Lin Clark and Alexandre Passant presented at SemTech 2010 in San Francisco http://semtech2010.semanticuniverse.com/sessionPop.cfm?confid=42& proposalid=2889
These slides explain (1) the motivation for using RDFa, for embedding structured data on web pages, (2) RDF as the foundation of RDFa, and (3) RDFa through examples.
A Privacy Preference Manager for the Social Semantic WebOwen Sacco
Current Social Web applications provide users with means to easily publish their personal information on the Web. However, once published, users cannot control how their data can be accessed apart from applying generic preferences (such as “friends” or “family”). In this paper, we describe how we enable finer-grained privacy preferences using the Privacy Preference Ontology (PPO); a light-weight vocabulary for defining privacy settings on the Social Web. In particular, we describe the formal semantic model of PPO and also present MyPrivacyManager, a privacy preference manager that let users (1) create privacy preferences using the aforementioned ontology and (2) restrict access to their data to third-party users based on profile features such as interests, relationships and common attributes.
Talk about Exploring the Semantic Web, and particularly Linked Data, and the Rhizomer approach. Presented August 14th 2012 at the SRI AIC Seminar Series, Menlo Park, CA
A talk about the gap between theory and practice with W3C Semantic Web and Dublin Core standards, and how the DC Tools Community can help collectively reduce the cost of that gap.
Given as part of the DC Tools Community workshop at LIDA2009 in Zadar, Croatia.
Towards an RDF Analytics Language: Learning from Successful ExperiencesFadi Maali
This presentation is given in the Fourth International Workshop on Consuming Linked Data (COLD2013) @ ISWC2013
Paper at: http://ceur-ws.org/Vol-1034/MaaliAndDecker_COLD2013.pdf
This is part 2 of the ISWC 2009 tutorial on the GoodRelations ontology and RDFa for e-commerce on the Web of Linked Data.
See also
http://www.ebusiness-unibw.org/wiki/Web_of_Data_for_E-Commerce_Tutorial_ISWC2009
This is part 2 of the ISWC 2009 tutorial on the GoodRelations ontology and RDFa for e-commerce on the Web of Linked Data.
See also
http://www.ebusiness-unibw.org/wiki/Web_of_Data_for_E-Commerce_Tutorial_ISWC2009
Presentation about opportunities and challenges concerning Linked Data at the Open Science Data Cloud NSF PIRE Workshop [1] on 18 July 2012 in Edinburgh, UK.
[1] http://www.opensciencedatacloud.org/osdc-edinburgh-workshop-71612-71712/
dcat: An RDF vocabulary for interoperability of data cataloguesRichard Cyganiak
Governments produce large amounts of valuable data, such as spreadsheets and maps, as part of daily operations and decision-making. This data can be useful to many citizens and organizations beyond the government, and it is ultimately the citizen who has paid for producing it. More and more governments make this data publicly available through data catalogs, such as data.gov, data.gov.uk, or statcentral.ie. This slide set introduces dcat, a unified format for publishing the metadata of such catalogs, that supports the querying, federation, consumption and archival of these valuable data assets.
A Privacy Preference Manager for the Social Semantic WebOwen Sacco
Current Social Web applications provide users with means to easily publish their personal information on the Web. However, once published, users cannot control how their data can be accessed apart from applying generic preferences (such as “friends” or “family”). In this paper, we describe how we enable finer-grained privacy preferences using the Privacy Preference Ontology (PPO); a light-weight vocabulary for defining privacy settings on the Social Web. In particular, we describe the formal semantic model of PPO and also present MyPrivacyManager, a privacy preference manager that let users (1) create privacy preferences using the aforementioned ontology and (2) restrict access to their data to third-party users based on profile features such as interests, relationships and common attributes.
Talk about Exploring the Semantic Web, and particularly Linked Data, and the Rhizomer approach. Presented August 14th 2012 at the SRI AIC Seminar Series, Menlo Park, CA
A talk about the gap between theory and practice with W3C Semantic Web and Dublin Core standards, and how the DC Tools Community can help collectively reduce the cost of that gap.
Given as part of the DC Tools Community workshop at LIDA2009 in Zadar, Croatia.
Towards an RDF Analytics Language: Learning from Successful ExperiencesFadi Maali
This presentation is given in the Fourth International Workshop on Consuming Linked Data (COLD2013) @ ISWC2013
Paper at: http://ceur-ws.org/Vol-1034/MaaliAndDecker_COLD2013.pdf
This is part 2 of the ISWC 2009 tutorial on the GoodRelations ontology and RDFa for e-commerce on the Web of Linked Data.
See also
http://www.ebusiness-unibw.org/wiki/Web_of_Data_for_E-Commerce_Tutorial_ISWC2009
This is part 2 of the ISWC 2009 tutorial on the GoodRelations ontology and RDFa for e-commerce on the Web of Linked Data.
See also
http://www.ebusiness-unibw.org/wiki/Web_of_Data_for_E-Commerce_Tutorial_ISWC2009
Presentation about opportunities and challenges concerning Linked Data at the Open Science Data Cloud NSF PIRE Workshop [1] on 18 July 2012 in Edinburgh, UK.
[1] http://www.opensciencedatacloud.org/osdc-edinburgh-workshop-71612-71712/
dcat: An RDF vocabulary for interoperability of data cataloguesRichard Cyganiak
Governments produce large amounts of valuable data, such as spreadsheets and maps, as part of daily operations and decision-making. This data can be useful to many citizens and organizations beyond the government, and it is ultimately the citizen who has paid for producing it. More and more governments make this data publicly available through data catalogs, such as data.gov, data.gov.uk, or statcentral.ie. This slide set introduces dcat, a unified format for publishing the metadata of such catalogs, that supports the querying, federation, consumption and archival of these valuable data assets.
Essentials of Automations: Optimizing FME Workflows with ParametersSafe Software
Are you looking to streamline your workflows and boost your projects’ efficiency? Do you find yourself searching for ways to add flexibility and control over your FME workflows? If so, you’re in the right place.
Join us for an insightful dive into the world of FME parameters, a critical element in optimizing workflow efficiency. This webinar marks the beginning of our three-part “Essentials of Automation” series. This first webinar is designed to equip you with the knowledge and skills to utilize parameters effectively: enhancing the flexibility, maintainability, and user control of your FME projects.
Here’s what you’ll gain:
- Essentials of FME Parameters: Understand the pivotal role of parameters, including Reader/Writer, Transformer, User, and FME Flow categories. Discover how they are the key to unlocking automation and optimization within your workflows.
- Practical Applications in FME Form: Delve into key user parameter types including choice, connections, and file URLs. Allow users to control how a workflow runs, making your workflows more reusable. Learn to import values and deliver the best user experience for your workflows while enhancing accuracy.
- Optimization Strategies in FME Flow: Explore the creation and strategic deployment of parameters in FME Flow, including the use of deployment and geometry parameters, to maximize workflow efficiency.
- Pro Tips for Success: Gain insights on parameterizing connections and leveraging new features like Conditional Visibility for clarity and simplicity.
We’ll wrap up with a glimpse into future webinars, followed by a Q&A session to address your specific questions surrounding this topic.
Don’t miss this opportunity to elevate your FME expertise and drive your projects to new heights of efficiency.
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf91mobiles
91mobiles recently conducted a Smart TV Buyer Insights Survey in which we asked over 3,000 respondents about the TV they own, aspects they look at on a new TV, and their TV buying preferences.
Accelerate your Kubernetes clusters with Varnish CachingThijs Feryn
A presentation about the usage and availability of Varnish on Kubernetes. This talk explores the capabilities of Varnish caching and shows how to use the Varnish Helm chart to deploy it to Kubernetes.
This presentation was delivered at K8SUG Singapore. See https://feryn.eu/presentations/accelerate-your-kubernetes-clusters-with-varnish-caching-k8sug-singapore-28-2024 for more details.
GraphRAG is All You need? LLM & Knowledge GraphGuy Korland
Guy Korland, CEO and Co-founder of FalkorDB, will review two articles on the integration of language models with knowledge graphs.
1. Unifying Large Language Models and Knowledge Graphs: A Roadmap.
https://arxiv.org/abs/2306.08302
2. Microsoft Research's GraphRAG paper and a review paper on various uses of knowledge graphs:
https://www.microsoft.com/en-us/research/blog/graphrag-unlocking-llm-discovery-on-narrative-private-data/
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualityInflectra
In this insightful webinar, Inflectra explores how artificial intelligence (AI) is transforming software development and testing. Discover how AI-powered tools are revolutionizing every stage of the software development lifecycle (SDLC), from design and prototyping to testing, deployment, and monitoring.
Learn about:
• The Future of Testing: How AI is shifting testing towards verification, analysis, and higher-level skills, while reducing repetitive tasks.
• Test Automation: How AI-powered test case generation, optimization, and self-healing tests are making testing more efficient and effective.
• Visual Testing: Explore the emerging capabilities of AI in visual testing and how it's set to revolutionize UI verification.
• Inflectra's AI Solutions: See demonstrations of Inflectra's cutting-edge AI tools like the ChatGPT plugin and Azure Open AI platform, designed to streamline your testing process.
Whether you're a developer, tester, or QA professional, this webinar will give you valuable insights into how AI is shaping the future of software delivery.
JMeter webinar - integration with InfluxDB and GrafanaRTTS
Watch this recorded webinar about real-time monitoring of application performance. See how to integrate Apache JMeter, the open-source leader in performance testing, with InfluxDB, the open-source time-series database, and Grafana, the open-source analytics and visualization application.
In this webinar, we will review the benefits of leveraging InfluxDB and Grafana when executing load tests and demonstrate how these tools are used to visualize performance metrics.
Length: 30 minutes
Session Overview
-------------------------------------------
During this webinar, we will cover the following topics while demonstrating the integrations of JMeter, InfluxDB and Grafana:
- What out-of-the-box solutions are available for real-time monitoring JMeter tests?
- What are the benefits of integrating InfluxDB and Grafana into the load testing stack?
- Which features are provided by Grafana?
- Demonstration of InfluxDB and Grafana using a practice web application
To view the webinar recording, go to:
https://www.rttsweb.com/jmeter-integration-webinar
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...James Anderson
Effective Application Security in Software Delivery lifecycle using Deployment Firewall and DBOM
The modern software delivery process (or the CI/CD process) includes many tools, distributed teams, open-source code, and cloud platforms. Constant focus on speed to release software to market, along with the traditional slow and manual security checks has caused gaps in continuous security as an important piece in the software supply chain. Today organizations feel more susceptible to external and internal cyber threats due to the vast attack surface in their applications supply chain and the lack of end-to-end governance and risk management.
The software team must secure its software delivery process to avoid vulnerability and security breaches. This needs to be achieved with existing tool chains and without extensive rework of the delivery processes. This talk will present strategies and techniques for providing visibility into the true risk of the existing vulnerabilities, preventing the introduction of security issues in the software, resolving vulnerabilities in production environments quickly, and capturing the deployment bill of materials (DBOM).
Speakers:
Bob Boule
Robert Boule is a technology enthusiast with PASSION for technology and making things work along with a knack for helping others understand how things work. He comes with around 20 years of solution engineering experience in application security, software continuous delivery, and SaaS platforms. He is known for his dynamic presentations in CI/CD and application security integrated in software delivery lifecycle.
Gopinath Rebala
Gopinath Rebala is the CTO of OpsMx, where he has overall responsibility for the machine learning and data processing architectures for Secure Software Delivery. Gopi also has a strong connection with our customers, leading design and architecture for strategic implementations. Gopi is a frequent speaker and well-known leader in continuous delivery and integrating security into software delivery.
Let's dive deeper into the world of ODC! Ricardo Alves (OutSystems) will join us to tell all about the new Data Fabric. After that, Sezen de Bruijn (OutSystems) will get into the details on how to best design a sturdy architecture within ODC.
The Art of the Pitch: WordPress Relationships and SalesLaura Byrne
Clients don’t know what they don’t know. What web solutions are right for them? How does WordPress come into the picture? How do you make sure you understand scope and timeline? What do you do if sometime changes?
All these questions and more will be explored as we talk about matching clients’ needs with what your agency offers without pulling teeth or pulling your hair out. Practical tips, and strategies for successful relationship building that leads to closing the deal.
Epistemic Interaction - tuning interfaces to provide information for AI supportAlan Dix
Paper presented at SYNERGY workshop at AVI 2024, Genoa, Italy. 3rd June 2024
https://alandix.com/academic/papers/synergy2024-epistemic/
As machine learning integrates deeper into human-computer interactions, the concept of epistemic interaction emerges, aiming to refine these interactions to enhance system adaptability. This approach encourages minor, intentional adjustments in user behaviour to enrich the data available for system learning. This paper introduces epistemic interaction within the context of human-system communication, illustrating how deliberate interaction design can improve system understanding and adaptation. Through concrete examples, we demonstrate the potential of epistemic interaction to significantly advance human-computer interaction by leveraging intuitive human communication strategies to inform system design and functionality, offering a novel pathway for enriching user-system engagements.
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...UiPathCommunity
💥 Speed, accuracy, and scaling – discover the superpowers of GenAI in action with UiPath Document Understanding and Communications Mining™:
See how to accelerate model training and optimize model performance with active learning
Learn about the latest enhancements to out-of-the-box document processing – with little to no training required
Get an exclusive demo of the new family of UiPath LLMs – GenAI models specialized for processing different types of documents and messages
This is a hands-on session specifically designed for automation developers and AI enthusiasts seeking to enhance their knowledge in leveraging the latest intelligent document processing capabilities offered by UiPath.
Speakers:
👨🏫 Andras Palfi, Senior Product Manager, UiPath
👩🏫 Lenka Dulovicova, Product Program Manager, UiPath
"Impact of front-end architecture on development cost", Viktor TurskyiFwdays
I have heard many times that architecture is not important for the front-end. Also, many times I have seen how developers implement features on the front-end just following the standard rules for a framework and think that this is enough to successfully launch the project, and then the project fails. How to prevent this and what approach to choose? I have launched dozens of complex projects and during the talk we will analyze which approaches have worked for me and which have not.
"Impact of front-end architecture on development cost", Viktor Turskyi
Weaving the Pedantic Web (LD
1. Digital Enterprise Research Institute www.deri.ie
Weaving the Pedantic Web
LDOW 2010
Aidan Hogan, Andreas Harth, Alexandre Passant, Stefan
Decker, Axel Polleres
0:39:00
Copyright 2009 Digital Enterprise Research Institute. All rights reserved.
1
3. Purpose of talk: Application developers…
how to not sink…
Digital Enterprise Research Institute www.deri.ie
3
4. Purpose of talk: RDF Publishers… how
to avoid common mistakes…
Digital Enterprise Research Institute www.deri.ie
4
5. Talking about errors in Linked Data…
Digital Enterprise Research Institute www.deri.ie
We’ll try not to ruin the party
…statistics based on crawl:
April 2009
5k domain limit
150k URIS, 55k RDF docs
12.5m triples (quads)
Mentioning 1.6m URIs
5,850 classes/9,507 props
Accept: application/rdf+xml
…okay… so no RDFa
Statistics are *illustrative* not
exhaustive!
5
6. Digital Enterprise Research Institute www.deri.ie
Chapter 1: HTTP-level issues…
…a good RDF description these days is hard to find
6
7. Waldo URIs:
URIs with no dereferencable RDF
Digital Enterprise Research Institute www.deri.ie
Not a crawler’s idea of fun…
7
8. Hmm not *so* many…
Digital Enterprise Research Institute www.deri.ie
5.3% of HTTP URIs return 40x/50x
Excluding redirects…
92.8% return 200 OK
In return, only 45.4% of 200 Okay return report
application/rdf+xml
34.8% return HTML… probably just HTML docs…
okay… maybe a *few* contain RDFa
8
9. Lies… Damned Lies…
& Content-Type Reporting
Digital Enterprise Research Institute www.deri.ie
“Trust me, it’s RDF/XML”
9
10. Okay… So he’s actually pretty honest
Digital Enterprise Research Institute www.deri.ie
16.9% of valid RDF/XML documents
returned with an invalid/more generic
Content-type:
text/xml (9.5%)
application/xml (5.9%)
text/plain (1%)
text/html (0.4%)
Of those returning
Content-type:application/rdf+xml
98.8% were valid RDF/XML
10
11. Same triples, different document
Digital Enterprise Research Institute www.deri.ie
I wish they’d used a redirect…
11
12. E.g., the Miracle at Calais:
turning 1,778 triples into ~∞ quads
Digital Enterprise Research Institute www.deri.ie
http://d.opencalais.com/1/type/em/r/SameTriplesDifferentDocument
(apologies to OpenCalais guys – it’s just a convenient example)
12
13. Digital Enterprise Research Institute www.deri.ie
Chapter 2: Reasoning issues…
…or, how I learned to start worrying and stop loving OWL
13
14. Undefined classes and properties…
Digital Enterprise Research Institute www.deri.ie
It looks important, but I’m afraid I don’t fully follow
14
15. Quite common…
Digital Enterprise Research Institute www.deri.ie
14.3% of triples use undeclared property
8.1% of triples use undeclared class
Three cases:
Case 1: Namespace has no vocabulary/
is not deferencable
(e.g., rss:item)
Case 2: Term invented in related namespace
(e.g., foaf:tagLine invented by LiveJournal)
Case 3: Term is misspelt version of term defined in namespace
(e.g., foaf:image vs. foaf:img)
15
16. Not-so-unique values for
Inverse-Functional Properties
Digital Enterprise Research Institute www.deri.ie
Despite what you claim,
not all of you can *actually be* Spartacus
16
17. Spartacus relived…
Digital Enterprise Research Institute www.deri.ie
08445a31a78661b5c746feff39a9db6e4e2cc5cf
sha1-sum of „mailto:‟
common value for foaf:mbox_sha1sum
An inverse-functional (uniquely identifying) property!!!
Any person who shares the same value will be considered
the same
*I’m Spartacus!*
…and so’s my wife
17
18. …unattended, can be pretty serious…
Digital Enterprise Research Institute www.deri.ie
foaf:mbox_sha1sum a owl:InverseFunctionalProperty .
?x foaf:mbox_sha1sum 08445a31a78661b5c746feff39a9db6e4e2cc5cf .
OWL 2 RL rule prp-ifp:
?p a owl:InverseFunctionalProperty . ?x1 ?p ?z . ?x2 ?p ?z .
⇒ ?x1 owl:sameAs ?x2 .
106 ?x1/?x2bindings in body
1012 inferred pair-wise and reflexive owl:sameAs statements
…or in simpler terms:
pow!
18
20. Not *too* bad…
Digital Enterprise Research Institute www.deri.ie
4.7% of typed literals were “ill-typed” (lexically
invalid)…
mostly xsd:dateTimes (26.4% of all date-time literals
were invalid; e.g., omitted the seconds field)
Also, literals are sometimes incompatible with
the datatype-range of a property:
E.g., 21.8% of ical:description triples used
language tags incompatible with the defined range of
xsd:string
E.g., 100% of sl:creationDate triples use plain literal
values incompatible with defined range of xsd:date
20
21. Mystical beings…
Members of disjoint classes
Digital Enterprise Research Institute www.deri.ie
Despite what FOAF says, it seems that
Persons can also be Documents
21
22. Again, not *too* bad…
Digital Enterprise Research Institute www.deri.ie
1,329 members of disjoint classes found
Generally caused by naïve URI naming:
Use of information resource URIs to name entities
(particularly foaf:Persons)
E.g., <me> foaf:knows <jim/foaf.rdf> .
22
23. Ontology hijacking…
Digital Enterprise Research Institute www.deri.ie
Anybody can say anything, anywhere, and unfortunately for everyone
else, have a good chance of being taken seriously
23
24. Redefining Everything…
…and home in time for tea
Digital Enterprise Research Institute www.deri.ie
From http://www.eiao.net/rdf/1.0
<owl:Property rdf:about="http://www.w3.org/1999/02/22-rdf-syntax-ns#type">
<rdfs:label xml:lang="en">type</rdfs:label>
<rdfs:comment xml:lang="en">Type of resource</rdfs:comment>
<rdfs:domain rdf:resource="http://www.eiao.net/rdf/1.0#testRun"/>
<rdfs:domain rdf:resource="http://www.eiao.net/rdf/1.0#pageSurvey"/>
<rdfs:domain rdf:resource="http://www.eiao.net/rdf/1.0#siteSurvey"/>
<rdfs:domain rdf:resource="http://www.eiao.net/rdf/1.0#scenario"/>
<rdfs:domain rdf:resource="http://www.eiao.net/rdf/1.0#rangeLocation"/>
<rdfs:domain rdf:resource="http://www.eiao.net/rdf/1.0#startPointer"/>
<rdfs:domain rdf:resource="http://www.eiao.net/rdf/1.0#endPointer"/>
<rdfs:domain rdf:resource="http://www.eiao.net/rdf/1.0#header"/>
<rdfs:domain rdf:resource="http://www.eiao.net/rdf/1.0#runs"/>
</owl:Property>
Ontology hijacking!!
(apologies to EIAO guys – it’s just a convenient example)
24
26. Application side: workarounds
Digital Enterprise Research Institute www.deri.ie
All presented issues have a suitable antidote, once
you know about them
See paper for discussion…
26
27. Publishing side: Validators!
Digital Enterprise Research Institute www.deri.ie
Syntax errors quite rare, partly due to popularity of
W3C RDF/XML syntax validator
Need an all-in-one validation service
Should not only validate strict errors, but give
feedback on suspected issues
We offer a prototypical service at:
http://swse.deri.org/RDFAlerts/
27
28. Publishing side: Pedantic Web Group
Digital Enterprise Research Institute www.deri.ie
Get the community to contact publishers about
errors/issues as they arise
Get involved: http://pedantic-web.org/
137 members!
Acknowledgements to: Aidan Hogan, Alex Passant, Me, Antoine Zimmermann, Axel
Polleres, Michael Hausenblas, Richard Cyganiak, Stéphane Corlosquet
28