A presentation of open data and its potential, especially seen in light of the linked open data development.
Presentation held for Institute of Information and Media Science at the University of Bergen, 14.04.2011
Linked data for Enterprise Data IntegrationSören Auer
The Web evolves into a Web of Data. In parallel Intranets of large companies will evolve into Data Intranets based on the Linked Data principles. Linked Data has the potential to complement the SOA paradigm with a light-weight, adaptive data integration approach.
Linked data for Enterprise Data IntegrationSören Auer
The Web evolves into a Web of Data. In parallel Intranets of large companies will evolve into Data Intranets based on the Linked Data principles. Linked Data has the potential to complement the SOA paradigm with a light-weight, adaptive data integration approach.
Introduction to the Data Web, DBpedia and the Life-cycle of Linked DataSören Auer
Over the past 4 years, the Semantic Web activity has gained momentum with the widespread publishing of structured data as RDF. The Linked Data paradigm has therefore evolved from a practical research idea into
a very promising candidate for addressing one of the biggest challenges
of computer science: the exploitation of the Web as a platform for data
and information integration. To translate this initial success into a
world-scale reality, a number of research challenges need to be
addressed: the performance gap between relational and RDF data
management has to be closed, coherence and quality of data published on
the Web have to be improved, provenance and trust on the Linked Data Web
must be established and generally the entrance barrier for data
publishers and users has to be lowered. This tutorial will discuss
approaches for tackling these challenges. As an example of a successful
Linked Data project we will present DBpedia, which leverages Wikipedia
by extracting structured information and by making this information
freely accessible on the Web. The tutorial will also outline some recent advances in DBpedia, such as the mappings Wiki, DBpedia Live as well as
the recently launched DBpedia benchmark.
Exploration, visualization and querying of linked open data sourcesLaura Po
afternoon hands-on session talk at the second Keystone Training School "Keyword search in Big Linked Data" held in Santiago de Compostela.
https://eventos.citius.usc.es/keystone.school/
Towards digitizing scholarly communicationSören Auer
Slides of the VIVO 2016 Conference keynote: Despite the availability of ubiquitous connectivity and information technology, scholarly communication has not changed much in the last hundred years: research findings are still encoded in and decoded from linear, static articles and the possibilities of digitization are rarely used. In this talk, we will discuss strategies for digitizing scholarly communication. This comprises in particular: the use of machine-readable, dynamic content; the description and interlinking of research artifacts using Linked Data; the crowd-sourcing of multilingual
educational and learning content. We discuss the relation of these developments to research information systems and how they could become part of an open ecosystem for scholarly communication.
These slides were originally a tutorial presented for the SIG preceding the May 2009 meeting of the PRISM Forum.
They attempt to give a survey of the technologies, tools, and state of the world with respect to the Semantic Web as of the first half of 2009.
Build Narratives, Connect Artifacts: Linked Open Data for Cultural HeritageOntotext
Many issues are faced by scholars, book researchers, museum directors who try to find the underlying connection between resources. Scholars in particular continuously emphasizes the role of digital humanities and the value of linked data in cultural heritage information systems.
Presentation about - Semantic Web - Overview -Semantic Web
Web of Data, Giant Global Graph, Data Web, Web 3.0, Linked Data Web, Semantic Data Web, Enterprise Information Web, HTML, CSS,
This invited keynote at the Social Computing Track at WI-IAT21 gives an introduction to Knowledge Graphs and how they are built collaboratively by us. It gives also presents a brief analysis of the links in Wikidata.
Big Linked Data - Creating Training CurriculaEUCLID project
This presentation includes an overview of the basic rules to follow when developing training and education curricula for Linked Data and Big Linked Data
This slideset introduces the LAK Dataset and Challenge, held at the Learning Analytics & Knowledge (LAK) conference in Leuven, Belgium, April 2013. Further information about the dataset and submissions is available at http://ceur-ws.org/Vol-974/ as well as http://www.solaresearch.org/events/lak/lak-data-challenge/.
Strategies in Semantic Marketing in the Online Travel and Tourism IndustryLars Göhler
Presentation on the ITB 2015 on semantic technologies of the travel and tourism industry Find more information at http://www.travel-semantics.com
about IP Sharemedia at: http://www.ip-sharemedia.de
about Quality Management in Travel and tourism: http://quality-in-travel.com
Travel semantics: Use of semantic technologies in online travel and tourism i...Lars Göhler
Semanic technologies are used increasingly in online tourism and travel industry. The presentation held at the ITB Berlin at the 6th of March illustrates some semantic technologies and gives information on their status and perspectives. Main application fields in semantics is management of big data, search engine optimization (seo), advertising, internal search technology, mobile applications and destination management. More on travel semantics: www.travel-semantics.com
Introduction to the Data Web, DBpedia and the Life-cycle of Linked DataSören Auer
Over the past 4 years, the Semantic Web activity has gained momentum with the widespread publishing of structured data as RDF. The Linked Data paradigm has therefore evolved from a practical research idea into
a very promising candidate for addressing one of the biggest challenges
of computer science: the exploitation of the Web as a platform for data
and information integration. To translate this initial success into a
world-scale reality, a number of research challenges need to be
addressed: the performance gap between relational and RDF data
management has to be closed, coherence and quality of data published on
the Web have to be improved, provenance and trust on the Linked Data Web
must be established and generally the entrance barrier for data
publishers and users has to be lowered. This tutorial will discuss
approaches for tackling these challenges. As an example of a successful
Linked Data project we will present DBpedia, which leverages Wikipedia
by extracting structured information and by making this information
freely accessible on the Web. The tutorial will also outline some recent advances in DBpedia, such as the mappings Wiki, DBpedia Live as well as
the recently launched DBpedia benchmark.
Exploration, visualization and querying of linked open data sourcesLaura Po
afternoon hands-on session talk at the second Keystone Training School "Keyword search in Big Linked Data" held in Santiago de Compostela.
https://eventos.citius.usc.es/keystone.school/
Towards digitizing scholarly communicationSören Auer
Slides of the VIVO 2016 Conference keynote: Despite the availability of ubiquitous connectivity and information technology, scholarly communication has not changed much in the last hundred years: research findings are still encoded in and decoded from linear, static articles and the possibilities of digitization are rarely used. In this talk, we will discuss strategies for digitizing scholarly communication. This comprises in particular: the use of machine-readable, dynamic content; the description and interlinking of research artifacts using Linked Data; the crowd-sourcing of multilingual
educational and learning content. We discuss the relation of these developments to research information systems and how they could become part of an open ecosystem for scholarly communication.
These slides were originally a tutorial presented for the SIG preceding the May 2009 meeting of the PRISM Forum.
They attempt to give a survey of the technologies, tools, and state of the world with respect to the Semantic Web as of the first half of 2009.
Build Narratives, Connect Artifacts: Linked Open Data for Cultural HeritageOntotext
Many issues are faced by scholars, book researchers, museum directors who try to find the underlying connection between resources. Scholars in particular continuously emphasizes the role of digital humanities and the value of linked data in cultural heritage information systems.
Presentation about - Semantic Web - Overview -Semantic Web
Web of Data, Giant Global Graph, Data Web, Web 3.0, Linked Data Web, Semantic Data Web, Enterprise Information Web, HTML, CSS,
This invited keynote at the Social Computing Track at WI-IAT21 gives an introduction to Knowledge Graphs and how they are built collaboratively by us. It gives also presents a brief analysis of the links in Wikidata.
Big Linked Data - Creating Training CurriculaEUCLID project
This presentation includes an overview of the basic rules to follow when developing training and education curricula for Linked Data and Big Linked Data
This slideset introduces the LAK Dataset and Challenge, held at the Learning Analytics & Knowledge (LAK) conference in Leuven, Belgium, April 2013. Further information about the dataset and submissions is available at http://ceur-ws.org/Vol-974/ as well as http://www.solaresearch.org/events/lak/lak-data-challenge/.
Strategies in Semantic Marketing in the Online Travel and Tourism IndustryLars Göhler
Presentation on the ITB 2015 on semantic technologies of the travel and tourism industry Find more information at http://www.travel-semantics.com
about IP Sharemedia at: http://www.ip-sharemedia.de
about Quality Management in Travel and tourism: http://quality-in-travel.com
Travel semantics: Use of semantic technologies in online travel and tourism i...Lars Göhler
Semanic technologies are used increasingly in online tourism and travel industry. The presentation held at the ITB Berlin at the 6th of March illustrates some semantic technologies and gives information on their status and perspectives. Main application fields in semantics is management of big data, search engine optimization (seo), advertising, internal search technology, mobile applications and destination management. More on travel semantics: www.travel-semantics.com
If you are conducting your tourism marketing today, the same way you were last year, it is time to be worried!
You need to change, make no mistake, if you don't you will be seriously left behind.
A distributed network of digital heritage information - Unesco/NDL IndiaEnno Meijers
These slides were presented at the Knowledge Engeneering for Digital Library Design Workshop in New-Delhi on 25 October 2017. The Workshop was organised by Unesco and the National Digital Library of India.
Charleston 2012 - The Future of Serials in a Linked Data WorldProQuest
The educational objective of this session is to review today’s MARC-based environment in which the serial record predominates, and compare that with what might be possible in a future world of linked data. The session will inspire conversation and reflection on a number of questions. What will a world of statement-based rather than record-based metadata look like? What will a new environment mean for library systems, workflows, and information dissemination?
Talk at 3th Keystone Training School - Keyword Search in Big Linked Data - Institute for Software Technology and Interactive Systems, TU Wien, Austria, 2017
I Linked Open Data nei Beni Culturali, alcuni progetti e casi di studioCulturaItalia
Maria Emilia Masci, Scuola Normale Superiore, Linked Open Data (LOD): Un’Opportunità per il Patrimonio Culturale Digitale, Roma, ICCU, 29 novembre 2013
Similar to Open data and reuse of public information (20)
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfPaige Cruz
Monitoring and observability aren’t traditionally found in software curriculums and many of us cobble this knowledge together from whatever vendor or ecosystem we were first introduced to and whatever is a part of your current company’s observability stack.
While the dev and ops silo continues to crumble….many organizations still relegate monitoring & observability as the purview of ops, infra and SRE teams. This is a mistake - achieving a highly observable system requires collaboration up and down the stack.
I, a former op, would like to extend an invitation to all application developers to join the observability party will share these foundational concepts to build on:
In his public lecture, Christian Timmerer provides insights into the fascinating history of video streaming, starting from its humble beginnings before YouTube to the groundbreaking technologies that now dominate platforms like Netflix and ORF ON. Timmerer also presents provocative contributions of his own that have significantly influenced the industry. He concludes by looking at future challenges and invites the audience to join in a discussion.
Threats to mobile devices are more prevalent and increasing in scope and complexity. Users of mobile devices desire to take full advantage of the features
available on those devices, but many of the features provide convenience and capability but sacrifice security. This best practices guide outlines steps the users can take to better protect personal devices and information.
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex ProofsAlex Pruden
This paper presents Reef, a system for generating publicly verifiable succinct non-interactive zero-knowledge proofs that a committed document matches or does not match a regular expression. We describe applications such as proving the strength of passwords, the provenance of email despite redactions, the validity of oblivious DNS queries, and the existence of mutations in DNA. Reef supports the Perl Compatible Regular Expression syntax, including wildcards, alternation, ranges, capture groups, Kleene star, negations, and lookarounds. Reef introduces a new type of automata, Skipping Alternating Finite Automata (SAFA), that skips irrelevant parts of a document when producing proofs without undermining soundness, and instantiates SAFA with a lookup argument. Our experimental evaluation confirms that Reef can generate proofs for documents with 32M characters; the proofs are small and cheap to verify (under a second).
Paper: https://eprint.iacr.org/2023/1886
Securing your Kubernetes cluster_ a step-by-step guide to success !KatiaHIMEUR1
Today, after several years of existence, an extremely active community and an ultra-dynamic ecosystem, Kubernetes has established itself as the de facto standard in container orchestration. Thanks to a wide range of managed services, it has never been so easy to set up a ready-to-use Kubernetes cluster.
However, this ease of use means that the subject of security in Kubernetes is often left for later, or even neglected. This exposes companies to significant risks.
In this talk, I'll show you step-by-step how to secure your Kubernetes cluster for greater peace of mind and reliability.
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Albert Hoitingh
In this session I delve into the encryption technology used in Microsoft 365 and Microsoft Purview. Including the concepts of Customer Key and Double Key Encryption.
GridMate - End to end testing is a critical piece to ensure quality and avoid...ThomasParaiso2
End to end testing is a critical piece to ensure quality and avoid regressions. In this session, we share our journey building an E2E testing pipeline for GridMate components (LWC and Aura) using Cypress, JSForce, FakerJS…
Full-RAG: A modern architecture for hyper-personalizationZilliz
Mike Del Balso, CEO & Co-Founder at Tecton, presents "Full RAG," a novel approach to AI recommendation systems, aiming to push beyond the limitations of traditional models through a deep integration of contextual insights and real-time data, leveraging the Retrieval-Augmented Generation architecture. This talk will outline Full RAG's potential to significantly enhance personalization, address engineering challenges such as data management and model training, and introduce data enrichment with reranking as a key solution. Attendees will gain crucial insights into the importance of hyperpersonalization in AI, the capabilities of Full RAG for advanced personalization, and strategies for managing complex data integrations for deploying cutting-edge AI solutions.
20 Comprehensive Checklist of Designing and Developing a WebsitePixlogix Infotech
Dive into the world of Website Designing and Developing with Pixlogix! Looking to create a stunning online presence? Look no further! Our comprehensive checklist covers everything you need to know to craft a website that stands out. From user-friendly design to seamless functionality, we've got you covered. Don't miss out on this invaluable resource! Check out our checklist now at Pixlogix and start your journey towards a captivating online presence today.
How to Get CNIC Information System with Paksim Ga.pptxdanishmna97
Pakdata Cf is a groundbreaking system designed to streamline and facilitate access to CNIC information. This innovative platform leverages advanced technology to provide users with efficient and secure access to their CNIC details.
Building RAG with self-deployed Milvus vector database and Snowpark Container...Zilliz
This talk will give hands-on advice on building RAG applications with an open-source Milvus database deployed as a docker container. We will also introduce the integration of Milvus with Snowpark Container Services.
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AIVladimir Iglovikov, Ph.D.
Presented by Vladimir Iglovikov:
- https://www.linkedin.com/in/iglovikov/
- https://x.com/viglovikov
- https://www.instagram.com/ternaus/
This presentation delves into the journey of Albumentations.ai, a highly successful open-source library for data augmentation.
Created out of a necessity for superior performance in Kaggle competitions, Albumentations has grown to become a widely used tool among data scientists and machine learning practitioners.
This case study covers various aspects, including:
People: The contributors and community that have supported Albumentations.
Metrics: The success indicators such as downloads, daily active users, GitHub stars, and financial contributions.
Challenges: The hurdles in monetizing open-source projects and measuring user engagement.
Development Practices: Best practices for creating, maintaining, and scaling open-source libraries, including code hygiene, CI/CD, and fast iteration.
Community Building: Strategies for making adoption easy, iterating quickly, and fostering a vibrant, engaged community.
Marketing: Both online and offline marketing tactics, focusing on real, impactful interactions and collaborations.
Mental Health: Maintaining balance and not feeling pressured by user demands.
Key insights include the importance of automation, making the adoption process seamless, and leveraging offline interactions for marketing. The presentation also emphasizes the need for continuous small improvements and building a friendly, inclusive community that contributes to the project's growth.
Vladimir Iglovikov brings his extensive experience as a Kaggle Grandmaster, ex-Staff ML Engineer at Lyft, sharing valuable lessons and practical advice for anyone looking to enhance the adoption of their open-source projects.
Explore more about Albumentations and join the community at:
GitHub: https://github.com/albumentations-team/albumentations
Website: https://albumentations.ai/
LinkedIn: https://www.linkedin.com/company/100504475
Twitter: https://x.com/albumentations
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...DanBrown980551
Do you want to learn how to model and simulate an electrical network from scratch in under an hour?
Then welcome to this PowSyBl workshop, hosted by Rte, the French Transmission System Operator (TSO)!
During the webinar, you will discover the PowSyBl ecosystem as well as handle and study an electrical network through an interactive Python notebook.
PowSyBl is an open source project hosted by LF Energy, which offers a comprehensive set of features for electrical grid modelling and simulation. Among other advanced features, PowSyBl provides:
- A fully editable and extendable library for grid component modelling;
- Visualization tools to display your network;
- Grid simulation tools, such as power flows, security analyses (with or without remedial actions) and sensitivity analyses;
The framework is mostly written in Java, with a Python binding so that Python developers can access PowSyBl functionalities as well.
What you will learn during the webinar:
- For beginners: discover PowSyBl's functionalities through a quick general presentation and the notebook, without needing any expert coding skills;
- For advanced developers: master the skills to efficiently apply PowSyBl functionalities to your real-world scenarios.
DevOps and Testing slides at DASA ConnectKari Kakkonen
My and Rik Marselis slides at 30.5.2024 DASA Connect conference. We discuss about what is testing, then what is agile testing and finally what is Testing in DevOps. Finally we had lovely workshop with the participants trying to find out different ways to think about quality and testing in different parts of the DevOps infinity loop.
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...James Anderson
Effective Application Security in Software Delivery lifecycle using Deployment Firewall and DBOM
The modern software delivery process (or the CI/CD process) includes many tools, distributed teams, open-source code, and cloud platforms. Constant focus on speed to release software to market, along with the traditional slow and manual security checks has caused gaps in continuous security as an important piece in the software supply chain. Today organizations feel more susceptible to external and internal cyber threats due to the vast attack surface in their applications supply chain and the lack of end-to-end governance and risk management.
The software team must secure its software delivery process to avoid vulnerability and security breaches. This needs to be achieved with existing tool chains and without extensive rework of the delivery processes. This talk will present strategies and techniques for providing visibility into the true risk of the existing vulnerabilities, preventing the introduction of security issues in the software, resolving vulnerabilities in production environments quickly, and capturing the deployment bill of materials (DBOM).
Speakers:
Bob Boule
Robert Boule is a technology enthusiast with PASSION for technology and making things work along with a knack for helping others understand how things work. He comes with around 20 years of solution engineering experience in application security, software continuous delivery, and SaaS platforms. He is known for his dynamic presentations in CI/CD and application security integrated in software delivery lifecycle.
Gopinath Rebala
Gopinath Rebala is the CTO of OpsMx, where he has overall responsibility for the machine learning and data processing architectures for Secure Software Delivery. Gopi also has a strong connection with our customers, leading design and architecture for strategic implementations. Gopi is a frequent speaker and well-known leader in continuous delivery and integrating security into software delivery.
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Open data and reuse of public information
1. Open Data and its Potential
- reuse of public sector information
- Svein Ølnes, Vestlandsforsking, 13.04.2011
2. Outline
About Vestforsk and myself
Semantic technologies
Linked (Open) Data
Open Data
Open Data -> LOD -> Sem. Techn.
Relevant projects and resources
Literature
www.vestforsk.no
3. Vestlandsforsking
ICT themes
Semantic technologies, information structures ++
Regional development, organizational changes with ICT
ICT application areas
Public sector (eGovernment, eHealth)
Tourism sector (local, regional, national, int’national level)
Vestforsk also does research in
Climate change
Transport and environment
Sustainable tourism
Renewable energy
www.vestforsk.no
4. About me
Vestforsk since 1996
eGovernment
Municipalities
Government
Semantic technologies
Projects
Norge.no (establishing in 1999/2000)
MiSide (development of demonstrator in 2004)
LivsIT/Los (2003 – to date)
Evaluation of public websites (2001 – to date)
www.vestforsk.no
5. Naming things!
[the famous cartoon by Gary Larson showing a man
painting ’the cat’, ’the dog’, ’the house’ on his cat,
dog, and house and explaining ”Now, this should
clear up a few things around here!”]
www.vestforsk.no
6. Technology waves
Model driven
Focus: Semantic
Data: Ontologi & Data
Component based
Focus: Services
Data: XML
Object oriented
Focus: Structure
Data: Relational
Procedure oriented
Focus: Syntacs
Data: Hierarchical
1965 1975 1985 1995 2005 2015
Stian Danenbarger, Bouvet.no
www.vestforsk.no
7. The ontology spectrum
The ontology spectrum: From weak to strong semantics
1. Vocabulary
• plain text documents/HTML pages – almost no semantic structure
2. Controlled vocabularies (weak semantic structure)
• adding metadata to the information
3. Taxonomies
• metadata and hierarchy
4. Thesauri
• metadata, hierarchy and a limited set of relations (BT, NT, related to ...)
5. Stronger semantic structures/ontologies
• metadata, [hierarchy], any relations
(Daconta et al.: “The Semantic Web”)
www.vestforsk.no
8. Semantic technologies
AI tradition/Logics: Semantic web
W3C as the standardization body
Humanities/Library science: Topic Maps
ISO-standard
Light-weight, bottom-up: Microformats
Not a standard yet, but might be as part of HTML5
www.vestforsk.no
9. Semantic Web
”Web of data”
”Web 3.0”
”Semantic web” coined by Tim Berners-Lee in mid 1990s
The (in-)famous article ”The Semantic Web” in Scientific
American 2001 (TBL, Jim Hendler, Ora Lassila)
Wikipedia:
However, the Semantic Web as originally envisioned, a system that
enables machines to understand and respond to complex human
requests based on their meaning, has remained largely unrealized and
its critics have questioned its feasibility.
www.vestforsk.no
11. Lessons learned from the HTML history?
xhtml 1: HTML as XML
xhtml 2: Get rid of html altogether
... it was a disaster!
WHATWG TF – a rebellion inside W3C
Web Apps 1.0
.. eventually led to HTML5
pragmatism won over idealism
Jeremy Keith: ”HTML5 for Web Designers”
www.vestforsk.no
12. Semantic Web light
Is the Semantic Web too complex?
difficult to scale to the WWW
more suitable for use within smaller domains
Introducing ”Light-weight” SW:
RDFa: RDF expressed as (x)HTML – part of HTML
GRDDL: RDF data from XML/xHTML documents
SKOS: Simple Knowledge Organization System – representation of
”classical” structures as taxonomies, thesauri in RDF
• organizing concepts with standard relations
Linked (Open) Data
www.vestforsk.no
13. Topic Maps
ISO standard from 2001 (present standard from 2003)
ISO 13250:2003
Strong Norwegian community
small world wide community compared to SW
Large uptake in portals, especially public portals
”Fight” between TM and SW
Largely over, ”SW has won”
Linked Data as a common ground for further development
Focus has shifted from technology to utilizing data
www.vestforsk.no
14. Simple Topic Maps model
3 Topic types: person, project and publication
2 Association types: Project manager of, Author of, and Result of
www.vestforsk.no
15. Topic Maps in use
Some Topic Maps driven portals:
uib.no
vestforsk.no
nofima.no
regjeringen.no
stortinget.no
bergen.kommune.no
www.vestforsk.no
17. Ultimate goal: My metadata is
Europe
ana
your data (and vice versa)
Yr.no
SERES
Volve
n
SSB
LOS KS
Smiln Kart-
o verket Lov
data
SKD
www.vestforsk.no
18. Linked (Open) Data
using the Web to lower the barriers to linking data
use of RDF to make typed statements
Linked Data = Use the Web to make typed links between data
from different sources
Alex Wright: The Web That Wasn’t (Topic Maps 2008 Conf.)
David Weinberger: Thank God! (Topic Maps 2008 Conf.)
”small pieces loosely joined”
www.vestforsk.no
20. Linked Data Principles
1. Use URIs as names for things
2. Use HTTP URIs so that people can look up those names
3. When someone looks up a URI, provide useful information,
using the standards (RDF, SPARQL)
4. Include linkes to other URIs, so that they can discover more
things
Linked Data can be serialized as
RDF/XML
N3 (Turtle)
RDFa
www.vestforsk.no
21. Linked Open Data Star Scheme
Tim Berners-Lee/DERI – University of Galway
www.vestforsk.no
22. Linked Data example
”Populated place” is a concept defined in the DBpedia ontology
Use established ontologies whereever possible
FOAF (friend-of-a-friend)
Dublin Core
hCard, hCalendar, hAtom
www.vestforsk.no
23. Linked Data vs. Semantic Web
The Semantic Web, or the Web of Data, is the ultimate goal
Linked Data provides the means to reach that goal
Linked Data helps build the Web of Data that later can be
exploited by more advanced techn. such as intelligent agents
(it has to be added that this is the proponents of the semantic
web/intelligent agents claim)
Tom Heath: ”Without Linked Data, no Semantic Web!
Talis Nodalities no. 11
www.vestforsk.no
24. Open data
In principle all data, but mostly public data because that is the
easiest to start with
PSI directive from EU an important enabler (also included in
Offentleg-lova)
data.norge.no
data.norge.no from FAD to Difi
and from blog to data repository (?)
www.vestforsk.no
25. Why open data?
1. Increase democratic control and political participation
Empower citizens to exercise their democratic rights
2. Foster service and product innovation
New opportunities for innovation generated by open government
data
3. Strengthen law enforcement
Especially the US and the UK strategies emphasize this
Study published in the European Journal of ePractice, 2011
www.vestforsk.no
26. “Open data and its enemies”
Some pressure from FAD (recently expressed in ”Tildelings-
brevet”), but slow movement in general
cultural issues
budget issues
fear of loosing control
transparency is seen as a threat
Map data is some of the most important – Map Authorities are
not willing to publish raw data
www.vestforsk.no
27. Closed map data a problem
Bente Kalsnes, Origo
www.vestforsk.no
30. Top 10 drivers of open data
1. Strategies and experiences
2. Political leadership
3. Regional initiatives
4. Citizen initiatives
5. Market initiatives
6. Emerging technologies
7. European legislation
8. Thought leaders
9. Possibility of monitoring government
European Journal of ePractice, 2011
10. Budget cuts www.vestforsk.no
31. Top 10 barriers to open data
1. Closed government culture
2. Privacy legislation
3. Limited quality of data
4. Limited user-friendliness/Info overload
5. Lack of standardisation of open data
6. Security threats
7. Existing charging methods
8. Uncertain economic impact
9. Digital divide
European Journal of ePractice, 2011
10. Network overload www.vestforsk.no
32. data.norge.no
Initiative from FAD started in 2010
(I will take credit for the name! :)
Mostly a blog
Gradually building up a data repository
From 01.05.2011 Difi will have the responsibility for
data.norge.no
www.vestforsk.no
33. data.norge.no as of April 2011
1. Byantikvarens gule liste (xls) 8. N5000 (div. grafiske format
+ sosi/shape)
2. Einingsregisteret (rdf/xml)
9. Statlege styre, råd og utval
3. Gardsmatrikkelen 1886 (xls) (html)
4. Idrettsanlegg (csv) 10. Statsbudsjettet og nasjonal-
budsjettet 2011 (xls, csv)
5. Kraftprisar (Tab-sep. tekst)
11. Tenestemannsregisteret
(csv)
6. Ladestasjonar (csv, ov2..)
• no.ckan.net lists 212
7. Los (ods)
different data sources
www.vestforsk.no
34. 7 tips for publishing linked open data
1. Use standard Internet protocols for access (http)
2. All objects need a unique identifier (URI)
3. Avoid aggregation of data
4. Structure metadata in a machine readable format (xml or xml/rdf/xtm)
5. Use international character set (UTF-8)
6. Use minimum Dublin Core as a standard way of describing metadata
7. Think about linking to other data sources by preparing for Linked Data
www.vestforsk.no
35. Relevant projects from Vestforsk
Sesam4 – Semantic technologies for SMEs
Los – a navigator for public services
Tourism concepts – a common vocabulary for the tourism industry
Seminars on semantic technologies
The WIMS’11 Conference
www.vestforsk.no
36. Sesam4
VERDIKT project 2008 – 2011 (ended 31st of March this year)
Use of semantic technologies in SMEs
Provided a set of tools for SMEs (and others) to use for ”semantisizing”
their data
Demonstrated semantic technologies in two pilots:
Tourism
Business information
NR, Vestlandsforsking, Esis, Computas, UNI Digital,
Cyberwatcher, TextUrgy, Ovitas, IKT-Norge
www.vestforsk.no
37. Sesam4 – lessons learned
Project planned in 2007
A lot of things have happened since 2007
Emerging of Linked Data
Sesam4 gradually tuned in to LOD
Too much focus, resources, and discussion (!) spent on ontologies!
Light-weight approach saves time & money
Valuable tools for semantic lifting and best practices remains
available for anybody to use (most of the project in open
source)
www.vestforsk.no
38. LOS – a navigator to public services
LivsIT (1996 – 2004)
Life situations
Los (2005 - ??)
Shared vocabulary for public services
More than 1/3 of the municipalities in Norway use Los as a
foundation of their web portal
Difi is the responsible agency
Los a success despite Difi’s lack of support and development
Problem with uptake in Governmental bodies
By using Los municipalities can share information with
www.vestforsk.no
39. What a difference a little semantics can do
Note: Bergen recently changed their internal search to Google search and lost the semantic support (Los) for search
www.vestforsk.no
43. Tourism concepts
Pre-project for the Norwegian tourism industry (VisitNorway, NCE
Tourism)
Advice on constructing a common vocabulary for tourism concepts
Initiated by Anders Waage Nilsen in NCE Tourism/Fjord Norway
(Anders now in MediArena)
www.vestforsk.no
44. Tourism concepts - advice
1. Simplification (today’s categorizing scheme is too complicated)
2. Develop a controlled vocabulary with emphasis on keywords (the Los
method)
3. Not everything can be solved with categorizing
A controlled vocabulary is necessary but not enough
4. Publish the vocabulary in the cloud
5. Publish the vocabulary in many formats (html, xml, xml/rdf, xtm)
6. Publish also the information resources in the cloud, as linked open
data
www.vestforsk.no
45. Seminars on semantic technologies
Vestforsk initiated a series of seminars on semantic technologies as
part of its 25th anniversary in 2010
A total of 7-8 seminars will be held, 4 already arranged
Streaming of all seminars, and archived for video on demand
We also have project ”Kunnskap kryssar grenser”/”Access to Knowledge”
where we focus on streaming and use of video
www.vestforsk.no
46. WIMS’11
International Conference on Web Intelligence, Mining, and Semantics
Sogndal, May 25 – 27
Keynote speakers:
Jim Hendler: The Semantic Web 10th Year Update (25.05)
Peter Mika: Making Things Findable (26.05)
Sören Auer: Creating Knowledge Out of Interlinked Data (26.05)
Ashwin Ram: Open Social Learning Communities (27.05)
Marko Grobelnik: Scalable Reasoning on Intensive Streams of Data (27.05)
wims.vestforsk.no
www.vestforsk.no
47. Some resources
Vestforsk series of seminars on semantic technologies
http://www.vestforsk.no/aktuelt/seminarserie-om-semantiske-teknologiar
Linked Data – The Story So Far (Bizer, Heath, Berners-Lee)
http://tomheath.com/papers/bizer-heath-berners-lee-ijswis-linked-data.pdf
Linked Data vs. Linked Open Data
http://datavisualization.ch/opinions/introduction-to-linked-data
Linked Data – Evolving the Web into a Global Data Space (Heath, Bizer)
http://linkeddatabook.com/editions/1.0/
Introduction to Linked Open Data for Visualization Creators:
http://datavisualization.ch/opinions/introduction-to-linked-data
CKAN: The Data Hub
http://ckan.net
www.vestforsk.no
48. More resources
Talis Nodalities:
http://www.talis.com/nodalities
Publishing Open Government Data (working draft)
http://www.w3.org/TR/2009/WD-gov-data-20090908/
Åpne data og journalistikk (Bente Kalsnes, Origo)
http://www.slideshare.net/benteka/pne-data-og-journalistikk
European Journal of ePractice
http://www.epractice.eu/en/journal/issues
Figshare: Sharing scientific data (http://figshare.com)
http://blog.okfn.org/2011/03/02/introducing-figshare-a-new-way-to-share-open-
scientific-data/
www.vestforsk.no
49. Thank you for your attention!
Contact information:
Svein Ølnes – sol@vestforsk.no
www.vestforsk.no