SlideShare a Scribd company logo
1 of 13
www.sti-innsbruck.at© Copyright 2013 STI INNSBRUCK www.sti-innsbruck.at
“How to put an annotation in
HTML?”
Ioannis Stavrakantonakis
www.sti-innsbruck.at
Outline
2
• Research question
• ITS 2.0
• NIF
• What about Microdata?
• Demo
• References
www.sti-innsbruck.at
Research question
3
We want to annotate Springfield with an URI to make sure that the
computer understands we mean the Springfield in Massachusetts.
HTML:
<p>It is well known, that Springfield has mild summers and short, but hard
winters.</p>
HTML with annotation (something like that):
<p>It is well known, that
<span about="http://sws.geonames.org/4951788/">Springfield</span>
has mild summers and short, but hard winters.</p>
We don't want to add whole triples, but just annotate the HTML and say
"this element refers to the following URI".
From: Denny Vrandečić
Sent: Wednesday, April 24, 2013 1:59 PM
To: semantic-web at W3C
Subject: How to put an annotation in HTML?
www.sti-innsbruck.at
ITS 2.0
4
• International Tag Set (ITS) [2]
– enhances the foundation to integrate automated processing of human language into
core Web technologies;
– focuses on HTML, XML-based formats in general, and can leverage processing
based on the XML Localization Interchange File Format (XLIFF), as well as the
Natural Language Processing Interchange Format (NIF);
– is a technology to add metadata to Web content, for the benefit of localization,
language technologies, and internationalization (see more in [5] regarding localization
(l10n) and internationalization (i18n))
www.sti-innsbruck.at
ITS 2.0
5
• Potential Users of ITS [2]:
– Schema developers starting a schema from the ground up
(proposals for attribute and element names to be included in their new schema)
– Schema developers working with an existing schema
(should check whether their schemas support the markup proposed in this
specification, and, where appropriate, add the markup proposed here to their schema)
– Vendors of content-related tools (e.g. tools for authoring, translation, etc.)
– Content producers (may be used by them to mark up specific bits of content)
– Machine Translation Systems
– Text Analytics (automatically generated metadata for improving localization, data
integration or knowledge management workflows)
– Localization Workflow Managers
www.sti-innsbruck.at
ITS 2.0
6
The Text Analysis use case:
•This data category is used to annotate content with lexical or conceptual
information for the purpose of contextual disambiguation.
•3 pieces of annotation:
– Confidence: The confidence of the agent (that produced the annotation) in its own
computation – XSD double data type (e.g. 0.63)
– Entity type: The type of entity, or concept class of the text analysis target – IRI (e.g.
http://nerd.eurecom.fr/ontology#Location [8])
– Entity identifier: A unique identifier for the text analysis target – IRI or String (e.g.
http://dbpedia.org/page/Innsbruck or the identifier for “Capital” from Wordnet [9])
www.sti-innsbruck.at
ITS 2.0
7
Rendered HTML:
HTML with ITS metadata:
<html xmlns="http://www.w3.org/1999/xhtml">
<body>
<h2 translate="yes">Welcome to
<span its-ta-ident-ref="http://dbpedia.org/page/Innsbruck"
its-within-text="yes" translate="no">Innsbruck</span> in
<b translate="no" its-within-text="yes">Austria</b>!</h2>
</body>
</html>
www.sti-innsbruck.at
ITS 2.0
8
• Conversion to NIF [2]:
– Convert XML or HTML documents that contain ITS metadata to the RDF-based
format based on NIF. The conversion results in RDF.
– The conversion algorithm to generate NIF consists of seven steps. The output of the
algorithm uses the ITS RDF ontology [7].
– The conversion to NIF is a possible basis for a natural language processing (NLP)
application that creates, for example, named entity annotations.
– To integrate the RDF annotations into the original input document is given in [6]
(NIF2ITS).
www.sti-innsbruck.at
NLP Interchange Format (NIF)
9
• NIF is an RDF/OWL-based format that aims to achieve interoperability
between Natural Language Processing (NLP) tools, language resources
and annotations.
• NIF will soon be a normative part of the ITS 2.0
• NIF and its community project NLP2RDF serve as an umbrella project
liaising with other community of practices, especially:
– LOD2 FP7 EU project
– MultilingualWeb-LT Working Group
– Best Practices for Multilingual Linked Open Data Community Group
– Ontology-Lexica Community Group
– Named Entity Recognition and Disambiguation (NERD)
– Ontologies of Linguistic Annotation (OLiA)
• University of Leipzig
www.sti-innsbruck.at
How is it different to Microdata annotations?
10
What is the latitude and longitude of
the <span ?=?>Empire State Building</span>?
<span its-ta-ident-ref="http://live.dbpedia.org/page/Empire_State_Building">
Empire State Building</span>
<div itemscope itemtype="http://schema.org/Place">
What is the latitude and longitude of the
<span itemprop="name">Empire State Building</span>?
</div>
Microdata + schema.org
ITS2.0 + dbpedia resource
www.sti-innsbruck.at
How is it different to Microdata annotations?
11
What is the latitude and longitude of
the <span ?=?>Empire State Building</span>?
Semantics of ITS2.0 annotations:
Specify entity identifiers (IRIs) for the presented information item.
Semantics of Microdata annotations:
Specify the type of information that is presented.
Microdata
ITS2.0
www.sti-innsbruck.at
Hands-on / Demo
12
• HTML with ITS metadata
• Transformation of HTML with ITS metadata to NIF
Notes:
• Based on the XSLT files shared by the W3C Working Group member
Felix Sasaki (@fsasaki) [4]
• The Java internal XSLTC processor fails to compile the XSLTs. Use
Saxon 9 HE.
www.sti-innsbruck.at
References
[1] W3C semantic web list thread:
http://lists.w3.org/Archives/Public/semantic-web/2013Apr/0218.html
[2] ITS 2.0 W3C working draft: http://www.w3.org/TR/its20/
[3] NIF Core Ontology: http://persistence.uni-leipzig.org/nlp2rdf/
[4] Felix Sasaki ITS 2.0 extractor (github): https://github.com/fsasaki/its20-extractor
[5] W3C, Localization vs. Internationalization: http://www.w3.org/International/questions/qa-i18n
[6] W3C, Conversion NIF2ITS: http://www.w3.org/TR/its20/#nif-backconversion
[7] W3C, ITS 2.0 / RDF Ontology: http://www.w3.org/2005/11/its/rdf-content/its-rdf.html
[8] Named Entity Recognition and Disambiguation (NERD): http://nerd.eurecom.fr/ontology
[9] WordNet Search 3.1: http://wordnetweb.princeton.edu/perl/webwn
13

More Related Content

Similar to How to put an annotation in html

Web Archives and the dream of the Personal Search Engine
Web Archives and the dream of the Personal Search EngineWeb Archives and the dream of the Personal Search Engine
Web Archives and the dream of the Personal Search EngineArjen de Vries
 
Semantic technologies for the Internet of Things
Semantic technologies for the Internet of Things Semantic technologies for the Internet of Things
Semantic technologies for the Internet of Things PayamBarnaghi
 
IPTC News in JSON Spring 2013
IPTC News in JSON Spring 2013IPTC News in JSON Spring 2013
IPTC News in JSON Spring 2013Stuart Myles
 
The nature.com ontologies portal: nature.com/ontologies
The nature.com ontologies portal: nature.com/ontologiesThe nature.com ontologies portal: nature.com/ontologies
The nature.com ontologies portal: nature.com/ontologiesTony Hammond
 
Repositories thru the looking glass
Repositories thru the looking glassRepositories thru the looking glass
Repositories thru the looking glassEduserv Foundation
 
WOTS2E: A Search Engine for a Semantic Web of Things
WOTS2E: A Search Engine for a Semantic Web of ThingsWOTS2E: A Search Engine for a Semantic Web of Things
WOTS2E: A Search Engine for a Semantic Web of ThingsAndreas Kamilaris
 
Automatic Classification of Springer Nature Proceedings with Smart Topic Miner
Automatic Classification of Springer Nature Proceedings with Smart Topic MinerAutomatic Classification of Springer Nature Proceedings with Smart Topic Miner
Automatic Classification of Springer Nature Proceedings with Smart Topic MinerFrancesco Osborne
 
Linked Open Data about Springer Nature conferences. The story so far
Linked Open Data about Springer Nature conferences. The story so farLinked Open Data about Springer Nature conferences. The story so far
Linked Open Data about Springer Nature conferences. The story so farAliaksandr Birukou
 
Introduction to Text Mining
Introduction to Text MiningIntroduction to Text Mining
Introduction to Text MiningMinha Hwang
 
Linked Open Data in the World of Patents
Linked Open Data in the World of Patents Linked Open Data in the World of Patents
Linked Open Data in the World of Patents Dr. Haxel Consult
 
Data integration with a façade. The case of knowledge graph construction.
Data integration with a façade. The case of knowledge graph construction.Data integration with a façade. The case of knowledge graph construction.
Data integration with a façade. The case of knowledge graph construction.Enrico Daga
 
An Introduction to Semantic Web Technology
An Introduction to Semantic Web TechnologyAn Introduction to Semantic Web Technology
An Introduction to Semantic Web TechnologyAnkur Biswas
 
What do we want computers to do for us?
What do we want computers to do for us? What do we want computers to do for us?
What do we want computers to do for us? Andrea Volpini
 
The Nature.com ontologies portal - Linked Science 2015
The Nature.com ontologies portal - Linked Science 2015The Nature.com ontologies portal - Linked Science 2015
The Nature.com ontologies portal - Linked Science 2015Michele Pasin
 
Why SKOS should be a Focal Point of your Linked Data Strategy
Why SKOS should be a Focal Point of your Linked Data StrategyWhy SKOS should be a Focal Point of your Linked Data Strategy
Why SKOS should be a Focal Point of your Linked Data StrategySemantic Web Company
 
Open Data - Principles and Techniques
Open Data - Principles and TechniquesOpen Data - Principles and Techniques
Open Data - Principles and TechniquesBernhard Haslhofer
 
Information Extraction from Text, presented @ Deloitte
Information Extraction from Text, presented @ DeloitteInformation Extraction from Text, presented @ Deloitte
Information Extraction from Text, presented @ DeloitteDeep Kayal
 
Multimedia Data Navigation and the Semantic Web (SemTech 2006)
Multimedia Data Navigation and the Semantic Web (SemTech 2006)Multimedia Data Navigation and the Semantic Web (SemTech 2006)
Multimedia Data Navigation and the Semantic Web (SemTech 2006)Bradley Allen
 

Similar to How to put an annotation in html (20)

Web Archives and the dream of the Personal Search Engine
Web Archives and the dream of the Personal Search EngineWeb Archives and the dream of the Personal Search Engine
Web Archives and the dream of the Personal Search Engine
 
Semantic technologies for the Internet of Things
Semantic technologies for the Internet of Things Semantic technologies for the Internet of Things
Semantic technologies for the Internet of Things
 
IPTC News in JSON Spring 2013
IPTC News in JSON Spring 2013IPTC News in JSON Spring 2013
IPTC News in JSON Spring 2013
 
The nature.com ontologies portal: nature.com/ontologies
The nature.com ontologies portal: nature.com/ontologiesThe nature.com ontologies portal: nature.com/ontologies
The nature.com ontologies portal: nature.com/ontologies
 
Repositories thru the looking glass
Repositories thru the looking glassRepositories thru the looking glass
Repositories thru the looking glass
 
WOTS2E: A Search Engine for a Semantic Web of Things
WOTS2E: A Search Engine for a Semantic Web of ThingsWOTS2E: A Search Engine for a Semantic Web of Things
WOTS2E: A Search Engine for a Semantic Web of Things
 
Automatic Classification of Springer Nature Proceedings with Smart Topic Miner
Automatic Classification of Springer Nature Proceedings with Smart Topic MinerAutomatic Classification of Springer Nature Proceedings with Smart Topic Miner
Automatic Classification of Springer Nature Proceedings with Smart Topic Miner
 
Linked Open Data about Springer Nature conferences. The story so far
Linked Open Data about Springer Nature conferences. The story so farLinked Open Data about Springer Nature conferences. The story so far
Linked Open Data about Springer Nature conferences. The story so far
 
Introduction to Text Mining
Introduction to Text MiningIntroduction to Text Mining
Introduction to Text Mining
 
Linked Open Data in the World of Patents
Linked Open Data in the World of Patents Linked Open Data in the World of Patents
Linked Open Data in the World of Patents
 
Data integration with a façade. The case of knowledge graph construction.
Data integration with a façade. The case of knowledge graph construction.Data integration with a façade. The case of knowledge graph construction.
Data integration with a façade. The case of knowledge graph construction.
 
An Introduction to Semantic Web Technology
An Introduction to Semantic Web TechnologyAn Introduction to Semantic Web Technology
An Introduction to Semantic Web Technology
 
What do we want computers to do for us?
What do we want computers to do for us? What do we want computers to do for us?
What do we want computers to do for us?
 
The Nature.com ontologies portal - Linked Science 2015
The Nature.com ontologies portal - Linked Science 2015The Nature.com ontologies portal - Linked Science 2015
The Nature.com ontologies portal - Linked Science 2015
 
Presentation at MTSR 2012
Presentation at MTSR 2012Presentation at MTSR 2012
Presentation at MTSR 2012
 
Why SKOS should be a Focal Point of your Linked Data Strategy
Why SKOS should be a Focal Point of your Linked Data StrategyWhy SKOS should be a Focal Point of your Linked Data Strategy
Why SKOS should be a Focal Point of your Linked Data Strategy
 
ITWS Capstone: Engineering a Semantic Web (Fall 2022)
ITWS Capstone: Engineering a Semantic Web (Fall 2022)ITWS Capstone: Engineering a Semantic Web (Fall 2022)
ITWS Capstone: Engineering a Semantic Web (Fall 2022)
 
Open Data - Principles and Techniques
Open Data - Principles and TechniquesOpen Data - Principles and Techniques
Open Data - Principles and Techniques
 
Information Extraction from Text, presented @ Deloitte
Information Extraction from Text, presented @ DeloitteInformation Extraction from Text, presented @ Deloitte
Information Extraction from Text, presented @ Deloitte
 
Multimedia Data Navigation and the Semantic Web (SemTech 2006)
Multimedia Data Navigation and the Semantic Web (SemTech 2006)Multimedia Data Navigation and the Semantic Web (SemTech 2006)
Multimedia Data Navigation and the Semantic Web (SemTech 2006)
 

More from STIinnsbruck

More from STIinnsbruck (20)

Unister
UnisterUnister
Unister
 
Twoo
TwooTwoo
Twoo
 
Twibes
TwibesTwibes
Twibes
 
Tweet deck 2012-01-02
Tweet deck 2012-01-02Tweet deck 2012-01-02
Tweet deck 2012-01-02
 
Tv handbook revised_100120141
Tv handbook revised_100120141Tv handbook revised_100120141
Tv handbook revised_100120141
 
Tv feratel 13032014
Tv feratel 13032014Tv feratel 13032014
Tv feratel 13032014
 
Tv evaluation 12032014
Tv evaluation 12032014Tv evaluation 12032014
Tv evaluation 12032014
 
T vb publication_rules_11032014
T vb publication_rules_11032014T vb publication_rules_11032014
T vb publication_rules_11032014
 
T vb mapping_implementation_25032014
T vb mapping_implementation_25032014T vb mapping_implementation_25032014
T vb mapping_implementation_25032014
 
T vb alignment_022814_0
T vb alignment_022814_0T vb alignment_022814_0
T vb alignment_022814_0
 
Ttr 20130701
Ttr 20130701Ttr 20130701
Ttr 20130701
 
Ttg mapping to_schema.org_
Ttg mapping to_schema.org_Ttg mapping to_schema.org_
Ttg mapping to_schema.org_
 
Ttb 08042014
Ttb 08042014Ttb 08042014
Ttb 08042014
 
Trust you
Trust youTrust you
Trust you
 
Tripwolf
TripwolfTripwolf
Tripwolf
 
Tripbirds
TripbirdsTripbirds
Tripbirds
 
Traveltainment
TraveltainmentTraveltainment
Traveltainment
 
Travelaudience
TravelaudienceTravelaudience
Travelaudience
 
Tourismuszukunft
TourismuszukunftTourismuszukunft
Tourismuszukunft
 
Tourismusverband innsbruck 24.09.2013
Tourismusverband innsbruck 24.09.2013Tourismusverband innsbruck 24.09.2013
Tourismusverband innsbruck 24.09.2013
 

Recently uploaded

Exploring protein-protein interactions by Weak Affinity Chromatography (WAC) ...
Exploring protein-protein interactions by Weak Affinity Chromatography (WAC) ...Exploring protein-protein interactions by Weak Affinity Chromatography (WAC) ...
Exploring protein-protein interactions by Weak Affinity Chromatography (WAC) ...Salam Al-Karadaghi
 
OSCamp Kubernetes 2024 | A Tester's Guide to CI_CD as an Automated Quality Co...
OSCamp Kubernetes 2024 | A Tester's Guide to CI_CD as an Automated Quality Co...OSCamp Kubernetes 2024 | A Tester's Guide to CI_CD as an Automated Quality Co...
OSCamp Kubernetes 2024 | A Tester's Guide to CI_CD as an Automated Quality Co...NETWAYS
 
Motivation and Theory Maslow and Murray pdf
Motivation and Theory Maslow and Murray pdfMotivation and Theory Maslow and Murray pdf
Motivation and Theory Maslow and Murray pdfakankshagupta7348026
 
Genesis part 2 Isaiah Scudder 04-24-2024.pptx
Genesis part 2 Isaiah Scudder 04-24-2024.pptxGenesis part 2 Isaiah Scudder 04-24-2024.pptx
Genesis part 2 Isaiah Scudder 04-24-2024.pptxFamilyWorshipCenterD
 
Philippine History cavite Mutiny Report.ppt
Philippine History cavite Mutiny Report.pptPhilippine History cavite Mutiny Report.ppt
Philippine History cavite Mutiny Report.pptssuser319dad
 
Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...
Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...
Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...Kayode Fayemi
 
Open Source Camp Kubernetes 2024 | Monitoring Kubernetes With Icinga by Eric ...
Open Source Camp Kubernetes 2024 | Monitoring Kubernetes With Icinga by Eric ...Open Source Camp Kubernetes 2024 | Monitoring Kubernetes With Icinga by Eric ...
Open Source Camp Kubernetes 2024 | Monitoring Kubernetes With Icinga by Eric ...NETWAYS
 
George Lever - eCommerce Day Chile 2024
George Lever -  eCommerce Day Chile 2024George Lever -  eCommerce Day Chile 2024
George Lever - eCommerce Day Chile 2024eCommerce Institute
 
Russian Call Girls in Kolkata Vaishnavi 🤌 8250192130 🚀 Vip Call Girls Kolkata
Russian Call Girls in Kolkata Vaishnavi 🤌  8250192130 🚀 Vip Call Girls KolkataRussian Call Girls in Kolkata Vaishnavi 🤌  8250192130 🚀 Vip Call Girls Kolkata
Russian Call Girls in Kolkata Vaishnavi 🤌 8250192130 🚀 Vip Call Girls Kolkataanamikaraghav4
 
Open Source Strategy in Logistics 2015_Henrik Hankedvz-d-nl-log-conference.pdf
Open Source Strategy in Logistics 2015_Henrik Hankedvz-d-nl-log-conference.pdfOpen Source Strategy in Logistics 2015_Henrik Hankedvz-d-nl-log-conference.pdf
Open Source Strategy in Logistics 2015_Henrik Hankedvz-d-nl-log-conference.pdfhenrik385807
 
Work Remotely with Confluence ACE 2.pptx
Work Remotely with Confluence ACE 2.pptxWork Remotely with Confluence ACE 2.pptx
Work Remotely with Confluence ACE 2.pptxmavinoikein
 
Night 7k Call Girls Noida Sector 128 Call Me: 8448380779
Night 7k Call Girls Noida Sector 128 Call Me: 8448380779Night 7k Call Girls Noida Sector 128 Call Me: 8448380779
Night 7k Call Girls Noida Sector 128 Call Me: 8448380779Delhi Call girls
 
The 3rd Intl. Workshop on NL-based Software Engineering
The 3rd Intl. Workshop on NL-based Software EngineeringThe 3rd Intl. Workshop on NL-based Software Engineering
The 3rd Intl. Workshop on NL-based Software EngineeringSebastiano Panichella
 
SBFT Tool Competition 2024 - CPS-UAV Test Case Generation Track
SBFT Tool Competition 2024 - CPS-UAV Test Case Generation TrackSBFT Tool Competition 2024 - CPS-UAV Test Case Generation Track
SBFT Tool Competition 2024 - CPS-UAV Test Case Generation TrackSebastiano Panichella
 
LANDMARKS AND MONUMENTS IN NIGERIA.pptx
LANDMARKS  AND MONUMENTS IN NIGERIA.pptxLANDMARKS  AND MONUMENTS IN NIGERIA.pptx
LANDMARKS AND MONUMENTS IN NIGERIA.pptxBasil Achie
 
Simulation-based Testing of Unmanned Aerial Vehicles with Aerialist
Simulation-based Testing of Unmanned Aerial Vehicles with AerialistSimulation-based Testing of Unmanned Aerial Vehicles with Aerialist
Simulation-based Testing of Unmanned Aerial Vehicles with AerialistSebastiano Panichella
 
Microsoft Copilot AI for Everyone - created by AI
Microsoft Copilot AI for Everyone - created by AIMicrosoft Copilot AI for Everyone - created by AI
Microsoft Copilot AI for Everyone - created by AITatiana Gurgel
 
call girls in delhi malviya nagar @9811711561@
call girls in delhi malviya nagar @9811711561@call girls in delhi malviya nagar @9811711561@
call girls in delhi malviya nagar @9811711561@vikas rana
 
CTAC 2024 Valencia - Henrik Hanke - Reduce to the max - slideshare.pdf
CTAC 2024 Valencia - Henrik Hanke - Reduce to the max - slideshare.pdfCTAC 2024 Valencia - Henrik Hanke - Reduce to the max - slideshare.pdf
CTAC 2024 Valencia - Henrik Hanke - Reduce to the max - slideshare.pdfhenrik385807
 
Presentation for the Strategic Dialogue on the Future of Agriculture, Brussel...
Presentation for the Strategic Dialogue on the Future of Agriculture, Brussel...Presentation for the Strategic Dialogue on the Future of Agriculture, Brussel...
Presentation for the Strategic Dialogue on the Future of Agriculture, Brussel...Krijn Poppe
 

Recently uploaded (20)

Exploring protein-protein interactions by Weak Affinity Chromatography (WAC) ...
Exploring protein-protein interactions by Weak Affinity Chromatography (WAC) ...Exploring protein-protein interactions by Weak Affinity Chromatography (WAC) ...
Exploring protein-protein interactions by Weak Affinity Chromatography (WAC) ...
 
OSCamp Kubernetes 2024 | A Tester's Guide to CI_CD as an Automated Quality Co...
OSCamp Kubernetes 2024 | A Tester's Guide to CI_CD as an Automated Quality Co...OSCamp Kubernetes 2024 | A Tester's Guide to CI_CD as an Automated Quality Co...
OSCamp Kubernetes 2024 | A Tester's Guide to CI_CD as an Automated Quality Co...
 
Motivation and Theory Maslow and Murray pdf
Motivation and Theory Maslow and Murray pdfMotivation and Theory Maslow and Murray pdf
Motivation and Theory Maslow and Murray pdf
 
Genesis part 2 Isaiah Scudder 04-24-2024.pptx
Genesis part 2 Isaiah Scudder 04-24-2024.pptxGenesis part 2 Isaiah Scudder 04-24-2024.pptx
Genesis part 2 Isaiah Scudder 04-24-2024.pptx
 
Philippine History cavite Mutiny Report.ppt
Philippine History cavite Mutiny Report.pptPhilippine History cavite Mutiny Report.ppt
Philippine History cavite Mutiny Report.ppt
 
Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...
Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...
Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...
 
Open Source Camp Kubernetes 2024 | Monitoring Kubernetes With Icinga by Eric ...
Open Source Camp Kubernetes 2024 | Monitoring Kubernetes With Icinga by Eric ...Open Source Camp Kubernetes 2024 | Monitoring Kubernetes With Icinga by Eric ...
Open Source Camp Kubernetes 2024 | Monitoring Kubernetes With Icinga by Eric ...
 
George Lever - eCommerce Day Chile 2024
George Lever -  eCommerce Day Chile 2024George Lever -  eCommerce Day Chile 2024
George Lever - eCommerce Day Chile 2024
 
Russian Call Girls in Kolkata Vaishnavi 🤌 8250192130 🚀 Vip Call Girls Kolkata
Russian Call Girls in Kolkata Vaishnavi 🤌  8250192130 🚀 Vip Call Girls KolkataRussian Call Girls in Kolkata Vaishnavi 🤌  8250192130 🚀 Vip Call Girls Kolkata
Russian Call Girls in Kolkata Vaishnavi 🤌 8250192130 🚀 Vip Call Girls Kolkata
 
Open Source Strategy in Logistics 2015_Henrik Hankedvz-d-nl-log-conference.pdf
Open Source Strategy in Logistics 2015_Henrik Hankedvz-d-nl-log-conference.pdfOpen Source Strategy in Logistics 2015_Henrik Hankedvz-d-nl-log-conference.pdf
Open Source Strategy in Logistics 2015_Henrik Hankedvz-d-nl-log-conference.pdf
 
Work Remotely with Confluence ACE 2.pptx
Work Remotely with Confluence ACE 2.pptxWork Remotely with Confluence ACE 2.pptx
Work Remotely with Confluence ACE 2.pptx
 
Night 7k Call Girls Noida Sector 128 Call Me: 8448380779
Night 7k Call Girls Noida Sector 128 Call Me: 8448380779Night 7k Call Girls Noida Sector 128 Call Me: 8448380779
Night 7k Call Girls Noida Sector 128 Call Me: 8448380779
 
The 3rd Intl. Workshop on NL-based Software Engineering
The 3rd Intl. Workshop on NL-based Software EngineeringThe 3rd Intl. Workshop on NL-based Software Engineering
The 3rd Intl. Workshop on NL-based Software Engineering
 
SBFT Tool Competition 2024 - CPS-UAV Test Case Generation Track
SBFT Tool Competition 2024 - CPS-UAV Test Case Generation TrackSBFT Tool Competition 2024 - CPS-UAV Test Case Generation Track
SBFT Tool Competition 2024 - CPS-UAV Test Case Generation Track
 
LANDMARKS AND MONUMENTS IN NIGERIA.pptx
LANDMARKS  AND MONUMENTS IN NIGERIA.pptxLANDMARKS  AND MONUMENTS IN NIGERIA.pptx
LANDMARKS AND MONUMENTS IN NIGERIA.pptx
 
Simulation-based Testing of Unmanned Aerial Vehicles with Aerialist
Simulation-based Testing of Unmanned Aerial Vehicles with AerialistSimulation-based Testing of Unmanned Aerial Vehicles with Aerialist
Simulation-based Testing of Unmanned Aerial Vehicles with Aerialist
 
Microsoft Copilot AI for Everyone - created by AI
Microsoft Copilot AI for Everyone - created by AIMicrosoft Copilot AI for Everyone - created by AI
Microsoft Copilot AI for Everyone - created by AI
 
call girls in delhi malviya nagar @9811711561@
call girls in delhi malviya nagar @9811711561@call girls in delhi malviya nagar @9811711561@
call girls in delhi malviya nagar @9811711561@
 
CTAC 2024 Valencia - Henrik Hanke - Reduce to the max - slideshare.pdf
CTAC 2024 Valencia - Henrik Hanke - Reduce to the max - slideshare.pdfCTAC 2024 Valencia - Henrik Hanke - Reduce to the max - slideshare.pdf
CTAC 2024 Valencia - Henrik Hanke - Reduce to the max - slideshare.pdf
 
Presentation for the Strategic Dialogue on the Future of Agriculture, Brussel...
Presentation for the Strategic Dialogue on the Future of Agriculture, Brussel...Presentation for the Strategic Dialogue on the Future of Agriculture, Brussel...
Presentation for the Strategic Dialogue on the Future of Agriculture, Brussel...
 

How to put an annotation in html

  • 1. www.sti-innsbruck.at© Copyright 2013 STI INNSBRUCK www.sti-innsbruck.at “How to put an annotation in HTML?” Ioannis Stavrakantonakis
  • 2. www.sti-innsbruck.at Outline 2 • Research question • ITS 2.0 • NIF • What about Microdata? • Demo • References
  • 3. www.sti-innsbruck.at Research question 3 We want to annotate Springfield with an URI to make sure that the computer understands we mean the Springfield in Massachusetts. HTML: <p>It is well known, that Springfield has mild summers and short, but hard winters.</p> HTML with annotation (something like that): <p>It is well known, that <span about="http://sws.geonames.org/4951788/">Springfield</span> has mild summers and short, but hard winters.</p> We don't want to add whole triples, but just annotate the HTML and say "this element refers to the following URI". From: Denny Vrandečić Sent: Wednesday, April 24, 2013 1:59 PM To: semantic-web at W3C Subject: How to put an annotation in HTML?
  • 4. www.sti-innsbruck.at ITS 2.0 4 • International Tag Set (ITS) [2] – enhances the foundation to integrate automated processing of human language into core Web technologies; – focuses on HTML, XML-based formats in general, and can leverage processing based on the XML Localization Interchange File Format (XLIFF), as well as the Natural Language Processing Interchange Format (NIF); – is a technology to add metadata to Web content, for the benefit of localization, language technologies, and internationalization (see more in [5] regarding localization (l10n) and internationalization (i18n))
  • 5. www.sti-innsbruck.at ITS 2.0 5 • Potential Users of ITS [2]: – Schema developers starting a schema from the ground up (proposals for attribute and element names to be included in their new schema) – Schema developers working with an existing schema (should check whether their schemas support the markup proposed in this specification, and, where appropriate, add the markup proposed here to their schema) – Vendors of content-related tools (e.g. tools for authoring, translation, etc.) – Content producers (may be used by them to mark up specific bits of content) – Machine Translation Systems – Text Analytics (automatically generated metadata for improving localization, data integration or knowledge management workflows) – Localization Workflow Managers
  • 6. www.sti-innsbruck.at ITS 2.0 6 The Text Analysis use case: •This data category is used to annotate content with lexical or conceptual information for the purpose of contextual disambiguation. •3 pieces of annotation: – Confidence: The confidence of the agent (that produced the annotation) in its own computation – XSD double data type (e.g. 0.63) – Entity type: The type of entity, or concept class of the text analysis target – IRI (e.g. http://nerd.eurecom.fr/ontology#Location [8]) – Entity identifier: A unique identifier for the text analysis target – IRI or String (e.g. http://dbpedia.org/page/Innsbruck or the identifier for “Capital” from Wordnet [9])
  • 7. www.sti-innsbruck.at ITS 2.0 7 Rendered HTML: HTML with ITS metadata: <html xmlns="http://www.w3.org/1999/xhtml"> <body> <h2 translate="yes">Welcome to <span its-ta-ident-ref="http://dbpedia.org/page/Innsbruck" its-within-text="yes" translate="no">Innsbruck</span> in <b translate="no" its-within-text="yes">Austria</b>!</h2> </body> </html>
  • 8. www.sti-innsbruck.at ITS 2.0 8 • Conversion to NIF [2]: – Convert XML or HTML documents that contain ITS metadata to the RDF-based format based on NIF. The conversion results in RDF. – The conversion algorithm to generate NIF consists of seven steps. The output of the algorithm uses the ITS RDF ontology [7]. – The conversion to NIF is a possible basis for a natural language processing (NLP) application that creates, for example, named entity annotations. – To integrate the RDF annotations into the original input document is given in [6] (NIF2ITS).
  • 9. www.sti-innsbruck.at NLP Interchange Format (NIF) 9 • NIF is an RDF/OWL-based format that aims to achieve interoperability between Natural Language Processing (NLP) tools, language resources and annotations. • NIF will soon be a normative part of the ITS 2.0 • NIF and its community project NLP2RDF serve as an umbrella project liaising with other community of practices, especially: – LOD2 FP7 EU project – MultilingualWeb-LT Working Group – Best Practices for Multilingual Linked Open Data Community Group – Ontology-Lexica Community Group – Named Entity Recognition and Disambiguation (NERD) – Ontologies of Linguistic Annotation (OLiA) • University of Leipzig
  • 10. www.sti-innsbruck.at How is it different to Microdata annotations? 10 What is the latitude and longitude of the <span ?=?>Empire State Building</span>? <span its-ta-ident-ref="http://live.dbpedia.org/page/Empire_State_Building"> Empire State Building</span> <div itemscope itemtype="http://schema.org/Place"> What is the latitude and longitude of the <span itemprop="name">Empire State Building</span>? </div> Microdata + schema.org ITS2.0 + dbpedia resource
  • 11. www.sti-innsbruck.at How is it different to Microdata annotations? 11 What is the latitude and longitude of the <span ?=?>Empire State Building</span>? Semantics of ITS2.0 annotations: Specify entity identifiers (IRIs) for the presented information item. Semantics of Microdata annotations: Specify the type of information that is presented. Microdata ITS2.0
  • 12. www.sti-innsbruck.at Hands-on / Demo 12 • HTML with ITS metadata • Transformation of HTML with ITS metadata to NIF Notes: • Based on the XSLT files shared by the W3C Working Group member Felix Sasaki (@fsasaki) [4] • The Java internal XSLTC processor fails to compile the XSLTs. Use Saxon 9 HE.
  • 13. www.sti-innsbruck.at References [1] W3C semantic web list thread: http://lists.w3.org/Archives/Public/semantic-web/2013Apr/0218.html [2] ITS 2.0 W3C working draft: http://www.w3.org/TR/its20/ [3] NIF Core Ontology: http://persistence.uni-leipzig.org/nlp2rdf/ [4] Felix Sasaki ITS 2.0 extractor (github): https://github.com/fsasaki/its20-extractor [5] W3C, Localization vs. Internationalization: http://www.w3.org/International/questions/qa-i18n [6] W3C, Conversion NIF2ITS: http://www.w3.org/TR/its20/#nif-backconversion [7] W3C, ITS 2.0 / RDF Ontology: http://www.w3.org/2005/11/its/rdf-content/its-rdf.html [8] Named Entity Recognition and Disambiguation (NERD): http://nerd.eurecom.fr/ontology [9] WordNet Search 3.1: http://wordnetweb.princeton.edu/perl/webwn 13