SlideShare a Scribd company logo
Semantic Web Technologies Annotation Presented By : AlbaraAbdalkhalig Mansour Sudan University-Web Technology E-mail : Brra51@hotmail.com  Tel : 00249121200239
Definition : 	Annotations are comments, notes, explanations, or other types of external remarks that can be attached to a Web document or a selected part of the document. As they are external, it is possible to annotate any Web document independently, without needing to edit that document. From the technical point of view, annotations are usually seen as metadata, as they give additional information about an existing piece of data.  2
What is annotation? People make notes to themselves in order to preserve ideas that arise during a variety of activities. The purpose of these notes is often to summarize, criticize, or emphasize specific phrases or events. Semantic annotations are to tag ontology class instance data and map it into ontology classes. 3
Why use annotation? To have the world knowledge at one's finger tips seems possible. The Internet is the platform for information. Unfortunately most of the information is provided in an unstructured and non-standardized form. 4
Annotation methods Manually Semi-automatically Automatically 5
(1)  Manually Manual annotation is the transformation of existing syntactic resources into interlinked knowledge structures that represent relevant underlying information. Manual annotation is an expensive process, and often does not consider  that multiple perspectives of a data source, requiring multiple ontologies, can be beneficial to support the needs of different users. 6
(2) Semi-automatic Annotation Semi-automatic annotation systems rely on human intervention at some point in the annotation process. The platforms vary in their architecture, information extraction tools and methods, initial ontology, amount of manual work required to perform annotation, performance and other features, such as storage management. 7
(3) Automatic Annotation The fully automatic creation of semantic annotations is an unsolved problem. Automatic semantic annotation for the natural language sentences in these pages is a daunting task and we are often forced to do it manually or semi-automatically using handwritten rules 8
Semantic Annotation Concerns Scale, Volume Existing & new documents on the Web Manual annotation Expensive – economic, time Subject to personal motivation Schema Complexity Storage support for multiple ontologies within or external to source document? Knowledge base refinement Access - How are annotations accessed? API, custom UI, plug-ins 9
TECHNICAL SOLUTION
TECHNICAL SOLUTION 2.1	Annotation of text ,[object Object]
GATE
KIM2.2	Multimedia annotation ,[object Object]
Tools for multimedia annotation11
Annotation of text  Many systems apply rules or wrappers that were manually created that try to recognize patterns for the annotations.  Some systems learn how to annotate with the help of the user. Supervised systems learn how to annotate from a training set that was manually created beforehand.  Semi-automatic approaches often apply information extraction technology, which analyzes natural language for pulling out information the user is interested in. 12
A Walk-Through Example: GATE GATEis a tool for : scientists performing experiments that involve processing human language; companies developing applications with language processing components; teachers and students of courses about language and language computation. GATE comprises an architecture, framework (or SDK) and development environment, and has been in development since 1995 in the Sheffield NLP group. The system has been used for many language processing projects; in particular for Information Extraction in many languages. GATE is funded by the EPSRC and the EU. 13
KIM platform KIM = Knowledge and Information Management developed by semantic technology lab “Ontotext“ based on GATE 14
KIM platform KIM performs IE based on an ontology and a massive knowledge base. 15
KIM KB KIM KB consists of above 80,000 entities (50,000 locations, 8,400 organization instances, etc.) Each location has geographic coordinates and several aliases (usually including English, French, Spanish, and sometimes the local transcription of the location name) as well as co-positioning relations (e.g. subRegionOf.) The organizations have locatedInrelations to the corresponding Countryinstances. The additionally imported information about the companies consists of short description, URL, reference to an industry sector, reported sales, net income,and number of employees. 16
KIM platform   The KIM platform provides a novel infrastructure and services for: automatic semantic annotation,  indexing,  retrieval of unstructured and semi-structured content. 17
KIM platform The most direct applications of KIM are:  Generation of meta-data for the Semantic Web, which allows hyper-linking and advanced visualization and navigation. Knowledge Management, enhancing the efficiency of the existing indexing, retrieval, classification and filtering applications.  18
KIM platform The automatic semantic annotation is seen as a named-entity recognition (NER) and annotation process. The traditional flat NE type sets consist of several general types (such as Organization, Person, Date, Location, Percent, Money). In KIM the NE type is specified by reference to an ontology. The semantic descriptions of entities and relations between them are kept in a knowledge base (KB) encoded in the KIM ontology and residing in the same semantic repository. Thus KIM provides for each entity reference in the text (i) a link (URI) to the most specific class in the ontology and (ii) a link to the specific instance in the KB. Each extracted NE is linked to its specific type information (thus Arabian Sea would be identified as Sea, instead of the traditional – Location). 19
MULTIMEDIA ANNOTATION 20
Multimedia Annotation Different levels of annotations Metadata Often technical metadata Content level Semantic annotations Keywords, domain ontologies, free-text Multimedia level low-level annotations Visual descriptors, such as dominant color 21
Metadata refers to information about technical details creation details creator, creationDate, … camera details settings resolution format EXIF access rights administrated by the OS owner, access rights, … 22
Content Level Describes what is depicted and directly perceivable by a human usually provided manually keywords/tags classification of content seldom generated automatically scene classification object detection different types of annotations global vs. local different semantic levels 23
Global vs. Local Annotations Global annotations most widely used flickr: tagging is only global organization within categories free-text annotations provide information about the content as a whole no detailed information Local annotations are less supported e.g. flickr, PhotoStuff allow to provide annotations of regions especially important for semantic image understanding allow to extract relations provide a more complete view of the scene provide information about different regions and about the depicted relations and arrangements of objects 24
Semantic Levels Free-Text annotations cover large aspects, but less appropriate for sharing, organization and retrieval Free-Text Annotations probably most natural for the human, but provide least formal semantics Tagging provides light-weight semantics Only useful if a fixed vocabulary is used Allows some simple inference of related concepts by tag analysis (clustering) No formal semantics, but provides benefits due to fixed vocabulary Requires more effort from the user Ontologies Provide syntax and semantic to define complex domain vocabularies Allow for the inference of additional knowledge Leverage interoperability Powerful way of semantic annotation, but hardly comprehensible by “normal users” 25
Tools Web-based Tools flickr riya 26
flickr Web2.0 application tagging photos globally add comments to image regions marked by  bounding box large user community and tagging allows for easy sharing of images partly fixed vocabularies evolved e.g. Geo-Tagging 27

More Related Content

Viewers also liked

Sacem Bieet Finala
Sacem Bieet FinalaSacem Bieet Finala
Sacem Bieet FinalaAntton Korta
 
Petcu Anca Andreea
Petcu Anca AndreeaPetcu Anca Andreea
Petcu Anca Andreea
Petcu anca-andreea
 
Movinord Hainbat Sistema
Movinord Hainbat SistemaMovinord Hainbat Sistema
Movinord Hainbat SistemaAntton Korta
 
Pathema Burkholderia Annotation Jamboree: Prokaryotic Annotation Overview
Pathema Burkholderia Annotation Jamboree: Prokaryotic Annotation OverviewPathema Burkholderia Annotation Jamboree: Prokaryotic Annotation Overview
Pathema Burkholderia Annotation Jamboree: Prokaryotic Annotation Overview
Pathema
 
Pathema-Clostridium A NIAID Bioinformatics Resource Center (BRC)
Pathema-Clostridium A NIAID Bioinformatics Resource Center (BRC)Pathema-Clostridium A NIAID Bioinformatics Resource Center (BRC)
Pathema-Clostridium A NIAID Bioinformatics Resource Center (BRC)
Pathema
 
Pathema Burkholderia Annotation Jamboree: A Guide to MANATEE
Pathema Burkholderia Annotation Jamboree: A Guide to MANATEEPathema Burkholderia Annotation Jamboree: A Guide to MANATEE
Pathema Burkholderia Annotation Jamboree: A Guide to MANATEE
Pathema
 
Pokies terms explained
Pokies terms explainedPokies terms explained
Pokies terms explained
ConnerReeves
 
A beginners guide to blackjack
A beginners guide to blackjackA beginners guide to blackjack
A beginners guide to blackjack
ConnerReeves
 
Yeni Bir Anlayış L.E.A.N. ve D.E.A.L.
Yeni Bir Anlayış L.E.A.N. ve D.E.A.L.Yeni Bir Anlayış L.E.A.N. ve D.E.A.L.
Yeni Bir Anlayış L.E.A.N. ve D.E.A.L.
IAB Turkey
 
Resource description framework
Resource description frameworkResource description framework
Resource description framework
hozifa1010
 

Viewers also liked (11)

Sacem Bieet Finala
Sacem Bieet FinalaSacem Bieet Finala
Sacem Bieet Finala
 
Petcu Anca Andreea
Petcu Anca AndreeaPetcu Anca Andreea
Petcu Anca Andreea
 
Movinord Hainbat Sistema
Movinord Hainbat SistemaMovinord Hainbat Sistema
Movinord Hainbat Sistema
 
Pathema Burkholderia Annotation Jamboree: Prokaryotic Annotation Overview
Pathema Burkholderia Annotation Jamboree: Prokaryotic Annotation OverviewPathema Burkholderia Annotation Jamboree: Prokaryotic Annotation Overview
Pathema Burkholderia Annotation Jamboree: Prokaryotic Annotation Overview
 
Pathema-Clostridium A NIAID Bioinformatics Resource Center (BRC)
Pathema-Clostridium A NIAID Bioinformatics Resource Center (BRC)Pathema-Clostridium A NIAID Bioinformatics Resource Center (BRC)
Pathema-Clostridium A NIAID Bioinformatics Resource Center (BRC)
 
Aurrentrega
AurrentregaAurrentrega
Aurrentrega
 
Pathema Burkholderia Annotation Jamboree: A Guide to MANATEE
Pathema Burkholderia Annotation Jamboree: A Guide to MANATEEPathema Burkholderia Annotation Jamboree: A Guide to MANATEE
Pathema Burkholderia Annotation Jamboree: A Guide to MANATEE
 
Pokies terms explained
Pokies terms explainedPokies terms explained
Pokies terms explained
 
A beginners guide to blackjack
A beginners guide to blackjackA beginners guide to blackjack
A beginners guide to blackjack
 
Yeni Bir Anlayış L.E.A.N. ve D.E.A.L.
Yeni Bir Anlayış L.E.A.N. ve D.E.A.L.Yeni Bir Anlayış L.E.A.N. ve D.E.A.L.
Yeni Bir Anlayış L.E.A.N. ve D.E.A.L.
 
Resource description framework
Resource description frameworkResource description framework
Resource description framework
 

Similar to Annotation seminar

Semantic Web in Action: Ontology-driven information search, integration and a...
Semantic Web in Action: Ontology-driven information search, integration and a...Semantic Web in Action: Ontology-driven information search, integration and a...
Semantic Web in Action: Ontology-driven information search, integration and a...
Amit Sheth
 
Annotation Of
Annotation OfAnnotation Of
Annotation Of
Michele Thomas
 
A knowledge-workbench-for-software-development
A knowledge-workbench-for-software-developmentA knowledge-workbench-for-software-development
A knowledge-workbench-for-software-development
Dimitris Panagiotou
 
Semantic web technology
Semantic web technologySemantic web technology
Semantic web technology
Stanley Wang
 
158 162
158 162158 162
Paper id 25201463
Paper id 25201463Paper id 25201463
Paper id 25201463
IJRAT
 
Knowledge management tools
Knowledge management toolsKnowledge management tools
Knowledge management tools
mohsen seyedi
 
Modular Documentation Joe Gelb Techshoret 2009
Modular Documentation Joe Gelb Techshoret 2009Modular Documentation Joe Gelb Techshoret 2009
Modular Documentation Joe Gelb Techshoret 2009
Suite Solutions
 
Relationships at the Heart of Semantic Web: Modeling, Discovering, Validating...
Relationships at the Heart of Semantic Web: Modeling, Discovering, Validating...Relationships at the Heart of Semantic Web: Modeling, Discovering, Validating...
Relationships at the Heart of Semantic Web: Modeling, Discovering, Validating...
Artificial Intelligence Institute at UofSC
 
Large Language Models Bootcamp
Large Language Models BootcampLarge Language Models Bootcamp
Large Language Models Bootcamp
Data Science Dojo
 
Semantic Web: Technolgies and Applications for Real-World
Semantic Web: Technolgies and Applications for Real-WorldSemantic Web: Technolgies and Applications for Real-World
Semantic Web: Technolgies and Applications for Real-World
Amit Sheth
 
Development Tools - Abhijeet
Development Tools - AbhijeetDevelopment Tools - Abhijeet
Development Tools - Abhijeet
Abhijeet Kalsi
 
Library tools and technologies
Library tools and technologiesLibrary tools and technologies
Library tools and technologies
Liaquat Rahoo
 
@lis agent communication, ontologies, protocols, semantic web 2003
@lis   agent communication, ontologies, protocols, semantic web 2003@lis   agent communication, ontologies, protocols, semantic web 2003
@lis agent communication, ontologies, protocols, semantic web 2003
Luigi Ceccaroni
 
How to Find a Needle in the Haystack
How to Find a Needle in the HaystackHow to Find a Needle in the Haystack
How to Find a Needle in the Haystack
Adrian Stevenson
 
Digital Libraries
Digital LibrariesDigital Libraries
Digital Libraries
Jack Eapen
 
Digital Libraries
Digital LibrariesDigital Libraries
Digital Libraries
Jack Eapen
 
Content Management: No Mystery
Content Management: No MysteryContent Management: No Mystery
Content Management: No Mystery
ClearPath, LLC
 
IRJET- Semantic based Automatic Text Summarization based on Soft Computing
IRJET- Semantic based Automatic Text Summarization based on Soft ComputingIRJET- Semantic based Automatic Text Summarization based on Soft Computing
IRJET- Semantic based Automatic Text Summarization based on Soft Computing
IRJET Journal
 
A Logic-Based Approach To Semantic Information Extraction
A Logic-Based Approach To Semantic Information ExtractionA Logic-Based Approach To Semantic Information Extraction
A Logic-Based Approach To Semantic Information Extraction
Amber Ford
 

Similar to Annotation seminar (20)

Semantic Web in Action: Ontology-driven information search, integration and a...
Semantic Web in Action: Ontology-driven information search, integration and a...Semantic Web in Action: Ontology-driven information search, integration and a...
Semantic Web in Action: Ontology-driven information search, integration and a...
 
Annotation Of
Annotation OfAnnotation Of
Annotation Of
 
A knowledge-workbench-for-software-development
A knowledge-workbench-for-software-developmentA knowledge-workbench-for-software-development
A knowledge-workbench-for-software-development
 
Semantic web technology
Semantic web technologySemantic web technology
Semantic web technology
 
158 162
158 162158 162
158 162
 
Paper id 25201463
Paper id 25201463Paper id 25201463
Paper id 25201463
 
Knowledge management tools
Knowledge management toolsKnowledge management tools
Knowledge management tools
 
Modular Documentation Joe Gelb Techshoret 2009
Modular Documentation Joe Gelb Techshoret 2009Modular Documentation Joe Gelb Techshoret 2009
Modular Documentation Joe Gelb Techshoret 2009
 
Relationships at the Heart of Semantic Web: Modeling, Discovering, Validating...
Relationships at the Heart of Semantic Web: Modeling, Discovering, Validating...Relationships at the Heart of Semantic Web: Modeling, Discovering, Validating...
Relationships at the Heart of Semantic Web: Modeling, Discovering, Validating...
 
Large Language Models Bootcamp
Large Language Models BootcampLarge Language Models Bootcamp
Large Language Models Bootcamp
 
Semantic Web: Technolgies and Applications for Real-World
Semantic Web: Technolgies and Applications for Real-WorldSemantic Web: Technolgies and Applications for Real-World
Semantic Web: Technolgies and Applications for Real-World
 
Development Tools - Abhijeet
Development Tools - AbhijeetDevelopment Tools - Abhijeet
Development Tools - Abhijeet
 
Library tools and technologies
Library tools and technologiesLibrary tools and technologies
Library tools and technologies
 
@lis agent communication, ontologies, protocols, semantic web 2003
@lis   agent communication, ontologies, protocols, semantic web 2003@lis   agent communication, ontologies, protocols, semantic web 2003
@lis agent communication, ontologies, protocols, semantic web 2003
 
How to Find a Needle in the Haystack
How to Find a Needle in the HaystackHow to Find a Needle in the Haystack
How to Find a Needle in the Haystack
 
Digital Libraries
Digital LibrariesDigital Libraries
Digital Libraries
 
Digital Libraries
Digital LibrariesDigital Libraries
Digital Libraries
 
Content Management: No Mystery
Content Management: No MysteryContent Management: No Mystery
Content Management: No Mystery
 
IRJET- Semantic based Automatic Text Summarization based on Soft Computing
IRJET- Semantic based Automatic Text Summarization based on Soft ComputingIRJET- Semantic based Automatic Text Summarization based on Soft Computing
IRJET- Semantic based Automatic Text Summarization based on Soft Computing
 
A Logic-Based Approach To Semantic Information Extraction
A Logic-Based Approach To Semantic Information ExtractionA Logic-Based Approach To Semantic Information Extraction
A Logic-Based Approach To Semantic Information Extraction
 

Recently uploaded

clinical examination of hip joint (1).pdf
clinical examination of hip joint (1).pdfclinical examination of hip joint (1).pdf
clinical examination of hip joint (1).pdf
Priyankaranawat4
 
The Diamonds of 2023-2024 in the IGRA collection
The Diamonds of 2023-2024 in the IGRA collectionThe Diamonds of 2023-2024 in the IGRA collection
The Diamonds of 2023-2024 in the IGRA collection
Israel Genealogy Research Association
 
Smart-Money for SMC traders good time and ICT
Smart-Money for SMC traders good time and ICTSmart-Money for SMC traders good time and ICT
Smart-Money for SMC traders good time and ICT
simonomuemu
 
World environment day ppt For 5 June 2024
World environment day ppt For 5 June 2024World environment day ppt For 5 June 2024
World environment day ppt For 5 June 2024
ak6969907
 
Film vocab for eal 3 students: Australia the movie
Film vocab for eal 3 students: Australia the movieFilm vocab for eal 3 students: Australia the movie
Film vocab for eal 3 students: Australia the movie
Nicholas Montgomery
 
Hindi varnamala | hindi alphabet PPT.pdf
Hindi varnamala | hindi alphabet PPT.pdfHindi varnamala | hindi alphabet PPT.pdf
Hindi varnamala | hindi alphabet PPT.pdf
Dr. Mulla Adam Ali
 
LAND USE LAND COVER AND NDVI OF MIRZAPUR DISTRICT, UP
LAND USE LAND COVER AND NDVI OF MIRZAPUR DISTRICT, UPLAND USE LAND COVER AND NDVI OF MIRZAPUR DISTRICT, UP
LAND USE LAND COVER AND NDVI OF MIRZAPUR DISTRICT, UP
RAHUL
 
Azure Interview Questions and Answers PDF By ScholarHat
Azure Interview Questions and Answers PDF By ScholarHatAzure Interview Questions and Answers PDF By ScholarHat
Azure Interview Questions and Answers PDF By ScholarHat
Scholarhat
 
Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...
Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...
Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...
National Information Standards Organization (NISO)
 
Liberal Approach to the Study of Indian Politics.pdf
Liberal Approach to the Study of Indian Politics.pdfLiberal Approach to the Study of Indian Politics.pdf
Liberal Approach to the Study of Indian Politics.pdf
WaniBasim
 
South African Journal of Science: Writing with integrity workshop (2024)
South African Journal of Science: Writing with integrity workshop (2024)South African Journal of Science: Writing with integrity workshop (2024)
South African Journal of Science: Writing with integrity workshop (2024)
Academy of Science of South Africa
 
A Independência da América Espanhola LAPBOOK.pdf
A Independência da América Espanhola LAPBOOK.pdfA Independência da América Espanhola LAPBOOK.pdf
A Independência da América Espanhola LAPBOOK.pdf
Jean Carlos Nunes Paixão
 
Chapter 4 - Islamic Financial Institutions in Malaysia.pptx
Chapter 4 - Islamic Financial Institutions in Malaysia.pptxChapter 4 - Islamic Financial Institutions in Malaysia.pptx
Chapter 4 - Islamic Financial Institutions in Malaysia.pptx
Mohd Adib Abd Muin, Senior Lecturer at Universiti Utara Malaysia
 
Digital Artefact 1 - Tiny Home Environmental Design
Digital Artefact 1 - Tiny Home Environmental DesignDigital Artefact 1 - Tiny Home Environmental Design
Digital Artefact 1 - Tiny Home Environmental Design
amberjdewit93
 
Main Java[All of the Base Concepts}.docx
Main Java[All of the Base Concepts}.docxMain Java[All of the Base Concepts}.docx
Main Java[All of the Base Concepts}.docx
adhitya5119
 
Life upper-Intermediate B2 Workbook for student
Life upper-Intermediate B2 Workbook for studentLife upper-Intermediate B2 Workbook for student
Life upper-Intermediate B2 Workbook for student
NgcHiNguyn25
 
Pengantar Penggunaan Flutter - Dart programming language1.pptx
Pengantar Penggunaan Flutter - Dart programming language1.pptxPengantar Penggunaan Flutter - Dart programming language1.pptx
Pengantar Penggunaan Flutter - Dart programming language1.pptx
Fajar Baskoro
 
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdfANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
Priyankaranawat4
 
PIMS Job Advertisement 2024.pdf Islamabad
PIMS Job Advertisement 2024.pdf IslamabadPIMS Job Advertisement 2024.pdf Islamabad
PIMS Job Advertisement 2024.pdf Islamabad
AyyanKhan40
 
The History of Stoke Newington Street Names
The History of Stoke Newington Street NamesThe History of Stoke Newington Street Names
The History of Stoke Newington Street Names
History of Stoke Newington
 

Recently uploaded (20)

clinical examination of hip joint (1).pdf
clinical examination of hip joint (1).pdfclinical examination of hip joint (1).pdf
clinical examination of hip joint (1).pdf
 
The Diamonds of 2023-2024 in the IGRA collection
The Diamonds of 2023-2024 in the IGRA collectionThe Diamonds of 2023-2024 in the IGRA collection
The Diamonds of 2023-2024 in the IGRA collection
 
Smart-Money for SMC traders good time and ICT
Smart-Money for SMC traders good time and ICTSmart-Money for SMC traders good time and ICT
Smart-Money for SMC traders good time and ICT
 
World environment day ppt For 5 June 2024
World environment day ppt For 5 June 2024World environment day ppt For 5 June 2024
World environment day ppt For 5 June 2024
 
Film vocab for eal 3 students: Australia the movie
Film vocab for eal 3 students: Australia the movieFilm vocab for eal 3 students: Australia the movie
Film vocab for eal 3 students: Australia the movie
 
Hindi varnamala | hindi alphabet PPT.pdf
Hindi varnamala | hindi alphabet PPT.pdfHindi varnamala | hindi alphabet PPT.pdf
Hindi varnamala | hindi alphabet PPT.pdf
 
LAND USE LAND COVER AND NDVI OF MIRZAPUR DISTRICT, UP
LAND USE LAND COVER AND NDVI OF MIRZAPUR DISTRICT, UPLAND USE LAND COVER AND NDVI OF MIRZAPUR DISTRICT, UP
LAND USE LAND COVER AND NDVI OF MIRZAPUR DISTRICT, UP
 
Azure Interview Questions and Answers PDF By ScholarHat
Azure Interview Questions and Answers PDF By ScholarHatAzure Interview Questions and Answers PDF By ScholarHat
Azure Interview Questions and Answers PDF By ScholarHat
 
Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...
Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...
Pollock and Snow "DEIA in the Scholarly Landscape, Session One: Setting Expec...
 
Liberal Approach to the Study of Indian Politics.pdf
Liberal Approach to the Study of Indian Politics.pdfLiberal Approach to the Study of Indian Politics.pdf
Liberal Approach to the Study of Indian Politics.pdf
 
South African Journal of Science: Writing with integrity workshop (2024)
South African Journal of Science: Writing with integrity workshop (2024)South African Journal of Science: Writing with integrity workshop (2024)
South African Journal of Science: Writing with integrity workshop (2024)
 
A Independência da América Espanhola LAPBOOK.pdf
A Independência da América Espanhola LAPBOOK.pdfA Independência da América Espanhola LAPBOOK.pdf
A Independência da América Espanhola LAPBOOK.pdf
 
Chapter 4 - Islamic Financial Institutions in Malaysia.pptx
Chapter 4 - Islamic Financial Institutions in Malaysia.pptxChapter 4 - Islamic Financial Institutions in Malaysia.pptx
Chapter 4 - Islamic Financial Institutions in Malaysia.pptx
 
Digital Artefact 1 - Tiny Home Environmental Design
Digital Artefact 1 - Tiny Home Environmental DesignDigital Artefact 1 - Tiny Home Environmental Design
Digital Artefact 1 - Tiny Home Environmental Design
 
Main Java[All of the Base Concepts}.docx
Main Java[All of the Base Concepts}.docxMain Java[All of the Base Concepts}.docx
Main Java[All of the Base Concepts}.docx
 
Life upper-Intermediate B2 Workbook for student
Life upper-Intermediate B2 Workbook for studentLife upper-Intermediate B2 Workbook for student
Life upper-Intermediate B2 Workbook for student
 
Pengantar Penggunaan Flutter - Dart programming language1.pptx
Pengantar Penggunaan Flutter - Dart programming language1.pptxPengantar Penggunaan Flutter - Dart programming language1.pptx
Pengantar Penggunaan Flutter - Dart programming language1.pptx
 
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdfANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
ANATOMY AND BIOMECHANICS OF HIP JOINT.pdf
 
PIMS Job Advertisement 2024.pdf Islamabad
PIMS Job Advertisement 2024.pdf IslamabadPIMS Job Advertisement 2024.pdf Islamabad
PIMS Job Advertisement 2024.pdf Islamabad
 
The History of Stoke Newington Street Names
The History of Stoke Newington Street NamesThe History of Stoke Newington Street Names
The History of Stoke Newington Street Names
 

Annotation seminar

  • 1. Semantic Web Technologies Annotation Presented By : AlbaraAbdalkhalig Mansour Sudan University-Web Technology E-mail : Brra51@hotmail.com Tel : 00249121200239
  • 2. Definition : Annotations are comments, notes, explanations, or other types of external remarks that can be attached to a Web document or a selected part of the document. As they are external, it is possible to annotate any Web document independently, without needing to edit that document. From the technical point of view, annotations are usually seen as metadata, as they give additional information about an existing piece of data. 2
  • 3. What is annotation? People make notes to themselves in order to preserve ideas that arise during a variety of activities. The purpose of these notes is often to summarize, criticize, or emphasize specific phrases or events. Semantic annotations are to tag ontology class instance data and map it into ontology classes. 3
  • 4. Why use annotation? To have the world knowledge at one's finger tips seems possible. The Internet is the platform for information. Unfortunately most of the information is provided in an unstructured and non-standardized form. 4
  • 5. Annotation methods Manually Semi-automatically Automatically 5
  • 6. (1) Manually Manual annotation is the transformation of existing syntactic resources into interlinked knowledge structures that represent relevant underlying information. Manual annotation is an expensive process, and often does not consider that multiple perspectives of a data source, requiring multiple ontologies, can be beneficial to support the needs of different users. 6
  • 7. (2) Semi-automatic Annotation Semi-automatic annotation systems rely on human intervention at some point in the annotation process. The platforms vary in their architecture, information extraction tools and methods, initial ontology, amount of manual work required to perform annotation, performance and other features, such as storage management. 7
  • 8. (3) Automatic Annotation The fully automatic creation of semantic annotations is an unsolved problem. Automatic semantic annotation for the natural language sentences in these pages is a daunting task and we are often forced to do it manually or semi-automatically using handwritten rules 8
  • 9. Semantic Annotation Concerns Scale, Volume Existing & new documents on the Web Manual annotation Expensive – economic, time Subject to personal motivation Schema Complexity Storage support for multiple ontologies within or external to source document? Knowledge base refinement Access - How are annotations accessed? API, custom UI, plug-ins 9
  • 11.
  • 12. GATE
  • 13.
  • 14. Tools for multimedia annotation11
  • 15. Annotation of text Many systems apply rules or wrappers that were manually created that try to recognize patterns for the annotations. Some systems learn how to annotate with the help of the user. Supervised systems learn how to annotate from a training set that was manually created beforehand. Semi-automatic approaches often apply information extraction technology, which analyzes natural language for pulling out information the user is interested in. 12
  • 16. A Walk-Through Example: GATE GATEis a tool for : scientists performing experiments that involve processing human language; companies developing applications with language processing components; teachers and students of courses about language and language computation. GATE comprises an architecture, framework (or SDK) and development environment, and has been in development since 1995 in the Sheffield NLP group. The system has been used for many language processing projects; in particular for Information Extraction in many languages. GATE is funded by the EPSRC and the EU. 13
  • 17. KIM platform KIM = Knowledge and Information Management developed by semantic technology lab “Ontotext“ based on GATE 14
  • 18. KIM platform KIM performs IE based on an ontology and a massive knowledge base. 15
  • 19. KIM KB KIM KB consists of above 80,000 entities (50,000 locations, 8,400 organization instances, etc.) Each location has geographic coordinates and several aliases (usually including English, French, Spanish, and sometimes the local transcription of the location name) as well as co-positioning relations (e.g. subRegionOf.) The organizations have locatedInrelations to the corresponding Countryinstances. The additionally imported information about the companies consists of short description, URL, reference to an industry sector, reported sales, net income,and number of employees. 16
  • 20. KIM platform The KIM platform provides a novel infrastructure and services for: automatic semantic annotation, indexing, retrieval of unstructured and semi-structured content. 17
  • 21. KIM platform The most direct applications of KIM are: Generation of meta-data for the Semantic Web, which allows hyper-linking and advanced visualization and navigation. Knowledge Management, enhancing the efficiency of the existing indexing, retrieval, classification and filtering applications. 18
  • 22. KIM platform The automatic semantic annotation is seen as a named-entity recognition (NER) and annotation process. The traditional flat NE type sets consist of several general types (such as Organization, Person, Date, Location, Percent, Money). In KIM the NE type is specified by reference to an ontology. The semantic descriptions of entities and relations between them are kept in a knowledge base (KB) encoded in the KIM ontology and residing in the same semantic repository. Thus KIM provides for each entity reference in the text (i) a link (URI) to the most specific class in the ontology and (ii) a link to the specific instance in the KB. Each extracted NE is linked to its specific type information (thus Arabian Sea would be identified as Sea, instead of the traditional – Location). 19
  • 24. Multimedia Annotation Different levels of annotations Metadata Often technical metadata Content level Semantic annotations Keywords, domain ontologies, free-text Multimedia level low-level annotations Visual descriptors, such as dominant color 21
  • 25. Metadata refers to information about technical details creation details creator, creationDate, … camera details settings resolution format EXIF access rights administrated by the OS owner, access rights, … 22
  • 26. Content Level Describes what is depicted and directly perceivable by a human usually provided manually keywords/tags classification of content seldom generated automatically scene classification object detection different types of annotations global vs. local different semantic levels 23
  • 27. Global vs. Local Annotations Global annotations most widely used flickr: tagging is only global organization within categories free-text annotations provide information about the content as a whole no detailed information Local annotations are less supported e.g. flickr, PhotoStuff allow to provide annotations of regions especially important for semantic image understanding allow to extract relations provide a more complete view of the scene provide information about different regions and about the depicted relations and arrangements of objects 24
  • 28. Semantic Levels Free-Text annotations cover large aspects, but less appropriate for sharing, organization and retrieval Free-Text Annotations probably most natural for the human, but provide least formal semantics Tagging provides light-weight semantics Only useful if a fixed vocabulary is used Allows some simple inference of related concepts by tag analysis (clustering) No formal semantics, but provides benefits due to fixed vocabulary Requires more effort from the user Ontologies Provide syntax and semantic to define complex domain vocabularies Allow for the inference of additional knowledge Leverage interoperability Powerful way of semantic annotation, but hardly comprehensible by “normal users” 25
  • 29. Tools Web-based Tools flickr riya 26
  • 30. flickr Web2.0 application tagging photos globally add comments to image regions marked by bounding box large user community and tagging allows for easy sharing of images partly fixed vocabularies evolved e.g. Geo-Tagging 27
  • 31. riya Similar to flickr in functionality Adds automatic annotation features Face Recognition Mark faces in photos associate name train system automatic recognition of the person in the future 28
  • 33. References Further Reading: B. Popov, A. Kiryakov, A.Kirilov, D. Manov, D.Ognyanoff, M. Goranov: „KIM – Semantic Annotation Platform“, 2003. GATE: http://gate.ac.uk/overview.html M-OntoMat-Annotizer: http://www.acemedia.org/aceMedia/results/software/m-ontomat-annotizer.html KIM platform: http://www.ontotext.com/kim/ ALIPR: http://www.alipr.com Wikipedia links: http://en.wikipedia.org/wiki/Automatic_image_annotation http://en.wikipedia.org/wiki/Games_with_a_purpose http://en.wikipedia.org/wiki/General_Architecture_for_Text_Engineering 30

Editor's Notes

  1. KIMprovides a Knowledge and Information Management (KIM) infrastructure and services for automatic semantic annotation, indexing, and retrieval of unstructured and semi-structured content. Within the process of annotation, KIM also performs ontology population. As a base line, KIM analyzes texts and recognizes references to entities (like persons, organizations, locations, dates). Then it tries to match the reference with a known entity, having a unique URI and description in the knowledge base. Alternatively, a new URI and entity description are automatically generated. Finally, the reference in the document gets annotated with the URI of the entity. This process, as well, as the result of it, are the KIM’s offer for semantic annotation. This sort of meta-data is later on used for semantic indexing, retrieval, visualization, and automatic hyper-linking of documents. KIM is a platform which offers a server, web user interface, and Internet Explorer plug-in. KIM is equipped with an upper-level ontology (KIMO) of about 250 classes and 100 properties. Further, a knowledge base (KIM KB), pre-populated with up to 200 000 entity descriptions, is bundled with KIM. In terms of underlying technology, KIM is using GATE, Sesame, and Lucene.