2. „Man muss das Unmögliche tun um das Mögliche
zu erreichen.“ - Hermann Hesse.
3. Web documents without meta-data
● Plain HTML format without metadata
description→ Problem: Search engines need to
extract useful information trough parsing tools
and also using natural language processing
4. Web documents with metadata
● Better documents, describing information about
keywords, author, kontext
Problem: Even if the header of a web document
is annotated, the content remains unknown
5. Web documents with structured data
● Content of web documents is annotated and
can be used to be recognized by machines
A topological query is possible which has
access to the nodes of structured data.
Any parsing or less parsing is needed
6. Semantic Web
● Entities are described via URIs
in Dublin Core Format, RDFa, RichSnippet
There is a tool from Google to annotate
semantics for a web document:
http://www.google.com/webmasters/tools/richsni
ppets