Semantic Search for Media Portals - Presentation Transcript
Semantische Suche in
Medienportalen
Dr. Sebastian Schaffert
Salzburg Research / Salzburg NewMediaLab
sebastian.schaffert@salzburgresearch.at
1
Introduction
2
Sebastian Schaffert
• Doktorat in Informatik,
Uni München
• Senior Researcher bei Salzburg
Research
• Forschungsgebiete Social Software,
Web 2.0 und Semantic Web
• Projektkoordinator des EU-Projekts
„KiWi - Knowledge in a Wiki“
3
Salzburg Research
• Forschungsgesellschaft des Landes Salzburg
• Fokus auf interdiszipliäre IT-Forschung
• Wissens- und Medienmanagement
• Mobilität und ortsbasierte Dienste
• Bildung und Medien
• E-Culture
• Netzwerktechnologien
4
Salzburg NewMediaLab
• Österreichisches Kompetenzzentrum zu
Neuen Medien
• „public private partnership“-Modell mit
öffentlicher Kofinanzierung
• Forschung in den Bereichen
„Multimediatechnologien“, „Social
Software“ und „Semantischen Systemen“
5
Information Organisation
6
video by M. Wesch/YouTube
7
classical paper-based
information organisation
is limited by physical
constraints and thus
follows a single hierarchy
8
Example: Dewey Decimal System
• developed by US librarian Melvil
Dewey
• arranging books in a numerically
encoded hierarchical order by
subject
9
Figure from Politt & Tinker (2003)
10
but what if your world view does not
match Dewey‘s 1930s world view?
11
12
13
This also holds for
newspapers!
photo by birdfarm/Flickr
14
15
Computers offer to organise information
along multiple dimensions, detached from
physical constraints http://universe.daylife.com/
16
Computers offer to organise information
along multiple dimensions, detached from
physical constraints http://universe.daylife.com/
16
Computers offer to organise information
along multiple dimensions, detached from
physical constraints http://universe.daylife.com/
16
Different Hierarchies
17
Example: Holiday Photos
18
you could organise as ...
Italy Photos 2008
2008
19
or as ...
2008
Italy
Photos
2008
20
or even as ...
Italy
2008
Photos
2008
21
or maybe as ?
Italy Photos
2008
2008
22
all this makes sense ...
... to someone
23
but: how
many
dimensions
are there?
photo by Alex Kessler/Flickr
24
5!
(exactly)
25
Location
Alphabet
Time
Category
Richard Saul Wurman
Hierarchy Information Designer
26
Location ...
http://tagit.salzburgresearch.at
27
Alphabet ...
http://www.linkedin.com
28
Time ...
http://simile.mit.edu/timeline/
29
Category ...
30
Hierarchy ...
31
What does this mean
for News Portals?
32
most existing news portals
follow the classical, resort
oriented navigation like in
paper-based news - physical
limitation lifted to virtual space
33
34
35
• resort = category (sort of ...)
• but: not necessarily topic!
36
Article on soccer EM
could be in ...
• sports
• economy
• politics
• culture
• Salzburg
37
LATCH in Online News
38
News by Location ...
http://atlas.tagesschau.de
39
News by Alphabet ...
40
News by Time ...
41
News by Category ...
sorry, no good example (except resort-based) :-(
42
News by Category ...
but there is:
http://www.iptc.org
43
News by Category ...
so why not offer it for navigation?
44
News by Hierarchy ...
45
Challenges & Opportunities
46
from big ambitions to realisable goal
47
Challenges ...
1. user centred design means „intuitiveness“ of
interface
48
but intuitiveness only exists when facing a bear ...
from: user „randy_harris“ at Flickr
49
User Interface ...
otherwise, it is rather patterns and idioms we already know ...
bread crumps tabs
dropdown selection
home link
tag clouds
50
User Interface ...
• when visiting an online news paper, people
almost expect a classical navigation structure
• new idioms need to be introduced very
carefully (e.g. blog style, ...)
• more complex structures need to be hidden (in
salzburg.com: only in search, not in navigation)
51
Managing Topics ...
2. assuming that editors become „knowledge
engineers“ that properly maintain complex
knowledge models was unrealistic
52
Managing Topics ...
• need to do as much automatic processing
as possible (but this is limited)
• possibility to involve users!
53
Tagging
54
Linking
55
Structuring
from: user „liber“ at Flickr
56
Integration ...
3. integration with other kinds of content
beyond news
57
from: „Wikis in plain English“
58
from: „Blogs in plain English“
59
60
Future Content Platforms
61
Project Deliverables ...
• Semantic Search (completed 2008):
http://search.salzburg.com
• KiWi (platform developed by EU Project):
• Content Integration Framework (2009):
integration and connection of different kinds
of content
• TagIT (2009):
geolocation & social tagging of news and
places
62
search.salzburg.com
keyword-based
interface, refine
search results by
map, category, time,
location
63
DEMO!
http://search.salzburg.com
64
Technology (Productive) ...
• UI: Ruby on Rails, AJAX
• Logic: mostly PL/SQL
• DB: PostgreSQL
• XML feed of news articles
• optimized full-text index, time index,
location index, resort
• 700.000 articles
65
Data Import ...
Articles Geolocation
(XML) (named entities + geo field)
Database Fulltext Index
(PostgreSQL) (PostgreSQL built-in)
66
KiWi - Knowledge in a Wiki
• EU project funded under 7th Framework
Programme
• 7 partners, 3.8 Million Euro
• develops a platform for „Semantic Social
Software“
• builds on the „Wiki Principles“
http://www.kiwi-project.eu
67
KiWi - Core Components
• content + semantic metadata (finished)
• transactions & versioning (mostly finished)
• semantic tagging (mostly finished)
• facetted search (in progress)
• social networking (in progress)
• personalisation (in progress)
• reasoning (in progress)
http://www.kiwi-project.eu
68
KiWi - Applications
• KiWi Wiki (finished)
• TagIT (mostly finished)
• Dashboard (in progress)
• Blog (planned)
important:
content shared between applications!
http://www.kiwi-project.eu
69
Demo!
http://showcase.kiwi-project.eu
70
Conclusion
71
Where do we go?
• reimplementation on top of the KiWi
platform
• integration of community features
(social networking, sharing, ...)
• integration of different kinds of content
(news, wiki, blogs, photos, ...)
• backed by advanced Semantic Web
technology
(reasoning, information extraction)
72
Book tips ...
• Richard Saul Wurman: Information Anxiety 2
• David Weinberger: Everything is Miscellaneous
• Clay Shirky: Here Comes Everybody - the Power
of Organising without Organisatons
73
SNML Books (German)
Nachrichten 2.0:
Eine Analyse internationaler
Nachrichtenangebote im Internet
ISBN: 978-3-8370-5731-7
Erfolgreicher Aufbau von Online-
Communitys: Konzepte, Szenarien und
Handlungsempfehlungen (April 2009)
ISBN: 978-3-902448-13-2
74
Thanks!
Dr. Sebastian Schaffert
| sebastian.schaffert@salzburgresearch.at
| http://www.salzburgresearch.at
| http://www.newmedialab.at
| http://www.kiwi-project.eu (KiWi Website)
| http://planet.kiwi-project.eu (KiWi blog)
75
A presentation I have given several times illustrat more
A presentation I have given several times illustrating to non-technical people how the Internet can change information access in media portals. It focusses on the different ways of information organisation and architecture that are possible in digital media because of taking away physical constraints. less
0 comments
Post a comment