Semantische Suche in
     Medienportalen
           Dr. Sebastian Schaffert
Salzburg Research / Salzburg NewMediaLab
  seb...
Introduction




               2
Sebastian Schaffert
 • Doktorat in Informatik,
   Uni München
 • Senior Researcher bei Salzburg
   Research
 • Forschungsg...
Salzburg Research
• Forschungsgesellschaft des Landes Salzburg
• Fokus auf interdiszipliäre IT-Forschung
 • Wissens- und M...
Salzburg NewMediaLab
• Österreichisches Kompetenzzentrum zu
  Neuen Medien
• „public private partnership“-Modell mit
  öff...
Information Organisation




                           6
video by M. Wesch/YouTube

                        7
classical paper-based
information organisation
   is limited by physical
   constraints and thus
follows a single hierarch...
Example: Dewey Decimal System

• developed by US librarian Melvil
  Dewey
• arranging books in a numerically
  encoded hie...
Figure from Politt & Tinker (2003)

                                     10
but what if your world view does not
 match Dewey‘s 1930s world view?




                                       11
12
13
This also holds for
   newspapers!




                      photo by birdfarm/Flickr

                                   ...
15
Computers offer to organise information
along multiple dimensions, detached from
           physical constraints    http:/...
Computers offer to organise information
along multiple dimensions, detached from
           physical constraints    http:/...
Computers offer to organise information
along multiple dimensions, detached from
           physical constraints    http:/...
Different Hierarchies




                        17
Example: Holiday Photos




                          18
you could organise as ...


Italy               Photos    2008




                             2008


                   ...
or as ...


                              2008
                     Italy
Photos




                             2008


 ...
or even as ...


                                 Italy
                          2008
Photos




                     200...
or maybe as ?


                        Italy   Photos
 2008




2008


                                         22
all this makes sense ...
    ... to someone




                           23
but: how
   many
dimensions
are there?




             photo by Alex Kessler/Flickr

                                    ...
5!
(exactly)




            25
Location
Alphabet
Time
Category
            Richard Saul Wurman
Hierarchy   Information Designer



                      ...
Location ...




               http://tagit.salzburgresearch.at
                                                  27
Alphabet ...




               http://www.linkedin.com
                                         28
Time ...




           http://simile.mit.edu/timeline/
                                             29
Category ...




               30
Hierarchy ...




                31
What does this mean
 for News Portals?


                      32
most existing news portals
   follow the classical, resort
   oriented navigation like in
  paper-based news - physical
li...
34
35
• resort = category (sort of ...)
• but: not necessarily topic!



                                    36
Article on soccer EM
could be in ...



                       • sports
                       • economy
                 ...
LATCH in Online News




                       38
News by Location ...




                       http://atlas.tagesschau.de
                                               ...
News by Alphabet ...




                       40
News by Time ...




                   41
News by Category ...




sorry, no good example (except resort-based) :-(




                                            ...
News by Category ...

but there is:




                           http://www.iptc.org
                                   ...
News by Category ...




    so why not offer it for navigation?




                                          44
News by Hierarchy ...




                        45
Challenges & Opportunities




                             46
from big ambitions to realisable goal




                                        47
Challenges ...




1. user centred design means „intuitiveness“ of
   interface




                                      ...
but intuitiveness only exists when facing a bear ...




                                       from: user „randy_harris“ ...
User Interface ...
otherwise, it is rather patterns and idioms we already know ...
           bread crumps                ...
User Interface ...

• when visiting an online news paper, people
  almost expect a classical navigation structure

• new i...
Managing Topics ...




2. assuming that editors become „knowledge
   engineers“ that properly maintain complex
   knowled...
Managing Topics ...



• need to do as much automatic processing
  as possible (but this is limited)

• possibility to inv...
Tagging




          54
Linking



          55
Structuring




              from: user „liber“ at Flickr
                                             56
Integration ...




3. integration with other kinds of content
   beyond news




                                        ...
from: „Wikis in plain English“
                                 58
from: „Blogs in plain English“
                                 59
60
Future Content Platforms




                           61
Project Deliverables ...
• Semantic Search (completed 2008):
  http://search.salzburg.com

• KiWi (platform developed by E...
search.salzburg.com




  keyword-based
  interface, refine
 search results by
map, category, time,
      location



     ...
DEMO!




        http://search.salzburg.com
                                     64
Technology (Productive) ...



 • UI: Ruby on Rails, AJAX
 • Logic: mostly PL/SQL
 • DB: PostgreSQL
 • XML feed of news ar...
Data Import ...




 Articles                 Geolocation
   (XML)             (named entities + geo field)




Database   ...
KiWi - Knowledge in a Wiki

  • EU project funded under 7th Framework
    Programme

  • 7 partners, 3.8 Million Euro
  • ...
KiWi - Core Components

 • content + semantic metadata (finished)
 • transactions & versioning (mostly finished)
 • semantic...
KiWi - Applications

• KiWi Wiki (finished)
• TagIT (mostly finished)
• Dashboard (in progress)
• Blog (planned)
important:
...
Demo!




        http://showcase.kiwi-project.eu
                                          70
Conclusion




             71
Where do we go?


     • reimplementation on top of the KiWi
       platform
     • integration of community features
    ...
Book tips ...


• Richard Saul Wurman: Information Anxiety 2
• David Weinberger: Everything is Miscellaneous
• Clay Shirky...
SNML Books (German)

              Nachrichten 2.0:
              Eine Analyse internationaler
              Nachrichtenan...
Thanks!


Dr. Sebastian Schaffert

| sebastian.schaffert@salzburgresearch.at

| http://www.salzburgresearch.at
| http://ww...
Upcoming SlideShare
Loading in …5
×

Semantic Search for Media Portals

1,157 views

Published on

A presentation I have given several times illustrating to non-technical people how the Internet can change information access in media portals. It focusses on the different ways of information organisation and architecture that are possible in digital media because of taking away physical constraints.

Published in: Technology, Education
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total views
1,157
On SlideShare
0
From Embeds
0
Number of Embeds
7
Actions
Shares
0
Downloads
20
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide

Semantic Search for Media Portals

  1. 1. Semantische Suche in Medienportalen Dr. Sebastian Schaffert Salzburg Research / Salzburg NewMediaLab sebastian.schaffert@salzburgresearch.at 1
  2. 2. Introduction 2
  3. 3. Sebastian Schaffert • Doktorat in Informatik, Uni München • Senior Researcher bei Salzburg Research • Forschungsgebiete Social Software, Web 2.0 und Semantic Web • Projektkoordinator des EU-Projekts „KiWi - Knowledge in a Wiki“ 3
  4. 4. Salzburg Research • Forschungsgesellschaft des Landes Salzburg • Fokus auf interdiszipliäre IT-Forschung • Wissens- und Medienmanagement • Mobilität und ortsbasierte Dienste • Bildung und Medien • E-Culture • Netzwerktechnologien 4
  5. 5. Salzburg NewMediaLab • Österreichisches Kompetenzzentrum zu Neuen Medien • „public private partnership“-Modell mit öffentlicher Kofinanzierung • Forschung in den Bereichen „Multimediatechnologien“, „Social Software“ und „Semantischen Systemen“ 5
  6. 6. Information Organisation 6
  7. 7. video by M. Wesch/YouTube 7
  8. 8. classical paper-based information organisation is limited by physical constraints and thus follows a single hierarchy 8
  9. 9. Example: Dewey Decimal System • developed by US librarian Melvil Dewey • arranging books in a numerically encoded hierarchical order by subject 9
  10. 10. Figure from Politt & Tinker (2003) 10
  11. 11. but what if your world view does not match Dewey‘s 1930s world view? 11
  12. 12. 12
  13. 13. 13
  14. 14. This also holds for newspapers! photo by birdfarm/Flickr 14
  15. 15. 15
  16. 16. Computers offer to organise information along multiple dimensions, detached from physical constraints http://universe.daylife.com/ 16
  17. 17. Computers offer to organise information along multiple dimensions, detached from physical constraints http://universe.daylife.com/ 16
  18. 18. Computers offer to organise information along multiple dimensions, detached from physical constraints http://universe.daylife.com/ 16
  19. 19. Different Hierarchies 17
  20. 20. Example: Holiday Photos 18
  21. 21. you could organise as ... Italy Photos 2008 2008 19
  22. 22. or as ... 2008 Italy Photos 2008 20
  23. 23. or even as ... Italy 2008 Photos 2008 21
  24. 24. or maybe as ? Italy Photos 2008 2008 22
  25. 25. all this makes sense ... ... to someone 23
  26. 26. but: how many dimensions are there? photo by Alex Kessler/Flickr 24
  27. 27. 5! (exactly) 25
  28. 28. Location Alphabet Time Category Richard Saul Wurman Hierarchy Information Designer 26
  29. 29. Location ... http://tagit.salzburgresearch.at 27
  30. 30. Alphabet ... http://www.linkedin.com 28
  31. 31. Time ... http://simile.mit.edu/timeline/ 29
  32. 32. Category ... 30
  33. 33. Hierarchy ... 31
  34. 34. What does this mean for News Portals? 32
  35. 35. most existing news portals follow the classical, resort oriented navigation like in paper-based news - physical limitation lifted to virtual space 33
  36. 36. 34
  37. 37. 35
  38. 38. • resort = category (sort of ...) • but: not necessarily topic! 36
  39. 39. Article on soccer EM could be in ... • sports • economy • politics • culture • Salzburg 37
  40. 40. LATCH in Online News 38
  41. 41. News by Location ... http://atlas.tagesschau.de 39
  42. 42. News by Alphabet ... 40
  43. 43. News by Time ... 41
  44. 44. News by Category ... sorry, no good example (except resort-based) :-( 42
  45. 45. News by Category ... but there is: http://www.iptc.org 43
  46. 46. News by Category ... so why not offer it for navigation? 44
  47. 47. News by Hierarchy ... 45
  48. 48. Challenges & Opportunities 46
  49. 49. from big ambitions to realisable goal 47
  50. 50. Challenges ... 1. user centred design means „intuitiveness“ of interface 48
  51. 51. but intuitiveness only exists when facing a bear ... from: user „randy_harris“ at Flickr 49
  52. 52. User Interface ... otherwise, it is rather patterns and idioms we already know ... bread crumps tabs dropdown selection home link tag clouds 50
  53. 53. User Interface ... • when visiting an online news paper, people almost expect a classical navigation structure • new idioms need to be introduced very carefully (e.g. blog style, ...) • more complex structures need to be hidden (in salzburg.com: only in search, not in navigation) 51
  54. 54. Managing Topics ... 2. assuming that editors become „knowledge engineers“ that properly maintain complex knowledge models was unrealistic 52
  55. 55. Managing Topics ... • need to do as much automatic processing as possible (but this is limited) • possibility to involve users! 53
  56. 56. Tagging 54
  57. 57. Linking 55
  58. 58. Structuring from: user „liber“ at Flickr 56
  59. 59. Integration ... 3. integration with other kinds of content beyond news 57
  60. 60. from: „Wikis in plain English“ 58
  61. 61. from: „Blogs in plain English“ 59
  62. 62. 60
  63. 63. Future Content Platforms 61
  64. 64. Project Deliverables ... • Semantic Search (completed 2008): http://search.salzburg.com • KiWi (platform developed by EU Project): • Content Integration Framework (2009): integration and connection of different kinds of content • TagIT (2009): geolocation & social tagging of news and places 62
  65. 65. search.salzburg.com keyword-based interface, refine search results by map, category, time, location 63
  66. 66. DEMO! http://search.salzburg.com 64
  67. 67. Technology (Productive) ... • UI: Ruby on Rails, AJAX • Logic: mostly PL/SQL • DB: PostgreSQL • XML feed of news articles • optimized full-text index, time index, location index, resort • 700.000 articles 65
  68. 68. Data Import ... Articles Geolocation (XML) (named entities + geo field) Database Fulltext Index (PostgreSQL) (PostgreSQL built-in) 66
  69. 69. KiWi - Knowledge in a Wiki • EU project funded under 7th Framework Programme • 7 partners, 3.8 Million Euro • develops a platform for „Semantic Social Software“ • builds on the „Wiki Principles“ http://www.kiwi-project.eu 67
  70. 70. KiWi - Core Components • content + semantic metadata (finished) • transactions & versioning (mostly finished) • semantic tagging (mostly finished) • facetted search (in progress) • social networking (in progress) • personalisation (in progress) • reasoning (in progress) http://www.kiwi-project.eu 68
  71. 71. KiWi - Applications • KiWi Wiki (finished) • TagIT (mostly finished) • Dashboard (in progress) • Blog (planned) important: content shared between applications! http://www.kiwi-project.eu 69
  72. 72. Demo! http://showcase.kiwi-project.eu 70
  73. 73. Conclusion 71
  74. 74. Where do we go? • reimplementation on top of the KiWi platform • integration of community features (social networking, sharing, ...) • integration of different kinds of content (news, wiki, blogs, photos, ...) • backed by advanced Semantic Web technology (reasoning, information extraction) 72
  75. 75. Book tips ... • Richard Saul Wurman: Information Anxiety 2 • David Weinberger: Everything is Miscellaneous • Clay Shirky: Here Comes Everybody - the Power of Organising without Organisatons 73
  76. 76. SNML Books (German) Nachrichten 2.0: Eine Analyse internationaler Nachrichtenangebote im Internet ISBN: 978-3-8370-5731-7 Erfolgreicher Aufbau von Online- Communitys: Konzepte, Szenarien und Handlungsempfehlungen (April 2009) ISBN: 978-3-902448-13-2 74
  77. 77. Thanks! Dr. Sebastian Schaffert | sebastian.schaffert@salzburgresearch.at | http://www.salzburgresearch.at | http://www.newmedialab.at | http://www.kiwi-project.eu (KiWi Website) | http://planet.kiwi-project.eu (KiWi blog) 75

×