Improving the Search Experience
in a Social Network with Cross
Media Contents
Daniele Cenni, Paolo Nesi
University of Flor...
ECLAP Social Network
 ECLAP is a Digital Library on Performing
Arts connected with Europeana
 ECLAP is a Best Practice a...
Goals/Requirements
 Develop an Indexing/Searching solution for ECLAP Social
Network allowing:
 Indexing multilingual cro...
ECLAP ANY content kind
 Informative Content
 Video, audio, images,
documents
 3D, animations, Braille
 Slide, Video-Sl...
ECLAP Semantic Model 1
DMS2013, August 2013, UK, Paolo Nesi
Media Object
Video Audio
Document
Group/Channel
CollectionPlay...
ECLAP Semantic Model 2
DMS2013, August 2013, UK, Paolo Nesi
User Group/Channel
Content
Media Object
Comment
Annotation
Tax...
Indexing
 Indexing & Search system
 Based on Apache Solr
 Multilingual aspects
 Translate the metadata or translate th...
Indexing
DMS2013, August 2013, UK, Paolo Nesi 8
Metadata Schema Indexing
DMS2013, August 2013, UK, Paolo Nesi 9
Search Facilities
 Full text search
 Uses the catch all fields to search for keywords in
most important fields in all la...
Search Facilities vs Information
DMS2013, August 2013, UK, Paolo Nesi 11
Searching
 Faceted search
DMS2013, August 2013, UK, Paolo Nesi 12
Weighted Query Model
 Where for the “q” query
 Weights are boosting fields
 Title is DC.Title, description DC.Descripti...
Model Optimization
 Optimization of the Precision&Recall to
improve search quality
 50 reference queries
 Optimization ...
Monte Carlo Analysis
MAP: Mean Average PrecisionDMS2013, August 2013, UK, Paolo Nesi 15
DMS2013, August 2013, UK, Paolo Nesi 16
Some weights’ Trends
DMS2013, August 2013, UK, Paolo Nesi 17
Comparative Results
MAP: Mean Average PrecisionDMS2013, August 2013, UK, Paolo Nesi 18
Usage Results
 Over than 500.000 visits
 7.29 minutes of permanence on the
portal
DMS2013, August 2013, UK, Paolo Nesi 19
Assessment of Search Facility
 Distribution of performed clicks
First page
DMS2013, August 2013, UK, Paolo Nesi 20
Conclusions
 indexing solution for
 cross media for multilingual metadata and texts
 Improved Searching & filtering res...
Upcoming SlideShare
Loading in...5
×

Improving the Search Experience in a Social Network with Cross Media Contents

71

Published on

Indexing/Searching solution for ECLAP Social Network allowing:
Indexing multilingual crossmedia content metadata and data (e.g. documents)
Indexing portal blogs, forums, events, group pages, comments, etc.
Efficient multilingual search (keyword search and advanced search) supporting:
misspelled words (e.g. shespeare)
partial word search
Sorting and filtering search results
re-index the whole data without blocking the system
Log and monitor users activity

Evaluate the Indexing/Searching service

Indexing & Search system
Based on Apache Solr
Multilingual aspects
Translate the metadata or translate the query?.. both
metadata translation
Query translation
Indexing schema
Dublin Core + DCTerms (multi language)
Performing Arts
Technical (provider, content type, GPS, IPR, duration, quality, …)
Groups associations (multi language)
Taxonomy associations (multi language)
Comments & multi language tags
FullText of the textual digital resources

Published in: Technology, Education
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total Views
71
On Slideshare
0
From Embeds
0
Number of Embeds
0
Actions
Shares
0
Downloads
3
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide

Improving the Search Experience in a Social Network with Cross Media Contents

  1. 1. Improving the Search Experience in a Social Network with Cross Media Contents Daniele Cenni, Paolo Nesi University of Florence Department of Systems and Informatics Distributed Systems and Internet Technology Laboratory Paolo.nesi@unifi.it cenni@dsi.unifi.it , http://www.disit.dinfo.unifi.it DMS2013, August 2013, UK, Paolo Nesi 1
  2. 2. ECLAP Social Network  ECLAP is a Digital Library on Performing Arts connected with Europeana  ECLAP is a Best Practice and Social Network (blogs, forums, comments, tagging, voting, …) DMS2013, August 2013, UK, Paolo Nesi 2
  3. 3. Goals/Requirements  Develop an Indexing/Searching solution for ECLAP Social Network allowing:  Indexing multilingual crossmedia content metadata and data (e.g. documents)  Indexing portal blogs, forums, events, group pages, comments, etc.  Efficient multilingual search (keyword search and advanced search) supporting:  misspelled words (e.g. shespeare)  partial word search  Sorting and filtering search results  re-index the whole data without blocking the system  Log and monitor users activity  …  Evaluate the Indexing/Searchig service DMS2013, August 2013, UK, Paolo Nesi 3
  4. 4. ECLAP ANY content kind  Informative Content  Video, audio, images, documents  3D, animations, Braille  Slide, Video-Slide, courses  eBook, ePub, Mpeg21, intelligent  Aggregated Content:  Playlist, Collections  Annotations, Synchronization  Support and networking content:  Blog, WebPage, Events, comments, forum, votes, messages, … 4 comments rating relationships technical Dynamic recommend …………… • Performance • Master classes • Scene Sketches • Scenography • Scenes • Private lives of artists • Scores • Braille • BackStage Stills • Choreography • Morals • Poster • Booklets • Magazines Music • Audio ballets
  5. 5. ECLAP Semantic Model 1 DMS2013, August 2013, UK, Paolo Nesi Media Object Video Audio Document Group/Channel CollectionPlaylist 0..n 0..n 1..n 0..n Image AVObjectAnnotation 0..n 1..2 1..n 0..n ForumWebPage CommentContentTaxonomyTerm 0..n 0..n 0..n1 0..n 0..n Blog Metadata Performing Arts Dublin Core Technical Main Annotation Side Annotation 1..n 1 GeoName Crossmedia Archive Event epub 3D IPR Braille Music Score 5
  6. 6. ECLAP Semantic Model 2 DMS2013, August 2013, UK, Paolo Nesi User Group/Channel Content Media Object Comment Annotation TaxonomyTerm foaf:member admin isProvidedBy isFavouriteOf dc:creator dc:creator foaf:topic_interest isFeaturedBy foaf:knows 6
  7. 7. Indexing  Indexing & Search system  Based on Apache Solr  Multilingual aspects  Translate the metadata or translate the query?.. both  metadata translation  Query translation  Indexing schema  Dublin Core + DCTerms (multi language)  Performing Arts  Technical (provider, content type, GPS, IPR, duration, quality, …)  Groups associations (multi language)  Taxonomy associations (multi language)  Comments & multi language tags  FullText of the textual digital resources DMS2013, August 2013, UK, Paolo Nesi 7
  8. 8. Indexing DMS2013, August 2013, UK, Paolo Nesi 8
  9. 9. Metadata Schema Indexing DMS2013, August 2013, UK, Paolo Nesi 9
  10. 10. Search Facilities  Full text search  Uses the catch all fields to search for keywords in most important fields in all languages (title, description, text, body, subject,…)  Fuzzy search  Allows matching mistyped words  Deep search  Allows searching for partial words  Faceted Search  Maximasing Precision and Recall:  Relevance & boosting terms DMS2013, August 2013, UK, Paolo Nesi 10
  11. 11. Search Facilities vs Information DMS2013, August 2013, UK, Paolo Nesi 11
  12. 12. Searching  Faceted search DMS2013, August 2013, UK, Paolo Nesi 12
  13. 13. Weighted Query Model  Where for the “q” query  Weights are boosting fields  Title is DC.Title, description DC.Description….,  Body is textual body, subject…,  taxonomy the full description of the taxonomy branch DMS2013, August 2013, UK, Paolo Nesi 13
  14. 14. Model Optimization  Optimization of the Precision&Recall to improve search quality  50 reference queries  Optimization Methods  Simulated Annealing  Genetic Algorithms  7 parameters DMS2013, August 2013, UK, Paolo Nesi 14
  15. 15. Monte Carlo Analysis MAP: Mean Average PrecisionDMS2013, August 2013, UK, Paolo Nesi 15
  16. 16. DMS2013, August 2013, UK, Paolo Nesi 16
  17. 17. Some weights’ Trends DMS2013, August 2013, UK, Paolo Nesi 17
  18. 18. Comparative Results MAP: Mean Average PrecisionDMS2013, August 2013, UK, Paolo Nesi 18
  19. 19. Usage Results  Over than 500.000 visits  7.29 minutes of permanence on the portal DMS2013, August 2013, UK, Paolo Nesi 19
  20. 20. Assessment of Search Facility  Distribution of performed clicks First page DMS2013, August 2013, UK, Paolo Nesi 20
  21. 21. Conclusions  indexing solution for  cross media for multilingual metadata and texts  Improved Searching & filtering results and thus user experience quality  Providing: (full text, operators), advanced, faceted, etc.  Precision and Recall analysis allowed to tune the search services  Simulated Annealing and Genetic Algorithms produced similar results  User behavior assessment has shown that search facility appreciation has been improved wrt to early previous settings, grounded on common sense and classical metadata relevance DMS2013, August 2013, UK, Paolo Nesi 21
  1. A particular slide catching your eye?

    Clipping is a handy way to collect important slides you want to go back to later.

×