SlideShare a Scribd company logo
© Fraunhofer IDMT
T. Köllmer†, E. Berndl‡, T. Weißgerber‡, P. Aichroth† , H. Kosch‡
A Workflow for Cross Media Recommendations
based on Linked Data Analysis
† Fraunhofer IDMT ‡ University of Passau
Thomas Köllmer
thomas.koellmer@idmt.fraunhofer.de
© Fraunhofer IDMT 2
The MICO Project
 http://www.mico-project.eu
 FP7 STREP, GA# 610480
 7 Partners: Salzburg Research, Fraunhofer IDMT, Oxford University,
University of Passau, Umeå University, Zaizi, InsideOut10
 Duration: 36 Months (end: 10/2016)
 Goal of MICO is to develop a platform to analyse “media in context” …
 ...by orchestrating different content extraction tools that can work
simultaneously or in a sequence and …..
 ...provide valuable metadata to third party applications.
© Fraunhofer IDMT 3
Cross Media Recommendations in MICO
 The Recommendation Problem
 “Providing suggestions for items to be of use to a user”
 Broad Application domains:
 e-Commerce, Travel, Multimedia, News, …
 Two main data sources:
 User behaviour data (Collaborative Filtering)
 Item Metadata (Content based recommendation)
© Fraunhofer IDMT 4
Cross Media Recommendations in MICO
 Media Recommendation
 “Classic” Recommendation problems (Music-Recommendation, Netflix
Challenge)
 Mostly based on collaborative filtering, but also content analysis
 Cross Media Recommendation
 Cross Media ≠ Cross Modal
 Multiple Input
 Media Items + Associated Media context, e.g., commentary
 Multiple Output
 Not necessarily more of the same thing
© Fraunhofer IDMT 5
Editor Support Usecase
As an editor, while I create or edit articles using WordPress, I want to
automatically
get related articles and videos that I might link to the article.
 Project Use Case: Recommendations for Greenpeace Magazine Italia
 Articles
 Videos
 User behaviour data
© Fraunhofer IDMT 6
Editor Support Usecase
As an editor, while I create or edit articles using WordPress, I want to
automatically
get related articles and videos that I might link to the article.
 How can Mico Help?
 Articles: NER components for content spotting
 Videos
 Text to speech + NER
 Metadata Analysis
 User behaviour data
 Generic collaborative filtering component (prediction.io)
© Fraunhofer IDMT 7
Chat Analysis Usecase
As a zooniverse user, I want get more information that helps me during the
current classification task, e.g., by the recommendation of similar subjects
or training data.
 Input:
 Results from animal detection
 Sentiment and competence classification of talks
 NER for animal spotting
 Output:
 Classification Hints
 Hints for moderators / experts
© Fraunhofer IDMT 8
Processing Workflow for Editor Support Use Case
1. Crawler feeds media items to platform
e.g., videos from Greenpeace channel, news articles, internal archive….
2. Analysis results are stored as Linked Data
Currently: using Apache Marmotta as triplestore
© Fraunhofer IDMT 9
Processing Workflow for Editor Support Use Case
3. Item gets into Focus (someone is writing an article on a specific topic)
4. Focused item is fed to the same platform, using customized pipelines
5. Results are processed by LD matching, i.e Recommendation component..
6. … enriched by stored annotations …
7. And returned to editing platform
© Fraunhofer IDMT 10
Recommendation Workflow in MICO
© Fraunhofer IDMT 11
Ontology Matching
 Semantic distance of entities important for analysis
 Animal detection: “I see a bird on the tree”, “Yes it’s definitely a
sparrow”
 Topic detection: Wind energy, sustainability, renewables, oil, …
 Non-Mico:
 Genres! (e.g., currently 1400 genres on Spotify…)
 Not less controversial: movie genres
© Fraunhofer IDMT 12
Evaluation
 How to evaluate a recommender system?
 Forget about precision and recall!
 Observing user behaviour
 Online experiment
 Offline: extrapolating past user behavior
 Lab Studies / Questionnaire
 Open problem: standardized + comparable testing of recommender
systems
© Fraunhofer IDMT 13
Conclusions
 Proposed setting tries to formalize a certain subset of recommendation
problems: Cross media recommendation
 Focused on Media Items
 Profits from Content Analysis
 New items, have a high contextual dependency on existing items, that
will be exploited for recommendation
 Work in Progress:
 Stable release of Mico-platform and extractors: right now
 PoC recommendation code will be integrated in the platform
 Demo Services: available in June
 Incorporate feedback into recommendation
© Fraunhofer IDMT
T. Köllmer†, E. Berndl‡, T. Weißgerber‡, P. Aichroth† , H. Kosch‡
A Workflow for Cross Media Recommendations
based on Linked Data Analysis
† Fraunhofer IDMT ‡ University of Passau
Thank you!
Questions?

More Related Content

Similar to Lime recommendation

Research Careers in Applied Computer Science
Research Careers in Applied Computer ScienceResearch Careers in Applied Computer Science
Research Careers in Applied Computer ScienceChristoph Lange
 
Summer school bz_fp7research_20100708
Summer school bz_fp7research_20100708Summer school bz_fp7research_20100708
Summer school bz_fp7research_20100708Sandro D'Elia
 
Go open2010 sde_20100417
Go open2010 sde_20100417Go open2010 sde_20100417
Go open2010 sde_20100417Sandro D'Elia
 
Fraunhofer iais audio mining - automatic metadata gereration of audio streams...
Fraunhofer iais audio mining - automatic metadata gereration of audio streams...Fraunhofer iais audio mining - automatic metadata gereration of audio streams...
Fraunhofer iais audio mining - automatic metadata gereration of audio streams...FIAT/IFTA
 
GoOpen 2010: Sandro D'Elia
GoOpen 2010: Sandro D'EliaGoOpen 2010: Sandro D'Elia
GoOpen 2010: Sandro D'EliaFriprogsenteret
 
Better Software, Better Research
Better Software, Better ResearchBetter Software, Better Research
Better Software, Better ResearchCarole Goble
 
LEARNTEC 2018 - HANDELkompetent - Enabling Adaptive Learning Pathways by Situ...
LEARNTEC 2018 - HANDELkompetent - Enabling Adaptive Learning Pathways by Situ...LEARNTEC 2018 - HANDELkompetent - Enabling Adaptive Learning Pathways by Situ...
LEARNTEC 2018 - HANDELkompetent - Enabling Adaptive Learning Pathways by Situ...Sebastian Opriel
 
NLP on Hadoop: A Distributed Framework for NLP-Based Keyword and Keyphrase Ex...
NLP on Hadoop: A Distributed Framework for NLP-Based Keyword and Keyphrase Ex...NLP on Hadoop: A Distributed Framework for NLP-Based Keyword and Keyphrase Ex...
NLP on Hadoop: A Distributed Framework for NLP-Based Keyword and Keyphrase Ex...Paolo Nesi
 
Open and Collaborative Software for Digital Pathology
Open and Collaborative Software for Digital Pathology Open and Collaborative Software for Digital Pathology
Open and Collaborative Software for Digital Pathology William Baird
 
MICO — Towards Contextual Media Analysis
MICO — Towards Contextual Media AnalysisMICO — Towards Contextual Media Analysis
MICO — Towards Contextual Media AnalysisThomas Kurz
 
Hamburg Requirements Engineering Symposium
Hamburg Requirements Engineering SymposiumHamburg Requirements Engineering Symposium
Hamburg Requirements Engineering SymposiumSupersede
 
Wageningen phenotype meeting
Wageningen phenotype meetingWageningen phenotype meeting
Wageningen phenotype meetingthehyve
 
Overview of methodologies
Overview of methodologiesOverview of methodologies
Overview of methodologiesMickael Pero
 
Frankfurt Big Data Lab & Refugee Projeect
Frankfurt Big Data Lab & Refugee ProjeectFrankfurt Big Data Lab & Refugee Projeect
Frankfurt Big Data Lab & Refugee ProjeectGoethe Univeristy
 
Jonathan bright - collecting social media data with the python programming la...
Jonathan bright - collecting social media data with the python programming la...Jonathan bright - collecting social media data with the python programming la...
Jonathan bright - collecting social media data with the python programming la...oiisdp
 
ISCRAM Summer School lecture Prof. Ralph Morelli
ISCRAM Summer School lecture Prof. Ralph MorelliISCRAM Summer School lecture Prof. Ralph Morelli
ISCRAM Summer School lecture Prof. Ralph Morelliwjm
 
OpenNTF Overview DanNotes 11/23/11
OpenNTF Overview DanNotes 11/23/11OpenNTF Overview DanNotes 11/23/11
OpenNTF Overview DanNotes 11/23/11Niklas Heidloff
 
Data management plans – EUDAT Best practices and case study | www.eudat.eu
Data management plans – EUDAT Best practices and case study | www.eudat.euData management plans – EUDAT Best practices and case study | www.eudat.eu
Data management plans – EUDAT Best practices and case study | www.eudat.euEUDAT
 

Similar to Lime recommendation (20)

Research Careers in Applied Computer Science
Research Careers in Applied Computer ScienceResearch Careers in Applied Computer Science
Research Careers in Applied Computer Science
 
Summer school bz_fp7research_20100708
Summer school bz_fp7research_20100708Summer school bz_fp7research_20100708
Summer school bz_fp7research_20100708
 
Go open2010 sde_20100417
Go open2010 sde_20100417Go open2010 sde_20100417
Go open2010 sde_20100417
 
Fraunhofer iais audio mining - automatic metadata gereration of audio streams...
Fraunhofer iais audio mining - automatic metadata gereration of audio streams...Fraunhofer iais audio mining - automatic metadata gereration of audio streams...
Fraunhofer iais audio mining - automatic metadata gereration of audio streams...
 
GoOpen 2010: Sandro D'Elia
GoOpen 2010: Sandro D'EliaGoOpen 2010: Sandro D'Elia
GoOpen 2010: Sandro D'Elia
 
Better Software, Better Research
Better Software, Better ResearchBetter Software, Better Research
Better Software, Better Research
 
LEARNTEC 2018 - HANDELkompetent - Enabling Adaptive Learning Pathways by Situ...
LEARNTEC 2018 - HANDELkompetent - Enabling Adaptive Learning Pathways by Situ...LEARNTEC 2018 - HANDELkompetent - Enabling Adaptive Learning Pathways by Situ...
LEARNTEC 2018 - HANDELkompetent - Enabling Adaptive Learning Pathways by Situ...
 
NLP on Hadoop: A Distributed Framework for NLP-Based Keyword and Keyphrase Ex...
NLP on Hadoop: A Distributed Framework for NLP-Based Keyword and Keyphrase Ex...NLP on Hadoop: A Distributed Framework for NLP-Based Keyword and Keyphrase Ex...
NLP on Hadoop: A Distributed Framework for NLP-Based Keyword and Keyphrase Ex...
 
2016 nov-ieee-sdn-wiki
2016 nov-ieee-sdn-wiki2016 nov-ieee-sdn-wiki
2016 nov-ieee-sdn-wiki
 
Open and Collaborative Software for Digital Pathology
Open and Collaborative Software for Digital Pathology Open and Collaborative Software for Digital Pathology
Open and Collaborative Software for Digital Pathology
 
MICO — Towards Contextual Media Analysis
MICO — Towards Contextual Media AnalysisMICO — Towards Contextual Media Analysis
MICO — Towards Contextual Media Analysis
 
Hamburg Requirements Engineering Symposium
Hamburg Requirements Engineering SymposiumHamburg Requirements Engineering Symposium
Hamburg Requirements Engineering Symposium
 
Wageningen phenotype meeting
Wageningen phenotype meetingWageningen phenotype meeting
Wageningen phenotype meeting
 
Overview of methodologies
Overview of methodologiesOverview of methodologies
Overview of methodologies
 
Frankfurt Big Data Lab & Refugee Projeect
Frankfurt Big Data Lab & Refugee ProjeectFrankfurt Big Data Lab & Refugee Projeect
Frankfurt Big Data Lab & Refugee Projeect
 
Jonathan bright - collecting social media data with the python programming la...
Jonathan bright - collecting social media data with the python programming la...Jonathan bright - collecting social media data with the python programming la...
Jonathan bright - collecting social media data with the python programming la...
 
ISCRAM Summer School lecture Prof. Ralph Morelli
ISCRAM Summer School lecture Prof. Ralph MorelliISCRAM Summer School lecture Prof. Ralph Morelli
ISCRAM Summer School lecture Prof. Ralph Morelli
 
OpenNTF Overview DanNotes 11/23/11
OpenNTF Overview DanNotes 11/23/11OpenNTF Overview DanNotes 11/23/11
OpenNTF Overview DanNotes 11/23/11
 
Annual Report
Annual ReportAnnual Report
Annual Report
 
Data management plans – EUDAT Best practices and case study | www.eudat.eu
Data management plans – EUDAT Best practices and case study | www.eudat.euData management plans – EUDAT Best practices and case study | www.eudat.eu
Data management plans – EUDAT Best practices and case study | www.eudat.eu
 

Recently uploaded

UiPath Test Automation using UiPath Test Suite series, part 2
UiPath Test Automation using UiPath Test Suite series, part 2UiPath Test Automation using UiPath Test Suite series, part 2
UiPath Test Automation using UiPath Test Suite series, part 2DianaGray10
 
Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfCheryl Hung
 
Introduction to Open Source RAG and RAG Evaluation
Introduction to Open Source RAG and RAG EvaluationIntroduction to Open Source RAG and RAG Evaluation
Introduction to Open Source RAG and RAG EvaluationZilliz
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...UiPathCommunity
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersSafe Software
 
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...Product School
 
Powerful Start- the Key to Project Success, Barbara Laskowska
Powerful Start- the Key to Project Success, Barbara LaskowskaPowerful Start- the Key to Project Success, Barbara Laskowska
Powerful Start- the Key to Project Success, Barbara LaskowskaCzechDreamin
 
JMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaJMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaRTTS
 
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo DiehlFuture Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo DiehlPeter Udo Diehl
 
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...Product School
 
10 Differences between Sales Cloud and CPQ, Blanka Doktorová
10 Differences between Sales Cloud and CPQ, Blanka Doktorová10 Differences between Sales Cloud and CPQ, Blanka Doktorová
10 Differences between Sales Cloud and CPQ, Blanka DoktorováCzechDreamin
 
ODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User GroupODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User GroupCatarinaPereira64715
 
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...Product School
 
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptx
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptxUnpacking Value Delivery - Agile Oxford Meetup - May 2024.pptx
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptxDavid Michel
 
"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor Turskyi"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor TurskyiFwdays
 
IoT Analytics Company Presentation May 2024
IoT Analytics Company Presentation May 2024IoT Analytics Company Presentation May 2024
IoT Analytics Company Presentation May 2024IoTAnalytics
 
Exploring UiPath Orchestrator API: updates and limits in 2024 🚀
Exploring UiPath Orchestrator API: updates and limits in 2024 🚀Exploring UiPath Orchestrator API: updates and limits in 2024 🚀
Exploring UiPath Orchestrator API: updates and limits in 2024 🚀DianaGray10
 
In-Depth Performance Testing Guide for IT Professionals
In-Depth Performance Testing Guide for IT ProfessionalsIn-Depth Performance Testing Guide for IT Professionals
In-Depth Performance Testing Guide for IT ProfessionalsExpeed Software
 
Custom Approval Process: A New Perspective, Pavel Hrbacek & Anindya Halder
Custom Approval Process: A New Perspective, Pavel Hrbacek & Anindya HalderCustom Approval Process: A New Perspective, Pavel Hrbacek & Anindya Halder
Custom Approval Process: A New Perspective, Pavel Hrbacek & Anindya HalderCzechDreamin
 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsPaul Groth
 

Recently uploaded (20)

UiPath Test Automation using UiPath Test Suite series, part 2
UiPath Test Automation using UiPath Test Suite series, part 2UiPath Test Automation using UiPath Test Suite series, part 2
UiPath Test Automation using UiPath Test Suite series, part 2
 
Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdf
 
Introduction to Open Source RAG and RAG Evaluation
Introduction to Open Source RAG and RAG EvaluationIntroduction to Open Source RAG and RAG Evaluation
Introduction to Open Source RAG and RAG Evaluation
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with Parameters
 
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
 
Powerful Start- the Key to Project Success, Barbara Laskowska
Powerful Start- the Key to Project Success, Barbara LaskowskaPowerful Start- the Key to Project Success, Barbara Laskowska
Powerful Start- the Key to Project Success, Barbara Laskowska
 
JMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaJMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and Grafana
 
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo DiehlFuture Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
 
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
 
10 Differences between Sales Cloud and CPQ, Blanka Doktorová
10 Differences between Sales Cloud and CPQ, Blanka Doktorová10 Differences between Sales Cloud and CPQ, Blanka Doktorová
10 Differences between Sales Cloud and CPQ, Blanka Doktorová
 
ODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User GroupODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User Group
 
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
 
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptx
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptxUnpacking Value Delivery - Agile Oxford Meetup - May 2024.pptx
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptx
 
"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor Turskyi"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor Turskyi
 
IoT Analytics Company Presentation May 2024
IoT Analytics Company Presentation May 2024IoT Analytics Company Presentation May 2024
IoT Analytics Company Presentation May 2024
 
Exploring UiPath Orchestrator API: updates and limits in 2024 🚀
Exploring UiPath Orchestrator API: updates and limits in 2024 🚀Exploring UiPath Orchestrator API: updates and limits in 2024 🚀
Exploring UiPath Orchestrator API: updates and limits in 2024 🚀
 
In-Depth Performance Testing Guide for IT Professionals
In-Depth Performance Testing Guide for IT ProfessionalsIn-Depth Performance Testing Guide for IT Professionals
In-Depth Performance Testing Guide for IT Professionals
 
Custom Approval Process: A New Perspective, Pavel Hrbacek & Anindya Halder
Custom Approval Process: A New Perspective, Pavel Hrbacek & Anindya HalderCustom Approval Process: A New Perspective, Pavel Hrbacek & Anindya Halder
Custom Approval Process: A New Perspective, Pavel Hrbacek & Anindya Halder
 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
 

Lime recommendation

  • 1. © Fraunhofer IDMT T. Köllmer†, E. Berndl‡, T. Weißgerber‡, P. Aichroth† , H. Kosch‡ A Workflow for Cross Media Recommendations based on Linked Data Analysis † Fraunhofer IDMT ‡ University of Passau Thomas Köllmer thomas.koellmer@idmt.fraunhofer.de
  • 2. © Fraunhofer IDMT 2 The MICO Project  http://www.mico-project.eu  FP7 STREP, GA# 610480  7 Partners: Salzburg Research, Fraunhofer IDMT, Oxford University, University of Passau, Umeå University, Zaizi, InsideOut10  Duration: 36 Months (end: 10/2016)  Goal of MICO is to develop a platform to analyse “media in context” …  ...by orchestrating different content extraction tools that can work simultaneously or in a sequence and …..  ...provide valuable metadata to third party applications.
  • 3. © Fraunhofer IDMT 3 Cross Media Recommendations in MICO  The Recommendation Problem  “Providing suggestions for items to be of use to a user”  Broad Application domains:  e-Commerce, Travel, Multimedia, News, …  Two main data sources:  User behaviour data (Collaborative Filtering)  Item Metadata (Content based recommendation)
  • 4. © Fraunhofer IDMT 4 Cross Media Recommendations in MICO  Media Recommendation  “Classic” Recommendation problems (Music-Recommendation, Netflix Challenge)  Mostly based on collaborative filtering, but also content analysis  Cross Media Recommendation  Cross Media ≠ Cross Modal  Multiple Input  Media Items + Associated Media context, e.g., commentary  Multiple Output  Not necessarily more of the same thing
  • 5. © Fraunhofer IDMT 5 Editor Support Usecase As an editor, while I create or edit articles using WordPress, I want to automatically get related articles and videos that I might link to the article.  Project Use Case: Recommendations for Greenpeace Magazine Italia  Articles  Videos  User behaviour data
  • 6. © Fraunhofer IDMT 6 Editor Support Usecase As an editor, while I create or edit articles using WordPress, I want to automatically get related articles and videos that I might link to the article.  How can Mico Help?  Articles: NER components for content spotting  Videos  Text to speech + NER  Metadata Analysis  User behaviour data  Generic collaborative filtering component (prediction.io)
  • 7. © Fraunhofer IDMT 7 Chat Analysis Usecase As a zooniverse user, I want get more information that helps me during the current classification task, e.g., by the recommendation of similar subjects or training data.  Input:  Results from animal detection  Sentiment and competence classification of talks  NER for animal spotting  Output:  Classification Hints  Hints for moderators / experts
  • 8. © Fraunhofer IDMT 8 Processing Workflow for Editor Support Use Case 1. Crawler feeds media items to platform e.g., videos from Greenpeace channel, news articles, internal archive…. 2. Analysis results are stored as Linked Data Currently: using Apache Marmotta as triplestore
  • 9. © Fraunhofer IDMT 9 Processing Workflow for Editor Support Use Case 3. Item gets into Focus (someone is writing an article on a specific topic) 4. Focused item is fed to the same platform, using customized pipelines 5. Results are processed by LD matching, i.e Recommendation component.. 6. … enriched by stored annotations … 7. And returned to editing platform
  • 10. © Fraunhofer IDMT 10 Recommendation Workflow in MICO
  • 11. © Fraunhofer IDMT 11 Ontology Matching  Semantic distance of entities important for analysis  Animal detection: “I see a bird on the tree”, “Yes it’s definitely a sparrow”  Topic detection: Wind energy, sustainability, renewables, oil, …  Non-Mico:  Genres! (e.g., currently 1400 genres on Spotify…)  Not less controversial: movie genres
  • 12. © Fraunhofer IDMT 12 Evaluation  How to evaluate a recommender system?  Forget about precision and recall!  Observing user behaviour  Online experiment  Offline: extrapolating past user behavior  Lab Studies / Questionnaire  Open problem: standardized + comparable testing of recommender systems
  • 13. © Fraunhofer IDMT 13 Conclusions  Proposed setting tries to formalize a certain subset of recommendation problems: Cross media recommendation  Focused on Media Items  Profits from Content Analysis  New items, have a high contextual dependency on existing items, that will be exploited for recommendation  Work in Progress:  Stable release of Mico-platform and extractors: right now  PoC recommendation code will be integrated in the platform  Demo Services: available in June  Incorporate feedback into recommendation
  • 14. © Fraunhofer IDMT T. Köllmer†, E. Berndl‡, T. Weißgerber‡, P. Aichroth† , H. Kosch‡ A Workflow for Cross Media Recommendations based on Linked Data Analysis † Fraunhofer IDMT ‡ University of Passau Thank you! Questions?