SlideShare a Scribd company logo
Sophia Antipolis, France, October 29-31, 2018
NewsREEL Multimedia at MediaEval 2018:
News Recommendation with Image and Text Content
Andreas Lommatzsch, Benjamin Kille,
Frank Hopfgartner, Torben Brodt
• Recommender systems help users to find relevant items / matching the
individual preferences
• Can be applied in a growing number of domains
• Online shops
• Videos and movie portals
• Social media
• News Portals
• Scenarios may differ with respect to
• Set of users (explicit profiles, sessions, anonymous users; context data)
• Set of items (static / dynamic, collection size)
• Available user-item interactions and content
(ratings, reviews, impressions, sales)
• Metrics
Motivation
Recommender Systems
• Images have an strong influence on our
perception
Living Labs and Stream-based Evaluation are on the
rise. A recent inquiry of recommendation experts
reveals shockingly low levels of confidence in
offline evaluation.
9 out of 10 experts agree that existing protocols can
Recommendation #1
Recommended Article
Headline
Recommendation #2
Recommended Article
Headline
Recommendation #3
Recommended Article
Headline
Recommendation #4
Recommended Article
Headline
The TUB presents NewsREEL at ECIR 2016
Motivation
Multi-Media in Recommender Systems
• Recommender Research focuses on Collaborative Filtering
• Text content-based approaches
• Not much research how to use
multimedia content in recommender systems
• NewsREEL Multimedia addresses this challenge
• Real-world scenario
• Dataset
• Baseline
• Evaluation Tools
Motivation
Objectives of NewsREEL Multimedia
Living Labs and Stream-based Evaluation are on the
rise. A recent inquiry of recommendation experts
reveals shockingly low levels of confidence in
offline evaluation.
9 out of 10 experts agree that existing protocols can
provide spurious results. They suggest to rely on
socalled „real“ user feedback. Such feedback can be
obtained in living labs such as clef Newsreel.
Researchers connect their servers with the Open
Recommendation Platform (http://orp.plista.com). They
Recommendation #1
Recommended Article
Headline
Recommendation #2
Recommended Article
Headline
Recommendation #3
Recommended Article
Headline
Recommendation #4
Recommended Article
Headline
The TUB presents NewsREEL at ECIR 2016
Recommendation A: Recommended Article Headline
NewsREEL scenario
• Recommend interesting news on
several different German News portals
• News items consist of
• The headline [text]
• A snippet [text]
• An image [URL]
• Additional meta data could be provided
• User-Item interactions
• Impressions
• Click on recommendations
Scenario
NewsREEL
Living Labs and Stream-based Evaluation are on the
rise. A recent inquiry of recommendation experts
reveals shockingly low levels of confidence in
offline evaluation.
9 out of 10 experts agree that existing protocols can
provide spurious results. They suggest to rely on
socalled „real“ user feedback. Such feedback can be
obtained in living labs such as clef Newsreel.
Researchers connect their servers with the Open
Recommendation Platform (http://orp.plista.com). They
Recommendation #1
Recommended Article
Headline
Recommendation #2
Recommended Article
Headline
Recommendation #3
Recommended Article
Headline
Recommendation #4
Recommended Article
Headline
The TUB presents NewsREEL at ECIR 2016
Recommendation A: Recommended Article Headline
Scenario
NewsREEL
The NewsREEL architecture
• Recommender teams
register for the challenge
• Receive requests that
must be answered with
recommendations
• Evaluation based on
live user feedback
• An offline version is provided
• Replay data and bin-based
• Long-term statistics
• No real time constraints
Scenario
NewsREEL
• Motivation
• The Dataset and Statistics
• Evaluation Metrics
• Discussion and Conclusions
Outline
Structure of the Presentation
NewsREEL MultiMedia
• Offline dataset
(image processing is still time consuming)
• Aggregated data on a weekly basis
• The dataset consists of
• Item data (headline, snippets, imageURL)
• Image labels for most items
(computed with 6 different configurations)
• A low level image representation
• Impression statistic (training weeks)
• Potentially relevant items (test weeks)
Dataset
NewsREEL Multimedia
Image
labels
itemIDs
in 12 bins
(weeks)
#impressions
Text
data
NewsREEL MultiMedia – dataset statistic
• Size: 200 MB (txt/csv) ;
8.5G low level features
• Number of pairs (items, week): 311k
• #tupels (item, config, label, score): 3Mio
• 7 image annotator configurations:
Dataset
NewsREEL Multimedia
Image
labels
itemIDs
#impressions
Textual
data
Dataset
NewsREEL Multimedia
• publishers exhibit different engagement patterns
• impressions show a power law distributions:
• few articles accumulate majority of impressions
• many articles get few impressions
• Motivation
• The Dataset and Statistics
• Evaluation Metrics
• Discussion and Conclusions
Outline
Structure of the Presentation
Task: Predict the most popular items for the test weeks
• What are appropriate metrics?
• Precision@10
• Precision@N
• Average Precision
• RMSE / MAE
NewsREEL MultiMedia
Evaluation
Task: Predict the most popular items for the test weeks
NewsREEL MultiMedia
Evaluation
f(text, image)  score; sort(score)
Article : #Impressions
438737736 : 342,428
438007519 : 340,840
437351823 : 268,656
437515691 : 253,596
437560377 : 203,943
439089552 : 169,430
439128028 : 160,133
438283775 : 152,881
436755527 : 145,553
437918358 : 127,624
week 0 | publisher 17614
Article : #Impressions
438737736 : -
438007519 : +
437351823 : -
437515691 : -
437560377 : -
439089552 : +
439128028 : +
438283775 : -
436755527 : -
437918358 : -
p@10 = 0.3
Evaluation Scheme
NewsREEL MultiMedia
Evaluation
w0 w2 w4 w6 w8 w10 w12
Train Train
Test Test
• Precision@10
+ Fits best with the application scenario
 Unstable, hard metric, small codomain {0.0, 0.1, .., 1.0}
• Precision@N
• Generalization of Precision@10
+ Overcomes the problem of small codomain,
weak recommender can reach a score > 0
• Find an appropriate N; we decided to use N = 0.1 * #items
• Average Precision@N
• Precision does not consider the order of the elements
+ Average Precision overcomes this problem
 More difficult to interpret
• RMSE / MAE
• Common metric is recommender system
 Inappropriate due to the power law
distribution of the #impressions
NewsREEL MultiMedia
Evaluation-Metrics
• Motivation
• The Dataset and statistics
• Evaluation Metrics
• Discussion and Conclusions
Outline
Structure of the Presentation
• Problem setting  ranking optimisation
• Dataset characteristics
• impressions
• preprocessed images
• texts
• own pipeline can be used
• Evaluation scheme  precision@10, precision@10%
• 8 registrations
• 1 submission ( how to improve?)
• To continue with
• baseline (presented remotely)
• contribution (presented remotely)
Conclusion
?
Questions?
• The code is available on request (Java/Maven)
Contact

More Related Content

Similar to MediaEval 2018: NewsREEL Multimedia at MediaEval 2018: News Recommendation with Image and Text Content

Digital disruption in information industries
Digital disruption in information industriesDigital disruption in information industries
Digital disruption in information industries
Sumeet Rohatgi
 
Week2 chapters1 3
Week2 chapters1 3Week2 chapters1 3
Week2 chapters1 3
Michael Germano
 
Supercharge Your Corporate Dashboards With UX Analytics
Supercharge Your Corporate Dashboards With UX AnalyticsSupercharge Your Corporate Dashboards With UX Analytics
Supercharge Your Corporate Dashboards With UX Analytics
UserZoom
 
Crafting a Compelling Data Science Resume
Crafting a Compelling Data Science ResumeCrafting a Compelling Data Science Resume
Crafting a Compelling Data Science Resume
Arushi Prakash, Ph.D.
 
Advanced Project Data Analytics for Improved Project Delivery
Advanced Project Data Analytics for Improved Project DeliveryAdvanced Project Data Analytics for Improved Project Delivery
Advanced Project Data Analytics for Improved Project Delivery
Mark Constable
 
IdeaScreen 2013
IdeaScreen 2013IdeaScreen 2013
IdeaScreen 2013
Vision Critical
 
Agile data science
Agile data scienceAgile data science
Agile data science
Joel Horwitz
 
A quick overview of Eaagle
A quick overview of EaagleA quick overview of Eaagle
A quick overview of Eaagle
Eaagle
 
Using analytics in ux design my view
Using analytics in ux design   my viewUsing analytics in ux design   my view
Using analytics in ux design my view
Outi Aramo
 
In metrics we trust?
In metrics we trust?In metrics we trust?
In metrics we trust?
ORCID, Inc
 
Presentation quaest strategic sourcing software
Presentation quaest strategic sourcing softwarePresentation quaest strategic sourcing software
Presentation quaest strategic sourcing software
Bernard ARRATEIG
 
Brainstorming session - Providing top-notch (online) research that clients ca...
Brainstorming session - Providing top-notch (online) research that clients ca...Brainstorming session - Providing top-notch (online) research that clients ca...
Brainstorming session - Providing top-notch (online) research that clients ca...
Merlien Institute
 
Telling the Full Story: Adding Qualitative Data To Executive Dashboards
Telling the Full Story: Adding Qualitative Data To Executive DashboardsTelling the Full Story: Adding Qualitative Data To Executive Dashboards
Telling the Full Story: Adding Qualitative Data To Executive Dashboards
UserZoom
 
Research and Community Building with a Roadmap
Research and Community Building with a RoadmapResearch and Community Building with a Roadmap
Research and Community Building with a Roadmap
QuestionPro
 
2014 CrossRef Annual Meeting Peer Review Panel: PRE: Securing Trust & Transpa...
2014 CrossRef Annual Meeting Peer Review Panel: PRE: Securing Trust & Transpa...2014 CrossRef Annual Meeting Peer Review Panel: PRE: Securing Trust & Transpa...
2014 CrossRef Annual Meeting Peer Review Panel: PRE: Securing Trust & Transpa...
Crossref
 
Large language models in higher education
Large language models in higher educationLarge language models in higher education
Large language models in higher education
Peter Trkman
 
UX research
UX researchUX research
UX research
Billy Choi
 
See to believe: capturing insights using contextual inquiry
See to believe: capturing insights using contextual inquirySee to believe: capturing insights using contextual inquiry
See to believe: capturing insights using contextual inquiry
Deirdre Costello
 
Elsevier
ElsevierElsevier
Elsevier
Christina Azzam
 
Optimising Your Content for findability
Optimising Your Content for findabilityOptimising Your Content for findability
Optimising Your Content for findability
Kristian Norling
 

Similar to MediaEval 2018: NewsREEL Multimedia at MediaEval 2018: News Recommendation with Image and Text Content (20)

Digital disruption in information industries
Digital disruption in information industriesDigital disruption in information industries
Digital disruption in information industries
 
Week2 chapters1 3
Week2 chapters1 3Week2 chapters1 3
Week2 chapters1 3
 
Supercharge Your Corporate Dashboards With UX Analytics
Supercharge Your Corporate Dashboards With UX AnalyticsSupercharge Your Corporate Dashboards With UX Analytics
Supercharge Your Corporate Dashboards With UX Analytics
 
Crafting a Compelling Data Science Resume
Crafting a Compelling Data Science ResumeCrafting a Compelling Data Science Resume
Crafting a Compelling Data Science Resume
 
Advanced Project Data Analytics for Improved Project Delivery
Advanced Project Data Analytics for Improved Project DeliveryAdvanced Project Data Analytics for Improved Project Delivery
Advanced Project Data Analytics for Improved Project Delivery
 
IdeaScreen 2013
IdeaScreen 2013IdeaScreen 2013
IdeaScreen 2013
 
Agile data science
Agile data scienceAgile data science
Agile data science
 
A quick overview of Eaagle
A quick overview of EaagleA quick overview of Eaagle
A quick overview of Eaagle
 
Using analytics in ux design my view
Using analytics in ux design   my viewUsing analytics in ux design   my view
Using analytics in ux design my view
 
In metrics we trust?
In metrics we trust?In metrics we trust?
In metrics we trust?
 
Presentation quaest strategic sourcing software
Presentation quaest strategic sourcing softwarePresentation quaest strategic sourcing software
Presentation quaest strategic sourcing software
 
Brainstorming session - Providing top-notch (online) research that clients ca...
Brainstorming session - Providing top-notch (online) research that clients ca...Brainstorming session - Providing top-notch (online) research that clients ca...
Brainstorming session - Providing top-notch (online) research that clients ca...
 
Telling the Full Story: Adding Qualitative Data To Executive Dashboards
Telling the Full Story: Adding Qualitative Data To Executive DashboardsTelling the Full Story: Adding Qualitative Data To Executive Dashboards
Telling the Full Story: Adding Qualitative Data To Executive Dashboards
 
Research and Community Building with a Roadmap
Research and Community Building with a RoadmapResearch and Community Building with a Roadmap
Research and Community Building with a Roadmap
 
2014 CrossRef Annual Meeting Peer Review Panel: PRE: Securing Trust & Transpa...
2014 CrossRef Annual Meeting Peer Review Panel: PRE: Securing Trust & Transpa...2014 CrossRef Annual Meeting Peer Review Panel: PRE: Securing Trust & Transpa...
2014 CrossRef Annual Meeting Peer Review Panel: PRE: Securing Trust & Transpa...
 
Large language models in higher education
Large language models in higher educationLarge language models in higher education
Large language models in higher education
 
UX research
UX researchUX research
UX research
 
See to believe: capturing insights using contextual inquiry
See to believe: capturing insights using contextual inquirySee to believe: capturing insights using contextual inquiry
See to believe: capturing insights using contextual inquiry
 
Elsevier
ElsevierElsevier
Elsevier
 
Optimising Your Content for findability
Optimising Your Content for findabilityOptimising Your Content for findability
Optimising Your Content for findability
 

More from multimediaeval

Classification of Strokes in Table Tennis with a Three Stream Spatio-Temporal...
Classification of Strokes in Table Tennis with a Three Stream Spatio-Temporal...Classification of Strokes in Table Tennis with a Three Stream Spatio-Temporal...
Classification of Strokes in Table Tennis with a Three Stream Spatio-Temporal...
multimediaeval
 
HCMUS at MediaEval 2020: Ensembles of Temporal Deep Neural Networks for Table...
HCMUS at MediaEval 2020: Ensembles of Temporal Deep Neural Networks for Table...HCMUS at MediaEval 2020: Ensembles of Temporal Deep Neural Networks for Table...
HCMUS at MediaEval 2020: Ensembles of Temporal Deep Neural Networks for Table...
multimediaeval
 
Sports Video Classification: Classification of Strokes in Table Tennis for Me...
Sports Video Classification: Classification of Strokes in Table Tennis for Me...Sports Video Classification: Classification of Strokes in Table Tennis for Me...
Sports Video Classification: Classification of Strokes in Table Tennis for Me...
multimediaeval
 
Predicting Media Memorability from a Multimodal Late Fusion of Self-Attention...
Predicting Media Memorability from a Multimodal Late Fusion of Self-Attention...Predicting Media Memorability from a Multimodal Late Fusion of Self-Attention...
Predicting Media Memorability from a Multimodal Late Fusion of Self-Attention...
multimediaeval
 
Essex-NLIP at MediaEval Predicting Media Memorability 2020 Task
Essex-NLIP at MediaEval Predicting Media Memorability 2020 TaskEssex-NLIP at MediaEval Predicting Media Memorability 2020 Task
Essex-NLIP at MediaEval Predicting Media Memorability 2020 Task
multimediaeval
 
Overview of MediaEval 2020 Predicting Media Memorability task: What Makes a V...
Overview of MediaEval 2020 Predicting Media Memorability task: What Makes a V...Overview of MediaEval 2020 Predicting Media Memorability task: What Makes a V...
Overview of MediaEval 2020 Predicting Media Memorability task: What Makes a V...
multimediaeval
 
Fooling an Automatic Image Quality Estimator
Fooling an Automatic Image Quality EstimatorFooling an Automatic Image Quality Estimator
Fooling an Automatic Image Quality Estimator
multimediaeval
 
Fooling Blind Image Quality Assessment by Optimizing a Human-Understandable C...
Fooling Blind Image Quality Assessment by Optimizing a Human-Understandable C...Fooling Blind Image Quality Assessment by Optimizing a Human-Understandable C...
Fooling Blind Image Quality Assessment by Optimizing a Human-Understandable C...
multimediaeval
 
Pixel Privacy: Quality Camouflage for Social Images
Pixel Privacy: Quality Camouflage for Social ImagesPixel Privacy: Quality Camouflage for Social Images
Pixel Privacy: Quality Camouflage for Social Images
multimediaeval
 
HCMUS at MediaEval 2020:Image-Text Fusion for Automatic News-Images Re-Matching
HCMUS at MediaEval 2020:Image-Text Fusion for Automatic News-Images Re-MatchingHCMUS at MediaEval 2020:Image-Text Fusion for Automatic News-Images Re-Matching
HCMUS at MediaEval 2020:Image-Text Fusion for Automatic News-Images Re-Matching
multimediaeval
 
Efficient Supervision Net: Polyp Segmentation using EfficientNet and Attentio...
Efficient Supervision Net: Polyp Segmentation using EfficientNet and Attentio...Efficient Supervision Net: Polyp Segmentation using EfficientNet and Attentio...
Efficient Supervision Net: Polyp Segmentation using EfficientNet and Attentio...
multimediaeval
 
HCMUS at Medico Automatic Polyp Segmentation Task 2020: PraNet and ResUnet++ ...
HCMUS at Medico Automatic Polyp Segmentation Task 2020: PraNet and ResUnet++ ...HCMUS at Medico Automatic Polyp Segmentation Task 2020: PraNet and ResUnet++ ...
HCMUS at Medico Automatic Polyp Segmentation Task 2020: PraNet and ResUnet++ ...
multimediaeval
 
Depth-wise Separable Atrous Convolution for Polyps Segmentation in Gastro-Int...
Depth-wise Separable Atrous Convolution for Polyps Segmentation in Gastro-Int...Depth-wise Separable Atrous Convolution for Polyps Segmentation in Gastro-Int...
Depth-wise Separable Atrous Convolution for Polyps Segmentation in Gastro-Int...
multimediaeval
 
Deep Conditional Adversarial learning for polyp Segmentation
Deep Conditional Adversarial learning for polyp SegmentationDeep Conditional Adversarial learning for polyp Segmentation
Deep Conditional Adversarial learning for polyp Segmentation
multimediaeval
 
A Temporal-Spatial Attention Model for Medical Image Detection
A Temporal-Spatial Attention Model for Medical Image DetectionA Temporal-Spatial Attention Model for Medical Image Detection
A Temporal-Spatial Attention Model for Medical Image Detection
multimediaeval
 
HCMUS-Juniors 2020 at Medico Task in MediaEval 2020: Refined Deep Neural Netw...
HCMUS-Juniors 2020 at Medico Task in MediaEval 2020: Refined Deep Neural Netw...HCMUS-Juniors 2020 at Medico Task in MediaEval 2020: Refined Deep Neural Netw...
HCMUS-Juniors 2020 at Medico Task in MediaEval 2020: Refined Deep Neural Netw...
multimediaeval
 
Fine-tuning for Polyp Segmentation with Attention
Fine-tuning for Polyp Segmentation with AttentionFine-tuning for Polyp Segmentation with Attention
Fine-tuning for Polyp Segmentation with Attention
multimediaeval
 
Bigger Networks are not Always Better: Deep Convolutional Neural Networks for...
Bigger Networks are not Always Better: Deep Convolutional Neural Networks for...Bigger Networks are not Always Better: Deep Convolutional Neural Networks for...
Bigger Networks are not Always Better: Deep Convolutional Neural Networks for...
multimediaeval
 
Insights for wellbeing: Predicting Personal Air Quality Index using Regressio...
Insights for wellbeing: Predicting Personal Air Quality Index using Regressio...Insights for wellbeing: Predicting Personal Air Quality Index using Regressio...
Insights for wellbeing: Predicting Personal Air Quality Index using Regressio...
multimediaeval
 
Use Visual Features From Surrounding Scenes to Improve Personal Air Quality ...
 Use Visual Features From Surrounding Scenes to Improve Personal Air Quality ... Use Visual Features From Surrounding Scenes to Improve Personal Air Quality ...
Use Visual Features From Surrounding Scenes to Improve Personal Air Quality ...
multimediaeval
 

More from multimediaeval (20)

Classification of Strokes in Table Tennis with a Three Stream Spatio-Temporal...
Classification of Strokes in Table Tennis with a Three Stream Spatio-Temporal...Classification of Strokes in Table Tennis with a Three Stream Spatio-Temporal...
Classification of Strokes in Table Tennis with a Three Stream Spatio-Temporal...
 
HCMUS at MediaEval 2020: Ensembles of Temporal Deep Neural Networks for Table...
HCMUS at MediaEval 2020: Ensembles of Temporal Deep Neural Networks for Table...HCMUS at MediaEval 2020: Ensembles of Temporal Deep Neural Networks for Table...
HCMUS at MediaEval 2020: Ensembles of Temporal Deep Neural Networks for Table...
 
Sports Video Classification: Classification of Strokes in Table Tennis for Me...
Sports Video Classification: Classification of Strokes in Table Tennis for Me...Sports Video Classification: Classification of Strokes in Table Tennis for Me...
Sports Video Classification: Classification of Strokes in Table Tennis for Me...
 
Predicting Media Memorability from a Multimodal Late Fusion of Self-Attention...
Predicting Media Memorability from a Multimodal Late Fusion of Self-Attention...Predicting Media Memorability from a Multimodal Late Fusion of Self-Attention...
Predicting Media Memorability from a Multimodal Late Fusion of Self-Attention...
 
Essex-NLIP at MediaEval Predicting Media Memorability 2020 Task
Essex-NLIP at MediaEval Predicting Media Memorability 2020 TaskEssex-NLIP at MediaEval Predicting Media Memorability 2020 Task
Essex-NLIP at MediaEval Predicting Media Memorability 2020 Task
 
Overview of MediaEval 2020 Predicting Media Memorability task: What Makes a V...
Overview of MediaEval 2020 Predicting Media Memorability task: What Makes a V...Overview of MediaEval 2020 Predicting Media Memorability task: What Makes a V...
Overview of MediaEval 2020 Predicting Media Memorability task: What Makes a V...
 
Fooling an Automatic Image Quality Estimator
Fooling an Automatic Image Quality EstimatorFooling an Automatic Image Quality Estimator
Fooling an Automatic Image Quality Estimator
 
Fooling Blind Image Quality Assessment by Optimizing a Human-Understandable C...
Fooling Blind Image Quality Assessment by Optimizing a Human-Understandable C...Fooling Blind Image Quality Assessment by Optimizing a Human-Understandable C...
Fooling Blind Image Quality Assessment by Optimizing a Human-Understandable C...
 
Pixel Privacy: Quality Camouflage for Social Images
Pixel Privacy: Quality Camouflage for Social ImagesPixel Privacy: Quality Camouflage for Social Images
Pixel Privacy: Quality Camouflage for Social Images
 
HCMUS at MediaEval 2020:Image-Text Fusion for Automatic News-Images Re-Matching
HCMUS at MediaEval 2020:Image-Text Fusion for Automatic News-Images Re-MatchingHCMUS at MediaEval 2020:Image-Text Fusion for Automatic News-Images Re-Matching
HCMUS at MediaEval 2020:Image-Text Fusion for Automatic News-Images Re-Matching
 
Efficient Supervision Net: Polyp Segmentation using EfficientNet and Attentio...
Efficient Supervision Net: Polyp Segmentation using EfficientNet and Attentio...Efficient Supervision Net: Polyp Segmentation using EfficientNet and Attentio...
Efficient Supervision Net: Polyp Segmentation using EfficientNet and Attentio...
 
HCMUS at Medico Automatic Polyp Segmentation Task 2020: PraNet and ResUnet++ ...
HCMUS at Medico Automatic Polyp Segmentation Task 2020: PraNet and ResUnet++ ...HCMUS at Medico Automatic Polyp Segmentation Task 2020: PraNet and ResUnet++ ...
HCMUS at Medico Automatic Polyp Segmentation Task 2020: PraNet and ResUnet++ ...
 
Depth-wise Separable Atrous Convolution for Polyps Segmentation in Gastro-Int...
Depth-wise Separable Atrous Convolution for Polyps Segmentation in Gastro-Int...Depth-wise Separable Atrous Convolution for Polyps Segmentation in Gastro-Int...
Depth-wise Separable Atrous Convolution for Polyps Segmentation in Gastro-Int...
 
Deep Conditional Adversarial learning for polyp Segmentation
Deep Conditional Adversarial learning for polyp SegmentationDeep Conditional Adversarial learning for polyp Segmentation
Deep Conditional Adversarial learning for polyp Segmentation
 
A Temporal-Spatial Attention Model for Medical Image Detection
A Temporal-Spatial Attention Model for Medical Image DetectionA Temporal-Spatial Attention Model for Medical Image Detection
A Temporal-Spatial Attention Model for Medical Image Detection
 
HCMUS-Juniors 2020 at Medico Task in MediaEval 2020: Refined Deep Neural Netw...
HCMUS-Juniors 2020 at Medico Task in MediaEval 2020: Refined Deep Neural Netw...HCMUS-Juniors 2020 at Medico Task in MediaEval 2020: Refined Deep Neural Netw...
HCMUS-Juniors 2020 at Medico Task in MediaEval 2020: Refined Deep Neural Netw...
 
Fine-tuning for Polyp Segmentation with Attention
Fine-tuning for Polyp Segmentation with AttentionFine-tuning for Polyp Segmentation with Attention
Fine-tuning for Polyp Segmentation with Attention
 
Bigger Networks are not Always Better: Deep Convolutional Neural Networks for...
Bigger Networks are not Always Better: Deep Convolutional Neural Networks for...Bigger Networks are not Always Better: Deep Convolutional Neural Networks for...
Bigger Networks are not Always Better: Deep Convolutional Neural Networks for...
 
Insights for wellbeing: Predicting Personal Air Quality Index using Regressio...
Insights for wellbeing: Predicting Personal Air Quality Index using Regressio...Insights for wellbeing: Predicting Personal Air Quality Index using Regressio...
Insights for wellbeing: Predicting Personal Air Quality Index using Regressio...
 
Use Visual Features From Surrounding Scenes to Improve Personal Air Quality ...
 Use Visual Features From Surrounding Scenes to Improve Personal Air Quality ... Use Visual Features From Surrounding Scenes to Improve Personal Air Quality ...
Use Visual Features From Surrounding Scenes to Improve Personal Air Quality ...
 

Recently uploaded

Eukaryotic Transcription Presentation.pptx
Eukaryotic Transcription Presentation.pptxEukaryotic Transcription Presentation.pptx
Eukaryotic Transcription Presentation.pptx
RitabrataSarkar3
 
molar-distalization in orthodontics-seminar.pptx
molar-distalization in orthodontics-seminar.pptxmolar-distalization in orthodontics-seminar.pptx
molar-distalization in orthodontics-seminar.pptx
Anagha Prasad
 
20240520 Planning a Circuit Simulator in JavaScript.pptx
20240520 Planning a Circuit Simulator in JavaScript.pptx20240520 Planning a Circuit Simulator in JavaScript.pptx
20240520 Planning a Circuit Simulator in JavaScript.pptx
Sharon Liu
 
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptxThe use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
MAGOTI ERNEST
 
The binding of cosmological structures by massless topological defects
The binding of cosmological structures by massless topological defectsThe binding of cosmological structures by massless topological defects
The binding of cosmological structures by massless topological defects
Sérgio Sacani
 
8.Isolation of pure cultures and preservation of cultures.pdf
8.Isolation of pure cultures and preservation of cultures.pdf8.Isolation of pure cultures and preservation of cultures.pdf
8.Isolation of pure cultures and preservation of cultures.pdf
by6843629
 
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills MN
 
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốtmô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
HongcNguyn6
 
Oedema_types_causes_pathophysiology.pptx
Oedema_types_causes_pathophysiology.pptxOedema_types_causes_pathophysiology.pptx
Oedema_types_causes_pathophysiology.pptx
muralinath2
 
Immersive Learning That Works: Research Grounding and Paths Forward
Immersive Learning That Works: Research Grounding and Paths ForwardImmersive Learning That Works: Research Grounding and Paths Forward
Immersive Learning That Works: Research Grounding and Paths Forward
Leonel Morgado
 
bordetella pertussis.................................ppt
bordetella pertussis.................................pptbordetella pertussis.................................ppt
bordetella pertussis.................................ppt
kejapriya1
 
SAR of Medicinal Chemistry 1st by dk.pdf
SAR of Medicinal Chemistry 1st by dk.pdfSAR of Medicinal Chemistry 1st by dk.pdf
SAR of Medicinal Chemistry 1st by dk.pdf
KrushnaDarade1
 
Applied Science: Thermodynamics, Laws & Methodology.pdf
Applied Science: Thermodynamics, Laws & Methodology.pdfApplied Science: Thermodynamics, Laws & Methodology.pdf
Applied Science: Thermodynamics, Laws & Methodology.pdf
University of Hertfordshire
 
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
Sérgio Sacani
 
Medical Orthopedic PowerPoint Templates.pptx
Medical Orthopedic PowerPoint Templates.pptxMedical Orthopedic PowerPoint Templates.pptx
Medical Orthopedic PowerPoint Templates.pptx
terusbelajar5
 
NuGOweek 2024 Ghent programme overview flyer
NuGOweek 2024 Ghent programme overview flyerNuGOweek 2024 Ghent programme overview flyer
NuGOweek 2024 Ghent programme overview flyer
pablovgd
 
Authoring a personal GPT for your research and practice: How we created the Q...
Authoring a personal GPT for your research and practice: How we created the Q...Authoring a personal GPT for your research and practice: How we created the Q...
Authoring a personal GPT for your research and practice: How we created the Q...
Leonel Morgado
 
The debris of the ‘last major merger’ is dynamically young
The debris of the ‘last major merger’ is dynamically youngThe debris of the ‘last major merger’ is dynamically young
The debris of the ‘last major merger’ is dynamically young
Sérgio Sacani
 
aziz sancar nobel prize winner: from mardin to nobel
aziz sancar nobel prize winner: from mardin to nobelaziz sancar nobel prize winner: from mardin to nobel
aziz sancar nobel prize winner: from mardin to nobel
İsa Badur
 
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdfTopic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
TinyAnderson
 

Recently uploaded (20)

Eukaryotic Transcription Presentation.pptx
Eukaryotic Transcription Presentation.pptxEukaryotic Transcription Presentation.pptx
Eukaryotic Transcription Presentation.pptx
 
molar-distalization in orthodontics-seminar.pptx
molar-distalization in orthodontics-seminar.pptxmolar-distalization in orthodontics-seminar.pptx
molar-distalization in orthodontics-seminar.pptx
 
20240520 Planning a Circuit Simulator in JavaScript.pptx
20240520 Planning a Circuit Simulator in JavaScript.pptx20240520 Planning a Circuit Simulator in JavaScript.pptx
20240520 Planning a Circuit Simulator in JavaScript.pptx
 
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptxThe use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
 
The binding of cosmological structures by massless topological defects
The binding of cosmological structures by massless topological defectsThe binding of cosmological structures by massless topological defects
The binding of cosmological structures by massless topological defects
 
8.Isolation of pure cultures and preservation of cultures.pdf
8.Isolation of pure cultures and preservation of cultures.pdf8.Isolation of pure cultures and preservation of cultures.pdf
8.Isolation of pure cultures and preservation of cultures.pdf
 
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
Travis Hills' Endeavors in Minnesota: Fostering Environmental and Economic Pr...
 
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốtmô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
 
Oedema_types_causes_pathophysiology.pptx
Oedema_types_causes_pathophysiology.pptxOedema_types_causes_pathophysiology.pptx
Oedema_types_causes_pathophysiology.pptx
 
Immersive Learning That Works: Research Grounding and Paths Forward
Immersive Learning That Works: Research Grounding and Paths ForwardImmersive Learning That Works: Research Grounding and Paths Forward
Immersive Learning That Works: Research Grounding and Paths Forward
 
bordetella pertussis.................................ppt
bordetella pertussis.................................pptbordetella pertussis.................................ppt
bordetella pertussis.................................ppt
 
SAR of Medicinal Chemistry 1st by dk.pdf
SAR of Medicinal Chemistry 1st by dk.pdfSAR of Medicinal Chemistry 1st by dk.pdf
SAR of Medicinal Chemistry 1st by dk.pdf
 
Applied Science: Thermodynamics, Laws & Methodology.pdf
Applied Science: Thermodynamics, Laws & Methodology.pdfApplied Science: Thermodynamics, Laws & Methodology.pdf
Applied Science: Thermodynamics, Laws & Methodology.pdf
 
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
 
Medical Orthopedic PowerPoint Templates.pptx
Medical Orthopedic PowerPoint Templates.pptxMedical Orthopedic PowerPoint Templates.pptx
Medical Orthopedic PowerPoint Templates.pptx
 
NuGOweek 2024 Ghent programme overview flyer
NuGOweek 2024 Ghent programme overview flyerNuGOweek 2024 Ghent programme overview flyer
NuGOweek 2024 Ghent programme overview flyer
 
Authoring a personal GPT for your research and practice: How we created the Q...
Authoring a personal GPT for your research and practice: How we created the Q...Authoring a personal GPT for your research and practice: How we created the Q...
Authoring a personal GPT for your research and practice: How we created the Q...
 
The debris of the ‘last major merger’ is dynamically young
The debris of the ‘last major merger’ is dynamically youngThe debris of the ‘last major merger’ is dynamically young
The debris of the ‘last major merger’ is dynamically young
 
aziz sancar nobel prize winner: from mardin to nobel
aziz sancar nobel prize winner: from mardin to nobelaziz sancar nobel prize winner: from mardin to nobel
aziz sancar nobel prize winner: from mardin to nobel
 
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdfTopic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
 

MediaEval 2018: NewsREEL Multimedia at MediaEval 2018: News Recommendation with Image and Text Content

  • 1. Sophia Antipolis, France, October 29-31, 2018 NewsREEL Multimedia at MediaEval 2018: News Recommendation with Image and Text Content Andreas Lommatzsch, Benjamin Kille, Frank Hopfgartner, Torben Brodt
  • 2. • Recommender systems help users to find relevant items / matching the individual preferences • Can be applied in a growing number of domains • Online shops • Videos and movie portals • Social media • News Portals • Scenarios may differ with respect to • Set of users (explicit profiles, sessions, anonymous users; context data) • Set of items (static / dynamic, collection size) • Available user-item interactions and content (ratings, reviews, impressions, sales) • Metrics Motivation Recommender Systems
  • 3. • Images have an strong influence on our perception Living Labs and Stream-based Evaluation are on the rise. A recent inquiry of recommendation experts reveals shockingly low levels of confidence in offline evaluation. 9 out of 10 experts agree that existing protocols can Recommendation #1 Recommended Article Headline Recommendation #2 Recommended Article Headline Recommendation #3 Recommended Article Headline Recommendation #4 Recommended Article Headline The TUB presents NewsREEL at ECIR 2016 Motivation Multi-Media in Recommender Systems
  • 4. • Recommender Research focuses on Collaborative Filtering • Text content-based approaches • Not much research how to use multimedia content in recommender systems • NewsREEL Multimedia addresses this challenge • Real-world scenario • Dataset • Baseline • Evaluation Tools Motivation Objectives of NewsREEL Multimedia Living Labs and Stream-based Evaluation are on the rise. A recent inquiry of recommendation experts reveals shockingly low levels of confidence in offline evaluation. 9 out of 10 experts agree that existing protocols can provide spurious results. They suggest to rely on socalled „real“ user feedback. Such feedback can be obtained in living labs such as clef Newsreel. Researchers connect their servers with the Open Recommendation Platform (http://orp.plista.com). They Recommendation #1 Recommended Article Headline Recommendation #2 Recommended Article Headline Recommendation #3 Recommended Article Headline Recommendation #4 Recommended Article Headline The TUB presents NewsREEL at ECIR 2016 Recommendation A: Recommended Article Headline
  • 5. NewsREEL scenario • Recommend interesting news on several different German News portals • News items consist of • The headline [text] • A snippet [text] • An image [URL] • Additional meta data could be provided • User-Item interactions • Impressions • Click on recommendations Scenario NewsREEL Living Labs and Stream-based Evaluation are on the rise. A recent inquiry of recommendation experts reveals shockingly low levels of confidence in offline evaluation. 9 out of 10 experts agree that existing protocols can provide spurious results. They suggest to rely on socalled „real“ user feedback. Such feedback can be obtained in living labs such as clef Newsreel. Researchers connect their servers with the Open Recommendation Platform (http://orp.plista.com). They Recommendation #1 Recommended Article Headline Recommendation #2 Recommended Article Headline Recommendation #3 Recommended Article Headline Recommendation #4 Recommended Article Headline The TUB presents NewsREEL at ECIR 2016 Recommendation A: Recommended Article Headline
  • 7. The NewsREEL architecture • Recommender teams register for the challenge • Receive requests that must be answered with recommendations • Evaluation based on live user feedback • An offline version is provided • Replay data and bin-based • Long-term statistics • No real time constraints Scenario NewsREEL
  • 8. • Motivation • The Dataset and Statistics • Evaluation Metrics • Discussion and Conclusions Outline Structure of the Presentation
  • 9. NewsREEL MultiMedia • Offline dataset (image processing is still time consuming) • Aggregated data on a weekly basis • The dataset consists of • Item data (headline, snippets, imageURL) • Image labels for most items (computed with 6 different configurations) • A low level image representation • Impression statistic (training weeks) • Potentially relevant items (test weeks) Dataset NewsREEL Multimedia Image labels itemIDs in 12 bins (weeks) #impressions Text data
  • 10. NewsREEL MultiMedia – dataset statistic • Size: 200 MB (txt/csv) ; 8.5G low level features • Number of pairs (items, week): 311k • #tupels (item, config, label, score): 3Mio • 7 image annotator configurations: Dataset NewsREEL Multimedia Image labels itemIDs #impressions Textual data
  • 11. Dataset NewsREEL Multimedia • publishers exhibit different engagement patterns • impressions show a power law distributions: • few articles accumulate majority of impressions • many articles get few impressions
  • 12. • Motivation • The Dataset and Statistics • Evaluation Metrics • Discussion and Conclusions Outline Structure of the Presentation
  • 13. Task: Predict the most popular items for the test weeks • What are appropriate metrics? • Precision@10 • Precision@N • Average Precision • RMSE / MAE NewsREEL MultiMedia Evaluation
  • 14. Task: Predict the most popular items for the test weeks NewsREEL MultiMedia Evaluation f(text, image)  score; sort(score) Article : #Impressions 438737736 : 342,428 438007519 : 340,840 437351823 : 268,656 437515691 : 253,596 437560377 : 203,943 439089552 : 169,430 439128028 : 160,133 438283775 : 152,881 436755527 : 145,553 437918358 : 127,624 week 0 | publisher 17614 Article : #Impressions 438737736 : - 438007519 : + 437351823 : - 437515691 : - 437560377 : - 439089552 : + 439128028 : + 438283775 : - 436755527 : - 437918358 : - p@10 = 0.3
  • 15. Evaluation Scheme NewsREEL MultiMedia Evaluation w0 w2 w4 w6 w8 w10 w12 Train Train Test Test
  • 16. • Precision@10 + Fits best with the application scenario  Unstable, hard metric, small codomain {0.0, 0.1, .., 1.0} • Precision@N • Generalization of Precision@10 + Overcomes the problem of small codomain, weak recommender can reach a score > 0 • Find an appropriate N; we decided to use N = 0.1 * #items • Average Precision@N • Precision does not consider the order of the elements + Average Precision overcomes this problem  More difficult to interpret • RMSE / MAE • Common metric is recommender system  Inappropriate due to the power law distribution of the #impressions NewsREEL MultiMedia Evaluation-Metrics
  • 17. • Motivation • The Dataset and statistics • Evaluation Metrics • Discussion and Conclusions Outline Structure of the Presentation
  • 18. • Problem setting  ranking optimisation • Dataset characteristics • impressions • preprocessed images • texts • own pipeline can be used • Evaluation scheme  precision@10, precision@10% • 8 registrations • 1 submission ( how to improve?) • To continue with • baseline (presented remotely) • contribution (presented remotely) Conclusion
  • 20. • The code is available on request (Java/Maven) Contact

Editor's Notes

  1. Distinct number of items: 52k
  2. Distinct number of items: 52k