Can AI say from our eyes when we read relevant information?

Nilavra Bhattacharya
Nilavra BhattacharyaPhD Student at The University of Texas at Austin
EYE TRACKING
COMPUTER VISION
INFORMATION
RELEVANCE
Can AI say from our eyes
when we read relevant information?
Nilavra Bhattacharya1, Somnath Rakshit1, Jacek Gwizdka1, Paul Kogut2
ACM SIGIR CHIIR 2020 • VANCOUVER VIRTUAL
RELEVANCE PREDICTION
FROM EYE MOVEMENTS
Using Semi-interpretable Convolutional Neural Networks
1 School of Information, The University of Texas at Austin
2 Rotary and Mission Systems, Lockheed Martin Corporation
ixlab.ischool.utexas.edu
INTRODUCTION
Introduction User Study Scanpath Image Classification Interpretability Conclusion
Two Worlds in the Information Field
Introduction User Study Scanpath Image Classification Interpretability Conclusion
Image: Tefko Saracevic (https://studylib.net/doc/15399702)
• Situational relevance or utility:
“situationally relevant items of
information are those that
answer, or logically help to
answer, questions of concern”
(Wilson, 1973)
• This work: situational relevance
= users’ perceived-relevance of
the documents they examine for
answering a question
Image: https://www.noldus.com/applications/eye-tracking-
physiology
Eye-tracking
Background: Eye-tracking & Information Relevance
Introduction User Study Scanpath Image Classification Interpretability Conclusion
• Drawback 1: aggregate ET data at stimulus / trial / participant level
• aggregated fixation counts/durations (Fahey+ 2011; Frey+ 2013; Gwizdka 2014; Loboda+ 2011; Puolamäki+ 2008; Wenzel+ 2017; Wittek+ 2016)
• reading related preprocessing before aggregation (Buscher+ 2008; 2012; Gwizdka, 2014a; 2014b, 2017; Gwizdka+ 2017)
• ET features from 2-second windows near the end of trial has more discriminating power (Gwizdka+ 2017)
=> collapsing ET data leads to loss of information
• Drawback 2: lack of standard feature selection => varied prediction performance; accuracy rarely above 70%
(Simola+ 2008; Slanzi+ 2017; Wenzel+ 2017; Gwizdka+ 2017);
Eye Movement Scanpath 1 Eye Movement Scanpath 2
Similar? Different?
How Much?
Background: Convolutional Neural Networks
Introduction User Study Scanpath Image Classification Interpretability Conclusion
• image classification is a major application of CNNs
• take an input image and predict a label for the image (e.g. “cat” or “dog”?)
• transfer learning: training received by a CNN for solving one task can be re-used to solve another related task
• e.g. training from cat/dog classifier can be re-used to classify traffic symbols
• benchmark CNN models, pre-trained on millions of images for classification tasks (ImageNet challenge) are readily available
• e.g. VGG, ResNet, DenseNet, etc.
Image: https://towardsdatascience.com/covolutional-neural-network-cb0883dd6529
Proposed Approach
Introduction User Study Scanpath Image Classification Interpretability Conclusion
Image: https://dev.to/frosnerd/handwritten-digit-recognition-using-convolutional-neural-networks-11g0
Scanpath - Image CNN Image Classifier
User
Perceived
Relevant?
Prediction
Eye movement
Scanpath
EYE-TRACKING
USER STUDY
Introduction User Study Scanpath Image Classification Interpretability Conclusion
Experimental Design
Introduction User Study Scanpath Image Classification Interpretability Conclusion
• Participants (N = 25, college-age students)
Example Trigger Q: The submarine Kursk was part of which Russian fleet?
Perceived Relevant Perceived Irrelevant
Trigger
Question
TREC 2005
Q&A Task
Spacebar
Relevance
Judgement
(binary)
Y/N then
Spacebar
+
1s
Short News
Article
AQUAINT Corpus
of English News
Text
+
Fixation
>= 2s
GENERATING
SCANPATH-IMAGES
Introduction User Study Scanpath Image Classification Interpretability Conclusion
Generating Scanpath-Images
Introduction User Study Scanpath Image Classification Interpretability Conclusion
SCANPATH - IMAGESCANPATH
Encode three attributes of eye fixations:
1. fixation location
2. fixation duration
3. fixation start time, for temporal ordering
Generating Scanpath-Images: Fixation Duration
Introduction User Study Scanpath Image Classification Interpretability Conclusion
Generating Scanpath-Images: Fixation Start Time
Introduction User Study Scanpath Image Classification Interpretability Conclusion
First
Saccade
Last
Saccade
Matplotlib’s winter colourmap
• each linearized saccade has a solid colour
Saccade Colour
Scanpath-Images
Introduction User Study Scanpath Image Classification Interpretability Conclusion
PERCEIVED RELEVANT PERCEIVED IRRELEVANT
SCANPATH-IMAGE
CLASSIFICATION
Introduction User Study Scanpath Image Classification Interpretability Conclusion
Scanpath-Image Classification
Introduction User Study Scanpath Image Classification Interpretability Conclusion
Given only the scanpath-image of a user’s eye movements on
the news article, predict if the user perceived the article to be
relevant for answering the trigger question.
Image: https://dev.to/frosnerd/handwritten-digit-recognition-using-convolutional-neural-networks-11g0
Scanpath - Image CNN Image Classifier
Perceived
Relevance
Prediction
Scanpath-Image Classification: Neural Network Architecture
Introduction User Study Scanpath Image Classification Interpretability Conclusion
Final hyperparameters:
epochs: 6, batch-size: 16, momentum: 0.9
Shallow(er) models:
VGG16, VGG19
Really Deep Models:
ResNet50
DenseNet121, DenseNet201
InceptionResNetV2
Optimizer: Stochastic Gradient Descent (SGD) with momentum
Pre-trained
CNN model
(ImageNet
Weights)
Fully
Connected
Layer
(256 nodes, ReLU,
with/without L1L2)
Dropout
(prob = 0.2)
Output Layer
(1 node, Sigmoid)
Scanpath-Image Classification: Results
Introduction User Study Scanpath Image Classification Interpretability Conclusion
For this specific task:
• Models do not overfit
• Shallow models classify better than deep models
Shallow
Deep
Table 1 from paper
CNN PREDICTION
INTERPRETABILITY
Introduction User Study Scanpath Image Classification Interpretability Conclusion
Attempt to Interpret CNN Predictions
Introduction User Study Scanpath Image Classification Interpretability Conclusion
Gradient-Weighted Class Activation Mapping (Grad-CAM)
Original Image CAM for “Cat” classCAM for “Dog” class
2017 IEEE International Conference on Computer Vision
Attempt to Interpret CNN Predictions
Introduction User Study Scanpath Image Classification Interpretability Conclusion
SCANPATH CLASS ACTIVATION MAP (CAM) AVERAGE CAM
Across all scanpath-images in this relevance class
Perceived Irrelevant
Perceived Relevant
CONCLUSION
Introduction User Study Scanpath Image Classification Interpretability Conclusion
Conclusion
Limitations:
• very simple information search task
• short texts of similar type
• relatively uniform group of participants (college-age
students)
Future Directions:
• complex scenarios, e.g., freely searching on the open
web
• diverse participants, e.g., young vs. older adults
• Eye-movement scanpath-image
classification:
• no aggregate measures: all eye-tracking data is
used
• spatio-temporal aspects of eye-movements are
preserved
• knowledge of screen content not needed
• additional insights (e.g. reading / scanning) not
needed
• Proof of concept:
• promising results, even with small dataset, without
overfitting
• CNNs trained for a different task can detect
patterns in eye-movements which are concordant
with prior literature
Introduction User Study Scanpath Image Classification Interpretability Conclusion
Acknowledgements
Student Travel Grant
Experimental Design Contribution,
Data Collection
Prof. Bradley Hatfield
Dr. Rodolphe Gentili
Dr. Joe Dien
Hyuk Oh
Kyle James Jaquess
Li-Chuan Lo
Department of Kinesiology,
University of Maryland, College Park
For inspiration:
Blog post on using mouse
trajectories for fraud detection
Gleb Esman
Splunk Inc.
THANK
YOU
@NilavraBnilavra@ieee.org ixlab.ischool.utexas.edu
Full paper:
https://dl.acm.org/doi/10.1145/3343413.3377960
https://arxiv.org/abs/2001.05152
1 of 24

Recommended

AlexNet by
AlexNetAlexNet
AlexNetBertil Hatt
2.9K views55 slides
Devil in the Details: Analysing the Performance of ConvNet Features by
Devil in the Details: Analysing the Performance of ConvNet FeaturesDevil in the Details: Analysing the Performance of ConvNet Features
Devil in the Details: Analysing the Performance of ConvNet FeaturesKen Chatfield
1.7K views27 slides
Scene classification using Convolutional Neural Networks - Jayani Withanawasam by
Scene classification using Convolutional Neural Networks - Jayani WithanawasamScene classification using Convolutional Neural Networks - Jayani Withanawasam
Scene classification using Convolutional Neural Networks - Jayani WithanawasamWithTheBest
1.4K views27 slides
Convolutional neural networks for image classification — evidence from Kaggle... by
Convolutional neural networks for image classification — evidence from Kaggle...Convolutional neural networks for image classification — evidence from Kaggle...
Convolutional neural networks for image classification — evidence from Kaggle...Dmytro Mishkin
4.8K views42 slides
Convolutional Neural Network for Alzheimer’s disease diagnosis with Neuroim... by
Convolutional Neural Network for Alzheimer’s disease diagnosis with Neuroim...Convolutional Neural Network for Alzheimer’s disease diagnosis with Neuroim...
Convolutional Neural Network for Alzheimer’s disease diagnosis with Neuroim...Seonho Park
3.5K views43 slides
A ROBUST CHAOTIC AND FAST WALSH TRANSFORM ENCRYPTION FOR GRAY SCALE BIOMEDICA... by
A ROBUST CHAOTIC AND FAST WALSH TRANSFORM ENCRYPTION FOR GRAY SCALE BIOMEDICA...A ROBUST CHAOTIC AND FAST WALSH TRANSFORM ENCRYPTION FOR GRAY SCALE BIOMEDICA...
A ROBUST CHAOTIC AND FAST WALSH TRANSFORM ENCRYPTION FOR GRAY SCALE BIOMEDICA...sipij
411 views22 slides

More Related Content

What's hot

[OSGeo-KR Tech Workshop] Deep Learning for Single Image Super-Resolution by
[OSGeo-KR Tech Workshop] Deep Learning for Single Image Super-Resolution[OSGeo-KR Tech Workshop] Deep Learning for Single Image Super-Resolution
[OSGeo-KR Tech Workshop] Deep Learning for Single Image Super-ResolutionTaegyun Jeon
3.2K views52 slides
DeepFix: a fully convolutional neural network for predicting human fixations... by
DeepFix:  a fully convolutional neural network for predicting human fixations...DeepFix:  a fully convolutional neural network for predicting human fixations...
DeepFix: a fully convolutional neural network for predicting human fixations...Universitat Politècnica de Catalunya
5.7K views34 slides
Self-supervised Learning from Video Sequences - Xavier Giro - UPC Barcelona 2019 by
Self-supervised Learning from Video Sequences - Xavier Giro - UPC Barcelona 2019Self-supervised Learning from Video Sequences - Xavier Giro - UPC Barcelona 2019
Self-supervised Learning from Video Sequences - Xavier Giro - UPC Barcelona 2019Universitat Politècnica de Catalunya
3K views64 slides
Parallel wisard object tracker a rambased tracking system by
Parallel wisard object tracker a rambased tracking systemParallel wisard object tracker a rambased tracking system
Parallel wisard object tracker a rambased tracking systemcseij
292 views13 slides
Convolutional Neural Network by
Convolutional Neural NetworkConvolutional Neural Network
Convolutional Neural NetworkJunho Cho
554 views151 slides
Deep Visual Saliency - Kevin McGuinness - UPC Barcelona 2017 by
Deep Visual Saliency - Kevin McGuinness - UPC Barcelona 2017Deep Visual Saliency - Kevin McGuinness - UPC Barcelona 2017
Deep Visual Saliency - Kevin McGuinness - UPC Barcelona 2017Universitat Politècnica de Catalunya
603 views32 slides

What's hot(20)

[OSGeo-KR Tech Workshop] Deep Learning for Single Image Super-Resolution by Taegyun Jeon
[OSGeo-KR Tech Workshop] Deep Learning for Single Image Super-Resolution[OSGeo-KR Tech Workshop] Deep Learning for Single Image Super-Resolution
[OSGeo-KR Tech Workshop] Deep Learning for Single Image Super-Resolution
Taegyun Jeon3.2K views
Parallel wisard object tracker a rambased tracking system by cseij
Parallel wisard object tracker a rambased tracking systemParallel wisard object tracker a rambased tracking system
Parallel wisard object tracker a rambased tracking system
cseij292 views
Convolutional Neural Network by Junho Cho
Convolutional Neural NetworkConvolutional Neural Network
Convolutional Neural Network
Junho Cho554 views
Transfer Learning and Fine-tuning Deep Neural Networks by PyData
 Transfer Learning and Fine-tuning Deep Neural Networks Transfer Learning and Fine-tuning Deep Neural Networks
Transfer Learning and Fine-tuning Deep Neural Networks
PyData10.5K views
Image Segmentation Using Deep Learning : A survey by NUPUR YADAV
Image Segmentation Using Deep Learning : A surveyImage Segmentation Using Deep Learning : A survey
Image Segmentation Using Deep Learning : A survey
NUPUR YADAV369 views
Image Captioning Generator using Deep Machine Learning by ijtsrd
Image Captioning Generator using Deep Machine LearningImage Captioning Generator using Deep Machine Learning
Image Captioning Generator using Deep Machine Learning
ijtsrd556 views
IRJET- Real-Time Object Detection using Deep Learning: A Survey by IRJET Journal
IRJET- Real-Time Object Detection using Deep Learning: A SurveyIRJET- Real-Time Object Detection using Deep Learning: A Survey
IRJET- Real-Time Object Detection using Deep Learning: A Survey
IRJET Journal70 views
Convolutional Neural Networks for Computer vision Applications by Alex Conway
Convolutional Neural Networks for Computer vision ApplicationsConvolutional Neural Networks for Computer vision Applications
Convolutional Neural Networks for Computer vision Applications
Alex Conway746 views
Dario izzo - Machine Learning methods and space engineering by Advanced-Concepts-Team
Dario izzo - Machine Learning methods and space engineeringDario izzo - Machine Learning methods and space engineering
Dario izzo - Machine Learning methods and space engineering
Low complexity features for jpeg steganalysis using undecimated dct by Pvrtechnologies Nellore
Low complexity features for jpeg steganalysis using undecimated dctLow complexity features for jpeg steganalysis using undecimated dct
Low complexity features for jpeg steganalysis using undecimated dct

Similar to Can AI say from our eyes when we read relevant information?

Obscenity Detection in Images by
Obscenity Detection in ImagesObscenity Detection in Images
Obscenity Detection in ImagesAnil Kumar Gupta
652 views29 slides
CNNs: from the Basics to Recent Advances by
CNNs: from the Basics to Recent AdvancesCNNs: from the Basics to Recent Advances
CNNs: from the Basics to Recent AdvancesDmytro Mishkin
3.9K views51 slides
Classification of Images Using CNN Model and its Variants by
Classification of Images Using CNN Model and its VariantsClassification of Images Using CNN Model and its Variants
Classification of Images Using CNN Model and its VariantsIRJET Journal
2 views7 slides
Image super resolution using Generative Adversarial Network. by
Image super resolution using Generative Adversarial Network.Image super resolution using Generative Adversarial Network.
Image super resolution using Generative Adversarial Network.IRJET Journal
3 views6 slides
D1L5 Visualization (D1L2 Insight@DCU Machine Learning Workshop 2017) by
D1L5 Visualization (D1L2 Insight@DCU Machine Learning Workshop 2017)D1L5 Visualization (D1L2 Insight@DCU Machine Learning Workshop 2017)
D1L5 Visualization (D1L2 Insight@DCU Machine Learning Workshop 2017)Universitat Politècnica de Catalunya
549 views43 slides
IRJET - Visual Question Answering – Implementation using Keras by
IRJET -  	  Visual Question Answering – Implementation using KerasIRJET -  	  Visual Question Answering – Implementation using Keras
IRJET - Visual Question Answering – Implementation using KerasIRJET Journal
20 views4 slides

Similar to Can AI say from our eyes when we read relevant information?(20)

CNNs: from the Basics to Recent Advances by Dmytro Mishkin
CNNs: from the Basics to Recent AdvancesCNNs: from the Basics to Recent Advances
CNNs: from the Basics to Recent Advances
Dmytro Mishkin3.9K views
Classification of Images Using CNN Model and its Variants by IRJET Journal
Classification of Images Using CNN Model and its VariantsClassification of Images Using CNN Model and its Variants
Classification of Images Using CNN Model and its Variants
IRJET Journal2 views
Image super resolution using Generative Adversarial Network. by IRJET Journal
Image super resolution using Generative Adversarial Network.Image super resolution using Generative Adversarial Network.
Image super resolution using Generative Adversarial Network.
IRJET Journal3 views
IRJET - Visual Question Answering – Implementation using Keras by IRJET Journal
IRJET -  	  Visual Question Answering – Implementation using KerasIRJET -  	  Visual Question Answering – Implementation using Keras
IRJET - Visual Question Answering – Implementation using Keras
IRJET Journal20 views
物件偵測與辨識技術 by CHENHuiMei
物件偵測與辨識技術物件偵測與辨識技術
物件偵測與辨識技術
CHENHuiMei368 views
improving Profile detection using Deep Learning by Sahil Kaw
improving Profile detection using Deep Learningimproving Profile detection using Deep Learning
improving Profile detection using Deep Learning
Sahil Kaw207 views
最近の研究情勢についていくために - Deep Learningを中心に - by Hiroshi Fukui
最近の研究情勢についていくために - Deep Learningを中心に - 最近の研究情勢についていくために - Deep Learningを中心に -
最近の研究情勢についていくために - Deep Learningを中心に -
Hiroshi Fukui1.3K views
NUMBER PLATE IMAGE DETECTION FOR FAST MOTION VEHICLES USING BLUR KERNEL ESTIM... by paperpublications3
NUMBER PLATE IMAGE DETECTION FOR FAST MOTION VEHICLES USING BLUR KERNEL ESTIM...NUMBER PLATE IMAGE DETECTION FOR FAST MOTION VEHICLES USING BLUR KERNEL ESTIM...
NUMBER PLATE IMAGE DETECTION FOR FAST MOTION VEHICLES USING BLUR KERNEL ESTIM...
A Survey on Image Processing using CNN in Deep Learning by IRJET Journal
A Survey on Image Processing using CNN in Deep LearningA Survey on Image Processing using CNN in Deep Learning
A Survey on Image Processing using CNN in Deep Learning
IRJET Journal3 views
Scene recognition using Convolutional Neural Network by DhirajGidde
Scene recognition using Convolutional Neural NetworkScene recognition using Convolutional Neural Network
Scene recognition using Convolutional Neural Network
DhirajGidde193 views
Omni-Modeler: Rapid Adaptive Visual Recognition with Dynamic Learning by sipij
Omni-Modeler: Rapid Adaptive Visual Recognition with Dynamic LearningOmni-Modeler: Rapid Adaptive Visual Recognition with Dynamic Learning
Omni-Modeler: Rapid Adaptive Visual Recognition with Dynamic Learning
sipij3 views

Recently uploaded

domestic waste_100013.pptx by
domestic waste_100013.pptxdomestic waste_100013.pptx
domestic waste_100013.pptxpadmasriv25
10 views17 slides
PRINCIPLES-OF ASSESSMENT by
PRINCIPLES-OF ASSESSMENTPRINCIPLES-OF ASSESSMENT
PRINCIPLES-OF ASSESSMENTrbalmagro
9 views12 slides
"How can I develop my learning path in bioinformatics? by
"How can I develop my learning path in bioinformatics?"How can I develop my learning path in bioinformatics?
"How can I develop my learning path in bioinformatics?Bioinformy
17 views13 slides
scopus cited journals.pdf by
scopus cited journals.pdfscopus cited journals.pdf
scopus cited journals.pdfKSAravindSrivastava
5 views15 slides
MILK LIPIDS 2.pptx by
MILK LIPIDS 2.pptxMILK LIPIDS 2.pptx
MILK LIPIDS 2.pptxabhinambroze18
5 views15 slides
Class 2 (12 july).pdf by
  Class 2 (12 july).pdf  Class 2 (12 july).pdf
Class 2 (12 july).pdfclimber9977
8 views1 slide

Recently uploaded(20)

domestic waste_100013.pptx by padmasriv25
domestic waste_100013.pptxdomestic waste_100013.pptx
domestic waste_100013.pptx
padmasriv2510 views
PRINCIPLES-OF ASSESSMENT by rbalmagro
PRINCIPLES-OF ASSESSMENTPRINCIPLES-OF ASSESSMENT
PRINCIPLES-OF ASSESSMENT
rbalmagro9 views
"How can I develop my learning path in bioinformatics? by Bioinformy
"How can I develop my learning path in bioinformatics?"How can I develop my learning path in bioinformatics?
"How can I develop my learning path in bioinformatics?
Bioinformy17 views
Class 2 (12 july).pdf by climber9977
  Class 2 (12 july).pdf  Class 2 (12 july).pdf
Class 2 (12 july).pdf
climber99778 views
RemeOs science and clinical evidence by PetrusViitanen1
RemeOs science and clinical evidenceRemeOs science and clinical evidence
RemeOs science and clinical evidence
PetrusViitanen114 views
Company Fashion Show ChemAI 231116.pptx by Marco Tibaldi
Company Fashion Show ChemAI 231116.pptxCompany Fashion Show ChemAI 231116.pptx
Company Fashion Show ChemAI 231116.pptx
Marco Tibaldi69 views
Conventional and non-conventional methods for improvement of cucurbits.pptx by gandhi976
Conventional and non-conventional methods for improvement of cucurbits.pptxConventional and non-conventional methods for improvement of cucurbits.pptx
Conventional and non-conventional methods for improvement of cucurbits.pptx
gandhi97614 views
Max Welling ChemAI 231116.pptx by Marco Tibaldi
Max Welling ChemAI 231116.pptxMax Welling ChemAI 231116.pptx
Max Welling ChemAI 231116.pptx
Marco Tibaldi132 views
application of genetic engineering 2.pptx by SankSurezz
application of genetic engineering 2.pptxapplication of genetic engineering 2.pptx
application of genetic engineering 2.pptx
SankSurezz6 views
Gold Nanoparticle as novel Agent for Drug targeting (1).pptx by sakshijadhav9843
Gold Nanoparticle as novel Agent for Drug targeting (1).pptxGold Nanoparticle as novel Agent for Drug targeting (1).pptx
Gold Nanoparticle as novel Agent for Drug targeting (1).pptx
sakshijadhav984316 views
Distinct distributions of elliptical and disk galaxies across the Local Super... by Sérgio Sacani
Distinct distributions of elliptical and disk galaxies across the Local Super...Distinct distributions of elliptical and disk galaxies across the Local Super...
Distinct distributions of elliptical and disk galaxies across the Local Super...
Sérgio Sacani30 views
MODULE-9-Biotechnology, Genetically Modified Organisms, and Gene Therapy.pdf by KerryNuez1
MODULE-9-Biotechnology, Genetically Modified Organisms, and Gene Therapy.pdfMODULE-9-Biotechnology, Genetically Modified Organisms, and Gene Therapy.pdf
MODULE-9-Biotechnology, Genetically Modified Organisms, and Gene Therapy.pdf
KerryNuez119 views

Can AI say from our eyes when we read relevant information?

  • 1. EYE TRACKING COMPUTER VISION INFORMATION RELEVANCE Can AI say from our eyes when we read relevant information?
  • 2. Nilavra Bhattacharya1, Somnath Rakshit1, Jacek Gwizdka1, Paul Kogut2 ACM SIGIR CHIIR 2020 • VANCOUVER VIRTUAL RELEVANCE PREDICTION FROM EYE MOVEMENTS Using Semi-interpretable Convolutional Neural Networks 1 School of Information, The University of Texas at Austin 2 Rotary and Mission Systems, Lockheed Martin Corporation ixlab.ischool.utexas.edu
  • 3. INTRODUCTION Introduction User Study Scanpath Image Classification Interpretability Conclusion
  • 4. Two Worlds in the Information Field Introduction User Study Scanpath Image Classification Interpretability Conclusion Image: Tefko Saracevic (https://studylib.net/doc/15399702) • Situational relevance or utility: “situationally relevant items of information are those that answer, or logically help to answer, questions of concern” (Wilson, 1973) • This work: situational relevance = users’ perceived-relevance of the documents they examine for answering a question Image: https://www.noldus.com/applications/eye-tracking- physiology Eye-tracking
  • 5. Background: Eye-tracking & Information Relevance Introduction User Study Scanpath Image Classification Interpretability Conclusion • Drawback 1: aggregate ET data at stimulus / trial / participant level • aggregated fixation counts/durations (Fahey+ 2011; Frey+ 2013; Gwizdka 2014; Loboda+ 2011; Puolamäki+ 2008; Wenzel+ 2017; Wittek+ 2016) • reading related preprocessing before aggregation (Buscher+ 2008; 2012; Gwizdka, 2014a; 2014b, 2017; Gwizdka+ 2017) • ET features from 2-second windows near the end of trial has more discriminating power (Gwizdka+ 2017) => collapsing ET data leads to loss of information • Drawback 2: lack of standard feature selection => varied prediction performance; accuracy rarely above 70% (Simola+ 2008; Slanzi+ 2017; Wenzel+ 2017; Gwizdka+ 2017); Eye Movement Scanpath 1 Eye Movement Scanpath 2 Similar? Different? How Much?
  • 6. Background: Convolutional Neural Networks Introduction User Study Scanpath Image Classification Interpretability Conclusion • image classification is a major application of CNNs • take an input image and predict a label for the image (e.g. “cat” or “dog”?) • transfer learning: training received by a CNN for solving one task can be re-used to solve another related task • e.g. training from cat/dog classifier can be re-used to classify traffic symbols • benchmark CNN models, pre-trained on millions of images for classification tasks (ImageNet challenge) are readily available • e.g. VGG, ResNet, DenseNet, etc. Image: https://towardsdatascience.com/covolutional-neural-network-cb0883dd6529
  • 7. Proposed Approach Introduction User Study Scanpath Image Classification Interpretability Conclusion Image: https://dev.to/frosnerd/handwritten-digit-recognition-using-convolutional-neural-networks-11g0 Scanpath - Image CNN Image Classifier User Perceived Relevant? Prediction Eye movement Scanpath
  • 8. EYE-TRACKING USER STUDY Introduction User Study Scanpath Image Classification Interpretability Conclusion
  • 9. Experimental Design Introduction User Study Scanpath Image Classification Interpretability Conclusion • Participants (N = 25, college-age students) Example Trigger Q: The submarine Kursk was part of which Russian fleet? Perceived Relevant Perceived Irrelevant Trigger Question TREC 2005 Q&A Task Spacebar Relevance Judgement (binary) Y/N then Spacebar + 1s Short News Article AQUAINT Corpus of English News Text + Fixation >= 2s
  • 10. GENERATING SCANPATH-IMAGES Introduction User Study Scanpath Image Classification Interpretability Conclusion
  • 11. Generating Scanpath-Images Introduction User Study Scanpath Image Classification Interpretability Conclusion SCANPATH - IMAGESCANPATH Encode three attributes of eye fixations: 1. fixation location 2. fixation duration 3. fixation start time, for temporal ordering
  • 12. Generating Scanpath-Images: Fixation Duration Introduction User Study Scanpath Image Classification Interpretability Conclusion
  • 13. Generating Scanpath-Images: Fixation Start Time Introduction User Study Scanpath Image Classification Interpretability Conclusion First Saccade Last Saccade Matplotlib’s winter colourmap • each linearized saccade has a solid colour Saccade Colour
  • 14. Scanpath-Images Introduction User Study Scanpath Image Classification Interpretability Conclusion PERCEIVED RELEVANT PERCEIVED IRRELEVANT
  • 15. SCANPATH-IMAGE CLASSIFICATION Introduction User Study Scanpath Image Classification Interpretability Conclusion
  • 16. Scanpath-Image Classification Introduction User Study Scanpath Image Classification Interpretability Conclusion Given only the scanpath-image of a user’s eye movements on the news article, predict if the user perceived the article to be relevant for answering the trigger question. Image: https://dev.to/frosnerd/handwritten-digit-recognition-using-convolutional-neural-networks-11g0 Scanpath - Image CNN Image Classifier Perceived Relevance Prediction
  • 17. Scanpath-Image Classification: Neural Network Architecture Introduction User Study Scanpath Image Classification Interpretability Conclusion Final hyperparameters: epochs: 6, batch-size: 16, momentum: 0.9 Shallow(er) models: VGG16, VGG19 Really Deep Models: ResNet50 DenseNet121, DenseNet201 InceptionResNetV2 Optimizer: Stochastic Gradient Descent (SGD) with momentum Pre-trained CNN model (ImageNet Weights) Fully Connected Layer (256 nodes, ReLU, with/without L1L2) Dropout (prob = 0.2) Output Layer (1 node, Sigmoid)
  • 18. Scanpath-Image Classification: Results Introduction User Study Scanpath Image Classification Interpretability Conclusion For this specific task: • Models do not overfit • Shallow models classify better than deep models Shallow Deep Table 1 from paper
  • 19. CNN PREDICTION INTERPRETABILITY Introduction User Study Scanpath Image Classification Interpretability Conclusion
  • 20. Attempt to Interpret CNN Predictions Introduction User Study Scanpath Image Classification Interpretability Conclusion Gradient-Weighted Class Activation Mapping (Grad-CAM) Original Image CAM for “Cat” classCAM for “Dog” class 2017 IEEE International Conference on Computer Vision
  • 21. Attempt to Interpret CNN Predictions Introduction User Study Scanpath Image Classification Interpretability Conclusion SCANPATH CLASS ACTIVATION MAP (CAM) AVERAGE CAM Across all scanpath-images in this relevance class Perceived Irrelevant Perceived Relevant
  • 22. CONCLUSION Introduction User Study Scanpath Image Classification Interpretability Conclusion
  • 23. Conclusion Limitations: • very simple information search task • short texts of similar type • relatively uniform group of participants (college-age students) Future Directions: • complex scenarios, e.g., freely searching on the open web • diverse participants, e.g., young vs. older adults • Eye-movement scanpath-image classification: • no aggregate measures: all eye-tracking data is used • spatio-temporal aspects of eye-movements are preserved • knowledge of screen content not needed • additional insights (e.g. reading / scanning) not needed • Proof of concept: • promising results, even with small dataset, without overfitting • CNNs trained for a different task can detect patterns in eye-movements which are concordant with prior literature Introduction User Study Scanpath Image Classification Interpretability Conclusion
  • 24. Acknowledgements Student Travel Grant Experimental Design Contribution, Data Collection Prof. Bradley Hatfield Dr. Rodolphe Gentili Dr. Joe Dien Hyuk Oh Kyle James Jaquess Li-Chuan Lo Department of Kinesiology, University of Maryland, College Park For inspiration: Blog post on using mouse trajectories for fraud detection Gleb Esman Splunk Inc. THANK YOU @NilavraBnilavra@ieee.org ixlab.ischool.utexas.edu Full paper: https://dl.acm.org/doi/10.1145/3343413.3377960 https://arxiv.org/abs/2001.05152