SlideShare a Scribd company logo
1 of 54
Download to read offline
09/05/14 pag. 1
Information visualization lecture 8
visual analytics
Katrien Verbert
Department of Computer Science
Faculty of Science
Vrije Universiteit Brussel
katrien.verbert@vub.ac.be
09/05/14 pag. 2
Motivation
h"p://bigdatablog.emc.com/2013/08/08/mlb-­‐a-­‐big-­‐fan-­‐of-­‐big-­‐data/	
  
09/05/14 pag. 3
Motivation
•  Volume:Gigabyte(109), Terabyte(1012), Petabyte(1015), Exabyte(1018),
Zettabytes(1021)
•  Variety: 
–  structured, semi-structured, unstructured; 
–  text, image, audio, video, record
•  Velocity: dynamic, sometimes time-varying
Big	
  Data	
  refers	
  to	
  datasets	
  that	
  grow	
  so	
  large	
  that	
  it	
  is	
  difficult	
  to	
  capture,	
  store,	
  
manage,	
  share,	
  analyze	
  and	
  visualize	
  with	
  the	
  typical	
  database	
  soFware	
  tools.	
  
	
  
09/05/14 pag. 4
The social layer in an interconnected world
2+	
  
billion	
  
people	
  on	
  
the	
  Web	
  
by	
  end	
  
2011	
  	
  
30	
  billion	
  RFID	
  
tags	
  today	
  
	
  (1.3B	
  in	
  2005)	
  
4.6	
  
billion	
  
camera	
  
phones	
  
world	
  wide	
  
100s	
  of	
  
millions	
  
of	
  GPS	
  
enabled	
  
devices	
  sold	
  
annually	
  
76	
  million	
  smart	
  
meters	
  in	
  2009…	
  
	
  200M	
  by	
  2014	
  	
  
12+ TBs
of tweet data
every day
25+ TBs of
log data
every day
?	
  TBs	
  of	
  
data	
  every	
  day	
  
09/05/14 pag. 5
Motivation
Raw	
  data	
  has	
  no	
  value	
  in	
  itself,	
  only	
  the	
  extracted	
  informaIon	
  has	
  value.	
  
	
  
	
  
Src	
  :	
  
h"p://www.sas.com/knowledge-­‐exchange/business-­‐analyIcs/featured/big-­‐data-­‐drives-­‐performance-­‐%E2%80%93-­‐if-­‐
you-­‐can-­‐exploit-­‐the-­‐informaIon-­‐overload/index.html	
  
09/05/14 pag. 6
Information overload
•  Refers to the danger of getting lost in data, which may be:
–  irrelevant	
  to	
  the	
  current	
  task	
  at	
  hand,	
  
–  processed	
  in	
  an	
  inappropriate	
  way,	
  or	
  
–  presented	
  in	
  an	
  inappropriate	
  way.	
  
Src:	
  h"p://supermarketpeople.co.uk/archives/713	
  
09/05/14 pag. 7
Visual Analytics - Mastering the
Information Age
h"ps://www.youtube.com/watch?v=5i3xbitEVfs	
  
	
  
09/05/14 pag. 8
Visual analytics is the science of analytical reasoning
facilitated by interactive visual interfaces.
Definition
09/05/14 pag. 9
Whom does it matter
•  Research Community J
•  Business Community - New tools, new capabilities, new
infrastructure, new business models etc.
Financial	
  Services...	
  
09/05/14 pag. 10
Visual analytics is a multidisciplinary field
09/05/14 pag. 11
History
09/05/14 pag. 12
History
discovery tools
•  Shneiderman (IV ‘02) suggests combining computational analysis
approaches such as data mining with information visualization
–  Too often viewed as competitors in past
–  Instead, can complement each other
–  Each has something valuable to contribute
•  Alternatives
–  Issues influencing the design of discovery tools:
•  Statistical algorithms vs. visual data presentation
•  Hypothesis testing vs. exploratory data analysis
–  Each has Pro’s and Con’s
09/05/14 pag. 13
Hypothesis testing & exploratory data analysis
•  Hypothesis testing
–  Advocates:
•  By stating hypotheses up front, limit variables and sharpens thinking, more
precise measurement
–  Critics:
•  Too far from reality, initial hypotheses bias toward finding evidence to support it
•  Exploratory Data Analysis
–  Advocates:
•  Find the interesting things this way, we now have computational capabilities to
do them
–  Skeptics:
•  Not generalizable, everything is a special case, detecting statistical relationships
does not infer cause and effect
09/05/14 pag. 14
Recommendations
•  Integrate data mining and information visualization
•  Allow users to specify what they are seeking
•  Recognize that users are situated in a social context
•  Respect human responsibility
09/05/14 pag. 15
History
•  Information visualization systems inadequately supported
decision making (Amar & Stasko IV ‘04):
–  Limited affordances
–  Predetermined representations
–  Decline of determinism in decision-Making
•  “Representational primacy” versus “Analytic primacy”
–  Telling truth about data vs. providing analytically useful visualizations
09/05/14 pag. 16
Representational primacy
•  Pursuit of faithful data replication and comprehension
•  Above all else, we must show the data
•  Can be a limiting notion
•  May focus on low level tasks not relevant to users
•  Some data is hard to represent faithfully (think high
dimensional data)
09/05/14 pag. 17
Gaps between representation and analysis
•  When explanatory or correlative models are the desired outcome, model
visualization may be more important than data visualization
•  One end is to build black boxes where data goes in and the answer comes out
•  Not a good idea to trust black boxes
–  What	
  can	
  go	
  wrong	
  with	
  this?	
  	
  
–  White	
  box	
  approach	
  be"er.	
  
09/05/14 pag. 18
Application of Precepts
Worldview-based tasks (“Did we show
the right thing to the user?”)
–  Determine Domain Parameters
–  Expose Multivariate Explanation
–  Facilitate Hypothesis Testing
Rationale-based tasks (“Will the
user believe what she sees?”)
-  Expose	
  Uncertainty	
  	
  
-  ConcreIze	
  RelaIonships	
  	
  
-  Expose	
  Cause	
  and	
  Effect	
  	
  
Amar	
  &	
  Stasko	
  2005	
  
09/05/14 pag. 19
Task level emphases
•  Don’t just help “low-level” tasks
–  Find, filter, correlate, etc.
•  Facilitate analytical thinking
–  Complex decision-making, especially under uncertainty
–  Learning a domain
–  Identifying the nature of trends
–  Predicting the future
09/05/14 pag. 20
History:	
  coining	
  the	
  term
•  2003-04 Jim Thomas of PNNL, together with colleagues, develops
notion of visual analytics
–  Holds workshops at PNNL and at InfoVis ‘04 to help define a research agenda
•  Agenda is formalized in book, Illuminating the Path
–  Available online, http://nvac.pnl.gov/
•  People use visual analytics tools and techniques to
–  Synthesize information and derive insight from massive, dynamic, ambiguous, and
often conflicting data
–  Detect the expected and discover the unexpected
–  Provide timely, defensible, and understandable assessments
–  Communicate assessment effectively for action.
Src: Stasko
09/05/14 pag. 21
Visual	
  AnalyMcs	
  
•  Not really an “area” per se
–  More of an “umbrella” notion
–  Combines multiple areas or disciplines
–  Ultimately about using data to improve our knowledge and help make
decisions
•  Main Components:
Src: Stasko
09/05/14 pag. 22
Visual Analytics: alternate Definition
Visual analytics combines automated analysis techniques with
interactive visualizations for an effective understanding,
reasoning and decision making on the basis of very large and
complex data sets
Keim et al, chapter in Information Visualization: Human-Centered Issues
and Perspectives, 2008
09/05/14 pag. 23
In-Spire Video
h"ps://www.youtube.com/watch?v=YONTBZaxz8g	
  
	
  
09/05/14 pag. 24
Synergy
•  Combine strengths of both human and electronic data
processing
–  Gives a semi-automated analytical process
–  Use strengths from each
–  Below from Keim, 2008
09/05/14 pag. 25
InfoVis Comparison
•  Clearly much overlap
•  Perhaps fair to say that infovis hasn’t always focused on
analysis tasks so much and that it doesn’t always include
advanced data analysis algorithms
–  Not a criticism, just not focus
–  InfoVis has a more narrow scope
Src: Stasko
09/05/14 pag. 26
Visual Analytics
•  Encompassing, integrated approach to data analysis
–  Use computational algorithms where helpful
–  Use human-directed visual exploration where helpful
–  Not just “Apply A, then apply B” though
–  Integrate the two tightly
Stasko, 2013
Src: Stasko
09/05/14 pag. 27
Visual analytics aims at making data and
information processing transparent
just as information visualization has changed our view on databases,
the goal of visual analytics is to make our way of processing
data and information transparent for an analytic discourse
09/05/14 pag. 28
The visual analytics process
The	
  visual	
  analyIcs	
  process	
  is	
  characterized	
  through	
  interacIon	
  between	
  data,	
  visualizaIons,	
  
models	
  about	
  the	
  data,	
  and	
  the	
  users	
  in	
  order	
  to	
  discover	
  knowledge.	
  
	
  
09/05/14 pag. 29
Information seeking mantra for visual
analytics applications
Overview first, zoom/filter, details on demand
Analyze first, show the important, zoom/filter, analyze further,
details on demand.
Keim	
  et	
  al.	
  2006	
  
09/05/14 pag. 30
Applications
09/05/14 pag. 31
09/05/14 pag. 32
UTOPIAN: user-driven topic modeling
	
  
	
  
h"ps://www.youtube.com/watch?v=du6_s6hcaRA&feature=youtu.be	
  
09/05/14 pag. 33
SketchPadN-D: WYDIWYG Sculpting and Editing
in High-Dimensional Space
	
  
	
  
h"ps://www.youtube.com/watch?v=ar8kAdAfx6w	
  
09/05/14 pag. 34
Some features
09/05/14 pag. 35
Relationship discovery
Scale independent representations, whole and parts at same time at
multiple levels of abstraction, often linked
09/05/14 pag. 36
Relationship discovery
Explore high dimensional relationships, theme groupings, outlier
detection, searching by proximity at multiple scales
09/05/14 pag. 37
Combined exploratory and confirmatory analytics
•  Develop and refine hypothesis
•  Evidence collection, management, and matching to hypothesis
•  Tailor views/displays for thematic/hypothesis focus of interest
•  Often suggestive of predictions enabling proactive thinking
09/05/14 pag. 38
Multiple data types
•  Supports multiple data types: structured/unstructured text
•  Imagery/video, cyber
•  Systems of either data type or application specific
09/05/14 pag. 39
Temporal views and interactions
•  Most analytics situations involve time, pace, velocity
•  Group segments of thoughts by time
•  Compare time segments
•  Often combined with geospatial
09/05/14 pag. 40
Reasoning workspace
Stu Card, PARC
09/05/14 pag. 41
Grouping and outlier detection
•  Form groups of thought/data
•  Labels and annotation
•  Compare groupings
•  Find small groups or outliers
41
09/05/14 pag. 42
Labeling
•  Critically important,
•  Dynamic in scope, number labels, size, color
•  Positioning
•  Almost everything has labels
•  Labels tell semantic meaning
09/05/14 pag. 43
Multiple linked views
Temporal, geospatial, theme, cluster, list views with association linkages
between views
09/05/14 pag. 44
Challenges
09/05/14 pag. 45
Challenge 1: scalability
•  Challenge with regard to both:
–  visual	
  representaIons	
  	
  
–  automaIc	
  analysis	
  
•  Requirements/needs:
–  SoluIon	
  needs	
  to	
  scale	
  in	
  size,	
  dimensionality,	
  data	
  types,	
  levels	
  of	
  quality.	
  
–  Methods	
  are	
  needed	
  to	
  deal	
  with	
  input	
  data	
  that	
  is	
  noisy	
  and	
  conInuous.	
  
–  Pa"erns	
  and	
  relaIonships	
  need	
  to	
  be	
  visualized	
  on	
  different	
  levels	
  of	
  
details,	
  and	
  with	
  appropriate	
  levels	
  of	
  data	
  and	
  visual	
  abstracIon.	
  
09/05/14 pag. 46
Challenge 2: uncertainty
•  Challenge:
–  large	
  amount	
  of	
  noise	
  and	
  missing	
  values	
  from	
  heterogeneous	
  data	
  sources	
  	
  
–  bias	
  introduced	
  by	
  automaIc	
  analysis	
  methods	
  as	
  well	
  as	
  human	
  percepIon	
  
•  Requirements/needs:
–  representaIon	
  of	
  the	
  noIon	
  of	
  data	
  quality	
  and	
  the	
  confidence	
  	
  
–  analysts	
  need	
  to	
  be	
  aware	
  of	
  the	
  uncertainty	
  and	
  be	
  able	
  to	
  analyze	
  quality	
  
09/05/14 pag. 47
Challenge 3: text data stream
•  Analysis and visualization of text steams is still a relatively new field.
•  Text stream data often
–  have	
  li"le	
  structure,	
  grammar	
  and	
  context,	
  
–  are	
  provided	
  as	
  high-­‐frequency	
  mulIlingual	
  stream,	
  	
  
–  contain	
  a	
  high	
  percentage	
  of	
  non-­‐meaningful	
  and	
  irrelevant	
  messages.	
  	
  
•  The challenge is to handle the dynamic nature of the stream data
and the low-tolerance of monitoring delay in emergency cases.
09/05/14 pag. 48
Challenge 4: interaction
•  Novel interaction techniques and tangible user-interfaces for
seamless intuitive visual communication with the system.
•  The analyst should be able to
–  fully	
  focus	
  on	
  the	
  task	
  at	
  hand	
  and	
  	
  
–  not	
  be	
  distracted	
  by	
  overly	
  technical	
  or	
  complex	
  user	
  interfaces	
  and	
  interacIons.	
  	
  
•  User feedback should be taken as intelligently as possible, requiring
as little user input as possible.
09/05/14 pag. 49
Challenge 5: evaluation
•  Challenge:
–  hard	
  to	
  assess	
  the	
  quality	
  of	
  visual	
  analyIcs	
  soluIons	
  
–  due	
  to	
  the	
  interdisciplinary	
  nature	
  and	
  complex	
  process	
  
•  Needs:
–  evaluaIon	
  framework	
  to	
  assess	
  effecIveness,	
  efficiency	
  and	
  acceptance	
  
–  of	
  new	
  visual	
  analyIcs	
  techniques,	
  methods,	
  and	
  models	
  
09/05/14 pag. 50
Challenge 6: infrastructure
•  Challenge:
–  most	
  soluIons	
  develop	
  their	
  own	
  infrastructures	
  	
  
–  mismatch	
  between	
  the	
  level	
  of	
  service	
  provided	
  and	
  real	
  needs:	
  
•  fast	
  and	
  precise	
  answers	
  with	
  progressive	
  refinement,	
  	
  
•  incremental	
  re-­‐computaIon,	
  	
  
•  steering	
  the	
  computaIon	
  towards	
  data	
  regions	
  that	
  are	
  of	
  interest.	
  	
  
•  Requirements/needs:
–  infrastructure	
  to	
  bind	
  together	
  all	
  the	
  processes,	
  funcIons,	
  and	
  services	
  	
  
–  repositories	
  of	
  available	
  visual	
  analyIcs	
  soluIons	
  
09/05/14 pag. 51
h"p://www.visual-­‐analyIcs.eu/	
  
	
  
09/05/14 pag. 52
Questions?
09/05/14 pag. 53
Readings
•  Ch. 1-2
h"p://www.vismaster.eu/wp-­‐content/uploads/2010/11/VisMaster-­‐book-­‐lowres.pdf	
  
	
  
09/05/14 pag. 54
References
•  Amar, R. A., & Stasko, J. T. (2005).
Knowledge precepts for design and evaluation of information
visualizations. Visualization and Computer Graphics, IEEE
Transactions on, 11(4), 432-442.
•  D. A. Keim, F. Mansmann, J. Schneidewind, and H. Ziegler.
Challenges in visual data analysis. In Information Visualization
(IV 2006), Invited Paper, July 5-7, London, United Kingdom. IEEE
Press, 2006.
•  Daniel Keim, Gennady Andrienko, Jean-Daniel Fekete, Carsten
Görg, Jörn Kohlhammer, and Guy Melancon. 2008. Visual
Analytics: Definition, Process, and Challenges. In Information
Visualization, Andreas Kerren, John T. Stasko, Jean-Daniel
Fekete, and Chris North (Eds.). Lecture Notes In Computer
Science, Vol. 4950. Springer-Verlag, Berlin, Heidelberg 154-175
•  Shneiderman, B. (2002)
Inventing discovery tools: combining information visualization
with data mining1. Information visualization, 1(1), 5-12.

More Related Content

What's hot

What's hot (20)

Tableau ppt
Tableau pptTableau ppt
Tableau ppt
 
OLAP OnLine Analytical Processing
OLAP OnLine Analytical ProcessingOLAP OnLine Analytical Processing
OLAP OnLine Analytical Processing
 
Introduction to Data Visualization
Introduction to Data VisualizationIntroduction to Data Visualization
Introduction to Data Visualization
 
Big data lecture notes
Big data lecture notesBig data lecture notes
Big data lecture notes
 
Oltp vs olap
Oltp vs olapOltp vs olap
Oltp vs olap
 
Data visualisation & analytics with Tableau
Data visualisation & analytics with Tableau Data visualisation & analytics with Tableau
Data visualisation & analytics with Tableau
 
Tableau
TableauTableau
Tableau
 
Business analytics
Business analyticsBusiness analytics
Business analytics
 
Data Visualization
Data VisualizationData Visualization
Data Visualization
 
Data Visualization
Data VisualizationData Visualization
Data Visualization
 
DATA WAREHOUSING AND DATA MINING
DATA WAREHOUSING AND DATA MININGDATA WAREHOUSING AND DATA MINING
DATA WAREHOUSING AND DATA MINING
 
3 pillars of big data : structured data, semi structured data and unstructure...
3 pillars of big data : structured data, semi structured data and unstructure...3 pillars of big data : structured data, semi structured data and unstructure...
3 pillars of big data : structured data, semi structured data and unstructure...
 
OLAP
OLAPOLAP
OLAP
 
Tableau Presentation
Tableau PresentationTableau Presentation
Tableau Presentation
 
Database Basics
Database BasicsDatabase Basics
Database Basics
 
3 data visualization
3 data visualization3 data visualization
3 data visualization
 
Tableau data types
Tableau   data typesTableau   data types
Tableau data types
 
Data Visualization in Exploratory Data Analysis
Data Visualization in Exploratory Data AnalysisData Visualization in Exploratory Data Analysis
Data Visualization in Exploratory Data Analysis
 
Data Visualization.pptx
Data Visualization.pptxData Visualization.pptx
Data Visualization.pptx
 
Data analytics vs. Data analysis
Data analytics vs. Data analysisData analytics vs. Data analysis
Data analytics vs. Data analysis
 

Similar to Visual analytics

poem_presentation_v5_linkedIn_version
poem_presentation_v5_linkedIn_versionpoem_presentation_v5_linkedIn_version
poem_presentation_v5_linkedIn_version
Iliada Eleftheriou
 
The Story of the Semantic Grid
The Story of the Semantic GridThe Story of the Semantic Grid
The Story of the Semantic Grid
butest
 
tableau material, to creat a good and wonderful presentation
tableau material, to creat a good and wonderful presentationtableau material, to creat a good and wonderful presentation
tableau material, to creat a good and wonderful presentation
IruolagbePius
 
Data analytics to support awareness and recommendation
Data analytics to support awareness and recommendationData analytics to support awareness and recommendation
Data analytics to support awareness and recommendation
Katrien Verbert
 
Supporting Emergence: Interaction Design for Visual Analytics Approach to ESDA
Supporting Emergence: Interaction Design for Visual Analytics Approach to ESDASupporting Emergence: Interaction Design for Visual Analytics Approach to ESDA
Supporting Emergence: Interaction Design for Visual Analytics Approach to ESDA
Jesse Lingeman
 

Similar to Visual analytics (20)

BigData Visualization and Usecase@TDGA-Stelligence-11july2019-share
BigData Visualization and Usecase@TDGA-Stelligence-11july2019-shareBigData Visualization and Usecase@TDGA-Stelligence-11july2019-share
BigData Visualization and Usecase@TDGA-Stelligence-11july2019-share
 
poem_presentation_v5_linkedIn_version
poem_presentation_v5_linkedIn_versionpoem_presentation_v5_linkedIn_version
poem_presentation_v5_linkedIn_version
 
Scalable Learning Analytics and Interoperability – an assessment of potential...
Scalable Learning Analytics and Interoperability – an assessment of potential...Scalable Learning Analytics and Interoperability – an assessment of potential...
Scalable Learning Analytics and Interoperability – an assessment of potential...
 
Exploratory Analysis of User Data
Exploratory Analysis of User DataExploratory Analysis of User Data
Exploratory Analysis of User Data
 
Introduction to the FP7 CODE project @ BDBC
Introduction to the FP7 CODE project @ BDBCIntroduction to the FP7 CODE project @ BDBC
Introduction to the FP7 CODE project @ BDBC
 
FAIR data_ Superior data visibility and reuse without warehousing.pdf
FAIR data_ Superior data visibility and reuse without warehousing.pdfFAIR data_ Superior data visibility and reuse without warehousing.pdf
FAIR data_ Superior data visibility and reuse without warehousing.pdf
 
The Story of the Semantic Grid
The Story of the Semantic GridThe Story of the Semantic Grid
The Story of the Semantic Grid
 
Data Science Lecture: Overview and Information Collateral
Data Science Lecture: Overview and Information CollateralData Science Lecture: Overview and Information Collateral
Data Science Lecture: Overview and Information Collateral
 
Bottoms up: building a service on a solid foundation of user needs
Bottoms up: building a service on a solid foundation of user needsBottoms up: building a service on a solid foundation of user needs
Bottoms up: building a service on a solid foundation of user needs
 
Data visualisations as a gateway to programming
Data visualisations as a gateway to programmingData visualisations as a gateway to programming
Data visualisations as a gateway to programming
 
Challenges in Analytics for BIG Data
Challenges in Analytics for BIG DataChallenges in Analytics for BIG Data
Challenges in Analytics for BIG Data
 
tableau material, to creat a good and wonderful presentation
tableau material, to creat a good and wonderful presentationtableau material, to creat a good and wonderful presentation
tableau material, to creat a good and wonderful presentation
 
Data analytics to support awareness and recommendation
Data analytics to support awareness and recommendationData analytics to support awareness and recommendation
Data analytics to support awareness and recommendation
 
Data Science - An emerging Stream of Science with its Spreading Reach & Impact
Data Science - An emerging Stream of Science with its Spreading Reach & ImpactData Science - An emerging Stream of Science with its Spreading Reach & Impact
Data Science - An emerging Stream of Science with its Spreading Reach & Impact
 
DMP lessons from Europe
DMP lessons from EuropeDMP lessons from Europe
DMP lessons from Europe
 
Challenges of Big Data Research
Challenges of Big Data ResearchChallenges of Big Data Research
Challenges of Big Data Research
 
Combining analytics and user research
Combining analytics and user researchCombining analytics and user research
Combining analytics and user research
 
Supporting Emergence: Interaction Design for Visual Analytics Approach to ESDA
Supporting Emergence: Interaction Design for Visual Analytics Approach to ESDASupporting Emergence: Interaction Design for Visual Analytics Approach to ESDA
Supporting Emergence: Interaction Design for Visual Analytics Approach to ESDA
 
How to Prepare for a Career in Data Science
How to Prepare for a Career in Data ScienceHow to Prepare for a Career in Data Science
How to Prepare for a Career in Data Science
 
data mining
data mining data mining
data mining
 

More from Katrien Verbert

Human-centered AI: towards the next generation of interactive and adaptive ex...
Human-centered AI: towards the next generation of interactive and adaptive ex...Human-centered AI: towards the next generation of interactive and adaptive ex...
Human-centered AI: towards the next generation of interactive and adaptive ex...
Katrien Verbert
 
Explaining and Exploring Job Recommendations: a User-driven Approach for Inte...
Explaining and Exploring Job Recommendations: a User-driven Approach for Inte...Explaining and Exploring Job Recommendations: a User-driven Approach for Inte...
Explaining and Exploring Job Recommendations: a User-driven Approach for Inte...
Katrien Verbert
 

More from Katrien Verbert (20)

Explainability methods
Explainability methodsExplainability methods
Explainability methods
 
Human-centered AI: how can we support end-users to interact with AI?
Human-centered AI: how can we support end-users to interact with AI?Human-centered AI: how can we support end-users to interact with AI?
Human-centered AI: how can we support end-users to interact with AI?
 
Human-centered AI: how can we support end-users to interact with AI?
Human-centered AI: how can we support end-users to interact with AI?Human-centered AI: how can we support end-users to interact with AI?
Human-centered AI: how can we support end-users to interact with AI?
 
Human-centered AI: how can we support lay users to understand AI?
Human-centered AI: how can we support lay users to understand AI?Human-centered AI: how can we support lay users to understand AI?
Human-centered AI: how can we support lay users to understand AI?
 
Explaining job recommendations: a human-centred perspective
Explaining job recommendations: a human-centred perspectiveExplaining job recommendations: a human-centred perspective
Explaining job recommendations: a human-centred perspective
 
Explaining recommendations: design implications and lessons learned
Explaining recommendations: design implications and lessons learnedExplaining recommendations: design implications and lessons learned
Explaining recommendations: design implications and lessons learned
 
Designing Learning Analytics Dashboards: Lessons Learned
Designing Learning Analytics Dashboards: Lessons LearnedDesigning Learning Analytics Dashboards: Lessons Learned
Designing Learning Analytics Dashboards: Lessons Learned
 
Human-centered AI: towards the next generation of interactive and adaptive ex...
Human-centered AI: towards the next generation of interactive and adaptive ex...Human-centered AI: towards the next generation of interactive and adaptive ex...
Human-centered AI: towards the next generation of interactive and adaptive ex...
 
Explainable AI for non-expert users
Explainable AI for non-expert usersExplainable AI for non-expert users
Explainable AI for non-expert users
 
Towards the next generation of interactive and adaptive explanation methods
Towards the next generation of interactive and adaptive explanation methodsTowards the next generation of interactive and adaptive explanation methods
Towards the next generation of interactive and adaptive explanation methods
 
Personalized food recommendations: combining recommendation, visualization an...
Personalized food recommendations: combining recommendation, visualization an...Personalized food recommendations: combining recommendation, visualization an...
Personalized food recommendations: combining recommendation, visualization an...
 
Explaining and Exploring Job Recommendations: a User-driven Approach for Inte...
Explaining and Exploring Job Recommendations: a User-driven Approach for Inte...Explaining and Exploring Job Recommendations: a User-driven Approach for Inte...
Explaining and Exploring Job Recommendations: a User-driven Approach for Inte...
 
Learning analytics for feedback at scale
Learning analytics for feedback at scaleLearning analytics for feedback at scale
Learning analytics for feedback at scale
 
Interactive recommender systems and dashboards for learning
Interactive recommender systems and dashboards for learningInteractive recommender systems and dashboards for learning
Interactive recommender systems and dashboards for learning
 
Interactive recommender systems: opening up the “black box”
Interactive recommender systems: opening up the “black box”Interactive recommender systems: opening up the “black box”
Interactive recommender systems: opening up the “black box”
 
Interactive Recommender Systems
Interactive Recommender SystemsInteractive Recommender Systems
Interactive Recommender Systems
 
Web Information Systems Lecture 2: HTML
Web Information Systems Lecture 2: HTMLWeb Information Systems Lecture 2: HTML
Web Information Systems Lecture 2: HTML
 
Information Visualisation: perception and principles
Information Visualisation: perception and principlesInformation Visualisation: perception and principles
Information Visualisation: perception and principles
 
Web Information Systems Lecture 1: Introduction
Web Information Systems Lecture 1: IntroductionWeb Information Systems Lecture 1: Introduction
Web Information Systems Lecture 1: Introduction
 
Information Visualisation: Introduction
Information Visualisation: IntroductionInformation Visualisation: Introduction
Information Visualisation: Introduction
 

Recently uploaded

Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
kauryashika82
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptx
negromaestrong
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
QucHHunhnh
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
ZurliaSoop
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
QucHHunhnh
 

Recently uploaded (20)

ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701
 
Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and Modifications
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan Fellows
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdf
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptx
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
 
How to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSHow to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POS
 
How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptx
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptx
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptx
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 

Visual analytics

  • 1. 09/05/14 pag. 1 Information visualization lecture 8 visual analytics Katrien Verbert Department of Computer Science Faculty of Science Vrije Universiteit Brussel katrien.verbert@vub.ac.be
  • 3. 09/05/14 pag. 3 Motivation •  Volume:Gigabyte(109), Terabyte(1012), Petabyte(1015), Exabyte(1018), Zettabytes(1021) •  Variety: –  structured, semi-structured, unstructured; –  text, image, audio, video, record •  Velocity: dynamic, sometimes time-varying Big  Data  refers  to  datasets  that  grow  so  large  that  it  is  difficult  to  capture,  store,   manage,  share,  analyze  and  visualize  with  the  typical  database  soFware  tools.    
  • 4. 09/05/14 pag. 4 The social layer in an interconnected world 2+   billion   people  on   the  Web   by  end   2011     30  billion  RFID   tags  today    (1.3B  in  2005)   4.6   billion   camera   phones   world  wide   100s  of   millions   of  GPS   enabled   devices  sold   annually   76  million  smart   meters  in  2009…    200M  by  2014     12+ TBs of tweet data every day 25+ TBs of log data every day ?  TBs  of   data  every  day  
  • 5. 09/05/14 pag. 5 Motivation Raw  data  has  no  value  in  itself,  only  the  extracted  informaIon  has  value.       Src  :   h"p://www.sas.com/knowledge-­‐exchange/business-­‐analyIcs/featured/big-­‐data-­‐drives-­‐performance-­‐%E2%80%93-­‐if-­‐ you-­‐can-­‐exploit-­‐the-­‐informaIon-­‐overload/index.html  
  • 6. 09/05/14 pag. 6 Information overload •  Refers to the danger of getting lost in data, which may be: –  irrelevant  to  the  current  task  at  hand,   –  processed  in  an  inappropriate  way,  or   –  presented  in  an  inappropriate  way.   Src:  h"p://supermarketpeople.co.uk/archives/713  
  • 7. 09/05/14 pag. 7 Visual Analytics - Mastering the Information Age h"ps://www.youtube.com/watch?v=5i3xbitEVfs    
  • 8. 09/05/14 pag. 8 Visual analytics is the science of analytical reasoning facilitated by interactive visual interfaces. Definition
  • 9. 09/05/14 pag. 9 Whom does it matter •  Research Community J •  Business Community - New tools, new capabilities, new infrastructure, new business models etc. Financial  Services...  
  • 10. 09/05/14 pag. 10 Visual analytics is a multidisciplinary field
  • 12. 09/05/14 pag. 12 History discovery tools •  Shneiderman (IV ‘02) suggests combining computational analysis approaches such as data mining with information visualization –  Too often viewed as competitors in past –  Instead, can complement each other –  Each has something valuable to contribute •  Alternatives –  Issues influencing the design of discovery tools: •  Statistical algorithms vs. visual data presentation •  Hypothesis testing vs. exploratory data analysis –  Each has Pro’s and Con’s
  • 13. 09/05/14 pag. 13 Hypothesis testing & exploratory data analysis •  Hypothesis testing –  Advocates: •  By stating hypotheses up front, limit variables and sharpens thinking, more precise measurement –  Critics: •  Too far from reality, initial hypotheses bias toward finding evidence to support it •  Exploratory Data Analysis –  Advocates: •  Find the interesting things this way, we now have computational capabilities to do them –  Skeptics: •  Not generalizable, everything is a special case, detecting statistical relationships does not infer cause and effect
  • 14. 09/05/14 pag. 14 Recommendations •  Integrate data mining and information visualization •  Allow users to specify what they are seeking •  Recognize that users are situated in a social context •  Respect human responsibility
  • 15. 09/05/14 pag. 15 History •  Information visualization systems inadequately supported decision making (Amar & Stasko IV ‘04): –  Limited affordances –  Predetermined representations –  Decline of determinism in decision-Making •  “Representational primacy” versus “Analytic primacy” –  Telling truth about data vs. providing analytically useful visualizations
  • 16. 09/05/14 pag. 16 Representational primacy •  Pursuit of faithful data replication and comprehension •  Above all else, we must show the data •  Can be a limiting notion •  May focus on low level tasks not relevant to users •  Some data is hard to represent faithfully (think high dimensional data)
  • 17. 09/05/14 pag. 17 Gaps between representation and analysis •  When explanatory or correlative models are the desired outcome, model visualization may be more important than data visualization •  One end is to build black boxes where data goes in and the answer comes out •  Not a good idea to trust black boxes –  What  can  go  wrong  with  this?     –  White  box  approach  be"er.  
  • 18. 09/05/14 pag. 18 Application of Precepts Worldview-based tasks (“Did we show the right thing to the user?”) –  Determine Domain Parameters –  Expose Multivariate Explanation –  Facilitate Hypothesis Testing Rationale-based tasks (“Will the user believe what she sees?”) -  Expose  Uncertainty     -  ConcreIze  RelaIonships     -  Expose  Cause  and  Effect     Amar  &  Stasko  2005  
  • 19. 09/05/14 pag. 19 Task level emphases •  Don’t just help “low-level” tasks –  Find, filter, correlate, etc. •  Facilitate analytical thinking –  Complex decision-making, especially under uncertainty –  Learning a domain –  Identifying the nature of trends –  Predicting the future
  • 20. 09/05/14 pag. 20 History:  coining  the  term •  2003-04 Jim Thomas of PNNL, together with colleagues, develops notion of visual analytics –  Holds workshops at PNNL and at InfoVis ‘04 to help define a research agenda •  Agenda is formalized in book, Illuminating the Path –  Available online, http://nvac.pnl.gov/ •  People use visual analytics tools and techniques to –  Synthesize information and derive insight from massive, dynamic, ambiguous, and often conflicting data –  Detect the expected and discover the unexpected –  Provide timely, defensible, and understandable assessments –  Communicate assessment effectively for action. Src: Stasko
  • 21. 09/05/14 pag. 21 Visual  AnalyMcs   •  Not really an “area” per se –  More of an “umbrella” notion –  Combines multiple areas or disciplines –  Ultimately about using data to improve our knowledge and help make decisions •  Main Components: Src: Stasko
  • 22. 09/05/14 pag. 22 Visual Analytics: alternate Definition Visual analytics combines automated analysis techniques with interactive visualizations for an effective understanding, reasoning and decision making on the basis of very large and complex data sets Keim et al, chapter in Information Visualization: Human-Centered Issues and Perspectives, 2008
  • 23. 09/05/14 pag. 23 In-Spire Video h"ps://www.youtube.com/watch?v=YONTBZaxz8g    
  • 24. 09/05/14 pag. 24 Synergy •  Combine strengths of both human and electronic data processing –  Gives a semi-automated analytical process –  Use strengths from each –  Below from Keim, 2008
  • 25. 09/05/14 pag. 25 InfoVis Comparison •  Clearly much overlap •  Perhaps fair to say that infovis hasn’t always focused on analysis tasks so much and that it doesn’t always include advanced data analysis algorithms –  Not a criticism, just not focus –  InfoVis has a more narrow scope Src: Stasko
  • 26. 09/05/14 pag. 26 Visual Analytics •  Encompassing, integrated approach to data analysis –  Use computational algorithms where helpful –  Use human-directed visual exploration where helpful –  Not just “Apply A, then apply B” though –  Integrate the two tightly Stasko, 2013 Src: Stasko
  • 27. 09/05/14 pag. 27 Visual analytics aims at making data and information processing transparent just as information visualization has changed our view on databases, the goal of visual analytics is to make our way of processing data and information transparent for an analytic discourse
  • 28. 09/05/14 pag. 28 The visual analytics process The  visual  analyIcs  process  is  characterized  through  interacIon  between  data,  visualizaIons,   models  about  the  data,  and  the  users  in  order  to  discover  knowledge.    
  • 29. 09/05/14 pag. 29 Information seeking mantra for visual analytics applications Overview first, zoom/filter, details on demand Analyze first, show the important, zoom/filter, analyze further, details on demand. Keim  et  al.  2006  
  • 32. 09/05/14 pag. 32 UTOPIAN: user-driven topic modeling     h"ps://www.youtube.com/watch?v=du6_s6hcaRA&feature=youtu.be  
  • 33. 09/05/14 pag. 33 SketchPadN-D: WYDIWYG Sculpting and Editing in High-Dimensional Space     h"ps://www.youtube.com/watch?v=ar8kAdAfx6w  
  • 35. 09/05/14 pag. 35 Relationship discovery Scale independent representations, whole and parts at same time at multiple levels of abstraction, often linked
  • 36. 09/05/14 pag. 36 Relationship discovery Explore high dimensional relationships, theme groupings, outlier detection, searching by proximity at multiple scales
  • 37. 09/05/14 pag. 37 Combined exploratory and confirmatory analytics •  Develop and refine hypothesis •  Evidence collection, management, and matching to hypothesis •  Tailor views/displays for thematic/hypothesis focus of interest •  Often suggestive of predictions enabling proactive thinking
  • 38. 09/05/14 pag. 38 Multiple data types •  Supports multiple data types: structured/unstructured text •  Imagery/video, cyber •  Systems of either data type or application specific
  • 39. 09/05/14 pag. 39 Temporal views and interactions •  Most analytics situations involve time, pace, velocity •  Group segments of thoughts by time •  Compare time segments •  Often combined with geospatial
  • 40. 09/05/14 pag. 40 Reasoning workspace Stu Card, PARC
  • 41. 09/05/14 pag. 41 Grouping and outlier detection •  Form groups of thought/data •  Labels and annotation •  Compare groupings •  Find small groups or outliers 41
  • 42. 09/05/14 pag. 42 Labeling •  Critically important, •  Dynamic in scope, number labels, size, color •  Positioning •  Almost everything has labels •  Labels tell semantic meaning
  • 43. 09/05/14 pag. 43 Multiple linked views Temporal, geospatial, theme, cluster, list views with association linkages between views
  • 45. 09/05/14 pag. 45 Challenge 1: scalability •  Challenge with regard to both: –  visual  representaIons     –  automaIc  analysis   •  Requirements/needs: –  SoluIon  needs  to  scale  in  size,  dimensionality,  data  types,  levels  of  quality.   –  Methods  are  needed  to  deal  with  input  data  that  is  noisy  and  conInuous.   –  Pa"erns  and  relaIonships  need  to  be  visualized  on  different  levels  of   details,  and  with  appropriate  levels  of  data  and  visual  abstracIon.  
  • 46. 09/05/14 pag. 46 Challenge 2: uncertainty •  Challenge: –  large  amount  of  noise  and  missing  values  from  heterogeneous  data  sources     –  bias  introduced  by  automaIc  analysis  methods  as  well  as  human  percepIon   •  Requirements/needs: –  representaIon  of  the  noIon  of  data  quality  and  the  confidence     –  analysts  need  to  be  aware  of  the  uncertainty  and  be  able  to  analyze  quality  
  • 47. 09/05/14 pag. 47 Challenge 3: text data stream •  Analysis and visualization of text steams is still a relatively new field. •  Text stream data often –  have  li"le  structure,  grammar  and  context,   –  are  provided  as  high-­‐frequency  mulIlingual  stream,     –  contain  a  high  percentage  of  non-­‐meaningful  and  irrelevant  messages.     •  The challenge is to handle the dynamic nature of the stream data and the low-tolerance of monitoring delay in emergency cases.
  • 48. 09/05/14 pag. 48 Challenge 4: interaction •  Novel interaction techniques and tangible user-interfaces for seamless intuitive visual communication with the system. •  The analyst should be able to –  fully  focus  on  the  task  at  hand  and     –  not  be  distracted  by  overly  technical  or  complex  user  interfaces  and  interacIons.     •  User feedback should be taken as intelligently as possible, requiring as little user input as possible.
  • 49. 09/05/14 pag. 49 Challenge 5: evaluation •  Challenge: –  hard  to  assess  the  quality  of  visual  analyIcs  soluIons   –  due  to  the  interdisciplinary  nature  and  complex  process   •  Needs: –  evaluaIon  framework  to  assess  effecIveness,  efficiency  and  acceptance   –  of  new  visual  analyIcs  techniques,  methods,  and  models  
  • 50. 09/05/14 pag. 50 Challenge 6: infrastructure •  Challenge: –  most  soluIons  develop  their  own  infrastructures     –  mismatch  between  the  level  of  service  provided  and  real  needs:   •  fast  and  precise  answers  with  progressive  refinement,     •  incremental  re-­‐computaIon,     •  steering  the  computaIon  towards  data  regions  that  are  of  interest.     •  Requirements/needs: –  infrastructure  to  bind  together  all  the  processes,  funcIons,  and  services     –  repositories  of  available  visual  analyIcs  soluIons  
  • 53. 09/05/14 pag. 53 Readings •  Ch. 1-2 h"p://www.vismaster.eu/wp-­‐content/uploads/2010/11/VisMaster-­‐book-­‐lowres.pdf    
  • 54. 09/05/14 pag. 54 References •  Amar, R. A., & Stasko, J. T. (2005). Knowledge precepts for design and evaluation of information visualizations. Visualization and Computer Graphics, IEEE Transactions on, 11(4), 432-442. •  D. A. Keim, F. Mansmann, J. Schneidewind, and H. Ziegler. Challenges in visual data analysis. In Information Visualization (IV 2006), Invited Paper, July 5-7, London, United Kingdom. IEEE Press, 2006. •  Daniel Keim, Gennady Andrienko, Jean-Daniel Fekete, Carsten Görg, Jörn Kohlhammer, and Guy Melancon. 2008. Visual Analytics: Definition, Process, and Challenges. In Information Visualization, Andreas Kerren, John T. Stasko, Jean-Daniel Fekete, and Chris North (Eds.). Lecture Notes In Computer Science, Vol. 4950. Springer-Verlag, Berlin, Heidelberg 154-175 •  Shneiderman, B. (2002) Inventing discovery tools: combining information visualization with data mining1. Information visualization, 1(1), 5-12.