SlideShare a Scribd company logo
1 of 22
Download to read offline
VINCI 2017
The 10th International Symposium on Visual
Information Communication and Interaction
Towards Glyph-based
Visualizations for Big
Data Clustering
AUG 15, 2017
Chair of Media Design
Technische Universität Dresden
Mandy Keck, Dietrich Kammer, Thomas Gründer, Thomas Thom, Martin Kleinsteuber,
Alexander Maasch, Rainer Groh
Structure
Part 1 Part 2
Part 3 Part 4
Problem Description
Related Work
Data Set,
Glyph Design
User Study Lessons Learned,
Future Work
Research Project
VANDA - Visual Analytics Interfaces for Big Data Environments
Data Analytics,
Copyright Observation
Data Crawling, Content
Exploration
Data Analytics and
Text Mining for Smart
Adaptive Learning
Environments
Research on Human
Computer Interaction
and Information
Visualization
Purchasing Platform bet-
ween Businesses with
Millions of Products
www.vanda-project.de
VINCI 2017 AUG 15, 2017 M. Keck et al. 3 | 22
Multidimensional Scaling [Torgerson 1952, Munzer 2014]
HD DATA
2D DATA
Item 1
Item 2
Item 3
Item 1
Item 2
Item 3
Dimension1
Dimension2
Dimension3
Dimension4
...
Item n Dimension5
Dimensionn
......
Item n
Dimension1
Dimension2
Visualization of Multi-dimensional Data Sets [Keim 2000]
Scatterplot (Geometric Technique) [w1] Glyphs (Icon-based Technique) [w2] Pixel-oriented Technique [Stefaner 2010]
VINCI 2017 AUG 15, 2017 M. Keck et al. 5 | 22
Using Glyphs for Cluster Analysis
VINCI 2017 AUG 15, 2017 M. Keck et al. 6 | 22
Glyph Design
Flower Glyph Star Plot
B1
Star plot: Whisker Plot with
connected endpoints of each line
B2
Filling the resulting shape for
color encoding
B3
Absolute axes to improve the
identification of extreme values +
transparency to enhance the
visibility of the coordinate system
A1
Length of each petal encodes
a quantitative attribute
A2
Redundant encoding: attribute
value is mapped to length and
brightness for different LoD
A3
Radius border to enhance the
identification of maximum values
Data Set
Event Data Set with 15 attributes
6 attributes are selected for glyph design:
price, popularity, time, distance,
estimationmusic and category
Quantitative attributes are normalized to
a value between 0 and 10
Categories are mapped to color
price
popularity
time
estimation-
music
distance
entertainment sports education
band tourism beauty
Hypotheses
H1
H2
Glyph-based visualizations reduce completion times in
comparison tasks when compared to tabular display
Glyph-based visualizations reduce completion times to identify outliers and
extreme values when compared to tabular displays
H3
H4
Tabular display of data has the highest accuracy when
compared to other visualization techniques
Flower glyphs reduce completion times and increase accuracy when identifying
extreme values
H5
Star plots reduce completion times and increase
accuracy in comparison tasks
Test Setting
Implementation in JavaScript
(jQuery, RequireJS, d3.js)
226 data items of the event data set
27’’ display with WQHD resolution
in portrait orientation
Table Flower Star
Conditions
VINCI 2017 AUG 15, 2017 M. Keck et al. 11 | 22
Methodology
25 tasks per interface, divided into 5 task types:
Id_High: Identification of high
extreme values
Id_Low: Identification of low
extreme values
Cat_High: Identification of high ex-
treme values in a specific category
Cat_Low: Identification of low
extrem values in a specific category
Comp: Comparison of all 5 values
to a provided example
„Find an event with a value in
popularity as high as possible!“
„Find an event with a
price as low as possible!“
„Find an event in the category tourism which has a
high chance that music is played there!“
„Find an event in the category
education that is very close to your city!“
„Find an event that is similar
to the shown example!“
price
popularity
time
estimation-
music
distance
VINCI 2017 AUG 15, 2017 M. Keck et al. 12 | 22
Participants
1427
41
USAGE
EVENT DATA
INFORMATION VISUALIZATION
GLYPH-BASED VISUALIZATION
very infrequent very frequent
TOTAL
AGE
EXPERIENCE
64
14
36,6%
19,5%
7,3%
14,6%
22,0%
no experience extensive experience
17,0%
7,3%
36,6%
29,3%
9,8%
31,7%
31,7%
17,1%
17,1%
2,4%
VINCI 2017 AUG 15, 2017 M. Keck et al. 13 | 22
Results | Time
0
10
20
30
40
Id_High Id_Low Cat_High Cat_Low Comp
TABLE FLOWER STAR
SOLUTION TIME IN SECONDSMain effects
Glyphs faster than Table, p < 0.001,
Starplot faster than Table, p < 0.042
Pairwise comparison for tasks
Id_High: Glyphs faster than Table,
Star faster than Flower
Cat_High, Cat_Low & Comp:
Glyphs faster than Table
VINCI 2017 AUG 15, 2017 M. Keck et al. 14 | 22
Results | Accuracy and Error Rate
0
0,4
0,8
1,2
Id_High Id_Low Cat_High Cat_Low Comp
TABLE FLOWER STAR
ACCURACYMain effects
Accuracy: no significant main effect for
conditions, p = 0.504
Error Rate: no significant main effect for
conditions, p = 0.122
Pairwise comparison for tasks
Id_Low: Table more accurate than Flower
Cat_Low: Table more accurate than Glyphs
VINCI 2017 AUG 15, 2017 M. Keck et al. 15 | 22
Questionnaire
User Experience Questionnaire (UEQ) for each
condition [Laugwitz et al. 2008]
26 adjective pairs - assigned to 6 factors:
Attractiveness, Perspicuity, Efficiency,
Dependability, Stimulation and Novelity
Each adjective pair uses a seven stage scale
(polarity is determined randomly)
attractive unattractive
VINCI 2017 AUG 15, 2017 M. Keck et al. 16 | 22
Results | Questionnaire
-2,00
-1,00
0,00
1,00
2,00
Attractiveness
Perspicuity
Efficiency
Dependability
Stimulation
Novelty
TABLE FLOWER STAR
QUESTIONNAIREMain effects
Glyphs rated better than Table, p < 0.001
Pairwise comparison for factors
Attractiveness, Efficiency, Stimulation:
Glyphs rated better than Table
Novelity: Flower better rated than Star;
Star rated better than Table
VINCI 2017 AUG 15, 2017 M. Keck et al. 17 | 22
Results | Hypotheses
Hypothesis MeasuresHypothesis
Description
Pairwise
comparison (p-value)
Task
H1 CompSolution Time
Significant
values found
Hypothesis
rejected
Glyphs faster than Table
for comparison tasks
yes no
H2
H3
Id_High
Id_Low
Cat_High
Cat_Low
Solution TimeGlyphs faster than Table
for extreme values
< 0.001
0.709
< 0.001
< 0.001
< 0.001
< 0.001
< 0.001
< 0.001
Flower – Table Star – Table
yes no
OverallAccuracyTable more accurate
than Glyphs
Table – Flower Table – Star
0.258 0.057 no no
H4 Flowers more accruate
and faster for extreme values
H5 Stars more accruate and
faster for comparison tasks
Solution Time
Accuracy
Solution Time
Accuracy
Id_High
Id_Low
Cat_High
Cat_Low
Comp
0.014
1.000
0.607
1.000
Solution Time Accuracy
1.000
0.140
0.153
1.000
Flower – Table Star – Table
< 0.001 < 0.001
0.084
Solution Time Accuracy
1.000
partially partially
no no
Missing flower petals could be more easily
perceived than zero values on the axes in
star plots
Discussion
More difficult to identify if the petal
encodes the maximum because of its
curvature
When flower glyphs encode data values with low
values in all dimensions, it was more difficult to
assign the value to a specific dimension
Star plot with many high extreme values pop out ea-
sier because of the larger surface area, so other star
plots with just one high value were often disregarded
More difficult to identify high extreme
values in star plots when adjacent
dimensions exhibit zero values
Improvements of Flower Glyphs and Star Plots
Future Work
[Kammer et al. 2017]
References
W.S. Torgerson. 1952. Multidimensional scaling: I. Theory and method. Psychometrika 17 (1952), 401–419.
T. Munzner. 2014. Visualization Analysis and Design. A.K. Peters visualization series. ISBN 978-1-466-50891-0. http://www.cs.ubc.ca/~tmm/
vadbook
Daniel A. Keim. 2000. Designing Pixel-Oriented Visualization Techniques: Theory	 and Applications. IEEE Trans. on Visualization and Compu-
ter Graphics 6, 1 (Jan. 2000), 59–78. DOI:https://doi.org/10.1109/2945.841121
M. Stefaner. 2010. The Design of “X by Y”. In: Beautiful Visualization: Looking at Data through the Eyes of Experts. O‘Reilly Media; 1 edition
(July 1, 2010). ISBN 978-1449379865
B. Laugwitz, T. Held, and M. Schrepp. 2008. Construction and Evaluation of a
User Experience Questionnaire. Springer Berlin Heidelberg, Berlin, Heidelberg, 63–76. DOI:https://doi.org/10.1007/978-3-540-89350-9 6
D. Kammer, M. Keck, M. Müller, T. Gründer, R. Groh. 2017. Exploring Big Data Landscapes with Elastic Displays Conference. Mensch & Com-
puter 2017 - Workshop Begreifbare Interaktion, Oldenbourg Verlag, Regensburg, Germany, (in press)
w1 - The Antibiotic Abacus. Information is Beautiful. http://www.informationisbeautiful.net/visualizations/antibiotic-resistance/, retrieved on
04.08.2017
w2- Stations & Lines. A visual catalog of every station in major rapid transit systems and the lines they serve. https://c82.net/work/?id=335,
retrieved on 04.08.2017
VINCI 2017 AUG 15, 2017 M. Keck et al. 22 | 22

More Related Content

Similar to Towards Glyph-based Visualizations for Big Data Clustering

ICCES 2017 - Crowd Density Estimation Method using Regression Analysis
ICCES 2017 - Crowd Density Estimation Method using Regression AnalysisICCES 2017 - Crowd Density Estimation Method using Regression Analysis
ICCES 2017 - Crowd Density Estimation Method using Regression AnalysisAhmed Gad
 
Application of deep leaning to computer vision
Application of deep leaning to computer visionApplication of deep leaning to computer vision
Application of deep leaning to computer visionDjamal Abide, MSc
 
Multi-Perspective Detail+Overview Visualization for 3D Building Exploration
Multi-Perspective Detail+Overview Visualization for 3D Building ExplorationMulti-Perspective Detail+Overview Visualization for 3D Building Exploration
Multi-Perspective Detail+Overview Visualization for 3D Building ExplorationMatthias Trapp
 
If a picture is worth a thousand words, Interactive data visualizations are w...
If a picture is worth a thousand words, Interactive data visualizations are w...If a picture is worth a thousand words, Interactive data visualizations are w...
If a picture is worth a thousand words, Interactive data visualizations are w...Olga Scrivner
 
STINGER: Multi-threaded Graph Streaming
STINGER: Multi-threaded Graph StreamingSTINGER: Multi-threaded Graph Streaming
STINGER: Multi-threaded Graph StreamingJason Riedy
 
Knowledge Graph Embeddings for Recommender Systems
Knowledge Graph Embeddings for Recommender SystemsKnowledge Graph Embeddings for Recommender Systems
Knowledge Graph Embeddings for Recommender SystemsEnrico Palumbo
 
Cupum 2013 Marco te Brömmelstroet
Cupum 2013 Marco te BrömmelstroetCupum 2013 Marco te Brömmelstroet
Cupum 2013 Marco te BrömmelstroetMarco
 
Application and evaluation of a K-Medoidsbased shape clustering method for an...
Application and evaluation of a K-Medoidsbased shape clustering method for an...Application and evaluation of a K-Medoidsbased shape clustering method for an...
Application and evaluation of a K-Medoidsbased shape clustering method for an...Venkat Projects
 
Visualization of Knowledge Distribution across Development Teams using 2.5D S...
Visualization of Knowledge Distribution across Development Teams using 2.5D S...Visualization of Knowledge Distribution across Development Teams using 2.5D S...
Visualization of Knowledge Distribution across Development Teams using 2.5D S...Matthias Trapp
 
Explaining job recommendations: a human-centred perspective
Explaining job recommendations: a human-centred perspectiveExplaining job recommendations: a human-centred perspective
Explaining job recommendations: a human-centred perspectiveKatrien Verbert
 
Data fusion for city live event detection
Data fusion for city live event detectionData fusion for city live event detection
Data fusion for city live event detectionAlket Cecaj
 
New Research Articles 2022 January Issue International Journal of Software En...
New Research Articles 2022 January Issue International Journal of Software En...New Research Articles 2022 January Issue International Journal of Software En...
New Research Articles 2022 January Issue International Journal of Software En...ijseajournal
 
From Representation to Mediation: A New Agenda for Conceptual Modeling Resear...
From Representation to Mediation: A New Agenda for Conceptual Modeling Resear...From Representation to Mediation: A New Agenda for Conceptual Modeling Resear...
From Representation to Mediation: A New Agenda for Conceptual Modeling Resear...Jan Recker @ University of Hamburg
 
“And Then a Miracle Occurs …” Engaging the challenge of operationalizing theo...
“And Then a Miracle Occurs …” Engaging the challenge of operationalizing theo...“And Then a Miracle Occurs …” Engaging the challenge of operationalizing theo...
“And Then a Miracle Occurs …” Engaging the challenge of operationalizing theo...Michael von Kutzschenbach
 
Official resume titash_mandal_
Official resume titash_mandal_Official resume titash_mandal_
Official resume titash_mandal_Titash Mandal
 
CSUN 2024 Simplifying Accessible Data Visualizations - 5 April 2024.pptx
CSUN 2024 Simplifying Accessible Data Visualizations - 5 April 2024.pptxCSUN 2024 Simplifying Accessible Data Visualizations - 5 April 2024.pptx
CSUN 2024 Simplifying Accessible Data Visualizations - 5 April 2024.pptxTed Gies
 

Similar to Towards Glyph-based Visualizations for Big Data Clustering (20)

ICCES 2017 - Crowd Density Estimation Method using Regression Analysis
ICCES 2017 - Crowd Density Estimation Method using Regression AnalysisICCES 2017 - Crowd Density Estimation Method using Regression Analysis
ICCES 2017 - Crowd Density Estimation Method using Regression Analysis
 
Technology Acceptance of Virtual Reality for Travel Planning
Technology Acceptance of Virtual Reality for Travel PlanningTechnology Acceptance of Virtual Reality for Travel Planning
Technology Acceptance of Virtual Reality for Travel Planning
 
Application of deep leaning to computer vision
Application of deep leaning to computer visionApplication of deep leaning to computer vision
Application of deep leaning to computer vision
 
Multi-Perspective Detail+Overview Visualization for 3D Building Exploration
Multi-Perspective Detail+Overview Visualization for 3D Building ExplorationMulti-Perspective Detail+Overview Visualization for 3D Building Exploration
Multi-Perspective Detail+Overview Visualization for 3D Building Exploration
 
If a picture is worth a thousand words, Interactive data visualizations are w...
If a picture is worth a thousand words, Interactive data visualizations are w...If a picture is worth a thousand words, Interactive data visualizations are w...
If a picture is worth a thousand words, Interactive data visualizations are w...
 
Ontology-based Matchmaking to Provide Personalized Offers
Ontology-based Matchmaking to Provide Personalized OffersOntology-based Matchmaking to Provide Personalized Offers
Ontology-based Matchmaking to Provide Personalized Offers
 
STINGER: Multi-threaded Graph Streaming
STINGER: Multi-threaded Graph StreamingSTINGER: Multi-threaded Graph Streaming
STINGER: Multi-threaded Graph Streaming
 
Icpc13.ppt
Icpc13.pptIcpc13.ppt
Icpc13.ppt
 
Knowledge Graph Embeddings for Recommender Systems
Knowledge Graph Embeddings for Recommender SystemsKnowledge Graph Embeddings for Recommender Systems
Knowledge Graph Embeddings for Recommender Systems
 
Cupum 2013 Marco te Brömmelstroet
Cupum 2013 Marco te BrömmelstroetCupum 2013 Marco te Brömmelstroet
Cupum 2013 Marco te Brömmelstroet
 
Application and evaluation of a K-Medoidsbased shape clustering method for an...
Application and evaluation of a K-Medoidsbased shape clustering method for an...Application and evaluation of a K-Medoidsbased shape clustering method for an...
Application and evaluation of a K-Medoidsbased shape clustering method for an...
 
Visualization of Knowledge Distribution across Development Teams using 2.5D S...
Visualization of Knowledge Distribution across Development Teams using 2.5D S...Visualization of Knowledge Distribution across Development Teams using 2.5D S...
Visualization of Knowledge Distribution across Development Teams using 2.5D S...
 
Explaining job recommendations: a human-centred perspective
Explaining job recommendations: a human-centred perspectiveExplaining job recommendations: a human-centred perspective
Explaining job recommendations: a human-centred perspective
 
Data fusion for city live event detection
Data fusion for city live event detectionData fusion for city live event detection
Data fusion for city live event detection
 
New Research Articles 2022 January Issue International Journal of Software En...
New Research Articles 2022 January Issue International Journal of Software En...New Research Articles 2022 January Issue International Journal of Software En...
New Research Articles 2022 January Issue International Journal of Software En...
 
From Representation to Mediation: A New Agenda for Conceptual Modeling Resear...
From Representation to Mediation: A New Agenda for Conceptual Modeling Resear...From Representation to Mediation: A New Agenda for Conceptual Modeling Resear...
From Representation to Mediation: A New Agenda for Conceptual Modeling Resear...
 
“And Then a Miracle Occurs …” Engaging the challenge of operationalizing theo...
“And Then a Miracle Occurs …” Engaging the challenge of operationalizing theo...“And Then a Miracle Occurs …” Engaging the challenge of operationalizing theo...
“And Then a Miracle Occurs …” Engaging the challenge of operationalizing theo...
 
Official resume titash_mandal_
Official resume titash_mandal_Official resume titash_mandal_
Official resume titash_mandal_
 
Tangible 3D Tabletops
Tangible 3D TabletopsTangible 3D Tabletops
Tangible 3D Tabletops
 
CSUN 2024 Simplifying Accessible Data Visualizations - 5 April 2024.pptx
CSUN 2024 Simplifying Accessible Data Visualizations - 5 April 2024.pptxCSUN 2024 Simplifying Accessible Data Visualizations - 5 April 2024.pptx
CSUN 2024 Simplifying Accessible Data Visualizations - 5 April 2024.pptx
 

Recently uploaded

Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfadriantubila
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFxolyaivanovalion
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusTimothy Spann
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxolyaivanovalion
 
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...shivangimorya083
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H
 
ALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptxALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptxolyaivanovalion
 
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxBPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxMohammedJunaid861692
 
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Online
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service OnlineCALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Online
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Onlineanilsa9823
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxolyaivanovalion
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxolyaivanovalion
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramMoniSankarHazra
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfMarinCaroMartnezBerg
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxolyaivanovalion
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxolyaivanovalion
 
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Delhi Call girls
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxolyaivanovalion
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxfirstjob4
 
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...Pooja Nehwal
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxolyaivanovalion
 

Recently uploaded (20)

Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptx
 
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
 
ALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptxALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptx
 
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxBPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
 
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Online
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service OnlineCALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Online
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Online
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptx
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptx
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics Program
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptx
 
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFx
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptx
 
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptx
 

Towards Glyph-based Visualizations for Big Data Clustering

  • 1. VINCI 2017 The 10th International Symposium on Visual Information Communication and Interaction Towards Glyph-based Visualizations for Big Data Clustering AUG 15, 2017 Chair of Media Design Technische Universität Dresden Mandy Keck, Dietrich Kammer, Thomas Gründer, Thomas Thom, Martin Kleinsteuber, Alexander Maasch, Rainer Groh
  • 2. Structure Part 1 Part 2 Part 3 Part 4 Problem Description Related Work Data Set, Glyph Design User Study Lessons Learned, Future Work
  • 3. Research Project VANDA - Visual Analytics Interfaces for Big Data Environments Data Analytics, Copyright Observation Data Crawling, Content Exploration Data Analytics and Text Mining for Smart Adaptive Learning Environments Research on Human Computer Interaction and Information Visualization Purchasing Platform bet- ween Businesses with Millions of Products www.vanda-project.de VINCI 2017 AUG 15, 2017 M. Keck et al. 3 | 22
  • 4. Multidimensional Scaling [Torgerson 1952, Munzer 2014] HD DATA 2D DATA Item 1 Item 2 Item 3 Item 1 Item 2 Item 3 Dimension1 Dimension2 Dimension3 Dimension4 ... Item n Dimension5 Dimensionn ...... Item n Dimension1 Dimension2
  • 5. Visualization of Multi-dimensional Data Sets [Keim 2000] Scatterplot (Geometric Technique) [w1] Glyphs (Icon-based Technique) [w2] Pixel-oriented Technique [Stefaner 2010] VINCI 2017 AUG 15, 2017 M. Keck et al. 5 | 22
  • 6. Using Glyphs for Cluster Analysis VINCI 2017 AUG 15, 2017 M. Keck et al. 6 | 22
  • 7. Glyph Design Flower Glyph Star Plot B1 Star plot: Whisker Plot with connected endpoints of each line B2 Filling the resulting shape for color encoding B3 Absolute axes to improve the identification of extreme values + transparency to enhance the visibility of the coordinate system A1 Length of each petal encodes a quantitative attribute A2 Redundant encoding: attribute value is mapped to length and brightness for different LoD A3 Radius border to enhance the identification of maximum values
  • 8. Data Set Event Data Set with 15 attributes 6 attributes are selected for glyph design: price, popularity, time, distance, estimationmusic and category Quantitative attributes are normalized to a value between 0 and 10 Categories are mapped to color price popularity time estimation- music distance entertainment sports education band tourism beauty
  • 9. Hypotheses H1 H2 Glyph-based visualizations reduce completion times in comparison tasks when compared to tabular display Glyph-based visualizations reduce completion times to identify outliers and extreme values when compared to tabular displays H3 H4 Tabular display of data has the highest accuracy when compared to other visualization techniques Flower glyphs reduce completion times and increase accuracy when identifying extreme values H5 Star plots reduce completion times and increase accuracy in comparison tasks
  • 10. Test Setting Implementation in JavaScript (jQuery, RequireJS, d3.js) 226 data items of the event data set 27’’ display with WQHD resolution in portrait orientation
  • 11. Table Flower Star Conditions VINCI 2017 AUG 15, 2017 M. Keck et al. 11 | 22
  • 12. Methodology 25 tasks per interface, divided into 5 task types: Id_High: Identification of high extreme values Id_Low: Identification of low extreme values Cat_High: Identification of high ex- treme values in a specific category Cat_Low: Identification of low extrem values in a specific category Comp: Comparison of all 5 values to a provided example „Find an event with a value in popularity as high as possible!“ „Find an event with a price as low as possible!“ „Find an event in the category tourism which has a high chance that music is played there!“ „Find an event in the category education that is very close to your city!“ „Find an event that is similar to the shown example!“ price popularity time estimation- music distance VINCI 2017 AUG 15, 2017 M. Keck et al. 12 | 22
  • 13. Participants 1427 41 USAGE EVENT DATA INFORMATION VISUALIZATION GLYPH-BASED VISUALIZATION very infrequent very frequent TOTAL AGE EXPERIENCE 64 14 36,6% 19,5% 7,3% 14,6% 22,0% no experience extensive experience 17,0% 7,3% 36,6% 29,3% 9,8% 31,7% 31,7% 17,1% 17,1% 2,4% VINCI 2017 AUG 15, 2017 M. Keck et al. 13 | 22
  • 14. Results | Time 0 10 20 30 40 Id_High Id_Low Cat_High Cat_Low Comp TABLE FLOWER STAR SOLUTION TIME IN SECONDSMain effects Glyphs faster than Table, p < 0.001, Starplot faster than Table, p < 0.042 Pairwise comparison for tasks Id_High: Glyphs faster than Table, Star faster than Flower Cat_High, Cat_Low & Comp: Glyphs faster than Table VINCI 2017 AUG 15, 2017 M. Keck et al. 14 | 22
  • 15. Results | Accuracy and Error Rate 0 0,4 0,8 1,2 Id_High Id_Low Cat_High Cat_Low Comp TABLE FLOWER STAR ACCURACYMain effects Accuracy: no significant main effect for conditions, p = 0.504 Error Rate: no significant main effect for conditions, p = 0.122 Pairwise comparison for tasks Id_Low: Table more accurate than Flower Cat_Low: Table more accurate than Glyphs VINCI 2017 AUG 15, 2017 M. Keck et al. 15 | 22
  • 16. Questionnaire User Experience Questionnaire (UEQ) for each condition [Laugwitz et al. 2008] 26 adjective pairs - assigned to 6 factors: Attractiveness, Perspicuity, Efficiency, Dependability, Stimulation and Novelity Each adjective pair uses a seven stage scale (polarity is determined randomly) attractive unattractive VINCI 2017 AUG 15, 2017 M. Keck et al. 16 | 22
  • 17. Results | Questionnaire -2,00 -1,00 0,00 1,00 2,00 Attractiveness Perspicuity Efficiency Dependability Stimulation Novelty TABLE FLOWER STAR QUESTIONNAIREMain effects Glyphs rated better than Table, p < 0.001 Pairwise comparison for factors Attractiveness, Efficiency, Stimulation: Glyphs rated better than Table Novelity: Flower better rated than Star; Star rated better than Table VINCI 2017 AUG 15, 2017 M. Keck et al. 17 | 22
  • 18. Results | Hypotheses Hypothesis MeasuresHypothesis Description Pairwise comparison (p-value) Task H1 CompSolution Time Significant values found Hypothesis rejected Glyphs faster than Table for comparison tasks yes no H2 H3 Id_High Id_Low Cat_High Cat_Low Solution TimeGlyphs faster than Table for extreme values < 0.001 0.709 < 0.001 < 0.001 < 0.001 < 0.001 < 0.001 < 0.001 Flower – Table Star – Table yes no OverallAccuracyTable more accurate than Glyphs Table – Flower Table – Star 0.258 0.057 no no H4 Flowers more accruate and faster for extreme values H5 Stars more accruate and faster for comparison tasks Solution Time Accuracy Solution Time Accuracy Id_High Id_Low Cat_High Cat_Low Comp 0.014 1.000 0.607 1.000 Solution Time Accuracy 1.000 0.140 0.153 1.000 Flower – Table Star – Table < 0.001 < 0.001 0.084 Solution Time Accuracy 1.000 partially partially no no
  • 19. Missing flower petals could be more easily perceived than zero values on the axes in star plots Discussion More difficult to identify if the petal encodes the maximum because of its curvature When flower glyphs encode data values with low values in all dimensions, it was more difficult to assign the value to a specific dimension Star plot with many high extreme values pop out ea- sier because of the larger surface area, so other star plots with just one high value were often disregarded More difficult to identify high extreme values in star plots when adjacent dimensions exhibit zero values
  • 20. Improvements of Flower Glyphs and Star Plots
  • 22. References W.S. Torgerson. 1952. Multidimensional scaling: I. Theory and method. Psychometrika 17 (1952), 401–419. T. Munzner. 2014. Visualization Analysis and Design. A.K. Peters visualization series. ISBN 978-1-466-50891-0. http://www.cs.ubc.ca/~tmm/ vadbook Daniel A. Keim. 2000. Designing Pixel-Oriented Visualization Techniques: Theory and Applications. IEEE Trans. on Visualization and Compu- ter Graphics 6, 1 (Jan. 2000), 59–78. DOI:https://doi.org/10.1109/2945.841121 M. Stefaner. 2010. The Design of “X by Y”. In: Beautiful Visualization: Looking at Data through the Eyes of Experts. O‘Reilly Media; 1 edition (July 1, 2010). ISBN 978-1449379865 B. Laugwitz, T. Held, and M. Schrepp. 2008. Construction and Evaluation of a User Experience Questionnaire. Springer Berlin Heidelberg, Berlin, Heidelberg, 63–76. DOI:https://doi.org/10.1007/978-3-540-89350-9 6 D. Kammer, M. Keck, M. Müller, T. Gründer, R. Groh. 2017. Exploring Big Data Landscapes with Elastic Displays Conference. Mensch & Com- puter 2017 - Workshop Begreifbare Interaktion, Oldenbourg Verlag, Regensburg, Germany, (in press) w1 - The Antibiotic Abacus. Information is Beautiful. http://www.informationisbeautiful.net/visualizations/antibiotic-resistance/, retrieved on 04.08.2017 w2- Stations & Lines. A visual catalog of every station in major rapid transit systems and the lines they serve. https://c82.net/work/?id=335, retrieved on 04.08.2017 VINCI 2017 AUG 15, 2017 M. Keck et al. 22 | 22