This document presents an approach called Approximate and User Steerable t-Distributed Stochastic Neighbor Embedding (A-tSNE) for progressive visual analytics of large and streaming datasets. A-tSNE speeds up t-SNE by approximating similarity computations during layout using k-nearest neighbors. It allows interactive steering and refinement of the layout. The approach is evaluated on two case studies, speeding up t-SNE over 100x on gene expression data and enabling interactive layout of streaming activity sensor data.
User Steerable A-tSNE for Progressive Visual Analytics
1. Approximate and User Steerable tSNE
for Progressive Visual Analytics
Nicola Pezzotti, Boudewijn P.F. Lelieveldt, Laurens van der Maaten,
Thomas Höllt, Elmar Eisemann, Anna Vilanova
17. 17
Case Study II : High-dimensional data streams
Chest - Ankle - Wrist
52 Dimensions every 100 ms
Image courtesy of www.activ8all.com
18. 18[1] Hierarchical Stochastic Neighbor Embedding - Pezzotti et al. - 2016
Conclusions
• Approximation in Progressive
Visual Analytics
• Approximated-tSNE
• Data manipulation
• Refinement
• Scalability issues of the gradient
descent
• Hierarchical SNE [1]
19. 19
Thank you for your attention!
A-tSNE
Precision: 35%
tSNE A-tSNE
Precision: 5%
Similarities
computation time: 12 sSimilarities
computation time: 29 s
Precomp. 3195 s
Speed 4x
29 s 12 s