3. t-SNE:
Student T Distributed-Stochastic Neighbor Embedding
▷ Nonlinear Dimension Reduction for Visualization (2-D or 3-D)
▷ Advance Version of SNE (G. Hinton, NIPS 2003)
▷ Gradient-based Machine Learning Algorithm
33. SNE t-SNE
▷ Crowding Problem Student t-Distribution
Student t-Distribution in Low-Dimension
This High-Dimension Data
34. SNE t-SNE
▷ Crowding Problem Student t-Distribution
Student t-Distribution in Low-Dimension
This High-Dimension Data
Loses its Probability
Closer
35. SNE t-SNE
▷ Crowding Problem Student t-Distribution
Student t-Distribution in Low-Dimension
This High-Dimension Data
36. SNE t-SNE
▷ Crowding Problem Student t-Distribution
Student t-Distribution in Low-Dimension
This High-Dimension Data
Gains its Probability
More far away
37. High-D Low-D 𝑝𝑖𝑗 𝑞𝑖𝑗 (𝑝𝑖𝑗 − 𝑞𝑖𝑗) (𝑦𝑖 − 𝑦𝑗) Gradient
Large Large 1 1 0 Large 0
Small Small 0 0 0 Small 0
Small Large 0 1 -1 Large Large
Attraction
Large Small 1 0 1 Small Small
Repulsion
Small Replusion
38. Adding Slight Repulsion (Uniform Dist. in 𝑞𝑖𝑗)
Often Not the Case
Low-D Initialized by Gaussian
39. High-D Low-D 𝑝𝑖𝑗 𝑞𝑖𝑗 (𝑝𝑖𝑗 − 𝑞𝑖𝑗) (𝑦𝑖 − 𝑦𝑗) 1 + 𝑦𝑖 − 𝑦𝑗
2 −1
Gradient
Large Large 1 1 0 Large Small 0
Small Small 0 0 0 Small Large 0
Small Large 0 1 -1 Large Small Attraction
Large Small 1 0 1 Small Large Repulsion
Strong Replusion