Evolution of words through time a malenko dataconf 21 04_18

Evolution of words through time
Andrii Malenko, VideoGorillas

The idea
● Meaning of words in natural language change slowly but surely
● Word’s meaning can be estimated by context or nearest neighbours
occuring in texts
● Language model represent relations between words in numeric way
● Change of language model shows evolution of natural language in
different time periods
● New concepts in mass consciousness appear as a synthesis of
previous concepts

Datasets
● Google n-grams
● News web sites
● Common crawl
● Wikipedia
● Reddit
● Open subtitles

Pipeline
1950-1955 1951-1956 2011-2015
W2V W2V W2V
DR DR DR
...
...
...

gensim
https://radimrehurek.com/gensim/

t-SNE
http://scikit-learn.org/stable/modules/generated/sklearn.manifold.TSNE.html
https://distill.pub/2016/misread-tsne/

LargeVis
https://github.com/lferry007/LargeVis

UMAP - Uniform Manifold Approximation and Projection

UMAP - Uniform Manifold Approximation and Projection
https://github.com/lmcinnes/umap

Demo
http://91.195.96.199:3222/static/index.html
http://91.195.96.199:3222/static/index.html?model=ru

Evolution of words through time a malenko dataconf 21 04_18

Recommended

Recommended

More Related Content

Similar to Evolution of words through time a malenko dataconf 21 04_18

Similar to Evolution of words through time a malenko dataconf 21 04_18 (20)

More from Olga Zinkevych

More from Olga Zinkevych (15)

Recently uploaded

Recently uploaded (20)

Evolution of words through time a malenko dataconf 21 04_18