Topic Models

Topic Models Claudia Wagner Graz, 16.9.2010

Semantic Representation of Text ,[object Object],[object Object],[object Object],(Griffiths, 2007)

Topic Models ,[object Object],[object Object],[object Object]

Topic Models source: http://www.cs.umass.edu/~wallach/talks/priors.pdf

Topic Models Topic 1 Topic 2 3 latent variables: Word distribution per topic (word-topic-matrix) Topic distribution per doc (topic-doc-matrix) Topic word assignment (Steyvers, 2006)

Summary ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Topic Models ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

pLSA (Hoffmann, 1999) ,[object Object],[object Object],number of documents number of words P( z | θ ) P( w | z ) Topic distribution of a document

Latent Dirichlet Allocation (LDA) (Blei, 2003) ,[object Object],[object Object],P( w | z, φ (z) ) P( φ (z) | β ) number of documents number of words

Dirichlet Prior α ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],High α Low α Each doc’s topic distribution θ is a smooth mix of all topics Each doc’s topic distribution θ must favor few topics Topic-distr. of Doc1 = (1/3, 1/3, 1/3) Topic-distr. of Doc2 = (1, 0, 0) Doc1 Doc2

Dirichlet Prior β ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],High β Low β Topic-distr. of Doc1 = (1/3, 1/3, 1/3) Word-distr. of Topic2 = (1, 0, 0) Topic1 Topic2

Matrix Representation of LDA observed latent latent θ (d) φ (z)

Statistical Inference and Parameter Estimation ,[object Object],[object Object],[object Object],(Blei, 2003) Latent Vars Observed Vars and Priors

Statistical Inference and Parameter Estimation ,[object Object],[object Object],[object Object],[object Object]

Markov Chain Example ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],source: http://en.wikipedia.org/wiki/Examples_of_Markov_chains

Markov Chain Example ,[object Object],[object Object],[object Object],[object Object]

Gibbs Sampling ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Gibbs Sampling for LDA ,[object Object],[object Object],[object Object],[object Object],[object Object]

Run Gibbs Sampling Example (1) 1 1 2 2 2 2 1 1 1 2 1 2 1 2 1 1 2 1 2 2 1 2 1 2 ,[object Object],[object Object],[object Object],[object Object],1 2 Stream 2 2 River 1 2 Loan 6 3 bank 2 3 money topic2 topic1 topic2 topic1 4 4 doc1 4 4 doc2 4 4 doc3

Gibbs Sampling for LDA Probability that topic j is chosen for word w i , conditioned on all other assigned topics of words in this doc and all other observed vars. Count number of times a word token w i was assigned to a topic j across all docs Count number of times a topic j was already assigned to some word token in doc d i unnormalized! => divide the probability of assigning topic j to word wi by the sum over all topics T

Run Gibbs Sampling ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Run Gibbs Sampling Example (2) 1 2 2 2 2 1 1 1 2 1 2 1 2 1 1 2 1 2 2 1 2 1 2 ,[object Object],[object Object],[object Object],3 2 2 5 3 1 2 Stream 2 2 River 1 2 Loan 6 3 bank 2 3 money topic2 topic1 topic2 topic1 4 4 doc1 4 4 doc2 4 4 doc3

Run Gibbs Sampling Example (2) 1 2 2 2 2 1 1 1 2 1 2 1 2 1 1 2 1 2 2 1 2 1 2 ,[object Object],[object Object],[object Object],2 4 2 5 5 6 1 2 Stream 2 2 River 1 2 Loan 6 3 bank 3 2 money topic2 topic1 topic2 topic1 5 3 doc1 4 4 doc2 4 4 doc3

Run Gibbs Sampling Example (3) ,[object Object],“ Bank” is assigned to Topic 2 How often were all other topics used in doc d i How often was topic j used in doc d i

Summary: Run Gibbs Sampling ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Gibbs Sampling Convergence Black = topic 1 White = topic2 ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Gibbs Sampling Convergence ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Gibbs Sampling Parameter Estimation ,[object Object],num of times word wi was related with topic j num of times all other words were related with topic j num of times topic j was related with doc d num of times all other topics were related with doc d predictive distributions of sampling a new token of word i from topic j , predictive distributions of sampling a new token in document d from topic j

Author-Topic (AT)Model (Rosen-Zvi, 2004) ,[object Object],[object Object],[object Object],[object Object],[object Object]

AT-Model Algorithm ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],P( w | z, φ (z) ) P( z | x, θ (x) )

AT Model Latent Variables Latent Variables: 2) Author-distribution of each topic  determines which topics are used by which authors  count matrix C AT 1) Author-Topic assignment for each word 3) Word-distribution of each topic  count matrix C WT ?

Matrix Representation of Author-Topic-Model source: http://www.ics.uci.edu/~smyth/kddpapers/UCI_KD-D_author_topic_preprint.pdf θ (x) φ (z) a d observed observed latent latent

Example (1) 1 1 2 2 2 2 1 1 1 2 1 2 1 2 1 1 2 1 2 1 2 1 2 ,[object Object],[object Object],[object Object],[object Object],1 2 1 2 1 1 2 2 2 2 2 2 2 2 2 2 3 2 2 2 3 3 3 2 2 2 1 2 stream 2 2 river 1 2 loan 6 3 bank 2 3 money topic2 topic1 8 8 author2 topic2 topic1 0 4 author1 4 0 author3

Gibbs Sampling for Author-Topic-Model ,[object Object],[object Object],[object Object],Count number of times an author k was already assigned to topic j. Count number of times a word token w i was assigned to a topic j across all docs

Problems of the AT Model ,[object Object],[object Object],[object Object]

AT Model with Fictitious Authors ,[object Object],[object Object],[object Object],[object Object],[object Object]

Predictive Power of different models (Rosen-Zvi, 2005) Experiment: Trainingsdata: 1 557 papers Testdata:183 papers (102 are single-authored papers). They choose test data documents in such a way that each author of a test set document also appears in the training set as an author.

Author-Recipients-Topic (ART) Model (McCallum, 2004) ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],, R , x P( z | x, a d , θ (A,R) ) P( w | z, φ (z) )

Gibbs Sampling ART-Model Random Start: Sample author-recipient pair for each word Sample topic for each word Compute for each word w i : Number of recipients of message to which word w i belongs Number of times topic t was assigned to an author-recipient-pair Number of times current word token was assigned to topic t Number of times all other topics were assigned to an author-recipient-pair Number of times all other words were assigned to topic t Number of words * beta

Labeled LDA (Ramage, 2009) ,[object Object],[object Object],[object Object],[object Object],[object Object]

Group-Topic Model (Wang, 2005) ,[object Object],[object Object],[object Object],[object Object]

Group-Topic Model (Wang, 2005) ,[object Object],[object Object],[object Object],[object Object],Number of events (=interactions between entities) Number of entities

CART Model (Pathak, 2008) ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Gibbs-sampling: alternates between updating latent communities c conditioned on other variables, and updating recipient-topic tuples (r, z) for each word conditioned on other variables.

Copycat Model (Dietz, 2007) ,[object Object],[object Object],[object Object],[object Object],[object Object]

Copycat Model (Dietz, 2007) ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],d1 c d2 cites

Copycat Model (Dietz, 2007) ,[object Object],[object Object],[object Object],[object Object]

Copycat Model (Dietz, 2007) ,[object Object],[object Object],[object Object]

Citation InfluenceModel (Dietz, 2007) ,[object Object],[object Object],[object Object],[object Object],i nnovation topic mixture of a citing publication distribution of citation inﬂuences parameter of the coin ﬂip, choosing to draw topics from θ or ψ

References ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Topic Models

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Viewers also liked

Viewers also liked (16)

Similar to Topic Models

Similar to Topic Models (20)

More from Claudia Wagner

More from Claudia Wagner (17)

Topic Models