Your SlideShare is downloading. ×
0
Did you forget LDA?A Simple On-Site Method forIncreasing Search Traffic!     VIRANTE WEB MARKETING     @JakeBohall
www.Virante.org@JakeBohall
So.. What is it?   IS: Topical relevance   Topic Models are algorithms used to uncover   hidden thematic structures in a c...
So….... What is it?                     PubCon Vegas     Marketing                            Vegas                 SEO   ...
How it works… Create a topical model of the English language using a dictionary restricted sample of 1,000,000 random Wiki...
seoMoz deserves a bunch of credit!  Ben Hendrickson was  … a Senior Scientist at seoMoz researched this and is     now a S...
What happened? Data reported wrong.    … reported .32 vs .17, suddenly the ‘buzz’ is about an error as    opposed to this ...
What did we do about it?  Built a tool using collocation to increase LDA scoring  Spent 1.5 years in R&D fleshing out our ...
www.Virante.org@JakeBohall
What this doesn’t mean This does not prove that LDA or topic modeling is used in Google’s algorithm We cannot determine th...
Virante, Inc.     http://www.virante.org        1-800-650-0820            @virante         Jake Bohall     jbohall@virante...
Upcoming SlideShare
Loading in...5
×

Forgot about LDA by Jake Bohall of Virante, Inc. - A look at nTopic and content relevancy

203

Published on

A Simple On-Site Method for Increasing Search Traffic using Latent Direchlet Allocation and nTopic content relevancy scoring.

Published in: Marketing
0 Comments
1 Like
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total Views
203
On Slideshare
0
From Embeds
0
Number of Embeds
1
Actions
Shares
0
Downloads
4
Comments
0
Likes
1
Embeds 0
No embeds

No notes for slide

Transcript of "Forgot about LDA by Jake Bohall of Virante, Inc. - A look at nTopic and content relevancy"

  1. 1. Did you forget LDA?A Simple On-Site Method forIncreasing Search Traffic! VIRANTE WEB MARKETING @JakeBohall
  2. 2. www.Virante.org@JakeBohall
  3. 3. So.. What is it? IS: Topical relevance Topic Models are algorithms used to uncover hidden thematic structures in a collection of data … statistical model for abstract themes. Every document has a several topics, and when used together create a documents theme. IS Not: Keyword usage, TF*IDF or co-occurance www.Virante.org @JakeBohall
  4. 4. So….... What is it? PubCon Vegas Marketing Vegas SEO Drinking SEM Gambling Networking Convention Center Analytics Money Affiliates Late Nights www.Virante.org @JakeBohall
  5. 5. How it works… Create a topical model of the English language using a dictionary restricted sample of 1,000,000 random Wikipedia articles Accept a keyword and build an ideal document based on content that ranks for that term Accept content and compare to an ideal model Build confidence score that these your content is related to the keyword more than two randomly selected Wikipedia articles are related to one another. www.Virante.org @JakeBohall
  6. 6. seoMoz deserves a bunch of credit! Ben Hendrickson was … a Senior Scientist at seoMoz researched this and is now a Software Development Engineer at Google … just a coincidence? They built a model and gave us LDA scoring tool What they got right …High LDA scores do correlate to rankings www.Virante.org @JakeBohall
  7. 7. What happened? Data reported wrong. … reported .32 vs .17, suddenly the ‘buzz’ is about an error as opposed to this amazing discovery Experimental Validation. …Did not run experiments to determine the method through which this impacts rankings or traffic (keyword breadth / long tail being the primary vector) Did not refine the model based on experiments seoMoz had too many other awesome projects happening www.Virante.org @JakeBohall
  8. 8. What did we do about it? Built a tool using collocation to increase LDA scoring Spent 1.5 years in R&D fleshing out our own LDA model 4x Did an organic traffic study to try and find causation 1. nTopic modified content saw an organic traffic lift of 17.5%, random keyword modified content saw a 10% drop and unmodified content saw drop in traffic of 15% 2. We now know unequivocally that improving topical relevancy can increase organic traffic. www.Virante.org @JakeBohall
  9. 9. www.Virante.org@JakeBohall
  10. 10. What this doesn’t mean This does not prove that LDA or topic modeling is used in Google’s algorithm We cannot determine the exact mechanism by which inserting nTopic recommended terms increase Google trafficBUT … We can provide evidence that nTopic recommended terms do increase Google traffic. … http://www.ntopic.org/causal-study.php www.Virante.org @JakeBohall
  11. 11. Virante, Inc. http://www.virante.org 1-800-650-0820 @virante Jake Bohall jbohall@virante.com 919-459-2834 @jakebohallhttp://www.thegooglecache.com/ http://www.virante.org/blog/ http://www.ntopic.org www.Virante.org
  1. A particular slide catching your eye?

    Clipping is a handy way to collect important slides you want to go back to later.

×