• Like
  • Save
Forgot about LDA by Jake Bohall of Virante, Inc. - A look at nTopic and content relevancy
Upcoming SlideShare
Loading in...5
×
 

Forgot about LDA by Jake Bohall of Virante, Inc. - A look at nTopic and content relevancy

on

  • 340 views

A Simple On-Site Method for Increasing Search Traffic using Latent Direchlet Allocation and nTopic content relevancy scoring.

A Simple On-Site Method for Increasing Search Traffic using Latent Direchlet Allocation and nTopic content relevancy scoring.

Statistics

Views

Total Views
340
Views on SlideShare
332
Embed Views
8

Actions

Likes
1
Downloads
2
Comments
0

1 Embed 8

http://www.linkedin.com 8

Accessibility

Categories

Upload Details

Uploaded via as Microsoft PowerPoint

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment

    Forgot about LDA by Jake Bohall of Virante, Inc. - A look at nTopic and content relevancy Forgot about LDA by Jake Bohall of Virante, Inc. - A look at nTopic and content relevancy Presentation Transcript

    • Did you forget LDA?A Simple On-Site Method forIncreasing Search Traffic! VIRANTE WEB MARKETING @JakeBohall
    • www.Virante.org@JakeBohall
    • So.. What is it? IS: Topical relevance Topic Models are algorithms used to uncover hidden thematic structures in a collection of data … statistical model for abstract themes. Every document has a several topics, and when used together create a documents theme. IS Not: Keyword usage, TF*IDF or co-occurance www.Virante.org @JakeBohall
    • So….... What is it? PubCon Vegas Marketing Vegas SEO Drinking SEM Gambling Networking Convention Center Analytics Money Affiliates Late Nights www.Virante.org @JakeBohall
    • How it works… Create a topical model of the English language using a dictionary restricted sample of 1,000,000 random Wikipedia articles Accept a keyword and build an ideal document based on content that ranks for that term Accept content and compare to an ideal model Build confidence score that these your content is related to the keyword more than two randomly selected Wikipedia articles are related to one another. www.Virante.org @JakeBohall
    • seoMoz deserves a bunch of credit! Ben Hendrickson was … a Senior Scientist at seoMoz researched this and is now a Software Development Engineer at Google … just a coincidence? They built a model and gave us LDA scoring tool What they got right …High LDA scores do correlate to rankings www.Virante.org @JakeBohall
    • What happened? Data reported wrong. … reported .32 vs .17, suddenly the ‘buzz’ is about an error as opposed to this amazing discovery Experimental Validation. …Did not run experiments to determine the method through which this impacts rankings or traffic (keyword breadth / long tail being the primary vector) Did not refine the model based on experiments seoMoz had too many other awesome projects happening www.Virante.org @JakeBohall
    • What did we do about it? Built a tool using collocation to increase LDA scoring Spent 1.5 years in R&D fleshing out our own LDA model 4x Did an organic traffic study to try and find causation 1. nTopic modified content saw an organic traffic lift of 17.5%, random keyword modified content saw a 10% drop and unmodified content saw drop in traffic of 15% 2. We now know unequivocally that improving topical relevancy can increase organic traffic. www.Virante.org @JakeBohall
    • www.Virante.org@JakeBohall
    • What this doesn’t mean This does not prove that LDA or topic modeling is used in Google’s algorithm We cannot determine the exact mechanism by which inserting nTopic recommended terms increase Google trafficBUT … We can provide evidence that nTopic recommended terms do increase Google traffic. … http://www.ntopic.org/causal-study.php www.Virante.org @JakeBohall
    • Virante, Inc. http://www.virante.org 1-800-650-0820 @virante Jake Bohall jbohall@virante.com 919-459-2834 @jakebohallhttp://www.thegooglecache.com/ http://www.virante.org/blog/ http://www.ntopic.org www.Virante.org