Information Architecture for SEO - Presentation Transcript
INTERACTIVE STRATEGY - 55, AV. MONT-ROYAL O., SUITE 999, MONTRÉAL (QC) H2T 2S6 T 514.524.7149 NVISOLUTIONS.COM Search Engine Optimization: Indexation and Link Juice
Indexation and Link Juice
The Holistic SEO Recipe:
One Part Marketing
One Part Editorial
One Part Webmaster
One Part SEO education
Don’t expect to find it in one person! Put together your best team worker from each group and have them ALL learn SEO then co-ordinate on its implementation. INTERACTIVE STRATEGY – NVISOLUTIONS.COM
Marketing :: Keywords to Fuel Site Expansion INTERACTIVE STRATEGY – NVISOLUTIONS.COM
Editorial :: Page and Link Placement INTERACTIVE STRATEGY – NVISOLUTIONS.COM
EVERYBODY NEEDS TO GET THE SEO BASICS AND SEE THE SEO BIG PICTURE
What is it they all need to ‘get’?
That search engines ‘crawl’ sites by following HTML and other links
That the quality and quantity of links pointing to a page = Link Popularity
Pages Need Link Popularity (or juice) to get indexed and rank
YOU get to control which pages on your site get indexed , and which ones get link juice .
A page being indexed by search engines is separate from that page’s ability to accumulate or pass on link juice.
INTERACTIVE STRATEGY – NVISOLUTIONS.COM
Link Juice and Indexation Cheat Sheet: INTERACTIVE STRATEGY – NVISOLUTIONS.COM Tag/Command Indexation Link Juice Robots.txt (file in root of site) EG: Disallow: /news/pdf-copies/ Stops pages or directories from appearing in Search Engine indexes – (except ‘uncrawled references’ ) Pages ‘blocked’ by robots.txt can still accumulate and pass link-juice Block what you don’t want indexed: Session IDs / dupe URLs: Disallow: *partner=* Entire directories: Disallow: /news/pdf-copies/ Internal SERPs: Disallow: *car-search-query=* External affiliate links: Disallow: *GO.cgi* If excluded pages have external links, expect 'URL-only' listings: No title, snippet, size or cache. For no listing at all, allow bots & use Meta noindex On-Page Meta No-Index (<head> of page) EG: <meta name=“robots” content=“noindex”> Stops the page from appearing in Search Engine indexes entirely The page can still accumulate and pass link-juice You may want to use Meta Noindex if: you can’t alter your robots.txt, or if robots.txt standard is not flexible enough, or if you don’t want URL listings
Link Juice and Indexation Cheat Sheet: INTERACTIVE STRATEGY – NVISOLUTIONS.COM Rel=nofollow (in an <a href> link) EG: <a href=“http://non-trusted-site.com” rel=“nofollow”> Stops spiders from following a specific link. They don’t crawl or discover through nofollow links. Stops Link Juice from flowing through a specific link Can be used from one domain to another when a link does not imply trust. Lots of controversy recently on its use within a domain to ‘sculpt PR flow’ 301 Redirection (many types of implementation) EG: redirect /old/page.php /new_page.php [301, permanent] Spiders follow redirect and discover new pages Search Engines transfer link juice from old pages to new pages If any URLs change, this is the best way to shift link juice from old to new. 301s are the only way to transfer link juice from one domain to another. Tag/Command Indexation Link Juice On-Page Meta No-follow (<head> of page) EG: <meta name=“robots” content=“nofollow”> Stops spiders from following the links on the page (which may still get indexed via other links) The pages can still accumulate link-juice (and rank), but can’t pass it on You may wish to nofollow an entire page like a list of paid sponsors.
Link Juice and Indexation Cheat Sheet: INTERACTIVE STRATEGY – NVISOLUTIONS.COM Tag/Command Indexation Link Juice Canonicalization tag (<head> of page) EG: <canonical = “/proper/product/page.php“> Spiders go to referred page like a 301 redirect . Does not work across domains. Search Engines transfer link juice from variation pages to real page
A new approach to both indexation control and link juice control
Supported by The Big Three: Google, Yahoo, MSN
May be cheaper than redoing your entire site from scratch, for now
May see faster results than redoing your entire site
May become a maintenance nightmare
Can turn out to be as or more complex than doing it right from scratch
Javascript Link EG: <div onclick="document.location.href='http://www.domain.com/'"> Google tries to crawl and index if URL is easily to access – in onclick or href If crawlable, Google will try to pass link juice
Before rel=nofollow, many SEOs used uncrawlable JS links for sculpting
May not carry as much weight, and should not be used as main navigation
Two Free Tools
Google Webmaster Central:
Identify Crawl problems (spider data over time!)
Find duplicate titles and meta descriptions
Quickly identify 404 issues
List your pages by internal or external links
Manage and find errors in sitemaps
Test your robots.txt file against specific URLs
Basic domain canonicalization (www to non-www)
Xenu Link Sleuth:
Find broken links (sort by status)
Find duplicate title tags (sort by title)
Find heavy pages (sort by size)
Find pages too many click from home (sort by level)
Find pages with too few internal links (sort by In links)
Find images without ALT text (sort by type, scan title field)
Test non-canonical URLs (from a text file) and view status
Find outgoing links to broken pages or expired content
Find bot traps (like open ended calendars)
INTERACTIVE STRATEGY – NVISOLUTIONS.COM
INTERACTIVE STRATEGY – NVISOLUTIONS.COM
Dos and Don’ts
Do:
Define your IA and determine canonical URLs for hub pages across all major categories, with expansion ability
Use breadcrumb style navigation – Put all new content in it! (EG: Home > Kitchen > Major Appliances > Stoves )
Include relevant category-specific navigation at each level
Make interlinking mandatory! I nclude in-content links to similar pages around the site, and give links from other pages
Keep updated HTML and XML sitemaps for all new content
Learn all the ways to control indexation and link juice flow
Don’t:
Let the same content appear on more than one URL
Just throw content up without linking to it, or linking from it
Spread your link juice thin over pages don’t have unique content
Leave open ended page scripts like calendars
Don’t archive poorly without respect to your IA
Return server headers other than 404 for error pages
Think you can fix link juice distribution issues with robots.txt
INTERACTIVE STRATEGY – NVISOLUTIONS.COM
Thank You! Keep up with NVI: Website - NVIsolutions.com NVI Blog EN - NVIsolutions.com/blog NVI Blog FR - GO-Referencement.org E-mail me: nosborne@nvisolutions.com Follow me on Twitter @NaoiseOsborne INTERACTIVE STRATEGY – NVISOLUTIONS.COM
A look at the Information Architecture for SEO. A p more
A look at the Information Architecture for SEO. A presention given by Naoise Osborne (NVI) at Search Engine Strategies, SES Toronto in June 9th 2009. less
0 comments
Post a comment