URLs and Domains (SMX East 2008)

9,232 views

Published on

Top 6 common issues Microsoft Live Seach sees in URLs and Domains of websites.

Published in: Technology, Design
1 Comment
1 Like
Statistics
Notes
  • attempting to use the webmaster.live.com and the site either freezes upon initial visit, or freezes when you click the button to login...this is using IE too....BIG Suprise!!! so typical....
       Reply 
    Are you sure you want to  Yes  No
    Your message goes here
No Downloads
Views
Total views
9,232
On SlideShare
0
From Embeds
0
Number of Embeds
3,926
Actions
Shares
0
Downloads
36
Comments
1
Likes
1
Embeds 0
No embeds

No notes for slide
  • URLs and Domains (SMX East 2008)

    1. 1. URLs and Domains Nathan Buggia, Live Search Webmaster Center Oct 7 th , 2008
    2. 2. What’s a URL (and where search engines get stuck) <ul><li>http://auto.msn.co.uk/autos/default.aspx?id=AA#found </li></ul>Protocol Hostname Path Query Fragment auto.msn.co.uk Subdomain TLD (ccTLD)
    3. 3. HTTP Status Codes <ul><li>200 – Everything’s okay (make sure you don’t return this code on a “Page not Found”!) </li></ul><ul><li>404 – File not found </li></ul><ul><li>301 – File has been moved </li></ul><ul><li>302 – File is temporarily somewhere else </li></ul>
    4. 4. Robots Exclusion Protocol – Common Mistakes <ul><li>microsoft.com/robots.txt does not apply to technet.microsoft.com </li></ul>
    5. 5. Robots Exclusion Protocol http://janeandrobot.com/post/Managing-Robots-Access-To-Your-Website.aspx
    6. 6. Parameter Tracking <ul><li>http://mysite.com/?from=PROMO_1 </li></ul><ul><li>Trap the request </li></ul><ul><li>Create a cookie with from=PROMO_1 </li></ul><ul><li>Set Cache-Control:no-cache content header </li></ul><ul><li>Do a 301 redirect to http://mysite.com </li></ul>
    7. 7. Duplicate Content <ul><li>When there is more than one URL for the same content </li></ul><ul><li>http://oreilly.com </li></ul><ul><li>http://oreilly.com/index.csp </li></ul><ul><li>http://www.oreilly.com </li></ul><ul><li>http://www.oreilly.com/index.csp </li></ul><ul><li>https://oreilly.com </li></ul><ul><li>https://oreilly.com/index.csp </li></ul>Create a few simple rules that will remove duplicate URLs by 301 redirecting all variations to the shortest, most authoritative URL. Often called “Domain Canonicalization” http://janeandrobot.com/post/canonical-url-canonicalization-domain.aspx
    8. 8. Duplicate Content – Gateway Pages <ul><li>When a log-in page, or region select page is placed on each URL unless you already have a cookie </li></ul>Because search engines don’t support cookies, they may see every URL on your site having the same content
    9. 9. Sitemaps
    10. 10. Great Tools From Search Engines <ul><li>Live Search ( webmaster.live.com ) </li></ul><ul><ul><li>Crawl Issues (404s, Too many parameters, REP, Bad ContentType) </li></ul></ul><ul><ul><li>Rank Info (PageRank, DomainRank) </li></ul></ul><ul><ul><li>Backlinks/Outbound links </li></ul></ul><ul><li>Google ( google.com/webmaster ) </li></ul><ul><ul><li>Crawl Issues (404s, REP, Timeout, Unreachable) </li></ul></ul><ul><ul><li>Comprehensive Link Explorer (outbound, inbound links) </li></ul></ul><ul><ul><li>Set WWW vs. Non-WWW </li></ul></ul><ul><li>Yahoo ( SiteExplorer.search.yahoo.com ) </li></ul><ul><ul><li>Feedback on URL Parameters </li></ul></ul><ul><ul><li>Backlinks/ Outbound links </li></ul></ul>
    11. 11. Issues Encountered by URL
    12. 12. webmaster.live.com

    ×