MULTIMEDIA and COMMUNICATIONS Computer Science CS1033a/b Marketing the Website Search Engines A little History Instructors: Laura Reid (section 001) Vivi Tryphonopoulos (section 002) A Vision of Students Today
Once you have created a website on your hard drive you need to get it up on to the Web. This is called "uploading“ or “publishing” or “ftp’ing” Publishing your Website Stages of creating a Website Publish
Via FTP software can delete, rename, move,and copy files on a server .
Stages of creating a Website Publish What you need?
Publishing your Website Downloading: process of receiving a program, document or file via a network from another computer Remote Site (server) Local Computer Uploading: Local computer Remote site (server ) Internet Provider host
Search engines search a database of information about the Internet
Uses spiders, webcrawlers to gather database information of websites; index sites and score pages and puts the information into a database
Publisher registers into database, or wait for spider
Google , Alta Vista, Lycos, Hotbot
human-selected (hand-picked) Internet resources and are arranged and classified in hierarchical topics.
Human editors review web pages, rank them, organize them into categorized list with brief descriptions
Wait for human editors
Open Directory is 1% size of Google
Yahoo , Open Directory, AOL, About.com
______________________ is a web software that constantly searches for new Web pages and follows any links Database: Addresses, page titles, significant words, topics ____________ the database NOT the internet sites
Internet search engine which ______________________________
Uses searches of other engines (tells you where from)
Returns the “top” results”
Doesn’t create its own database
Top 10 Search Providers http://www.marketingcharts.com/wp/wp-content/uploads/2008/06/nielsen-top-10-search-engines-share-of-searches-april-2008.jpg
Top 10 Search Terms Top 10 Search Terms in 10 Categories, August 2008 http://searchenginewatch.com/showPage.html?page=3631004 http://searchenginewatch.com/showPage.html?page=3630718
STEP 1: Fetch pages Crawl and index the billions of pages of the World Wide Web. This job is performed by Googlebot, a "spider " which connects to web servers around the world to fetch documents. The spider gives each retrieved page a number so it can refer to the pages it fetched. STEP 2: Build an index List every document that contains a certain word. For example, the word "civil" might occur in documents 3, 8, 22, 56, 68, and 92, while the word "war" might occur in documents 2, 8, 15, 22, 68, and 77. How does Google work?
An exercise for students: Understand why a search engine returns certain results over others.
Pretend that you're a search engine. Pick a query like civil war or recycling or whatever you want. Search for the phrase on Google, pick three or four pages from the results, and print them out.
On each printout, find the individual words from your query (such as "civil" and "war") and use a highlighter to mark each word with color. Do that for each of the 3-5 documents that you print out.
Now tape those documents on a wall, step back a few feet, and squint your eyes.
Which document do you think would be most relevant?
This is the heading people will see in the Search Engines and is what will make them click on your link or not .
<title> MIT ILP - Industry Liaison Program <title>
IMPORTANT _________ Property Title attribute and every webpage MIT ILP - Industry Liaison Program (Not Homepage') MIT ILP – Industry at ILP MIT ILP – About the ILP MIT ILP – ILP Services http ://ilp-www.mit.edu/display_page.a4d?key=H1 Here’s another one: http://www.thedancemovement.ca/ Dreamweaver it is the ____________________
Allow time! With countless millions of pages on the World Wide Web it may take 2- 6 weeks for new sites or pages to get indexed in the database . How does one submit? Different for Search Engine vs Directory
Submit directly to Directories http://www.entheosweb.com/website_promotion/directory_submission.asp Submit directly to Search Engines http://www.entheosweb.com/website_promotion/directory_submission.asp
Robots not used, but human editor reviews it
META tags, ALT image tags – DO NOT HELP WITH RANKING
Best to describe your site accurately as editor reviews your website and decides
MAKE SURE SITE IS COMPLETELY DONE BEFORE SUBMISSION
Website Statistics: Quick Sheet http://www.suestudios.com/articles/article27.htm Every provider provides different stats Web site where a visitor was just prior to reaching your site (filter out your own pages) Referrers Order in which people visit the various pages of your site Click Path A list ranking the most popular exit pages (the last page your users visited prior to leaving your site). Exit Pages A list ranking the most popular entry pages (the page in which a visitor enters your site) Entry Pages Total size of pages (or files) viewed by visitors Bandwidth (Kbytes)/ Kilobytes # of distinct html files or pages looked at on your website (stickiness) Pages / Page Views # of users to your site Visits/Unique Visitors & Repeat Visitors # files retrieved from a web site Files # of files sent to a user after a page request (includes graphic images) Hits DEFINITION TERM
Errors recorded while users visiting your site Errors # of times a robot or spider (Search Engine) ran over a website for submitting sites to the search engine Robots/ Spiders visitors # of visits from different countries Countries What operating system (Windows, Mac, Linux) or screen resolution visitors are using Platform Usage What kind of browser visiting using (Explorer, Netscape, Mozilla, etc), User Agents /Browser keywords and/or keyword phrases that were used in searching for your website Search String /Search Terms # of times a visitor accessed your pages by either directly typing your URL in the address bar, by using a bookmark or by following a link on an email message. Direct Request DEFINITION TERM