Google knows about 1 trillion non-duplicate URLs ( http://googleblog.blogspot.com/2008/07/we-knew-web-was-big.html ) Google’s index: 40B (4% of what they know exists!) Cuil’s index: 124B (anecdotally: 3x Google’s) Crawler limitations? Search company editorially limited (spock.com) [preventing index jamming) Publisher TOS limited (Craigslist) Crawler technically limited (form-fronted deep web, AJAX-fronted web) Fresh content, fast-changing sites (Digg, Twitter, etc.) [not just special 'news' sites -- the whole web is moving real-time!]
Real-Time Search Panel at OMMA Global 2009 - Presentation Transcript
Tom Daly, Group Manager, Strategy & Planning, The Coca-Cola Company - @ travelingparent
Rob Garner, Search Strategy Director, iCrossing - @ robgarner
Jonathan Mendez, Founder & CEO, RAMP Digital – @ jonathanmendez
Tobias Peggs, General Manager, OneRiot - @ tobiaspeggs
Stephanie Sarka, Strategic Advisor, Wowd - @ wowd
Moderator: David Berkowitz, Senior Director of Emerging Media & Innovation, 360i - @ dberkowitz
Read More at http://bit.ly/RTSpanel
Wowd indexes the whole web, in real time All human-accessible web pages All crawler- accessible web pages All legitimately indexable web pages G, Y!, B
These slides accompany the OMMA Global panel on Rea more
These slides accompany the OMMA Global panel on Real-Time Search in September 2009. More info can be found at http://bit.ly/RTSpanel . Panelists include: Tom Daly, Group Manager, Strategy & Planning, The Coca-Cola Company - @travelingparent; Rob Garner, Search Strategy Director, iCrossing -@robgarner; Jonathan Mendez, Founder & CEO, RAMP Digital – @jonathanmendez; Tobias Peggs, General Manager, OneRiot - @tobiaspeggs; Stephanie Sarka, Strategic Advisor, Wowd - @stephaniesarka Moderator: David Berkowitz, Senior Director of Emerging Media & Innovation, 360i - @dberkowitz less
0 comments
Post a comment