The number of devices connected to IP networkswill be nearly three times as high as the globalpopulation in 2016 22
kilo mega tera giga pita The Zettabyte Era exa zetta 9,444,732,965,739,290,427,392 bits (1024 exbibytes) yottahttp://www.cisco.com/en/US/solutions/collateral/ns341/ns525/ns537/ns705/ns827/VNI_Hyperconnectivity_WP.html 23
“Reports that say that something hasnthappened are always interesting to me, becauseas we know, there are known knowns; there arethings we know we know. We also know thereare known unknowns; that is to say we knowthere are some things we do not know. Butthere are also unknown unknowns – the oneswe dont know we dont know.” Donald Rumsfeld, US Defense Secretary at a press conference at NATO Headquarters, Brussels, Belgium, June 6, 2002 Image: planetization.org 24
Nicholas Carr worriesthat the flood of digitalinformation is changingnot only our habits, buteven our mentalcapacities: Forced to scanand skim to keep up, weare losing our abilities topay sustainedattention, reflectdeeply, or rememberwhat we’ve learned. 25
Information overload?http://blogs.tusc.k12.al.us/bhslibrary/files/2012/01/Information_overload.jpg 26
DO YOU KNOW WHAT ARE YOU LOOKING FOR? 27 http://www.teachersdiary.com/.a/6a0115703931fc970c0128765537ba970c-800wi
DO YOU KNOW WHERE TO FIND WHAT YOU WANT? http://www.flickr.com/photos/special/1597251/ 28
IS THERE A SUPERHERO WHO CAN HELP?http://www.flickr.com/photos/sumrow/1267682594/sizes/l/ 30
BUILD YOUR OWN SEARCH SERVICE Yes, you are the superhero
BOSS IS BUILD YOUR OWN SEARCH SERVICEhttp://developer.yahoo.com/search/boss/
BOSS PROVIDES APIS TO OUR SEARCH DATA STORES
TO BUILD YOUR OWN POWERFULSEARCH APPLICATIONS
BOSS allows you to search over Web, images, news & Blogs
You can even monetize yourapplications using Search Ads from BOSS and get support.
What can be done on top of BOSS?• Blend and re-rank search results• Your own look and feel• Mix it with other APIs
Free for building your hacks!!
Where do I start?
What’s in it?Restful XML and JSON API Web Image Spelling News Search Ads http//www.flickr.com/photos/joeshlabotnik/419914250/sizes/o/in/photostream/.jpg
Oauth based Autenticationhttp//www.flickr.com/photos/friarsbalsam/5736126308/sizes/o/in/photostream/.jpg
What else do I get? Web and Limited Web results Image attributes like height, width, etc Time span filtering for News Search Document type filtering Extended abstracts http//www.flickr.com/photos/acidpix/6021203584/sizes/o/in/photostream/.jpg
BOSS + YQL• Table Name: boss.search Example Parameters Consumer Key ck - Consumer Secret secret - Query Term q ‘iitd’• e.g. select * from boss.search where ck=… and secret=… and q=‘openhackindia’
Searching “The Dark Knight”
Finding images of “The Dark Knight Rises”select * from boss.search where q="The Dark Knight Rises" and service="images" and ck="..." and secret="..."
Finding “The Dark Knight Rises” in IMDB, movies.yahoo.comselect * from boss.search where q="The Dark Knight Rises" and sites="imdb.com,movies.yahoo.com" and ck="..." and secret="..."
Spell Check and Correctionselect * from boss.search where q="The Dark Knight Rises" and service="spelling" and ck="..." and secret="..."
Finding news on “The Dark Knight Rises”select * from boss.search where q="The DarkKnight Rises" and service="news" and ck="..." and secret="..."
And through the BOSS API Getting multiple data sets /ysearch/web,images,news?q=anna /ysearch/web,images,news?web.q=anna&images.q=anna&news.q=lokpal Searching through sites A Simple Movie Search /ysearch/web?q=“Dark Knight”& sites=movies.yahoo.com,netflix.com,imdb.com AND/OR operators /ysearch/web?q="steve jobs"AND((ipad)OR(iphone))&sites=bestbuy.com,newegg.com Important: Use Braces or quotes
Unary Operators Search for Batman but not “Dark Knight” q=(batman -“Dark Knight") Find pages with “Heath Ledger” but not “Dark Knight” q=+”heath ledger”–”Dark Knight”&sites=movies.yahoo.com Force auto-spelling off q=+”drk knight”
Searching in body and in title Searching for Dark Knight in the Title on Yahoo movies q=reviews intitle:"dark knight"&sites=movies.yahoo.com Searching for Dark Knight in the Title in Yahoo movies containing Christian Bale q=reviews intitle:"dark knight" inbody:"christian bale"&sites=movies.yahoo.com
Market and document specific Filters Search for “Dark Knight” in India specific sites q=“Dark Knight”&market=en-in Search for “PDF’s containing “Dark Knight” q=“Dark Knight”&type=pdf Search for MS Office type (except PPT’s) containing “Dark Knight” q=“Dark Knight”&type=msoffice,-ppt
Image search parameters Search for images that are not offensive /ysearch/images?q=“san francisco”&filter=yes Search for images that are wallpaper size /ysearch/images?q=“san francisco”&dimensions=wallpaper Search for a image at a certain refer URL /ysearch/images?q=yahoo&refererurl=http://www.flickr.com• Interesting Output Fields format, file size, height, width, title, total result count
News search parameters Search news that is less than 7 days old /ysearch/news?q=lokpal&age=7dSearch news that is between 20hrs and 2 days old /ysearch/news?q=lokpal&age=20h2dRe-rank news results by date /ysearch/news?q=lokpal&ranking=trueInteresting Output Fields Source, Date, Source URL
I did BOSS and got data, now how to extract information of out it?
make sense out of it?
Content Analysisselect * from contentanalysis.analyze where text="Yahoo! kicks off hackday”
Content Analysis from a URLselect * from contentanalysis.analyze where url="http://www.cnn.com/"
Term Exractionselect * from search.termextract where context in (select description from rss where url=‘’)
More resources Yahoo! BOSS: http://developer.yahoo.com/boss BOSS Technical Documentation: http://developer.yahoo.com/search/boss/boss_api_guide/ YQL: http://developer.yahoo.com/yql Amazon Web Services: http://aws.amazon.com oAuth: http://oauth.net/ Open Data: http://theinfo.org Alt Search Engines: http://www.altsearchengines.com/