SlideShare a Scribd company logo
1 of 45
Download to read offline
Build your Own Search Service



     Chris Heilmann
     Saurabh Sahni


HackU: IIT Bombay 5th Feb’ 2009
Outline



   •  Search engines using BOSS
   •  About BOSS API
      –  What?
      –  Why?
      –  Features
   •  How to use it
      –  BOSS API
      –  BOSS Mashup framework




                                 -2-
Search engines
using BOSS



 -3-
hakia: http://hakia.com/




                           -4-
hakia: http://hakia.com/




                           -5-
hakia: http://hakia.com/




                           -6-
Cluuz: http://cluuz.com




                          -7-
Cluuz: http://cluuz.com




                          -8-
Cluuz: http://cluuz.com




                          -9-
Keyword finder - http://keywordfinder.org/




                                - 10 -
askBOSS: http://ask-boss.appspot.com/




                             - 11 -
askBOSS: http://ask-boss.appspot.com/




                             - 12 -
askBOSS: http://ask-boss.appspot.com/




                             - 13 -
askBOSS: http://ask-boss.appspot.com/




                             - 14 -
askBOSS: http://ask-boss.appspot.com/




                             - 15 -
About BOSS API



 - 16 -
What?




  •  Open Yahoo’s core search features via web services to
     let 3rd parties revolutionize Search


  •  Unrestricted



        http://developer.yahoo.com/search/boss



                             - 17 -
Opening the search technology stack




                                                                           Rank
                                                       Assist
                  EXTRACT



                                                                Retrieve
                    SPAM <-> Gold              Usage
       CRAWL

                                           Web Map


                     Analyze
                                      Index                      Index
    50B pages * 20ms page download = 31 years


                                      - 18 -
Opening the search technology stack

                                                            Your App here



                                                                WEB API

                                                                            Rank
                                                       Assist
                  EXTRACT



                                                                Retrieve
                    SPAM <-> Gold              Usage
       CRAWL

                                           Web Map


                     Analyze
                                      Index                      Index
    50B pages * 20ms page download = 31 years


                                      - 19 -
Why?

  •  Removes entry barriers
       –  massive capital investment
       –  access to top technical talent

  •  Asset to Innovate
       –  Develop new relevance models
           • Leverage user insights
           • Use tags, bookmarks
       –  Change presentation style
  •  Search anywhere
       –  Improve Vertical Quality w/ Web comprehensiveness
       –  Fragment the market, foster more players, choice, competition

                                      - 20 -
BOSS API features



   •  Unlimited queries per day
   •  No branding or attribution
   •  No restrictions on presentation
   •  Ability to re-order results and blend-in addition content
   •  Access to multiple verticals (web search, image, news)
   •  Spell checks, keyword suggestions
   •  40+ supported language and region pairs
   •  Ability to monetize




                                 - 21 -
How to use it?



- 22 -
Get Started



  •  Register for an application id
     http://developer.yahoo.com/wsregapp/

  •  Documentation
     http://developer.yahoo.com/search/boss/boss_guide/

  •  Code samples: Javascript, PHP and Python
     http://www.saurabhsahni.com/boss-examples.zip




                             - 23 -
BOSS API



  Searching
  Slumdog
  Millionaire




                         (Source: http://en.wikipedia.org/wiki/File:Slumdog_Millionaire_poster.jpg)


                - 24 -
BOSS API



  •  Search for slumdog millionaire:
    –  http://boss.yahooapis.com/ysearch/web
       /v1/slumdog+millionaire
       ?appid=xyz&format=xml




                        - 25 -
BOSS API: XML response
http://boss.yahooapis.com/ysearch/web/v1/slumdog+millionaire?appid=xyz&format=xml




                                       - 26 -
BOSS API




  •  Exact search for “slumdog millionaire”
     –  http://boss.yahooapis.com/ysearch/web
        /v1/%22slumdog+millionaire%22
        ?appid=xyz&format=xml




                               - 27 -
BOSS API



  •  Search for slumdog millionaire only on indiatimes.com:
     –  Add site:indiatimes.com to your query
     –  http://boss.yahooapis.com/ysearch/web/v1/slumdog
        +millionaire+site%3Aindiatimes.com
        ?appid=xyz&format=xml


  •  Search for slumdog millionaire on selected movie sites
     –  Add param sites=indiatimes.com,movies.yahoo.com,imdb.com
     –  http://boss.yahooapis.com/ysearch/web/v1/slumdog
        +millionaire?appid=xyz&sites=indiatimes.co
        m%2Cmovies.yahoo.com&format=xml




                                   - 28 -
http://boss.yahooapis.com/ysearch/web/v1/slumdog+millionaire?
appid=xyz&sites=indiatimes.com%2Cmovies.yahoo.com&format=xml




                               - 29 -
BOSS API



  •  Find related keywords
     –  Add parameter view=keyterms
     –  http://boss.yahooapis.com/ysearch/web/v1/slumdog
        +millionaire?appid=xyz&view=keyterms&format=xml




                                - 30 -
http://boss.yahooapis.com/ysearch/web/v1/slumdog +millionaire?appid=xyz&view=keyterms&format=xml




                                                - 31 -
BOSS API




  •  Search images
     –  http://boss.yahooapis.com/ysearch/images/v1/slumdog
        +millionaire?dimensions=small




                              - 32 -
http://boss.yahooapis.com/ysearch/images/v1/
slumdog +millionaire?dimensions=small




                             - 33 -
BOSS API




  •  Search news
     –  http://boss.yahooapis.com/ysearch/news/v1/slumdog
        +millionaire?age=15d




                              - 34 -
http://boss.yahooapis.com/ysearch/news/v1/slumdog
+millionaire?age=15d





                              - 35 -
BOSS API

 Spell check request

 http://boss.yahooapis.com/ysearch/spelling/v1/
 milionare?format=xml



 Response




                           - 36 -
BOSS API REST Interface

  http://boss.yahooapis.com/ysearch/{vert}/v1/{query}

  •  {query}: term to look for (url-encoded)
  •  {vert} := {web, news, images, spelling}
  •  @ required
     –  appid
  •  @ optional
     –  start, count, lang, region, format, callback, sites




                                   - 37 -
BOSS Mashup Framework



  •  Python (v2.5+) library

  •  BOSS Search SDK plus …

  •  SQL for remixing arbitrary XML/JSON sources




         http://developer.yahoo.com/search/boss/mashup.html




                               - 38 -
BMF + Google App Engine



  •  Enhanced version of BMF to GAE platform
  •    http://zooie.wordpress.com/2008/08/04/yahoo-boss-google-app-engine-integrated/




  •  Enables quick deployment of BOSS applications online




                                               - 39 -
One more thing…




 - 40 -
BOSS in Academic Research



  •  The biggest dataset available on web
  •  Very useful for Web-mining research experiments
     –    Natural language processing
     –    Semantic extraction
     –    Related keywords
     –    Similarity detection
     –    Clustering algorithms
     –    Spelling corrections




                                    - 41 -
Thank You




                              Questions?
        More: http://developer.yahoo.com/search/boss/




                           - 42 -
Appendix



- 43 -
Search UI Templates are Included in the
BOSS Mashup Framework




                       http://www.yahoo.com




    BOSS Mashup Framework simplifies aggregating and presenting multiple data sources

                                              - 44 -
BMF Features


  •  select, group, sort, union, joins, udfs, where
  •  Text normalization and duplicate removal
  •  Auto-transformation of resource-oriented API results
     into tables w/o parsing
  •  All-in-memory storage and retrieval operations
  •  Ability to join lists of tables via an arbitrary predicate
     function (map-like)
  •  Search UI template framework
  •  Single search function provides total access to
     BOSS REST API

                                - 45 -

More Related Content

Similar to BOSS: HackU IIT Bombay

BOSS: Yahoo HackU IIIT Bangalore
BOSS: Yahoo HackU IIIT BangaloreBOSS: Yahoo HackU IIIT Bangalore
BOSS: Yahoo HackU IIIT BangaloreSaurabh Sahni
 
Y Boss External 20091017
Y Boss External 20091017Y Boss External 20091017
Y Boss External 20091017JH Lee
 
Yahoo! BOSS API external 20091001
Yahoo! BOSS API external 20091001Yahoo! BOSS API external 20091001
Yahoo! BOSS API external 20091001Fred Meng
 
Yahoo Pipes Middleware In The Cloud
Yahoo Pipes Middleware In The CloudYahoo Pipes Middleware In The Cloud
Yahoo Pipes Middleware In The CloudConSanFrancisco123
 
EscConf - Deep Dive Frontend Optimization
EscConf - Deep Dive Frontend OptimizationEscConf - Deep Dive Frontend Optimization
EscConf - Deep Dive Frontend OptimizationJonathan Klein
 
Map Mashup Trend
Map Mashup TrendMap Mashup Trend
Map Mashup TrendJinho Jung
 
GDD Japan 2009 - Designing OpenSocial Apps For Speed and Scale
GDD Japan 2009 - Designing OpenSocial Apps For Speed and ScaleGDD Japan 2009 - Designing OpenSocial Apps For Speed and Scale
GDD Japan 2009 - Designing OpenSocial Apps For Speed and ScalePatrick Chanezon
 
SpeedGeeks
SpeedGeeksSpeedGeeks
SpeedGeeksxlight
 
Padrino - the Godfather of Sinatra
Padrino - the Godfather of SinatraPadrino - the Godfather of Sinatra
Padrino - the Godfather of SinatraStoyan Zhekov
 
Apache Solr Changes the Way You Build Sites
Apache Solr Changes the Way You Build SitesApache Solr Changes the Way You Build Sites
Apache Solr Changes the Way You Build SitesPeter
 
Beyond web services: supporting mashup artists at Yahoo!
Beyond web services: supporting mashup artists at Yahoo!Beyond web services: supporting mashup artists at Yahoo!
Beyond web services: supporting mashup artists at Yahoo!Chad Dickerson
 
Sourcer\'s Daily Dozen for ERE- Arbita JobMachine
Sourcer\'s Daily Dozen for ERE- Arbita JobMachineSourcer\'s Daily Dozen for ERE- Arbita JobMachine
Sourcer\'s Daily Dozen for ERE- Arbita JobMachineGlenn Gutmacher
 
Behavior Driven Development and Automation Testing Using Cucumber
Behavior Driven Development and Automation Testing Using CucumberBehavior Driven Development and Automation Testing Using Cucumber
Behavior Driven Development and Automation Testing Using CucumberKMS Technology
 
High Performance Webdesign
High Performance WebdesignHigh Performance Webdesign
High Performance Webdesign拓樹 谷
 
Socket applications
Socket applicationsSocket applications
Socket applicationsJoão Moura
 
Обход проверки безопасности в магазинах мобильных приложений при помощи платф...
Обход проверки безопасности в магазинах мобильных приложений при помощи платф...Обход проверки безопасности в магазинах мобильных приложений при помощи платф...
Обход проверки безопасности в магазинах мобильных приложений при помощи платф...Positive Hack Days
 
Spark Hsinchu meetup
Spark Hsinchu meetupSpark Hsinchu meetup
Spark Hsinchu meetupYung-An He
 
Service Oriented Integration with ServiceMix
Service Oriented Integration with ServiceMixService Oriented Integration with ServiceMix
Service Oriented Integration with ServiceMixghessler
 

Similar to BOSS: HackU IIT Bombay (20)

BOSS: Yahoo HackU IIIT Bangalore
BOSS: Yahoo HackU IIIT BangaloreBOSS: Yahoo HackU IIIT Bangalore
BOSS: Yahoo HackU IIIT Bangalore
 
Y Boss External 20091017
Y Boss External 20091017Y Boss External 20091017
Y Boss External 20091017
 
SearchMonkey
SearchMonkeySearchMonkey
SearchMonkey
 
Yahoo! BOSS API external 20091001
Yahoo! BOSS API external 20091001Yahoo! BOSS API external 20091001
Yahoo! BOSS API external 20091001
 
Yahoo Pipes Middleware In The Cloud
Yahoo Pipes Middleware In The CloudYahoo Pipes Middleware In The Cloud
Yahoo Pipes Middleware In The Cloud
 
EscConf - Deep Dive Frontend Optimization
EscConf - Deep Dive Frontend OptimizationEscConf - Deep Dive Frontend Optimization
EscConf - Deep Dive Frontend Optimization
 
Map Mashup Trend
Map Mashup TrendMap Mashup Trend
Map Mashup Trend
 
GDD Japan 2009 - Designing OpenSocial Apps For Speed and Scale
GDD Japan 2009 - Designing OpenSocial Apps For Speed and ScaleGDD Japan 2009 - Designing OpenSocial Apps For Speed and Scale
GDD Japan 2009 - Designing OpenSocial Apps For Speed and Scale
 
SpeedGeeks
SpeedGeeksSpeedGeeks
SpeedGeeks
 
In Search of Speed
In Search of SpeedIn Search of Speed
In Search of Speed
 
Padrino - the Godfather of Sinatra
Padrino - the Godfather of SinatraPadrino - the Godfather of Sinatra
Padrino - the Godfather of Sinatra
 
Apache Solr Changes the Way You Build Sites
Apache Solr Changes the Way You Build SitesApache Solr Changes the Way You Build Sites
Apache Solr Changes the Way You Build Sites
 
Beyond web services: supporting mashup artists at Yahoo!
Beyond web services: supporting mashup artists at Yahoo!Beyond web services: supporting mashup artists at Yahoo!
Beyond web services: supporting mashup artists at Yahoo!
 
Sourcer\'s Daily Dozen for ERE- Arbita JobMachine
Sourcer\'s Daily Dozen for ERE- Arbita JobMachineSourcer\'s Daily Dozen for ERE- Arbita JobMachine
Sourcer\'s Daily Dozen for ERE- Arbita JobMachine
 
Behavior Driven Development and Automation Testing Using Cucumber
Behavior Driven Development and Automation Testing Using CucumberBehavior Driven Development and Automation Testing Using Cucumber
Behavior Driven Development and Automation Testing Using Cucumber
 
High Performance Webdesign
High Performance WebdesignHigh Performance Webdesign
High Performance Webdesign
 
Socket applications
Socket applicationsSocket applications
Socket applications
 
Обход проверки безопасности в магазинах мобильных приложений при помощи платф...
Обход проверки безопасности в магазинах мобильных приложений при помощи платф...Обход проверки безопасности в магазинах мобильных приложений при помощи платф...
Обход проверки безопасности в магазинах мобильных приложений при помощи платф...
 
Spark Hsinchu meetup
Spark Hsinchu meetupSpark Hsinchu meetup
Spark Hsinchu meetup
 
Service Oriented Integration with ServiceMix
Service Oriented Integration with ServiceMixService Oriented Integration with ServiceMix
Service Oriented Integration with ServiceMix
 

More from Saurabh Sahni

Hacking 101 & Yahoo Mobile Developer Suite - YMDC NYC
Hacking 101 & Yahoo Mobile Developer Suite - YMDC NYCHacking 101 & Yahoo Mobile Developer Suite - YMDC NYC
Hacking 101 & Yahoo Mobile Developer Suite - YMDC NYCSaurabh Sahni
 
Finding things on the web with Yahoo! BOSS: IIT Delhi
Finding things on the web with Yahoo! BOSS: IIT DelhiFinding things on the web with Yahoo! BOSS: IIT Delhi
Finding things on the web with Yahoo! BOSS: IIT DelhiSaurabh Sahni
 
YQL: Hacking on steroids - Yahoo! Open Hack Day 2012
YQL: Hacking on steroids - Yahoo! Open Hack Day 2012YQL: Hacking on steroids - Yahoo! Open Hack Day 2012
YQL: Hacking on steroids - Yahoo! Open Hack Day 2012Saurabh Sahni
 
Hacking for Innovation - WPP, New York
Hacking for Innovation - WPP, New YorkHacking for Innovation - WPP, New York
Hacking for Innovation - WPP, New YorkSaurabh Sahni
 
Innovating the Hacker Way
Innovating the Hacker WayInnovating the Hacker Way
Innovating the Hacker WaySaurabh Sahni
 
Finding things with BOSS
Finding things with BOSSFinding things with BOSS
Finding things with BOSSSaurabh Sahni
 
Let the hacking continue - Post Open Hack India Mixer
Let the hacking continue - Post Open Hack India MixerLet the hacking continue - Post Open Hack India Mixer
Let the hacking continue - Post Open Hack India MixerSaurabh Sahni
 
Hack 101 - IIT Delhi HackU 2011
Hack 101 - IIT Delhi HackU 2011Hack 101 - IIT Delhi HackU 2011
Hack 101 - IIT Delhi HackU 2011Saurabh Sahni
 
Yahoo! Open Stack & YQL
Yahoo! Open Stack & YQLYahoo! Open Stack & YQL
Yahoo! Open Stack & YQLSaurabh Sahni
 
Hacking for Innovation: IIT Kharagpur
Hacking for Innovation: IIT KharagpurHacking for Innovation: IIT Kharagpur
Hacking for Innovation: IIT KharagpurSaurabh Sahni
 
HackU: IIT Madras: Hacking Yahoo! Social
HackU: IIT Madras: Hacking Yahoo! SocialHackU: IIT Madras: Hacking Yahoo! Social
HackU: IIT Madras: Hacking Yahoo! SocialSaurabh Sahni
 
Open Hack Day Bangalore: Hacking Yahoo! Social
Open Hack Day Bangalore: Hacking Yahoo! SocialOpen Hack Day Bangalore: Hacking Yahoo! Social
Open Hack Day Bangalore: Hacking Yahoo! SocialSaurabh Sahni
 
Hackuzela: Hacking Yahoo! Social
Hackuzela: Hacking Yahoo! SocialHackuzela: Hacking Yahoo! Social
Hackuzela: Hacking Yahoo! SocialSaurabh Sahni
 

More from Saurabh Sahni (14)

Hacking 101 & Yahoo Mobile Developer Suite - YMDC NYC
Hacking 101 & Yahoo Mobile Developer Suite - YMDC NYCHacking 101 & Yahoo Mobile Developer Suite - YMDC NYC
Hacking 101 & Yahoo Mobile Developer Suite - YMDC NYC
 
Finding things on the web with Yahoo! BOSS: IIT Delhi
Finding things on the web with Yahoo! BOSS: IIT DelhiFinding things on the web with Yahoo! BOSS: IIT Delhi
Finding things on the web with Yahoo! BOSS: IIT Delhi
 
Hacking 101
Hacking 101Hacking 101
Hacking 101
 
YQL: Hacking on steroids - Yahoo! Open Hack Day 2012
YQL: Hacking on steroids - Yahoo! Open Hack Day 2012YQL: Hacking on steroids - Yahoo! Open Hack Day 2012
YQL: Hacking on steroids - Yahoo! Open Hack Day 2012
 
Hacking for Innovation - WPP, New York
Hacking for Innovation - WPP, New YorkHacking for Innovation - WPP, New York
Hacking for Innovation - WPP, New York
 
Innovating the Hacker Way
Innovating the Hacker WayInnovating the Hacker Way
Innovating the Hacker Way
 
Finding things with BOSS
Finding things with BOSSFinding things with BOSS
Finding things with BOSS
 
Let the hacking continue - Post Open Hack India Mixer
Let the hacking continue - Post Open Hack India MixerLet the hacking continue - Post Open Hack India Mixer
Let the hacking continue - Post Open Hack India Mixer
 
Hack 101 - IIT Delhi HackU 2011
Hack 101 - IIT Delhi HackU 2011Hack 101 - IIT Delhi HackU 2011
Hack 101 - IIT Delhi HackU 2011
 
Yahoo! Open Stack & YQL
Yahoo! Open Stack & YQLYahoo! Open Stack & YQL
Yahoo! Open Stack & YQL
 
Hacking for Innovation: IIT Kharagpur
Hacking for Innovation: IIT KharagpurHacking for Innovation: IIT Kharagpur
Hacking for Innovation: IIT Kharagpur
 
HackU: IIT Madras: Hacking Yahoo! Social
HackU: IIT Madras: Hacking Yahoo! SocialHackU: IIT Madras: Hacking Yahoo! Social
HackU: IIT Madras: Hacking Yahoo! Social
 
Open Hack Day Bangalore: Hacking Yahoo! Social
Open Hack Day Bangalore: Hacking Yahoo! SocialOpen Hack Day Bangalore: Hacking Yahoo! Social
Open Hack Day Bangalore: Hacking Yahoo! Social
 
Hackuzela: Hacking Yahoo! Social
Hackuzela: Hacking Yahoo! SocialHackuzela: Hacking Yahoo! Social
Hackuzela: Hacking Yahoo! Social
 

Recently uploaded

Gas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptxGas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptxDr.Ibrahim Hassaan
 
How to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPHow to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPCeline George
 
What is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPWhat is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPCeline George
 
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...JhezDiaz1
 
Types of Journalistic Writing Grade 8.pptx
Types of Journalistic Writing Grade 8.pptxTypes of Journalistic Writing Grade 8.pptx
Types of Journalistic Writing Grade 8.pptxEyham Joco
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTiammrhaywood
 
ROOT CAUSE ANALYSIS PowerPoint Presentation
ROOT CAUSE ANALYSIS PowerPoint PresentationROOT CAUSE ANALYSIS PowerPoint Presentation
ROOT CAUSE ANALYSIS PowerPoint PresentationAadityaSharma884161
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxiammrhaywood
 
Atmosphere science 7 quarter 4 .........
Atmosphere science 7 quarter 4 .........Atmosphere science 7 quarter 4 .........
Atmosphere science 7 quarter 4 .........LeaCamillePacle
 
Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)Mark Reed
 
Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Celine George
 
Grade 9 Q4-MELC1-Active and Passive Voice.pptx
Grade 9 Q4-MELC1-Active and Passive Voice.pptxGrade 9 Q4-MELC1-Active and Passive Voice.pptx
Grade 9 Q4-MELC1-Active and Passive Voice.pptxChelloAnnAsuncion2
 
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxEPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxRaymartEstabillo3
 
Hierarchy of management that covers different levels of management
Hierarchy of management that covers different levels of managementHierarchy of management that covers different levels of management
Hierarchy of management that covers different levels of managementmkooblal
 
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...Nguyen Thanh Tu Collection
 

Recently uploaded (20)

Gas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptxGas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptx
 
How to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPHow to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERP
 
What is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPWhat is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERP
 
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
 
Types of Journalistic Writing Grade 8.pptx
Types of Journalistic Writing Grade 8.pptxTypes of Journalistic Writing Grade 8.pptx
Types of Journalistic Writing Grade 8.pptx
 
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
 
ROOT CAUSE ANALYSIS PowerPoint Presentation
ROOT CAUSE ANALYSIS PowerPoint PresentationROOT CAUSE ANALYSIS PowerPoint Presentation
ROOT CAUSE ANALYSIS PowerPoint Presentation
 
OS-operating systems- ch04 (Threads) ...
OS-operating systems- ch04 (Threads) ...OS-operating systems- ch04 (Threads) ...
OS-operating systems- ch04 (Threads) ...
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
 
Atmosphere science 7 quarter 4 .........
Atmosphere science 7 quarter 4 .........Atmosphere science 7 quarter 4 .........
Atmosphere science 7 quarter 4 .........
 
Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)
 
Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17
 
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
 
Grade 9 Q4-MELC1-Active and Passive Voice.pptx
Grade 9 Q4-MELC1-Active and Passive Voice.pptxGrade 9 Q4-MELC1-Active and Passive Voice.pptx
Grade 9 Q4-MELC1-Active and Passive Voice.pptx
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
Raw materials used in Herbal Cosmetics.pptx
Raw materials used in Herbal Cosmetics.pptxRaw materials used in Herbal Cosmetics.pptx
Raw materials used in Herbal Cosmetics.pptx
 
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxEPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
 
Hierarchy of management that covers different levels of management
Hierarchy of management that covers different levels of managementHierarchy of management that covers different levels of management
Hierarchy of management that covers different levels of management
 
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
 

BOSS: HackU IIT Bombay

  • 1. Build your Own Search Service Chris Heilmann Saurabh Sahni HackU: IIT Bombay 5th Feb’ 2009
  • 2. Outline •  Search engines using BOSS •  About BOSS API –  What? –  Why? –  Features •  How to use it –  BOSS API –  BOSS Mashup framework -2-
  • 10. Keyword finder - http://keywordfinder.org/ - 10 -
  • 16. About BOSS API - 16 -
  • 17. What? •  Open Yahoo’s core search features via web services to let 3rd parties revolutionize Search •  Unrestricted http://developer.yahoo.com/search/boss - 17 -
  • 18. Opening the search technology stack Rank Assist EXTRACT Retrieve SPAM <-> Gold Usage CRAWL Web Map Analyze Index Index 50B pages * 20ms page download = 31 years - 18 -
  • 19. Opening the search technology stack Your App here WEB API Rank Assist EXTRACT Retrieve SPAM <-> Gold Usage CRAWL Web Map Analyze Index Index 50B pages * 20ms page download = 31 years - 19 -
  • 20. Why? •  Removes entry barriers –  massive capital investment –  access to top technical talent •  Asset to Innovate –  Develop new relevance models • Leverage user insights • Use tags, bookmarks –  Change presentation style •  Search anywhere –  Improve Vertical Quality w/ Web comprehensiveness –  Fragment the market, foster more players, choice, competition - 20 -
  • 21. BOSS API features •  Unlimited queries per day •  No branding or attribution •  No restrictions on presentation •  Ability to re-order results and blend-in addition content •  Access to multiple verticals (web search, image, news) •  Spell checks, keyword suggestions •  40+ supported language and region pairs •  Ability to monetize - 21 -
  • 22. How to use it? - 22 -
  • 23. Get Started •  Register for an application id http://developer.yahoo.com/wsregapp/ •  Documentation http://developer.yahoo.com/search/boss/boss_guide/ •  Code samples: Javascript, PHP and Python http://www.saurabhsahni.com/boss-examples.zip - 23 -
  • 24. BOSS API Searching Slumdog Millionaire (Source: http://en.wikipedia.org/wiki/File:Slumdog_Millionaire_poster.jpg) - 24 -
  • 25. BOSS API •  Search for slumdog millionaire: –  http://boss.yahooapis.com/ysearch/web /v1/slumdog+millionaire ?appid=xyz&format=xml - 25 -
  • 26. BOSS API: XML response http://boss.yahooapis.com/ysearch/web/v1/slumdog+millionaire?appid=xyz&format=xml - 26 -
  • 27. BOSS API •  Exact search for “slumdog millionaire” –  http://boss.yahooapis.com/ysearch/web /v1/%22slumdog+millionaire%22 ?appid=xyz&format=xml - 27 -
  • 28. BOSS API •  Search for slumdog millionaire only on indiatimes.com: –  Add site:indiatimes.com to your query –  http://boss.yahooapis.com/ysearch/web/v1/slumdog +millionaire+site%3Aindiatimes.com ?appid=xyz&format=xml •  Search for slumdog millionaire on selected movie sites –  Add param sites=indiatimes.com,movies.yahoo.com,imdb.com –  http://boss.yahooapis.com/ysearch/web/v1/slumdog +millionaire?appid=xyz&sites=indiatimes.co m%2Cmovies.yahoo.com&format=xml - 28 -
  • 30. BOSS API •  Find related keywords –  Add parameter view=keyterms –  http://boss.yahooapis.com/ysearch/web/v1/slumdog +millionaire?appid=xyz&view=keyterms&format=xml - 30 -
  • 32. BOSS API •  Search images –  http://boss.yahooapis.com/ysearch/images/v1/slumdog +millionaire?dimensions=small - 32 -
  • 34. BOSS API •  Search news –  http://boss.yahooapis.com/ysearch/news/v1/slumdog +millionaire?age=15d - 34 -
  • 36. BOSS API Spell check request http://boss.yahooapis.com/ysearch/spelling/v1/ milionare?format=xml Response - 36 -
  • 37. BOSS API REST Interface http://boss.yahooapis.com/ysearch/{vert}/v1/{query} •  {query}: term to look for (url-encoded) •  {vert} := {web, news, images, spelling} •  @ required –  appid •  @ optional –  start, count, lang, region, format, callback, sites - 37 -
  • 38. BOSS Mashup Framework •  Python (v2.5+) library •  BOSS Search SDK plus … •  SQL for remixing arbitrary XML/JSON sources http://developer.yahoo.com/search/boss/mashup.html - 38 -
  • 39. BMF + Google App Engine •  Enhanced version of BMF to GAE platform •  http://zooie.wordpress.com/2008/08/04/yahoo-boss-google-app-engine-integrated/ •  Enables quick deployment of BOSS applications online - 39 -
  • 41. BOSS in Academic Research •  The biggest dataset available on web •  Very useful for Web-mining research experiments –  Natural language processing –  Semantic extraction –  Related keywords –  Similarity detection –  Clustering algorithms –  Spelling corrections - 41 -
  • 42. Thank You Questions? More: http://developer.yahoo.com/search/boss/ - 42 -
  • 44. Search UI Templates are Included in the BOSS Mashup Framework http://www.yahoo.com BOSS Mashup Framework simplifies aggregating and presenting multiple data sources - 44 -
  • 45. BMF Features •  select, group, sort, union, joins, udfs, where •  Text normalization and duplicate removal •  Auto-transformation of resource-oriented API results into tables w/o parsing •  All-in-memory storage and retrieval operations •  Ability to join lists of tables via an arbitrary predicate function (map-like) •  Search UI template framework •  Single search function provides total access to BOSS REST API - 45 -