Your SlideShare is downloading. ×
0
slides
slides
slides
slides
slides
slides
slides
slides
slides
slides
slides
slides
Upcoming SlideShare
Loading in...5
×

Thanks for flagging this SlideShare!

Oops! An error has occurred.

×
Saving this for later? Get the SlideShare app to save on your phone or tablet. Read anywhere, anytime – even offline.
Text the download link to your phone
Standard text messaging rates apply
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total Views
260
On Slideshare
0
From Embeds
0
Number of Embeds
0
Actions
Shares
0
Downloads
1
Comments
0
Likes
0
Embeds 0
No embeds

Report content
Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
No notes for slide

Transcript

  • 1. INFORMATION EXTRACTION FROM QUERIES Ed Snelson, Joaquin Quiñonero Candela, Ralf Herbrich, Thore Graepel
  • 2. Information extraction from queries
  • 3. Templates
  • 4. Probabilistic query modelling
  • 5. Key details
    • EP message passing for inference within single query model
    • ADF single pass through queries
    • Sparse messages within query
    • Bootstrap from initial seed sets of instances/attributes
    • Directed processing of queries based on current top beliefs
  • 6. Data
    • 10 months, Live Search query logs
    • 100 Million unique queries, with associated counts
    • Preliminary experiments on small specific subsets
    • e.g. 50,000 unique queries related to actors, cars and national parks
  • 7. Seed lists
  • 8. Actors Instances Attributes tom cruise movies brad pitt pictures johnny depp dealer.com matt damon photos george clooney angelina jolie cameron diaz nude scarlett johansson biography mel gibson news grand canyon height sharon stone wedding
  • 9. Cars Instances Attributes dealer {Year} honda civic parts honda accord hybrid ford mustang dealer dodge charger used toyota camry world ford explorer accessories toyota corolla ford ford focus cleveland plain dodge durango wachovia
  • 10. National Parks Instances Attributes grand canyon national park yellowstone park yosemite tours redwood lodging denali hotels everglades lodge algonquin west joshua tree skywalk west yellowstone gmc shenandoah college
  • 11. Templates Templates [Inst] [Attr] [Attr] [Inst] {Year} [Inst] [Attr] [Attr] of [Inst] [Inst] and [Attr] [Attr] and [Inst] [Attr] in [Inst] the [Attr] [Inst] how [Attr] is [Inst] [Attr] [Inst] coupe [Attr] [Inst] parts the [Inst] [Attr] [Inst] 's [Attr] [Inst] in [Attr]
  • 12. Future improvements
    • Class/Attribute dependent templates
    • A garbage class to deal with “noise”
    • Reducing sensitivity to order of processing initial queries
    • Disambiguation, synonyms etc.
    • Use of part-of-speech tagger
    • Combination with standard hand-crafted entity extraction techniques

×