Dont Cry for me Google: Semantic Analysis Of Travel Reviews to Understand Feelings


Published on

Presentation about semantic analysis of travel reviews for machine understanding of sentiments

Published in: Technology, News & Politics
  • Be the first to comment

  • Be the first to like this

No Downloads
Total Views
On Slideshare
From Embeds
Number of Embeds
Embeds 0
No embeds

No notes for slide

Dont Cry for me Google: Semantic Analysis Of Travel Reviews to Understand Feelings

  1. 1. Semantic analysis of travel reviews to understand feelings  Sept 17 th , 2007 "Don't cry for me, Google"  Let me tell you...I would highly recommend that unless you are a party animal, a heavy drinker I would not let my dog to search at Google My dog is better behaved than most kids I saw in this hotel I ate a 3.5 pound lobster for just $16.99 at the seafood restaurant inside the hotel which is the only thing I felt good about. Boris Galitsky
  2. 2. Conventional search engines <ul><li>Deliver documents which include significant occurrences of keywords in user queries. </li></ul><ul><li>Additionally, Google is good at selecting those documents which has satisfied users in previous similar searches. </li></ul><ul><li>Powerset is trying to understand linguistic links between words in queries and documents </li></ul><ul><li>While this works well enough for general horizontal searches, vertical searches for specific kinds of products can do much better. </li></ul>Horizontal (about everything) By using domain knowledge (ontology)
  3. 3. Ontologies for vertical search <ul><li>Computers cannot really “know” what each word “means” </li></ul><ul><li>However, we can teach them how entities are related to each other </li></ul><ul><li>In a vertical search, for every word (or entity, denoted by it) there should be coded relationships with other words </li></ul><ul><li>Then the index for each document contains multiple inter-related entities : skeleton of a document for search </li></ul>Dive in Monterey activity Relationship = ‘performed at’ location dive - boat dive - lesson dive – deep - oxygen
  4. 4. To recommend a product <ul><li>It is very convincing to refer to the experience of those who used  it before </li></ul><ul><li>To provide argumentation why this product is good for particular user, having discovered it suited well similar users </li></ul>a search engine must not only &quot;understand&quot; the features of products such as a hotel close to outdoor activities , but also feeling of people about these products like not impressed with a view but nice for guys' getaway .   It is not a recommendation opposite is understood
  5. 5. To handle recommendation queries <ul><ul><li>a search engine must know that: </li></ul></ul><ul><ul><li>hotels are characterized with locations, </li></ul></ul><ul><ul><li>sometimes good locations are those which are close to activities ( in particular, outdoor activities). </li></ul></ul><ul><ul><li>“ views” are important considerations while staying in hotels, </li></ul></ul><ul><ul><li>expression &quot;not impressed&quot; refers to a negative feeling, which is nevertheless combined with positive reference to the category such as &quot;guys getaway&quot;. </li></ul></ul>
  6. 6. Understanding sarcastic expressions <ul><ul><li>It is also necessary to understand sarcastic expressions like: </li></ul></ul>Not clean => Dogs are not allowed =>
  7. 7. Technology: using semantic templates
  8. 8. A traveler profile Confidential - Contains Trade Secrets
  9. 9. Positive and negative sentiments Confidential - Contains Trade Secrets
  10. 10. Overview: we understand facts and feelings about travel products and pages <ul><li>Why: we know how to link the underlying facts and consumer feelings about a product to return recommendations relevant to the consumers’ explicit and implicit intentions </li></ul><ul><li>Example: Portola Plaza is the top family hotel because it is rated highly: </li></ul><ul><ul><li>Based on quantifiable facts - it has a pool with a slide, kids eat and stay free and it has an endorsement from parenting magazine. It also has kitchenettes and babysitting available </li></ul></ul><ul><ul><li>Based on the feelings about the hotel -15% of reviews mentioned the children enjoyed it e.g. “great little indoor pool and jacuzzi which my kids loved”, “family friendly, easy access to state parks”, “right on the beach, very family friendly” and “staff was attentive to my kids” </li></ul></ul><ul><li>How - based on our ontology, we extracted facts & feelings about products: </li></ul><ul><ul><li>Used natural language processing to extract consumers and experts feelings from reviews, blogs and articles </li></ul></ul><ul><ul><li>Applied rule-based reasoning to derive data and meta-data from underlying facts and match them against consumers intentions and themes </li></ul></ul><ul><ul><li>Mined guidebooks and other offline sources for facts and feelings not online </li></ul></ul><ul><ul><li>Created a search relevancy algorithm that marries the extracts of feelings, the derived data, and facts to match them against the consumer’s intention and theme-based intentions </li></ul></ul><ul><li>Why our results are better: </li></ul><ul><ul><li>Ontology required to represent facts and feelings cannot be scaled up for horizontal search </li></ul></ul><ul><ul><li>Most vertical searches do not integrate facts and feelings into the form which can be matched against user queries </li></ul></ul>