Red Opal: Product-Feature Scoring from Reviews ACM EC 2007 - Presentation Transcript
Red Opal: Product-Feature Scoring from Reviews Christopher Scaffidi Kevin Bierhoff Eric Chang Mikhael Felker Herman Ng Chun Jin School of Computer Science Carnegie Mellon University ACM EC 2007, San Diego, CA
Motivation
Searching for quality digital camera
Picture quality
Auto mode
Battery
Shutter
Memory
Searching by product feature: Red Opal
Red Opal Overview
Feature Extraction
Product scoring
User Interface
Evaluation
Feature extraction
Prior work identified technical terms as mostly nouns.
For a given product category, if a certain noun occurs in reviews far more frequently than in generic English text, then that word is likely to be a product feature.
A Poisson distribution was previously used to extract technical terms from texts on physics & politics.
Feature extraction
For each product category,
1. Retrieve Amazon reviews for products in this category
2. Tag part-of-speech to text (e.g. “games” NN/“game”)
3. Compute the count of each noun and compound noun
4. Compute their probability
5. Sort the most common nouns and compound nouns together according to probability, yielding the feature list
0 comments
Post a comment