The document presents a study on detecting fake reviews using supervised machine learning techniques. The study applies various machine learning classifiers to identify fake reviews based on the content of reviews as well as features extracted from reviewers' behaviors.
The proposed approach involves three phases: data preprocessing, feature extraction, and feature engineering. It extracts features representing reviewers' behaviors like capital letters, punctuation, and emojis used. It then compares classifier performance on Yelp restaurant reviews in identifying fake reviews, with and without these extracted behavioral features.
The results show that classifiers generally perform better at detecting fake reviews when the extracted behavioral features are included. For example, the KNN classifier's f-score improved from 82.40% to 83.20%