Aspect Level Sentiment Analysis for Arabic Language


Published on

This is the presentation I used in my proposal seminar for master degree in ISSR.
the thesis about Aspect Level Sentiment Classification for Arabic Language.
Any further info. please contact me at (

Published in: Education, Technology
  • Be the first to comment

No Downloads
Total views
On SlideShare
From Embeds
Number of Embeds
Embeds 0
No embeds

No notes for slide

Aspect Level Sentiment Analysis for Arabic Language

  1. 1. Aspect Level Sentiment Classification For Arabic Language Mahmoud El Razzaz ISSR.CU Under the Supervision of Dr. Mohamed Farouk Prof. Dr. Hesham A. Hefny 1
  2. 2. Agenda 1. 2. 3. 4. 5. 6. 7. Introduction Problem definition Difficulties and chalenges Related work Objective Work plan References
  3. 3. Introduction to Sentiment Analysis 3
  4. 4. What is Sentiment Analysis   Sentiment Classification is a sub domain of text Classification or text categorization. Text classification is concerned with automatically identify the category or the domain of a text document (Political, Financial, … etc.,) 4
  5. 5. What is Sentiment Analysis  Identifying the opinion in a piece of text My Phone is awesome! [ Sentimental ]  My phone has 5MP camera [ Factual ] My Phone is horrible! [ Sentimental ] It can be generalized over a wider set of emotions 5
  6. 6. Advantages >>A lower cost than traditional methods of getting customer insight. >>A faster way of getting insight from customer data. >>The ability to act on customer suggestions. >>Identifies an organisation's Strengths, Opportunities & Threats (SWOT Analysis) . Weaknesses, >>More accurate and insightful customer perceptions and feedback. 6
  7. 7. Sentiment Analysis at different levels 7
  8. 8. Document Level Sentiment Analysis The task at this level is to classify whether a whole opinion document express a positive or negative sentiment. Researchers developed machine learning classifiers to classify document level sentiments for both English Language [1] and Also Arabic Language [2] References: [1] Pang, Bo, Lillian Lee, and Shivakumar Vaithyanathan. Thumbs up?: Sentiment classification using machine learning techniques. In Proceedings of Conference on Empirical methods in Natural Language processing (EMNLP-2002). 2002. [2] Mohamed Aly and Amir Atiya: LABR: A Large Scale Arabic Book Reviews Dataset. In Proceedinds of the 51st Annual Meeting of the Association for Computational Linguistics, Pages 494-498 Sofia, Bulgaria, August 4-9-2013. 8
  9. 9. Document Level Sentiment Analysis This level of Analysis assumes that each document expresses opinions on a single entity (e.g., a single product). Thus, it is not applicable to documents which evaluate or compare multiple entities. Example in English: positive Sentiment about a smart phone [1] “My mpop is very amazing even thought its battery drains fast the performance and the speed of the phone is very good even in playing high graphic games the camera is bright ” Example In Arabic: positive Sentiment about a book [2] “ ” References: [1] [2] 9
  10. 10. Sentence Level Sentiment Analysis The task at this level goes to the sentences and determines whether each sentence expressed a positive, negative, or neutral opinion. Neutral usually means no opinion. Ex., The poverty of India is decreasing Reference: N. Farra, E. Challita, R. Assi, and H. Hajj. Sentence-Level and Document-Level Sentiment mining for Arabic Texts. In proceedings of International Conference on data mining workshops. Pages 1114-1119. IEEE, 2010 10
  11. 11. Aspect Level Sentiment Analysis Why Aspect Level is better represent of a product review? Document and sentence level assumes that each document evaluates one entity. Even though that does not mean that in positive opinions the author of the review has a positive opinion about all aspects of the product. Likewise, a negative opinion document does not mean that the author is negative about every thing. For more complete Analysis we need to discover the aspects and determine whether the sentiment is positive or negative on each aspect. 11
  12. 12. Aspect Level Sentiment Analysis Aspect Level Sentiment Analysis is based on the idea that an opinion consists of a sentiment (positive or negative) and target of opinion “Aspect”. Realizing the importance of opinion targets also helps us understand the sentiment analysis problem better. For example, “although the service is not that great, I Still love this restaurant.” clearly has a positive tone, we can not say that this sentence is entirely positive. In fact it is positive about the restaurant but negative about the service. 12
  13. 13. Aspect Level Sentiment Analysis Example “My mpop is very amazing even thought its battery drains fast the performance and the speed of the phone is very good even in playing high graphic games the camera is bright ” The Sentiment on mpop, performance, speed and camera is positive. The sentiment on the battery is negative. The mpop, performance, speed and battery are the opinion targets 13
  14. 14. Advantages of Aspect Level Sentiment Analysis Based on this level of analysis a structured summary of opinions about entities and their aspects can be produced. Reference: Tun Thura Thet, Jin-Cheon Na and Christopher S.G. Khoo: “Aspect-based sentiment analysis of movie reviews on discussion boards” Journal of Information Science 2010 14
  15. 15. Advantages of Aspect Level Sentiment Analysis Thus it would be more useful for both customers and service provider or product producers. - For product producers or service providers they would know exactly what are the main aspects of the product/service that customers are not satisfied about rather than just knowing that customers are not satisfied about the service or product in general. 15
  16. 16. Advantages of Aspect Level Sentiment Analysis For customers it would be more important and this is because each customer usually concerned about a few number of product features “Aspects” and do not care about the other features. Thus customers may concentrate on the aspects the care much about rather than having an overall review of other users about the product or service. For example some may be concerned about the life time of the battery, the quality of the camera and the clearance of the screen while shows no concern about the color, weight and the insurance period of the mobile phone thus using aspect analysis would give customers a brief summary of user opinions specifically about each aspect of the mobile so he can decide which is better for him. 16
  17. 17. Challenges and Difficulties Both the Document Level and sentence level classifications are already highly Challenging. The aspect-level is even more difficult. It constricts or several sub-problems: 1- Entity Extraction. 2- Entity categorization (picture, image and photo are the same aspects for cameras) Each entity category should have a unique name in a particular application. 3- implicit Entities (this book is expensive) 17
  18. 18. Challenges and Difficulties (continuous) Difficulties related to Arabic language 1- Rare resources (few number of Arabic datasets are available) 2- Rare resources (few NLP tools are available for Arabic Slang) 3- The variance of Arabic dialects or tones from country to country. (ex., 3eda gamda gedan bas el battery taba3ha yefda bsor3a) 4- Some Arabic natives writes reviews in Franco Arab and some other write reviews in multiple languages. Ex., : apps Asha Reference: Soha Ahmed, Michel Pasquier, and Ghassan Qadah: “Key issues in conducting sentiment analysis on arabic social media texts” 2012 18
  19. 19. Related work Recently researchers bayed more attention to the problem of sentiment analysis for Arabic language such as: - Mohamed El Arnaoty et al., who provided “a machine learning approach for opinion holder extraction in Arabic language” 2012 -Mohamed Aly et al., who provided “A Large Scale Arabic Book reviews Data Set” 2013. -Also a Survey on Sentiment And Subjectivity Analysis of Arabic were introduced by Mohamed Korayem et al., in “Subjectivity and Sentiment Analysis of Arabic: A Survey” 2012. 19
  20. 20. Related work - Furthermore the difficulties of applying sentiment classification in Arabic Language were disused by Soha Ahmed et al., in “Key Issues in Conducting Sentiment Analysis on Arabic Social Media Text” 2010. 20
  21. 21. Some of the Review Websites (book reviews)  (mobile phones reviews)  (digital cameras reviews)  (restaurants reviews)  (reviews on multiple subjects)  (movies reviews)  21
  22. 22. Example of a Review website 22
  23. 23. Objective Construct An aspect level sentiment classification system to automatically Summarize the Arabic sentiments of users of a specific product or service. 23
  24. 24. Work plan 1. Overview of Data collection 2. Overview of data preprocessing (entity extraction, entity categorization, feature selection, and feature extraction) 3. Overview of the Sentiment Analysis levels and techniques 4. The proposed approach for Sentiment Analysis: Aspect Level Sentiment classification. 5. Testing the proposal approach and comparing the results with related work. 6. Conclusion and future work. 24
  25. 25. Thank you 25