Seminar presentation made by me for the topic of 'Resources for Sentiment Analysis' at IIT Bombay. Includes a set of bonus slides for additional information which was not actually presented.
This document summarizes a thesis analyzing stress concentrations around doors and windows on the Boeing 787 aircraft under uniform shear loading. It presents analytical solutions using complex variable and Schwarz alternating techniques to model openings as rectangular holes in an infinite plate. Finite element analysis is also conducted and results show good agreement with analytical solutions. Stress concentrations are highest at corners and depend on geometry. Door and window interaction increases window stresses up to 4.8% but negligibly impacts door stresses.
Natural language processing for requirements engineering: ICSE 2021 Technical...alessio_ferrari
These are the slides for the technical briefing given at ICSE 2021, given by Alessio Ferrari, Liping Zhao, and Waad Alhoshan
It covers RE tasks to which NLP is applied, an overview of a recent systematic mapping study on the topic, and a hands-on tutorial on using transfer learning for requirements classification.
Please find the links to the colab notebooks here:
https://colab.research.google.com/drive/158H-lEJE1pc-xHc1ISBAKGDHMt_eg4Gn?usp=sharing
https://colab.research.google.com/d rive/1B_5ow3rvS0Qz1y-KyJtlMNnm gmx9w3kJ?usp=sharing
https://colab.research.google.com/d rive/1Xrm0gNaa41YwlM5g2CRYYX cRvpbDnTRT?usp=sharing
The document discusses Particle Swarm Optimization (PSO), which is an optimization technique inspired by swarm intelligence and the social behavior of bird flocking. PSO initializes a population of random solutions and searches for optima by updating generations of candidate solutions. Each candidate, or particle, updates its position based on its own experience and the experience of neighboring highly-ranked particles. The algorithm is simple to implement and converges quickly to produce approximate solutions to difficult optimization problems.
Grid integrated system
study on Integration of DG’s
Key challenges observed
Modelling and study of hybrid systems under different fault conditions
Propose suitable methods to over come some of these challenges
This document provides an overview of a course on the finite element method. The course objectives are for students to learn how to write simple programs to solve problems using FEM. Assessment includes assignments, quizzes, a course project, midterm exam, and final exam. Fundamental agreements include electronic homework submission and using MATLAB or Mathematica. References on FEM are also provided. The document outlines numerical methods for solving boundary value problems and introduces weighted residual methods like the collocation method, subdomain method, and Galerkin method.
twin well cmos fabrication steps using Synopsys TCADTeam-VLSI-ITMU
The document describes the design and simulation of a CMOS fabrication process using TCAD (Technology Computer-Aided Design). It involves dimensioning the design using MOSIS design rules, creating masks, and defining 44 steps for the process in Synopsys TCAD. This includes doping wells, growing oxides, depositing polysilicon, implanting sources and drains, and depositing metals. The process aims to fabricate n-well and p-well CMOS on a silicon substrate using a minimum number of masks. Characterization and optimization of the modeled device can be done using additional TCAD tools.
The Indian Dental Academy is the Leader in continuing dental education , training dentists in all aspects of dentistry and
offering a wide range of dental certified courses in different formats.for more details please visit
www.indiandentalacademy.com
This document discusses a crystal oscillator model for simulation purposes. It contains three figures showing the circuit model, simulation results, and a zoomed in view of the simulation results. The figures are copyrighted by Bee Technologies Corporation and are from 2016.
This document summarizes a thesis analyzing stress concentrations around doors and windows on the Boeing 787 aircraft under uniform shear loading. It presents analytical solutions using complex variable and Schwarz alternating techniques to model openings as rectangular holes in an infinite plate. Finite element analysis is also conducted and results show good agreement with analytical solutions. Stress concentrations are highest at corners and depend on geometry. Door and window interaction increases window stresses up to 4.8% but negligibly impacts door stresses.
Natural language processing for requirements engineering: ICSE 2021 Technical...alessio_ferrari
These are the slides for the technical briefing given at ICSE 2021, given by Alessio Ferrari, Liping Zhao, and Waad Alhoshan
It covers RE tasks to which NLP is applied, an overview of a recent systematic mapping study on the topic, and a hands-on tutorial on using transfer learning for requirements classification.
Please find the links to the colab notebooks here:
https://colab.research.google.com/drive/158H-lEJE1pc-xHc1ISBAKGDHMt_eg4Gn?usp=sharing
https://colab.research.google.com/d rive/1B_5ow3rvS0Qz1y-KyJtlMNnm gmx9w3kJ?usp=sharing
https://colab.research.google.com/d rive/1Xrm0gNaa41YwlM5g2CRYYX cRvpbDnTRT?usp=sharing
The document discusses Particle Swarm Optimization (PSO), which is an optimization technique inspired by swarm intelligence and the social behavior of bird flocking. PSO initializes a population of random solutions and searches for optima by updating generations of candidate solutions. Each candidate, or particle, updates its position based on its own experience and the experience of neighboring highly-ranked particles. The algorithm is simple to implement and converges quickly to produce approximate solutions to difficult optimization problems.
Grid integrated system
study on Integration of DG’s
Key challenges observed
Modelling and study of hybrid systems under different fault conditions
Propose suitable methods to over come some of these challenges
This document provides an overview of a course on the finite element method. The course objectives are for students to learn how to write simple programs to solve problems using FEM. Assessment includes assignments, quizzes, a course project, midterm exam, and final exam. Fundamental agreements include electronic homework submission and using MATLAB or Mathematica. References on FEM are also provided. The document outlines numerical methods for solving boundary value problems and introduces weighted residual methods like the collocation method, subdomain method, and Galerkin method.
twin well cmos fabrication steps using Synopsys TCADTeam-VLSI-ITMU
The document describes the design and simulation of a CMOS fabrication process using TCAD (Technology Computer-Aided Design). It involves dimensioning the design using MOSIS design rules, creating masks, and defining 44 steps for the process in Synopsys TCAD. This includes doping wells, growing oxides, depositing polysilicon, implanting sources and drains, and depositing metals. The process aims to fabricate n-well and p-well CMOS on a silicon substrate using a minimum number of masks. Characterization and optimization of the modeled device can be done using additional TCAD tools.
The Indian Dental Academy is the Leader in continuing dental education , training dentists in all aspects of dentistry and
offering a wide range of dental certified courses in different formats.for more details please visit
www.indiandentalacademy.com
This document discusses a crystal oscillator model for simulation purposes. It contains three figures showing the circuit model, simulation results, and a zoomed in view of the simulation results. The figures are copyrighted by Bee Technologies Corporation and are from 2016.
This document provides an overview of finite element analysis (FEA). It defines FEA as a numerical method for solving governing equations over the domain of a continuous physical system that is discretized into simple shapes. It lists several common structural and non-structural applications of FEA, such as stress analysis, buckling problems, vibration analysis, and heat transfer. Finally, it provides the course outline, textbooks, references, and some common FEA software packages.
This document is a master's project report on Hadamard matrices by Raymond Nguyen, advised by Peter Casazza at the University of Missouri. It begins with an introduction to the basic theory of Hadamard matrices, including definitions, examples, properties and the open Hadamard conjecture. Subsequent sections will cover constructions of Hadamard matrices using methods like Sylvester's, Paley's and Williamson's, as well as applications of Hadamard matrices. The document is organized into chapters on basic theory, constructions and applications.
The document provides an introduction to the finite element method (FEM) through lecture notes. It discusses the basic concepts of dividing a complex problem into smaller, simpler pieces called finite elements. A brief history is given of the FEM from its origins in the 1940s to its widespread use today in engineering fields. The typical procedure of FEM for structural analysis is outlined as dividing a structure into finite elements connected at nodes.
This document provides an overview of Kalyan Acharjya's proposed work on face recognition for his M.Tech dissertation. It discusses conducting literature research on existing face recognition techniques, identifying challenges in real-time applications, and exploring standard face image databases. The presentation covers topics such as how face recognition works, applications, and concludes with plans to modify existing algorithms and compare results to related work to enhance recognition rates.
The document discusses a model called C2AE (Class Conditioned Auto-Encoder) for open-set recognition. C2AE is an auto-encoder trained with a conditional decoder to learn class-specific representations, allowing it to detect unknown classes during open-set identification. The document examines what threshold scores and operating points work best for open-set identification tasks using C2AE, which models unknown class likelihoods through extreme value theory modeling during conditional decoder training.
This is related to describing various types of Ansys stimulations and it's application at industrial level. It will give an overview of benefits of using Ansys software.
The document provides an introduction to the finite element method (FEM). It discusses that FEM is a numerical technique used to approximate solutions to boundary value problems defined by partial differential equations. It can handle complex geometries, loadings, and material properties that have no analytical solution. The document outlines the historical development of FEM and describes different numerical methods like the finite difference method, variational method, and weighted residual methods that FEM evolved from. It also discusses key concepts in FEM like discretization into elements, node points, and interpolation functions.
This document provides an introduction to and overview of engineering mathematics for the GATE exam. It aims to give GATE aspirants basic knowledge of topics like linear algebra, calculus, differential equations, and probability and statistics. The author advises students to review the syllabus and previous questions for their specific paper to understand what depth of topics is required. The book contains solved problems from past papers of popular branches but is still a work in progress. Readers are asked to provide feedback to the author.
This document discusses thermography testing as a non-destructive testing method. It describes how thermography detects infrared radiation emitted from all objects based on their temperature. Defects appear as temperature variations that can be visualized using thermal cameras. There are different thermography techniques including pulsed thermography, lock-in thermography, and vibrothermography. Pulsed thermography involves heating the material with a short pulse and observing defects. Thermography allows for rapid inspection of large areas and can detect defects like delaminations. While it is useful for many applications, it has limitations in penetrating deep within materials.
The presentation the most important issues on copper and copper alloys for railway traction supply systems - conventional and high speed railway according to the UIC directives. The construction of overhead railway traction with various elements (e.g. trolley wires, catenary wires, equipment) made from copper and copper alloys will be presented. Presentation will also include the selected EU normalizations of different products designed for railway traction and their mechanical and electrical properties requirements. Furthermore authors will present and discuss the most important exploitation problems of overhead railway traction. In addition selected industrial production technologies of different railway traction copper based elements, the laboratory results and its exploitation properties will be also discussed.
MRAM is a type of non-volatile memory that uses magnetism instead of electricity to store data. It has the potential for unlimited read/write endurance, high speed performance, and lower power consumption compared to other RAM technologies. MRAM uses magnetic tunnel junctions consisting of two ferromagnetic plates separated by an insulator, where one plate's magnetization represents a 1 and the other a 0. Over time, MRAM development has improved switching speed and density, with multi-Mbit demonstration chips produced in the 2000s and research continuing on thermal assisted switching and spin torque transfer to enable smaller geometries.
This lecture provides an introduction to finite element analysis (FEA). It discusses the basic concepts of FEA, including dividing a complex object into simple finite elements and using polynomial terms to describe field quantities within each element. The lecture covers the history and applications of FEA, as well as the basic procedure, which involves meshing a structure into elements, describing element behavior, assembling elements at nodes, solving the system of equations, and calculating results. It also reviews matrix algebra concepts needed for FEA. Finally, it presents the simple example of a spring element and spring system to demonstrate the finite element modeling process.
The document discusses techniques for measuring resistivity and mapping resistivity variations across semiconductor wafers. It begins by defining resistivity and listing typical resistivity values for different materials. It then describes two common measurement techniques: two-point probe and four-point probe. Four-point probe is more accurate as it eliminates lead and contact resistance. Factors that affect measurement accuracy like sample size, carrier injection, and probe spacing are also covered. The document concludes by explaining techniques for wafer mapping like double implant, modulated photoreflectance, and optical densitometry.
MCM (Multiple-chip-module) packages multiple integrated circuits and semiconductor dies onto a single substrate. There are different types of MCMs based on the substrate material, including MCM-L which uses a laminated substrate, MCM-C which uses a ceramic substrate, and MCM-D which uses a deposited substrate with thin-film metals and dielectrics. The physical design of an MCM involves partitioning the circuit, placing the chips on the substrate while considering timing constraints, power constraints, and thermal characteristics, and routing the interconnects between chips.
CDS is the criminal face identification by capsule neural network.
Solving the common problems in image recognition such as illumination problem, scale variability, and to fight against a most common problem like pose problem, we are introducing Face Reconstruction System.
For the full video of this presentation, please visit:
http://www.embedded-vision.com/platinum-members/embedded-vision-alliance/embedded-vision-training/videos/pages/dec-2016-member-meeting-uofw
For more information about embedded vision, please visit:
http://www.embedded-vision.com
Professor Jeff Bilmes of the University of Washington delivers the presentation "Image and Video Summarization" at the December 2016 Embedded Vision Alliance Member Meeting. Bilmes provides an overview of the state of the art in image and video summarization.
Presentation of my PhD work about a preliminary design tool for flapping-wing systems. The presentation is about the definition/implementation of an aeroelastic framework that coupled an aerodynamic model of insect flight with a FEM solver, its numerical and experimental validation for preliminary design tasks and finally about its applications to the specific case of a resonant nano-air vehicle: the OVMI. Thus the designers can evaluate quickly the performance of a wing and then determine a wing geometry via an optimization environment. Enjoy!
Finite element analysis (FEA) involves breaking a model down into small pieces called finite elements. FEA was first developed in 1943 and involved numerical analysis techniques. By the 1970s, FEA was used primarily by aerospace, automotive, and defense industries due to the high cost of computers. Modern FEA involves preprocessing like meshing a model, applying properties and boundary conditions, solving the model using software, and postprocessing to analyze results like stresses and displacements.
Human Activity Recognition (HAR) systems aim to recognize human activities through sensors in order to provide assistance. The key steps in designing a HAR system are:
1) Acquiring sensor data and preprocessing it by removing noise.
2) Segmenting the preprocessed data into windows that may contain activities.
3) Extracting features from each window to reduce the data into discriminative features.
4) Training a classification model on the extracted features to predict activity labels, and evaluating the model's performance using methods like a confusion matrix.
This document proposes a system for sentiment classification in Hindi language texts. It involves building a training dataset from Hindi corpora by identifying sentiment scores. A classification model is then built and applied to new test data to predict sentiment. Key steps include tokenization, removing stop words, stemming using a Hindi stemmer, identifying sentiment using Hindi WordNet, and aggregating word-level sentiment scores to determine overall sentiment. Challenges noted include limited coverage of Hindi WordNet and accuracy issues. Future work could focus on expanding Hindi WordNet. The proposed system aims to efficiently classify sentiment in Hindi texts.
Senti-Lexicon and Analysis for Restaurant Reviews of Myanmar TextIJAEMSJORNAL
This document summarizes research on sentiment analysis of restaurant reviews written in Myanmar language. The authors built a Myanmar language sentiment lexicon and analyzed 800 restaurant reviews collected from social media. They performed preprocessing including syllable segmentation and merging. Then conducted dictionary-based sentiment analysis and opinion word extraction to classify reviews as positive, negative, or neutral. The research addressed challenges in analyzing sentiment in Myanmar language due to the lack of existing language resources.
This document provides an overview of finite element analysis (FEA). It defines FEA as a numerical method for solving governing equations over the domain of a continuous physical system that is discretized into simple shapes. It lists several common structural and non-structural applications of FEA, such as stress analysis, buckling problems, vibration analysis, and heat transfer. Finally, it provides the course outline, textbooks, references, and some common FEA software packages.
This document is a master's project report on Hadamard matrices by Raymond Nguyen, advised by Peter Casazza at the University of Missouri. It begins with an introduction to the basic theory of Hadamard matrices, including definitions, examples, properties and the open Hadamard conjecture. Subsequent sections will cover constructions of Hadamard matrices using methods like Sylvester's, Paley's and Williamson's, as well as applications of Hadamard matrices. The document is organized into chapters on basic theory, constructions and applications.
The document provides an introduction to the finite element method (FEM) through lecture notes. It discusses the basic concepts of dividing a complex problem into smaller, simpler pieces called finite elements. A brief history is given of the FEM from its origins in the 1940s to its widespread use today in engineering fields. The typical procedure of FEM for structural analysis is outlined as dividing a structure into finite elements connected at nodes.
This document provides an overview of Kalyan Acharjya's proposed work on face recognition for his M.Tech dissertation. It discusses conducting literature research on existing face recognition techniques, identifying challenges in real-time applications, and exploring standard face image databases. The presentation covers topics such as how face recognition works, applications, and concludes with plans to modify existing algorithms and compare results to related work to enhance recognition rates.
The document discusses a model called C2AE (Class Conditioned Auto-Encoder) for open-set recognition. C2AE is an auto-encoder trained with a conditional decoder to learn class-specific representations, allowing it to detect unknown classes during open-set identification. The document examines what threshold scores and operating points work best for open-set identification tasks using C2AE, which models unknown class likelihoods through extreme value theory modeling during conditional decoder training.
This is related to describing various types of Ansys stimulations and it's application at industrial level. It will give an overview of benefits of using Ansys software.
The document provides an introduction to the finite element method (FEM). It discusses that FEM is a numerical technique used to approximate solutions to boundary value problems defined by partial differential equations. It can handle complex geometries, loadings, and material properties that have no analytical solution. The document outlines the historical development of FEM and describes different numerical methods like the finite difference method, variational method, and weighted residual methods that FEM evolved from. It also discusses key concepts in FEM like discretization into elements, node points, and interpolation functions.
This document provides an introduction to and overview of engineering mathematics for the GATE exam. It aims to give GATE aspirants basic knowledge of topics like linear algebra, calculus, differential equations, and probability and statistics. The author advises students to review the syllabus and previous questions for their specific paper to understand what depth of topics is required. The book contains solved problems from past papers of popular branches but is still a work in progress. Readers are asked to provide feedback to the author.
This document discusses thermography testing as a non-destructive testing method. It describes how thermography detects infrared radiation emitted from all objects based on their temperature. Defects appear as temperature variations that can be visualized using thermal cameras. There are different thermography techniques including pulsed thermography, lock-in thermography, and vibrothermography. Pulsed thermography involves heating the material with a short pulse and observing defects. Thermography allows for rapid inspection of large areas and can detect defects like delaminations. While it is useful for many applications, it has limitations in penetrating deep within materials.
The presentation the most important issues on copper and copper alloys for railway traction supply systems - conventional and high speed railway according to the UIC directives. The construction of overhead railway traction with various elements (e.g. trolley wires, catenary wires, equipment) made from copper and copper alloys will be presented. Presentation will also include the selected EU normalizations of different products designed for railway traction and their mechanical and electrical properties requirements. Furthermore authors will present and discuss the most important exploitation problems of overhead railway traction. In addition selected industrial production technologies of different railway traction copper based elements, the laboratory results and its exploitation properties will be also discussed.
MRAM is a type of non-volatile memory that uses magnetism instead of electricity to store data. It has the potential for unlimited read/write endurance, high speed performance, and lower power consumption compared to other RAM technologies. MRAM uses magnetic tunnel junctions consisting of two ferromagnetic plates separated by an insulator, where one plate's magnetization represents a 1 and the other a 0. Over time, MRAM development has improved switching speed and density, with multi-Mbit demonstration chips produced in the 2000s and research continuing on thermal assisted switching and spin torque transfer to enable smaller geometries.
This lecture provides an introduction to finite element analysis (FEA). It discusses the basic concepts of FEA, including dividing a complex object into simple finite elements and using polynomial terms to describe field quantities within each element. The lecture covers the history and applications of FEA, as well as the basic procedure, which involves meshing a structure into elements, describing element behavior, assembling elements at nodes, solving the system of equations, and calculating results. It also reviews matrix algebra concepts needed for FEA. Finally, it presents the simple example of a spring element and spring system to demonstrate the finite element modeling process.
The document discusses techniques for measuring resistivity and mapping resistivity variations across semiconductor wafers. It begins by defining resistivity and listing typical resistivity values for different materials. It then describes two common measurement techniques: two-point probe and four-point probe. Four-point probe is more accurate as it eliminates lead and contact resistance. Factors that affect measurement accuracy like sample size, carrier injection, and probe spacing are also covered. The document concludes by explaining techniques for wafer mapping like double implant, modulated photoreflectance, and optical densitometry.
MCM (Multiple-chip-module) packages multiple integrated circuits and semiconductor dies onto a single substrate. There are different types of MCMs based on the substrate material, including MCM-L which uses a laminated substrate, MCM-C which uses a ceramic substrate, and MCM-D which uses a deposited substrate with thin-film metals and dielectrics. The physical design of an MCM involves partitioning the circuit, placing the chips on the substrate while considering timing constraints, power constraints, and thermal characteristics, and routing the interconnects between chips.
CDS is the criminal face identification by capsule neural network.
Solving the common problems in image recognition such as illumination problem, scale variability, and to fight against a most common problem like pose problem, we are introducing Face Reconstruction System.
For the full video of this presentation, please visit:
http://www.embedded-vision.com/platinum-members/embedded-vision-alliance/embedded-vision-training/videos/pages/dec-2016-member-meeting-uofw
For more information about embedded vision, please visit:
http://www.embedded-vision.com
Professor Jeff Bilmes of the University of Washington delivers the presentation "Image and Video Summarization" at the December 2016 Embedded Vision Alliance Member Meeting. Bilmes provides an overview of the state of the art in image and video summarization.
Presentation of my PhD work about a preliminary design tool for flapping-wing systems. The presentation is about the definition/implementation of an aeroelastic framework that coupled an aerodynamic model of insect flight with a FEM solver, its numerical and experimental validation for preliminary design tasks and finally about its applications to the specific case of a resonant nano-air vehicle: the OVMI. Thus the designers can evaluate quickly the performance of a wing and then determine a wing geometry via an optimization environment. Enjoy!
Finite element analysis (FEA) involves breaking a model down into small pieces called finite elements. FEA was first developed in 1943 and involved numerical analysis techniques. By the 1970s, FEA was used primarily by aerospace, automotive, and defense industries due to the high cost of computers. Modern FEA involves preprocessing like meshing a model, applying properties and boundary conditions, solving the model using software, and postprocessing to analyze results like stresses and displacements.
Human Activity Recognition (HAR) systems aim to recognize human activities through sensors in order to provide assistance. The key steps in designing a HAR system are:
1) Acquiring sensor data and preprocessing it by removing noise.
2) Segmenting the preprocessed data into windows that may contain activities.
3) Extracting features from each window to reduce the data into discriminative features.
4) Training a classification model on the extracted features to predict activity labels, and evaluating the model's performance using methods like a confusion matrix.
This document proposes a system for sentiment classification in Hindi language texts. It involves building a training dataset from Hindi corpora by identifying sentiment scores. A classification model is then built and applied to new test data to predict sentiment. Key steps include tokenization, removing stop words, stemming using a Hindi stemmer, identifying sentiment using Hindi WordNet, and aggregating word-level sentiment scores to determine overall sentiment. Challenges noted include limited coverage of Hindi WordNet and accuracy issues. Future work could focus on expanding Hindi WordNet. The proposed system aims to efficiently classify sentiment in Hindi texts.
Senti-Lexicon and Analysis for Restaurant Reviews of Myanmar TextIJAEMSJORNAL
This document summarizes research on sentiment analysis of restaurant reviews written in Myanmar language. The authors built a Myanmar language sentiment lexicon and analyzed 800 restaurant reviews collected from social media. They performed preprocessing including syllable segmentation and merging. Then conducted dictionary-based sentiment analysis and opinion word extraction to classify reviews as positive, negative, or neutral. The research addressed challenges in analyzing sentiment in Myanmar language due to the lack of existing language resources.
Opinion Mining Techniques for Non-English Languages: An OverviewCSCJournals
The amount of user-generated data on web is increasing day by day giving rise to necessity of automatic tools to analyze huge data and extract useful information from it. Opinion Mining is an emerging area of research concerning with extracting and analyzing opinions expressed in texts. It is a language and domain dependent task having number of applications like recommender systems, review analysis, marketing systems, etc. Early research in the field of opinion mining has concentrated on English language. Many opinion mining tools and linguistic resources have been built for English language. Availability of information in regional languages has motivated researchers to develop tools and resources for non-English languages. In this paper we present a survey on the opinion mining research for non-English languages.
Nowadays peoples are actively involved in giving comments and reviews on social networking websites
and other websites like shopping websites, news websites etc. large number of people everyday share
their opinion on the web, results is a large number of user data is collected .users also find it trivial task
to read all the reviews and then reached into the decision. It would be better if these reviews are
classified into some category so that the user finds it easier to read. Opinion Mining or Sentiment
Analysis is a natural language processing task that mines information from various text forms such as
reviews, news, and blogs and classify them on the basis of their polarity as positive, negative or neutral.
But, from the last few years, user content in Hindi language is also increasing at a rapid rate on the Web.
So it is very important to perform opinion mining in Hindi language as well. In this paper a Hindi
language opinion mining system is proposed. The system classifies the reviews as positive, negative and
neutral for Hindi language. Negation is also handled in the proposed system. Experimental results using
reviews of movies show the effectiveness of the system.
Sentiment classification is an ongoing field and interesting area of research because of its application in various fields collecting review from people about products and social and political events through the web. Currently, Sentiment Analysis concentrates for subjective statements or on subjectivity and overlook objective statements which carry sentiment(s). During the sentiment classification more challenging problem are faced due to the ambiguous sense of words, negation words and intensifier. Due to its importance the correct sense of target word is extracted and determined for which the similarity arise in WordNet Glosses. This paper presents a survey covering the techniques and methods in sentiment analysis and challenges appear in the field.
HTY 110HA
Module 8
Presentation Project Instructions
Choose one immigrant or refugee group and prepare an audio-narrated
PowerPoint presentation about the group.
You may not choose the following groups that have already been covered extensively
within the modules:
Irish Germans Chinese Jews African Americans
Your presentation must include slides that include the following information:
1. Images/visuals for each slide in the form of:
• Photos (Required)
• Maps (Required)
• Charts
• graphs
2. An introduction your group and an overview of its place of origin.
3. Push and Pull factors that affected your chosen group
4. Skills and assets of this group
5. Liabilities of this group
6. Early settlement patterns of the group
7. Occupations in which this group was concentrated
8. Challenges this group faced
9. Settlement patterns and experiences of this group in American society today
10. Use your critical thinking skills to answer the following question:
• Based on your research, would you say that your chosen group has
attained the “American Dream?” In other words, has America been a
“Promised Land” for your group? Why or why not?
11. List of sources for your presentation (articles, websites, books, etc.)
Important
1. Slides must not be covered with paragraphs of writing. Include only short phrases
(bullets) and images/visuals. You should explain the content of each slide with
your voice, rather than with writing.
2. Do not read from your notes when recording your presentation. Your words
should flow smoothly as though you are speaking to someone rather than
reading mechanically from your notes. Try to be animated when you speak rather
than speaking in a monotone. Try to engage your listener and keep him or her
interested in what you have to say.
Page 2 of 2
Review the grading rubric to see how this presentation will be graded.
You will need to record and embed a narrative for each of your slides; i.e., say what you
would say if you were presenting in front of a live audience. As such, you will need to
attach a microphone/headset with microphone to your computer to record the audio.
Prices will vary, but an inexpensive headset with microphone will work fine.
Instructions for how to record and add audio narrations to your presentation can be
found by using the PowerPoint help feature. This link may also help you as you create
your audio PowerPoint with appropriate timing: http://office.microsoft.com/en-
us/powerpoint-help/record-and-add-narration-and-timings-to-a-slide-show-
HA010338313.aspx?CTT=1 . Note that you will not be able to edit your audio if you
choose to record the audio from within PowerPoint, so if you need to correct any
mistakes, you will just need to rerecord that audio for that particular slide.
If you wish to record and edit your audio prior to adding it to your PowerPoint
presentation, you will need recording/editing softw ...
Opinion mining in hindi language a surveyijfcstjournal
Opinions are very important in the life of human beings. These Opinions helped the humans to carry out
the decisions. As the impact of the Web is increasing day by day, Web documents can be seen as a new
source of opinion for human beings. Web contains a huge amount of information generated by the users
through blogs, forum entries, and social networking websites and so on To analyze this large amount of
information it is required to develop a method that automatically classifies the information available on the
Web. This domain is called Sentiment Analysis and Opinion Mining. Opinion Mining or Sentiment Analysis
is a natural language processing task that mine information from various text forms such as reviews, news,
and blogs and classify them on the basis of their polarity as positive, negative or neutral. But, from the last
few years, enormous increase has been seen in Hindi language on the Web. Research in opinion mining
mostly carried out in English language but it is very important to perform the opinion mining in Hindi
language also as large amount of information in Hindi is also available on the Web. This paper gives an
overview of the work that has been done Hindi language.
Opinion mining of movie reviews at document levelijitjournal
The whole world is changed rapidly and using the current technologies Internet becomes an essential
need for everyone. Web is used in every field. Most of the people use web for a common purpose like
online shopping, chatting etc. During an online shopping large number of reviews/opinions are given by
the users that reflect whether the product is good or bad. These reviews need to be explored, analyse and
organized for better decision making. Opinion Mining is a natural language processing task that deals
with finding orientation of opinion in a piece of text with respect to a topic. In this paper a document
based opinion mining system is proposed that classify the documents as positive, negative and neutral.
Negation is also handled in the proposed system. Experimental results using reviews of movies show the
effectiveness of the system.
Required resources articlesbrown, l. (2015). a quick guide toMARK547399
This document provides guidance and resources for Week 2 of a course on evidence-based instructional methods for students with mild to moderate disabilities. It discusses formative assessment and data collection methods that are part of evidence-based practice in special education. Teachers are encouraged to use individualized and efficient data collection systems to monitor student progress on IEP goals and objectives. Examples of data recording systems are provided that measure different dimensions of behavior, such as frequency, duration, and latency. The resources for the week focus on analyzing student performance data and developing assessment-driven IEP goals and objectives.
One fundamental problem in sentiment analysis is categorization of sentiment polarity. Given a piece of written text, the problem is to categorize the text into one specific sentiment polarity, positive or negative (or neutral). Based on the scope of the text, there are three distinctions of sentiment polarity categorization, namely the document level, the sentence level, and the entity and aspect level. Consider a review “I like multimedia features but the battery life sucks.†This sentence has a mixed emotion. The emotion regarding multimedia is positive whereas that regarding battery life is negative. Hence, it is required to extract only those opinions relevant to a particular feature (like battery life or multimedia) and classify them, instead of taking the complete sentence and the overall sentiment. In this paper, we present a novel approach to identify pattern specific expressions of opinion in text.
To Label or Not? Advances and Open Challenges in SE-specific Sentiment AnalysisNicole Novielli
What makes developers happy? What makes them upset? Is it possible to monitor the mood of a developer to determine when and where additional help is needed? How are emotions conveyed in developers’ communication channels and how they affect collaboration? Answering these questions involves being able to reliably implement sentiment analysis, which is the automatic processing of texts to map and capture the polarity of emotions and opinions. In this talk, I will provide an overview of recent research about sentiment analysis in software engineering (SE), address the open challenges, and provide empirically-based guidelines for safe (re)use of SE-specific tools in order to obtain meaningful results.
Keynote @SEmotion 2020: https://semotion.github.io/2020/keynote.html
Sentiment analysis is inevitable in current era. Internet is growing day-by-day. Now-a-days everything is online. We can shop, buy, and sell online. People can give feedbacks / opinions on the internet. Customers can compare among various products by analyzing the product reviews. As more and more people from different age groups and languages are becoming new internet users, we need it in regional languages. Till date most of the work related to sentiment analysis has been done in English language. But when it comes to Indian languages, not much research has done except for few languages. This paper mainly focuses on performing sentiment analysis in one of the Indian languages i.e. Marathi.
Peer Mentoring/Peer Assisted Learning Leicester Award Guidancemartau3
This document outlines the stages for a peer mentoring project, including planning, implementation, and evaluation. It discusses developing targeted support sessions or resources in areas like study skills or specific topics. The stages include analyzing student needs, designing sessions or online materials, ensuring students access the support, obtaining feedback, and reflecting on what worked and improvements for the future.
A decision tree based word sense disambiguation system in manipuri languageacijjournal
This paper manifests a primary attempt on building a word sense disambiguation system in Manipuri
language. The paper discusses related attempts made in the Manipuri language followed by the proposed
plan. A database, consisting of 650 sentences, is collected in Manipuri language in the course of the study.
Conventional positional and context based features are suggested to capture the sense of the words, which
have ambiguous and multiple senses. The proposed work is expected to predict the senses of the
polysemous words with high accuracy with the help of the suitable knowledge acquisition techniques. The
system produces an accuracy of 71.75 %.
This is my presentation on my study on The Beginning Reader at the Walden University. They are a summary of what I learnt to create a Literate environment.
Mining of product reviews at aspect levelijfcstjournal
Today’s world is a world of Internet, almost all work can be done with the help of it, from simple mobile
phone recharge to biggest business deals can be done with the help of this technology. People spent their
most of the times on surfing on the Web; it becomes a new source of entertainment, education,
communication, shopping etc. Users not only use these websites but also give their feedback and
suggestions that will be useful for other users. In this way a large amount of reviews of users are collected
on the Web that needs to be explored, analyse and organized for better decision making. Opinion Mining or
Sentiment Analysis is a Natural Language Processing and Information Extraction task that identifies the
user’s views or opinions explained in the form of positive, negative or neutral comments and quotes
underlying the text. Aspect based opinion mining is one of the level of Opinion mining that determines the
aspect of the given reviews and classify the review for each feature. In this paper an aspect based opinion
mining system is proposed to classify the reviews as positive, negative and neutral for each feature.
Negation is also handled in the proposed system. Experimental results using reviews of products show the
effectiveness of the system.
This document discusses the design of a questionnaire for an E&A Survey. It outlines the objectives of designing a questionnaire, including determining what questions to ask, how to phrase them, the proper sequencing, and optimal layout. It also covers pretesting the questionnaire to revise any issues and ensure it meets the research objectives. Specific guidelines are provided on funnel technique, filter questions, pivot questions, status bars, response buttons, and open-ended boxes for internet questionnaires. The pretesting process and preliminary tabulation are described to analyze responses and finalize the questionnaire.
IRJET- A Review on: Sentiment Polarity Analysis on Twitter Data from Diff...IRJET Journal
This document summarizes research on sentiment polarity analysis of Twitter data from different events. It discusses how Twitter data can be used for opinion mining and sentiment analysis. Several papers that used techniques like naive Bayes classifier, support vector machines, and dual sentiment analysis on Twitter data are summarized. The document also provides an overview of the key steps involved in a Twitter sentiment analysis system, including data collection, preprocessing, feature extraction, training a classification model, and evaluating accuracy. The goal of analyzing sentiments on Twitter is to understand public opinions on different topics and events.
A Survey On Sentiment Analysis Of Movie ReviewsShannon Green
This document provides a literature review on sentiment analysis of movie reviews. It discusses how sentiment analysis uses natural language processing, computational linguistics and text analytics to categorize the polarity of opinions in text as positive, negative or neutral. The document summarizes several research papers on sentiment analysis methods at the document, sentence and entity levels. Supervised machine learning classifiers like SVM generally perform better than unsupervised lexicon-based approaches. The document also discusses challenges in aspect-level sentiment analysis and analyzing sentiments in other domains like social media posts.
An Improved sentiment classification for objective word.IJSRD
Sentiment classification is an ongoing field and interesting area of research because of its application in various fields. Customer sentiments play a very important role in daily life. Currently, Sentiment classification focused on subjective statements and ignores objective statements which also carry sentiment. During the sentiment classification, problem is faced due to the ambiguous sense (meaning) of words and negation words. In word sense disambiguation method semantic scores calculated from SentiWordNet of WordNet glosses terms. The correct sense of the word is extracted and determined similarity in WordNet glosses terms. SentiWordNet extract first sense of word which used in general sense. This work aims at improving the sentiment classification by modifying the sentiment values returned by SentiWordNet and compare classification accuracy of support vector machine and naïve bays.
Similar to MTech Seminar Presentation [IIT-Bombay] (20)
The document discusses Wordnet-Affect, an extension of Wordnet that adds affective information. It was built by manually adding affect labels and information to 1903 terms, then projecting this information to Wordnet synsets. The extension was done semi-automatically using Wordnet relations. Possible applications of Wordnet-Affect include sentiment analysis, verbal expressiveness for conversational agents, and computer-assisted creativity. In general, affective computing aims to enable computers to perceive and express emotion and can be applied in areas like adaptive entertainment and modifying environments.
This document summarizes SentiWordNet, a sentiment lexicon that associates sentiment scores to WordNet synsets. It describes how SentiWordNet was built by training classifiers on seed sentiment terms and expanding the training sets iteratively through WordNet relations. Classifiers were combined to assign positive, negative, and objective scores to each synset. The document outlines enhancements such as performing random walks on the WordNet graph to update sentiment scores.
Paper Presentation: HMM-based AlignmentSagar Ahire
The paper presentation I did for HMM-based Alignment at IIT Bombay as a part of the Topics in NLP course.
The paper treats alignment as an HMM problem, which is a different approach compared to the IBM models approach which is predominantly used.
Paper Presentation: A Pendulum Swung Too FarSagar Ahire
A paper presentation made by me for the paper 'A Pendulum Swung Too Far' by Kenneth Church at IIT Bombay as a part of preparation for the MTech Seminar.
Get the paper on which this presentation is based here: http://languagelog.ldc.upenn.edu/myl/ldc/swung-too-far.pdf
NLP Asignment Final Presentation [IIT-Bombay]Sagar Ahire
The final presentation I did with Lekha & Deepali for the Natural Language Processing assignments at IIT-Bombay.
Assignments included:
1: Spelling Correction
2: Part-of-speech Tagging
3: Metaphor Detection
Sarcasm & Thwarting in Sentiment Analysis [IIT-Bombay]Sagar Ahire
1) The document discusses various linguistic phenomena including irony, sarcasm, and thwarting. It presents algorithms for detecting sarcasm and thwarting in text.
2) For sarcasm detection, a semi-supervised algorithm uses pattern-based and punctuation-based features to classify sentences, achieving up to 81% accuracy.
3) Thwarting detection compares sentiment across levels of a domain ontology, using either rule-based or machine learning approaches, with the latter approach achieving up to 81% accuracy.
Neuro-fuzzy systems combine neural networks and fuzzy logic to overcome the limitations of each. They were created to achieve the mapping precision of neural networks and the interpretability of fuzzy systems. There are different types of neuro-fuzzy systems depending on whether the inputs, outputs, and weights are crisp or fuzzy. Two common models are fuzzy systems providing input to neural networks, and neural networks providing input to fuzzy systems. Neuro-fuzzy systems have applications in domains like measuring water opacity, improving financial ratings, and automatically adjusting devices.
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AIVladimir Iglovikov, Ph.D.
Presented by Vladimir Iglovikov:
- https://www.linkedin.com/in/iglovikov/
- https://x.com/viglovikov
- https://www.instagram.com/ternaus/
This presentation delves into the journey of Albumentations.ai, a highly successful open-source library for data augmentation.
Created out of a necessity for superior performance in Kaggle competitions, Albumentations has grown to become a widely used tool among data scientists and machine learning practitioners.
This case study covers various aspects, including:
People: The contributors and community that have supported Albumentations.
Metrics: The success indicators such as downloads, daily active users, GitHub stars, and financial contributions.
Challenges: The hurdles in monetizing open-source projects and measuring user engagement.
Development Practices: Best practices for creating, maintaining, and scaling open-source libraries, including code hygiene, CI/CD, and fast iteration.
Community Building: Strategies for making adoption easy, iterating quickly, and fostering a vibrant, engaged community.
Marketing: Both online and offline marketing tactics, focusing on real, impactful interactions and collaborations.
Mental Health: Maintaining balance and not feeling pressured by user demands.
Key insights include the importance of automation, making the adoption process seamless, and leveraging offline interactions for marketing. The presentation also emphasizes the need for continuous small improvements and building a friendly, inclusive community that contributes to the project's growth.
Vladimir Iglovikov brings his extensive experience as a Kaggle Grandmaster, ex-Staff ML Engineer at Lyft, sharing valuable lessons and practical advice for anyone looking to enhance the adoption of their open-source projects.
Explore more about Albumentations and join the community at:
GitHub: https://github.com/albumentations-team/albumentations
Website: https://albumentations.ai/
LinkedIn: https://www.linkedin.com/company/100504475
Twitter: https://x.com/albumentations
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!SOFTTECHHUB
As the digital landscape continually evolves, operating systems play a critical role in shaping user experiences and productivity. The launch of Nitrux Linux 3.5.0 marks a significant milestone, offering a robust alternative to traditional systems such as Windows 11. This article delves into the essence of Nitrux Linux 3.5.0, exploring its unique features, advantages, and how it stands as a compelling choice for both casual users and tech enthusiasts.
Threats to mobile devices are more prevalent and increasing in scope and complexity. Users of mobile devices desire to take full advantage of the features
available on those devices, but many of the features provide convenience and capability but sacrifice security. This best practices guide outlines steps the users can take to better protect personal devices and information.
For the full video of this presentation, please visit: https://www.edge-ai-vision.com/2024/06/building-and-scaling-ai-applications-with-the-nx-ai-manager-a-presentation-from-network-optix/
Robin van Emden, Senior Director of Data Science at Network Optix, presents the “Building and Scaling AI Applications with the Nx AI Manager,” tutorial at the May 2024 Embedded Vision Summit.
In this presentation, van Emden covers the basics of scaling edge AI solutions using the Nx tool kit. He emphasizes the process of developing AI models and deploying them globally. He also showcases the conversion of AI models and the creation of effective edge AI pipelines, with a focus on pre-processing, model conversion, selecting the appropriate inference engine for the target hardware and post-processing.
van Emden shows how Nx can simplify the developer’s life and facilitate a rapid transition from concept to production-ready applications.He provides valuable insights into developing scalable and efficient edge AI solutions, with a strong focus on practical implementation.
How to Get CNIC Information System with Paksim Ga.pptxdanishmna97
Pakdata Cf is a groundbreaking system designed to streamline and facilitate access to CNIC information. This innovative platform leverages advanced technology to provide users with efficient and secure access to their CNIC details.
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?Speck&Tech
ABSTRACT: A prima vista, un mattoncino Lego e la backdoor XZ potrebbero avere in comune il fatto di essere entrambi blocchi di costruzione, o dipendenze di progetti creativi e software. La realtà è che un mattoncino Lego e il caso della backdoor XZ hanno molto di più di tutto ciò in comune.
Partecipate alla presentazione per immergervi in una storia di interoperabilità, standard e formati aperti, per poi discutere del ruolo importante che i contributori hanno in una comunità open source sostenibile.
BIO: Sostenitrice del software libero e dei formati standard e aperti. È stata un membro attivo dei progetti Fedora e openSUSE e ha co-fondato l'Associazione LibreItalia dove è stata coinvolta in diversi eventi, migrazioni e formazione relativi a LibreOffice. In precedenza ha lavorato a migrazioni e corsi di formazione su LibreOffice per diverse amministrazioni pubbliche e privati. Da gennaio 2020 lavora in SUSE come Software Release Engineer per Uyuni e SUSE Manager e quando non segue la sua passione per i computer e per Geeko coltiva la sua curiosità per l'astronomia (da cui deriva il suo nickname deneb_alpha).
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024Neo4j
Neha Bajwa, Vice President of Product Marketing, Neo4j
Join us as we explore breakthrough innovations enabled by interconnected data and AI. Discover firsthand how organizations use relationships in data to uncover contextual insights and solve our most pressing challenges – from optimizing supply chains, detecting fraud, and improving customer experiences to accelerating drug discoveries.
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Albert Hoitingh
In this session I delve into the encryption technology used in Microsoft 365 and Microsoft Purview. Including the concepts of Customer Key and Double Key Encryption.
UiPath Test Automation using UiPath Test Suite series, part 5DianaGray10
Welcome to UiPath Test Automation using UiPath Test Suite series part 5. In this session, we will cover CI/CD with devops.
Topics covered:
CI/CD with in UiPath
End-to-end overview of CI/CD pipeline with Azure devops
Speaker:
Lyndsey Byblow, Test Suite Sales Engineer @ UiPath, Inc.
In the rapidly evolving landscape of technologies, XML continues to play a vital role in structuring, storing, and transporting data across diverse systems. The recent advancements in artificial intelligence (AI) present new methodologies for enhancing XML development workflows, introducing efficiency, automation, and intelligent capabilities. This presentation will outline the scope and perspective of utilizing AI in XML development. The potential benefits and the possible pitfalls will be highlighted, providing a balanced view of the subject.
We will explore the capabilities of AI in understanding XML markup languages and autonomously creating structured XML content. Additionally, we will examine the capacity of AI to enrich plain text with appropriate XML markup. Practical examples and methodological guidelines will be provided to elucidate how AI can be effectively prompted to interpret and generate accurate XML markup.
Further emphasis will be placed on the role of AI in developing XSLT, or schemas such as XSD and Schematron. We will address the techniques and strategies adopted to create prompts for generating code, explaining code, or refactoring the code, and the results achieved.
The discussion will extend to how AI can be used to transform XML content. In particular, the focus will be on the use of AI XPath extension functions in XSLT, Schematron, Schematron Quick Fixes, or for XML content refactoring.
The presentation aims to deliver a comprehensive overview of AI usage in XML development, providing attendees with the necessary knowledge to make informed decisions. Whether you’re at the early stages of adopting AI or considering integrating it in advanced XML development, this presentation will cover all levels of expertise.
By highlighting the potential advantages and challenges of integrating AI with XML development tools and languages, the presentation seeks to inspire thoughtful conversation around the future of XML development. We’ll not only delve into the technical aspects of AI-powered XML development but also discuss practical implications and possible future directions.
Dr. Sean Tan, Head of Data Science, Changi Airport Group
Discover how Changi Airport Group (CAG) leverages graph technologies and generative AI to revolutionize their search capabilities. This session delves into the unique search needs of CAG’s diverse passengers and customers, showcasing how graph data structures enhance the accuracy and relevance of AI-generated search results, mitigating the risk of “hallucinations” and improving the overall customer journey.
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...Neo4j
Leonard Jayamohan, Partner & Generative AI Lead, Deloitte
This keynote will reveal how Deloitte leverages Neo4j’s graph power for groundbreaking digital twin solutions, achieving a staggering 100x performance boost. Discover the essential role knowledge graphs play in successful generative AI implementations. Plus, get an exclusive look at an innovative Neo4j + Generative AI solution Deloitte is developing in-house.
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionAggregage
Join Maher Hanafi, VP of Engineering at Betterworks, in this new session where he'll share a practical framework to transform Gen AI prototypes into impactful products! He'll delve into the complexities of data collection and management, model selection and optimization, and ensuring security, scalability, and responsible use.
Introducing Milvus Lite: Easy-to-Install, Easy-to-Use vector database for you...Zilliz
Join us to introduce Milvus Lite, a vector database that can run on notebooks and laptops, share the same API with Milvus, and integrate with every popular GenAI framework. This webinar is perfect for developers seeking easy-to-use, well-integrated vector databases for their GenAI apps.
Communications Mining Series - Zero to Hero - Session 1DianaGray10
This session provides introduction to UiPath Communication Mining, importance and platform overview. You will acquire a good understand of the phases in Communication Mining as we go over the platform with you. Topics covered:
• Communication Mining Overview
• Why is it important?
• How can it help today’s business and the benefits
• Phases in Communication Mining
• Demo on Platform overview
• Q/A
3. Introduction
Roadmap: We Are Here
1 Introduction
2 Sentiwordnet
3 SO-CAL
4 Wordnet-Affect
5 Indian-Language Sentiwordnets
6 Conclusions
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 3 / 48
4. Introduction Overview
Overview
An overview of today’s presentation:
This presentation covers lexical resources for sentiment analysis.
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 4 / 48
5. Introduction Overview
Overview
An overview of today’s presentation:
This presentation covers lexical resources for sentiment analysis.
Four resources are covered, each using a different approach for
representation and creation:
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 4 / 48
6. Introduction Overview
Overview
An overview of today’s presentation:
This presentation covers lexical resources for sentiment analysis.
Four resources are covered, each using a different approach for
representation and creation:
Sentiwordnet, created automatically, with 3 graded scores per synset
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 4 / 48
7. Introduction Overview
Overview
An overview of today’s presentation:
This presentation covers lexical resources for sentiment analysis.
Four resources are covered, each using a different approach for
representation and creation:
Sentiwordnet, created automatically, with 3 graded scores per synset
SO-CAL, created manually, with a graded score per word
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 4 / 48
8. Introduction Overview
Overview
An overview of today’s presentation:
This presentation covers lexical resources for sentiment analysis.
Four resources are covered, each using a different approach for
representation and creation:
Sentiwordnet, created automatically, with 3 graded scores per synset
SO-CAL, created manually, with a graded score per word
Wordnet-Affect, created semi-automatically, with affect information for
each synset
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 4 / 48
9. Introduction Overview
Overview
An overview of today’s presentation:
This presentation covers lexical resources for sentiment analysis.
Four resources are covered, each using a different approach for
representation and creation:
Sentiwordnet, created automatically, with 3 graded scores per synset
SO-CAL, created manually, with a graded score per word
Wordnet-Affect, created semi-automatically, with affect information for
each synset
Indian-Language Sentiwordnet, created by projecting the English
Sentiwordnet
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 4 / 48
10. Introduction Sentiment Analysis
Sentiment Analysis
Sentiment Analysis: Determining the opinion expressed in a text
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 5 / 48
11. Introduction Sentiment Analysis
Sentiment Analysis
Sentiment Analysis: Determining the opinion expressed in a text
Approaches:
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 5 / 48
12. Introduction Sentiment Analysis
Sentiment Analysis
Sentiment Analysis: Determining the opinion expressed in a text
Approaches:
Classifier-based
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 5 / 48
13. Introduction Sentiment Analysis
Sentiment Analysis
Sentiment Analysis: Determining the opinion expressed in a text
Approaches:
Classifier-based
Lexicon-based
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 5 / 48
14. Introduction Sentiment Analysis
Why Lexicon-based Approach?
The classifier-based approach has the following drawbacks:
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 6 / 48
15. Introduction Sentiment Analysis
Why Lexicon-based Approach?
The classifier-based approach has the following drawbacks:
Domain Specificity (Example: Movie reviews mentioning ‘writer’,
‘plot’, etc.) [Bro01]
Lack of Context (Example: ‘good’ vs ‘not good’ vs ‘not very good’)
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 6 / 48
16. Introduction Sentiment Analysis
Why Lexicon-based Approach?
The classifier-based approach has the following drawbacks:
Domain Specificity (Example: Movie reviews mentioning ‘writer’,
‘plot’, etc.) [Bro01]
Lack of Context (Example: ‘good’ vs ‘not good’ vs ‘not very good’)
The lexicon-based approach aims at solving these problems.
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 6 / 48
17. Introduction Sentiment Lexicons
Sentiment Lexicons
A sentiment lexicon is a sentiment database for language units of the form
(lexical unit, sentiment).
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 7 / 48
18. Introduction Sentiment Lexicons
Sentiment Lexicons
A sentiment lexicon is a sentiment database for language units of the form
(lexical unit, sentiment).
Choices for lexical unit:
Word
Word sense
Phrase, etc.
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 7 / 48
19. Introduction Sentiment Lexicons
Sentiment Lexicons
A sentiment lexicon is a sentiment database for language units of the form
(lexical unit, sentiment).
Choices for lexical unit:
Word
Word sense
Phrase, etc.
Choices for sentiment:
Fixed categorization into ‘positive’ and ‘negative’
Graded sets like ‘strongly positive’, ‘mildly positive’, ‘neutral’, ‘mildly
negative’, ‘strongly negative’
Score in an interval like [0, 1] or [−1, +1]
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 7 / 48
21. Sentiwordnet
Roadmap: We Are Here
1 Introduction
2 Sentiwordnet
3 SO-CAL
4 Wordnet-Affect
5 Indian-Language Sentiwordnets
6 Conclusions
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 9 / 48
22. Sentiwordnet
Introduction to Sentiwordnet
Sentiwordnet [ES06] is an automatically generated sentiment lexicon made
using Wordnet. Its salient features are:
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 10 / 48
23. Sentiwordnet
Introduction to Sentiwordnet
Sentiwordnet [ES06] is an automatically generated sentiment lexicon made
using Wordnet. Its salient features are:
High coverage
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 10 / 48
24. Sentiwordnet
Introduction to Sentiwordnet
Sentiwordnet [ES06] is an automatically generated sentiment lexicon made
using Wordnet. Its salient features are:
High coverage
Support for graded sentiment labels
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 10 / 48
25. Sentiwordnet
Introduction to Sentiwordnet
Sentiwordnet [ES06] is an automatically generated sentiment lexicon made
using Wordnet. Its salient features are:
High coverage
Support for graded sentiment labels
Support for both sentiment classification and subjectivity detection
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 10 / 48
27. Sentiwordnet Structure
Structure of Sentiwordnet
Sentiwordnet = Wordnet + Sentiment Information.
Each synset s is given three sentiment scores:
Positive score Pos(s)
Negative score Neg(s)
Objective score Obj(s)
Pos(s) + Neg(s) + Obj(s) = 1
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 11 / 48
28. Sentiwordnet Structure
Structure of Sentiwordnet
Sentiwordnet = Wordnet + Sentiment Information.
Each synset s is given three sentiment scores:
Positive score Pos(s)
Negative score Neg(s)
Objective score Obj(s)
Pos(s) + Neg(s) + Obj(s) = 1
Example Synset
beautifula: Pos = 0.75, Neg = 0.00, Obj = 0.25
a
URL: http://sentiwordnet.isti.cnr.it/search.php?q=beautiful
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 11 / 48
29. Sentiwordnet Creation
Creation Steps
The top-level steps in the algorithm to create Sentiwordnet are as follows:
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 12 / 48
30. Sentiwordnet Creation
Creation Steps
The top-level steps in the algorithm to create Sentiwordnet are as follows:
1 Selection of seed set
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 12 / 48
31. Sentiwordnet Creation
Creation Steps
The top-level steps in the algorithm to create Sentiwordnet are as follows:
1 Selection of seed set
2 Expansion using Wordnet’s semantic relations
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 12 / 48
32. Sentiwordnet Creation
Creation Steps
The top-level steps in the algorithm to create Sentiwordnet are as follows:
1 Selection of seed set
2 Expansion using Wordnet’s semantic relations
3 Training of a team of ternary classifiers
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 12 / 48
33. Sentiwordnet Creation
Creation Steps
The top-level steps in the algorithm to create Sentiwordnet are as follows:
1 Selection of seed set
2 Expansion using Wordnet’s semantic relations
3 Training of a team of ternary classifiers
4 Classification of each Wordnet synset using the classifiers
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 12 / 48
34. SO-CAL
Roadmap: We Are Here
1 Introduction
2 Sentiwordnet
3 SO-CAL
4 Wordnet-Affect
5 Indian-Language Sentiwordnets
6 Conclusions
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 13 / 48
35. SO-CAL
Introduction to SO-CAL
SO-CAL is a system that uses a manually-constructed lexicon. Its salient
features are:
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 14 / 48
36. SO-CAL
Introduction to SO-CAL
SO-CAL is a system that uses a manually-constructed lexicon. Its salient
features are:
Highly detailed lexicon
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 14 / 48
37. SO-CAL
Introduction to SO-CAL
SO-CAL is a system that uses a manually-constructed lexicon. Its salient
features are:
Highly detailed lexicon
Graded sentiment label
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 14 / 48
38. SO-CAL
Introduction to SO-CAL
SO-CAL is a system that uses a manually-constructed lexicon. Its salient
features are:
Highly detailed lexicon
Graded sentiment label
Low coverage, but high accuracy
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 14 / 48
39. SO-CAL Structure
Features Used
SO-CAL classifies words into various features and treats each feature
differently in the lexicon. They are:
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 15 / 48
40. SO-CAL Structure
Features Used
SO-CAL classifies words into various features and treats each feature
differently in the lexicon. They are:
Adjectives
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 15 / 48
41. SO-CAL Structure
Features Used
SO-CAL classifies words into various features and treats each feature
differently in the lexicon. They are:
Adjectives
Nouns, Verbs, Adverbs and Multiwords
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 15 / 48
42. SO-CAL Structure
Features Used
SO-CAL classifies words into various features and treats each feature
differently in the lexicon. They are:
Adjectives
Nouns, Verbs, Adverbs and Multiwords
Intensifiers and Downtoners
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 15 / 48
43. SO-CAL Structure
Features Used
SO-CAL classifies words into various features and treats each feature
differently in the lexicon. They are:
Adjectives
Nouns, Verbs, Adverbs and Multiwords
Intensifiers and Downtoners
Negation
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 15 / 48
44. SO-CAL Structure
Features Used
SO-CAL classifies words into various features and treats each feature
differently in the lexicon. They are:
Adjectives
Nouns, Verbs, Adverbs and Multiwords
Intensifiers and Downtoners
Negation
Irrealis Blocking
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 15 / 48
45. SO-CAL Structure
Structure of SO-CAL
Sentiment scoring:
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 16 / 48
46. SO-CAL Structure
Structure of SO-CAL
Sentiment scoring:
Words are scored in [−5, +5]
Intensifiers and negation further act upon these scores
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 16 / 48
47. SO-CAL Structure
Structure of SO-CAL
Sentiment scoring:
Words are scored in [−5, +5]
Intensifiers and negation further act upon these scores
Examples
good: +3
monstrosity: −5
masterpiece: +5
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 16 / 48
48. Wordnet-Affect
Roadmap: We Are Here
1 Introduction
2 Sentiwordnet
3 SO-CAL
4 Wordnet-Affect
5 Indian-Language Sentiwordnets
6 Conclusions
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 17 / 48
49. Wordnet-Affect
Introduction to Wordnet-Affect
Wordnet-Affect [SV04] is a semi-automatically generated sentiment lexicon
made using Wordnet. It associates affective information with each
synset. Its salient features are:
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 18 / 48
50. Wordnet-Affect
Introduction to Wordnet-Affect
Wordnet-Affect [SV04] is a semi-automatically generated sentiment lexicon
made using Wordnet. It associates affective information with each
synset. Its salient features are:
Highly detailed
Ability to handle sentiment differently depending on emotion
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 18 / 48
52. Wordnet-Affect Structure
Structure of Wordnet-Affect
Wordnet-Affect = Wordnet + Affect Information.
Affect is represented using the following:
An a-label which represents the emotion,
The valency which indicates the sentiment.
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 19 / 48
53. Wordnet-Affect Structure
Structure of Wordnet-Affect
The a-label is a tree of emotions starting at a root node with each
leaf node corresponding to a synset.
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 20 / 48
54. Wordnet-Affect Structure
Structure of Wordnet-Affect
The a-label is a tree of emotions starting at a root node with each
leaf node corresponding to a synset.
The valency can be any of positive, negative, neutral or ambiguous.
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 20 / 48
58. Wordnet-Affect Creation
Creation Steps
Wordnet-Affect was created using the following steps:
Manual creation of initial resource
Automatic expansion using Wordnet relations
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 22 / 48
59. Indian-Language Sentiwordnets
Roadmap: We Are Here
1 Introduction
2 Sentiwordnet
3 SO-CAL
4 Wordnet-Affect
5 Indian-Language Sentiwordnets
6 Conclusions
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 23 / 48
60. Indian-Language Sentiwordnets
Introduction to Indian-Language Sentiwordnets
Indian-language Sentiwordnets can be created using Wordnet projection
[JRB10]. This approach has the following salient features:
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 24 / 48
61. Indian-Language Sentiwordnets
Introduction to Indian-Language Sentiwordnets
Indian-language Sentiwordnets can be created using Wordnet projection
[JRB10]. This approach has the following salient features:
Easy to create once backing resources are available
No reduplication of effort
Use of tried-and-tested representations
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 24 / 48
62. Indian-Language Sentiwordnets Creation
Creation Steps
The process of projecting a Sentiwordnet has the following steps:
Fetch a synset from the English Sentiwordnet.
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 25 / 48
63. Indian-Language Sentiwordnets Creation
Creation Steps
The process of projecting a Sentiwordnet has the following steps:
Fetch a synset from the English Sentiwordnet.
Find the corresponding Hindi synset using Indowordnet.
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 25 / 48
64. Indian-Language Sentiwordnets Creation
Creation Steps
The process of projecting a Sentiwordnet has the following steps:
Fetch a synset from the English Sentiwordnet.
Find the corresponding Hindi synset using Indowordnet.
Assign sentiment scores from English synset to Hindi synset.
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 25 / 48
65. Conclusions
Roadmap: We Are Here
1 Introduction
2 Sentiwordnet
3 SO-CAL
4 Wordnet-Affect
5 Indian-Language Sentiwordnets
6 Conclusions
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 26 / 48
66. Conclusions
A Comparison of the Resources
Criterion SWN SO-CAL WN-Affect IL-SWN
Sentiment 3 x [0, 1] [−5, +5] Affect 3 x [0, 1]
Lexical Unit Synset Word Synset Synset
Backing Resource Wordnet None Wordnet SWN + In-
dowordnet
Creation Automatic Manual Automatic Projection
No of Entries 117,000 5,000 900 16,000
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 27 / 48
68. Conclusions
Concluding Remarks
To conclude, there are three choices in making a sentiment lexicon:
Creation Approach: Manual, Automatic, Semi-Automatic or
Projection
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 28 / 48
69. Conclusions
Concluding Remarks
To conclude, there are three choices in making a sentiment lexicon:
Creation Approach: Manual, Automatic, Semi-Automatic or
Projection
Lexical Unit: Word, Synset or Higher Representations
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 28 / 48
70. Conclusions
Concluding Remarks
To conclude, there are three choices in making a sentiment lexicon:
Creation Approach: Manual, Automatic, Semi-Automatic or
Projection
Lexical Unit: Word, Synset or Higher Representations
Sentiment: Labels, Graded Scores or Affect Information
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 28 / 48
71. Conclusions
Concluding Remarks: Creation Approach
Manual Approach Automatic Approach
High annotation accuracy Low annotation accuracy
High time investment Low time investment
More details supported Less details supported
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 29 / 48
72. Conclusions
Concluding Remarks: Lexical Unit
Word Synset
Unreliable for polysemous words Reliable for polysemous words
No pre-processing required Requires WSD
Projection is comparatively difficult Projection is comparatively easier
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 30 / 48
73. Conclusions
Concluding Remarks: Sentiment
Graded scores have been shown to be better than mere labels in general.
Moreover, a graded score resource can always be converted to a
label-based resource.
Affect information can help in specialized circumstances.
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 31 / 48
75. Conclusions
Future Work
Possible directions in the future:
Automatic resources for higher-level lexical units like phrases, trees,
etc.
Manual resources for synsets
Manual lexicons for Indian languages
Techniques for building dynamic resources to incorporate ‘netspeak’
and other slang
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 32 / 48
76. Conclusions
References I
Julian Brooke, A semantic approach to automatic text sentiment
analysis, M.A. thesis, Stanford University, 2001.
Andrea Esuli and Fabrizio Sebastiani, SentiWordNet: A publicly
available lexical resource for opinion mining, Proceedings of the 5th
Conference on Language Resources and Evaluation (LREC-06), 2006,
pp. 417–422.
Andrea Esuli, Automatic generation of lexical resources for opinion
mining: Models, algorithms and applications, Ph.D. thesis, Universita
di Pisa, 2008.
Christiane Fellbaum, Wordnet: An electronic lexical database, A
Bradford Book, 1998.
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 33 / 48
77. Conclusions
References II
Vasileios Hatzivassiloglou and Kathleen R. McKeown, Predicting the
semantic orientation of adjectives, Proceedings of the 35th Annual
Meeting of the Association for Computational Linguistics and Eighth
Conference of the European Chapter of the Association for
Computational Linguistics, Association for Computational Linguistics,
1997, pp. 174–181.
Aditya Joshi, Balamurali A R, and Pushpak Bhattacharyya, A
fall-back strategy for sentiment analysis in hindi: a case study,
Proceedings of ICON 2010: 8th International Conference on Natural
Language Processing, Macmillan Publishers, India, 2010.
Jaap Kamps, Maarten Marx, Robert J. Mokken, and Maarten
de Rijke, Using wordnet to measure semantic orientations of
adjectives, Proceedings of LREC-04, 4th International Conference on
Language Resources and Evaluation, 2004, pp. 1115–1118.
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 34 / 48
78. Conclusions
References III
Ellen Riloff and Janyce Wiebe, Learning extraction patterns for
subjective expressions, Proceedings of the 2003 Conference on
Empirical Methods in Natural Language Processing, Association for
Computational Linguistics, 2003, pp. 105–112.
Carlo Strapparava and Alessandro Valitutti, WordNet-Affect: an
affective extension of WordNet, Proceedings of the 4th International
Conference on Language Resources and Evaluation (LREC-04), 2004,
pp. 1083–1086.
Peter D. Turney and Michael L. Littman, Measuring praise and
criticism: Inference of semantic orientation from association, ACM
Transactions on Information Systems 21 (2003), no. 4, 315–346.
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 35 / 48
79. Additional Slides Wordnet
Wordnet
Wordnet [Fel98] is a lexical database organized by word sense. The
fundamental unit of storage is called a synset.
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 36 / 48
80. Additional Slides Wordnet
Wordnet
Wordnet [Fel98] is a lexical database organized by word sense. The
fundamental unit of storage is called a synset.
An Example Synset
brilliant, superba: of surpassing excellence
“a brilliant performance”; “a superb actor”
a
URL: http://wordnetweb.princeton.edu/perl/webwn?s=brilliant
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 36 / 48
81. Additional Slides Wordnet
Semantic Relations in Wordnet
Wordnet synsets are linked to each other by relations called semantic
relations. Some of them are:
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 37 / 48
82. Additional Slides Wordnet
Semantic Relations in Wordnet
Wordnet synsets are linked to each other by relations called semantic
relations. Some of them are:
Antonymy
Meronymy
Hypernymy
Hyponymy
Similar to, etc.
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 37 / 48
83. Additional Slides Wordnet
Semantic Relations in Wordnet
Wordnet synsets are linked to each other by relations called semantic
relations. Some of them are:
Antonymy
Meronymy
Hypernymy
Hyponymy
Similar to, etc.
These relations are helpful in creating the training set for classifying
synsets to create Sentiwordnet.
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 37 / 48
84. Additional Slides Background
Sentiment Classification
Initial work that automatically detected the sentiment of a word led to
today’s modern lexicons. This included:
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 38 / 48
85. Additional Slides Background
Sentiment Classification
Initial work that automatically detected the sentiment of a word led to
today’s modern lexicons. This included:
Use of conjunction-separated adjectives [HM97]
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 38 / 48
86. Additional Slides Background
Sentiment Classification
Initial work that automatically detected the sentiment of a word led to
today’s modern lexicons. This included:
Use of conjunction-separated adjectives [HM97]
PMI-based Extraction using Web Queries [TL03]
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 38 / 48
87. Additional Slides Background
Sentiment Classification
Initial work that automatically detected the sentiment of a word led to
today’s modern lexicons. This included:
Use of conjunction-separated adjectives [HM97]
PMI-based Extraction using Web Queries [TL03]
Graph Expansion using Wordnet [KMMdR04]
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 38 / 48
88. Additional Slides Background
Sentiment Classification
Initial work that automatically detected the sentiment of a word led to
today’s modern lexicons. This included:
Use of conjunction-separated adjectives [HM97]
PMI-based Extraction using Web Queries [TL03]
Graph Expansion using Wordnet [KMMdR04]
Classification using Wordnet Glosses [Esu08]
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 38 / 48
89. Additional Slides Background
Subjectivity Detection
Work that identifies whether a term is indeed subjective is necessary to
filter out objective words from sentiment classification. This includes:
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 39 / 48
90. Additional Slides Background
Subjectivity Detection
Work that identifies whether a term is indeed subjective is necessary to
filter out objective words from sentiment classification. This includes:
Adapting Wordnet Glosses to Subjectivity Detection [Esu08]
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 39 / 48
91. Additional Slides Background
Subjectivity Detection
Work that identifies whether a term is indeed subjective is necessary to
filter out objective words from sentiment classification. This includes:
Adapting Wordnet Glosses to Subjectivity Detection [Esu08]
Bootstrapping Subjective Expressions from a Corpus [RW03]
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 39 / 48
92. Additional Slides Structure of SO-CAL
Adjectives
Adjectives were collected from a 500-document corpus and annotated with
a sentiment score from −5 to +5.
Examples
good: +3
sleazy: −3
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 40 / 48
93. Additional Slides Structure of SO-CAL
Nouns, Verbs, Adverbs, Multiwords
This was extended to other parts of speech and multiword expressions, for
a total of about 5,000 words.
Examples
monstrosity: −5
masterpiece: +5
inspire: +2
funny: +2 vs. act funny: −1
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 41 / 48
94. Additional Slides Structure of SO-CAL
Intensifiers and Downtoners
Intensifiers are words that increase sentiment intensity while downtoners
are words that reduce sentiment intensity. For example extraordinarily and
somewhat.
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 42 / 48
95. Additional Slides Structure of SO-CAL
Intensifiers and Downtoners
Intensifiers are words that increase sentiment intensity while downtoners
are words that reduce sentiment intensity. For example extraordinarily and
somewhat.
Intensifiers and downtoners are modeled as percentage modifiers.
Examples
slightly: −50%
extraordinarily: +50%
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 42 / 48
96. Additional Slides Structure of SO-CAL
Negation
Negation is modeled as a numeric shift of value 4 towards the opposite
sentiment.
Examples
good: +3 ⇒ not good: −1
atrocious: −5 ⇒ not atrocious: −1
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 43 / 48
97. Additional Slides Structure of SO-CAL
Irrealis Blocking
An irrealis marker is a word that indicates that the sentiment may not be
reliable because the event hasn’t actually happened. For example, ‘would’,
‘expect’, ‘if’, quotation marks, etc.
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 44 / 48
98. Additional Slides Structure of SO-CAL
Irrealis Blocking
An irrealis marker is a word that indicates that the sentiment may not be
reliable because the event hasn’t actually happened. For example, ‘would’,
‘expect’, ‘if’, quotation marks, etc.
Sentences with irrealis markers are ignored for sentiment analysis.
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 44 / 48
99. Additional Slides Sentiwordnet Creation
Seed Set
Two seed sets are created:
Lp for positive synsets
Ln for negative synsets
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 45 / 48
100. Additional Slides Sentiwordnet Creation
Seed Set
Two seed sets are created:
Lp for positive synsets
Ln for negative synsets
Each synset representation consists of:
The terms
The defninition
The sample phrases
Explicit indication of negation
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 45 / 48
101. Additional Slides Sentiwordnet Creation
Wordnet Expansion
Relations of Wordnet used for expansion:
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 46 / 48
102. Additional Slides Sentiwordnet Creation
Wordnet Expansion
Relations of Wordnet used for expansion:
Direct antonymy
Similarity
Derived from
Pertains to
Attribute
Also see
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 46 / 48
103. Additional Slides Sentiwordnet Creation
Classifiers
8 classifiers were created differing in:
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 47 / 48
104. Additional Slides Sentiwordnet Creation
Classifiers
8 classifiers were created differing in:
No of iterations of expansion (0, 2, 4, 6)
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 47 / 48
105. Additional Slides Sentiwordnet Creation
Classifiers
8 classifiers were created differing in:
No of iterations of expansion (0, 2, 4, 6)
Learning algorithm (SVM, Rocchio)
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 47 / 48
106. Additional Slides Sentiwordnet Creation
Classifiers
Each ternary classifier is a sum of 2 binary classifiers:
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 48 / 48
107. Additional Slides Sentiwordnet Creation
Classifiers
Each ternary classifier is a sum of 2 binary classifiers:
Positive vs. Not Positive
Negative vs. Not Negative
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 48 / 48
108. Additional Slides Sentiwordnet Creation
Classifiers
Each ternary classifier is a sum of 2 binary classifiers:
Positive vs. Not Positive
Negative vs. Not Negative
The results are combined as:
Positive Not Positive
Negative Objective Negative
Not Negative Positive Objective
Sagar Ahire (IIT Bombay) Sentiment Resources 02 May, 2014 48 / 48