SlideShare a Scribd company logo
1 of 11
University of Sheffield, NLP
Opinion Mining in ARCOMEM
University of Sheffield, UK
© The University of Sheffield, 1995-2013
This work is licenced under the Creative Commons Attribution-NonCommercial-ShareAlike Licence
University of Sheffield, NLP
Opinion mining in Arcomem
• The ARCOMEM project is about preserving community memories and
retrieving interesting information from social media
• Aims to answer questions such as:
• What are the opinions on crucial social events and the key people
involved?
• How are these opinions distributed in relation to demographic user
data?
• How have these opinions evolved?
• Who are the opinion leaders?
• What is their impact and influence?
University of Sheffield, NLP
Opinion Mining Subtasks
● Opinion extraction: extract the piece of text which represents the opinion
● I just bought a new camera yesterday. It was a bit expensive, but the
battery life is very good.
● Sentiment classification/orientation: extract the polarity of the opinion
(e.g. positive, negative, neutral, or classify on a numerical scale)
● negative: expensive
● positive: good battery life
● Target identification: what is the topic of the opinion expressed?
● Opinion summarisation: summarise the overall opinion about something
● price:negative, battery life: positive --> overall 7/10
University of Sheffield, NLP
Arcomem Application
• Application for opinion mining from social media in English and
German
• Uses a variety of social media including twitter, facebook and forum
posts
• Based on entity and event extraction, and uses a rule-based approach
• Structural pre-processing, specific to social media types (such as
separating the actual content of the tweet from the metadata)
• Linguistic pre-processing (including language detection), entity and
event recognition
• Additional targeted gazetteer lookup
• JAPE grammars
• Aggregation of opinions
• Dynamics
University of Sheffield, NLP
Opinion mining architecture
University of Sheffield, NLP
Why Rule-based?
• Although ML applications are typically used for Opinion Mining, this
task involves documents from many different text types, genres,
languages and domains
• This is problematic for ML because it requires many applications
trained on the different datasets, and methods to deal with
acquisition of training material
• Aim of using a rule-based system is that the bulk of it can be used
across different kinds of texts, with only the pre-processing and some
sentiment dictionaries which are domain and language-specific
University of Sheffield, NLP
Linguistic pre-processing
• Language identification (per sentence) using TextCat
• Standard tokenisation, POS tagging etc using GATE
• NE and Term recognition using modified versions of ANNIE and
TermRaider
• Event recognition using specially developed GATE application (e.g.
band performance, economic crisis, industrial strike)
University of Sheffield, NLP
Basic approach for opinion finding
• Find sentiment-containing words in a linguistic relation with
entities/events (opinion-target matching)
• Use a number of linguistic sub-components to deal with issues such as
negatives, irony, swear words etc.
• Starting from basic sentiment lookup, we then adjust the scores and
polarity of the opinions via these components
University of Sheffield, NLP
Sentiment finding components
• Flexible Gazetteer Lookup: matches lists of affect/emotion words
against the text, in any morphological variant
• Gazetteer Lookup: matches lists of affect/emotion words against the
text only in non-variant forms, i.e. exact string match (mainly the case
for specific phrases, swear words, emoticons etc.)
• Sentiment Grammars: set of hand-crafted JAPE rules which annotate
sentiments and link them with the relevant targets and opinion
holders
• RDF Generation: create the relevant RDF-XML for the annotations
according to the data model (so they can be used by other
components)
University of Sheffield, NLP
Opinion scoring
• Sentiment gazetteers (developed from sentiment words in WordNet)
have a starting “strength” score
• These get modified by context words, e.g. adverbs, swear words,
negatives and so on
The film was amazing --> The film was absolutely amazing.
The film was awful --> The film was bloody awful.
• Swear words on their own are classified as negative, however.
Damned politicians and their lies.
RIP Fergie? It's SIR Alex Ferguson to you, Carlos, you runt.
:-)
University of Sheffield, NLP
More information
You can try a simple demo of the English rule-based system at
http://demos.gate.ac.uk/arcomem/opinions/
● More details on entity, event and opinion mining from a
technical point of view are available from
http://gate.ac.uk/projects/arcomem/training.html
● Comprehensive training material about using GATE for text
mining is available at http://gate.ac.uk/family/training.html

More Related Content

What's hot

Sentiment Analysis
Sentiment AnalysisSentiment Analysis
Sentiment Analysis
harit66
 
Preference Elicitation Interface
Preference Elicitation InterfacePreference Elicitation Interface
Preference Elicitation Interface
晓愚 孟
 
Research Developments and Directions in Speech Recognition and ...
Research Developments and Directions in Speech Recognition and ...Research Developments and Directions in Speech Recognition and ...
Research Developments and Directions in Speech Recognition and ...
butest
 

What's hot (17)

Natural Language Processing: L01 introduction
Natural Language Processing: L01 introductionNatural Language Processing: L01 introduction
Natural Language Processing: L01 introduction
 
Text Analytics for Security
Text Analytics for SecurityText Analytics for Security
Text Analytics for Security
 
Sentiment Analysis
Sentiment AnalysisSentiment Analysis
Sentiment Analysis
 
Sentiment analysis
Sentiment analysisSentiment analysis
Sentiment analysis
 
Sentiment Analysis
Sentiment AnalysisSentiment Analysis
Sentiment Analysis
 
Natural Language Processing (NLP) for Requirements Engineering (RE): an Overview
Natural Language Processing (NLP) for Requirements Engineering (RE): an OverviewNatural Language Processing (NLP) for Requirements Engineering (RE): an Overview
Natural Language Processing (NLP) for Requirements Engineering (RE): an Overview
 
Topic Modelling: Tutorial on Usage and Applications
Topic Modelling: Tutorial on Usage and ApplicationsTopic Modelling: Tutorial on Usage and Applications
Topic Modelling: Tutorial on Usage and Applications
 
Preference Elicitation Interface
Preference Elicitation InterfacePreference Elicitation Interface
Preference Elicitation Interface
 
ADAPT Centre and My NLP journey: MT, MTE, QE, MWE, NER, Treebanks, Parsing.
ADAPT Centre and My NLP journey: MT, MTE, QE, MWE, NER, Treebanks, Parsing.ADAPT Centre and My NLP journey: MT, MTE, QE, MWE, NER, Treebanks, Parsing.
ADAPT Centre and My NLP journey: MT, MTE, QE, MWE, NER, Treebanks, Parsing.
 
Research Developments and Directions in Speech Recognition and ...
Research Developments and Directions in Speech Recognition and ...Research Developments and Directions in Speech Recognition and ...
Research Developments and Directions in Speech Recognition and ...
 
Artificial Intelligence Notes Unit 4
Artificial Intelligence Notes Unit 4Artificial Intelligence Notes Unit 4
Artificial Intelligence Notes Unit 4
 
Word representation: SVD, LSA, Word2Vec
Word representation: SVD, LSA, Word2VecWord representation: SVD, LSA, Word2Vec
Word representation: SVD, LSA, Word2Vec
 
Chinese Character Decomposition for Neural MT with Multi-Word Expressions
Chinese Character Decomposition for  Neural MT with Multi-Word ExpressionsChinese Character Decomposition for  Neural MT with Multi-Word Expressions
Chinese Character Decomposition for Neural MT with Multi-Word Expressions
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 
Roeder rocky 2011_46
Roeder rocky 2011_46Roeder rocky 2011_46
Roeder rocky 2011_46
 
2104 Talk @SSU
2104 Talk @SSU2104 Talk @SSU
2104 Talk @SSU
 
Online Learning to Rank
Online Learning to RankOnline Learning to Rank
Online Learning to Rank
 

Viewers also liked

Opinion Mining
Opinion MiningOpinion Mining
Opinion Mining
Ali Habeeb
 

Viewers also liked (7)

Arcomem training – Enrichment Advanced (update)
Arcomem training – Enrichment Advanced (update)Arcomem training – Enrichment Advanced (update)
Arcomem training – Enrichment Advanced (update)
 
Arcomem training – Enrichment Beginner (update)
Arcomem training – Enrichment Beginner (update)Arcomem training – Enrichment Beginner (update)
Arcomem training – Enrichment Beginner (update)
 
Arcomem training enrichment_advanced
Arcomem training enrichment_advancedArcomem training enrichment_advanced
Arcomem training enrichment_advanced
 
Opinion Mining
Opinion MiningOpinion Mining
Opinion Mining
 
Identifying features in opinion mining via intrinsic and extrinsic domain rel...
Identifying features in opinion mining via intrinsic and extrinsic domain rel...Identifying features in opinion mining via intrinsic and extrinsic domain rel...
Identifying features in opinion mining via intrinsic and extrinsic domain rel...
 
Aspect Opinion Mining From User Reviews on the web
Aspect Opinion Mining From User Reviews on the webAspect Opinion Mining From User Reviews on the web
Aspect Opinion Mining From User Reviews on the web
 
Tutorial on Opinion Mining and Sentiment Analysis
Tutorial on Opinion Mining and Sentiment AnalysisTutorial on Opinion Mining and Sentiment Analysis
Tutorial on Opinion Mining and Sentiment Analysis
 

Similar to Arcomem training opinions_advanced

NLP, Expert system and pattern recognition
NLP, Expert system and pattern recognitionNLP, Expert system and pattern recognition
NLP, Expert system and pattern recognition
Mohammad Ilyas Malik
 
Natural Language Processing for development
Natural Language Processing for developmentNatural Language Processing for development
Natural Language Processing for development
Aravind Reddy
 
Natural Language Processing for development
Natural Language Processing for developmentNatural Language Processing for development
Natural Language Processing for development
Aravind Reddy
 

Similar to Arcomem training opinions_advanced (20)

GATE: a text analysis tool for social media
GATE: a text analysis tool for social mediaGATE: a text analysis tool for social media
GATE: a text analysis tool for social media
 
Text Analysis and Semantic Search with GATE
Text Analysis and Semantic Search with GATEText Analysis and Semantic Search with GATE
Text Analysis and Semantic Search with GATE
 
Tools for (Almost) Real-Time Social Media Analysis
Tools for (Almost) Real-Time Social Media AnalysisTools for (Almost) Real-Time Social Media Analysis
Tools for (Almost) Real-Time Social Media Analysis
 
Arcomem training entities-and-events_advanced
Arcomem training entities-and-events_advancedArcomem training entities-and-events_advanced
Arcomem training entities-and-events_advanced
 
Unit 5f.pptx
Unit 5f.pptxUnit 5f.pptx
Unit 5f.pptx
 
Sld-Natural-Language-Processing-for-large-volumes-of-human-text-data-Sozzi-Br...
Sld-Natural-Language-Processing-for-large-volumes-of-human-text-data-Sozzi-Br...Sld-Natural-Language-Processing-for-large-volumes-of-human-text-data-Sozzi-Br...
Sld-Natural-Language-Processing-for-large-volumes-of-human-text-data-Sozzi-Br...
 
Supervised Learning Based Approach to Aspect Based Sentiment Analysis
Supervised Learning Based Approach to Aspect Based Sentiment AnalysisSupervised Learning Based Approach to Aspect Based Sentiment Analysis
Supervised Learning Based Approach to Aspect Based Sentiment Analysis
 
A tailor-made one-size-fits-all approach to sentiment analysis
A tailor-made one-size-fits-all approach to sentiment analysisA tailor-made one-size-fits-all approach to sentiment analysis
A tailor-made one-size-fits-all approach to sentiment analysis
 
Introduction to natural language processing (NLP)
Introduction to natural language processing (NLP)Introduction to natural language processing (NLP)
Introduction to natural language processing (NLP)
 
Social Media Sentiments Analysis
Social Media Sentiments AnalysisSocial Media Sentiments Analysis
Social Media Sentiments Analysis
 
HotSoS16 Tutorial "Text Analytics for Security" by Tao Xie and William Enck
HotSoS16 Tutorial "Text Analytics for Security" by Tao Xie and William EnckHotSoS16 Tutorial "Text Analytics for Security" by Tao Xie and William Enck
HotSoS16 Tutorial "Text Analytics for Security" by Tao Xie and William Enck
 
LLM.pdf
LLM.pdfLLM.pdf
LLM.pdf
 
NLP, Expert system and pattern recognition
NLP, Expert system and pattern recognitionNLP, Expert system and pattern recognition
NLP, Expert system and pattern recognition
 
Natural Language Processing for development
Natural Language Processing for developmentNatural Language Processing for development
Natural Language Processing for development
 
Natural Language Processing for development
Natural Language Processing for developmentNatural Language Processing for development
Natural Language Processing for development
 
II-SDV 2017: Localizing International Content for Search, Data Mining and Ana...
II-SDV 2017: Localizing International Content for Search, Data Mining and Ana...II-SDV 2017: Localizing International Content for Search, Data Mining and Ana...
II-SDV 2017: Localizing International Content for Search, Data Mining and Ana...
 
Sequence to sequence model speech recognition
Sequence to sequence model speech recognitionSequence to sequence model speech recognition
Sequence to sequence model speech recognition
 
Kishaloy Haldar and Wenqiang Lei - WESST - Sentiment Analysis of Social Media
Kishaloy Haldar and Wenqiang Lei - WESST - Sentiment Analysis of Social MediaKishaloy Haldar and Wenqiang Lei - WESST - Sentiment Analysis of Social Media
Kishaloy Haldar and Wenqiang Lei - WESST - Sentiment Analysis of Social Media
 
Twitter data analysis using R
Twitter data analysis using RTwitter data analysis using R
Twitter data analysis using R
 
Natural Language Processing Advancements By Deep Learning - A Survey
Natural Language Processing Advancements By Deep Learning - A SurveyNatural Language Processing Advancements By Deep Learning - A Survey
Natural Language Processing Advancements By Deep Learning - A Survey
 

More from arcomem

Diata12 ARCOMEM
Diata12 ARCOMEMDiata12 ARCOMEM
Diata12 ARCOMEM
arcomem
 

More from arcomem (20)

Arcomem training Specifying Crawls Advanced
Arcomem training Specifying Crawls AdvancedArcomem training Specifying Crawls Advanced
Arcomem training Specifying Crawls Advanced
 
Arcomem training Specifying Crawls Beginners
Arcomem training Specifying Crawls BeginnersArcomem training Specifying Crawls Beginners
Arcomem training Specifying Crawls Beginners
 
Arcomem training Topic Analysis Models advanced
Arcomem training Topic Analysis Models advancedArcomem training Topic Analysis Models advanced
Arcomem training Topic Analysis Models advanced
 
Arcomem training Topic Analysis Models beginners
Arcomem training Topic Analysis Models beginnersArcomem training Topic Analysis Models beginners
Arcomem training Topic Analysis Models beginners
 
Arcomem training Twitter Domain Experts advanced
Arcomem training Twitter Domain Experts advancedArcomem training Twitter Domain Experts advanced
Arcomem training Twitter Domain Experts advanced
 
Arcomem training Cultural Analysis Advanced
Arcomem training Cultural Analysis AdvancedArcomem training Cultural Analysis Advanced
Arcomem training Cultural Analysis Advanced
 
Arcomem training Cultural Analysis Beginner
Arcomem training Cultural Analysis BeginnerArcomem training Cultural Analysis Beginner
Arcomem training Cultural Analysis Beginner
 
Arcomem training twitter-dynamics_advanced
Arcomem training twitter-dynamics_advancedArcomem training twitter-dynamics_advanced
Arcomem training twitter-dynamics_advanced
 
Arcomem training system-overview_advanced
Arcomem training system-overview_advancedArcomem training system-overview_advanced
Arcomem training system-overview_advanced
 
Arcomem training specifying-crawls
Arcomem training specifying-crawlsArcomem training specifying-crawls
Arcomem training specifying-crawls
 
Arcomem training simple-text-mining_beginner
Arcomem training simple-text-mining_beginnerArcomem training simple-text-mining_beginner
Arcomem training simple-text-mining_beginner
 
Arcomem training neer_beginner
Arcomem training neer_beginnerArcomem training neer_beginner
Arcomem training neer_beginner
 
Arcomem training neer_advanced
Arcomem training neer_advancedArcomem training neer_advanced
Arcomem training neer_advanced
 
Arcomem training heritrix_beginner
Arcomem training heritrix_beginnerArcomem training heritrix_beginner
Arcomem training heritrix_beginner
 
Arcomem training heritrix_advanced
Arcomem training heritrix_advancedArcomem training heritrix_advanced
Arcomem training heritrix_advanced
 
Arcomem training enrichment_beginner
Arcomem training enrichment_beginnerArcomem training enrichment_beginner
Arcomem training enrichment_beginner
 
Arcomem training diversification
Arcomem training diversificationArcomem training diversification
Arcomem training diversification
 
Arcomem training twitter-dynamics_beginner
Arcomem training twitter-dynamics_beginnerArcomem training twitter-dynamics_beginner
Arcomem training twitter-dynamics_beginner
 
Arcomem TPDL poster
Arcomem TPDL posterArcomem TPDL poster
Arcomem TPDL poster
 
Diata12 ARCOMEM
Diata12 ARCOMEMDiata12 ARCOMEM
Diata12 ARCOMEM
 

Recently uploaded

Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 

Recently uploaded (20)

A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 

Arcomem training opinions_advanced

  • 1. University of Sheffield, NLP Opinion Mining in ARCOMEM University of Sheffield, UK © The University of Sheffield, 1995-2013 This work is licenced under the Creative Commons Attribution-NonCommercial-ShareAlike Licence
  • 2. University of Sheffield, NLP Opinion mining in Arcomem • The ARCOMEM project is about preserving community memories and retrieving interesting information from social media • Aims to answer questions such as: • What are the opinions on crucial social events and the key people involved? • How are these opinions distributed in relation to demographic user data? • How have these opinions evolved? • Who are the opinion leaders? • What is their impact and influence?
  • 3. University of Sheffield, NLP Opinion Mining Subtasks ● Opinion extraction: extract the piece of text which represents the opinion ● I just bought a new camera yesterday. It was a bit expensive, but the battery life is very good. ● Sentiment classification/orientation: extract the polarity of the opinion (e.g. positive, negative, neutral, or classify on a numerical scale) ● negative: expensive ● positive: good battery life ● Target identification: what is the topic of the opinion expressed? ● Opinion summarisation: summarise the overall opinion about something ● price:negative, battery life: positive --> overall 7/10
  • 4. University of Sheffield, NLP Arcomem Application • Application for opinion mining from social media in English and German • Uses a variety of social media including twitter, facebook and forum posts • Based on entity and event extraction, and uses a rule-based approach • Structural pre-processing, specific to social media types (such as separating the actual content of the tweet from the metadata) • Linguistic pre-processing (including language detection), entity and event recognition • Additional targeted gazetteer lookup • JAPE grammars • Aggregation of opinions • Dynamics
  • 5. University of Sheffield, NLP Opinion mining architecture
  • 6. University of Sheffield, NLP Why Rule-based? • Although ML applications are typically used for Opinion Mining, this task involves documents from many different text types, genres, languages and domains • This is problematic for ML because it requires many applications trained on the different datasets, and methods to deal with acquisition of training material • Aim of using a rule-based system is that the bulk of it can be used across different kinds of texts, with only the pre-processing and some sentiment dictionaries which are domain and language-specific
  • 7. University of Sheffield, NLP Linguistic pre-processing • Language identification (per sentence) using TextCat • Standard tokenisation, POS tagging etc using GATE • NE and Term recognition using modified versions of ANNIE and TermRaider • Event recognition using specially developed GATE application (e.g. band performance, economic crisis, industrial strike)
  • 8. University of Sheffield, NLP Basic approach for opinion finding • Find sentiment-containing words in a linguistic relation with entities/events (opinion-target matching) • Use a number of linguistic sub-components to deal with issues such as negatives, irony, swear words etc. • Starting from basic sentiment lookup, we then adjust the scores and polarity of the opinions via these components
  • 9. University of Sheffield, NLP Sentiment finding components • Flexible Gazetteer Lookup: matches lists of affect/emotion words against the text, in any morphological variant • Gazetteer Lookup: matches lists of affect/emotion words against the text only in non-variant forms, i.e. exact string match (mainly the case for specific phrases, swear words, emoticons etc.) • Sentiment Grammars: set of hand-crafted JAPE rules which annotate sentiments and link them with the relevant targets and opinion holders • RDF Generation: create the relevant RDF-XML for the annotations according to the data model (so they can be used by other components)
  • 10. University of Sheffield, NLP Opinion scoring • Sentiment gazetteers (developed from sentiment words in WordNet) have a starting “strength” score • These get modified by context words, e.g. adverbs, swear words, negatives and so on The film was amazing --> The film was absolutely amazing. The film was awful --> The film was bloody awful. • Swear words on their own are classified as negative, however. Damned politicians and their lies. RIP Fergie? It's SIR Alex Ferguson to you, Carlos, you runt. :-)
  • 11. University of Sheffield, NLP More information You can try a simple demo of the English rule-based system at http://demos.gate.ac.uk/arcomem/opinions/ ● More details on entity, event and opinion mining from a technical point of view are available from http://gate.ac.uk/projects/arcomem/training.html ● Comprehensive training material about using GATE for text mining is available at http://gate.ac.uk/family/training.html