SlideShare a Scribd company logo
EVALITA 2014 
EVALUATION OF NLP AND SPEECH TOOLS FOR ITALIAN 
SENTIPOLC 
SENTIment POLarity Classification 
di.unito.it/sentipolc14 
Valerio Basile, University of Groningen 
Andrea Bolioli, CELI, Torino 
Malvina Nissim, University of Groningen, University of Bologna 
Viviana Patti, University of Torino, Dip. di Informatica 
Paolo Rosso, Universitat Politècnica de València
Task description 
A new shared task in the Evalita evaluation campaign 
• sentiment analysis at the message level on Italian tweets 
• three independent sub-tasks: 
Task 1 - Subjectivity Classification: a system 
must decide whether a given message is 
subjective or objective 
Task 2 - Polarity Classification: 
a system must decide whether a given message is of 
positive, negative, neutral or mixed sentiment 
Task 3 (Pilot): Irony Detection: a system 
must decide whether a given message is 
ironic or not 
EVALITA 
2014 
Workshop 
December 
11 
2014, 
Pisa 
Semeval 
2013, 
task 
2 
Semeval 
2014, 
task 
9
Development and Test Data 
Collection 
• 6,448 (training set 4,513; test set: 1,935) 
tweets derived from two existing corpora: 
o SENTI-TUT (Bosco, Patti, Bolioli, 2013) 
o TWITA (Basile and Nissim, 2013) 
EVALITA 
2014 
Workshop 
December 
11 
2014, 
Pisa 
• two main components: 
o political: extraction based on 
specific keywords and hashtags 
marking political topics 
(#grillo, Monti) 
o generic: random tweets on any topic.
Development and Test Data 
Data format 
• Each tweet is presented as a sequence of comma separated fields 
id, subj, pos, neg, iro, topic, text 
• Manual annotation: subj (subjectivity)/pos (positive polarity)/neg 
(negative polarity)/iro (ironic) 
• Apart from the id, which is a string of numeric characters, 
the value of all the other fields can be either “0” or “1”. 
• For the four manually annotated classes: 
o 0 means that the feature is absent 
o 1 means that the feature is present 
EVALITA 
2014 
Workshop 
December 
11 
2014, 
Pisa
Development and Test Data 
Data format 
• Each tweet is presented as a sequence of comma separated fields 
id, subj, pos, neg, iro, topic, text 
Constraints in the annotation scheme: 
• An objective tweet will not have any polarity nor irony 
• A subjective tweet can exhibit at the same time positive and 
negative polarity (mixed!) 
• A subjective tweet can exhibit no specific polarity and be just 
neutral but with a clear subjective flavour 
• An ironic tweet is always subjective and it must have one 
defined polarity 
EVALITA 
2014 
Workshop 
December 
11 
2014, 
Pisa
Development and Test Data 
• Constraints in the annotation scheme: 
EVALITA 
2014 
Workshop 
December 
11 
2014, 
Pisa
EVALITA 
2014 
Workshop 
December 
11 
2014, 
Pisa 
Examples 
l’articolo di Roberto Ciccarelli dal manifesto di oggi 
http://fb.me/1BQVy5WAk 
o subj = 0 
o pos = 0 
o neg = 0 
o iro = 0 
• Objective tweet: …0, 0, 0, 0… 
id, subj, pos, neg, iro, topic, text
EVALITA 
2014 
Workshop 
December 
11 
2014, 
Pisa 
Examples 
Dati negativi da Confindustria che spera nel nuovo 
governo Monti. Castiglione: “Avanti con le riforme” 
http://t.co/kIKnbFY7 
o subj = 1 
o pos = 1 
o neg = 1 
o iro = 0 
• Subjective, mixed: …1, 1, 1, 0 … 
id, subj, pos, neg, iro, topic, text
EVALITA 
2014 
Workshop 
December 
11 
2014, 
Pisa 
Examples 
Botta di ottimismo a #lInfedele: Governo Monti, o la 
va o la spacca. 
o subj = 1 
o pos = 0 
o neg = 1 
o iro = 1 
• Subjective, negative, ironic: …1, 0, 1, 1 … 
id, subj, pos, neg, iro, topic, text 
• Underlying assumptions on irony 
o 1111: not allowed! 
o 1001: not allowed! 
o 0XX1: not allowed! 
An 
ironic 
tweet 
is 
always 
subjec?ve 
and 
it 
must 
have 
one 
defined 
polarity
Development and Test Data 
Data format 
• Each tweet is presented as a sequence of comma separated fields 
id, subj, pos, neg, ironic, topic, text 
• id: Twitter status id (necessary to retrieve the text). 
• topic: 0 means “generic” and 1 means “political”. 
• text: this column will be filled with the 
actual tweet's text 
o Due to Twitter’s privacy policy, tweets 
cannot be distributed directly 
o Participants were provided with a web interface (RESTful Web API 
technology) through which they could download the tweet’s text on the 
fly --when still available-- for all the ids provided 
EVALITA 
2014 
Workshop 
December 
11 
2014, 
Pisa 
TwiEer’s 
peculiar 
issue 
in 
the 
evalua?on 
phase: 
same 
training/ 
test 
data 
for 
all 
teams
Evaluation 
• Evaluation set: tweets classified by all participating 
teams 
o current twitter policies! 
o no big differences 
• Metrics: precision, recall and F-measure for each 
field/class 
o polarity classification: adapted in order to take 
into account the peculiarities of the annotation 
scheme (e.g. possible to have mixed sentiment) 
o details on evaluation metrics applied for the 
evaluation of the participant results in the 
organizers’ report 
EVALITA 
2014 
Workshop 
December 
11 
2014, 
Pisa
Participants 
• A total of 11 teams from 4 different countries participated in 
at least one of the three tasks 
• SENTIPOLC was the most participated Evalita task with a total 
of 35 submitted runs: great interest of the NLP community on 
sentiment analysis in Italian social media 
o Most of the submissions were constrained (training only on task data) 
EVALITA 
2014 
Workshop 
December 
11 
2014, 
Pisa 
• Only academy (no industry)
Results – Task 1 subjectivity 
• The highest F-score was achieved by uniba2930 at 0.7140 
(constrained run) 
o All participating systems show an improvement over the 
baseline 
• majority class baseline (for all tasks) 
EVALITA 
2014 
Workshop 
December 
11 
2014, 
Pisa
Results – Task 2 polarity 
• Again, the highest F-score was achieved by uniba2930 at 
0.6771 (constrained). 
EVALITA 
2014 
Workshop 
December 
11 
2014, 
Pisa 
o the most popular 
subtask 
o all participating 
systems show 
an improvement 
over the baseline
Results – pilot Task 3 irony 
• The highest F-score was achieved by UNITOR at 0.5959 
(unconstrained run) and 0.5759 (constrained run). 
o some systems score very close to the baseline: 
hight complexity of the task 
EVALITA 
2014 
Workshop 
December 
11 
2014, 
Pisa
Comparison, issues 
EVALITA 
2014 
Workshop 
December 
11 
2014, 
Pisa 
• Comparison lines: 
o exploitation of further Twitter annotated data for training 
o classification framework (approaches, algorithms, features) 
o exploitation of available resources (e.g. sentiment lexicons, NLP 
tools, etc.), 
o interdependency of tasks in case of systems participating in 
several subtasks 
…in the organizers’ report 
• Issues 
o Irony and polarity reversal 
o Mixed sentiment is hard to recognise
What’s next 
• uniba2930: best system on tasks 1 & 2 
Pierpaolo Basile and Nicole Novielli 
UNIBA at EVALITA 2014-SENTIPOLC Task: Predicting tweet 
sentiment polarity combining micro-blogging, lexicon and 
semantic features 
• UNITOR: best system on pilot task 3 
Giuseppe Castellucci, Danilo Croce, Diego De Cao, Roberto 
Basili 
A Multiple Kernel Approach for Twitter Sentiment Analysis in 
Italian 
Discussion! 
17.45: Poster session 
EVALITA 
2014 
Workshop 
December 
11 
2014, 
Pisa 
Proceedings on-line! 
http://clic.humnet.unipi.it/proceedings/Proceedings-EVALITA-2014.pdf
Discussion 
• Feedback from 2014 Sentipolc teams? 
• Next edition?  Ideas? Proposal? 
EVALITA 
2014 
Workshop 
December 
11 
2014, 
Pisa 
o data 
- Twitter data 
Facebook data (conversational threads, friends 
network)? 
- format 
o tasks 
- aspect-based sentiment analysis (target)? emotions? 
o systems 
- Sentipolc systems available as services via API/ 
download? 
o evaluation metrics

More Related Content

Similar to SENTIment POLarity Classification Task - Sentipolc@Evalita 2014

How we can teach Educational Robotics to foster 21st learning skills through ...
How we can teach Educational Robotics to foster 21st learning skills through ...How we can teach Educational Robotics to foster 21st learning skills through ...
How we can teach Educational Robotics to foster 21st learning skills through ...
Alexandra Sierra
 
Introduction to nlp
Introduction to nlpIntroduction to nlp
Introduction to nlp
Amaan Shaikh
 
UNIBA at EVALITA 2014-SENTIPOLC Task: Predicting tweet sentiment polarity com...
UNIBA at EVALITA 2014-SENTIPOLC Task: Predicting tweet sentiment polarity com...UNIBA at EVALITA 2014-SENTIPOLC Task: Predicting tweet sentiment polarity com...
UNIBA at EVALITA 2014-SENTIPOLC Task: Predicting tweet sentiment polarity com...
Nicole Novielli
 
An Ensemble Model for Cross-Domain Polarity Classification on Twitter
An Ensemble Model for Cross-Domain Polarity Classification on TwitterAn Ensemble Model for Cross-Domain Polarity Classification on Twitter
An Ensemble Model for Cross-Domain Polarity Classification on Twitter
Symeon Papadopoulos
 
Venturing from the “Back Room”: Do Technical Services Librarians have a Role ...
Venturing from the “Back Room”: Do Technical Services Librarians have a Role ...Venturing from the “Back Room”: Do Technical Services Librarians have a Role ...
Venturing from the “Back Room”: Do Technical Services Librarians have a Role ...
Charleston Conference
 
Cross-cultural discussions in a 3D virtual environment and their affordances ...
Cross-cultural discussions in a 3D virtual environment and their affordances ...Cross-cultural discussions in a 3D virtual environment and their affordances ...
Cross-cultural discussions in a 3D virtual environment and their affordances ...
Kristi Jauregi Ondarra
 
Perceptual Data_04182016
Perceptual Data_04182016Perceptual Data_04182016
Perceptual Data_04182016Kunal Dash
 
Case study narratives
Case study narrativesCase study narratives
Case study narratives
Gemma Derrick
 
The CLEF Initiative From 2010 to 2012 and Onwards
The CLEF Initiative From 2010 to 2012 and OnwardsThe CLEF Initiative From 2010 to 2012 and Onwards
The CLEF Initiative From 2010 to 2012 and OnwardsMediaEval2012
 
To Label or Not? Advances and Open Challenges in SE-specific Sentiment Analysis
To Label or Not? Advances and Open Challenges in SE-specific Sentiment AnalysisTo Label or Not? Advances and Open Challenges in SE-specific Sentiment Analysis
To Label or Not? Advances and Open Challenges in SE-specific Sentiment Analysis
Nicole Novielli
 
Materiales didacticos JMRT
Materiales didacticos JMRTMateriales didacticos JMRT
Materiales didacticos JMRT
Manuel Romero
 
[OOFHEC2018] Anna Dipace:Analysing emotions to personalise learning on EduOpe...
[OOFHEC2018] Anna Dipace:Analysing emotions to personalise learning on EduOpe...[OOFHEC2018] Anna Dipace:Analysing emotions to personalise learning on EduOpe...
[OOFHEC2018] Anna Dipace:Analysing emotions to personalise learning on EduOpe...
EADTU
 
14.30 kurilovas bireniene
14.30 kurilovas bireniene14.30 kurilovas bireniene
14.30 kurilovas birenieneprogettoacariss
 
Designing Rebo, the Reflection Guidance Chatbot - Poster
Designing Rebo, the Reflection Guidance Chatbot - PosterDesigning Rebo, the Reflection Guidance Chatbot - Poster
Designing Rebo, the Reflection Guidance Chatbot - Poster
IrmtraudWolfbauer
 
Infographics: E-volving Instruction for Visual Literacy
Infographics: E-volving Instruction for Visual LiteracyInfographics: E-volving Instruction for Visual Literacy
Infographics: E-volving Instruction for Visual Literacy
Melanie Parlette-Stewart
 
Reflections on teaching and information behaviour in a Futurelearn MOOC
Reflections on teaching and information behaviour in a Futurelearn MOOCReflections on teaching and information behaviour in a Futurelearn MOOC
Reflections on teaching and information behaviour in a Futurelearn MOOC
Sheila Webber
 
Listening deck conference call 3
Listening deck conference call 3Listening deck conference call 3
Listening deck conference call 3Julie Pippert
 
Evaluating without numbers: exploration of a story based evaluation method
Evaluating without numbers: exploration of a story based evaluation methodEvaluating without numbers: exploration of a story based evaluation method
Evaluating without numbers: exploration of a story based evaluation method
LearningandTeaching
 
Interestingness of articles using twitter sentiments
Interestingness of articles using twitter sentimentsInterestingness of articles using twitter sentiments
Interestingness of articles using twitter sentiments
Arpit Bhayani
 
Should feedback be at the centre of Personalised Learning?
Should feedback be at the centre of Personalised Learning?Should feedback be at the centre of Personalised Learning?
Should feedback be at the centre of Personalised Learning?
Denise Whitelock
 

Similar to SENTIment POLarity Classification Task - Sentipolc@Evalita 2014 (20)

How we can teach Educational Robotics to foster 21st learning skills through ...
How we can teach Educational Robotics to foster 21st learning skills through ...How we can teach Educational Robotics to foster 21st learning skills through ...
How we can teach Educational Robotics to foster 21st learning skills through ...
 
Introduction to nlp
Introduction to nlpIntroduction to nlp
Introduction to nlp
 
UNIBA at EVALITA 2014-SENTIPOLC Task: Predicting tweet sentiment polarity com...
UNIBA at EVALITA 2014-SENTIPOLC Task: Predicting tweet sentiment polarity com...UNIBA at EVALITA 2014-SENTIPOLC Task: Predicting tweet sentiment polarity com...
UNIBA at EVALITA 2014-SENTIPOLC Task: Predicting tweet sentiment polarity com...
 
An Ensemble Model for Cross-Domain Polarity Classification on Twitter
An Ensemble Model for Cross-Domain Polarity Classification on TwitterAn Ensemble Model for Cross-Domain Polarity Classification on Twitter
An Ensemble Model for Cross-Domain Polarity Classification on Twitter
 
Venturing from the “Back Room”: Do Technical Services Librarians have a Role ...
Venturing from the “Back Room”: Do Technical Services Librarians have a Role ...Venturing from the “Back Room”: Do Technical Services Librarians have a Role ...
Venturing from the “Back Room”: Do Technical Services Librarians have a Role ...
 
Cross-cultural discussions in a 3D virtual environment and their affordances ...
Cross-cultural discussions in a 3D virtual environment and their affordances ...Cross-cultural discussions in a 3D virtual environment and their affordances ...
Cross-cultural discussions in a 3D virtual environment and their affordances ...
 
Perceptual Data_04182016
Perceptual Data_04182016Perceptual Data_04182016
Perceptual Data_04182016
 
Case study narratives
Case study narrativesCase study narratives
Case study narratives
 
The CLEF Initiative From 2010 to 2012 and Onwards
The CLEF Initiative From 2010 to 2012 and OnwardsThe CLEF Initiative From 2010 to 2012 and Onwards
The CLEF Initiative From 2010 to 2012 and Onwards
 
To Label or Not? Advances and Open Challenges in SE-specific Sentiment Analysis
To Label or Not? Advances and Open Challenges in SE-specific Sentiment AnalysisTo Label or Not? Advances and Open Challenges in SE-specific Sentiment Analysis
To Label or Not? Advances and Open Challenges in SE-specific Sentiment Analysis
 
Materiales didacticos JMRT
Materiales didacticos JMRTMateriales didacticos JMRT
Materiales didacticos JMRT
 
[OOFHEC2018] Anna Dipace:Analysing emotions to personalise learning on EduOpe...
[OOFHEC2018] Anna Dipace:Analysing emotions to personalise learning on EduOpe...[OOFHEC2018] Anna Dipace:Analysing emotions to personalise learning on EduOpe...
[OOFHEC2018] Anna Dipace:Analysing emotions to personalise learning on EduOpe...
 
14.30 kurilovas bireniene
14.30 kurilovas bireniene14.30 kurilovas bireniene
14.30 kurilovas bireniene
 
Designing Rebo, the Reflection Guidance Chatbot - Poster
Designing Rebo, the Reflection Guidance Chatbot - PosterDesigning Rebo, the Reflection Guidance Chatbot - Poster
Designing Rebo, the Reflection Guidance Chatbot - Poster
 
Infographics: E-volving Instruction for Visual Literacy
Infographics: E-volving Instruction for Visual LiteracyInfographics: E-volving Instruction for Visual Literacy
Infographics: E-volving Instruction for Visual Literacy
 
Reflections on teaching and information behaviour in a Futurelearn MOOC
Reflections on teaching and information behaviour in a Futurelearn MOOCReflections on teaching and information behaviour in a Futurelearn MOOC
Reflections on teaching and information behaviour in a Futurelearn MOOC
 
Listening deck conference call 3
Listening deck conference call 3Listening deck conference call 3
Listening deck conference call 3
 
Evaluating without numbers: exploration of a story based evaluation method
Evaluating without numbers: exploration of a story based evaluation methodEvaluating without numbers: exploration of a story based evaluation method
Evaluating without numbers: exploration of a story based evaluation method
 
Interestingness of articles using twitter sentiments
Interestingness of articles using twitter sentimentsInterestingness of articles using twitter sentiments
Interestingness of articles using twitter sentiments
 
Should feedback be at the centre of Personalised Learning?
Should feedback be at the centre of Personalised Learning?Should feedback be at the centre of Personalised Learning?
Should feedback be at the centre of Personalised Learning?
 

Recently uploaded

SluggerPunk Angel Investor Final Proposal
SluggerPunk Angel Investor Final ProposalSluggerPunk Angel Investor Final Proposal
SluggerPunk Angel Investor Final Proposal
grogshiregames
 
快速办理(BCR毕业证书)加州大学河滨分校毕业证文凭证书一模一样
快速办理(BCR毕业证书)加州大学河滨分校毕业证文凭证书一模一样快速办理(BCR毕业证书)加州大学河滨分校毕业证文凭证书一模一样
快速办理(BCR毕业证书)加州大学河滨分校毕业证文凭证书一模一样
ryxqoswi
 
Unlock TikTok Success with Sociocosmos..
Unlock TikTok Success with Sociocosmos..Unlock TikTok Success with Sociocosmos..
Unlock TikTok Success with Sociocosmos..
SocioCosmos
 
LORRAINE ANDREI_LEQUIGAN_HOW TO USE TELEGRAM
LORRAINE ANDREI_LEQUIGAN_HOW TO USE TELEGRAMLORRAINE ANDREI_LEQUIGAN_HOW TO USE TELEGRAM
LORRAINE ANDREI_LEQUIGAN_HOW TO USE TELEGRAM
lorraineandreiamcidl
 
Project Serenity — 33% Life-time Commissions.docx
Project Serenity — 33% Life-time Commissions.docxProject Serenity — 33% Life-time Commissions.docx
Project Serenity — 33% Life-time Commissions.docx
zeqirielmedina8
 
Your Path to YouTube Stardom Starts Here
Your Path to YouTube Stardom Starts HereYour Path to YouTube Stardom Starts Here
Your Path to YouTube Stardom Starts Here
SocioCosmos
 
The Evolution of SEO: Insights from a Leading Digital Marketing Agency
The Evolution of SEO: Insights from a Leading Digital Marketing AgencyThe Evolution of SEO: Insights from a Leading Digital Marketing Agency
The Evolution of SEO: Insights from a Leading Digital Marketing Agency
Digital Marketing Lab
 
SluggerPunk Final Angel Investor Proposal
SluggerPunk Final Angel Investor ProposalSluggerPunk Final Angel Investor Proposal
SluggerPunk Final Angel Investor Proposal
grogshiregames
 
EASY TUTORIAL OF HOW TO USE G-TEAMS BY: FEBLESS HERNANE
EASY TUTORIAL OF HOW TO USE G-TEAMS BY: FEBLESS HERNANEEASY TUTORIAL OF HOW TO USE G-TEAMS BY: FEBLESS HERNANE
EASY TUTORIAL OF HOW TO USE G-TEAMS BY: FEBLESS HERNANE
Febless Hernane
 
Your LinkedIn Success Starts Here.......
Your LinkedIn Success Starts Here.......Your LinkedIn Success Starts Here.......
Your LinkedIn Success Starts Here.......
SocioCosmos
 
Exploring The Dimensions and Dynamics of Felt Obligation: A Bibliometric Anal...
Exploring The Dimensions and Dynamics of Felt Obligation: A Bibliometric Anal...Exploring The Dimensions and Dynamics of Felt Obligation: A Bibliometric Anal...
Exploring The Dimensions and Dynamics of Felt Obligation: A Bibliometric Anal...
AJHSSR Journal
 
Grow Your Reddit Community Fast.........
Grow Your Reddit Community Fast.........Grow Your Reddit Community Fast.........
Grow Your Reddit Community Fast.........
SocioCosmos
 
Improving Workplace Safety Performance in Malaysian SMEs: The Role of Safety ...
Improving Workplace Safety Performance in Malaysian SMEs: The Role of Safety ...Improving Workplace Safety Performance in Malaysian SMEs: The Role of Safety ...
Improving Workplace Safety Performance in Malaysian SMEs: The Role of Safety ...
AJHSSR Journal
 
Multilingual SEO Services | Multilingual Keyword Research | Filose
Multilingual SEO Services |  Multilingual Keyword Research | FiloseMultilingual SEO Services |  Multilingual Keyword Research | Filose
Multilingual SEO Services | Multilingual Keyword Research | Filose
madisonsmith478075
 
Surat Digital Marketing School - course curriculum
Surat Digital Marketing School - course curriculumSurat Digital Marketing School - course curriculum
Surat Digital Marketing School - course curriculum
digitalcourseshop4
 
7 Tips on Social Media Marketing strategy
7 Tips on Social Media Marketing strategy7 Tips on Social Media Marketing strategy
7 Tips on Social Media Marketing strategy
Digital Marketing Lab
 
Buy Pinterest Followers, Reactions & Repins Go Viral on Pinterest with Socio...
Buy Pinterest Followers, Reactions & Repins  Go Viral on Pinterest with Socio...Buy Pinterest Followers, Reactions & Repins  Go Viral on Pinterest with Socio...
Buy Pinterest Followers, Reactions & Repins Go Viral on Pinterest with Socio...
SocioCosmos
 
Social Media Marketing Strategies .
Social Media Marketing Strategies                     .Social Media Marketing Strategies                     .
Social Media Marketing Strategies .
Virtual Real Design
 
HOW TO USE FACEBOOK _ by Clarissa Credito
HOW TO USE FACEBOOK _ by Clarissa CreditoHOW TO USE FACEBOOK _ by Clarissa Credito
HOW TO USE FACEBOOK _ by Clarissa Credito
ClarissaAlanoCredito
 

Recently uploaded (19)

SluggerPunk Angel Investor Final Proposal
SluggerPunk Angel Investor Final ProposalSluggerPunk Angel Investor Final Proposal
SluggerPunk Angel Investor Final Proposal
 
快速办理(BCR毕业证书)加州大学河滨分校毕业证文凭证书一模一样
快速办理(BCR毕业证书)加州大学河滨分校毕业证文凭证书一模一样快速办理(BCR毕业证书)加州大学河滨分校毕业证文凭证书一模一样
快速办理(BCR毕业证书)加州大学河滨分校毕业证文凭证书一模一样
 
Unlock TikTok Success with Sociocosmos..
Unlock TikTok Success with Sociocosmos..Unlock TikTok Success with Sociocosmos..
Unlock TikTok Success with Sociocosmos..
 
LORRAINE ANDREI_LEQUIGAN_HOW TO USE TELEGRAM
LORRAINE ANDREI_LEQUIGAN_HOW TO USE TELEGRAMLORRAINE ANDREI_LEQUIGAN_HOW TO USE TELEGRAM
LORRAINE ANDREI_LEQUIGAN_HOW TO USE TELEGRAM
 
Project Serenity — 33% Life-time Commissions.docx
Project Serenity — 33% Life-time Commissions.docxProject Serenity — 33% Life-time Commissions.docx
Project Serenity — 33% Life-time Commissions.docx
 
Your Path to YouTube Stardom Starts Here
Your Path to YouTube Stardom Starts HereYour Path to YouTube Stardom Starts Here
Your Path to YouTube Stardom Starts Here
 
The Evolution of SEO: Insights from a Leading Digital Marketing Agency
The Evolution of SEO: Insights from a Leading Digital Marketing AgencyThe Evolution of SEO: Insights from a Leading Digital Marketing Agency
The Evolution of SEO: Insights from a Leading Digital Marketing Agency
 
SluggerPunk Final Angel Investor Proposal
SluggerPunk Final Angel Investor ProposalSluggerPunk Final Angel Investor Proposal
SluggerPunk Final Angel Investor Proposal
 
EASY TUTORIAL OF HOW TO USE G-TEAMS BY: FEBLESS HERNANE
EASY TUTORIAL OF HOW TO USE G-TEAMS BY: FEBLESS HERNANEEASY TUTORIAL OF HOW TO USE G-TEAMS BY: FEBLESS HERNANE
EASY TUTORIAL OF HOW TO USE G-TEAMS BY: FEBLESS HERNANE
 
Your LinkedIn Success Starts Here.......
Your LinkedIn Success Starts Here.......Your LinkedIn Success Starts Here.......
Your LinkedIn Success Starts Here.......
 
Exploring The Dimensions and Dynamics of Felt Obligation: A Bibliometric Anal...
Exploring The Dimensions and Dynamics of Felt Obligation: A Bibliometric Anal...Exploring The Dimensions and Dynamics of Felt Obligation: A Bibliometric Anal...
Exploring The Dimensions and Dynamics of Felt Obligation: A Bibliometric Anal...
 
Grow Your Reddit Community Fast.........
Grow Your Reddit Community Fast.........Grow Your Reddit Community Fast.........
Grow Your Reddit Community Fast.........
 
Improving Workplace Safety Performance in Malaysian SMEs: The Role of Safety ...
Improving Workplace Safety Performance in Malaysian SMEs: The Role of Safety ...Improving Workplace Safety Performance in Malaysian SMEs: The Role of Safety ...
Improving Workplace Safety Performance in Malaysian SMEs: The Role of Safety ...
 
Multilingual SEO Services | Multilingual Keyword Research | Filose
Multilingual SEO Services |  Multilingual Keyword Research | FiloseMultilingual SEO Services |  Multilingual Keyword Research | Filose
Multilingual SEO Services | Multilingual Keyword Research | Filose
 
Surat Digital Marketing School - course curriculum
Surat Digital Marketing School - course curriculumSurat Digital Marketing School - course curriculum
Surat Digital Marketing School - course curriculum
 
7 Tips on Social Media Marketing strategy
7 Tips on Social Media Marketing strategy7 Tips on Social Media Marketing strategy
7 Tips on Social Media Marketing strategy
 
Buy Pinterest Followers, Reactions & Repins Go Viral on Pinterest with Socio...
Buy Pinterest Followers, Reactions & Repins  Go Viral on Pinterest with Socio...Buy Pinterest Followers, Reactions & Repins  Go Viral on Pinterest with Socio...
Buy Pinterest Followers, Reactions & Repins Go Viral on Pinterest with Socio...
 
Social Media Marketing Strategies .
Social Media Marketing Strategies                     .Social Media Marketing Strategies                     .
Social Media Marketing Strategies .
 
HOW TO USE FACEBOOK _ by Clarissa Credito
HOW TO USE FACEBOOK _ by Clarissa CreditoHOW TO USE FACEBOOK _ by Clarissa Credito
HOW TO USE FACEBOOK _ by Clarissa Credito
 

SENTIment POLarity Classification Task - Sentipolc@Evalita 2014

  • 1. EVALITA 2014 EVALUATION OF NLP AND SPEECH TOOLS FOR ITALIAN SENTIPOLC SENTIment POLarity Classification di.unito.it/sentipolc14 Valerio Basile, University of Groningen Andrea Bolioli, CELI, Torino Malvina Nissim, University of Groningen, University of Bologna Viviana Patti, University of Torino, Dip. di Informatica Paolo Rosso, Universitat Politècnica de València
  • 2. Task description A new shared task in the Evalita evaluation campaign • sentiment analysis at the message level on Italian tweets • three independent sub-tasks: Task 1 - Subjectivity Classification: a system must decide whether a given message is subjective or objective Task 2 - Polarity Classification: a system must decide whether a given message is of positive, negative, neutral or mixed sentiment Task 3 (Pilot): Irony Detection: a system must decide whether a given message is ironic or not EVALITA 2014 Workshop December 11 2014, Pisa Semeval 2013, task 2 Semeval 2014, task 9
  • 3. Development and Test Data Collection • 6,448 (training set 4,513; test set: 1,935) tweets derived from two existing corpora: o SENTI-TUT (Bosco, Patti, Bolioli, 2013) o TWITA (Basile and Nissim, 2013) EVALITA 2014 Workshop December 11 2014, Pisa • two main components: o political: extraction based on specific keywords and hashtags marking political topics (#grillo, Monti) o generic: random tweets on any topic.
  • 4. Development and Test Data Data format • Each tweet is presented as a sequence of comma separated fields id, subj, pos, neg, iro, topic, text • Manual annotation: subj (subjectivity)/pos (positive polarity)/neg (negative polarity)/iro (ironic) • Apart from the id, which is a string of numeric characters, the value of all the other fields can be either “0” or “1”. • For the four manually annotated classes: o 0 means that the feature is absent o 1 means that the feature is present EVALITA 2014 Workshop December 11 2014, Pisa
  • 5. Development and Test Data Data format • Each tweet is presented as a sequence of comma separated fields id, subj, pos, neg, iro, topic, text Constraints in the annotation scheme: • An objective tweet will not have any polarity nor irony • A subjective tweet can exhibit at the same time positive and negative polarity (mixed!) • A subjective tweet can exhibit no specific polarity and be just neutral but with a clear subjective flavour • An ironic tweet is always subjective and it must have one defined polarity EVALITA 2014 Workshop December 11 2014, Pisa
  • 6. Development and Test Data • Constraints in the annotation scheme: EVALITA 2014 Workshop December 11 2014, Pisa
  • 7. EVALITA 2014 Workshop December 11 2014, Pisa Examples l’articolo di Roberto Ciccarelli dal manifesto di oggi http://fb.me/1BQVy5WAk o subj = 0 o pos = 0 o neg = 0 o iro = 0 • Objective tweet: …0, 0, 0, 0… id, subj, pos, neg, iro, topic, text
  • 8. EVALITA 2014 Workshop December 11 2014, Pisa Examples Dati negativi da Confindustria che spera nel nuovo governo Monti. Castiglione: “Avanti con le riforme” http://t.co/kIKnbFY7 o subj = 1 o pos = 1 o neg = 1 o iro = 0 • Subjective, mixed: …1, 1, 1, 0 … id, subj, pos, neg, iro, topic, text
  • 9. EVALITA 2014 Workshop December 11 2014, Pisa Examples Botta di ottimismo a #lInfedele: Governo Monti, o la va o la spacca. o subj = 1 o pos = 0 o neg = 1 o iro = 1 • Subjective, negative, ironic: …1, 0, 1, 1 … id, subj, pos, neg, iro, topic, text • Underlying assumptions on irony o 1111: not allowed! o 1001: not allowed! o 0XX1: not allowed! An ironic tweet is always subjec?ve and it must have one defined polarity
  • 10. Development and Test Data Data format • Each tweet is presented as a sequence of comma separated fields id, subj, pos, neg, ironic, topic, text • id: Twitter status id (necessary to retrieve the text). • topic: 0 means “generic” and 1 means “political”. • text: this column will be filled with the actual tweet's text o Due to Twitter’s privacy policy, tweets cannot be distributed directly o Participants were provided with a web interface (RESTful Web API technology) through which they could download the tweet’s text on the fly --when still available-- for all the ids provided EVALITA 2014 Workshop December 11 2014, Pisa TwiEer’s peculiar issue in the evalua?on phase: same training/ test data for all teams
  • 11. Evaluation • Evaluation set: tweets classified by all participating teams o current twitter policies! o no big differences • Metrics: precision, recall and F-measure for each field/class o polarity classification: adapted in order to take into account the peculiarities of the annotation scheme (e.g. possible to have mixed sentiment) o details on evaluation metrics applied for the evaluation of the participant results in the organizers’ report EVALITA 2014 Workshop December 11 2014, Pisa
  • 12. Participants • A total of 11 teams from 4 different countries participated in at least one of the three tasks • SENTIPOLC was the most participated Evalita task with a total of 35 submitted runs: great interest of the NLP community on sentiment analysis in Italian social media o Most of the submissions were constrained (training only on task data) EVALITA 2014 Workshop December 11 2014, Pisa • Only academy (no industry)
  • 13. Results – Task 1 subjectivity • The highest F-score was achieved by uniba2930 at 0.7140 (constrained run) o All participating systems show an improvement over the baseline • majority class baseline (for all tasks) EVALITA 2014 Workshop December 11 2014, Pisa
  • 14. Results – Task 2 polarity • Again, the highest F-score was achieved by uniba2930 at 0.6771 (constrained). EVALITA 2014 Workshop December 11 2014, Pisa o the most popular subtask o all participating systems show an improvement over the baseline
  • 15. Results – pilot Task 3 irony • The highest F-score was achieved by UNITOR at 0.5959 (unconstrained run) and 0.5759 (constrained run). o some systems score very close to the baseline: hight complexity of the task EVALITA 2014 Workshop December 11 2014, Pisa
  • 16. Comparison, issues EVALITA 2014 Workshop December 11 2014, Pisa • Comparison lines: o exploitation of further Twitter annotated data for training o classification framework (approaches, algorithms, features) o exploitation of available resources (e.g. sentiment lexicons, NLP tools, etc.), o interdependency of tasks in case of systems participating in several subtasks …in the organizers’ report • Issues o Irony and polarity reversal o Mixed sentiment is hard to recognise
  • 17. What’s next • uniba2930: best system on tasks 1 & 2 Pierpaolo Basile and Nicole Novielli UNIBA at EVALITA 2014-SENTIPOLC Task: Predicting tweet sentiment polarity combining micro-blogging, lexicon and semantic features • UNITOR: best system on pilot task 3 Giuseppe Castellucci, Danilo Croce, Diego De Cao, Roberto Basili A Multiple Kernel Approach for Twitter Sentiment Analysis in Italian Discussion! 17.45: Poster session EVALITA 2014 Workshop December 11 2014, Pisa Proceedings on-line! http://clic.humnet.unipi.it/proceedings/Proceedings-EVALITA-2014.pdf
  • 18. Discussion • Feedback from 2014 Sentipolc teams? • Next edition?  Ideas? Proposal? EVALITA 2014 Workshop December 11 2014, Pisa o data - Twitter data Facebook data (conversational threads, friends network)? - format o tasks - aspect-based sentiment analysis (target)? emotions? o systems - Sentipolc systems available as services via API/ download? o evaluation metrics