SlideShare a Scribd company logo
1 of 10
Download to read offline
Project: Named Entity Extraction in
Twitter
- Md Tareque Khan (201505521)
- Sourav Sarangi (201301014)
- Darshan Agarwal (201225189)
[Team 55] [April 2016]
Information Retrieval and Extraction (CSE474) Spring ‘16
Professor: Vasudeva Verma
Mentor: Priyanka Bajaj
Problem Statement
https://noisy-text.github.io/ner-shared-
task.html
Problem Statement Continued…
A baseline code was provided by Organisers
having
precision : 96.06%
F1 Measure: 42.09
and categorizing named entities into
following categories
- Company - facility - geo-loc
- movie - musicartist - other
- person - product - sportsteam
- tvshow
Goal: Improve the precision and F1 Measure
of the baseline code
Baseline Code Review
 Train and test data each containing 500 tweets
 Lexicons for people first name, english stop words, product
names, location database, sports team, tv programs
 A python code is used to generate the feature in format
required by CRFSuite ()
 CRFSuite generates model using the training data, and
dumps the model in txt format
 CRFSuite tag mode is then used on the test data to extract
named entities.
 Perl script did the job of evaluation
Crfsuite (Averaged Perceptron)
Crfsuite uses Averaged Perceptron algorithm
This algorithm takes the average of feature
weights at all updates in the training process.
The algorithm is fastest in terms of training
speed(as compared to l2sgd: Stochastic Gradient
Descent (SGD) with L2 regularization).
Even though the algorithm is very simple, it
exhibits high prediction performance. In
practice, it is necessary to stop a training
process by specifying the maximum number
of iterations (120 in our case).
Changes done
1. Code changes : Logical bug fixed
python code didn’t actually considered contiguous words
to
extract the phrase features using windowing approach.
Fixing
this boosted the precision by .3% to 96.57%
2. Non-code changes :
- Wikipedia titles were heavily pruned and added as
lexicons,
which boosted the precision to 96.24 % i.e. by a factor
of .2%
- OpenData from gov websites like world university
names,
geographical data like river names, company names was
also
Base Output
processed 11570 tokens with 356 phrases; found: 244 phrases; correct: 128.
accuracy: 96.07%; precision: 52.46%; recall: 35.96%; FB1: 42.67
company: precision: 72.41%; recall: 51.22%; FB1: 60.00 29
facility: precision: 40.00%; recall: 30.00%; FB1: 34.29 15
geo-loc: precision: 64.44%; recall: 50.00%; FB1: 56.31 45
movie: precision: 11.11%; recall: 33.33%; FB1: 16.67 9
musicartist: precision: 16.67%; recall: 8.33%; FB1: 11.11 6
other: precision: 35.00%; recall: 11.48%; FB1: 17.28 20
person: precision: 60.44%; recall: 47.01%; FB1: 52.88 91
product: precision: 26.67%; recall: 22.22%; FB1: 24.24 15
sportsteam: precision: 25.00%; recall: 16.67%; FB1: 20.00 12
tvshow: precision: 50.00%; recall: 12.50%; FB1: 20.00 2
Build Output
processed 11570 tokens with 356 phrases; found: 261 phrases; correct: 157.
accuracy: 96.57%; precision: 60.15%; recall: 44.10%; FB1: 50.89
company: precision: 70.97%; recall: 53.66%; FB1: 61.11 31
facility: precision: 50.00%; recall: 30.00%; FB1: 37.50 12
geo-loc: precision: 68.09%; recall: 55.17%; FB1: 60.95 47
movie: precision: 33.33%; recall: 33.33%; FB1: 33.33 3
musicartist: precision: 12.50%; recall: 8.33%; FB1: 10.00 8
other: precision: 42.42%; recall: 22.95%; FB1: 29.79 33
person: precision: 68.27%; recall: 60.68%; FB1: 64.25 104
product: precision: 62.50%; recall: 27.78%; FB1: 38.46 8
sportsteam: precision: 40.00%; recall: 22.22%; FB1: 28.57 10
tvshow: precision: 20.00%; recall: 12.50%; FB1: 15.38 5
External Links
http://www.chokkan.org/software/crfsuite/
https://noisy-text.github.io/ner-shared-
task.html
Papers referred to
 TwiNER: Named Entity Recognition in Targeted Twitter Stream [Chenliang
Li1, Jianshu Weng2]
 Named Entity Recognition in Tweets: An Experimental Study [Alan Ritter,
Sam Clark, Mausam and Oren Etzioni]
Thank You

More Related Content

Recently uploaded

Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxpboyjonauth
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingTechSoup
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformChameera Dedduwage
 
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991RKavithamani
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docxPoojaSen20
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...Marc Dusseiller Dusjagr
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxGaneshChakor2
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpinRaunakKeshri1
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdfssuser54595a
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdfSoniaTolstoy
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13Steve Thomason
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdfQucHHunhnh
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...EduSkills OECD
 
Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...
Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...
Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...RKavithamani
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationnomboosow
 

Recently uploaded (20)

Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptx
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docx
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptx
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpin
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...
Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...
Privatization and Disinvestment - Meaning, Objectives, Advantages and Disadva...
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communication
 

Featured

How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024Albert Qian
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsKurio // The Social Media Age(ncy)
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Search Engine Journal
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summarySpeakerHub
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next Tessa Mero
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentLily Ray
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best PracticesVit Horky
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project managementMindGenius
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...RachelPearson36
 
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Applitools
 
12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at WorkGetSmarter
 
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...DevGAMM Conference
 
Barbie - Brand Strategy Presentation
Barbie - Brand Strategy PresentationBarbie - Brand Strategy Presentation
Barbie - Brand Strategy PresentationErica Santiago
 
Good Stuff Happens in 1:1 Meetings: Why you need them and how to do them well
Good Stuff Happens in 1:1 Meetings: Why you need them and how to do them wellGood Stuff Happens in 1:1 Meetings: Why you need them and how to do them well
Good Stuff Happens in 1:1 Meetings: Why you need them and how to do them wellSaba Software
 
Introduction to C Programming Language
Introduction to C Programming LanguageIntroduction to C Programming Language
Introduction to C Programming LanguageSimplilearn
 

Featured (20)

How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
 
12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work
 
ChatGPT webinar slides
ChatGPT webinar slidesChatGPT webinar slides
ChatGPT webinar slides
 
More than Just Lines on a Map: Best Practices for U.S Bike Routes
More than Just Lines on a Map: Best Practices for U.S Bike RoutesMore than Just Lines on a Map: Best Practices for U.S Bike Routes
More than Just Lines on a Map: Best Practices for U.S Bike Routes
 
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
 
Barbie - Brand Strategy Presentation
Barbie - Brand Strategy PresentationBarbie - Brand Strategy Presentation
Barbie - Brand Strategy Presentation
 
Good Stuff Happens in 1:1 Meetings: Why you need them and how to do them well
Good Stuff Happens in 1:1 Meetings: Why you need them and how to do them wellGood Stuff Happens in 1:1 Meetings: Why you need them and how to do them well
Good Stuff Happens in 1:1 Meetings: Why you need them and how to do them well
 
Introduction to C Programming Language
Introduction to C Programming LanguageIntroduction to C Programming Language
Introduction to C Programming Language
 

Ire project ner-team55-spring16-iiith

  • 1. Project: Named Entity Extraction in Twitter - Md Tareque Khan (201505521) - Sourav Sarangi (201301014) - Darshan Agarwal (201225189) [Team 55] [April 2016] Information Retrieval and Extraction (CSE474) Spring ‘16 Professor: Vasudeva Verma Mentor: Priyanka Bajaj
  • 3. Problem Statement Continued… A baseline code was provided by Organisers having precision : 96.06% F1 Measure: 42.09 and categorizing named entities into following categories - Company - facility - geo-loc - movie - musicartist - other - person - product - sportsteam - tvshow Goal: Improve the precision and F1 Measure of the baseline code
  • 4. Baseline Code Review  Train and test data each containing 500 tweets  Lexicons for people first name, english stop words, product names, location database, sports team, tv programs  A python code is used to generate the feature in format required by CRFSuite ()  CRFSuite generates model using the training data, and dumps the model in txt format  CRFSuite tag mode is then used on the test data to extract named entities.  Perl script did the job of evaluation
  • 5. Crfsuite (Averaged Perceptron) Crfsuite uses Averaged Perceptron algorithm This algorithm takes the average of feature weights at all updates in the training process. The algorithm is fastest in terms of training speed(as compared to l2sgd: Stochastic Gradient Descent (SGD) with L2 regularization). Even though the algorithm is very simple, it exhibits high prediction performance. In practice, it is necessary to stop a training process by specifying the maximum number of iterations (120 in our case).
  • 6. Changes done 1. Code changes : Logical bug fixed python code didn’t actually considered contiguous words to extract the phrase features using windowing approach. Fixing this boosted the precision by .3% to 96.57% 2. Non-code changes : - Wikipedia titles were heavily pruned and added as lexicons, which boosted the precision to 96.24 % i.e. by a factor of .2% - OpenData from gov websites like world university names, geographical data like river names, company names was also
  • 7. Base Output processed 11570 tokens with 356 phrases; found: 244 phrases; correct: 128. accuracy: 96.07%; precision: 52.46%; recall: 35.96%; FB1: 42.67 company: precision: 72.41%; recall: 51.22%; FB1: 60.00 29 facility: precision: 40.00%; recall: 30.00%; FB1: 34.29 15 geo-loc: precision: 64.44%; recall: 50.00%; FB1: 56.31 45 movie: precision: 11.11%; recall: 33.33%; FB1: 16.67 9 musicartist: precision: 16.67%; recall: 8.33%; FB1: 11.11 6 other: precision: 35.00%; recall: 11.48%; FB1: 17.28 20 person: precision: 60.44%; recall: 47.01%; FB1: 52.88 91 product: precision: 26.67%; recall: 22.22%; FB1: 24.24 15 sportsteam: precision: 25.00%; recall: 16.67%; FB1: 20.00 12 tvshow: precision: 50.00%; recall: 12.50%; FB1: 20.00 2
  • 8. Build Output processed 11570 tokens with 356 phrases; found: 261 phrases; correct: 157. accuracy: 96.57%; precision: 60.15%; recall: 44.10%; FB1: 50.89 company: precision: 70.97%; recall: 53.66%; FB1: 61.11 31 facility: precision: 50.00%; recall: 30.00%; FB1: 37.50 12 geo-loc: precision: 68.09%; recall: 55.17%; FB1: 60.95 47 movie: precision: 33.33%; recall: 33.33%; FB1: 33.33 3 musicartist: precision: 12.50%; recall: 8.33%; FB1: 10.00 8 other: precision: 42.42%; recall: 22.95%; FB1: 29.79 33 person: precision: 68.27%; recall: 60.68%; FB1: 64.25 104 product: precision: 62.50%; recall: 27.78%; FB1: 38.46 8 sportsteam: precision: 40.00%; recall: 22.22%; FB1: 28.57 10 tvshow: precision: 20.00%; recall: 12.50%; FB1: 15.38 5
  • 9. External Links http://www.chokkan.org/software/crfsuite/ https://noisy-text.github.io/ner-shared- task.html Papers referred to  TwiNER: Named Entity Recognition in Targeted Twitter Stream [Chenliang Li1, Jianshu Weng2]  Named Entity Recognition in Tweets: An Experimental Study [Alan Ritter, Sam Clark, Mausam and Oren Etzioni]