SlideShare a Scribd company logo
1 of 10
Download to read offline
Project: Named Entity Extraction in
Twitter
- Md Tareque Khan (201505521)
- Sourav Sarangi (201301014)
- Darshan Agarwal (201225189)
[Team 55] [April 2016]
Information Retrieval and Extraction (CSE474) Spring ‘16
Professor: Vasudeva Verma
Mentor: Priyanka Bajaj
Problem Statement
https://noisy-text.github.io/ner-shared-
task.html
Problem Statement Continued…
A baseline code was provided by Organisers
having
precision : 96.06%
F1 Measure: 42.09
and categorizing named entities into
following categories
- Company - facility - geo-loc
- movie - musicartist - other
- person - product - sportsteam
- tvshow
Goal: Improve the precision and F1 Measure
of the baseline code
Baseline Code Review
 Train and test data each containing 500 tweets
 Lexicons for people first name, english stop words, product
names, location database, sports team, tv programs
 A python code is used to generate the feature in format
required by CRFSuite ()
 CRFSuite generates model using the training data, and
dumps the model in txt format
 CRFSuite tag mode is then used on the test data to extract
named entities.
 Perl script did the job of evaluation
Crfsuite (Averaged Perceptron)
Crfsuite uses Averaged Perceptron algorithm
This algorithm takes the average of feature
weights at all updates in the training process.
The algorithm is fastest in terms of training
speed(as compared to l2sgd: Stochastic Gradient
Descent (SGD) with L2 regularization).
Even though the algorithm is very simple, it
exhibits high prediction performance. In
practice, it is necessary to stop a training
process by specifying the maximum number
of iterations (120 in our case).
Changes done
1. Code changes : Logical bug fixed
python code didn’t actually considered contiguous words
to
extract the phrase features using windowing approach.
Fixing
this boosted the precision by .3% to 96.57%
2. Non-code changes :
- Wikipedia titles were heavily pruned and added as
lexicons,
which boosted the precision to 96.24 % i.e. by a factor
of .2%
- OpenData from gov websites like world university
names,
geographical data like river names, company names was
also
Base Output
processed 11570 tokens with 356 phrases; found: 244 phrases; correct: 128.
accuracy: 96.07%; precision: 52.46%; recall: 35.96%; FB1: 42.67
company: precision: 72.41%; recall: 51.22%; FB1: 60.00 29
facility: precision: 40.00%; recall: 30.00%; FB1: 34.29 15
geo-loc: precision: 64.44%; recall: 50.00%; FB1: 56.31 45
movie: precision: 11.11%; recall: 33.33%; FB1: 16.67 9
musicartist: precision: 16.67%; recall: 8.33%; FB1: 11.11 6
other: precision: 35.00%; recall: 11.48%; FB1: 17.28 20
person: precision: 60.44%; recall: 47.01%; FB1: 52.88 91
product: precision: 26.67%; recall: 22.22%; FB1: 24.24 15
sportsteam: precision: 25.00%; recall: 16.67%; FB1: 20.00 12
tvshow: precision: 50.00%; recall: 12.50%; FB1: 20.00 2
Build Output
processed 11570 tokens with 356 phrases; found: 261 phrases; correct: 157.
accuracy: 96.57%; precision: 60.15%; recall: 44.10%; FB1: 50.89
company: precision: 70.97%; recall: 53.66%; FB1: 61.11 31
facility: precision: 50.00%; recall: 30.00%; FB1: 37.50 12
geo-loc: precision: 68.09%; recall: 55.17%; FB1: 60.95 47
movie: precision: 33.33%; recall: 33.33%; FB1: 33.33 3
musicartist: precision: 12.50%; recall: 8.33%; FB1: 10.00 8
other: precision: 42.42%; recall: 22.95%; FB1: 29.79 33
person: precision: 68.27%; recall: 60.68%; FB1: 64.25 104
product: precision: 62.50%; recall: 27.78%; FB1: 38.46 8
sportsteam: precision: 40.00%; recall: 22.22%; FB1: 28.57 10
tvshow: precision: 20.00%; recall: 12.50%; FB1: 15.38 5
External Links
http://www.chokkan.org/software/crfsuite/
https://noisy-text.github.io/ner-shared-
task.html
Papers referred to
 TwiNER: Named Entity Recognition in Targeted Twitter Stream [Chenliang
Li1, Jianshu Weng2]
 Named Entity Recognition in Tweets: An Experimental Study [Alan Ritter,
Sam Clark, Mausam and Oren Etzioni]
Thank You

More Related Content

Recently uploaded

Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019
 
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...Sapna Thakur
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAssociation for Project Management
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsTechSoup
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docxPoojaSen20
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingTechSoup
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...EduSkills OECD
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptxVS Mahajan Coaching Centre
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13Steve Thomason
 
JAPAN: ORGANISATION OF PMDA, PHARMACEUTICAL LAWS & REGULATIONS, TYPES OF REGI...
JAPAN: ORGANISATION OF PMDA, PHARMACEUTICAL LAWS & REGULATIONS, TYPES OF REGI...JAPAN: ORGANISATION OF PMDA, PHARMACEUTICAL LAWS & REGULATIONS, TYPES OF REGI...
JAPAN: ORGANISATION OF PMDA, PHARMACEUTICAL LAWS & REGULATIONS, TYPES OF REGI...anjaliyadav012327
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxGaneshChakor2
 
The byproduct of sericulture in different industries.pptx
The byproduct of sericulture in different industries.pptxThe byproduct of sericulture in different industries.pptx
The byproduct of sericulture in different industries.pptxShobhayan Kirtania
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 
Russian Call Girls in Andheri Airport Mumbai WhatsApp 9167673311 💞 Full Nigh...
Russian Call Girls in Andheri Airport Mumbai WhatsApp  9167673311 💞 Full Nigh...Russian Call Girls in Andheri Airport Mumbai WhatsApp  9167673311 💞 Full Nigh...
Russian Call Girls in Andheri Airport Mumbai WhatsApp 9167673311 💞 Full Nigh...Pooja Nehwal
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationnomboosow
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionSafetyChain Software
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactPECB
 

Recently uploaded (20)

Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across Sectors
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docx
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13
 
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
 
JAPAN: ORGANISATION OF PMDA, PHARMACEUTICAL LAWS & REGULATIONS, TYPES OF REGI...
JAPAN: ORGANISATION OF PMDA, PHARMACEUTICAL LAWS & REGULATIONS, TYPES OF REGI...JAPAN: ORGANISATION OF PMDA, PHARMACEUTICAL LAWS & REGULATIONS, TYPES OF REGI...
JAPAN: ORGANISATION OF PMDA, PHARMACEUTICAL LAWS & REGULATIONS, TYPES OF REGI...
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptx
 
The byproduct of sericulture in different industries.pptx
The byproduct of sericulture in different industries.pptxThe byproduct of sericulture in different industries.pptx
The byproduct of sericulture in different industries.pptx
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
Russian Call Girls in Andheri Airport Mumbai WhatsApp 9167673311 💞 Full Nigh...
Russian Call Girls in Andheri Airport Mumbai WhatsApp  9167673311 💞 Full Nigh...Russian Call Girls in Andheri Airport Mumbai WhatsApp  9167673311 💞 Full Nigh...
Russian Call Girls in Andheri Airport Mumbai WhatsApp 9167673311 💞 Full Nigh...
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communication
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory Inspection
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 

Featured

How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024Albert Qian
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsKurio // The Social Media Age(ncy)
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Search Engine Journal
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summarySpeakerHub
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next Tessa Mero
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentLily Ray
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best PracticesVit Horky
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project managementMindGenius
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...RachelPearson36
 
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Applitools
 
12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at WorkGetSmarter
 
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...DevGAMM Conference
 
Barbie - Brand Strategy Presentation
Barbie - Brand Strategy PresentationBarbie - Brand Strategy Presentation
Barbie - Brand Strategy PresentationErica Santiago
 
Good Stuff Happens in 1:1 Meetings: Why you need them and how to do them well
Good Stuff Happens in 1:1 Meetings: Why you need them and how to do them wellGood Stuff Happens in 1:1 Meetings: Why you need them and how to do them well
Good Stuff Happens in 1:1 Meetings: Why you need them and how to do them wellSaba Software
 
Introduction to C Programming Language
Introduction to C Programming LanguageIntroduction to C Programming Language
Introduction to C Programming LanguageSimplilearn
 

Featured (20)

How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
 
12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work
 
ChatGPT webinar slides
ChatGPT webinar slidesChatGPT webinar slides
ChatGPT webinar slides
 
More than Just Lines on a Map: Best Practices for U.S Bike Routes
More than Just Lines on a Map: Best Practices for U.S Bike RoutesMore than Just Lines on a Map: Best Practices for U.S Bike Routes
More than Just Lines on a Map: Best Practices for U.S Bike Routes
 
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
 
Barbie - Brand Strategy Presentation
Barbie - Brand Strategy PresentationBarbie - Brand Strategy Presentation
Barbie - Brand Strategy Presentation
 
Good Stuff Happens in 1:1 Meetings: Why you need them and how to do them well
Good Stuff Happens in 1:1 Meetings: Why you need them and how to do them wellGood Stuff Happens in 1:1 Meetings: Why you need them and how to do them well
Good Stuff Happens in 1:1 Meetings: Why you need them and how to do them well
 
Introduction to C Programming Language
Introduction to C Programming LanguageIntroduction to C Programming Language
Introduction to C Programming Language
 

Ire project ner-team55-spring16-iiith

  • 1. Project: Named Entity Extraction in Twitter - Md Tareque Khan (201505521) - Sourav Sarangi (201301014) - Darshan Agarwal (201225189) [Team 55] [April 2016] Information Retrieval and Extraction (CSE474) Spring ‘16 Professor: Vasudeva Verma Mentor: Priyanka Bajaj
  • 3. Problem Statement Continued… A baseline code was provided by Organisers having precision : 96.06% F1 Measure: 42.09 and categorizing named entities into following categories - Company - facility - geo-loc - movie - musicartist - other - person - product - sportsteam - tvshow Goal: Improve the precision and F1 Measure of the baseline code
  • 4. Baseline Code Review  Train and test data each containing 500 tweets  Lexicons for people first name, english stop words, product names, location database, sports team, tv programs  A python code is used to generate the feature in format required by CRFSuite ()  CRFSuite generates model using the training data, and dumps the model in txt format  CRFSuite tag mode is then used on the test data to extract named entities.  Perl script did the job of evaluation
  • 5. Crfsuite (Averaged Perceptron) Crfsuite uses Averaged Perceptron algorithm This algorithm takes the average of feature weights at all updates in the training process. The algorithm is fastest in terms of training speed(as compared to l2sgd: Stochastic Gradient Descent (SGD) with L2 regularization). Even though the algorithm is very simple, it exhibits high prediction performance. In practice, it is necessary to stop a training process by specifying the maximum number of iterations (120 in our case).
  • 6. Changes done 1. Code changes : Logical bug fixed python code didn’t actually considered contiguous words to extract the phrase features using windowing approach. Fixing this boosted the precision by .3% to 96.57% 2. Non-code changes : - Wikipedia titles were heavily pruned and added as lexicons, which boosted the precision to 96.24 % i.e. by a factor of .2% - OpenData from gov websites like world university names, geographical data like river names, company names was also
  • 7. Base Output processed 11570 tokens with 356 phrases; found: 244 phrases; correct: 128. accuracy: 96.07%; precision: 52.46%; recall: 35.96%; FB1: 42.67 company: precision: 72.41%; recall: 51.22%; FB1: 60.00 29 facility: precision: 40.00%; recall: 30.00%; FB1: 34.29 15 geo-loc: precision: 64.44%; recall: 50.00%; FB1: 56.31 45 movie: precision: 11.11%; recall: 33.33%; FB1: 16.67 9 musicartist: precision: 16.67%; recall: 8.33%; FB1: 11.11 6 other: precision: 35.00%; recall: 11.48%; FB1: 17.28 20 person: precision: 60.44%; recall: 47.01%; FB1: 52.88 91 product: precision: 26.67%; recall: 22.22%; FB1: 24.24 15 sportsteam: precision: 25.00%; recall: 16.67%; FB1: 20.00 12 tvshow: precision: 50.00%; recall: 12.50%; FB1: 20.00 2
  • 8. Build Output processed 11570 tokens with 356 phrases; found: 261 phrases; correct: 157. accuracy: 96.57%; precision: 60.15%; recall: 44.10%; FB1: 50.89 company: precision: 70.97%; recall: 53.66%; FB1: 61.11 31 facility: precision: 50.00%; recall: 30.00%; FB1: 37.50 12 geo-loc: precision: 68.09%; recall: 55.17%; FB1: 60.95 47 movie: precision: 33.33%; recall: 33.33%; FB1: 33.33 3 musicartist: precision: 12.50%; recall: 8.33%; FB1: 10.00 8 other: precision: 42.42%; recall: 22.95%; FB1: 29.79 33 person: precision: 68.27%; recall: 60.68%; FB1: 64.25 104 product: precision: 62.50%; recall: 27.78%; FB1: 38.46 8 sportsteam: precision: 40.00%; recall: 22.22%; FB1: 28.57 10 tvshow: precision: 20.00%; recall: 12.50%; FB1: 15.38 5
  • 9. External Links http://www.chokkan.org/software/crfsuite/ https://noisy-text.github.io/ner-shared- task.html Papers referred to  TwiNER: Named Entity Recognition in Targeted Twitter Stream [Chenliang Li1, Jianshu Weng2]  Named Entity Recognition in Tweets: An Experimental Study [Alan Ritter, Sam Clark, Mausam and Oren Etzioni]