SlideShare a Scribd company logo
1 of 5
Download to read offline
Entityclassifier.eu: Real-time Classification
of Entities in Text with Wikipedia
Milan Dojchinovski1,2, Tomáš Kliegr2
1 Faculty of Information Technology
Czech Technical University in Prague
2Faculty of Informatics and Statistics
University of Economics, Prague
European Conference on Machine Learning and Principles and Practice of
Knowledge Discovery Discovery in Databases (ECMLPKDD 2013)
September 23-27, 2013, Prague, CZ
Milan Dojchinovski
milan.dojchinovski@vse.cz - @m1ci - http://dojchinovski.mk
Except where otherwise noted, the content of this presentation is licensed under
Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported
Czech Technical University
in Prague
University of Economics
Prague
What is Entityclassifier.eu?
‣ Fully-automated Named Entity Recognition (NER) system
- entity spotting - rule based lexico-syntactic patterns
- entity disambiguation - unique identification with Wikipedia/DBpedia URIs
- entity classification - using types from the DBpedia Ontology
- entity linking - entities linked with concepts from DBpedia and YAGO
2Entityclassifier.eu: Real-time Classification of Entities in Text with Wikipedia - @m1ci - http://dojchinovski.mk
Advantages of using Entityclassifier.eu
‣ Real-time mining
- previously unknown entities can be disambiguated and classified in real-time
‣ Right type granularity
- most frequent type, as selected by the Wikipedia editors, extracted from free text
‣ Multilinguality
- can process English, German and Dutch texts
3Entityclassifier.eu: Real-time Classification of Entities in Text with Wikipedia - @m1ci - http://dojchinovski.mk
Availability
‣ Web application
- http://entityclassfier.eu
‣ REST API
- API documentation http://entityclassifier.eu/thd/docs/
4Entityclassifier.eu: Real-time Classification of Entities in Text with Wikipedia - @m1ci - http://dojchinovski.mk
Live demo!
http://entityclassifier.eu
Feedback
5
Thank you!
Questions, comments, ideas?
Milan Dojchinovski @m1ci
milan.dojchinovski@fit.cvut.cz http://dojchinovski.mk
Except where otherwise noted, the content of this presentation is licensed under
Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported

More Related Content

Viewers also liked

Recognizing, Classifying and Linking Entities with Wikipedia and DBpedia
Recognizing, Classifying and Linking Entities with Wikipedia and DBpediaRecognizing, Classifying and Linking Entities with Wikipedia and DBpedia
Recognizing, Classifying and Linking Entities with Wikipedia and DBpediaMilan Dojchinovski
 
7 kalimah allah
7 kalimah allah7 kalimah allah
7 kalimah allahIcha Brow
 
Anggaran penjualan
Anggaran penjualanAnggaran penjualan
Anggaran penjualanIcha Brow
 
01. kebijakan binsus (palopo) 2014
01. kebijakan binsus (palopo) 201401. kebijakan binsus (palopo) 2014
01. kebijakan binsus (palopo) 2014Icha Brow
 
Presentase pemasaran
Presentase pemasaranPresentase pemasaran
Presentase pemasaranIcha Brow
 
Prada H & D in Tokyo
Prada H & D in Tokyo Prada H & D in Tokyo
Prada H & D in Tokyo Emma Pereira
 
Keuangan dan tata kelola lkp
Keuangan dan tata kelola lkpKeuangan dan tata kelola lkp
Keuangan dan tata kelola lkpIcha Brow
 
Manajemen mutu, visi, renstra
Manajemen mutu, visi, renstraManajemen mutu, visi, renstra
Manajemen mutu, visi, renstraIcha Brow
 

Viewers also liked (9)

Recognizing, Classifying and Linking Entities with Wikipedia and DBpedia
Recognizing, Classifying and Linking Entities with Wikipedia and DBpediaRecognizing, Classifying and Linking Entities with Wikipedia and DBpedia
Recognizing, Classifying and Linking Entities with Wikipedia and DBpedia
 
7 kalimah allah
7 kalimah allah7 kalimah allah
7 kalimah allah
 
Humor kocak
Humor kocakHumor kocak
Humor kocak
 
Anggaran penjualan
Anggaran penjualanAnggaran penjualan
Anggaran penjualan
 
01. kebijakan binsus (palopo) 2014
01. kebijakan binsus (palopo) 201401. kebijakan binsus (palopo) 2014
01. kebijakan binsus (palopo) 2014
 
Presentase pemasaran
Presentase pemasaranPresentase pemasaran
Presentase pemasaran
 
Prada H & D in Tokyo
Prada H & D in Tokyo Prada H & D in Tokyo
Prada H & D in Tokyo
 
Keuangan dan tata kelola lkp
Keuangan dan tata kelola lkpKeuangan dan tata kelola lkp
Keuangan dan tata kelola lkp
 
Manajemen mutu, visi, renstra
Manajemen mutu, visi, renstraManajemen mutu, visi, renstra
Manajemen mutu, visi, renstra
 

Recently uploaded

MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu SubbuApidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbuapidays
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxRustici Software
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesrafiqahmad00786416
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Jeffrey Haguewood
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWERMadyBayot
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdflior mazor
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native ApplicationsWSO2
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez
 

Recently uploaded (20)

MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu SubbuApidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 

Entityclassifier. eu: Real-Time Classification of Entities in Text with Wikipedia

  • 1. Entityclassifier.eu: Real-time Classification of Entities in Text with Wikipedia Milan Dojchinovski1,2, Tomáš Kliegr2 1 Faculty of Information Technology Czech Technical University in Prague 2Faculty of Informatics and Statistics University of Economics, Prague European Conference on Machine Learning and Principles and Practice of Knowledge Discovery Discovery in Databases (ECMLPKDD 2013) September 23-27, 2013, Prague, CZ Milan Dojchinovski milan.dojchinovski@vse.cz - @m1ci - http://dojchinovski.mk Except where otherwise noted, the content of this presentation is licensed under Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported Czech Technical University in Prague University of Economics Prague
  • 2. What is Entityclassifier.eu? ‣ Fully-automated Named Entity Recognition (NER) system - entity spotting - rule based lexico-syntactic patterns - entity disambiguation - unique identification with Wikipedia/DBpedia URIs - entity classification - using types from the DBpedia Ontology - entity linking - entities linked with concepts from DBpedia and YAGO 2Entityclassifier.eu: Real-time Classification of Entities in Text with Wikipedia - @m1ci - http://dojchinovski.mk
  • 3. Advantages of using Entityclassifier.eu ‣ Real-time mining - previously unknown entities can be disambiguated and classified in real-time ‣ Right type granularity - most frequent type, as selected by the Wikipedia editors, extracted from free text ‣ Multilinguality - can process English, German and Dutch texts 3Entityclassifier.eu: Real-time Classification of Entities in Text with Wikipedia - @m1ci - http://dojchinovski.mk
  • 4. Availability ‣ Web application - http://entityclassfier.eu ‣ REST API - API documentation http://entityclassifier.eu/thd/docs/ 4Entityclassifier.eu: Real-time Classification of Entities in Text with Wikipedia - @m1ci - http://dojchinovski.mk Live demo! http://entityclassifier.eu
  • 5. Feedback 5 Thank you! Questions, comments, ideas? Milan Dojchinovski @m1ci milan.dojchinovski@fit.cvut.cz http://dojchinovski.mk Except where otherwise noted, the content of this presentation is licensed under Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported