SlideShare a Scribd company logo
1 of 19
Download to read offline
Linked Data for content analytics in Celi 
Semantics 2014 - Leipzig 
Alessio Bosca
Agenda 
ü Presentation of Celi 
ü Technologies (and what we do with 
them) 
ü Focus on LOD for content analytics 
in Celi 
ü … what we’d like to do 
2
1999 
CELI srl was born 
2002 
Speech Technology 
2006 
BlogMeter 
1999 2005 2010 
2013 
Korean Market 
2011 
Cross Library 
2010 
Milan, Rome, 
Trento 
3
4 Seats 
Torino 
Milano 
Trento 
Roma 
Italy 
Belgium 
France 
Spain 
Corea 
Poland 
50 
Employees + 
Collaborators 
6 Markets 
>100 
Active clients 
4 
Business branches 
15 Years of experience 
NLP components 
Speech technology 
Social Media Intelligence 
Digital Humanities 
4
>50 Published papers 15 Research projects 
Scuola Normale Superiore 
Università di Torino 
Università di Pisa 
Università di Trento 
Fondazione Bruno Kessler 
Politecnico di Milano 
6 Agreements with research centers 
Relationships with the scientific community 
5
6 
Core technology 
opinion 
mining, 
mood and 
sentiment 
analysis 
normalization 
language 
identification 
NSW 
processing morphological 
tokenization 
analysis 
disambiguation 
chunking and 
phrasing 
phonetic 
transcription 
with word 
stress 
semantic 
clustering 
automatic 
classification 
named entities
Techs 
Guava 
Kestrel 
Virtuoso 
OpenSource 
7
8 
Clients 
Semantic Solutions Speech Technology Social Media Monitoring
Linked (and/or Open) Data 
Linked Data 
Open Data 
? 
LOD 
9
Private Sector: how Celi exploits L(O)D 
• as user 
LODs as linguistic resources for NER, 
content enrichment, machine linking, 
discovery search… 
• as provider for the PA 
publishing, data integration 
• internal use 
(e.g. assets management) 
• crafting of RDF artifacts 
for custom projects and applications 
10
LOD for NER 
• GENDER GUESSER 
• LOCATION GUESSER 
• ENTITY LINKER 
• ETC . 
11 
INDEXER 
DUMP 
CELI TRIPLE STORES 
INDEXES 
Linguistic Analysis 
SPARQL QUERIES 
SEARCHER 
CUSTOM RDF 
WEBAPPS
Faceted Semantic Search 
Browse through 
documents 
and contents 
Relations between Facets 
12
LOD for CLIR 
THE AGROVOC 
THESAURUS HAS 
BEEN USED IN 
THE 
ORGANIC.LINGU 
A PROJECT FOR 
ONTOLOGY-BASED 
CLIR 
13
Sem-web techs for internal models 
Information in the 
CRUNCHED BOOK is 
represented using 
combinations of RDF 
and GRAPH DBS 
14
Public Sector: clear process … 
acquire 
data 
set open 
license 
open 
formats publish 
15 
Celi for the public sector 
(CSI Piemonte): 
the Homer project
(Public sector contd.) … but … 
LACK 
OF MONEY 
USE OF “STANDARDS” 
… hard 
problems OPAQUE DATASETS 
LACK 
OF WILLINGNESS 
POOR RDF/SPARQL 
SUPPORT 
16
Why companies’ RDF is not published 
• It reflects customers’ needs • It reflects internal data models 
HENCE à OVERFITTING: 
Provocation 
It would not be interesting nor usable 
WAY OUTS: 
having more standard models for particular micro-domains could permit their direct 
(re)use by the private company (and hence the publication of enhanced versions) 
17
Receipts 
Public Sector: use “true” LOD technologies (RDF dumps and SPARQL endpoints) 
Private companies: use standard data models, internally and for their artifacts 
OpenData Community: please stress the linked in LOD! 
The success of LOD is bound to the use of Linked Data 
(as a technology) 
The use of LD in the Private Sector will positively 
feedback on the diffusion of the necessary expertise and 
sensibility in the Public Sector too 
18
Thank You!

More Related Content

Similar to Celi Semantics 2014 Presentation on Linked Data for Content Analytics

“Publishing and Consuming Linked Data. (Lessons learnt when using LOD in an a...
“Publishing and Consuming Linked Data. (Lessons learnt when using LOD in an a...“Publishing and Consuming Linked Data. (Lessons learnt when using LOD in an a...
“Publishing and Consuming Linked Data. (Lessons learnt when using LOD in an a...Marta Villegas
 
Notes for talk on 12th June 2013 to Open Innovation meeting, Glasgow
Notes for talk on 12th June 2013 to Open Innovation meeting, GlasgowNotes for talk on 12th June 2013 to Open Innovation meeting, Glasgow
Notes for talk on 12th June 2013 to Open Innovation meeting, GlasgowPeterWinstanley1
 
A Lightning Introduction To Clouds & HLT - Human Language Technology Conference
A Lightning Introduction To Clouds & HLT - Human Language Technology ConferenceA Lightning Introduction To Clouds & HLT - Human Language Technology Conference
A Lightning Introduction To Clouds & HLT - Human Language Technology ConferenceBasis Technology
 
Hello Open World - Semtech 2009
Hello Open World - Semtech 2009Hello Open World - Semtech 2009
Hello Open World - Semtech 2009Alexandre Passant
 
Michael Lang Sr. Presentation
Michael Lang Sr. PresentationMichael Lang Sr. Presentation
Michael Lang Sr. PresentationMediabistro
 
SURFconext: a next generation collaboration infrastructure across institution...
SURFconext: a next generation collaboration infrastructure across institution...SURFconext: a next generation collaboration infrastructure across institution...
SURFconext: a next generation collaboration infrastructure across institution...University of Amsterdam
 
Clarin nl odijk-final_event_2015-03-13
Clarin nl odijk-final_event_2015-03-13Clarin nl odijk-final_event_2015-03-13
Clarin nl odijk-final_event_2015-03-13CLARIAH
 
SWSIG wlic2016
SWSIG wlic2016SWSIG wlic2016
SWSIG wlic2016Figoblog
 
Bg linkedin bigdata_martinschultz_symposium_yale_oct2012
Bg linkedin bigdata_martinschultz_symposium_yale_oct2012Bg linkedin bigdata_martinschultz_symposium_yale_oct2012
Bg linkedin bigdata_martinschultz_symposium_yale_oct2012Bhaskar Ghosh
 
BDS14 Big Data Analytics to the masses
BDS14 Big Data Analytics to the massesBDS14 Big Data Analytics to the masses
BDS14 Big Data Analytics to the massesJose Luis Lopez Pino
 
Linked Open Library Data @hbz
Linked Open Library Data @hbzLinked Open Library Data @hbz
Linked Open Library Data @hbzAdrian Pohl
 
Data Production Pipelines: Legacy, practices, and innovation
Data Production Pipelines: Legacy, practices, and innovationData Production Pipelines: Legacy, practices, and innovation
Data Production Pipelines: Legacy, practices, and innovationNatalino Busa
 
Grand Challenges Learning Analytics
Grand Challenges Learning AnalyticsGrand Challenges Learning Analytics
Grand Challenges Learning Analyticsamberg
 
Peter Bjørn Larsen - Öresund Smart City Hub
Peter Bjørn Larsen - Öresund Smart City HubPeter Bjørn Larsen - Öresund Smart City Hub
Peter Bjørn Larsen - Öresund Smart City HubBigDataViz
 
An introduction to repository reference models
An introduction to repository reference modelsAn introduction to repository reference models
An introduction to repository reference modelsJulie Allinson
 

Similar to Celi Semantics 2014 Presentation on Linked Data for Content Analytics (20)

“Publishing and Consuming Linked Data. (Lessons learnt when using LOD in an a...
“Publishing and Consuming Linked Data. (Lessons learnt when using LOD in an a...“Publishing and Consuming Linked Data. (Lessons learnt when using LOD in an a...
“Publishing and Consuming Linked Data. (Lessons learnt when using LOD in an a...
 
Weaving a Web of Linked Data - September 26th, 2019
Weaving a Web of Linked Data - September 26th, 2019Weaving a Web of Linked Data - September 26th, 2019
Weaving a Web of Linked Data - September 26th, 2019
 
Notes for talk on 12th June 2013 to Open Innovation meeting, Glasgow
Notes for talk on 12th June 2013 to Open Innovation meeting, GlasgowNotes for talk on 12th June 2013 to Open Innovation meeting, Glasgow
Notes for talk on 12th June 2013 to Open Innovation meeting, Glasgow
 
A Lightning Introduction To Clouds & HLT - Human Language Technology Conference
A Lightning Introduction To Clouds & HLT - Human Language Technology ConferenceA Lightning Introduction To Clouds & HLT - Human Language Technology Conference
A Lightning Introduction To Clouds & HLT - Human Language Technology Conference
 
Methodology for the publication of Linked Open Data from small and medium siz...
Methodology for the publication of Linked Open Data from small and medium siz...Methodology for the publication of Linked Open Data from small and medium siz...
Methodology for the publication of Linked Open Data from small and medium siz...
 
Hello Open World - Semtech 2009
Hello Open World - Semtech 2009Hello Open World - Semtech 2009
Hello Open World - Semtech 2009
 
Michael Lang Sr. Presentation
Michael Lang Sr. PresentationMichael Lang Sr. Presentation
Michael Lang Sr. Presentation
 
SURFconext: a next generation collaboration infrastructure across institution...
SURFconext: a next generation collaboration infrastructure across institution...SURFconext: a next generation collaboration infrastructure across institution...
SURFconext: a next generation collaboration infrastructure across institution...
 
Linked Open Data and Ontotext Projects
Linked Open Data and Ontotext ProjectsLinked Open Data and Ontotext Projects
Linked Open Data and Ontotext Projects
 
Clarin nl odijk-final_event_2015-03-13
Clarin nl odijk-final_event_2015-03-13Clarin nl odijk-final_event_2015-03-13
Clarin nl odijk-final_event_2015-03-13
 
Jones "Enabling Discovery in the Library"
Jones "Enabling Discovery in the Library"Jones "Enabling Discovery in the Library"
Jones "Enabling Discovery in the Library"
 
SWSIG wlic2016
SWSIG wlic2016SWSIG wlic2016
SWSIG wlic2016
 
Bg linkedin bigdata_martinschultz_symposium_yale_oct2012
Bg linkedin bigdata_martinschultz_symposium_yale_oct2012Bg linkedin bigdata_martinschultz_symposium_yale_oct2012
Bg linkedin bigdata_martinschultz_symposium_yale_oct2012
 
CORFU-MTSR 2013
CORFU-MTSR 2013CORFU-MTSR 2013
CORFU-MTSR 2013
 
BDS14 Big Data Analytics to the masses
BDS14 Big Data Analytics to the massesBDS14 Big Data Analytics to the masses
BDS14 Big Data Analytics to the masses
 
Linked Open Library Data @hbz
Linked Open Library Data @hbzLinked Open Library Data @hbz
Linked Open Library Data @hbz
 
Data Production Pipelines: Legacy, practices, and innovation
Data Production Pipelines: Legacy, practices, and innovationData Production Pipelines: Legacy, practices, and innovation
Data Production Pipelines: Legacy, practices, and innovation
 
Grand Challenges Learning Analytics
Grand Challenges Learning AnalyticsGrand Challenges Learning Analytics
Grand Challenges Learning Analytics
 
Peter Bjørn Larsen - Öresund Smart City Hub
Peter Bjørn Larsen - Öresund Smart City HubPeter Bjørn Larsen - Öresund Smart City Hub
Peter Bjørn Larsen - Öresund Smart City Hub
 
An introduction to repository reference models
An introduction to repository reference modelsAn introduction to repository reference models
An introduction to repository reference models
 

More from mbruemmer

Dirk Goldhahn: Introduction to the German Wortschatz Project
Dirk Goldhahn: Introduction to the German Wortschatz ProjectDirk Goldhahn: Introduction to the German Wortschatz Project
Dirk Goldhahn: Introduction to the German Wortschatz Projectmbruemmer
 
Oscar Muñoz: Content Analytics for Media Agencies
Oscar Muñoz: Content Analytics for Media AgenciesOscar Muñoz: Content Analytics for Media Agencies
Oscar Muñoz: Content Analytics for Media Agenciesmbruemmer
 
Marc Egger: Text Analytics for Brand Research -Non-reactive Concept Mapping t...
Marc Egger: Text Analytics for Brand Research -Non-reactive Concept Mapping t...Marc Egger: Text Analytics for Brand Research -Non-reactive Concept Mapping t...
Marc Egger: Text Analytics for Brand Research -Non-reactive Concept Mapping t...mbruemmer
 
Mark Zöpfgen: Software-Supported Bibliographic Recording and Linked Data
Mark Zöpfgen: Software-Supported Bibliographic Recording and Linked DataMark Zöpfgen: Software-Supported Bibliographic Recording and Linked Data
Mark Zöpfgen: Software-Supported Bibliographic Recording and Linked Datambruemmer
 
Ilan Kernerman: Generating Multilingual Lexicographic Resources
Ilan Kernerman: Generating Multilingual Lexicographic ResourcesIlan Kernerman: Generating Multilingual Lexicographic Resources
Ilan Kernerman: Generating Multilingual Lexicographic Resourcesmbruemmer
 
Tatiana Gornostay: Language Meets Knowledge in Digital Content Management
Tatiana Gornostay: Language Meets Knowledge in Digital Content ManagementTatiana Gornostay: Language Meets Knowledge in Digital Content Management
Tatiana Gornostay: Language Meets Knowledge in Digital Content Managementmbruemmer
 
Lemon-aid: using Lemon to aid quantitative historical linguistic analysis
Lemon-aid: using Lemon to aid quantitative historical linguistic analysisLemon-aid: using Lemon to aid quantitative historical linguistic analysis
Lemon-aid: using Lemon to aid quantitative historical linguistic analysismbruemmer
 

More from mbruemmer (7)

Dirk Goldhahn: Introduction to the German Wortschatz Project
Dirk Goldhahn: Introduction to the German Wortschatz ProjectDirk Goldhahn: Introduction to the German Wortschatz Project
Dirk Goldhahn: Introduction to the German Wortschatz Project
 
Oscar Muñoz: Content Analytics for Media Agencies
Oscar Muñoz: Content Analytics for Media AgenciesOscar Muñoz: Content Analytics for Media Agencies
Oscar Muñoz: Content Analytics for Media Agencies
 
Marc Egger: Text Analytics for Brand Research -Non-reactive Concept Mapping t...
Marc Egger: Text Analytics for Brand Research -Non-reactive Concept Mapping t...Marc Egger: Text Analytics for Brand Research -Non-reactive Concept Mapping t...
Marc Egger: Text Analytics for Brand Research -Non-reactive Concept Mapping t...
 
Mark Zöpfgen: Software-Supported Bibliographic Recording and Linked Data
Mark Zöpfgen: Software-Supported Bibliographic Recording and Linked DataMark Zöpfgen: Software-Supported Bibliographic Recording and Linked Data
Mark Zöpfgen: Software-Supported Bibliographic Recording and Linked Data
 
Ilan Kernerman: Generating Multilingual Lexicographic Resources
Ilan Kernerman: Generating Multilingual Lexicographic ResourcesIlan Kernerman: Generating Multilingual Lexicographic Resources
Ilan Kernerman: Generating Multilingual Lexicographic Resources
 
Tatiana Gornostay: Language Meets Knowledge in Digital Content Management
Tatiana Gornostay: Language Meets Knowledge in Digital Content ManagementTatiana Gornostay: Language Meets Knowledge in Digital Content Management
Tatiana Gornostay: Language Meets Knowledge in Digital Content Management
 
Lemon-aid: using Lemon to aid quantitative historical linguistic analysis
Lemon-aid: using Lemon to aid quantitative historical linguistic analysisLemon-aid: using Lemon to aid quantitative historical linguistic analysis
Lemon-aid: using Lemon to aid quantitative historical linguistic analysis
 

Recently uploaded

VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...Suhani Kapoor
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]📊 Markus Baersch
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfLars Albertsson
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPramod Kumar Srivastava
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfJohn Sterrett
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...Florian Roscheck
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfSocial Samosa
 
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一fhwihughh
 
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...soniya singh
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsappssapnasaifi408
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998YohFuh
 
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptxAmazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptxAbdelrhman abooda
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Callshivangimorya083
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationshipsccctableauusergroup
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptSonatrach
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFAAndrei Kaleshka
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfgstagge
 
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一F La
 

Recently uploaded (20)

VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdf
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdf
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
 
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
 
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998
 
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptxAmazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships
 
Decoding Loan Approval: Predictive Modeling in Action
Decoding Loan Approval: Predictive Modeling in ActionDecoding Loan Approval: Predictive Modeling in Action
Decoding Loan Approval: Predictive Modeling in Action
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFA
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdf
 
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
 

Celi Semantics 2014 Presentation on Linked Data for Content Analytics

  • 1. Linked Data for content analytics in Celi Semantics 2014 - Leipzig Alessio Bosca
  • 2. Agenda ü Presentation of Celi ü Technologies (and what we do with them) ü Focus on LOD for content analytics in Celi ü … what we’d like to do 2
  • 3. 1999 CELI srl was born 2002 Speech Technology 2006 BlogMeter 1999 2005 2010 2013 Korean Market 2011 Cross Library 2010 Milan, Rome, Trento 3
  • 4. 4 Seats Torino Milano Trento Roma Italy Belgium France Spain Corea Poland 50 Employees + Collaborators 6 Markets >100 Active clients 4 Business branches 15 Years of experience NLP components Speech technology Social Media Intelligence Digital Humanities 4
  • 5. >50 Published papers 15 Research projects Scuola Normale Superiore Università di Torino Università di Pisa Università di Trento Fondazione Bruno Kessler Politecnico di Milano 6 Agreements with research centers Relationships with the scientific community 5
  • 6. 6 Core technology opinion mining, mood and sentiment analysis normalization language identification NSW processing morphological tokenization analysis disambiguation chunking and phrasing phonetic transcription with word stress semantic clustering automatic classification named entities
  • 7. Techs Guava Kestrel Virtuoso OpenSource 7
  • 8. 8 Clients Semantic Solutions Speech Technology Social Media Monitoring
  • 9. Linked (and/or Open) Data Linked Data Open Data ? LOD 9
  • 10. Private Sector: how Celi exploits L(O)D • as user LODs as linguistic resources for NER, content enrichment, machine linking, discovery search… • as provider for the PA publishing, data integration • internal use (e.g. assets management) • crafting of RDF artifacts for custom projects and applications 10
  • 11. LOD for NER • GENDER GUESSER • LOCATION GUESSER • ENTITY LINKER • ETC . 11 INDEXER DUMP CELI TRIPLE STORES INDEXES Linguistic Analysis SPARQL QUERIES SEARCHER CUSTOM RDF WEBAPPS
  • 12. Faceted Semantic Search Browse through documents and contents Relations between Facets 12
  • 13. LOD for CLIR THE AGROVOC THESAURUS HAS BEEN USED IN THE ORGANIC.LINGU A PROJECT FOR ONTOLOGY-BASED CLIR 13
  • 14. Sem-web techs for internal models Information in the CRUNCHED BOOK is represented using combinations of RDF and GRAPH DBS 14
  • 15. Public Sector: clear process … acquire data set open license open formats publish 15 Celi for the public sector (CSI Piemonte): the Homer project
  • 16. (Public sector contd.) … but … LACK OF MONEY USE OF “STANDARDS” … hard problems OPAQUE DATASETS LACK OF WILLINGNESS POOR RDF/SPARQL SUPPORT 16
  • 17. Why companies’ RDF is not published • It reflects customers’ needs • It reflects internal data models HENCE à OVERFITTING: Provocation It would not be interesting nor usable WAY OUTS: having more standard models for particular micro-domains could permit their direct (re)use by the private company (and hence the publication of enhanced versions) 17
  • 18. Receipts Public Sector: use “true” LOD technologies (RDF dumps and SPARQL endpoints) Private companies: use standard data models, internally and for their artifacts OpenData Community: please stress the linked in LOD! The success of LOD is bound to the use of Linked Data (as a technology) The use of LD in the Private Sector will positively feedback on the diffusion of the necessary expertise and sensibility in the Public Sector too 18