SlideShare a Scribd company logo
1 of 28
Download to read offline
SESSION ON
MULTILINGUAL VOCABULARY
Development and extension
DanielVila-Suero (dvila@fi.upm.es)
Ontology Engineering Group, UPM (Madrid)
Slides available at
GOAL
Open discussion around vocabularies enabled for
multilingual environments (WWW)
Introduce some examples: current situation and efforts.
More open questions than answers.
Promote collaboration
ß
SESSION OUTLINE
1. Introduction to the session and the topic
2. “Representing multilingual lexical and
terminological information in RDF vocabularies”
Elena Montiel-Ponsoda, OEG-UPM
3. “Metadata registry of the Publications office of the
EU”
Michael Düro. PO-EU
OPENDISCUSSION
OPEN DISCUSSION
THISTALK
1. Why should we care about multilingual vocabularies?
2. What is a multilingual vocabulary?
3. Current situation: when and who
WHY
The primary design principle underlying the Web’s
usefulness and growth is universality.When you make
a link, you can link to anything.That means people
must be able to put anything on the Web, no matter
what computer they have, software they use or human
language they speak…
Tim Berners-Lee
ß
ß	

	

WHY
The primary design principle underlying the Web’s
usefulness and growth is universality.When you make
a link, you can link to anything.That means people
must be able to put anything on the Web, no matter
what computer they have, software they use or human
language they speak…
Tim Berners-LeeVocabularies are becoming a central part of the WWW
LANGUAGES ARE USEFUL
•For Humans
★Finding vocabularies, terms, etc.
★Understading their semantics, how to use them
★...
•and Machines...
★ Search, ranking, resource discovery
★Natural Language Processing applications: multilingual
question answering, localized presentation of data
★....
WHY
Search for
プロジェクト
WHY
24 ranked results
including the term
project in Japanese
SOME FACTS ABOUT LOV
• Data retrieved 12.04.2013* out of 326 vocabs
0
75
150
225
300
Monolingual (EN) Monolingual (No EN) Multilingual Non especified
425358
223
Number of vocabularies
* “Guidelines for Multilingual Linked Data” Gómez-Pérez et al., 2013
SOME FACTS ABOUT LOV
• LOV loves multilingual descriptions: indexing, ranked
search results.
•But, still very low usage of language tags for
vocabulary elements < 60%
•Other semantic search engines (Sindice, Falcons, SWSE..)
lack support for multiple languages
WHAT IS AN MLVOCAB?
• Simple (general) answer:
“A vocabulary which includes labels and
documentation in multiple languages”
•Are there other flavors of multilingual vocabularies?
FLAVORS
1
1. LABELLING
@en
@es
FLAVORS
2. EXTERNAL
MODEL
personal docente
profesor titular
catedrático
academic staff
associate professor
(full) professor
Lehrpersonal
Privatdozent
Professor
P01
P011
P013
P012
Legend!
subClassOf!
Mappings!
ISSUES:
Directionality of links, different namespaces, resolution of
URIs (at http level with header, htaccess, external
service..-)
FLAVORS
3. MAPPING MODEL
cuerpo docente
catedrático
Lehrpersonal
profesor
contratado
assistant
professor
associate
professor
full
professor
Dozent
…
academic staff
P011 P013P012
profesor titular
…
…
P01
Legend!
subClassOf!
Mappings!
Professor
wissenschaftlicher
Mitarbeiter Privatdozent…
WHAT FLAVOR IS MINE ?
WHAT FLAVOR IS MINE?
• Depends on a number of factors:
★Your starting point (starting from scratch? can you
modify the terms within your original namespace? are
there similar vocabularies in other langs?)
★Your needs (linguistically complex model, simplicity,
efficiency, et)
★Your available resources (time, people, money...)
★.........
•Selection should be USE CASE DRIVEN
LAYERED FRAMEWORK
PROCESS
REPRESENTATION
POLICY
INFRASTRUCTURE
ORGANIZATIONAL
TECHNICAL
POLICY
POLICY
★Vocabulary publishers should commit to
a translation policy:
e.g.,What are the protocols for including/
developing/validating a new translation?
★Establish the neccesary mechanisms to
manage and assess the quality,
sinchronization and appropriate
coverage between different languages.
★Again, should be based on
requirements, goals, etc. and be UC
driven
PROCESS
PROCESS
★Translation workflows: versioning,
notification, edition, validation
mechanisms, etc.
★Develop methodologies,
guidelines and best practices for
translating and including new languages.
★Establish communication
protocols between the responsibles of
the different translations (languages)
★Coordination among the people
involved
REPRESENTATION
REPRESENTATION
★Choose your modelling approach:
★rdfs and skos labels and descriptions
★Specialized models (lemon, ontolex
etc.)
★ Mappings
★ Guidelines for:
★ Naming: coining new URIs for
terms
★Labeling: Defining the structure
of the labels (should we use verbs, full
sentences, etc.)
INFRASTRUCTURE
INFRASTRUCTURE
★Manage different aspects:
★Management of translation/
edition workflows: notifications, review
process, versioning, etc.
★Access to vocabulary elements:
localize access? different namespace for
the linguistic descriptions?
★Generation of human-readable
documentation
★Look at MLOD patterns and
guidelines
WHEN AND WHO
• Learn from (succesful) initiatives:
★ FAO’s AGROVOC
★ EUROVOC
★WORDNET
★ IFLAVocabularies and Guidelines for translations
★....
•Get involved in initiatives around the topic:
★W3C Internationalization Activity
★W3C Best practices for Multilingual LOD CG
★W3C Ontology-Lexica CG
★EU Lider project
W3C BPMLOD
REPRESENTATION
INFRASTRUCTURE
Use cases wanted!
W3C ONTO-LEX
REPRESENTATION
LIDER-PROJECT.EU
Linguistic Linked Data (including vocabularies)
can serve as an enabler technology for content analytics
on the Multilingual Web.
Universidad Politécnica de Madrid (Spain) Trinity College (Ireland)
DFKI (Germany) National University of Ireland , Galway (Ireland)
Institut für Angewandte Informatik (Germany)
Universität Bielefeld (Germany)
Universita Roma la Sapienza (Italy) W3C/ERCIM (France)
LIDER-PROJECT.EU
• Development of best practices and guidelines for
publishing multilingual linked data resources (including
vocabularies).
•Events: W3C Multilingual Web workshop, hackhathons,
industrial events, etc.
•Help organizations with publishing Multilingual
Linked Data resources
Get involved!
THANK YOU VERY MUCH
dvila@fi.upm, @dvilasuero

More Related Content

Similar to Multilingual vocabularies for the Web: Session on multilingual vocabularies, VocDay, DC conference Lisbon 2013

OER: insights into a multilingual landscape - EUROCALL 2014 conference
OER: insights into a multilingual landscape - EUROCALL 2014 conference  OER: insights into a multilingual landscape - EUROCALL 2014 conference
OER: insights into a multilingual landscape - EUROCALL 2014 conference LangOER
 
Developing corpus-based resources for language learning: looking back in "hope"
Developing corpus-based resources for language learning: looking back in "hope"Developing corpus-based resources for language learning: looking back in "hope"
Developing corpus-based resources for language learning: looking back in "hope"Pascual Pérez-Paredes
 
Comp app lexicography
Comp app lexicographyComp app lexicography
Comp app lexicographysyila239
 
The World Is Not Flat (Rossomondo & Lord, ACTFL2015)
The World Is Not Flat (Rossomondo & Lord, ACTFL2015)The World Is Not Flat (Rossomondo & Lord, ACTFL2015)
The World Is Not Flat (Rossomondo & Lord, ACTFL2015)Gillian Lord
 
Linq 2013 session_red_1_diadori_peppoloni
Linq 2013 session_red_1_diadori_peppoloniLinq 2013 session_red_1_diadori_peppoloni
Linq 2013 session_red_1_diadori_peppoloniLINQ_Conference
 
Embedding_OER_Into_Your_Learning_Managemen_System
Embedding_OER_Into_Your_Learning_Managemen_SystemEmbedding_OER_Into_Your_Learning_Managemen_System
Embedding_OER_Into_Your_Learning_Managemen_SystemJacqueline L. Frank
 
TDWG VoMaG Vocabulary management workflow, 2013-10-31
TDWG VoMaG Vocabulary management workflow, 2013-10-31TDWG VoMaG Vocabulary management workflow, 2013-10-31
TDWG VoMaG Vocabulary management workflow, 2013-10-31Dag Endresen
 
Definiendo el enfoque lfe
Definiendo el enfoque lfeDefiniendo el enfoque lfe
Definiendo el enfoque lfeNatalia Rojo
 
Presentation of Adaptive Software at CLIL 2010 Conference
Presentation of Adaptive Software at CLIL 2010 ConferencePresentation of Adaptive Software at CLIL 2010 Conference
Presentation of Adaptive Software at CLIL 2010 ConferenceTon Koenraad
 
Word processors in the classroom
Word processors in the classroomWord processors in the classroom
Word processors in the classroomDavid Mendocoreas
 
4. Edurne Ecay and Irune Labiano - Lack of a theoretical framework
4. Edurne Ecay  and Irune Labiano - Lack of a theoretical framework4. Edurne Ecay  and Irune Labiano - Lack of a theoretical framework
4. Edurne Ecay and Irune Labiano - Lack of a theoretical frameworkÚcar Marian
 
Lack of a theoretical framework
Lack of a theoretical framework Lack of a theoretical framework
Lack of a theoretical framework Edurne
 
Corpus Construction & Specialist Vocabulary Learning
Corpus Construction & Specialist Vocabulary LearningCorpus Construction & Specialist Vocabulary Learning
Corpus Construction & Specialist Vocabulary LearningSimon Smith
 
Calico 2014 intelligent call - def
Calico 2014   intelligent call - defCalico 2014   intelligent call - def
Calico 2014 intelligent call - defPiet Desmet
 
Siop model-and-research-findings
Siop model-and-research-findingsSiop model-and-research-findings
Siop model-and-research-findingsrobinsondurango67
 
Siop model-and-research-findings
Siop model-and-research-findingsSiop model-and-research-findings
Siop model-and-research-findingsrobinsondurango67
 

Similar to Multilingual vocabularies for the Web: Session on multilingual vocabularies, VocDay, DC conference Lisbon 2013 (20)

Sacodeyl Birmingham 2007
Sacodeyl Birmingham 2007Sacodeyl Birmingham 2007
Sacodeyl Birmingham 2007
 
OER: insights into a multilingual landscape - EUROCALL 2014 conference
OER: insights into a multilingual landscape - EUROCALL 2014 conference  OER: insights into a multilingual landscape - EUROCALL 2014 conference
OER: insights into a multilingual landscape - EUROCALL 2014 conference
 
Developing corpus-based resources for language learning: looking back in "hope"
Developing corpus-based resources for language learning: looking back in "hope"Developing corpus-based resources for language learning: looking back in "hope"
Developing corpus-based resources for language learning: looking back in "hope"
 
Comp app lexicography
Comp app lexicographyComp app lexicography
Comp app lexicography
 
The World Is Not Flat (Rossomondo & Lord, ACTFL2015)
The World Is Not Flat (Rossomondo & Lord, ACTFL2015)The World Is Not Flat (Rossomondo & Lord, ACTFL2015)
The World Is Not Flat (Rossomondo & Lord, ACTFL2015)
 
Linq 2013 session_red_1_diadori_peppoloni
Linq 2013 session_red_1_diadori_peppoloniLinq 2013 session_red_1_diadori_peppoloni
Linq 2013 session_red_1_diadori_peppoloni
 
Embedding_OER_Into_Your_Learning_Managemen_System
Embedding_OER_Into_Your_Learning_Managemen_SystemEmbedding_OER_Into_Your_Learning_Managemen_System
Embedding_OER_Into_Your_Learning_Managemen_System
 
TDWG VoMaG Vocabulary management workflow, 2013-10-31
TDWG VoMaG Vocabulary management workflow, 2013-10-31TDWG VoMaG Vocabulary management workflow, 2013-10-31
TDWG VoMaG Vocabulary management workflow, 2013-10-31
 
Definiendo el enfoque lfe
Definiendo el enfoque lfeDefiniendo el enfoque lfe
Definiendo el enfoque lfe
 
Presentation of Adaptive Software at CLIL 2010 Conference
Presentation of Adaptive Software at CLIL 2010 ConferencePresentation of Adaptive Software at CLIL 2010 Conference
Presentation of Adaptive Software at CLIL 2010 Conference
 
Word processors in the classroom
Word processors in the classroomWord processors in the classroom
Word processors in the classroom
 
ESP for PR-managers
ESP for PR-managersESP for PR-managers
ESP for PR-managers
 
4. Edurne Ecay and Irune Labiano - Lack of a theoretical framework
4. Edurne Ecay  and Irune Labiano - Lack of a theoretical framework4. Edurne Ecay  and Irune Labiano - Lack of a theoretical framework
4. Edurne Ecay and Irune Labiano - Lack of a theoretical framework
 
Lack of a theoretical framework
Lack of a theoretical framework Lack of a theoretical framework
Lack of a theoretical framework
 
Corpus Construction & Specialist Vocabulary Learning
Corpus Construction & Specialist Vocabulary LearningCorpus Construction & Specialist Vocabulary Learning
Corpus Construction & Specialist Vocabulary Learning
 
Calico 2014 intelligent call - def
Calico 2014   intelligent call - defCalico 2014   intelligent call - def
Calico 2014 intelligent call - def
 
UDL Presentation
UDL PresentationUDL Presentation
UDL Presentation
 
Siop model-and-research-findings
Siop model-and-research-findingsSiop model-and-research-findings
Siop model-and-research-findings
 
Siop model-and-research-findings
Siop model-and-research-findingsSiop model-and-research-findings
Siop model-and-research-findings
 
OWN-PT: Taking Stock
OWN-PT: Taking Stock OWN-PT: Taking Stock
OWN-PT: Taking Stock
 

Recently uploaded

Healthy Minds, Flourishing Lives: A Philosophical Approach to Mental Health a...
Healthy Minds, Flourishing Lives: A Philosophical Approach to Mental Health a...Healthy Minds, Flourishing Lives: A Philosophical Approach to Mental Health a...
Healthy Minds, Flourishing Lives: A Philosophical Approach to Mental Health a...Osopher
 
The role of Geography in climate education: science and active citizenship
The role of Geography in climate education: science and active citizenshipThe role of Geography in climate education: science and active citizenship
The role of Geography in climate education: science and active citizenshipKarl Donert
 
MS4 level being good citizen -imperative- (1) (1).pdf
MS4 level   being good citizen -imperative- (1) (1).pdfMS4 level   being good citizen -imperative- (1) (1).pdf
MS4 level being good citizen -imperative- (1) (1).pdfMr Bounab Samir
 
Narcotic and Non Narcotic Analgesic..pdf
Narcotic and Non Narcotic Analgesic..pdfNarcotic and Non Narcotic Analgesic..pdf
Narcotic and Non Narcotic Analgesic..pdfPrerana Jadhav
 
How to Manage Buy 3 Get 1 Free in Odoo 17
How to Manage Buy 3 Get 1 Free in Odoo 17How to Manage Buy 3 Get 1 Free in Odoo 17
How to Manage Buy 3 Get 1 Free in Odoo 17Celine George
 
Tree View Decoration Attribute in the Odoo 17
Tree View Decoration Attribute in the Odoo 17Tree View Decoration Attribute in the Odoo 17
Tree View Decoration Attribute in the Odoo 17Celine George
 
Indexing Structures in Database Management system.pdf
Indexing Structures in Database Management system.pdfIndexing Structures in Database Management system.pdf
Indexing Structures in Database Management system.pdfChristalin Nelson
 
4.11.24 Poverty and Inequality in America.pptx
4.11.24 Poverty and Inequality in America.pptx4.11.24 Poverty and Inequality in America.pptx
4.11.24 Poverty and Inequality in America.pptxmary850239
 
BÀI TẬP BỔ TRỢ 4 KĨ NĂNG TIẾNG ANH LỚP 11 (CẢ NĂM) - FRIENDS GLOBAL - NĂM HỌC...
BÀI TẬP BỔ TRỢ 4 KĨ NĂNG TIẾNG ANH LỚP 11 (CẢ NĂM) - FRIENDS GLOBAL - NĂM HỌC...BÀI TẬP BỔ TRỢ 4 KĨ NĂNG TIẾNG ANH LỚP 11 (CẢ NĂM) - FRIENDS GLOBAL - NĂM HỌC...
BÀI TẬP BỔ TRỢ 4 KĨ NĂNG TIẾNG ANH LỚP 11 (CẢ NĂM) - FRIENDS GLOBAL - NĂM HỌC...Nguyen Thanh Tu Collection
 
Farrington HS Streamlines Guest Entrance
Farrington HS Streamlines Guest EntranceFarrington HS Streamlines Guest Entrance
Farrington HS Streamlines Guest Entrancejulius27264
 
Employablity presentation and Future Career Plan.pptx
Employablity presentation and Future Career Plan.pptxEmployablity presentation and Future Career Plan.pptx
Employablity presentation and Future Career Plan.pptxryandux83rd
 
How to Uninstall a Module in Odoo 17 Using Command Line
How to Uninstall a Module in Odoo 17 Using Command LineHow to Uninstall a Module in Odoo 17 Using Command Line
How to Uninstall a Module in Odoo 17 Using Command LineCeline George
 
Congestive Cardiac Failure..presentation
Congestive Cardiac Failure..presentationCongestive Cardiac Failure..presentation
Congestive Cardiac Failure..presentationdeepaannamalai16
 
Transdisciplinary Pathways for Urban Resilience [Work in Progress].pptx
Transdisciplinary Pathways for Urban Resilience [Work in Progress].pptxTransdisciplinary Pathways for Urban Resilience [Work in Progress].pptx
Transdisciplinary Pathways for Urban Resilience [Work in Progress].pptxinfo924062
 
Sarah Lahm In Media Res Media Component
Sarah Lahm  In Media Res Media ComponentSarah Lahm  In Media Res Media Component
Sarah Lahm In Media Res Media ComponentInMediaRes1
 
BÀI TẬP BỔ TRỢ TIẾNG ANH 11 THEO ĐƠN VỊ BÀI HỌC - CẢ NĂM - CÓ FILE NGHE (GLOB...
BÀI TẬP BỔ TRỢ TIẾNG ANH 11 THEO ĐƠN VỊ BÀI HỌC - CẢ NĂM - CÓ FILE NGHE (GLOB...BÀI TẬP BỔ TRỢ TIẾNG ANH 11 THEO ĐƠN VỊ BÀI HỌC - CẢ NĂM - CÓ FILE NGHE (GLOB...
BÀI TẬP BỔ TRỢ TIẾNG ANH 11 THEO ĐƠN VỊ BÀI HỌC - CẢ NĂM - CÓ FILE NGHE (GLOB...Nguyen Thanh Tu Collection
 
Geoffrey Chaucer Works II UGC NET JRF TGT PGT MA PHD Entrance Exam II History...
Geoffrey Chaucer Works II UGC NET JRF TGT PGT MA PHD Entrance Exam II History...Geoffrey Chaucer Works II UGC NET JRF TGT PGT MA PHD Entrance Exam II History...
Geoffrey Chaucer Works II UGC NET JRF TGT PGT MA PHD Entrance Exam II History...DrVipulVKapoor
 

Recently uploaded (20)

Healthy Minds, Flourishing Lives: A Philosophical Approach to Mental Health a...
Healthy Minds, Flourishing Lives: A Philosophical Approach to Mental Health a...Healthy Minds, Flourishing Lives: A Philosophical Approach to Mental Health a...
Healthy Minds, Flourishing Lives: A Philosophical Approach to Mental Health a...
 
CARNAVAL COM MAGIA E EUFORIA _
CARNAVAL COM MAGIA E EUFORIA            _CARNAVAL COM MAGIA E EUFORIA            _
CARNAVAL COM MAGIA E EUFORIA _
 
The role of Geography in climate education: science and active citizenship
The role of Geography in climate education: science and active citizenshipThe role of Geography in climate education: science and active citizenship
The role of Geography in climate education: science and active citizenship
 
MS4 level being good citizen -imperative- (1) (1).pdf
MS4 level   being good citizen -imperative- (1) (1).pdfMS4 level   being good citizen -imperative- (1) (1).pdf
MS4 level being good citizen -imperative- (1) (1).pdf
 
Narcotic and Non Narcotic Analgesic..pdf
Narcotic and Non Narcotic Analgesic..pdfNarcotic and Non Narcotic Analgesic..pdf
Narcotic and Non Narcotic Analgesic..pdf
 
How to Manage Buy 3 Get 1 Free in Odoo 17
How to Manage Buy 3 Get 1 Free in Odoo 17How to Manage Buy 3 Get 1 Free in Odoo 17
How to Manage Buy 3 Get 1 Free in Odoo 17
 
Tree View Decoration Attribute in the Odoo 17
Tree View Decoration Attribute in the Odoo 17Tree View Decoration Attribute in the Odoo 17
Tree View Decoration Attribute in the Odoo 17
 
Indexing Structures in Database Management system.pdf
Indexing Structures in Database Management system.pdfIndexing Structures in Database Management system.pdf
Indexing Structures in Database Management system.pdf
 
Spearman's correlation,Formula,Advantages,
Spearman's correlation,Formula,Advantages,Spearman's correlation,Formula,Advantages,
Spearman's correlation,Formula,Advantages,
 
4.11.24 Poverty and Inequality in America.pptx
4.11.24 Poverty and Inequality in America.pptx4.11.24 Poverty and Inequality in America.pptx
4.11.24 Poverty and Inequality in America.pptx
 
BÀI TẬP BỔ TRỢ 4 KĨ NĂNG TIẾNG ANH LỚP 11 (CẢ NĂM) - FRIENDS GLOBAL - NĂM HỌC...
BÀI TẬP BỔ TRỢ 4 KĨ NĂNG TIẾNG ANH LỚP 11 (CẢ NĂM) - FRIENDS GLOBAL - NĂM HỌC...BÀI TẬP BỔ TRỢ 4 KĨ NĂNG TIẾNG ANH LỚP 11 (CẢ NĂM) - FRIENDS GLOBAL - NĂM HỌC...
BÀI TẬP BỔ TRỢ 4 KĨ NĂNG TIẾNG ANH LỚP 11 (CẢ NĂM) - FRIENDS GLOBAL - NĂM HỌC...
 
Farrington HS Streamlines Guest Entrance
Farrington HS Streamlines Guest EntranceFarrington HS Streamlines Guest Entrance
Farrington HS Streamlines Guest Entrance
 
Mattingly "AI & Prompt Design" - Introduction to Machine Learning"
Mattingly "AI & Prompt Design" - Introduction to Machine Learning"Mattingly "AI & Prompt Design" - Introduction to Machine Learning"
Mattingly "AI & Prompt Design" - Introduction to Machine Learning"
 
Employablity presentation and Future Career Plan.pptx
Employablity presentation and Future Career Plan.pptxEmployablity presentation and Future Career Plan.pptx
Employablity presentation and Future Career Plan.pptx
 
How to Uninstall a Module in Odoo 17 Using Command Line
How to Uninstall a Module in Odoo 17 Using Command LineHow to Uninstall a Module in Odoo 17 Using Command Line
How to Uninstall a Module in Odoo 17 Using Command Line
 
Congestive Cardiac Failure..presentation
Congestive Cardiac Failure..presentationCongestive Cardiac Failure..presentation
Congestive Cardiac Failure..presentation
 
Transdisciplinary Pathways for Urban Resilience [Work in Progress].pptx
Transdisciplinary Pathways for Urban Resilience [Work in Progress].pptxTransdisciplinary Pathways for Urban Resilience [Work in Progress].pptx
Transdisciplinary Pathways for Urban Resilience [Work in Progress].pptx
 
Sarah Lahm In Media Res Media Component
Sarah Lahm  In Media Res Media ComponentSarah Lahm  In Media Res Media Component
Sarah Lahm In Media Res Media Component
 
BÀI TẬP BỔ TRỢ TIẾNG ANH 11 THEO ĐƠN VỊ BÀI HỌC - CẢ NĂM - CÓ FILE NGHE (GLOB...
BÀI TẬP BỔ TRỢ TIẾNG ANH 11 THEO ĐƠN VỊ BÀI HỌC - CẢ NĂM - CÓ FILE NGHE (GLOB...BÀI TẬP BỔ TRỢ TIẾNG ANH 11 THEO ĐƠN VỊ BÀI HỌC - CẢ NĂM - CÓ FILE NGHE (GLOB...
BÀI TẬP BỔ TRỢ TIẾNG ANH 11 THEO ĐƠN VỊ BÀI HỌC - CẢ NĂM - CÓ FILE NGHE (GLOB...
 
Geoffrey Chaucer Works II UGC NET JRF TGT PGT MA PHD Entrance Exam II History...
Geoffrey Chaucer Works II UGC NET JRF TGT PGT MA PHD Entrance Exam II History...Geoffrey Chaucer Works II UGC NET JRF TGT PGT MA PHD Entrance Exam II History...
Geoffrey Chaucer Works II UGC NET JRF TGT PGT MA PHD Entrance Exam II History...
 

Multilingual vocabularies for the Web: Session on multilingual vocabularies, VocDay, DC conference Lisbon 2013

  • 1. SESSION ON MULTILINGUAL VOCABULARY Development and extension DanielVila-Suero (dvila@fi.upm.es) Ontology Engineering Group, UPM (Madrid) Slides available at
  • 2. GOAL Open discussion around vocabularies enabled for multilingual environments (WWW) Introduce some examples: current situation and efforts. More open questions than answers. Promote collaboration ß
  • 3. SESSION OUTLINE 1. Introduction to the session and the topic 2. “Representing multilingual lexical and terminological information in RDF vocabularies” Elena Montiel-Ponsoda, OEG-UPM 3. “Metadata registry of the Publications office of the EU” Michael Düro. PO-EU OPENDISCUSSION OPEN DISCUSSION
  • 4. THISTALK 1. Why should we care about multilingual vocabularies? 2. What is a multilingual vocabulary? 3. Current situation: when and who
  • 5. WHY The primary design principle underlying the Web’s usefulness and growth is universality.When you make a link, you can link to anything.That means people must be able to put anything on the Web, no matter what computer they have, software they use or human language they speak… Tim Berners-Lee ß
  • 6. ß WHY The primary design principle underlying the Web’s usefulness and growth is universality.When you make a link, you can link to anything.That means people must be able to put anything on the Web, no matter what computer they have, software they use or human language they speak… Tim Berners-LeeVocabularies are becoming a central part of the WWW
  • 7. LANGUAGES ARE USEFUL •For Humans ★Finding vocabularies, terms, etc. ★Understading their semantics, how to use them ★... •and Machines... ★ Search, ranking, resource discovery ★Natural Language Processing applications: multilingual question answering, localized presentation of data ★....
  • 9. WHY 24 ranked results including the term project in Japanese
  • 10. SOME FACTS ABOUT LOV • Data retrieved 12.04.2013* out of 326 vocabs 0 75 150 225 300 Monolingual (EN) Monolingual (No EN) Multilingual Non especified 425358 223 Number of vocabularies * “Guidelines for Multilingual Linked Data” Gómez-Pérez et al., 2013
  • 11. SOME FACTS ABOUT LOV • LOV loves multilingual descriptions: indexing, ranked search results. •But, still very low usage of language tags for vocabulary elements < 60% •Other semantic search engines (Sindice, Falcons, SWSE..) lack support for multiple languages
  • 12. WHAT IS AN MLVOCAB? • Simple (general) answer: “A vocabulary which includes labels and documentation in multiple languages” •Are there other flavors of multilingual vocabularies?
  • 14. FLAVORS 2. EXTERNAL MODEL personal docente profesor titular catedrático academic staff associate professor (full) professor Lehrpersonal Privatdozent Professor P01 P011 P013 P012 Legend! subClassOf! Mappings! ISSUES: Directionality of links, different namespaces, resolution of URIs (at http level with header, htaccess, external service..-)
  • 15. FLAVORS 3. MAPPING MODEL cuerpo docente catedrático Lehrpersonal profesor contratado assistant professor associate professor full professor Dozent … academic staff P011 P013P012 profesor titular … … P01 Legend! subClassOf! Mappings! Professor wissenschaftlicher Mitarbeiter Privatdozent…
  • 16. WHAT FLAVOR IS MINE ?
  • 17. WHAT FLAVOR IS MINE? • Depends on a number of factors: ★Your starting point (starting from scratch? can you modify the terms within your original namespace? are there similar vocabularies in other langs?) ★Your needs (linguistically complex model, simplicity, efficiency, et) ★Your available resources (time, people, money...) ★......... •Selection should be USE CASE DRIVEN
  • 19. POLICY POLICY ★Vocabulary publishers should commit to a translation policy: e.g.,What are the protocols for including/ developing/validating a new translation? ★Establish the neccesary mechanisms to manage and assess the quality, sinchronization and appropriate coverage between different languages. ★Again, should be based on requirements, goals, etc. and be UC driven
  • 20. PROCESS PROCESS ★Translation workflows: versioning, notification, edition, validation mechanisms, etc. ★Develop methodologies, guidelines and best practices for translating and including new languages. ★Establish communication protocols between the responsibles of the different translations (languages) ★Coordination among the people involved
  • 21. REPRESENTATION REPRESENTATION ★Choose your modelling approach: ★rdfs and skos labels and descriptions ★Specialized models (lemon, ontolex etc.) ★ Mappings ★ Guidelines for: ★ Naming: coining new URIs for terms ★Labeling: Defining the structure of the labels (should we use verbs, full sentences, etc.)
  • 22. INFRASTRUCTURE INFRASTRUCTURE ★Manage different aspects: ★Management of translation/ edition workflows: notifications, review process, versioning, etc. ★Access to vocabulary elements: localize access? different namespace for the linguistic descriptions? ★Generation of human-readable documentation ★Look at MLOD patterns and guidelines
  • 23. WHEN AND WHO • Learn from (succesful) initiatives: ★ FAO’s AGROVOC ★ EUROVOC ★WORDNET ★ IFLAVocabularies and Guidelines for translations ★.... •Get involved in initiatives around the topic: ★W3C Internationalization Activity ★W3C Best practices for Multilingual LOD CG ★W3C Ontology-Lexica CG ★EU Lider project
  • 26. LIDER-PROJECT.EU Linguistic Linked Data (including vocabularies) can serve as an enabler technology for content analytics on the Multilingual Web. Universidad Politécnica de Madrid (Spain) Trinity College (Ireland) DFKI (Germany) National University of Ireland , Galway (Ireland) Institut für Angewandte Informatik (Germany) Universität Bielefeld (Germany) Universita Roma la Sapienza (Italy) W3C/ERCIM (France)
  • 27. LIDER-PROJECT.EU • Development of best practices and guidelines for publishing multilingual linked data resources (including vocabularies). •Events: W3C Multilingual Web workshop, hackhathons, industrial events, etc. •Help organizations with publishing Multilingual Linked Data resources Get involved!
  • 28. THANK YOU VERY MUCH dvila@fi.upm, @dvilasuero