SlideShare a Scribd company logo
1 of 34
Download to read offline
META-NET and META-SHARE:
Language Technology for Europe
Georg Rehm
Network Manager META-NET
DFKI, Germany
georg.rehm@dfki.de

CHAT 2012 Workshop – Madrid, Spain
June 22, 2012

Co-funded by the 7th Framework Programme and the ICT Policy Support Programme of the European Commission through
the contracts T4ME, CESAR, METANET4U, META-NORD (grant agreements no. 249119, 271022, 270893, 270899).
Outline
q 

Introduction

q 

META-SHARE

q 

Language White Paper Series

q 

Strategic Research Agenda

http://www.meta-net.eu

2
Multilingual Europe
q 

q 

q 

Challenge: Providing each language community with the most
advanced technologies for communication and information so that
maintaining their mother tongue does not turn into a disadvantage.
While research has made considerable progress in recent years, the
pace of progress is not fast enough to meet the challenge within the
next 10-20 years.
All stakeholders – researchers, LT user and provider industries,
language communities, funding programmes, policy makers –
should team up for a major dedicated push.

http://www.meta-net.eu

3
Objectives
META-NET is a network of excellence dedicated to fostering the technological foundations of the European multilingual information society.

http://www.meta-net.eu

4
Four Funded Projects
q 

q 

q 

q 

Initial project: T4ME (FP7;
13 partners, 10 countries)
Three additional consortia
(ICT-PSP) since Feb. 2011:
CESAR, METANET4U,
META-NORD
All EU member states and
several non-member states
covered.
META-NET in June 2012:
60 members in 34 countries.
http://www.meta-net.eu/members

http://www.meta-net.eu

5
META-SHARE

META-SHARE

http://www.meta-net.eu

6
META-SHARE at a Glance
q 
q 

q 

q 

Open exchange infrastructure for language resources and tools.
Language resources and tools are documented, uploaded, stored in
repositories, catalogued, can be downloaded, shared, discussed.
Improve their visibility, documentation, identification, availability,
preservation, interoperability.
Long-term goal: boost research, technology and innovation through
wide availability, pooling, openness and sharing of resources.

q 

Repositories store and maintain inventories of resources and tools.

q 

Metadata inventories are exported and harvested in the network.

q 

Currently 13 repositories up and running; ~1.300 LRs available.

http://www.meta-net.eu

7
http://www.meta-net.eu

8
Metadata Descriptions
q 
q 

Unit of description: LRs/LTs and related objects
Metadata descriptions for
§  Sets of text/audio/video etc. files (corpora)
§  Sets of lexical entries (lexical/conceptual resources)
§  Tools or services etc.

q 

q 

Components group together elements and relations
§  General components common to all LR types
§  LR type-specific components
Profiles for each LR type are built upon components

http://www.meta-net.eu

9
Ontology excerpt

http://www.meta-net.eu

10
Current Contents
29	
   4	
  

Resource	
  types	
  	
  
corpus	
  	
  
lexicalConceptualResource	
  	
  

586	
  

650	
  

toolService	
  	
  
languageDescrip<on	
  	
  

21	
  

25	
  
28	
  
38	
  

36	
  

Distribu/on	
  per	
  Language	
  

20	
  
293	
  

27	
  

503	
  

30	
  
40	
  
76	
  
41	
  

391	
  

99	
  
202	
  

http://www.meta-net.eu

204	
  

English	
  	
  
Spanish	
  	
  
French	
  	
  
German	
  	
  
Italian	
  	
  
Chinese	
  	
  
Dutch	
  	
  
Portuguese	
  	
  
Greek	
  	
  
Swedish	
  	
  
Catalan	
  	
  
Arabic	
  	
  
Japanese	
  	
  
Romanian	
  	
  
Danish	
  	
  
Polish	
  	
  
Other	
  

14
Recent Developments
q 

q 

q 

Several organizations outside META-NET want to set up their own
META-SHARE repositories (e.g., JRC Optima Activity).
Incoming resources from CESAR, METANET4U and META-NORD
and also from additional initiatives (collaboration agreements with
>45 EU-funded projects).
ELRA will adopt META-SHARE as the central model for LR sharing
and will provide services in its operation.
	
  

q 

META-SHARE Version 3.0 to be released in September 2012.

	
  

	
  

	
  

	
  

	
  

	
  

	
  

	
  

	
  

	
  

	
  

	
  

	
  

	
  

	
  

	
  

	
  
	
  

	
  

	
  
	
  

http://www.meta-net.eu

	
  

	
  

	
  

	
  

	
  
EXCITEMENT	
  
	
  

	
  

	
  

	
  

	
  

	
  

	
  

	
  

	
  

	
  

	
  

	
  

	
  

	
  

	
  

	
  

	
  

	
  	
  
	
  

	
  

	
  
	
  

	
  
	
  

15
META-VISION

Language White Paper Series

http://www.meta-net.eu

16
Language White Papers
q 

q 
q 

q 

q 
q 

Key communication instruments to
address decision makers and journalists.
Cover all EU languages (30 volumes).
Report on the state of a language
(general, social, strategic and
technological aspects) and the level of
support through language technology.
Inform target group about societal and
technological problems and challenges
as well as economic opportunities.
Published by Springer.
PDF versions available for free.

http://www.meta-net.eu

17
30 Languages Covered
q 
q 
q 
q 
q 
q 
q 
q 
q 
q 

Basque
Bulgarian*
Catalan
Czech*
Danish*
Dutch*
English*
Estonian*
Finnish*
French*

q 
q 
q 
q 
q 
q 
q 
q 
q 
q 

Galician
German*
Greek*
Hungarian*
Icelandic
Irish*
Italian*
Latvian*
Lithuanian*
Maltese*

q 
q 
q 
q 
q 
q 
q 
q 
q 
q 

Norwegian
Polish*
Portuguese*
Romanian*
Serbian
Slovak*
Slovene*
Spanish*
Swedish*
Croatian

* = Official EU language
http://www.meta-net.eu

18
A few Numbers …
q 

q 

q 

>160 national experts contributed
as authors or co-authors
(ca. 5 per language on average)
>50 additional experts have
contributed data and information
>8,000 copies will be printed and
distributed by META-NET

http://www.meta-net.eu

19
Cross-Lingual Ranking
q 
q 

Journalists and politicians need simple and clear messages.
For four application areas, each language is assigned to one of five
clusters, ranging from excellent LT support to weak/no support:
1.  Machine Translation
2.  Speech Processing
3.  Text Analysis
4.  Resources

q 

Finalised at a meeting with
representatives of all 30 languages (Oct. 21/22, 2011).

http://www.meta-net.eu

20
MT

(top)

excellent

& Speech Processing
good

English

excellent

good

English

http://www.meta-net.eu

(bottom)

moderate

fragmentary

weak or no support

French,
Spanish

Catalan, Dutch,
German,
Hungarian,
Italian, Polish,
Romanian

Basque, Bulgarian, Croatian, Czech,
Danish, Estonian, Finnish, Galician,
Greek, Icelandic, Irish, Latvian,
Lithuanian, Maltese, Norwegian,
Portuguese, Serbian, Slovak, Slovene,
Swedish

moderate

fragmentary

weak or no support

Czech, Dutch,
Finnish, French,
German, Italian,
Portuguese, Spanish

Basque, Bulgarian,
Catalan, Danish,
Estonian, Galician,
Greek, Hungarian,
Irish, Norwegian,
Polish, Serbian,
Slovak, Slovene,
Swedish

Croatian, Icelandic,
Latvian, Lithuanian,
Maltese, Romanian

21
Text Analysis
excellent

good

English

excellent

(top)

& Resources

(bottom)

moderate

fragmentary

weak or no support

Dutch, French,
German,
Italian, Spanish

Basque, Bulgarian, Catalan,
Czech, Danish, Finnish,
Galician, Greek, Hungarian,
Norwegian, Polish,
Portuguese, Romanian,
Slovak, Slovene, Swedish

Croatian, Estonian,
Icelandic, Irish,
Latvian, Lithuanian,
Maltese, Serbian

good

moderate

fragmentary

weak/no support

English

Czech, Dutch,
French, German,
Hungarian, Italian,
Polish, Spanish,
Swedish

Basque, Bulgarian, Catalan,
Croatian, Danish, Estonian,
Finnish, Galician, Greek,
Norwegian, Portuguese,
Romanian, Serbian, Slovak,
Slovene

Icelandic, Irish,
Latvian,
Lithuanian,
Maltese

http://www.meta-net.eu

22
Europe’s Languages and LT

English

good support through
Language Technology

http://www.meta-net.eu

Dutch
French
German
Italian
Spanish

Catalan
Czech
Finnish
Hungarian
Polish
Portuguese
Swedish

Basque
Bulgarian
Danish
Galician
Greek
Norwegian
Romanian
Slovak
Slovene

Croatian
Estonian
Icelandic
Irish
Latvian
Lithuanian
Maltese
Serbian

weak or
no support

23
Key Results
q 

q 

q 

q 

When it comes to Language Technology support, there are massive
differences between Europe’s languages and technology areas.
Language Technology support for English is ahead of any other
language – but far from being perfect.
For 16 of 30 languages, Language Technology support is only
fragmentary, very weak or non-existent!
We now have a list of gaps and needs for all 30 languages which
need to be addressed in the years to come.

http://www.meta-net.eu

24
Recent Developments
q 

q 
q 

q 

q 

On June 20 we launched a new version of
the Language White Paper website:
http://www.meta-net.eu/whitepapers.
Ca. 20 of the 30 white papers online (PDF).
First batch of printed copies (published
by Springer) will be available soon.
New volumes upcoming (such as, “European Sign Languages in the Digital Age”).
Dissemination of printed volumes to
politicians and journalists to start in July.

http://www.meta-net.eu

25
META-VISION

Strategic Research Agenda

http://www.meta-net.eu

26
Shared Vision and SRA
q 

Mobilize researchers, users and providers of LT for cooperation and
collaboration to initiate and support a large-scale joint action to
§  Building a community around Language Technology in Europe (META),
§  Creating a shared vision,
§  Preparing a Strategic Research Agenda for Multilingual Europe 2020,
to be presented to national/international politicians, funding agencies,
decision makers.
Appropriate
Actors

Appropriate
Programme

Commercialisation &
Research

Vision &
Agenda

Appropriate
Support
Funding

http://www.meta-net.eu

27
From Visions to the SRA
q 

q 

q 

q 

q 

Three Vision Groups brought together researchers, developers,
integrators and (corporate or professional) users of LT-based
products, services and applications (ca. 25 members each).
Collected domain-specific visions and prepared individual reports.
Vision Group
Translation and Localisation

§ 
§ 
§ 

July 23, 2010
September 28, 2010
April 7/8, 2011

Berlin, Germany
Brussels, Belgium
Prague, Czech Republic

Vision Group
Media and Information Services

§ 
§ 
§ 

September 10, 2010
October 15, 2010
April 1, 2011

Paris, France
Barcelona, Spain
Vienna, Austria

Vision Group
Interactive Systems

§ 
§ 
§ 

September 10, 2010
October 5, 2010
March 28, 2011

Paris, France
Prague, Czech Republic
Utrecht, The Netherlands

http://www.meta-net.eu

28
Steps taken towards the SRA
q 
q 
q 
q 
q 

Many suggestions by: Vision Group members; META-NET; META.
Additional input in meetings, workshops, discussions etc.
Language White Papers: research gaps; funding situation etc.
We screened the Strategic Research Agendas of other initiatives.
We discussed procedures, input, structure and content of the SRA in
four meetings of the META Technology Council.
§  Brussels, Belgium, November 16, 2010
§  Venice, Italy, May 25, 2011
§  Berlin, Germany, September 30, 2011
§  Brussels, Belgium, June 19, 2012

http://www.meta-net.eu

29
From Visions to the SRA
Expert meeting
minutes
Expert meeting
minutes
Vision Group
Media and
Information
Services Report
Vision Group
Interactive
Systems Report

Expert meeting
minutes

Vision
Paper
Strategic Research
Agenda

Vision Group
Translation and
Localisation
Report

2010
http://www.meta-net.eu

Priority
Themes
Paper

2011

2012
30
Priority Research Themes

LT 2020
Vision and Priority Themes for
Language Technology Research
in Europe until the Year 2020
Towards the META-NET Strategic Research Agenda

Do you
have co
mments
with re
, ideas
gard to
or sugg
the con
Please
es
tent of
send th
this doc tions
discuss
em to of
ument?
them on
fice@m
line: htt
eta-net
.eu or
p://ww
w.metanet.eu/
sra.

The development of this paper has been funded by the Seventh Framework Programme and the ICT Policy Support Programme of the European Commission under contracts T4ME (Grant Agreement 249119), CESAR (Grant Agreement 271022), METANET4U (Grant Agreement
270893) and META-NORD (Grant Agreement 270899).

http://www.meta-net.eu

31
META
q 
q 

META-NET is a network of excellence.
META is an open and growing strategic technology alliance:
Multilingual Europe Technology Alliance.
§  630+ members, including W3C, Google, Microsoft, GALA,
research centres, LT companies, many EC funded projects etc.
§  Main goal: to support our Strategic Research Agenda.
§  Join us!

http://www.meta-net.eu

http://www.meta-net.eu/join

32
Next Steps
q 

q 
q 
q 

q 
q 

Solid first version of SRA to be available online on July 2 – we need as
much feedback as possible! Check our website!
Disseminate the Language White Papers widely.
Making META-SHARE sustainable.
Disseminate the SRA widely – meet with national research planners,
funders, policy makers and inform them about the strategy.
Roll out META-SHARE for the whole LT community in Sep. 2012.
Upcoming opportunities can provide sufficient resources to make
our visions for Europe’s citizens and economy, as described in the
SRA, a reality (Horizon 2020 and Connecting Europe Facility, CEF).

http://www.meta-net.eu

33
Q/A

Thank you very much!

office@meta-net.eu
http://www.meta-net.eu
http://www.facebook.com/META.Alliance
34

More Related Content

What's hot

What's hot (20)

Computational Morphology and the META-NET Strategic Research Agenda for Multi...
Computational Morphology and the META-NET Strategic Research Agenda for Multi...Computational Morphology and the META-NET Strategic Research Agenda for Multi...
Computational Morphology and the META-NET Strategic Research Agenda for Multi...
 
Human Language Technologies in a Multilingual Europe
Human Language Technologies in a Multilingual EuropeHuman Language Technologies in a Multilingual Europe
Human Language Technologies in a Multilingual Europe
 
Language Technologies for Big Data – A Strategic Agenda for the Multilingual ...
Language Technologies for Big Data – A Strategic Agenda for the Multilingual ...Language Technologies for Big Data – A Strategic Agenda for the Multilingual ...
Language Technologies for Big Data – A Strategic Agenda for the Multilingual ...
 
The Strategic Agenda for the Multilingual Digital Single Market V0.9
The Strategic Agenda for the Multilingual Digital Single Market V0.9The Strategic Agenda for the Multilingual Digital Single Market V0.9
The Strategic Agenda for the Multilingual Digital Single Market V0.9
 
Multilingual Europe in late 2016 – A Strategic Research and Innovation Agenda...
Multilingual Europe in late 2016 – A Strategic Research and Innovation Agenda...Multilingual Europe in late 2016 – A Strategic Research and Innovation Agenda...
Multilingual Europe in late 2016 – A Strategic Research and Innovation Agenda...
 
Language Technology for Multilingual Europe
Language Technology for Multilingual EuropeLanguage Technology for Multilingual Europe
Language Technology for Multilingual Europe
 
Multilingualism for Digital Europe
Multilingualism for Digital EuropeMultilingualism for Digital Europe
Multilingualism for Digital Europe
 
AI for Translation Technologies and Multilingual Europe
AI for Translation Technologies and Multilingual EuropeAI for Translation Technologies and Multilingual Europe
AI for Translation Technologies and Multilingual Europe
 
European Language Technologies – Past, Present and Future
European Language Technologies – Past, Present and FutureEuropean Language Technologies – Past, Present and Future
European Language Technologies – Past, Present and Future
 
Promoting the Use of Basque via Language Technology
Promoting the Use of Basque via Language TechnologyPromoting the Use of Basque via Language Technology
Promoting the Use of Basque via Language Technology
 
The META-NET Strategic Research Agenda and Linked Open Data
The META-NET Strategic Research Agenda and Linked Open DataThe META-NET Strategic Research Agenda and Linked Open Data
The META-NET Strategic Research Agenda and Linked Open Data
 
Towards a Human Language Project for Multilingual Europe: AI and Interpretation
Towards a Human Language Project for Multilingual Europe: AI and InterpretationTowards a Human Language Project for Multilingual Europe: AI and Interpretation
Towards a Human Language Project for Multilingual Europe: AI and Interpretation
 
Celtic language technologies in the digital age
Celtic language technologies in the digital ageCeltic language technologies in the digital age
Celtic language technologies in the digital age
 
ELSE IF 2019: What’s next for Multilingual Europe?
ELSE IF 2019: What’s next for Multilingual Europe?ELSE IF 2019: What’s next for Multilingual Europe?
ELSE IF 2019: What’s next for Multilingual Europe?
 
Language Technologies for Multilingual Europe - Towards a Human Language Proj...
Language Technologies for Multilingual Europe - Towards a Human Language Proj...Language Technologies for Multilingual Europe - Towards a Human Language Proj...
Language Technologies for Multilingual Europe - Towards a Human Language Proj...
 
TAUS MT Showcase, MT@EC for European public administrations and online servic...
TAUS MT Showcase, MT@EC for European public administrations and online servic...TAUS MT Showcase, MT@EC for European public administrations and online servic...
TAUS MT Showcase, MT@EC for European public administrations and online servic...
 
Is MT ready for e-Government? The Latvian Story. Indra Samite, Tilde
Is MT ready for e-Government? The Latvian Story. Indra Samite, TildeIs MT ready for e-Government? The Latvian Story. Indra Samite, Tilde
Is MT ready for e-Government? The Latvian Story. Indra Samite, Tilde
 
META-NET and META-SHARE: An Overview
META-NET and META-SHARE: An OverviewMETA-NET and META-SHARE: An Overview
META-NET and META-SHARE: An Overview
 
NIF 2.0 Phd thesis intermediate report
NIF 2.0 Phd thesis intermediate reportNIF 2.0 Phd thesis intermediate report
NIF 2.0 Phd thesis intermediate report
 
Europeana meeting under Finland’s Presidency of the Council of the EU - Day 1...
Europeana meeting under Finland’s Presidency of the Council of the EU - Day 1...Europeana meeting under Finland’s Presidency of the Council of the EU - Day 1...
Europeana meeting under Finland’s Presidency of the Council of the EU - Day 1...
 

Similar to META-NET and META-SHARE: Language Technology for Europe

Similar to META-NET and META-SHARE: Language Technology for Europe (13)

AI and Conference Interpretation – From Smart Assistants for the Human Interp...
AI and Conference Interpretation – From Smart Assistants for the Human Interp...AI and Conference Interpretation – From Smart Assistants for the Human Interp...
AI and Conference Interpretation – From Smart Assistants for the Human Interp...
 
TAUS Roundtable Moscow, Is MT Ready for e-Government, The Latvian Story, Indr...
TAUS Roundtable Moscow, Is MT Ready for e-Government, The Latvian Story, Indr...TAUS Roundtable Moscow, Is MT Ready for e-Government, The Latvian Story, Indr...
TAUS Roundtable Moscow, Is MT Ready for e-Government, The Latvian Story, Indr...
 
Europeana meeting under Finland’s Presidency of the Council of the EU - Day 2...
Europeana meeting under Finland’s Presidency of the Council of the EU - Day 2...Europeana meeting under Finland’s Presidency of the Council of the EU - Day 2...
Europeana meeting under Finland’s Presidency of the Council of the EU - Day 2...
 
Language technology market and components taxonomy
Language technology market and components taxonomyLanguage technology market and components taxonomy
Language technology market and components taxonomy
 
TAUS MT Showcace, MT Applications in the EU Public Sector, Adrejs Vasiljevs, ...
TAUS MT Showcace, MT Applications in the EU Public Sector, Adrejs Vasiljevs, ...TAUS MT Showcace, MT Applications in the EU Public Sector, Adrejs Vasiljevs, ...
TAUS MT Showcace, MT Applications in the EU Public Sector, Adrejs Vasiljevs, ...
 
FrameNet development for Latvian
FrameNet development for LatvianFrameNet development for Latvian
FrameNet development for Latvian
 
Sala+ Presentation Cali Cartagena Octubre 2008 V0.0
Sala+ Presentation Cali Cartagena Octubre 2008 V0.0Sala+ Presentation Cali Cartagena Octubre 2008 V0.0
Sala+ Presentation Cali Cartagena Octubre 2008 V0.0
 
Workflow Development for OCR (and beyond)
Workflow Development for OCR (and beyond)Workflow Development for OCR (and beyond)
Workflow Development for OCR (and beyond)
 
Apertium: Free/open-source rule-based machine translation and language proces...
Apertium: Free/open-source rule-based machine translation and language proces...Apertium: Free/open-source rule-based machine translation and language proces...
Apertium: Free/open-source rule-based machine translation and language proces...
 
Davor Sostaric - IPv6 in Slovenia
Davor Sostaric - IPv6 in SloveniaDavor Sostaric - IPv6 in Slovenia
Davor Sostaric - IPv6 in Slovenia
 
Centre of Competence in digitisation. Clemens Neudecker
Centre of Competence in digitisation. Clemens NeudeckerCentre of Competence in digitisation. Clemens Neudecker
Centre of Competence in digitisation. Clemens Neudecker
 
Experimental Workflow Development in Digitisation
Experimental Workflow Development in DigitisationExperimental Workflow Development in Digitisation
Experimental Workflow Development in Digitisation
 
Spanish Language Technology Plan. David Pérez Fernández, Cabinet of State Sec...
Spanish Language Technology Plan. David Pérez Fernández, Cabinet of State Sec...Spanish Language Technology Plan. David Pérez Fernández, Cabinet of State Sec...
Spanish Language Technology Plan. David Pérez Fernández, Cabinet of State Sec...
 

More from Georg Rehm

More from Georg Rehm (18)

QURATOR: A Flexible AI Platform for the Adaptive Analysis and Creative Genera...
QURATOR: A Flexible AI Platform for the Adaptive Analysis and Creative Genera...QURATOR: A Flexible AI Platform for the Adaptive Analysis and Creative Genera...
QURATOR: A Flexible AI Platform for the Adaptive Analysis and Creative Genera...
 
Observations on Annotations – From Computational Linguistics and the World Wi...
Observations on Annotations – From Computational Linguistics and the World Wi...Observations on Annotations – From Computational Linguistics and the World Wi...
Observations on Annotations – From Computational Linguistics and the World Wi...
 
The Preparation, Impact and Future of the META-NET White Paper Series “Europe...
The Preparation, Impact and Future of the META-NET White Paper Series “Europe...The Preparation, Impact and Future of the META-NET White Paper Series “Europe...
The Preparation, Impact and Future of the META-NET White Paper Series “Europe...
 
Künstliche Intelligenz beim Dolmetschen und Übersetzen
Künstliche Intelligenz beim Dolmetschen und ÜbersetzenKünstliche Intelligenz beim Dolmetschen und Übersetzen
Künstliche Intelligenz beim Dolmetschen und Übersetzen
 
Herausforderungen und Lösungen für die europäische Sprachtechnologie- Forschu...
Herausforderungen und Lösungen für die europäische Sprachtechnologie- Forschu...Herausforderungen und Lösungen für die europäische Sprachtechnologie- Forschu...
Herausforderungen und Lösungen für die europäische Sprachtechnologie- Forschu...
 
KI, Sprachtechnologie und Digital Humanities: Ein (unvollständiger) Überblick
KI, Sprachtechnologie und Digital Humanities: Ein (unvollständiger) ÜberblickKI, Sprachtechnologie und Digital Humanities: Ein (unvollständiger) Überblick
KI, Sprachtechnologie und Digital Humanities: Ein (unvollständiger) Überblick
 
Kuratieren im Zeitalter der KI
Kuratieren im Zeitalter der KIKuratieren im Zeitalter der KI
Kuratieren im Zeitalter der KI
 
Artificial Intelligence for the Film Industry
Artificial Intelligence for the Film IndustryArtificial Intelligence for the Film Industry
Artificial Intelligence for the Film Industry
 
KI für die Kundenkommunikation
KI für die KundenkommunikationKI für die Kundenkommunikation
KI für die Kundenkommunikation
 
Transformieren, Manipulieren, Kuratieren: Technologien für die Wissensarbeit ...
Transformieren, Manipulieren, Kuratieren: Technologien für die Wissensarbeit ...Transformieren, Manipulieren, Kuratieren: Technologien für die Wissensarbeit ...
Transformieren, Manipulieren, Kuratieren: Technologien für die Wissensarbeit ...
 
Digitale Kuratierungstechnologien: Anwendungsfälle in Digitalen Bibliotheken
Digitale Kuratierungstechnologien: Anwendungsfälle in Digitalen BibliothekenDigitale Kuratierungstechnologien: Anwendungsfälle in Digitalen Bibliotheken
Digitale Kuratierungstechnologien: Anwendungsfälle in Digitalen Bibliotheken
 
EPUB, quo vadis? Publishing im W3C
EPUB, quo vadis? Publishing im W3CEPUB, quo vadis? Publishing im W3C
EPUB, quo vadis? Publishing im W3C
 
Digitale Kuratierungstechnologien für verschiedene Branchen und Anwendungssze...
Digitale Kuratierungstechnologien für verschiedene Branchen und Anwendungssze...Digitale Kuratierungstechnologien für verschiedene Branchen und Anwendungssze...
Digitale Kuratierungstechnologien für verschiedene Branchen und Anwendungssze...
 
Generische Kuratierungstechnologien für spezifische Anwendungsfälle: Hintergr...
Generische Kuratierungstechnologien für spezifische Anwendungsfälle: Hintergr...Generische Kuratierungstechnologien für spezifische Anwendungsfälle: Hintergr...
Generische Kuratierungstechnologien für spezifische Anwendungsfälle: Hintergr...
 
Curation Technologies for Multilingual Europe
Curation Technologies for Multilingual EuropeCuration Technologies for Multilingual Europe
Curation Technologies for Multilingual Europe
 
Web Annotations – A Game Changer for Language Technology?
Web Annotations – A Game Changer for Language Technology?Web Annotations – A Game Changer for Language Technology?
Web Annotations – A Game Changer for Language Technology?
 
Globale Standards im Web of Things
Globale Standards im Web of ThingsGlobale Standards im Web of Things
Globale Standards im Web of Things
 
W3C/DFKI Automotive Workshop
W3C/DFKI Automotive WorkshopW3C/DFKI Automotive Workshop
W3C/DFKI Automotive Workshop
 

Recently uploaded

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 

Recently uploaded (20)

Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKSpring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 

META-NET and META-SHARE: Language Technology for Europe

  • 1. META-NET and META-SHARE: Language Technology for Europe Georg Rehm Network Manager META-NET DFKI, Germany georg.rehm@dfki.de CHAT 2012 Workshop – Madrid, Spain June 22, 2012 Co-funded by the 7th Framework Programme and the ICT Policy Support Programme of the European Commission through the contracts T4ME, CESAR, METANET4U, META-NORD (grant agreements no. 249119, 271022, 270893, 270899).
  • 2. Outline q  Introduction q  META-SHARE q  Language White Paper Series q  Strategic Research Agenda http://www.meta-net.eu 2
  • 3. Multilingual Europe q  q  q  Challenge: Providing each language community with the most advanced technologies for communication and information so that maintaining their mother tongue does not turn into a disadvantage. While research has made considerable progress in recent years, the pace of progress is not fast enough to meet the challenge within the next 10-20 years. All stakeholders – researchers, LT user and provider industries, language communities, funding programmes, policy makers – should team up for a major dedicated push. http://www.meta-net.eu 3
  • 4. Objectives META-NET is a network of excellence dedicated to fostering the technological foundations of the European multilingual information society. http://www.meta-net.eu 4
  • 5. Four Funded Projects q  q  q  q  Initial project: T4ME (FP7; 13 partners, 10 countries) Three additional consortia (ICT-PSP) since Feb. 2011: CESAR, METANET4U, META-NORD All EU member states and several non-member states covered. META-NET in June 2012: 60 members in 34 countries. http://www.meta-net.eu/members http://www.meta-net.eu 5
  • 7. META-SHARE at a Glance q  q  q  q  Open exchange infrastructure for language resources and tools. Language resources and tools are documented, uploaded, stored in repositories, catalogued, can be downloaded, shared, discussed. Improve their visibility, documentation, identification, availability, preservation, interoperability. Long-term goal: boost research, technology and innovation through wide availability, pooling, openness and sharing of resources. q  Repositories store and maintain inventories of resources and tools. q  Metadata inventories are exported and harvested in the network. q  Currently 13 repositories up and running; ~1.300 LRs available. http://www.meta-net.eu 7
  • 9. Metadata Descriptions q  q  Unit of description: LRs/LTs and related objects Metadata descriptions for §  Sets of text/audio/video etc. files (corpora) §  Sets of lexical entries (lexical/conceptual resources) §  Tools or services etc. q  q  Components group together elements and relations §  General components common to all LR types §  LR type-specific components Profiles for each LR type are built upon components http://www.meta-net.eu 9
  • 11.
  • 12.
  • 13.
  • 14. Current Contents 29   4   Resource  types     corpus     lexicalConceptualResource     586   650   toolService     languageDescrip<on     21   25   28   38   36   Distribu/on  per  Language   20   293   27   503   30   40   76   41   391   99   202   http://www.meta-net.eu 204   English     Spanish     French     German     Italian     Chinese     Dutch     Portuguese     Greek     Swedish     Catalan     Arabic     Japanese     Romanian     Danish     Polish     Other   14
  • 15. Recent Developments q  q  q  Several organizations outside META-NET want to set up their own META-SHARE repositories (e.g., JRC Optima Activity). Incoming resources from CESAR, METANET4U and META-NORD and also from additional initiatives (collaboration agreements with >45 EU-funded projects). ELRA will adopt META-SHARE as the central model for LR sharing and will provide services in its operation.   q  META-SHARE Version 3.0 to be released in September 2012.                                           http://www.meta-net.eu           EXCITEMENT                                                     15
  • 16. META-VISION Language White Paper Series http://www.meta-net.eu 16
  • 17. Language White Papers q  q  q  q  q  q  Key communication instruments to address decision makers and journalists. Cover all EU languages (30 volumes). Report on the state of a language (general, social, strategic and technological aspects) and the level of support through language technology. Inform target group about societal and technological problems and challenges as well as economic opportunities. Published by Springer. PDF versions available for free. http://www.meta-net.eu 17
  • 19. A few Numbers … q  q  q  >160 national experts contributed as authors or co-authors (ca. 5 per language on average) >50 additional experts have contributed data and information >8,000 copies will be printed and distributed by META-NET http://www.meta-net.eu 19
  • 20. Cross-Lingual Ranking q  q  Journalists and politicians need simple and clear messages. For four application areas, each language is assigned to one of five clusters, ranging from excellent LT support to weak/no support: 1.  Machine Translation 2.  Speech Processing 3.  Text Analysis 4.  Resources q  Finalised at a meeting with representatives of all 30 languages (Oct. 21/22, 2011). http://www.meta-net.eu 20
  • 21. MT (top) excellent & Speech Processing good English excellent good English http://www.meta-net.eu (bottom) moderate fragmentary weak or no support French, Spanish Catalan, Dutch, German, Hungarian, Italian, Polish, Romanian Basque, Bulgarian, Croatian, Czech, Danish, Estonian, Finnish, Galician, Greek, Icelandic, Irish, Latvian, Lithuanian, Maltese, Norwegian, Portuguese, Serbian, Slovak, Slovene, Swedish moderate fragmentary weak or no support Czech, Dutch, Finnish, French, German, Italian, Portuguese, Spanish Basque, Bulgarian, Catalan, Danish, Estonian, Galician, Greek, Hungarian, Irish, Norwegian, Polish, Serbian, Slovak, Slovene, Swedish Croatian, Icelandic, Latvian, Lithuanian, Maltese, Romanian 21
  • 22. Text Analysis excellent good English excellent (top) & Resources (bottom) moderate fragmentary weak or no support Dutch, French, German, Italian, Spanish Basque, Bulgarian, Catalan, Czech, Danish, Finnish, Galician, Greek, Hungarian, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovene, Swedish Croatian, Estonian, Icelandic, Irish, Latvian, Lithuanian, Maltese, Serbian good moderate fragmentary weak/no support English Czech, Dutch, French, German, Hungarian, Italian, Polish, Spanish, Swedish Basque, Bulgarian, Catalan, Croatian, Danish, Estonian, Finnish, Galician, Greek, Norwegian, Portuguese, Romanian, Serbian, Slovak, Slovene Icelandic, Irish, Latvian, Lithuanian, Maltese http://www.meta-net.eu 22
  • 23. Europe’s Languages and LT English good support through Language Technology http://www.meta-net.eu Dutch French German Italian Spanish Catalan Czech Finnish Hungarian Polish Portuguese Swedish Basque Bulgarian Danish Galician Greek Norwegian Romanian Slovak Slovene Croatian Estonian Icelandic Irish Latvian Lithuanian Maltese Serbian weak or no support 23
  • 24. Key Results q  q  q  q  When it comes to Language Technology support, there are massive differences between Europe’s languages and technology areas. Language Technology support for English is ahead of any other language – but far from being perfect. For 16 of 30 languages, Language Technology support is only fragmentary, very weak or non-existent! We now have a list of gaps and needs for all 30 languages which need to be addressed in the years to come. http://www.meta-net.eu 24
  • 25. Recent Developments q  q  q  q  q  On June 20 we launched a new version of the Language White Paper website: http://www.meta-net.eu/whitepapers. Ca. 20 of the 30 white papers online (PDF). First batch of printed copies (published by Springer) will be available soon. New volumes upcoming (such as, “European Sign Languages in the Digital Age”). Dissemination of printed volumes to politicians and journalists to start in July. http://www.meta-net.eu 25
  • 27. Shared Vision and SRA q  Mobilize researchers, users and providers of LT for cooperation and collaboration to initiate and support a large-scale joint action to §  Building a community around Language Technology in Europe (META), §  Creating a shared vision, §  Preparing a Strategic Research Agenda for Multilingual Europe 2020, to be presented to national/international politicians, funding agencies, decision makers. Appropriate Actors Appropriate Programme Commercialisation & Research Vision & Agenda Appropriate Support Funding http://www.meta-net.eu 27
  • 28. From Visions to the SRA q  q  q  q  q  Three Vision Groups brought together researchers, developers, integrators and (corporate or professional) users of LT-based products, services and applications (ca. 25 members each). Collected domain-specific visions and prepared individual reports. Vision Group Translation and Localisation §  §  §  July 23, 2010 September 28, 2010 April 7/8, 2011 Berlin, Germany Brussels, Belgium Prague, Czech Republic Vision Group Media and Information Services §  §  §  September 10, 2010 October 15, 2010 April 1, 2011 Paris, France Barcelona, Spain Vienna, Austria Vision Group Interactive Systems §  §  §  September 10, 2010 October 5, 2010 March 28, 2011 Paris, France Prague, Czech Republic Utrecht, The Netherlands http://www.meta-net.eu 28
  • 29. Steps taken towards the SRA q  q  q  q  q  Many suggestions by: Vision Group members; META-NET; META. Additional input in meetings, workshops, discussions etc. Language White Papers: research gaps; funding situation etc. We screened the Strategic Research Agendas of other initiatives. We discussed procedures, input, structure and content of the SRA in four meetings of the META Technology Council. §  Brussels, Belgium, November 16, 2010 §  Venice, Italy, May 25, 2011 §  Berlin, Germany, September 30, 2011 §  Brussels, Belgium, June 19, 2012 http://www.meta-net.eu 29
  • 30. From Visions to the SRA Expert meeting minutes Expert meeting minutes Vision Group Media and Information Services Report Vision Group Interactive Systems Report Expert meeting minutes Vision Paper Strategic Research Agenda Vision Group Translation and Localisation Report 2010 http://www.meta-net.eu Priority Themes Paper 2011 2012 30
  • 31. Priority Research Themes LT 2020 Vision and Priority Themes for Language Technology Research in Europe until the Year 2020 Towards the META-NET Strategic Research Agenda Do you have co mments with re , ideas gard to or sugg the con Please es tent of send th this doc tions discuss em to of ument? them on fice@m line: htt eta-net .eu or p://ww w.metanet.eu/ sra. The development of this paper has been funded by the Seventh Framework Programme and the ICT Policy Support Programme of the European Commission under contracts T4ME (Grant Agreement 249119), CESAR (Grant Agreement 271022), METANET4U (Grant Agreement 270893) and META-NORD (Grant Agreement 270899). http://www.meta-net.eu 31
  • 32. META q  q  META-NET is a network of excellence. META is an open and growing strategic technology alliance: Multilingual Europe Technology Alliance. §  630+ members, including W3C, Google, Microsoft, GALA, research centres, LT companies, many EC funded projects etc. §  Main goal: to support our Strategic Research Agenda. §  Join us! http://www.meta-net.eu http://www.meta-net.eu/join 32
  • 33. Next Steps q  q  q  q  q  q  Solid first version of SRA to be available online on July 2 – we need as much feedback as possible! Check our website! Disseminate the Language White Papers widely. Making META-SHARE sustainable. Disseminate the SRA widely – meet with national research planners, funders, policy makers and inform them about the strategy. Roll out META-SHARE for the whole LT community in Sep. 2012. Upcoming opportunities can provide sufficient resources to make our visions for Europe’s citizens and economy, as described in the SRA, a reality (Horizon 2020 and Connecting Europe Facility, CEF). http://www.meta-net.eu 33
  • 34. Q/A Thank you very much! office@meta-net.eu http://www.meta-net.eu http://www.facebook.com/META.Alliance 34