Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.

Language Technology for Multilingual Europe


Published on

Georg Rehm. Language Technology for Multilingual Europe. EFNIL - 10th Annual Conference of the European Federation of National Institutions for Language, Budapest, Hungary, October 2012. October 25, 2012. Invited talk.

Published in: Technology, Education
  • Be the first to comment

  • Be the first to like this

Language Technology for Multilingual Europe

  1. 1. Language Technology for Multilingual Europe Georg Rehm Network Manager META-NET German Research Center for Artificial Intelligence (DFKI), Germany EFNIL 10th Annual Conference – Budapest, Hungary October 25, 2012 Co-funded by the 7th Framework Programme and the ICT Policy Support Programme of the European Commission through the contracts T4ME, CESAR, METANET4U, META-NORD (grant agreements no. 249119, 271022, 270893, 270899).
  2. 2. Multilingual Europe q  q  q  Challenge: to provide each language community with the most advanced technologies for communication and information so that maintaining their mother tongue does not turn into a disadvantage. All stakeholders – researchers, LT user and provider industries, language communities, funding programmes, policy makers – should team up for a major dedicated push. META-NET is a network of excellence dedicated to fostering the technological foundations of the European multilingual information society. q  META-NET in October 2012: 60 members in 34 countries. q  Supported through a total of four EU-funded projects. 2
  3. 3. META-SHARE q  q  q  q  q  Open exchange infrastructure for language resources and tools. Language resources and tools are documented, uploaded, stored in repositories, catalogued, can be downloaded, shared, discussed. Improve their visibility, documentation, availability, preservation, interoperability. Goal: boost research, technology and innovation through pooling, openness and sharing of resources. 1.300+ LRs available in 13 repositories. 3
  4. 4. Language White Papers q  q  q  Reports on the state of our languages in the digital age and the level of support through language technology. Series covers 30 languages. Communication instruments to address decision makers and journalists. q  >2 years in the making. q  >200 national experts as contributors. q  >8.000 copies printed and distributed to politicians and journalists. 4
  5. 5. Text Analysis MT excellent good English excellent fragmentary weak or no support French, Spanish Catalan, Dutch, German, Hungarian, Italian, Polish, Romanian Basque, Bulgarian, Croatian, Czech, Danish, Estonian, Finnish, Galician, Greek, Icelandic, Irish, Latvian, Lithuanian, Maltese, Norwegian, Portuguese, Serbian, Slovak, Slovene, Swedish good moderate fragmentary weak or no support English Dutch, French, German, Italian, Spanish Basque, Bulgarian, Catalan, Czech, Danish, Finnish, Galician, Greek, Hungarian, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovene, Swedish Croatian, Estonian, Icelandic, Irish, Latvian, Lithuanian, Maltese, Serbian good excellent moderate fragmentary weak or no support English Speech excellent Resources moderate Czech, Dutch, Finnish, French, German, Italian, Portuguese, Spanish Basque, Bulgarian, Catalan, Danish, Estonian, Galician, Greek, Hungarian, Irish, Norwegian, Polish, Serbian, Slovak, Slovene, Swedish Croatian, Icelandic, Latvian, Lithuanian, Maltese, Romanian good moderate fragmentary weak/no support English Czech, Dutch, French, German, Hungarian, Italian, Polish, Spanish, Swedish Basque, Bulgarian, Catalan, Croatian, Danish, Estonian, Finnish, Galician, Greek, Norwegian, Portuguese, Romanian, Serbian, Slovak, Slovene Icelandic, Irish, Latvian, Lithuanian, Maltese 5
  6. 6. White Paper Press Campaign q  Headline of press release: At Least 21 European Languages in Danger of Digital Extinction. Good News and Bad News on the European Day of Languages. q  Sent out to journalists, politicians and other stakeholder groups before the European Day of Languages (September 26). q  Overwhelmed by the huge interest in the topic and our key findings! q  470+ mentions in the online and traditional press. q  40+ interviews with META-NET representatives (television, radio). q  News came in from 41 countries in 35 different languages. 6
  7. 7. Response: Examples q  q  q  q  q  q  q  q  q  q  q  q  Austria: Denmark: Finland: Germany: Greece: Iceland: Italy: Norway: Slovenia: Spain: UK: USA: Der Standard. Politiken, Berlingske Tidende. Tiede. Heise Newsticker, Süddeutsche Zeitung., Πρώτο Θέµα, Prosilipsis. Fréttablaðið, Morgunblaðið. Wired. Computerworld. Delo, Dnevnik, Demokracija. El Mundo. Huffington Post. Mashable, NBC News, Reddit. 7
  8. 8. Website: Visitors Overview began sending European Day out press release of Languages unusually high traffic 8
  9. 9. Website: Visitors’ Cities City with the most visits: Brussels! 9
  10. 10. Strategic Research Agenda q  q  q  q  META-NET Strategic Research Agenda for Multilingual Europe 2020. Three priority research themes and application/innovation scenarios. Can put Europe ahead of its competitors in this technology area. Addresses the problems we found when preparing the white papers. q  180+ contributors (research, industry). q  Final version to be ready in Nov. 2012. q  SRA will be presented to the EC and national bodies. 10
  11. 11. Strategic Research Agenda 11
  12. 12. Recent News – Next Steps q  q  q  q  q  q  q  q  META-SHARE Version 3.0 released in September 2012. Incoming resources from many EU-funded projects (40+ CAs). Launch event in January 2013. META-TRUST AISBL is an international non-profit organisation, founded on September 12, 2012: Legal person of META-NET. Strategic Research Agenda press campaign, focus on social media (videos, infographics etc.). Meet with national research planners, funders, policy makers and inform them about the Strategic Research Agenda. Data Liberation campaign. 12
  13. 13. Data Liberation Campaign q  q  q  Many valuable resources, among others, at the National Institutions for Language, are not available for research. We know that you’d like to free up the data sets that you’ve worked on so hard and to make them available for research purposes. Many reasons: §  Situation in Europe is bad (White Paper Series; press campaign impact). §  Google, Facebook and others are overtaking us left, right and centre. §  Two options: either declare defeat or do something about it. q  q  We want to help you liberate your data sets. Next week you’ll receive a letter from us with further information. 13
  14. 14. Thank you very much! 14