Georg Rehm. AI and Conference Interpretation - From Smart Assistants for the Human Interpreter to Automatic Solutions. DG Interpretation Lunchtime Session on Digital Transformation. European Commission, Brussels, November 2018. November 12, 2018. Invited talk.
AI for Translation Technologies and Multilingual EuropeGeorg Rehm
Georg Rehm. AI for Translation Technologies and Multilingual Europe. DG TRAD Conference - Translation Services in the Digital World: A Sneak Peek into the (near) Future. Luxembourg. October 16/17, 2017.
Towards a Human Language Project for Multilingual Europe: AI and InterpretationGeorg Rehm
Georg Rehm. Towards a Human Language Project for Multilingual Europe: AI and Interpretation. DG Interpretation Conference - Interpretation: Sharing Knowledge & Fostering Communities. European Commission, Brussels, April 2018. April 19/20, 2018. Invited talk.
Language Technologies for Multilingual Europe - Towards a Human Language Proj...Georg Rehm
META-NET has received funding from the EU for several projects related to language technologies, most recently the CRACKER project. The document outlines the history and development of META-NET's Strategic Research and Innovation Agenda (SRIA), including versions 0.5, 0.9, and the current version 1.0 beta, which endorses the establishment of a Human Language Project to help overcome language barriers in Europe. A recent survey of over 600 language technology experts found strong support for a large-scale Human Language Project to achieve deep natural language understanding by 2030.
Apertium: a unique free/open-source MT system for related languages [but not ...Gema Ramirez-Sanchez
The document provides an overview of Apertium, an open-source machine translation system. It describes Apertium's main components, including its engine, data in XML formats, and tools. It also discusses Apertium's ready-to-use products, licensing as free/open-source, active community of hundreds of developers, research uses, over 40 supported language pairs including smaller languages, and some success stories in localization.
Human Language Technologies in a Multilingual EuropeGeorg Rehm
The document summarizes a presentation on human language technologies in a multilingual Europe. Some key points:
- There are 24 official EU languages and many regional/minority languages that have equal status but most are under-supported by language technologies and face digital extinction.
- The META-NET alliance coordinates language technology research across Europe but the field remains fragmented. There is a need for high-quality, deployable language technologies to support applications like translation, conversational interfaces, and a multilingual digital single market.
- A proposed "Multilingual Value Programme" would help enable the multilingual digital single market through technologies for translating, analyzing, processing and curating natural language content.
- A long-term
Language Technologies for Big Data – A Strategic Agenda for the Multilingual ...Georg Rehm
Georg Rehm. Language Technologies for Big Data – A Strategic Agenda for the Multilingual Digital Single Market. BDVA Summit (Big Data Value Association), Valencia, Spain, December 2016. December 1, 2016.
AI for Translation Technologies and Multilingual EuropeGeorg Rehm
Georg Rehm. AI for Translation Technologies and Multilingual Europe. DG TRAD Conference - Translation Services in the Digital World: A Sneak Peek into the (near) Future. Luxembourg. October 16/17, 2017.
Towards a Human Language Project for Multilingual Europe: AI and InterpretationGeorg Rehm
Georg Rehm. Towards a Human Language Project for Multilingual Europe: AI and Interpretation. DG Interpretation Conference - Interpretation: Sharing Knowledge & Fostering Communities. European Commission, Brussels, April 2018. April 19/20, 2018. Invited talk.
Language Technologies for Multilingual Europe - Towards a Human Language Proj...Georg Rehm
META-NET has received funding from the EU for several projects related to language technologies, most recently the CRACKER project. The document outlines the history and development of META-NET's Strategic Research and Innovation Agenda (SRIA), including versions 0.5, 0.9, and the current version 1.0 beta, which endorses the establishment of a Human Language Project to help overcome language barriers in Europe. A recent survey of over 600 language technology experts found strong support for a large-scale Human Language Project to achieve deep natural language understanding by 2030.
Apertium: a unique free/open-source MT system for related languages [but not ...Gema Ramirez-Sanchez
The document provides an overview of Apertium, an open-source machine translation system. It describes Apertium's main components, including its engine, data in XML formats, and tools. It also discusses Apertium's ready-to-use products, licensing as free/open-source, active community of hundreds of developers, research uses, over 40 supported language pairs including smaller languages, and some success stories in localization.
Human Language Technologies in a Multilingual EuropeGeorg Rehm
The document summarizes a presentation on human language technologies in a multilingual Europe. Some key points:
- There are 24 official EU languages and many regional/minority languages that have equal status but most are under-supported by language technologies and face digital extinction.
- The META-NET alliance coordinates language technology research across Europe but the field remains fragmented. There is a need for high-quality, deployable language technologies to support applications like translation, conversational interfaces, and a multilingual digital single market.
- A proposed "Multilingual Value Programme" would help enable the multilingual digital single market through technologies for translating, analyzing, processing and curating natural language content.
- A long-term
Language Technologies for Big Data – A Strategic Agenda for the Multilingual ...Georg Rehm
Georg Rehm. Language Technologies for Big Data – A Strategic Agenda for the Multilingual Digital Single Market. BDVA Summit (Big Data Value Association), Valencia, Spain, December 2016. December 1, 2016.
Multilingual Europe in late 2016 – A Strategic Research and Innovation Agenda...Georg Rehm
Georg Rehm. Multilingual Europe in late 2016 – A Strategic Research and Innovation Agenda for the Multilingual Digital Single Market. Future and Emerging Trends in Language Technologies, Machine Learning and Big Data (FETLT 2016), Seville, Spain, November 2016. November 30, 2016.
Language Technology for Multilingual EuropeGeorg Rehm
Georg Rehm. Language Technology for Multilingual Europe. EFNIL - 10th Annual Conference of the European Federation of National Institutions for Language, Budapest, Hungary, October 2012. October 25, 2012. Invited talk.
This document discusses the relationship between language, culture, and software. It covers several key points:
1) Language and culture are closely intertwined, with language representing the dominant sign of any given culture. Different languages reflect different worldviews.
2) Cultural variables like date formats, number formatting, and colors must be taken into account for software to be successfully localized for other cultures. Internationalization involves developing a cultural model and using international variables.
3) Localization requires adapting aspects of software like graphics, text directionality, and shortcuts to be appropriate for a given language and culture. This helps ensure software usability and understanding across cultures.
Apertium is a free/open-source platform for rule-based machine translation that supports over 40 language pairs. It has a modular pipeline architecture and is collaboratively developed by hundreds of researchers and developers. Apertium can be used for translation between related languages as well as in applications like gisting. It also produces monolingual language resources during rule development.
META-NET and META-SHARE: Language Technology for EuropeGeorg Rehm
Georg Rehm. META-NET and META-SHARE: Language Technology for Europe. The Second Workshop on Creation, Harmonization and Application of Terminology Resources (CHAT 2012), Madrid, Spain, June 2012. June 22, 2012. Invited keynote talk.
The document discusses machine translation (MT) applications in the public sector of the EU. It outlines various types of government-to-stakeholder communication that can benefit from MT, such as government-to-citizens, government-to-businesses, and government-to-governments. It also summarizes case studies on using MT to support multilingualism in the public sectors of Latvia and Lithuania. The document advocates for building an EU-wide public MT infrastructure to fulfill the promises of e-government and language diversity.
European Language Technologies – Past, Present and FutureGeorg Rehm
Georg Rehm. European Language Technologies – Past, Present and Future. Language Equality in the Digital Age. Conference on language technologies and digital equality in a multilingual Europe, European Parliament, Brussels, Belgium, September 2018. September 27, 2018. Invited talk
Cracking the Language Barrier for a Multilingual EuropeGeorg Rehm
Georg Rehm. Cracking the Language Barrier for a Multilingual Europe. EFNIL – 13th Annual Conference of the European Federation of National Institutions for Language, Helsinki, Finland, October 2015. October 08, 2015. Invited talk.
Computational Morphology and the META-NET Strategic Research Agenda for Multi...Georg Rehm
Georg Rehm. Computational Morphology and the META-NET Strategic Research Agenda for Multilingual Europe 2020. 3rd. International Workshop on Systems and Frameworks for Computational Morphology (SFCM 2013), Berlin, Germany, September 2013. September 06, 2013. Invited keynote talk.
Georg Rehm. Mehrsprachigkeit für das Digitale Europa. Ringvorlesung Digitale Lebenswelten, University of Hildesheim, Germany, November 2016. November 15, 2016.
META-NET: Language Technology for EuropeGeorg Rehm
Georg Rehm. META-NET: Language Technology for Europe. Development of Human Language Technologies and Resources in Slovakia and in the world (10 Years of the Slovak National Corpus), Bratislava, Slovakia, June 2012. June 7, 2012. Invited talk.
A Strategic Research and Innovation Agenda for the Multilingual Digital Singl...Georg Rehm
The document discusses META-NET, a network that received EU funding to research overcoming language barriers in Europe. It summarizes that META-NET published the first strategic research agenda for language technology in 2013 and is now drafting an updated version. The presentation also outlines META-NET's work developing strategies and roadmaps to achieve a truly multilingual European Digital Single Market.
The META-NET Language White Paper SeriesGeorg Rehm
Georg Rehm. The META-NET Language White Paper Series. EFNIL - 9th Annual Conference of the European Federation of National Institutions for Language, London, UK, October 2011. October 26, 2011. Invited talk.
META-NET: Towards a Strategic Research Agenda for Multilingual EuropeGeorg Rehm
Georg Rehm. META-NET: Towards a Strategic Research Agenda for Multilingual Europe. Multilingual Web Workshop, Limerick, Ireland, September 2011. September 21, 2011. Talk.
The META-NET Strategic Research Agenda for Multilingual Europe 2020Georg Rehm
Georg Rehm. The META-NET Strategic Research Agenda for Multilingual Europe 2020. The Hungarian Language in the Digital Age - Human Language Technology Day, Budapest, Hungary, January 2013. January 18, 2013. Invited talk.
Europeana meeting under Finland’s Presidency of the Council of the EU - Day 2...Europeana
Here are a few approaches to address the context demand challenge for machine translation of cultural heritage content:
- Leverage knowledge graphs and ontologies to disambiguate terms based on conceptual relationships
- Train domain-specific models on large cultural heritage corpora to capture nuances of language use in different contexts
- Perform multi-task learning to optimize models for both translation accuracy and conceptual mapping between languages
- Allow users to provide feedback to iteratively improve disambiguation of ambiguous terms over time
- Develop specialized interfaces that surface contextual clues from objects to help machine translation
The goal is to mimic how humans understand intended meaning based on surrounding context clues. Combining linguistic and conceptual techniques can help machines do the same.
The music-loving Baltic countries are a multilingual hotspot in Europe, with the majority of citizens speaking (and singing) three languages on a daily basis. At the same time, the melodious Baltic languages are famously complex and morphologically rich, containing lots of ambiguity and intricate word agreements. Taken together, these factors make the region a prime spot for driving innovation in language technologies. Tilde, a language technology company specializing in custom MT and terminology services, has leveraged its extensive linguistic experience in the Baltic region to create custom MT systems for a wide variety of languages and domains, helping EU and global companies to boost translation productivity and make their applications multilingual. Tilde recently embarked on the challenging task of building a large-scale MT service for the Latvian government, Hugo.lv. This service was adapted to create a communication tool for the 2015 EU Presidency. The presentation will introduce the audience to languages and MT in the Baltic region and highlight these two case studies, which showcased the crucial role of language technology in enabling multilingual communication in the digital age.
Protecting Minority Languages from Digital ExtinctionTeresa Lynn
This document discusses protecting minority languages from digital extinction, using Irish as a case study. It provides background on the Irish language and its decline. It then summarizes the current status of Irish language technology, finding it lacks support in areas like machine translation, speech recognition and text analysis. The role of social media in supporting Irish is discussed. Government initiatives to develop Irish language technology and share resources are outlined as ways to ensure the language's continued usage in a digital world.
The document discusses the potential for machine translation (MT) to help address the language diversity challenges of e-government in Europe. It provides background on the multilingual nature of the EU and goals of e-government, before examining how MT could bridge language barriers and allow governments to serve citizens in their own languages. The talk uses Latvia as a case study and addresses the data challenges in developing high-quality MT for smaller European languages.
ELSE IF 2019: What’s next for Multilingual Europe?PretaLLOD
The document introduces the European Language Grid (ELG), a new language technology platform for Europe. It summarizes the history and context around the need for such a platform, including the fragmentation of the language technology landscape in Europe. The ELG is then introduced as a new EU-funded project running from 2019-2021 that aims to address this need by creating an open infrastructure for language data, tools, services and technologies.
Multilingual Europe in late 2016 – A Strategic Research and Innovation Agenda...Georg Rehm
Georg Rehm. Multilingual Europe in late 2016 – A Strategic Research and Innovation Agenda for the Multilingual Digital Single Market. Future and Emerging Trends in Language Technologies, Machine Learning and Big Data (FETLT 2016), Seville, Spain, November 2016. November 30, 2016.
Language Technology for Multilingual EuropeGeorg Rehm
Georg Rehm. Language Technology for Multilingual Europe. EFNIL - 10th Annual Conference of the European Federation of National Institutions for Language, Budapest, Hungary, October 2012. October 25, 2012. Invited talk.
This document discusses the relationship between language, culture, and software. It covers several key points:
1) Language and culture are closely intertwined, with language representing the dominant sign of any given culture. Different languages reflect different worldviews.
2) Cultural variables like date formats, number formatting, and colors must be taken into account for software to be successfully localized for other cultures. Internationalization involves developing a cultural model and using international variables.
3) Localization requires adapting aspects of software like graphics, text directionality, and shortcuts to be appropriate for a given language and culture. This helps ensure software usability and understanding across cultures.
Apertium is a free/open-source platform for rule-based machine translation that supports over 40 language pairs. It has a modular pipeline architecture and is collaboratively developed by hundreds of researchers and developers. Apertium can be used for translation between related languages as well as in applications like gisting. It also produces monolingual language resources during rule development.
META-NET and META-SHARE: Language Technology for EuropeGeorg Rehm
Georg Rehm. META-NET and META-SHARE: Language Technology for Europe. The Second Workshop on Creation, Harmonization and Application of Terminology Resources (CHAT 2012), Madrid, Spain, June 2012. June 22, 2012. Invited keynote talk.
The document discusses machine translation (MT) applications in the public sector of the EU. It outlines various types of government-to-stakeholder communication that can benefit from MT, such as government-to-citizens, government-to-businesses, and government-to-governments. It also summarizes case studies on using MT to support multilingualism in the public sectors of Latvia and Lithuania. The document advocates for building an EU-wide public MT infrastructure to fulfill the promises of e-government and language diversity.
European Language Technologies – Past, Present and FutureGeorg Rehm
Georg Rehm. European Language Technologies – Past, Present and Future. Language Equality in the Digital Age. Conference on language technologies and digital equality in a multilingual Europe, European Parliament, Brussels, Belgium, September 2018. September 27, 2018. Invited talk
Cracking the Language Barrier for a Multilingual EuropeGeorg Rehm
Georg Rehm. Cracking the Language Barrier for a Multilingual Europe. EFNIL – 13th Annual Conference of the European Federation of National Institutions for Language, Helsinki, Finland, October 2015. October 08, 2015. Invited talk.
Computational Morphology and the META-NET Strategic Research Agenda for Multi...Georg Rehm
Georg Rehm. Computational Morphology and the META-NET Strategic Research Agenda for Multilingual Europe 2020. 3rd. International Workshop on Systems and Frameworks for Computational Morphology (SFCM 2013), Berlin, Germany, September 2013. September 06, 2013. Invited keynote talk.
Georg Rehm. Mehrsprachigkeit für das Digitale Europa. Ringvorlesung Digitale Lebenswelten, University of Hildesheim, Germany, November 2016. November 15, 2016.
META-NET: Language Technology for EuropeGeorg Rehm
Georg Rehm. META-NET: Language Technology for Europe. Development of Human Language Technologies and Resources in Slovakia and in the world (10 Years of the Slovak National Corpus), Bratislava, Slovakia, June 2012. June 7, 2012. Invited talk.
A Strategic Research and Innovation Agenda for the Multilingual Digital Singl...Georg Rehm
The document discusses META-NET, a network that received EU funding to research overcoming language barriers in Europe. It summarizes that META-NET published the first strategic research agenda for language technology in 2013 and is now drafting an updated version. The presentation also outlines META-NET's work developing strategies and roadmaps to achieve a truly multilingual European Digital Single Market.
The META-NET Language White Paper SeriesGeorg Rehm
Georg Rehm. The META-NET Language White Paper Series. EFNIL - 9th Annual Conference of the European Federation of National Institutions for Language, London, UK, October 2011. October 26, 2011. Invited talk.
META-NET: Towards a Strategic Research Agenda for Multilingual EuropeGeorg Rehm
Georg Rehm. META-NET: Towards a Strategic Research Agenda for Multilingual Europe. Multilingual Web Workshop, Limerick, Ireland, September 2011. September 21, 2011. Talk.
The META-NET Strategic Research Agenda for Multilingual Europe 2020Georg Rehm
Georg Rehm. The META-NET Strategic Research Agenda for Multilingual Europe 2020. The Hungarian Language in the Digital Age - Human Language Technology Day, Budapest, Hungary, January 2013. January 18, 2013. Invited talk.
Europeana meeting under Finland’s Presidency of the Council of the EU - Day 2...Europeana
Here are a few approaches to address the context demand challenge for machine translation of cultural heritage content:
- Leverage knowledge graphs and ontologies to disambiguate terms based on conceptual relationships
- Train domain-specific models on large cultural heritage corpora to capture nuances of language use in different contexts
- Perform multi-task learning to optimize models for both translation accuracy and conceptual mapping between languages
- Allow users to provide feedback to iteratively improve disambiguation of ambiguous terms over time
- Develop specialized interfaces that surface contextual clues from objects to help machine translation
The goal is to mimic how humans understand intended meaning based on surrounding context clues. Combining linguistic and conceptual techniques can help machines do the same.
The music-loving Baltic countries are a multilingual hotspot in Europe, with the majority of citizens speaking (and singing) three languages on a daily basis. At the same time, the melodious Baltic languages are famously complex and morphologically rich, containing lots of ambiguity and intricate word agreements. Taken together, these factors make the region a prime spot for driving innovation in language technologies. Tilde, a language technology company specializing in custom MT and terminology services, has leveraged its extensive linguistic experience in the Baltic region to create custom MT systems for a wide variety of languages and domains, helping EU and global companies to boost translation productivity and make their applications multilingual. Tilde recently embarked on the challenging task of building a large-scale MT service for the Latvian government, Hugo.lv. This service was adapted to create a communication tool for the 2015 EU Presidency. The presentation will introduce the audience to languages and MT in the Baltic region and highlight these two case studies, which showcased the crucial role of language technology in enabling multilingual communication in the digital age.
Protecting Minority Languages from Digital ExtinctionTeresa Lynn
This document discusses protecting minority languages from digital extinction, using Irish as a case study. It provides background on the Irish language and its decline. It then summarizes the current status of Irish language technology, finding it lacks support in areas like machine translation, speech recognition and text analysis. The role of social media in supporting Irish is discussed. Government initiatives to develop Irish language technology and share resources are outlined as ways to ensure the language's continued usage in a digital world.
The document discusses the potential for machine translation (MT) to help address the language diversity challenges of e-government in Europe. It provides background on the multilingual nature of the EU and goals of e-government, before examining how MT could bridge language barriers and allow governments to serve citizens in their own languages. The talk uses Latvia as a case study and addresses the data challenges in developing high-quality MT for smaller European languages.
ELSE IF 2019: What’s next for Multilingual Europe?PretaLLOD
The document introduces the European Language Grid (ELG), a new language technology platform for Europe. It summarizes the history and context around the need for such a platform, including the fragmentation of the language technology landscape in Europe. The ELG is then introduced as a new EU-funded project running from 2019-2021 that aims to address this need by creating an open infrastructure for language data, tools, services and technologies.
Language Resources for Multilingual EuropeGeorg Rehm
META-NET has received funding from the EU to support several language technology projects, including CRACKER, T4ME, CESAR, METANET4U, and META-NORD. It brings together over 60 research centers across 34 countries to build infrastructure for sharing language resources and tools. The goal is to improve the visibility, documentation, identification, availability, and interoperability of language resources in order to support both academic and commercial language technology research and development across Europe.
The Strategic Agenda for the Multilingual Digital Single Market V0.9Georg Rehm
Georg Rehm. The Strategic Agenda for the Multilingual Digital Single Market Version 0.9. META-FORUM 2016, Lisbon, Portugal, July 2016. July 04/05, 2016.
Practical implementation and discussion of the user experience: Governments face challenges to meet the needs of multilingual communities. MT is promising, but can it deliver? Indra will illustrate how SMT is sufficiently mature for a significant role in e-government. She will analyze and present for discussion a pilot implementation for Latvia.
QURATOR: A Flexible AI Platform for the Adaptive Analysis and Creative Genera...Georg Rehm
Georg Rehm. QURATOR: Developing a Flexible AI Platform for Digital Content Curation. QURATOR 2020 – Conference on Digital Curation Technologies., 1 2020. Fraunhofer FOKUS, January 20/21, 2020. Invited keynote talk.
Observations on Annotations – From Computational Linguistics and the World Wi...Georg Rehm
Georg Rehm. Observations on Annotations – From Computational Linguistics and the World Wide Web to Artificial Intelligence and back again. Annotation in Scholarly Editions and Research: Function – Differentiation – Systematization, University of Wuppertal, Germany. February 20-22, 2019. Invited keynote talk.
The Preparation, Impact and Future of the META-NET White Paper Series “Europe...Georg Rehm
Georg Rehm. The Preparation, Impact and Future of the META-NET White Paper Series “Europe’s Languages in the Digital Age”. Sanskrit and Other Indian Languages Technology (SOIL-Tech), Jawaharlal Nehru University, New Delhi, India, February 2019. February 15, 2019. Invited keynote talk.
Künstliche Intelligenz beim Dolmetschen und ÜbersetzenGeorg Rehm
Georg Rehm. Künstliche Intelligenz beim Dolmetschen und Übersetzen. Institut für Angewandte Linguistik und Translatologie, Universität Leipzig, November 2018. November 1, 2018. Invited presentation.
Herausforderungen und Lösungen für die europäische Sprachtechnologie- Forschu...Georg Rehm
Georg Rehm. Herausforderungen und Lösungen für die europäische Sprachtechnologie-Forschung und -Entwicklung. Deutsches Forschungszentrum für Künstliche Intelligenz GmbH, Berlin, Germany, October 2018. October 30, 2018. Presentation on the occasion of being awarded the appointment as a DFKI Research Fellow.
KI, Sprachtechnologie und Digital Humanities: Ein (unvollständiger) ÜberblickGeorg Rehm
Georg Rehm. KI, Sprachtechnologie und Digital Humanities: Ein (unvollständiger) Überblick. Interdisziplinärer Forschungsverbund Digital Humanities in Berlin (ifDHb), 23. Berliner DH-Rundgang im Deutschen Forschungszentrum für Künstliche Intelligenz, Berlin, Germany, February 05, 2018.
Georg Rehm. Kuratieren im Zeitalter der KI. #DKT17 - Kuratieren im Zeitalter der KI, Berlin, Germany, October 2017. October 12, 2017. Invited keynote talk.
Transformieren, Manipulieren, Kuratieren: Technologien für die Wissensarbeit ...Georg Rehm
Georg Rehm. Transformieren, Manipulieren, Kuratieren? Technologien für die Wissensarbeit im Netz. KOOP-LITERA International. Konferenz 2017, Berlin, Germany, June 2017. June 20, 2017. Invited talk.
Digitale Kuratierungstechnologien: Anwendungsfälle in Digitalen BibliothekenGeorg Rehm
Georg Rehm and Clemens Neudecker. Digitale Kuratierungstechnologien: Anwendungsfälle in Digitalen Bibliotheken . Berliner Bibliothekswissenschaftliches Kolloqium (BBK), Humboldt-Universität zu Berlin, Berlin, Germany, June 2017. June 06, 2017. Invited talk.
Georg Rehm. EPUB, quo vadis? ePublishing im W3C. Jahrestagung der IG Digital. Im Rahmen der Buchtage, Jahreskongress des Börsenvereins, Berlin, Germany, June 2017. June 14, 2017. Invited talk.
Digitale Kuratierungstechnologien für verschiedene Branchen und Anwendungssze...Georg Rehm
Georg Rehm. Digitale Kuratierungstechnologien für verschiedene Branchen und Anwendungsszenarien. EVA Berlin 2016 – Elektronische Medien & Kunst, Kultur, Historie, Berlin, Germany, November 2016. November 09, 2016.
Web Annotations – A Game Changer for Language Technology?Georg Rehm
Georg Rehm, Felix Sasaki, and Aljoscha Burchardt. Web Annotations - A Game Changer for Language Technologies? I Annotate 2016, Berlin, Germany, May 2016. May 19/20, 2016.
Georg Rehm. "Globale Standards im Web of Things". Bitkom Akademie Workshop “Die Dinge im Internet-der-Dinge kommen”, Cologne, Germany, December 2015. December 09, 2015.
Digitale Kuratierungstechnologien – Beispiele aus ausgewählten BranchenGeorg Rehm
Felix Sasaki and Georg Rehm. Digitale Kuratierungstechnologien: Beispiele aus ausgewählten Branchen. #DKT15 - Digitale Kuratierungstechnologien, Berlin, Germany, October 2015. October 06, 2015
How to Get CNIC Information System with Paksim Ga.pptxdanishmna97
Pakdata Cf is a groundbreaking system designed to streamline and facilitate access to CNIC information. This innovative platform leverages advanced technology to provide users with efficient and secure access to their CNIC details.
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slackshyamraj55
Discover the seamless integration of RPA (Robotic Process Automation), COMPOSER, and APM with AWS IDP enhanced with Slack notifications. Explore how these technologies converge to streamline workflows, optimize performance, and ensure secure access, all while leveraging the power of AWS IDP and real-time communication via Slack notifications.
Full-RAG: A modern architecture for hyper-personalizationZilliz
Mike Del Balso, CEO & Co-Founder at Tecton, presents "Full RAG," a novel approach to AI recommendation systems, aiming to push beyond the limitations of traditional models through a deep integration of contextual insights and real-time data, leveraging the Retrieval-Augmented Generation architecture. This talk will outline Full RAG's potential to significantly enhance personalization, address engineering challenges such as data management and model training, and introduce data enrichment with reranking as a key solution. Attendees will gain crucial insights into the importance of hyperpersonalization in AI, the capabilities of Full RAG for advanced personalization, and strategies for managing complex data integrations for deploying cutting-edge AI solutions.
GraphRAG for Life Science to increase LLM accuracyTomaz Bratanic
GraphRAG for life science domain, where you retriever information from biomedical knowledge graphs using LLMs to increase the accuracy and performance of generated answers
Removing Uninteresting Bytes in Software FuzzingAftab Hussain
Imagine a world where software fuzzing, the process of mutating bytes in test seeds to uncover hidden and erroneous program behaviors, becomes faster and more effective. A lot depends on the initial seeds, which can significantly dictate the trajectory of a fuzzing campaign, particularly in terms of how long it takes to uncover interesting behaviour in your code. We introduce DIAR, a technique designed to speedup fuzzing campaigns by pinpointing and eliminating those uninteresting bytes in the seeds. Picture this: instead of wasting valuable resources on meaningless mutations in large, bloated seeds, DIAR removes the unnecessary bytes, streamlining the entire process.
In this work, we equipped AFL, a popular fuzzer, with DIAR and examined two critical Linux libraries -- Libxml's xmllint, a tool for parsing xml documents, and Binutil's readelf, an essential debugging and security analysis command-line tool used to display detailed information about ELF (Executable and Linkable Format). Our preliminary results show that AFL+DIAR does not only discover new paths more quickly but also achieves higher coverage overall. This work thus showcases how starting with lean and optimized seeds can lead to faster, more comprehensive fuzzing campaigns -- and DIAR helps you find such seeds.
- These are slides of the talk given at IEEE International Conference on Software Testing Verification and Validation Workshop, ICSTW 2022.
HCL Notes and Domino License Cost Reduction in the World of DLAUpanagenda
Webinar Recording: https://www.panagenda.com/webinars/hcl-notes-and-domino-license-cost-reduction-in-the-world-of-dlau/
The introduction of DLAU and the CCB & CCX licensing model caused quite a stir in the HCL community. As a Notes and Domino customer, you may have faced challenges with unexpected user counts and license costs. You probably have questions on how this new licensing approach works and how to benefit from it. Most importantly, you likely have budget constraints and want to save money where possible. Don’t worry, we can help with all of this!
We’ll show you how to fix common misconfigurations that cause higher-than-expected user counts, and how to identify accounts which you can deactivate to save money. There are also frequent patterns that can cause unnecessary cost, like using a person document instead of a mail-in for shared mailboxes. We’ll provide examples and solutions for those as well. And naturally we’ll explain the new licensing model.
Join HCL Ambassador Marc Thomas in this webinar with a special guest appearance from Franz Walder. It will give you the tools and know-how to stay on top of what is going on with Domino licensing. You will be able lower your cost through an optimized configuration and keep it low going forward.
These topics will be covered
- Reducing license cost by finding and fixing misconfigurations and superfluous accounts
- How do CCB and CCX licenses really work?
- Understanding the DLAU tool and how to best utilize it
- Tips for common problem areas, like team mailboxes, functional/test users, etc
- Practical examples and best practices to implement right away
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdfMalak Abu Hammad
Discover how MongoDB Atlas and vector search technology can revolutionize your application's search capabilities. This comprehensive presentation covers:
* What is Vector Search?
* Importance and benefits of vector search
* Practical use cases across various industries
* Step-by-step implementation guide
* Live demos with code snippets
* Enhancing LLM capabilities with vector search
* Best practices and optimization strategies
Perfect for developers, AI enthusiasts, and tech leaders. Learn how to leverage MongoDB Atlas to deliver highly relevant, context-aware search results, transforming your data retrieval process. Stay ahead in tech innovation and maximize the potential of your applications.
#MongoDB #VectorSearch #AI #SemanticSearch #TechInnovation #DataScience #LLM #MachineLearning #SearchTechnology
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...SOFTTECHHUB
The choice of an operating system plays a pivotal role in shaping our computing experience. For decades, Microsoft's Windows has dominated the market, offering a familiar and widely adopted platform for personal and professional use. However, as technological advancements continue to push the boundaries of innovation, alternative operating systems have emerged, challenging the status quo and offering users a fresh perspective on computing.
One such alternative that has garnered significant attention and acclaim is Nitrux Linux 3.5.0, a sleek, powerful, and user-friendly Linux distribution that promises to redefine the way we interact with our devices. With its focus on performance, security, and customization, Nitrux Linux presents a compelling case for those seeking to break free from the constraints of proprietary software and embrace the freedom and flexibility of open-source computing.
In the rapidly evolving landscape of technologies, XML continues to play a vital role in structuring, storing, and transporting data across diverse systems. The recent advancements in artificial intelligence (AI) present new methodologies for enhancing XML development workflows, introducing efficiency, automation, and intelligent capabilities. This presentation will outline the scope and perspective of utilizing AI in XML development. The potential benefits and the possible pitfalls will be highlighted, providing a balanced view of the subject.
We will explore the capabilities of AI in understanding XML markup languages and autonomously creating structured XML content. Additionally, we will examine the capacity of AI to enrich plain text with appropriate XML markup. Practical examples and methodological guidelines will be provided to elucidate how AI can be effectively prompted to interpret and generate accurate XML markup.
Further emphasis will be placed on the role of AI in developing XSLT, or schemas such as XSD and Schematron. We will address the techniques and strategies adopted to create prompts for generating code, explaining code, or refactoring the code, and the results achieved.
The discussion will extend to how AI can be used to transform XML content. In particular, the focus will be on the use of AI XPath extension functions in XSLT, Schematron, Schematron Quick Fixes, or for XML content refactoring.
The presentation aims to deliver a comprehensive overview of AI usage in XML development, providing attendees with the necessary knowledge to make informed decisions. Whether you’re at the early stages of adopting AI or considering integrating it in advanced XML development, this presentation will cover all levels of expertise.
By highlighting the potential advantages and challenges of integrating AI with XML development tools and languages, the presentation seeks to inspire thoughtful conversation around the future of XML development. We’ll not only delve into the technical aspects of AI-powered XML development but also discuss practical implications and possible future directions.
Best 20 SEO Techniques To Improve Website Visibility In SERPPixlogix Infotech
Boost your website's visibility with proven SEO techniques! Our latest blog dives into essential strategies to enhance your online presence, increase traffic, and rank higher on search engines. From keyword optimization to quality content creation, learn how to make your site stand out in the crowded digital landscape. Discover actionable tips and expert insights to elevate your SEO game.
Programming Foundation Models with DSPy - Meetup SlidesZilliz
Prompting language models is hard, while programming language models is easy. In this talk, I will discuss the state-of-the-art framework DSPy for programming foundation models with its powerful optimizers and runtime constraint system.
Pushing the limits of ePRTC: 100ns holdover for 100 daysAdtran
At WSTS 2024, Alon Stern explored the topic of parametric holdover and explained how recent research findings can be implemented in real-world PNT networks to achieve 100 nanoseconds of accuracy for up to 100 days.
Maruthi Prithivirajan, Head of ASEAN & IN Solution Architecture, Neo4j
Get an inside look at the latest Neo4j innovations that enable relationship-driven intelligence at scale. Learn more about the newest cloud integrations and product enhancements that make Neo4j an essential choice for developers building apps with interconnected data and generative AI.
Dr. Sean Tan, Head of Data Science, Changi Airport Group
Discover how Changi Airport Group (CAG) leverages graph technologies and generative AI to revolutionize their search capabilities. This session delves into the unique search needs of CAG’s diverse passengers and customers, showcasing how graph data structures enhance the accuracy and relevance of AI-generated search results, mitigating the risk of “hallucinations” and improving the overall customer journey.
AI and Conference Interpretation – From Smart Assistants for the Human Interpreter to Automatic Solutions
1. Georg Rehm
DFKI GmbH
Language Technology Lab – Berlin, Germany
georg.rehm@dfki.de
Artificial Intelligence and
Conference Interpretation
From Smart Assistants for the Human
Interpreter to Automatic Solutions
2. Outline
• Artificial Intelligence
• Language Technology
• Multilingualism in Europe
• AI and Interpretation
• Recommendations
SCIC Lunchtime Session on Digital Transformation – 12 Nov. 2018 2
5. SCIC Lunchtime Session on Digital Transformation – 12 Nov. 2018
Data Intelligence
Current breakthroughs based on Machine Learning (“Deep Learning”)
Also still in use: symbolic, rule-based methods and expert systems
Artificial Intelligence
• Huge data sets + powerful learning algorithms + very fast hardware
• Enormous potential for disruptions in all sectors and areas
• Important: “The AI” doesn’t exist – there’s only single-task systems!
5
6. Language Technology
• Language Technology transfers theoretical results from
language-oriented research into technologies and
applications that are ready for production use
• Uses results from, e.g.:
– Artificial Intelligence
– Computer Science
– Computational Linguistics
– Natural Language Processing
– Psychology, Psycholinguistics
– Cognitive Science
SCIC Lunchtime Session on Digital Transformation – 12 Nov. 2018 6
Example Applications
• Spell checkers
• Dictation systems
• Translation systems
• Search engines
• Report generation
• Expert systems
• Dialogue systems
• Text summarisers
8. • Multilingualism is at the heart of the European idea
• 24 EU languages – all have the same status
• Dozens of regional and minority languages as well as
languages of immigrants and trade partners
• Many economic and social challenges:
– The Digital Single Market needs to be multilingual
– Cross-border, cross-lingual, cross-cultural
communication
• This presentation is about very general application
scenarios and use cases: day-to-day communication,
ecommerce, mobility, health, tourism etc.
9. 60 research centres in 34 countries (founded in 2010)
Chair of Executive Board: Jan Hajic (CUNI)
Dep.: J. van Genabith (DFKI), A. Vasiljevs (Tilde)
General Secretary: Georg Rehm (DFKI)
Multilingual Europe
Technology Alliance.
900+ members in
67 countries
T4ME (META-NET) CESAR METANET4UMETA-NORD
Multilingual Europe Technology AllianceNET
(published in 2013) (31 volumes; published in 2012)
16. META-NET SRA, published in early 2013
• First strategic research agenda of our field
• Complex process for collecting and structuring technology visions
• Approx. 200 researchers have participated
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
D
RAFT
Strategic Agenda for the
Multilingual Digital Single Market
Technologies for Overcoming Language Barriers towards
a truly integrated European Online Market
D
RAFT
Version 0.5 – April 22, 2015
SRIA V0.5 presented at
• Based on strategy papers and roadmaps prepared by several EU
projects including the META-NET SRA (see above)
Strategic Research and Innovation Agenda
Language as a Data Type and
Key Challenge for Big Data
Enabling the Multilingual Digital Single Market
through technologies for translating, analysing, processing
and curating natural language content
SRIA Editorial Team
Version 0.9 – July 2016
SRIA V0.9 presented at
• Prepared, presented and endorsed by the
Cracking the Language Barrier federation
• Explains how the LT community can realise the
Multilingual Digital Single Market
Strategic Research and Innovation Agenda
Language Technologies for
Multilingual Europe
Towards a Human Language Project
SRIA Editorial Team
Version 1.0 beta – November 2017
SRIA V1.0 presented at
• Complements and supports the STOA study
• Crucial recommendation: Kickstart the Human Language Project!
Georg Rehm and Hans Uszkoreit, editors. The META-NET Strategic Research Agenda for Multilingual Europe 2020. Springer, Heidelberg, New York, Dordrecht, London, 2013.
Georg Rehm, editor. Language Technologies for Multilingual Europe: Towards a Human Language Project. Strategic Research and Innovation Agenda. Dec. 2017. Version 1.0.
Unveiled at META-FORUM 2017 in Brussels, Belgium, on Nov. 13/14, 2017. Prepared by the Cracking the Language Barrier federation, supported by CRACKER.
Georg Rehm, editor. Language as a Data Type and Key Challenge for Big Data. Strategic Research and Innovation Agenda for the Multilingual Digital Single Market. CRACKER and
Cracking the Language Barrier federation, July 2016. Version 0.9. 04 July 2016. Supported by CRACKER and LT_Observatory.
Georg Rehm, editor. Strategic Agenda for the Multilingual Digital Single Market – Technologies for Overcoming Language Barriers towards a truly integrated European Online Market.
CRACKER and LT_Observatory, April 2015. Version 0.5. 22 April 2015. Prepared by the EU-funded projects CRACKER and LT_Observatory.
17. STUDY
EPRS | European Parliamentary Research Service
Scientific Foresight Unit (STOA)
PE 581.621
Science and Technology Options Assessment
STOA Workshop
European Parliament
10 January 2017
• STOA study – published in March 2017
• Recommends to the EC to initiate the Human Language Project (HLP)
• Three important research policy recommendations:
– Strengthen research and focus upon the HLP
– Support a European LT platform for data and services
– Bridge the technology gap between Europe‘s languages
18. “Language equality” Resolution
• EP resolution “Language equality in the digital age”
P8_TA(2018)0332 – partially based on the STOA study
• Voting in the EP on 11 September 2018:
592 votes in favour vs. 45 votes against!
• Three of the 45 recommendations:
– 25. Establish a large-scale, long-term
LT funding programme
– 27. Europe has to secure its leadership
in language-centric AI
– 29. Create a European LT platform
for sharing of services
SCIC Lunchtime Session on Digital Transformation – 12 Nov. 2018 18
European Parliament
2014-2019
TEXTS ADOPTED
Provisional edition
P8_TA-PROV(2018)0332
Language equality in the digital age
European Parliament resolution of 11 September 2018 on language equality in the
digital age (2018/2028(INI))
The European Parliament,
– having regard to Articles 2 and 3(3) of the Treaty on the Functioning of the European
Union (TFEU),
– having regard to Articles 21(1) and 22 of the Charter of Fundamental Rights of the
European Union,
– having regard to the 2003 UNESCO Convention for the Safeguarding of the Intangible
Cultural Heritage,
– having regard to Directive 2003/98/EC of the European Parliament and of the Council of
17 November 2003 on the re-use of public sector information1
,
– having regard to Directive 2013/37/EU of the European Parliament and of the Council of
26 June 2013 amending Directive 2003/98/EC on the re-use of public sector information2
,
– having regard to Decision (EU) 2015/2240 of the European Parliament and of the Council
of 25 November 2015 establishing a programme on interoperability solutions and
common frameworks for European public administrations, businesses and citizens (ISA2
programme) as a means for modernising the public sector3
,
– having regard to the Council resolution of 21 November 2008 on a European strategy for
multilingualism (2008/C 320/01)4
,
– having regard to the Council decision of 3 December 2013 establishing the specific
programme implementing Horizon 2020 – the Framework Programme for Research and
1
OJ L 345, 31.12.2003, p. 90.
2
OJ L 175, 27.6.2013, p. 1.
3
OJ L 318, 4.12.2015, p. 1.
4
OJ C 320, 16.12.2008, p. 1.
20. SCIC Lunchtime Session on Digital Transformation – 12 Nov. 2018 20
ELGELG – The Primary Platform for
Language Technology in Europe
• Development of a functional language technology cloud platform for Europe
• Market place for European LT business space (directory of stakeholders)
• Hundreds of LT services and resources – easy-to-use and easy-to-integrate
• Evaluation through 15-20 pilot projects feeding back into the platform
• 30+ national competence centres for a strong European network
• Services can be made available by the community
• Boosting the emerging Multilingual Digital Single Market
• Towards a thriving and flourishing European LT community
Consortium
• DFKI GmbH (Coordinator) (DE)
• ILSP, R.C. “Athena“ (GR)
• University of Sheffield (UK)
• Charles University (CZ)
• ELDA (FR)
• Tilde (LV)
• SAIL LABS GmbH (AT)
• Expert System Iberia (ES)
• University of Edinburgh (UK)
1 Jan. 2019 – 31 Dec. 2021
• ICT-29-2018: Multilingual Next Generation Internet
Ø Two sub topics, budget 25M€
• ICT-29 a) European Language Grid
Ø One Innovation Action, 7M€
• ICT-29 b) Domain-specific/challenge-oriented HLT
Ø Six Research and Innovation Actions, je 3M€
Web Interface APIs
European Language Grid – Content Catalogue
LT Services, Tools, Components, Technologies
Language Resources and Data Sets
Organisations, Languages, Service Types etc.
Cloud Infrastructure
21. • Goal: Deep Natural Language Understanding by 2030
• All official European and many additional languages
• Broad coverage, high quality, high precision
• Create new approaches, algorithms, data sets
• Across modalities: text, text types, speech, video etc.
• Across platforms: messaging, telephony, social, mobile,
IoT, robots, smart devices, conversational technologies etc.
• Across cultures: knowledge, customs, formalities, humour,
emotion, subjectivity, biases, opinions, filter bubble etc.
• How? As the next EU FET Flagship Project!
SCIC Lunchtime Session on Digital Transformation – 12 Nov. 2018
Human Language Project
21
22. Proposal
• FET Flagship Projects: Up to 1B€ of funding for up to ten years
• Flagships: Human Brain Project, Graphene, Quantum (2019)
• H2020 FETFLAG-01-2018 called for preparatory actions, i.e.,
small (1M€) projects to prepare the full flagship proposal
• Our proposal: “Human Language Project Preparation”
• Consortium of 16 partners (coordinated by DFKI)
• 375+ letters of support including 16 ministries and 24 national
language institutions
• More information: http://human-language-project.eu
SCIC Lunchtime Session on Digital Transformation – 12 Nov. 2018 22
23. HLP Partnering Projects
• Language-specific and/or regional
consortia doing research
on their own languages
• Close collaboration with the Core Project
• Overlap between CP and PP in terms of partners
HLP
Partnering
Project:
Spanish
PP
PP
HLP
Partnering
Project:
Italian
HLP
Partnering
Project:
Greek
HLP
Partnering
Project:
German
HLP
Partnering
Project:
Polish
HLP
Partnering
Project:
Baltic
languages
PP
HLP
Core Project
HLP
Partnering
Project:
Dutch
PP
Important dates and next steps
• 11. Sept 2018: EP “Language equality” resolution
• 18. Sept 2018: HLP Prep proposal submitted
• 27. Sept 2018: “Language equality” conference (EP)
• 29. Nov 2018: Results to be circulated to the consortia
• 04. Dec 2018: Official announcement of winning proposals at ICT 2018
• 01. March 2019: Start of the up to six FET Flagship preparation projects
• 29. Feb 2020: End of the up to six projects
23
HLP Core Project
• Coordination of the flagship (CP and PPs)
• Continuous roadmap development
• General technology and algorithm development
• Digital data, resources, computing and
collaboration infrastructure
25. • Since approx. 2015, with breakthroughs in neural technolo-
gies, Machine Translation has been getting better and better.
• All areas of AI look for “super-human performance” but
language is fundamentally different and much more complex.
• Neural AI approaches cannot understand language,
they process it according to huge collected data sets.
• In many use cases, mistakes can be tolerated.
• But: translation and interpretation are often mission-critical!
• Mistakes can have serious consequences (politics, medicine).
Translation and Interpretation
SCIC Lunchtime Session on Digital Transformation – 12 Nov. 2018 25
26. • Example: Lecture Translator
– University lectures are automatically transcribed and translated,
in near-real time, into several languages
– Students can follow the translation – or only the transcription –
through a web interface
• Example: Presentation Translator
– Presenter can have the speech automatically translated
– Translations are displayed as subtitles
• Example: Call Translator
– Internet telephony provider offers automatic voice translation
Speech Translation
SCIC Lunchtime Session on Digital Transformation – 12 Nov. 2018 26
27. • Lexifone – http://www.lexifone.com
– Israeli startup, launched a
telephone-based service in 2013
– Enable real-time translation and/or
transcription for any conference call.
• VoiceTra – http://voicetra.nict.go.jp
– The Nara Institute of Science
and Technology’s translation app
covers 27 languages.
• Mymanu – https://www.mymanu.com
– Mymanu is deploying “smart” earbuds
to make conversations in multiple
languages easier.
• TYWI – http://www.translateyourworld.com/
– Enables across-language
communication at full speed.
SCIC Lunchtime Session on Digital Transformation – 12 Nov. 2018 27
5-Year-Vision
28. • The three example applications work surprisingly well for
their respective domains and registers. But:
– They are far from being perfect.
– They aren’t robust.
– They cannot cope with unforeseen situations.
– They cannot understand language as humans do.
– They are not (yet?) suited for conference interpretation.
Ø Limitations as regards their fields of application.
Ø Interpretation is often mission-critical.
Ø Human interpreters won’t be replaced anytime soon.
Issues and Limitations
SCIC Lunchtime Session on Digital Transformation – 12 Nov. 2018 28
32. SCIC Lunchtime Session on Digital Transformation – 12 Nov. 2018 32
https://slator.com/features/ai-interpreter-fail-at-china-summit-sparks-debate-about-future-of-profession/
33. Summary & Conclusions
Ø AI is disrupting all industries – including translation and,
increasingly, also interpretation.
Ø But: perfect, robust, precise language technologies (incl.
written/spoken MT and interpretation) are still far away.
Ø The Human Language Project is still a vision. Its goal:
develop new breakthroughs in Language Technology.
Ø More and more demand for LT – including interpretation.
Ø The machine will support human interpreters and help
them become more efficient – it will not replace them.
Ø At least not in important or critical use cases.
Ø In other use cases the machine opens up interpretation.
SCIC Lunchtime Session on Digital Transformation – 12 Nov. 2018 33
34. Augmented Interpretation
• SCIC repo, 4,000 speeches (3,000 public, 1,000 private)
• Many R&D groups currently work on TED talk data sets
• Suggestion 1: establish bridges between SCIC and
LT research – share your data set (maybe in ELITR?).
Maybe organise a shared task around the data set?
• Help build the next generation of AI tools for interpreters
• Tailored to the needs, topics, domains and established
workflows of conference interpreters in the EC/EP
• Both for preparing a meeting and for use in the booth
• Glossary production, summarization, ASR, translation etc.
• Suggestion 2: make use of services provided by the
ELG to build smart tools for human interpreters (2020).
SCIC Lunchtime Session on Digital Transformation – 12 Nov. 2018 34
35. Thank you very much!
Dr. Georg Rehm
Senior Researcher
DFKI Research Fellow
DFKI Berlin
! georg.rehm@dfki.de
! http://georg-re.hm
! http://de.linkedin.com/in/georgrehm
! https://www.slideshare.net/georgrehm
SCIC Lunchtime Session on Digital Transformation – 12 Nov. 2018 35