SlideShare a Scribd company logo
UMD Crowdsourcing and Translation Workshop, June 10-11 Language Grid: Service-OrientedInfrastructurefor Multilingual Society Donghui Lin National Institute of Information and Communications Technology (NICT), Japan
Introduction of the Language Grid http://langrid.nict.go.jp 2
Language Grid Architecture Education Medical Care Disaster Management more more Sharing Multilingual Information Translation Services at Hospital Receptions Universal Playground Language Support for Multicultural Societies Sharing language resources such as dictionaries and machine translators around the world German Research Center for Artificial Intelligence Kookmin University Stuttgart University National Institute of Informatics Princeton University National Research Council, Italy Chinese Academy of Sciences Google Inc. NICT NTT Research Labs Asian Disaster Reduction Center NECTEC Univ. of Indonesia 3
From Language Resources to Language Services Dictionary Service 避難場所 Wrapping disaster shelter Dictionary Return translated word Dictionary Service Parallel Text Service 避難場所は、家から近い学校です。 Wrapping In the case of disaster, people should be evacuated to a school nearby their house. Return similar translated text Parallel Text Parallel Text Service Machine Translation Service 避難場所は、家から近い学校です。 Wrapping Disaster shelter is school close from a house. Translate  by machine Machine Translator Machine Translation Service Human Translation Service 避難場所は、家から近い学校です。 Wrapping Your disaster shelter is the school closest to your house. Translate with high quality Human Translator Human Translation Service
From Atomic Services to Composite Services 授業料は無償ですが、教材費や給食費は必要です。 (Original Text in English:Tuition is free,but cost of textbooks and school meals is required) Multilingual Service Infrastructure Multilingual Communication Support System Over60 language services are  available currently (April 2010) AutoComplete Service Multilingual Communication Support System for a middle school in Kawasaki City, Japan School Guidance for Foreign Guardians Mie Prefecture, JapanParallel Text As aulas referentes ao ensino obrigatório são gratuitas. Porém, serão cobrados os valores referentes à refeição escolar e os materiais a serem usados. (Translation Result in English: Tuition of compulsory education is free,but cost of textbooks and school meals is required) If original text is registered  as parallel text item, perfect  translation can be acquired Composite Translation Service A taxa escolar é gratuita, mas a um custo de material educativo e um gasto de merenda escolar são necessários. (Translation Result in English: Although tuition is free,cost of textbooks and waste of school meals is necessary.) Display area for conversationlogs  J-Server, Kodensha (Rule-based Translation) Translation(ja-en) Multi-hop Translation Web-transer, CrossLanguage (Rule-based Translation) Translation(en-pt) Translation quality is similar to  that of free translation on the Web ・Multi-hop translation ・Composite translation    with dictionary  ・Best translation selection A taxa escolar é gratuita, mas as taxas de materiais didáticos e merenda escolar são as taxas necessárias. (Translation Result in English: Although tuition is free,the rate that learns materials and school meals is the necessary cost. ) Composite  Translation with Dictionary Mecab, NTT Morphological Analysis Professional translation  can be realized by combining dictionaries fortechnical terminology (jargon) TreeTagger, University of Stuttgart Morphological Analysis Education Dictionary,  The Toyota Foundation Dictionary Best translation result can  be selected from multiple  translation services AutoComplete Service Composite  Translation with Dictionary Best Translation Selection Service O ensino é livre, as taxas de materiais de aprendizagem taxas de merenda escolar e é necessário. (Translation Result in English: Tuition is free,but cost of textbooks and school feeding is required.) Translation quality is possible to be further improved by learning from  examples (parallel texts). (Under development by Kyoto Univ.) GoogleTranslate,  Google (Statistical Translation) Translation(ja-pt) Best Translation Selection
Participants and Language Resources 74% of participants are from Japan. ,[object Object]
University / Research Institute
Kyoto Univ. (Japan), Shanghai Jiao Tong Univ. (China), Univ. of Stuttgart (Germany), IT Univ. of Copenhagen (Denmark), Princeton Univ. (USA), DFKI (Germany), CNR (Italy), Chinese Academy of Sciences (China), NECTEC (Thailand), and more.
NPO/NGO/Public Sector
NGOs for disaster reduction, NGOs for Intercultural exchange, Public Junior-high schools, City Boards of Education, and more.
Corporate (CSR activities / language resource providers)
Google, NTT, Toshiba, Oki, Kodensha, Translution, and more.
Language Resources  (60+ language resources)
Machine Translator
J-Server (ja/en/ko/zh), Web-Transer (ja/en/ko/zh/fr/de/it/es), Toshiba (en/zh), Parsit (en>th), Google Translate (40+ languages), and more.
Bilingual Dictionary, Concept Dictionary
EDR , Wordnet, Life Science Dictionary, Multi-language Glossary on Natural Disasters, and more.
Parallel Text
Morphological Analyzer, Dependency Parser
Voice Recognition6
Participatory Design Approach Users create and share their own language services, combine their services with services on the Language Grid to support their multilingual activities Multilingual medical support system in a hospital Multilingual chatting system in a middle school Multilingual BBS for international staffs in an NPO
Participatory Design Approach Users customize how to use language services on the Language Grid to support their multilingual activities User can customize translation services  and dictionary services Multilingual BBS
Using/Customizing Language Services ,[object Object]
http://langrid.org/playground
Try language services on the Language Grid
Machine Translation
Dictionary / Concept Dictionary
Parallel Text
Morphological Analysis
…
Language Grid Toolbox
http://langrid-tool.nict.go.jp
Try customization of language services on the Language Grid
Text Translation (Back-translation, Composite translation)
Language Resource Creation (Dictionary, Parallel Text…)
Multilingual BBS
…,[object Object]
Open Smart Classroom (IEEE TKED 2009) Language Grid YueSuo, Naoki Miyata, Hiroki Morikawa, Toru Ishida and Yuanchun Shi. Open Smart Classroom: Extensible and Scalable Learning System in Smart Space using Web Service Technology. IEEE Transactions on Knowledge and Data Engineering, Vol.21, No.6, pp. 814-828, 2009. Gold Prize (1st place) in "Microsoft Cup" IEEE China Student Paper Contest.
Analysis of Multicultural Communication (CSCW 2006, CHI 2009) Problems of MT-mediated communication Translation Asymmetry Echoing for ratification does not work. Naomi Yamashita, Reiko Inaba, Hideaki Kuzuoka and Toru Ishida. Difficulties in Establishing Common Ground in Multiparty Groups using Machine Translation. International Conference on Human Factors in Computing Systems (CHI-09), pp. 679-688, 2009. Naomi Yamashita and Toru Ishida. Effects of Machine Translation on Collaborative Work. International Conference on Computer Supported Cooperative Work (CSCW-06), pp. 515-523, 2006.

More Related Content

What's hot

A New Approach: Automatically Identify Naming Word from Bengali Sentence for ...
A New Approach: Automatically Identify Naming Word from Bengali Sentence for ...A New Approach: Automatically Identify Naming Word from Bengali Sentence for ...
A New Approach: Automatically Identify Naming Word from Bengali Sentence for ...
Syeful Islam
 
Multi-modal Neural Machine Translation - Iacer Calixto
Multi-modal Neural Machine Translation - Iacer CalixtoMulti-modal Neural Machine Translation - Iacer Calixto
Multi-modal Neural Machine Translation - Iacer Calixto
Sebastian Ruder
 
The main-principles-of-text-to-speech-synthesis-system
The main-principles-of-text-to-speech-synthesis-systemThe main-principles-of-text-to-speech-synthesis-system
The main-principles-of-text-to-speech-synthesis-systemCemal Ardil
 
Lecture 1: Semantic Analysis in Language Technology
Lecture 1: Semantic Analysis in Language TechnologyLecture 1: Semantic Analysis in Language Technology
Lecture 1: Semantic Analysis in Language Technology
Marina Santini
 
Word Segmentation and Lexical Normalization for Unsegmented Languages
Word Segmentation and Lexical Normalization for Unsegmented LanguagesWord Segmentation and Lexical Normalization for Unsegmented Languages
Word Segmentation and Lexical Normalization for Unsegmented Languages
hs0041
 
Nlp presentation
Nlp presentationNlp presentation
Nlp presentation
Surya Sg
 
Human Evaluation: Why do we need it? - Dr. Sheila Castilho
Human Evaluation: Why do we need it? - Dr. Sheila CastilhoHuman Evaluation: Why do we need it? - Dr. Sheila Castilho
Human Evaluation: Why do we need it? - Dr. Sheila Castilho
Sebastian Ruder
 
Design Analysis Rules to Identify Proper Noun from Bengali Sentence for Univ...
Design Analysis Rules to Identify Proper Noun  from Bengali Sentence for Univ...Design Analysis Rules to Identify Proper Noun  from Bengali Sentence for Univ...
Design Analysis Rules to Identify Proper Noun from Bengali Sentence for Univ...
Syeful Islam
 
Detection of Verbal Multi-Word Expressions via Conditional Random Fields with...
Detection of Verbal Multi-Word Expressions via Conditional Random Fields with...Detection of Verbal Multi-Word Expressions via Conditional Random Fields with...
Detection of Verbal Multi-Word Expressions via Conditional Random Fields with...
Lifeng (Aaron) Han
 
Deep learning for natural language embeddings
Deep learning for natural language embeddingsDeep learning for natural language embeddings
Deep learning for natural language embeddings
Roelof Pieters
 
A prior case study of natural language processing on different domain
A prior case study of natural language processing  on different domain A prior case study of natural language processing  on different domain
A prior case study of natural language processing on different domain
IJECEIAES
 
1 computational linguistics an introduction
1 computational linguistics   an introduction1 computational linguistics   an introduction
1 computational linguistics an introduction
ThennarasuSakkan
 
How can text-mining leverage developments in Deep Learning? Presentation at ...
How can text-mining leverage developments in Deep Learning?  Presentation at ...How can text-mining leverage developments in Deep Learning?  Presentation at ...
How can text-mining leverage developments in Deep Learning? Presentation at ...
jcscholtes
 
A New Approach: Automatically Identify Proper Noun from Bengali Sentence for ...
A New Approach: Automatically Identify Proper Noun from Bengali Sentence for ...A New Approach: Automatically Identify Proper Noun from Bengali Sentence for ...
A New Approach: Automatically Identify Proper Noun from Bengali Sentence for ...
Syeful Islam
 
The NLP Muppets revolution!
The NLP Muppets revolution!The NLP Muppets revolution!
The NLP Muppets revolution!
Fabio Petroni, PhD
 
Frontiers of Natural Language Processing
Frontiers of Natural Language ProcessingFrontiers of Natural Language Processing
Frontiers of Natural Language Processing
Sebastian Ruder
 
Nonparametric Bayesian Word Discovery for Symbol Emergence in Robotics
Nonparametric Bayesian Word Discovery for Symbol Emergence in RoboticsNonparametric Bayesian Word Discovery for Symbol Emergence in Robotics
Nonparametric Bayesian Word Discovery for Symbol Emergence in Robotics
Tadahiro Taniguchi
 
download
downloaddownload
downloadbutest
 
TAAI 2016 Keynote Talk: Intercultural Collaboration as a Multi‐Agent System
TAAI 2016 Keynote Talk: Intercultural Collaboration as a Multi‐Agent SystemTAAI 2016 Keynote Talk: Intercultural Collaboration as a Multi‐Agent System
TAAI 2016 Keynote Talk: Intercultural Collaboration as a Multi‐Agent System
Yi-Shin Chen
 

What's hot (20)

A New Approach: Automatically Identify Naming Word from Bengali Sentence for ...
A New Approach: Automatically Identify Naming Word from Bengali Sentence for ...A New Approach: Automatically Identify Naming Word from Bengali Sentence for ...
A New Approach: Automatically Identify Naming Word from Bengali Sentence for ...
 
Multi-modal Neural Machine Translation - Iacer Calixto
Multi-modal Neural Machine Translation - Iacer CalixtoMulti-modal Neural Machine Translation - Iacer Calixto
Multi-modal Neural Machine Translation - Iacer Calixto
 
Language models
Language modelsLanguage models
Language models
 
The main-principles-of-text-to-speech-synthesis-system
The main-principles-of-text-to-speech-synthesis-systemThe main-principles-of-text-to-speech-synthesis-system
The main-principles-of-text-to-speech-synthesis-system
 
Lecture 1: Semantic Analysis in Language Technology
Lecture 1: Semantic Analysis in Language TechnologyLecture 1: Semantic Analysis in Language Technology
Lecture 1: Semantic Analysis in Language Technology
 
Word Segmentation and Lexical Normalization for Unsegmented Languages
Word Segmentation and Lexical Normalization for Unsegmented LanguagesWord Segmentation and Lexical Normalization for Unsegmented Languages
Word Segmentation and Lexical Normalization for Unsegmented Languages
 
Nlp presentation
Nlp presentationNlp presentation
Nlp presentation
 
Human Evaluation: Why do we need it? - Dr. Sheila Castilho
Human Evaluation: Why do we need it? - Dr. Sheila CastilhoHuman Evaluation: Why do we need it? - Dr. Sheila Castilho
Human Evaluation: Why do we need it? - Dr. Sheila Castilho
 
Design Analysis Rules to Identify Proper Noun from Bengali Sentence for Univ...
Design Analysis Rules to Identify Proper Noun  from Bengali Sentence for Univ...Design Analysis Rules to Identify Proper Noun  from Bengali Sentence for Univ...
Design Analysis Rules to Identify Proper Noun from Bengali Sentence for Univ...
 
Detection of Verbal Multi-Word Expressions via Conditional Random Fields with...
Detection of Verbal Multi-Word Expressions via Conditional Random Fields with...Detection of Verbal Multi-Word Expressions via Conditional Random Fields with...
Detection of Verbal Multi-Word Expressions via Conditional Random Fields with...
 
Deep learning for natural language embeddings
Deep learning for natural language embeddingsDeep learning for natural language embeddings
Deep learning for natural language embeddings
 
A prior case study of natural language processing on different domain
A prior case study of natural language processing  on different domain A prior case study of natural language processing  on different domain
A prior case study of natural language processing on different domain
 
1 computational linguistics an introduction
1 computational linguistics   an introduction1 computational linguistics   an introduction
1 computational linguistics an introduction
 
How can text-mining leverage developments in Deep Learning? Presentation at ...
How can text-mining leverage developments in Deep Learning?  Presentation at ...How can text-mining leverage developments in Deep Learning?  Presentation at ...
How can text-mining leverage developments in Deep Learning? Presentation at ...
 
A New Approach: Automatically Identify Proper Noun from Bengali Sentence for ...
A New Approach: Automatically Identify Proper Noun from Bengali Sentence for ...A New Approach: Automatically Identify Proper Noun from Bengali Sentence for ...
A New Approach: Automatically Identify Proper Noun from Bengali Sentence for ...
 
The NLP Muppets revolution!
The NLP Muppets revolution!The NLP Muppets revolution!
The NLP Muppets revolution!
 
Frontiers of Natural Language Processing
Frontiers of Natural Language ProcessingFrontiers of Natural Language Processing
Frontiers of Natural Language Processing
 
Nonparametric Bayesian Word Discovery for Symbol Emergence in Robotics
Nonparametric Bayesian Word Discovery for Symbol Emergence in RoboticsNonparametric Bayesian Word Discovery for Symbol Emergence in Robotics
Nonparametric Bayesian Word Discovery for Symbol Emergence in Robotics
 
download
downloaddownload
download
 
TAAI 2016 Keynote Talk: Intercultural Collaboration as a Multi‐Agent System
TAAI 2016 Keynote Talk: Intercultural Collaboration as a Multi‐Agent SystemTAAI 2016 Keynote Talk: Intercultural Collaboration as a Multi‐Agent System
TAAI 2016 Keynote Talk: Intercultural Collaboration as a Multi‐Agent System
 

Similar to Language Grid

Machine translation from English to Hindi
Machine translation from English to HindiMachine translation from English to Hindi
Machine translation from English to Hindi
Rajat Jain
 
visH (fin).pptx
visH (fin).pptxvisH (fin).pptx
visH (fin).pptx
tefflontrolegdy
 
Past, Present, and Future: Machine Translation & Natural Language Processing ...
Past, Present, and Future: Machine Translation & Natural Language Processing ...Past, Present, and Future: Machine Translation & Natural Language Processing ...
Past, Present, and Future: Machine Translation & Natural Language Processing ...
John Tinsley
 
Past, Present, and Future: Machine Translation & Natural Language Processing ...
Past, Present, and Future: Machine Translation & Natural Language Processing ...Past, Present, and Future: Machine Translation & Natural Language Processing ...
Past, Present, and Future: Machine Translation & Natural Language Processing ...
Iconic Translation Machines
 
Conversational AI:An Overview of Techniques, Applications & Future Scope - Ph...
Conversational AI:An Overview of Techniques, Applications & Future Scope - Ph...Conversational AI:An Overview of Techniques, Applications & Future Scope - Ph...
Conversational AI:An Overview of Techniques, Applications & Future Scope - Ph...
PhD Assistance
 
An Overview Of Natural Language Processing
An Overview Of Natural Language ProcessingAn Overview Of Natural Language Processing
An Overview Of Natural Language Processing
Scott Faria
 
Wreck a nice beach: adventures in speech recognition
Wreck a nice beach: adventures in speech recognitionWreck a nice beach: adventures in speech recognition
Wreck a nice beach: adventures in speech recognition
Stephen Marquard
 
Efficient Intralingual Text To Speech Web Podcasting And Recording
Efficient Intralingual Text To Speech Web Podcasting And RecordingEfficient Intralingual Text To Speech Web Podcasting And Recording
Efficient Intralingual Text To Speech Web Podcasting And Recording
IOSR Journals
 
Introduction to development of lexical databases
Introduction to development of lexical databasesIntroduction to development of lexical databases
Introduction to development of lexical databases
Muhammad Shoaib Chaudhary
 
NLP unit-VI.pptx
NLP unit-VI.pptxNLP unit-VI.pptx
NLP unit-VI.pptx
aishuchemate01
 
APznzaalselifJKjGQdTCA51cF7bldYdFMvDcshM8opKFZ_ZaIV-dqkiLoIKIfhz2tS6Fw5UBk25u...
APznzaalselifJKjGQdTCA51cF7bldYdFMvDcshM8opKFZ_ZaIV-dqkiLoIKIfhz2tS6Fw5UBk25u...APznzaalselifJKjGQdTCA51cF7bldYdFMvDcshM8opKFZ_ZaIV-dqkiLoIKIfhz2tS6Fw5UBk25u...
APznzaalselifJKjGQdTCA51cF7bldYdFMvDcshM8opKFZ_ZaIV-dqkiLoIKIfhz2tS6Fw5UBk25u...
AishwaryaChemate
 
SpokenMedia Project: Media-Linked Transcripts and Rich Media Notebooks for Le...
SpokenMedia Project: Media-Linked Transcripts and Rich Media Notebooks for Le...SpokenMedia Project: Media-Linked Transcripts and Rich Media Notebooks for Le...
SpokenMedia Project: Media-Linked Transcripts and Rich Media Notebooks for Le...
Brandon Muramatsu
 
Designing the Workflow of a Language Interpretation Device Using Artificial I...
Designing the Workflow of a Language Interpretation Device Using Artificial I...Designing the Workflow of a Language Interpretation Device Using Artificial I...
Designing the Workflow of a Language Interpretation Device Using Artificial I...
IOSR Journals
 
Pangeanic Cor-ActivaTM-Neural machine translation Taus Tokyo 2017
Pangeanic Cor-ActivaTM-Neural machine translation Taus Tokyo 2017Pangeanic Cor-ActivaTM-Neural machine translation Taus Tokyo 2017
Pangeanic Cor-ActivaTM-Neural machine translation Taus Tokyo 2017
Manuel Herranz
 
An Application for Performing Real Time Speech Translation in Mobile Environment
An Application for Performing Real Time Speech Translation in Mobile EnvironmentAn Application for Performing Real Time Speech Translation in Mobile Environment
An Application for Performing Real Time Speech Translation in Mobile Environment
Association of Scientists, Developers and Faculties
 
download
downloaddownload
downloadbutest
 
download
downloaddownload
downloadbutest
 
Computational linguistics
Computational linguisticsComputational linguistics
Computational linguistics
AdnanBaloch15
 
Developing an architecture for translation engine using ontology
Developing an architecture for translation engine using ontologyDeveloping an architecture for translation engine using ontology
Developing an architecture for translation engine using ontology
Alexander Decker
 

Similar to Language Grid (20)

Machine translation from English to Hindi
Machine translation from English to HindiMachine translation from English to Hindi
Machine translation from English to Hindi
 
visH (fin).pptx
visH (fin).pptxvisH (fin).pptx
visH (fin).pptx
 
Past, Present, and Future: Machine Translation & Natural Language Processing ...
Past, Present, and Future: Machine Translation & Natural Language Processing ...Past, Present, and Future: Machine Translation & Natural Language Processing ...
Past, Present, and Future: Machine Translation & Natural Language Processing ...
 
Past, Present, and Future: Machine Translation & Natural Language Processing ...
Past, Present, and Future: Machine Translation & Natural Language Processing ...Past, Present, and Future: Machine Translation & Natural Language Processing ...
Past, Present, and Future: Machine Translation & Natural Language Processing ...
 
Conversational AI:An Overview of Techniques, Applications & Future Scope - Ph...
Conversational AI:An Overview of Techniques, Applications & Future Scope - Ph...Conversational AI:An Overview of Techniques, Applications & Future Scope - Ph...
Conversational AI:An Overview of Techniques, Applications & Future Scope - Ph...
 
An Overview Of Natural Language Processing
An Overview Of Natural Language ProcessingAn Overview Of Natural Language Processing
An Overview Of Natural Language Processing
 
machine transaltion
machine transaltionmachine transaltion
machine transaltion
 
Wreck a nice beach: adventures in speech recognition
Wreck a nice beach: adventures in speech recognitionWreck a nice beach: adventures in speech recognition
Wreck a nice beach: adventures in speech recognition
 
Efficient Intralingual Text To Speech Web Podcasting And Recording
Efficient Intralingual Text To Speech Web Podcasting And RecordingEfficient Intralingual Text To Speech Web Podcasting And Recording
Efficient Intralingual Text To Speech Web Podcasting And Recording
 
Introduction to development of lexical databases
Introduction to development of lexical databasesIntroduction to development of lexical databases
Introduction to development of lexical databases
 
NLP unit-VI.pptx
NLP unit-VI.pptxNLP unit-VI.pptx
NLP unit-VI.pptx
 
APznzaalselifJKjGQdTCA51cF7bldYdFMvDcshM8opKFZ_ZaIV-dqkiLoIKIfhz2tS6Fw5UBk25u...
APznzaalselifJKjGQdTCA51cF7bldYdFMvDcshM8opKFZ_ZaIV-dqkiLoIKIfhz2tS6Fw5UBk25u...APznzaalselifJKjGQdTCA51cF7bldYdFMvDcshM8opKFZ_ZaIV-dqkiLoIKIfhz2tS6Fw5UBk25u...
APznzaalselifJKjGQdTCA51cF7bldYdFMvDcshM8opKFZ_ZaIV-dqkiLoIKIfhz2tS6Fw5UBk25u...
 
SpokenMedia Project: Media-Linked Transcripts and Rich Media Notebooks for Le...
SpokenMedia Project: Media-Linked Transcripts and Rich Media Notebooks for Le...SpokenMedia Project: Media-Linked Transcripts and Rich Media Notebooks for Le...
SpokenMedia Project: Media-Linked Transcripts and Rich Media Notebooks for Le...
 
Designing the Workflow of a Language Interpretation Device Using Artificial I...
Designing the Workflow of a Language Interpretation Device Using Artificial I...Designing the Workflow of a Language Interpretation Device Using Artificial I...
Designing the Workflow of a Language Interpretation Device Using Artificial I...
 
Pangeanic Cor-ActivaTM-Neural machine translation Taus Tokyo 2017
Pangeanic Cor-ActivaTM-Neural machine translation Taus Tokyo 2017Pangeanic Cor-ActivaTM-Neural machine translation Taus Tokyo 2017
Pangeanic Cor-ActivaTM-Neural machine translation Taus Tokyo 2017
 
An Application for Performing Real Time Speech Translation in Mobile Environment
An Application for Performing Real Time Speech Translation in Mobile EnvironmentAn Application for Performing Real Time Speech Translation in Mobile Environment
An Application for Performing Real Time Speech Translation in Mobile Environment
 
download
downloaddownload
download
 
download
downloaddownload
download
 
Computational linguistics
Computational linguisticsComputational linguistics
Computational linguistics
 
Developing an architecture for translation engine using ontology
Developing an architecture for translation engine using ontologyDeveloping an architecture for translation engine using ontology
Developing an architecture for translation engine using ontology
 

Recently uploaded

GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
Neo4j
 
A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...
sonjaschweigert1
 
UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5
DianaGray10
 
Building RAG with self-deployed Milvus vector database and Snowpark Container...
Building RAG with self-deployed Milvus vector database and Snowpark Container...Building RAG with self-deployed Milvus vector database and Snowpark Container...
Building RAG with self-deployed Milvus vector database and Snowpark Container...
Zilliz
 
Removing Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software FuzzingRemoving Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software Fuzzing
Aftab Hussain
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance
 
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
James Anderson
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
Safe Software
 
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
Neo4j
 
Video Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the FutureVideo Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the Future
Alpen-Adria-Universität
 
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
名前 です男
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
Matthew Sinclair
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
Alan Dix
 
Mind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AIMind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AI
Kumud Singh
 
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
Neo4j
 
How to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptxHow to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptx
danishmna97
 
20 Comprehensive Checklist of Designing and Developing a Website
20 Comprehensive Checklist of Designing and Developing a Website20 Comprehensive Checklist of Designing and Developing a Website
20 Comprehensive Checklist of Designing and Developing a Website
Pixlogix Infotech
 
By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024
Pierluigi Pugliese
 
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
Neo4j
 
Full-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalizationFull-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalization
Zilliz
 

Recently uploaded (20)

GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
 
A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...
 
UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5
 
Building RAG with self-deployed Milvus vector database and Snowpark Container...
Building RAG with self-deployed Milvus vector database and Snowpark Container...Building RAG with self-deployed Milvus vector database and Snowpark Container...
Building RAG with self-deployed Milvus vector database and Snowpark Container...
 
Removing Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software FuzzingRemoving Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software Fuzzing
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
 
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
 
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
 
Video Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the FutureVideo Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the Future
 
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
 
Mind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AIMind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AI
 
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
 
How to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptxHow to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptx
 
20 Comprehensive Checklist of Designing and Developing a Website
20 Comprehensive Checklist of Designing and Developing a Website20 Comprehensive Checklist of Designing and Developing a Website
20 Comprehensive Checklist of Designing and Developing a Website
 
By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024
 
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
 
Full-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalizationFull-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalization
 

Language Grid

  • 1. UMD Crowdsourcing and Translation Workshop, June 10-11 Language Grid: Service-OrientedInfrastructurefor Multilingual Society Donghui Lin National Institute of Information and Communications Technology (NICT), Japan
  • 2. Introduction of the Language Grid http://langrid.nict.go.jp 2
  • 3. Language Grid Architecture Education Medical Care Disaster Management more more Sharing Multilingual Information Translation Services at Hospital Receptions Universal Playground Language Support for Multicultural Societies Sharing language resources such as dictionaries and machine translators around the world German Research Center for Artificial Intelligence Kookmin University Stuttgart University National Institute of Informatics Princeton University National Research Council, Italy Chinese Academy of Sciences Google Inc. NICT NTT Research Labs Asian Disaster Reduction Center NECTEC Univ. of Indonesia 3
  • 4. From Language Resources to Language Services Dictionary Service 避難場所 Wrapping disaster shelter Dictionary Return translated word Dictionary Service Parallel Text Service 避難場所は、家から近い学校です。 Wrapping In the case of disaster, people should be evacuated to a school nearby their house. Return similar translated text Parallel Text Parallel Text Service Machine Translation Service 避難場所は、家から近い学校です。 Wrapping Disaster shelter is school close from a house. Translate by machine Machine Translator Machine Translation Service Human Translation Service 避難場所は、家から近い学校です。 Wrapping Your disaster shelter is the school closest to your house. Translate with high quality Human Translator Human Translation Service
  • 5. From Atomic Services to Composite Services 授業料は無償ですが、教材費や給食費は必要です。 (Original Text in English:Tuition is free,but cost of textbooks and school meals is required) Multilingual Service Infrastructure Multilingual Communication Support System Over60 language services are available currently (April 2010) AutoComplete Service Multilingual Communication Support System for a middle school in Kawasaki City, Japan School Guidance for Foreign Guardians Mie Prefecture, JapanParallel Text As aulas referentes ao ensino obrigatório são gratuitas. Porém, serão cobrados os valores referentes à refeição escolar e os materiais a serem usados. (Translation Result in English: Tuition of compulsory education is free,but cost of textbooks and school meals is required) If original text is registered as parallel text item, perfect translation can be acquired Composite Translation Service A taxa escolar é gratuita, mas a um custo de material educativo e um gasto de merenda escolar são necessários. (Translation Result in English: Although tuition is free,cost of textbooks and waste of school meals is necessary.) Display area for conversationlogs J-Server, Kodensha (Rule-based Translation) Translation(ja-en) Multi-hop Translation Web-transer, CrossLanguage (Rule-based Translation) Translation(en-pt) Translation quality is similar to that of free translation on the Web ・Multi-hop translation ・Composite translation with dictionary ・Best translation selection A taxa escolar é gratuita, mas as taxas de materiais didáticos e merenda escolar são as taxas necessárias. (Translation Result in English: Although tuition is free,the rate that learns materials and school meals is the necessary cost. ) Composite Translation with Dictionary Mecab, NTT Morphological Analysis Professional translation can be realized by combining dictionaries fortechnical terminology (jargon) TreeTagger, University of Stuttgart Morphological Analysis Education Dictionary, The Toyota Foundation Dictionary Best translation result can be selected from multiple translation services AutoComplete Service Composite Translation with Dictionary Best Translation Selection Service O ensino é livre, as taxas de materiais de aprendizagem taxas de merenda escolar e é necessário. (Translation Result in English: Tuition is free,but cost of textbooks and school feeding is required.) Translation quality is possible to be further improved by learning from examples (parallel texts). (Under development by Kyoto Univ.) GoogleTranslate, Google (Statistical Translation) Translation(ja-pt) Best Translation Selection
  • 6.
  • 8. Kyoto Univ. (Japan), Shanghai Jiao Tong Univ. (China), Univ. of Stuttgart (Germany), IT Univ. of Copenhagen (Denmark), Princeton Univ. (USA), DFKI (Germany), CNR (Italy), Chinese Academy of Sciences (China), NECTEC (Thailand), and more.
  • 10. NGOs for disaster reduction, NGOs for Intercultural exchange, Public Junior-high schools, City Boards of Education, and more.
  • 11. Corporate (CSR activities / language resource providers)
  • 12. Google, NTT, Toshiba, Oki, Kodensha, Translution, and more.
  • 13. Language Resources (60+ language resources)
  • 15. J-Server (ja/en/ko/zh), Web-Transer (ja/en/ko/zh/fr/de/it/es), Toshiba (en/zh), Parsit (en>th), Google Translate (40+ languages), and more.
  • 17. EDR , Wordnet, Life Science Dictionary, Multi-language Glossary on Natural Disasters, and more.
  • 21. Participatory Design Approach Users create and share their own language services, combine their services with services on the Language Grid to support their multilingual activities Multilingual medical support system in a hospital Multilingual chatting system in a middle school Multilingual BBS for international staffs in an NPO
  • 22. Participatory Design Approach Users customize how to use language services on the Language Grid to support their multilingual activities User can customize translation services and dictionary services Multilingual BBS
  • 23.
  • 25. Try language services on the Language Grid
  • 27. Dictionary / Concept Dictionary
  • 30.
  • 33. Try customization of language services on the Language Grid
  • 34. Text Translation (Back-translation, Composite translation)
  • 35. Language Resource Creation (Dictionary, Parallel Text…)
  • 37.
  • 38. Open Smart Classroom (IEEE TKED 2009) Language Grid YueSuo, Naoki Miyata, Hiroki Morikawa, Toru Ishida and Yuanchun Shi. Open Smart Classroom: Extensible and Scalable Learning System in Smart Space using Web Service Technology. IEEE Transactions on Knowledge and Data Engineering, Vol.21, No.6, pp. 814-828, 2009. Gold Prize (1st place) in "Microsoft Cup" IEEE China Student Paper Contest.
  • 39. Analysis of Multicultural Communication (CSCW 2006, CHI 2009) Problems of MT-mediated communication Translation Asymmetry Echoing for ratification does not work. Naomi Yamashita, Reiko Inaba, Hideaki Kuzuoka and Toru Ishida. Difficulties in Establishing Common Ground in Multiparty Groups using Machine Translation. International Conference on Human Factors in Computing Systems (CHI-09), pp. 679-688, 2009. Naomi Yamashita and Toru Ishida. Effects of Machine Translation on Collaborative Work. International Conference on Computer Supported Cooperative Work (CSCW-06), pp. 515-523, 2006.
  • 40. Cross Language News Analysis (ongoing) Translation andDictionary Creation バラク・オバマ 巴拉克·奥巴马 Japan China Korea USA World News Analysis Prof. Yoshioka, Hokkaido University, Japan
  • 41.
  • 42. Wiki-to-Wiki Translation (just started) Number of Wikipedia articles by language Support Wikipedia communities to create multilingual articles. Number of Wiktionary entries by language 15
  • 43. Research on Developing the Language Grid
  • 44.
  • 45. Morphological Analysis Technical Term Extraction Technical Term Bilingual Dictionary Technical Term Bilingual Dictionary Term Replacement Machine Translation Term Replacement + Any remaining terms Any remaining terms Constraint-basedHorizontal Service Composition (ISWC 2006) Original sentence X1 Vertical service composition is to create a workflow. Horizontal service composition is to select atomic services for a given workflow. Satisfy hard constraints (required functions), while maximizing soft constraints (QoS for example). Set of morphemes X2 Set of technical terms included in the sentence Set of technical terms included in the sentence No No yes yes X3 Intermediate code of technical term Translation of technical term Original sentence Set of technical terms Set of intermediate code X4 Sentence including intermediate code. X5 Translated sentence Set of intermediate code Set of term translations X6 Ahlem Ben Hassine, Matsubara Shigeo and Toru Ishida. Constraint-based Approach for Web Service Composition. International Semantic Web Conference (ISWC-06), pp. 130-143, 2006.
  • 46. Summary: Research Issues Interaction analysis Creating conversational common ground with inconsistent, asymmetric, and intransitive machine translations. (CSCW2006, CHI2009) Service Composition Horizontal service composition. (ISWC2006, ICWS2008) Service supervision. (ICWS2009, SCC2010) Context aware service composition for pivot translations. (IJCAI2009) Provider-centered trust for autonomic service composition. Pricing composite/atomic language services. User-centered QoS for composite language services. Customizing statistical translations with community dictionaries. Extending the Language Grid Emotions and pictograms in language services (ESWC2008) e-learning grid (IEEE Transactions on KDE, 2009) 19
  • 47. International Conference Initiated by the Language Grid
  • 48.
  • 49. Research Example 1: MT-Mediated Communication, Collaborative Translation
  • 50. 23 MT-Mediated Communication (CSCW06, CHI09) To observe and analyze the effects of machine translation on communication among monolinguals Participants: 5 Chinese-Japanesepairs, 3 Japanese-Korean pairs They have never met before the experiments Tasks: Each pair of users try to indentify the same avatars by using their mother tongues and machine translations Communication medias: MT-based multilingual chatting systems
  • 51. 24 Experiment Design Indentify the same avatars < China Side> < Japan Side> Multilingual Chatting System Based on the Language Grid
  • 52. Analysis of MT-Mediated Communication Problem of MT-mediated communication (1) Translation Asymmetry Echoing for ratification does not work.
  • 53. Analysis of MT-Mediated Communication Problem of MT-mediated communication (2) Translation Inconsistency Different expressions of a same sentence can get totally different translation results.
  • 54. MT-Mediated Collaborative Translation Protocols (IUI2009) Language Grid Basic concept of collaborative translation DaisukeMorita and Toru Ishida. Collaborative Translation by Monolinguals with Machine Translators. International Conference on Intelligent User Interfaces (IUI-09), pp. 361-365, 2009.
  • 55. Examples of Collaborative Translation Protocols
  • 56. Experiment Result Result of Japanese-English Translation Result of Japanese-Chinese Translation
  • 57. Research Example 2: Multilingual Localization
  • 58. Language Grid for Multilingual Localization (ICSOC2009, LREC2010) Machine translation VS Human translation Human translations are of high costs with long durations Machine translation have limited translation quality (in dimensions of fluency and adequacy) Language Grid improves traditional machine translation service by combining community dictionaries, but still has limitation in translation quality Language Grid for Localization Processes Localization processes requires perfect translation quality that machine translation services cannot provide 31 Let’s try to combine machine translation services and human activities using the Language Grid!!
  • 59.
  • 60. Experiment Design Localization Contents Business, University, Temple… Languages JA, EN, ZH-CN, ZH-TW, KO, PT, ES, DE, FR… Monolingual Foreign students in Kyoto University Bilingual Professional bilingual translator in Translation Company Machine Translation Services Composite translation with bilingual dictionary For each content, there is a dictionary covers 15%-25% of the total contents Comparison Localization Process by using Language Grid Common Localization Process
  • 62. Monolingual Modification Rate and Length of Original Sentences
  • 63. Monolingual Modification Rate and Translation Time Monolingual modification time keeps stable for different modification rate Bilingual translation/confirmation time becomes less when monolingual modification rate increases Bilingual Time Monolingual Time
  • 64. Monolingual Modification Rate and Translation Time Bilingual Time Monolingual Time
  • 65. Monolingual Modification Rate and Bilingual Time Reduction Rate The bilingual time reduction rate increases with the monolingual modification rate
  • 66. Challenging Issues Requirements of composing machine translation and human Methodologies of balancing the cost and quality Control of human translation qualities and machine translation quality Interaction design of human-human and human-machine translation