SlideShare a Scribd company logo
1 of 8
Cloud Robotics for Human-Robot Dialogues
Komei Sugiura
Senior Researcher,
National Institute of Information and Communications Technology
Trustee,
RoboCup Federation
NICT:
Japan’s National Research Institute for ICT
Possible collaborations
• human-robot communication, scene
understanding, multimodal dialogues,
IoT data mining, and other
robotics/machine-learning/CV topics
Annual budget ¥29.7B
(£168M)
# researchers/staffs 434 / 937
Research topics
Spoken language processing, natural
language processing, machine translation,
databases, data mining, etc. @Kyoto
Photonic network, wireless network,
cybersecurity, time standard, neuroscience,
space weather, etc. @Tokyo, Osaka
VoiceTra
>1M downloads
since July, 2010
Multimodal dialogues with robots: Language processing using
non-linguistic information is challenging
Smartphone and other consumer devices
Language processing using non-
linguistic information gives benefit
cf. Market size of speech recognition
¥88B@2013→¥170B@2018 (£1B)*
Show me today’s
schedule
* Estimation by NEDO, TSC Foresight Vol.8, 2015
Sushi restaurants
around here
Benefit for
QA/search
GPS Contacts Other context
info.
Current communication with robots
Limited multi-modality and
scalability in robot intelligence
??
??Throw them
away.
Is there any milk
in the fridge?
cf. [Steels 2003, Roy 2005, Iwahashi
2007, Kollar+ 2010, Yu+ 2013]
Key Question:
How can we make robot intelligence scalable and multimodal ?
Major speech recognition engines are trained with large-scale corpora (>1000
hours ≒ 100M utterances), and continuously improved as cloud services
4
RoboCup@Home: Target user scenario
with service robots
XIMERA 3
(by NICT)
Voice talent
cf. [Sugiura+ ICRA2014]
Can we make such innovations in robotics?
• Training with large-scale datasets and continuous improvements in e.g.
dialogues, object recognition, grasping, simulation, …
(1) Rospeex:
We built a cloud robotics platform for multilingual dialogues*
• 30,000 unique users since Sep. 2013
• Non-monologue speech synthesis designed for robots [Sugiura+ 2014]
• Word Error Rate = 7.9% for IWSLT tst2011 (1st Place Winners in
IWSLT12, 13, 14)
Python & C++ samples
are available
rospeex Search
* Research/development-use only
Rospeex’s positioning in robot dialogue quadrants
Cloud APIs
(Google, Microsoft, Nuance,
NTT docomo, Wit.ai, …)
Free software
Commercial software
OpenHRI,
PocketSphinx, Festival
Cloud-based
Stand-alone
Robot middleware-
compatible
Incompatible
6
Does not work with
very low-spec PCs 
Robotics-specific
logs are lost 
Authentication
Low quality 
Expensive 
IP distributions of rospeex users
Rospeex has been applied to:
Humanoids, web agents, conversational
robots with elderly people, automotive
navigation systems, smart-home interface
(2) Building domestic service robots
(1st places in 2008 & 2010, 2nd places in 2009 & 2012)
• RoboCup@Home: The largest competition for domestic service robots
– Focuses on human-robot interaction and mobile manipulation
• Challenges
– Navigation in unknown environments (e.g. real shop), handling everyday
objects, spoken dialogues in very noisy environments (70dBA), …
• cf. Social impacts of other RoboCup leagues
– RoboCupRescue: Fukushima Power plant investigation
– Aldebaran sold >1000 NAO robots and bought by Softbank @US$ 100M, …
7by Channel 5
(3) Machine learning for environments:
Air pollution prediction can reduce social cost
• Loss by PM2.5 and air pollutants
– 3.3M premature deaths per year [Lelieveld, Nature, 2015]
• Prediction can prevent possible exposure, but prediction accuracy is quite low
– Standard approach gave only 42% accuracy in Fukuoka@2013-14*
• Applying the DPT-DRNN method to weather open-data outperforms a
standard weather model-based approach
*threat score=TP/(TP+FP+FN)
Premature deaths in London
≒2,800 @2010 [Ong & Sugiura, IEEE BigData 2014]Hefei, China 2015
(Not fog, not cloudy)

More Related Content

Viewers also liked

風險測試
風險測試風險測試
風險測試
excel2003
 
Things I like, I love and I hate.
Things I like, I love and I hate.Things I like, I love and I hate.
Things I like, I love and I hate.
nachisoukaina
 

Viewers also liked (16)

The Robot and the Cloud
The Robot and the CloudThe Robot and the Cloud
The Robot and the Cloud
 
Humanoid Robotics: Towards Smart Community
Humanoid Robotics: Towards Smart Community Humanoid Robotics: Towards Smart Community
Humanoid Robotics: Towards Smart Community
 
ROBOTIC ARM
ROBOTIC ARMROBOTIC ARM
ROBOTIC ARM
 
Cubic robots
Cubic robotsCubic robots
Cubic robots
 
Raspberry pi based project abstracts
Raspberry pi based project abstractsRaspberry pi based project abstracts
Raspberry pi based project abstracts
 
風險測試
風險測試風險測試
風險測試
 
ขยายเวลาเปิดรับข้อเสนอโครงการงบสกอ. ประจำปีงบประมาณ 2557
ขยายเวลาเปิดรับข้อเสนอโครงการงบสกอ. ประจำปีงบประมาณ 2557ขยายเวลาเปิดรับข้อเสนอโครงการงบสกอ. ประจำปีงบประมาณ 2557
ขยายเวลาเปิดรับข้อเสนอโครงการงบสกอ. ประจำปีงบประมาณ 2557
 
Exits
ExitsExits
Exits
 
Things I like, I love and I hate.
Things I like, I love and I hate.Things I like, I love and I hate.
Things I like, I love and I hate.
 
5° básico b semana 23 al 27 mayo
5° básico b  semana 23  al 27 mayo5° básico b  semana 23  al 27 mayo
5° básico b semana 23 al 27 mayo
 
Leveraging social media for fundraising & events by suzanne mc donald
Leveraging social media for fundraising & events by suzanne mc donaldLeveraging social media for fundraising & events by suzanne mc donald
Leveraging social media for fundraising & events by suzanne mc donald
 
CV Ahmed madeeh
CV Ahmed madeeh CV Ahmed madeeh
CV Ahmed madeeh
 
Galicia - Comenius Project
Galicia - Comenius ProjectGalicia - Comenius Project
Galicia - Comenius Project
 
Presentación sobre Derechos Humanos
Presentación sobre Derechos HumanosPresentación sobre Derechos Humanos
Presentación sobre Derechos Humanos
 
19 plan & section of penstock
19 plan & section of  penstock19 plan & section of  penstock
19 plan & section of penstock
 
Chindogu
ChindoguChindogu
Chindogu
 

Similar to Cloud Robotics for Human-Robot Dialogues

EDF2012 Stefano Bertolo - Future European activities and funding perspectiv...
EDF2012   Stefano Bertolo - Future European activities and funding perspectiv...EDF2012   Stefano Bertolo - Future European activities and funding perspectiv...
EDF2012 Stefano Bertolo - Future European activities and funding perspectiv...
European Data Forum
 
SOFIA - Semantic Technologies and Techniques for Interoperable Information in...
SOFIA - Semantic Technologies and Techniques for Interoperable Information in...SOFIA - Semantic Technologies and Techniques for Interoperable Information in...
SOFIA - Semantic Technologies and Techniques for Interoperable Information in...
Sofia Eu
 
rospeex: a cloud-based speech communication toolkit for ROS
rospeex: a cloud-based speech communication toolkit for ROSrospeex: a cloud-based speech communication toolkit for ROS
rospeex: a cloud-based speech communication toolkit for ROS
Komei Sugiura
 

Similar to Cloud Robotics for Human-Robot Dialogues (20)

20161014IROS_WS
20161014IROS_WS20161014IROS_WS
20161014IROS_WS
 
EDF2012 Stefano Bertolo - Future European activities and funding perspectiv...
EDF2012   Stefano Bertolo - Future European activities and funding perspectiv...EDF2012   Stefano Bertolo - Future European activities and funding perspectiv...
EDF2012 Stefano Bertolo - Future European activities and funding perspectiv...
 
IRJET- Virtual Vision for Blinds
IRJET- Virtual Vision for BlindsIRJET- Virtual Vision for Blinds
IRJET- Virtual Vision for Blinds
 
[GTC 2019] Bringing Personal Robots Home: Integrating Computer Vision and Hum...
[GTC 2019] Bringing Personal Robots Home: Integrating Computer Vision and Hum...[GTC 2019] Bringing Personal Robots Home: Integrating Computer Vision and Hum...
[GTC 2019] Bringing Personal Robots Home: Integrating Computer Vision and Hum...
 
IRJET- A Survey to Chatbot System with Knowledge Base Database by using Artif...
IRJET- A Survey to Chatbot System with Knowledge Base Database by using Artif...IRJET- A Survey to Chatbot System with Knowledge Base Database by using Artif...
IRJET- A Survey to Chatbot System with Knowledge Base Database by using Artif...
 
The Convergence of Robotics, the Web, and the IoT
The Convergence of Robotics, the Web, and the IoTThe Convergence of Robotics, the Web, and the IoT
The Convergence of Robotics, the Web, and the IoT
 
SOFIA - Semantic Technologies and Techniques for Interoperable Information in...
SOFIA - Semantic Technologies and Techniques for Interoperable Information in...SOFIA - Semantic Technologies and Techniques for Interoperable Information in...
SOFIA - Semantic Technologies and Techniques for Interoperable Information in...
 
rospeex: a cloud-based speech communication toolkit for ROS
rospeex: a cloud-based speech communication toolkit for ROSrospeex: a cloud-based speech communication toolkit for ROS
rospeex: a cloud-based speech communication toolkit for ROS
 
Internet trends and ICT knowledge necessary in the next years - 2013 ed.
Internet trends and ICT knowledge necessary in the next years - 2013 ed.Internet trends and ICT knowledge necessary in the next years - 2013 ed.
Internet trends and ICT knowledge necessary in the next years - 2013 ed.
 
One Laptop Per Child (OLPC) KLUG Presentation
One Laptop Per Child (OLPC) KLUG PresentationOne Laptop Per Child (OLPC) KLUG Presentation
One Laptop Per Child (OLPC) KLUG Presentation
 
Project linkbox
Project linkboxProject linkbox
Project linkbox
 
Implementation of humanoid robot with using the concept of synthetic brain
Implementation of humanoid robot with using the concept of synthetic brainImplementation of humanoid robot with using the concept of synthetic brain
Implementation of humanoid robot with using the concept of synthetic brain
 
Software park Thailand
Software park ThailandSoftware park Thailand
Software park Thailand
 
“SKYE : Voice Based AI Desktop Assistant”
“SKYE : Voice Based AI Desktop Assistant”“SKYE : Voice Based AI Desktop Assistant”
“SKYE : Voice Based AI Desktop Assistant”
 
Bt35408413
Bt35408413Bt35408413
Bt35408413
 
sample PPT.pptx
sample PPT.pptxsample PPT.pptx
sample PPT.pptx
 
Implementation of humanoid robot with using the
Implementation of humanoid robot with using theImplementation of humanoid robot with using the
Implementation of humanoid robot with using the
 
Self introduction
Self introductionSelf introduction
Self introduction
 
Geospatial trends
Geospatial trendsGeospatial trends
Geospatial trends
 
Vint Cerf @ Sharkfest 2008
Vint Cerf @ Sharkfest 2008Vint Cerf @ Sharkfest 2008
Vint Cerf @ Sharkfest 2008
 

More from Komei Sugiura

SuMo-SS: Submodular Optimization Sensor Scattering for Deploying Sensor Netwo...
SuMo-SS: Submodular Optimization Sensor Scattering for Deploying Sensor Netwo...SuMo-SS: Submodular Optimization Sensor Scattering for Deploying Sensor Netwo...
SuMo-SS: Submodular Optimization Sensor Scattering for Deploying Sensor Netwo...
Komei Sugiura
 
ロボットの音声コミュニケーション技術:言葉や能力の壁を越えるデータ指向知能に向けて
ロボットの音声コミュニケーション技術:言葉や能力の壁を越えるデータ指向知能に向けてロボットの音声コミュニケーション技術:言葉や能力の壁を越えるデータ指向知能に向けて
ロボットの音声コミュニケーション技術:言葉や能力の壁を越えるデータ指向知能に向けて
Komei Sugiura
 
20160606劣モジュラ性を利用したドローンによるばらまき型センサ配置
20160606劣モジュラ性を利用したドローンによるばらまき型センサ配置20160606劣モジュラ性を利用したドローンによるばらまき型センサ配置
20160606劣モジュラ性を利用したドローンによるばらまき型センサ配置
Komei Sugiura
 
20140513大規模異分野データ横断検索における時空間情報を用いた擬似適合性フィードバック
20140513大規模異分野データ横断検索における時空間情報を用いた擬似適合性フィードバック20140513大規模異分野データ横断検索における時空間情報を用いた擬似適合性フィードバック
20140513大規模異分野データ横断検索における時空間情報を用いた擬似適合性フィードバック
Komei Sugiura
 
実世界の意味を扱う理論と機械知能の構築
実世界の意味を扱う理論と機械知能の構築実世界の意味を扱う理論と機械知能の構築
実世界の意味を扱う理論と機械知能の構築
Komei Sugiura
 
Language acquisition framework for robots: From grounded language acquisition...
Language acquisition framework for robots: From grounded language acquisition...Language acquisition framework for robots: From grounded language acquisition...
Language acquisition framework for robots: From grounded language acquisition...
Komei Sugiura
 
Introduction to RoboCup@Home
Introduction to RoboCup@HomeIntroduction to RoboCup@Home
Introduction to RoboCup@Home
Komei Sugiura
 
ロボカップ@ホーム入門
ロボカップ@ホーム入門ロボカップ@ホーム入門
ロボカップ@ホーム入門
Komei Sugiura
 

More from Komei Sugiura (20)

ロボティクスにおける言語の利活用
ロボティクスにおける言語の利活用ロボティクスにおける言語の利活用
ロボティクスにおける言語の利活用
 
生活支援ロボットにおける 大規模データ収集に向けて
生活支援ロボットにおける大規模データ収集に向けて生活支援ロボットにおける大規模データ収集に向けて
生活支援ロボットにおける 大規模データ収集に向けて
 
生活支援ロボットのマルチモーダル言語理解技術
生活支援ロボットのマルチモーダル言語理解技術生活支援ロボットのマルチモーダル言語理解技術
生活支援ロボットのマルチモーダル言語理解技術
 
SuMo-SS: Submodular Optimization Sensor Scattering for Deploying Sensor Netwo...
SuMo-SS: Submodular Optimization Sensor Scattering for Deploying Sensor Netwo...SuMo-SS: Submodular Optimization Sensor Scattering for Deploying Sensor Netwo...
SuMo-SS: Submodular Optimization Sensor Scattering for Deploying Sensor Netwo...
 
ロボットの音声コミュニケーション技術:言葉や能力の壁を越えるデータ指向知能に向けて
ロボットの音声コミュニケーション技術:言葉や能力の壁を越えるデータ指向知能に向けてロボットの音声コミュニケーション技術:言葉や能力の壁を越えるデータ指向知能に向けて
ロボットの音声コミュニケーション技術:言葉や能力の壁を越えるデータ指向知能に向けて
 
Spatio-Temporal Pseudo Relevance Feedback for Large-Scale and Heterogeneous S...
Spatio-Temporal Pseudo Relevance Feedback for Large-Scale and Heterogeneous S...Spatio-Temporal Pseudo Relevance Feedback for Large-Scale and Heterogeneous S...
Spatio-Temporal Pseudo Relevance Feedback for Large-Scale and Heterogeneous S...
 
言葉や能力の壁を越えるデータ指向知能
言葉や能力の壁を越えるデータ指向知能言葉や能力の壁を越えるデータ指向知能
言葉や能力の壁を越えるデータ指向知能
 
New challenge in RoboCup 2017 Nagoya: RoboCup@Home Standard Platform
New challenge in RoboCup 2017 Nagoya: RoboCup@Home Standard PlatformNew challenge in RoboCup 2017 Nagoya: RoboCup@Home Standard Platform
New challenge in RoboCup 2017 Nagoya: RoboCup@Home Standard Platform
 
20160907rsj16ロボット聴覚OS
20160907rsj16ロボット聴覚OS20160907rsj16ロボット聴覚OS
20160907rsj16ロボット聴覚OS
 
20160606劣モジュラ性を利用したドローンによるばらまき型センサ配置
20160606劣モジュラ性を利用したドローンによるばらまき型センサ配置20160606劣モジュラ性を利用したドローンによるばらまき型センサ配置
20160606劣モジュラ性を利用したドローンによるばらまき型センサ配置
 
20160221statistic imitation learning and human-robot communication
20160221statistic imitation learning and human-robot communication20160221statistic imitation learning and human-robot communication
20160221statistic imitation learning and human-robot communication
 
20140513大規模異分野データ横断検索における時空間情報を用いた擬似適合性フィードバック
20140513大規模異分野データ横断検索における時空間情報を用いた擬似適合性フィードバック20140513大規模異分野データ横断検索における時空間情報を用いた擬似適合性フィードバック
20140513大規模異分野データ横断検索における時空間情報を用いた擬似適合性フィードバック
 
20150531Deep Recurrent Neural Networkによる環境モニタリングデータの予測
20150531Deep Recurrent Neural Networkによる環境モニタリングデータの予測20150531Deep Recurrent Neural Networkによる環境モニタリングデータの予測
20150531Deep Recurrent Neural Networkによる環境モニタリングデータの予測
 
階層型評価構造に基づく観光スポット推薦システムの構築と長期実証実験
階層型評価構造に基づく観光スポット推薦システムの構築と長期実証実験階層型評価構造に基づく観光スポット推薦システムの構築と長期実証実験
階層型評価構造に基づく観光スポット推薦システムの構築と長期実証実験
 
実世界の意味を扱う理論と機械知能の構築
実世界の意味を扱う理論と機械知能の構築実世界の意味を扱う理論と機械知能の構築
実世界の意味を扱う理論と機械知能の構築
 
20151129インテリジェントホームロボティクス研究会
20151129インテリジェントホームロボティクス研究会20151129インテリジェントホームロボティクス研究会
20151129インテリジェントホームロボティクス研究会
 
Japan Robot Week 2014けいはんなロボットフォーラム
Japan Robot Week 2014けいはんなロボットフォーラムJapan Robot Week 2014けいはんなロボットフォーラム
Japan Robot Week 2014けいはんなロボットフォーラム
 
Language acquisition framework for robots: From grounded language acquisition...
Language acquisition framework for robots: From grounded language acquisition...Language acquisition framework for robots: From grounded language acquisition...
Language acquisition framework for robots: From grounded language acquisition...
 
Introduction to RoboCup@Home
Introduction to RoboCup@HomeIntroduction to RoboCup@Home
Introduction to RoboCup@Home
 
ロボカップ@ホーム入門
ロボカップ@ホーム入門ロボカップ@ホーム入門
ロボカップ@ホーム入門
 

Recently uploaded

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
giselly40
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 

Recently uploaded (20)

Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 

Cloud Robotics for Human-Robot Dialogues

  • 1. Cloud Robotics for Human-Robot Dialogues Komei Sugiura Senior Researcher, National Institute of Information and Communications Technology Trustee, RoboCup Federation
  • 2. NICT: Japan’s National Research Institute for ICT Possible collaborations • human-robot communication, scene understanding, multimodal dialogues, IoT data mining, and other robotics/machine-learning/CV topics Annual budget ¥29.7B (£168M) # researchers/staffs 434 / 937 Research topics Spoken language processing, natural language processing, machine translation, databases, data mining, etc. @Kyoto Photonic network, wireless network, cybersecurity, time standard, neuroscience, space weather, etc. @Tokyo, Osaka VoiceTra >1M downloads since July, 2010
  • 3. Multimodal dialogues with robots: Language processing using non-linguistic information is challenging Smartphone and other consumer devices Language processing using non- linguistic information gives benefit cf. Market size of speech recognition ¥88B@2013→¥170B@2018 (£1B)* Show me today’s schedule * Estimation by NEDO, TSC Foresight Vol.8, 2015 Sushi restaurants around here Benefit for QA/search GPS Contacts Other context info. Current communication with robots Limited multi-modality and scalability in robot intelligence ?? ??Throw them away. Is there any milk in the fridge? cf. [Steels 2003, Roy 2005, Iwahashi 2007, Kollar+ 2010, Yu+ 2013]
  • 4. Key Question: How can we make robot intelligence scalable and multimodal ? Major speech recognition engines are trained with large-scale corpora (>1000 hours ≒ 100M utterances), and continuously improved as cloud services 4 RoboCup@Home: Target user scenario with service robots XIMERA 3 (by NICT) Voice talent cf. [Sugiura+ ICRA2014] Can we make such innovations in robotics? • Training with large-scale datasets and continuous improvements in e.g. dialogues, object recognition, grasping, simulation, …
  • 5. (1) Rospeex: We built a cloud robotics platform for multilingual dialogues* • 30,000 unique users since Sep. 2013 • Non-monologue speech synthesis designed for robots [Sugiura+ 2014] • Word Error Rate = 7.9% for IWSLT tst2011 (1st Place Winners in IWSLT12, 13, 14) Python & C++ samples are available rospeex Search * Research/development-use only
  • 6. Rospeex’s positioning in robot dialogue quadrants Cloud APIs (Google, Microsoft, Nuance, NTT docomo, Wit.ai, …) Free software Commercial software OpenHRI, PocketSphinx, Festival Cloud-based Stand-alone Robot middleware- compatible Incompatible 6 Does not work with very low-spec PCs  Robotics-specific logs are lost  Authentication Low quality  Expensive  IP distributions of rospeex users Rospeex has been applied to: Humanoids, web agents, conversational robots with elderly people, automotive navigation systems, smart-home interface
  • 7. (2) Building domestic service robots (1st places in 2008 & 2010, 2nd places in 2009 & 2012) • RoboCup@Home: The largest competition for domestic service robots – Focuses on human-robot interaction and mobile manipulation • Challenges – Navigation in unknown environments (e.g. real shop), handling everyday objects, spoken dialogues in very noisy environments (70dBA), … • cf. Social impacts of other RoboCup leagues – RoboCupRescue: Fukushima Power plant investigation – Aldebaran sold >1000 NAO robots and bought by Softbank @US$ 100M, … 7by Channel 5
  • 8. (3) Machine learning for environments: Air pollution prediction can reduce social cost • Loss by PM2.5 and air pollutants – 3.3M premature deaths per year [Lelieveld, Nature, 2015] • Prediction can prevent possible exposure, but prediction accuracy is quite low – Standard approach gave only 42% accuracy in Fukuoka@2013-14* • Applying the DPT-DRNN method to weather open-data outperforms a standard weather model-based approach *threat score=TP/(TP+FP+FN) Premature deaths in London ≒2,800 @2010 [Ong & Sugiura, IEEE BigData 2014]Hefei, China 2015 (Not fog, not cloudy)