Cloud Robotics for Human-Robot Dialogues

•

1 like•509 views

This document discusses cloud robotics platforms for human-robot dialogues. It introduces Rospeex, a cloud robotics platform developed by NICT for multilingual dialogues. Rospeex has over 30,000 unique users and achieves state-of-the-art performance in speech recognition. The document also discusses NICT's work building domestic service robots through the RoboCup@Home competition, and their research using machine learning to better predict air pollution and reduce its social costs.

Technology

NICT:
Japan’s National Research Institute for ICT
Possible collaborations
• human-robot communication, scene
understanding, multimodal dialogues,
IoT data mining, and other
robotics/machine-learning/CV topics
Annual budget ¥29.7B
(￡168M)
# researchers/staffs 434 / 937
Research topics
Spoken language processing, natural
language processing, machine translation,
databases, data mining, etc. @Kyoto
Photonic network, wireless network,
cybersecurity, time standard, neuroscience,
space weather, etc. @Tokyo, Osaka
VoiceTra
>1M downloads
since July, 2010

Multimodal dialogues with robots: Language processing using
non-linguistic information is challenging
Smartphone and other consumer devices
Language processing using non-
linguistic information gives benefit
cf. Market size of speech recognition
¥88B@2013→¥170B@2018 (￡1B)*
Show me today’s
schedule
* Estimation by NEDO, TSC Foresight Vol.8, 2015
Sushi restaurants
around here
Benefit for
QA/search
GPS Contacts Other context
info.
Current communication with robots
Limited multi-modality and
scalability in robot intelligence
??
??Throw them
away.
Is there any milk
in the fridge?
cf. [Steels 2003, Roy 2005, Iwahashi
2007, Kollar+ 2010, Yu+ 2013]

Key Question:
How can we make robot intelligence scalable and multimodal ?
Major speech recognition engines are trained with large-scale corpora (>1000
hours ≒ 100M utterances), and continuously improved as cloud services
4
RoboCup@Home: Target user scenario
with service robots
XIMERA 3
(by NICT)
Voice talent
cf. [Sugiura+ ICRA2014]
Can we make such innovations in robotics?
• Training with large-scale datasets and continuous improvements in e.g.
dialogues, object recognition, grasping, simulation, …

(1) Rospeex:
We built a cloud robotics platform for multilingual dialogues*
• 30,000 unique users since Sep. 2013
• Non-monologue speech synthesis designed for robots [Sugiura+ 2014]
• Word Error Rate = 7.9% for IWSLT tst2011 (1st Place Winners in
IWSLT12, 13, 14)
Python & C++ samples
are available
rospeex Search
* Research/development-use only

Rospeex’s positioning in robot dialogue quadrants
Cloud APIs
(Google, Microsoft, Nuance,
NTT docomo, Wit.ai, …)
Free software
Commercial software
OpenHRI,
PocketSphinx, Festival
Cloud-based
Stand-alone
Robot middleware-
compatible
Incompatible
6
Does not work with
very low-spec PCs 
Robotics-specific
logs are lost 
Authentication
Low quality 
Expensive 
IP distributions of rospeex users
Rospeex has been applied to:
Humanoids, web agents, conversational
robots with elderly people, automotive
navigation systems, smart-home interface

(2) Building domestic service robots
(1st places in 2008 & 2010, 2nd places in 2009 & 2012)
• RoboCup@Home: The largest competition for domestic service robots
– Focuses on human-robot interaction and mobile manipulation
• Challenges
– Navigation in unknown environments (e.g. real shop), handling everyday
objects, spoken dialogues in very noisy environments (70dBA), …
• cf. Social impacts of other RoboCup leagues
– RoboCupRescue: Fukushima Power plant investigation
– Aldebaran sold >1000 NAO robots and bought by Softbank @US$ 100M, …
7by Channel 5

(3) Machine learning for environments:
Air pollution prediction can reduce social cost
• Loss by PM2.5 and air pollutants
– 3.3M premature deaths per year [Lelieveld, Nature, 2015]
• Prediction can prevent possible exposure, but prediction accuracy is quite low
– Standard approach gave only 42% accuracy in Fukuoka@2013-14*
• Applying the DPT-DRNN method to weather open-data outperforms a
standard weather model-based approach
*threat score=TP/(TP+FP+FN)
Premature deaths in London
≒2,800 @2010 [Ong & Sugiura, IEEE BigData 2014]Hefei, China 2015
(Not fog, not cloudy)

Viewers also liked

The Robot and the CloudSteven Cooper

Humanoid Robotics: Towards Smart Community Akhil Garg

ROBOTIC ARMlavanya kaluri

Cubic robotsRavi Yasas

Raspberry pi based project abstractsSoftroniics india

風險測試excel2003

ขยายเวลาเปิดรับข้อเสนอโครงการงบสกอ. ประจำปีงบประมาณ 2557สถาบันวิจัยและพัฒนา มทร.รัตนโกสินทร์

ExitsNational Venture Capital Association (NVCA)

Things I like, I love and I hate.nachisoukaina

5° básico b semana 23 al 27 mayoColegio Camilo Henríquez

Leveraging social media for fundraising & events by suzanne mc donaldAngles & Insights' Brand Builders Innovate Experiences

CV Ahmed madeeh Ahmed Madeeh

Galicia - Comenius Projectlaborcomenius

Presentación sobre Derechos HumanosJoaquin Sanchez

19 plan & section of penstockNikhil Jaipurkar

ChindoguSusan Lieberman

Viewers also liked (16)

The Robot and the Cloud

Humanoid Robotics: Towards Smart Community

ROBOTIC ARM

Cubic robots

Raspberry pi based project abstracts

風險測試

ขยายเวลาเปิดรับข้อเสนอโครงการงบสกอ. ประจำปีงบประมาณ 2557

Exits

Things I like, I love and I hate.

5° básico b semana 23 al 27 mayo

Leveraging social media for fundraising & events by suzanne mc donald

CV Ahmed madeeh

Galicia - Comenius Project

Presentación sobre Derechos Humanos

19 plan & section of penstock

Chindogu

Similar to Cloud Robotics for Human-Robot Dialogues

20161014IROS_WSKomei Sugiura

EDF2012 Stefano Bertolo - Future European activities and funding perspectiv...European Data Forum

IRJET- Virtual Vision for BlindsIRJET Journal

[GTC 2019] Bringing Personal Robots Home: Integrating Computer Vision and Hum...Preferred Networks

IRJET- A Survey to Chatbot System with Knowledge Base Database by using Artif...IRJET Journal

The Convergence of Robotics, the Web, and the IoTIntel® Software

SOFIA - Semantic Technologies and Techniques for Interoperable Information in...Sofia Eu

rospeex: a cloud-based speech communication toolkit for ROSKomei Sugiura

Internet trends and ICT knowledge necessary in the next years - 2013 ed.Antonio Ciccarelli, PMP

One Laptop Per Child (OLPC) KLUG PresentationJose de Leon

Project linkboxcheol hoe kim

Implementation of humanoid robot with using the concept of synthetic braineSAT Journals

Software park ThailandSoftware Park Thailand

“SKYE : Voice Based AI Desktop Assistant”IRJET Journal

Bt35408413IJERA Editor

sample PPT.pptxManishDubey91569

Implementation of humanoid robot with using theeSAT Publishing House

Self introductionankitaagrawal15

Geospatial trendsGeorge Percivall

Vint Cerf @ Sharkfest 2008Denny K

Similar to Cloud Robotics for Human-Robot Dialogues (20)

20161014IROS_WS

EDF2012 Stefano Bertolo - Future European activities and funding perspectiv...

IRJET- Virtual Vision for Blinds

[GTC 2019] Bringing Personal Robots Home: Integrating Computer Vision and Hum...

IRJET- A Survey to Chatbot System with Knowledge Base Database by using Artif...

The Convergence of Robotics, the Web, and the IoT

SOFIA - Semantic Technologies and Techniques for Interoperable Information in...

rospeex: a cloud-based speech communication toolkit for ROS

Internet trends and ICT knowledge necessary in the next years - 2013 ed.

One Laptop Per Child (OLPC) KLUG Presentation

Project linkbox

Implementation of humanoid robot with using the concept of synthetic brain

Software park Thailand

“SKYE : Voice Based AI Desktop Assistant”

Bt35408413

sample PPT.pptx

Implementation of humanoid robot with using the

Self introduction

Geospatial trends

Vint Cerf @ Sharkfest 2008

Recently uploaded

Generative AI - Gitex v1Generative AI - Gitex v1.pptxfnnc6jmgwh

A Framework for Development in the AI AgeCprime

React JS; all concepts. Contains React Features, JSX, functional & Class comp...Karmanjay Verma

Connecting the Dots for Information Discovery.pdfNeo4j

Top 10 Hubspot Development Companies in 2024TopCSSGallery

The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...Wes McKinney

MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotesManik S Magar

Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructureitnewsafrica

Abdul Kader Baba- Managing Cybersecurity Risks and Compliance Requirements i...itnewsafrica

Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integrationmarketing932765

Zeshan Sattar- Assessing the skill requirements and industry expectations for...itnewsafrica

Tampa BSides - The No BS SOC (slides from April 6, 2024 talk)Mark Simos

Transcript: New from BookNet Canada for 2024: BNC SalesData and LibraryData -...BookNet Canada

Email Marketing Automation for Bonterra Impact Management (fka Social Solutio...Jeffrey Haguewood

Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentPim van der Noll

Infrared simulation and processing on Nvidia platformsYoss Cohen

JET Technology Labs White Paper for Virtualized Security and Encryption Techn...amber724300

Genislab builds better products and faster go-to-market with Lean project man...Farhan Tariq

2024 April Patch TuesdayIvanti

React Native vs Ionic - The Best Mobile App FrameworkPixlogix Infotech

Recently uploaded (20)

Generative AI - Gitex v1Generative AI - Gitex v1.pptx

A Framework for Development in the AI Age

React JS; all concepts. Contains React Features, JSX, functional & Class comp...

Connecting the Dots for Information Discovery.pdf

Top 10 Hubspot Development Companies in 2024

The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...

MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes

Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure

Abdul Kader Baba- Managing Cybersecurity Risks and Compliance Requirements i...

Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration

Zeshan Sattar- Assessing the skill requirements and industry expectations for...

Tampa BSides - The No BS SOC (slides from April 6, 2024 talk)

Transcript: New from BookNet Canada for 2024: BNC SalesData and LibraryData -...

Email Marketing Automation for Bonterra Impact Management (fka Social Solutio...

Emixa Mendix Meetup 11 April 2024 about Mendix Native development

Infrared simulation and processing on Nvidia platforms

JET Technology Labs White Paper for Virtualized Security and Encryption Techn...

Genislab builds better products and faster go-to-market with Lean project man...

2024 April Patch Tuesday

React Native vs Ionic - The Best Mobile App Framework

Cloud Robotics for Human-Robot Dialogues

1. Cloud Robotics for Human-Robot Dialogues Komei Sugiura Senior Researcher, National Institute of Information and Communications Technology Trustee, RoboCup Federation

2. NICT: Japan’s National Research Institute for ICT Possible collaborations • human-robot communication, scene understanding, multimodal dialogues, IoT data mining, and other robotics/machine-learning/CV topics Annual budget ¥29.7B (￡168M) # researchers/staffs 434 / 937 Research topics Spoken language processing, natural language processing, machine translation, databases, data mining, etc. @Kyoto Photonic network, wireless network, cybersecurity, time standard, neuroscience, space weather, etc. @Tokyo, Osaka VoiceTra >1M downloads since July, 2010

3. Multimodal dialogues with robots: Language processing using non-linguistic information is challenging Smartphone and other consumer devices Language processing using non- linguistic information gives benefit cf. Market size of speech recognition ¥88B@2013→¥170B@2018 (￡1B)* Show me today’s schedule * Estimation by NEDO, TSC Foresight Vol.8, 2015 Sushi restaurants around here Benefit for QA/search GPS Contacts Other context info. Current communication with robots Limited multi-modality and scalability in robot intelligence ?? ??Throw them away. Is there any milk in the fridge? cf. [Steels 2003, Roy 2005, Iwahashi 2007, Kollar+ 2010, Yu+ 2013]

4. Key Question: How can we make robot intelligence scalable and multimodal ? Major speech recognition engines are trained with large-scale corpora (>1000 hours ≒ 100M utterances), and continuously improved as cloud services 4 RoboCup@Home: Target user scenario with service robots XIMERA 3 (by NICT) Voice talent cf. [Sugiura+ ICRA2014] Can we make such innovations in robotics? • Training with large-scale datasets and continuous improvements in e.g. dialogues, object recognition, grasping, simulation, …

5. (1) Rospeex: We built a cloud robotics platform for multilingual dialogues* • 30,000 unique users since Sep. 2013 • Non-monologue speech synthesis designed for robots [Sugiura+ 2014] • Word Error Rate = 7.9% for IWSLT tst2011 (1st Place Winners in IWSLT12, 13, 14) Python & C++ samples are available rospeex Search * Research/development-use only

6. Rospeex’s positioning in robot dialogue quadrants Cloud APIs (Google, Microsoft, Nuance, NTT docomo, Wit.ai, …) Free software Commercial software OpenHRI, PocketSphinx, Festival Cloud-based Stand-alone Robot middleware- compatible Incompatible 6 Does not work with very low-spec PCs  Robotics-specific logs are lost  Authentication Low quality  Expensive  IP distributions of rospeex users Rospeex has been applied to: Humanoids, web agents, conversational robots with elderly people, automotive navigation systems, smart-home interface

7. (2) Building domestic service robots (1st places in 2008 & 2010, 2nd places in 2009 & 2012) • RoboCup@Home: The largest competition for domestic service robots – Focuses on human-robot interaction and mobile manipulation • Challenges – Navigation in unknown environments (e.g. real shop), handling everyday objects, spoken dialogues in very noisy environments (70dBA), … • cf. Social impacts of other RoboCup leagues – RoboCupRescue: Fukushima Power plant investigation – Aldebaran sold >1000 NAO robots and bought by Softbank @US$ 100M, … 7by Channel 5

8. (3) Machine learning for environments: Air pollution prediction can reduce social cost • Loss by PM2.5 and air pollutants – 3.3M premature deaths per year [Lelieveld, Nature, 2015] • Prediction can prevent possible exposure, but prediction accuracy is quite low – Standard approach gave only 42% accuracy in Fukuoka@2013-14* • Applying the DPT-DRNN method to weather open-data outperforms a standard weather model-based approach *threat score=TP/(TP+FP+FN) Premature deaths in London ≒2,800 @2010 [Ong & Sugiura, IEEE BigData 2014]Hefei, China 2015 (Not fog, not cloudy)

Cloud Robotics for Human-Robot Dialogues

Recommended

Recommended

More Related Content

Viewers also liked

Viewers also liked (16)

Similar to Cloud Robotics for Human-Robot Dialogues

Similar to Cloud Robotics for Human-Robot Dialogues (20)

More from Komei Sugiura

More from Komei Sugiura (20)

Recently uploaded

Recently uploaded (20)

Cloud Robotics for Human-Robot Dialogues