SlideShare a Scribd company logo
Online Spatial Concept and
Lexical Acquisition with
Simultaneous Localization and
Mapping
IROS2017@Vancouver
1
Akira Taniguchi *, Yoshinobu Hagiwara *, Tadahiro Taniguchi *,
Tetsunari Inamura **
* Ritsumeikan University, Japan. (E-mail: a.taniguchi@em.ci.ritsumei.ac.jp)
** National Institute of Informatics / The Graduate University for Advanced
Studies, Japan.
Research background
Robots coexisting with humans and operating in various
environments are required to adaptively learn the
spatial concepts (place categories and a lexicon) while
incrementally generating an environmental map.
◦ Spatial concepts are such that their target domain may be
unclear and may differ according to the user and environment.
◦ Therefore, it is difficult to manually design spatial concepts in
advance, and it is desirable for robots to autonomously learn
spatial concepts based on their own experiences.
2
Which area is
the same place?
What scenery
can I see?
What is the name
of this place?
Spatial concept
Spatial concept based on multimodal information
◦ Word information (Place names)
◦ Place information (Position distribution)
◦ Image information (Visual features)
Meething room
Laboratory
Elevator hall
3
[Taniguchi 16] Taniguchi, A et.al. Spatial Concept Acquisition for a Mobile Robot that Integrates Self-Localization and
Unsupervised Word Discovery from Spoken Sentences, IEEE TCDS, Vol. 8, No. 4, pp. 285–297 (2016)
Previous method : SpCoA [Taniguchi 16]
Nonparametric Bayesian spatial concept acquisition method
The main features
• This model can learn unknown words from continuous speech signals.
• This model can learn an appropriate number of spatial concepts,
depending on the data. (using nonparametric Beysian approach)
• This model can learn many-to-many correspondences between names and
places by relating several places to several names via spatial concepts.
/sofamae//hoNdana//qgeNkaN/
/kiqchiN/
/daidokoro/ /terebimae//gomibakoo/
/tereburunoatari/
4
Learning result in Japanese
Previous method : SpCoA [Taniguchi 16]
Nonparametric Bayesian spatial concept acquisition method
◦ Batch learning
◦ The robot learns the spatial concepts after getting sufficient data while
moving the environment.
◦ Environmental map
◦ This method cannot learn spatial concepts from unknown
environments without a map.
◦ Over-segmentation problem
◦ It’s is caused by word segmentation of phoneme-recognition results
including errors.
[Taniguchi 16] Taniguchi, A et.al. Spatial Concept Acquisition for a Mobile Robot that Integrates Self-Localization and
Unsupervised Word Discovery from Spoken Sentences, IEEE TCDS, Vol. 8, No. 4, pp. 285–297 (2016)
This place is the
laboratory.
|dis|pu|rai|su|iz
a|ra|bora|to|ri|
???
5
Research purpose
Mobile robots learn spatial concepts, a lexicon, and an environmental
map incrementally from interaction with an environment and human,
even in an unknown environment without prior knowledge.
6
The proposed method : SpCoSLAM
This model integrates multimodal place categorization,
lexical acquisition and SLAM as one Bayesian generative model.
7Gray nodes indicate observation variables.
1tx

xt

xt1
tz
tu
tC
ti

l 

00 ,m
00 ,V
1tz
1tu
1tz
1tu

LMtyAM
tS

ltf 
∞
∞
m
lW
k
k
The proposed method : SpCoSLAM
This model integrates multimodal place categorization,
lexical acquisition and SLAM as one Bayesian generative model.
8Gray nodes indicate observation variables.
1tx

xt

xt1
tz
tu
tC
ti

l 

00 ,m
00 ,V
1tz
1tu
1tz
1tu

LMtyAM
tS

ltf 
∞
∞
m
lW
k
k
Simultaneous localization and mapping (SLAM)
Position
distribution
(Gaussian
distribution)
Nonparametric Bayesian
multimodal place categorization
Index of place
Image feature
Words
Lexical acquisition (Speech recognition and word segmentation)
FastSLAM and SpCoSLAM
Simultaneous Localization And Mapping (SLAM)
◦ FastSLAM has realized an on-line algorithm for efficient
self-localization and mapping using a Rao-Blackwellized
particle filter (RBPF) [Grisetti 05] .
Online learning algorithm of SpCoSLAM
◦ The online learning algorithm can be derived by
introducing sequential update equations for estimating
the parameters of the spatial concepts into the
formulation of FastSLAM based on RBPF.
9
[Grisetti 05] G. Grisetti, C. Stachniss, and W. Burgard, “Improving grid-based SLAM with Rao-Blackwellized
particle filters by adaptive proposals and selective resampling,” in Proceedings of ICRA, 2005.
FastSLAM and SpCoSLAM
FastSLAM
SpCoSLAM
Self-
position map
Control
data
Sensor
data
Latent variables
Model parameters
Hyperprameters
Language model
Acoustic model
Speech signal
Image feature
LM is updated. Model parameters
are updated.
10
Rao-Blackwellized
particle filter (RBPF)
Online learning algorithm
of SpCoSLAM
2. Calculating the proposal
distribution of FastSLAM 2.0
3. Word segmentation, sampling
latent variables, and calculating
weights
6. Updating a language model
7. Resampling of particles
4. mapping
5. Estimation of parameters of
spatial concepts
1. Speech recognition
11
Experiment I : Online learning
We performed experiments for online learning of spatial
concepts in a novel environment.
[1] latticelm: http://www.phontron.com/latticelm/
[2] The robotics data set repository (radish): http://radish.sourceforge.net/
Conditions
12
Middleware Robot Operating System (ROS) indigo
Speech recognition
system
Julius dictation-kit-v4.3.1-linux (GMM-HMM decoding),
Japanese syllable dictionary
Word segmentation
system
latticelm [1]
(WFST-based word segmentation system)
Image feature extractor Caffe (CNN model of Places205-AlexNet)
Dataset Robotics Data Set Repository (Radish) [2]
albert-b-laser-vision by Cyrill Stachniss
• Rosbag file (odometry, depth, image data)
Speech data 50 sentences including 10 types of various phrases
WFST:Weighted Finite-State Transducer
Experiment I : Online learning
13
Video: https://youtu.be/hVKQCdbRQVM
Experiment I : Online learning
1
2
3 4
5
6
Step 15
1
23
4
5
6
7
8
Step 30
1
23
4
5
6
78
9
10
Step 50
Position distribution: 6 Position distribution: 1 Position distribution: 8
14
Correct: /ikidomari/
(The end of corridor)
Estimated word:
/ikidomaekidayao/
Estimated word:
/kyooyuusehi/
Correct: /kyouyuuseki/
(Sharing desk)
Estimated words:
/upuriNpabeyatarero/
/izaridokourodayo/
Correct1: /puriNtaabeya/
Correct2: /daidokoro/
(Printer room, kitchen)
Words are estimated by
Experiment I : Online learning
We compare the performance of four methods as
follows:
(A) SpCoSLAM (The proposed method)
(B) Online SpCoA based on RBPF
(C) Online SpCoA
(D) SpCoA (Batch learning) [Taniguchi 16]
[Taniguchi 16] Taniguchi, A et.al. Spatial Concept Acquisition for a Mobile Robot that Integrates
Self-Localization and Unsupervised Word Discovery from Spoken Sentences, IEEE Transactions on Cognitive
and Developmental Systems, Vol. 8, No. 4, pp. 285–297 (2016)
The number of
particles:30
15
Methods (B), (C), and (D) based on SpCoA did not perform
the update of a language model and did not use image
features.
Experiment I : Online learning
16
We compare the performance of SpCoSLAM and SpCoA-based methods.
1tx

xt

xt1
tz
tu
tC
ti

l 

00 ,m
00 ,V
1tz
1tu
1tz
1tu

LMtyAM
tS

ltf 
∞
∞
m
lW
k
k
1tx

xt

xt1
tz
tu
tC
ti

l 

00,m
00 ,V
1tz
1tu
1tz
1tu

tS
∞
∞
m
lW
k
k
SpCoA-based methodSpCoSLAM
SpCoA did not perform the update of a language model and did not use image features.
Evaluation I :
The estimated number of spatial concepts
Figures show the number of spatial concepts and the number
of position distributions by online learning.
True data was determined by an user based on teaching data.
SpCoSLAM was closer to the true data than other methods.
17
Evaluation II :
Word segmentation in the lexical acquisition
Figure shows the number of segmented words.
SpCoSLAM improved the over-segmentation problem by
updating the language model sequentially.
SpCoSLAM was
closer to the phrase
segmentation.
Over-segmentation
Morpheme: The morphological segmentation (using MeCab)
Phrase: The phrase segmentation (segmenting words only before and after
the name of the place.)
18
Evaluation II :
Word segmentation in the lexical acquisition
SpCoSLAM
SpCoA 19
SpCoSLAM
SpCoA
SpCoSLAM
SpCoA
(in English)
(in Japanese)
Experiment II :
Place recognition using a speech signal
When the user says “Go to **.”, the estimation of a target
position was calculated as follows:
SpCoSLAM showed the
highest overall evaluation
values of the online methods.
We calculated the place recognition rate (PRR) that the rate
of positions estimated within the correct area in the test data.
SpCoA (0.5)
20
Conclusion
 We proposed an online learning method of spatial
concepts and an environmental map by a mobile
robot.
 The proposed method integrated the spatial concept
acquisition into SLAM by an RBPF-based approach.
 In the experiments, we conducted online learning in
a novel environment by the robot without a pre-
existing lexicon and map.
 SpCoSLAM improved the performance of place recognition
using a speech signal in online learning methods.
 SpCoSLAM improved over-segmentation problem in lexical
acquisition by updating the language model sequentially.
21THANK YOU FOR YOUR KIND ATTENTION.

More Related Content

What's hot

A survey on moving object tracking in video
A survey on moving object tracking in videoA survey on moving object tracking in video
A survey on moving object tracking in video
ijitjournal
 
Dr.Kawewong Ph.D Thesis
Dr.Kawewong Ph.D ThesisDr.Kawewong Ph.D Thesis
Dr.Kawewong Ph.D Thesis
SOINN Inc.
 
Object detection
Object detectionObject detection
Object detection
Jksuryawanshi
 
פוסטר דר פרידמן
פוסטר דר פרידמןפוסטר דר פרידמן
Robotics: Modelling, Planning and Control
Robotics: Modelling, Planning and ControlRobotics: Modelling, Planning and Control
Robotics: Modelling, Planning and Control
Cody Ray
 
Image Translation with GAN
Image Translation with GANImage Translation with GAN
Image Translation with GAN
Junho Cho
 
mihara_iccp16_presentation
mihara_iccp16_presentationmihara_iccp16_presentation
mihara_iccp16_presentation
Hajime Mihara
 

What's hot (7)

A survey on moving object tracking in video
A survey on moving object tracking in videoA survey on moving object tracking in video
A survey on moving object tracking in video
 
Dr.Kawewong Ph.D Thesis
Dr.Kawewong Ph.D ThesisDr.Kawewong Ph.D Thesis
Dr.Kawewong Ph.D Thesis
 
Object detection
Object detectionObject detection
Object detection
 
פוסטר דר פרידמן
פוסטר דר פרידמןפוסטר דר פרידמן
פוסטר דר פרידמן
 
Robotics: Modelling, Planning and Control
Robotics: Modelling, Planning and ControlRobotics: Modelling, Planning and Control
Robotics: Modelling, Planning and Control
 
Image Translation with GAN
Image Translation with GANImage Translation with GAN
Image Translation with GAN
 
mihara_iccp16_presentation
mihara_iccp16_presentationmihara_iccp16_presentation
mihara_iccp16_presentation
 

Similar to [IROS2017] Online Spatial Concept and Lexical Acquisition with Simultaneous Localization and Mapping

Tool Use Learning for a Real Robot
Tool Use Learning for a Real RobotTool Use Learning for a Real Robot
Tool Use Learning for a Real Robot
IJECEIAES
 
BIDIRECTIONAL LONG SHORT-TERM MEMORY (BILSTM)WITH CONDITIONAL RANDOM FIELDS (...
BIDIRECTIONAL LONG SHORT-TERM MEMORY (BILSTM)WITH CONDITIONAL RANDOM FIELDS (...BIDIRECTIONAL LONG SHORT-TERM MEMORY (BILSTM)WITH CONDITIONAL RANDOM FIELDS (...
BIDIRECTIONAL LONG SHORT-TERM MEMORY (BILSTM)WITH CONDITIONAL RANDOM FIELDS (...
kevig
 
BIDIRECTIONAL LONG SHORT-TERM MEMORY (BILSTM)WITH CONDITIONAL RANDOM FIELDS (...
BIDIRECTIONAL LONG SHORT-TERM MEMORY (BILSTM)WITH CONDITIONAL RANDOM FIELDS (...BIDIRECTIONAL LONG SHORT-TERM MEMORY (BILSTM)WITH CONDITIONAL RANDOM FIELDS (...
BIDIRECTIONAL LONG SHORT-TERM MEMORY (BILSTM)WITH CONDITIONAL RANDOM FIELDS (...
ijnlc
 
SENTIMENT ANALYSIS IN MYANMAR LANGUAGE USING CONVOLUTIONAL LSTM NEURAL NETWORK
SENTIMENT ANALYSIS IN MYANMAR LANGUAGE USING CONVOLUTIONAL LSTM NEURAL NETWORKSENTIMENT ANALYSIS IN MYANMAR LANGUAGE USING CONVOLUTIONAL LSTM NEURAL NETWORK
SENTIMENT ANALYSIS IN MYANMAR LANGUAGE USING CONVOLUTIONAL LSTM NEURAL NETWORK
ijnlc
 
Sentiment Analysis In Myanmar Language Using Convolutional Lstm Neural Network
Sentiment Analysis In Myanmar Language Using Convolutional Lstm Neural NetworkSentiment Analysis In Myanmar Language Using Convolutional Lstm Neural Network
Sentiment Analysis In Myanmar Language Using Convolutional Lstm Neural Network
kevig
 
liu2019.pdf
liu2019.pdfliu2019.pdf
liu2019.pdf
suhasappu4
 
PROLOGUSED TO REPRESENT AND REASON QUALITATIVELYOVER A SPACE DOMAIN
PROLOGUSED TO REPRESENT AND REASON QUALITATIVELYOVER A SPACE DOMAINPROLOGUSED TO REPRESENT AND REASON QUALITATIVELYOVER A SPACE DOMAIN
PROLOGUSED TO REPRESENT AND REASON QUALITATIVELYOVER A SPACE DOMAIN
ijaia
 
IRJET - Automatic Lip Reading: Classification of Words and Phrases using Conv...
IRJET - Automatic Lip Reading: Classification of Words and Phrases using Conv...IRJET - Automatic Lip Reading: Classification of Words and Phrases using Conv...
IRJET - Automatic Lip Reading: Classification of Words and Phrases using Conv...
IRJET Journal
 
Advanced Robotics Projects For Undergraduate Students
Advanced Robotics Projects For Undergraduate StudentsAdvanced Robotics Projects For Undergraduate Students
Advanced Robotics Projects For Undergraduate Students
Emily Smith
 
[IJET-V1I5P9] Author: Prutha Gandhi, Dhanashri Dalvi, Pallavi Gaikwad, Shubha...
[IJET-V1I5P9] Author: Prutha Gandhi, Dhanashri Dalvi, Pallavi Gaikwad, Shubha...[IJET-V1I5P9] Author: Prutha Gandhi, Dhanashri Dalvi, Pallavi Gaikwad, Shubha...
[IJET-V1I5P9] Author: Prutha Gandhi, Dhanashri Dalvi, Pallavi Gaikwad, Shubha...
IJET - International Journal of Engineering and Techniques
 
IRJET- ASL Language Translation using ML
IRJET- ASL Language Translation using MLIRJET- ASL Language Translation using ML
IRJET- ASL Language Translation using ML
IRJET Journal
 
Language-Based Actions for Self-Driving Robot
 	  Language-Based Actions for Self-Driving Robot 	  Language-Based Actions for Self-Driving Robot
Language-Based Actions for Self-Driving Robot
IRJET Journal
 
INTELLIGENT QUERY PROCESSING IN MALAYALAM
INTELLIGENT QUERY PROCESSING IN MALAYALAMINTELLIGENT QUERY PROCESSING IN MALAYALAM
INTELLIGENT QUERY PROCESSING IN MALAYALAM
ijcsa
 
Arabic named entity recognition using deep learning approach
Arabic named entity recognition using deep learning approachArabic named entity recognition using deep learning approach
Arabic named entity recognition using deep learning approach
IJECEIAES
 
Feature Extraction and Analysis of Natural Language Processing for Deep Learn...
Feature Extraction and Analysis of Natural Language Processing for Deep Learn...Feature Extraction and Analysis of Natural Language Processing for Deep Learn...
Feature Extraction and Analysis of Natural Language Processing for Deep Learn...
Sharmila Sathish
 
COMPREHENSIVE ANALYSIS OF NATURAL LANGUAGE PROCESSING TECHNIQUE
COMPREHENSIVE ANALYSIS OF NATURAL LANGUAGE PROCESSING TECHNIQUECOMPREHENSIVE ANALYSIS OF NATURAL LANGUAGE PROCESSING TECHNIQUE
COMPREHENSIVE ANALYSIS OF NATURAL LANGUAGE PROCESSING TECHNIQUE
Journal For Research
 
French machine reading for question answering
French machine reading for question answeringFrench machine reading for question answering
French machine reading for question answering
Ali Kabbadj
 
final-day1-july2.pptx
final-day1-july2.pptxfinal-day1-july2.pptx
final-day1-july2.pptx
KartikGulati16
 
Anatomical Survey Based Feature Vector for Text Pattern Detection
Anatomical Survey Based Feature Vector for Text Pattern DetectionAnatomical Survey Based Feature Vector for Text Pattern Detection
Anatomical Survey Based Feature Vector for Text Pattern Detection
IJEACS
 
Self Automated Rovers
Self Automated RoversSelf Automated Rovers
Self Automated Rovers
RutikBhoyar
 

Similar to [IROS2017] Online Spatial Concept and Lexical Acquisition with Simultaneous Localization and Mapping (20)

Tool Use Learning for a Real Robot
Tool Use Learning for a Real RobotTool Use Learning for a Real Robot
Tool Use Learning for a Real Robot
 
BIDIRECTIONAL LONG SHORT-TERM MEMORY (BILSTM)WITH CONDITIONAL RANDOM FIELDS (...
BIDIRECTIONAL LONG SHORT-TERM MEMORY (BILSTM)WITH CONDITIONAL RANDOM FIELDS (...BIDIRECTIONAL LONG SHORT-TERM MEMORY (BILSTM)WITH CONDITIONAL RANDOM FIELDS (...
BIDIRECTIONAL LONG SHORT-TERM MEMORY (BILSTM)WITH CONDITIONAL RANDOM FIELDS (...
 
BIDIRECTIONAL LONG SHORT-TERM MEMORY (BILSTM)WITH CONDITIONAL RANDOM FIELDS (...
BIDIRECTIONAL LONG SHORT-TERM MEMORY (BILSTM)WITH CONDITIONAL RANDOM FIELDS (...BIDIRECTIONAL LONG SHORT-TERM MEMORY (BILSTM)WITH CONDITIONAL RANDOM FIELDS (...
BIDIRECTIONAL LONG SHORT-TERM MEMORY (BILSTM)WITH CONDITIONAL RANDOM FIELDS (...
 
SENTIMENT ANALYSIS IN MYANMAR LANGUAGE USING CONVOLUTIONAL LSTM NEURAL NETWORK
SENTIMENT ANALYSIS IN MYANMAR LANGUAGE USING CONVOLUTIONAL LSTM NEURAL NETWORKSENTIMENT ANALYSIS IN MYANMAR LANGUAGE USING CONVOLUTIONAL LSTM NEURAL NETWORK
SENTIMENT ANALYSIS IN MYANMAR LANGUAGE USING CONVOLUTIONAL LSTM NEURAL NETWORK
 
Sentiment Analysis In Myanmar Language Using Convolutional Lstm Neural Network
Sentiment Analysis In Myanmar Language Using Convolutional Lstm Neural NetworkSentiment Analysis In Myanmar Language Using Convolutional Lstm Neural Network
Sentiment Analysis In Myanmar Language Using Convolutional Lstm Neural Network
 
liu2019.pdf
liu2019.pdfliu2019.pdf
liu2019.pdf
 
PROLOGUSED TO REPRESENT AND REASON QUALITATIVELYOVER A SPACE DOMAIN
PROLOGUSED TO REPRESENT AND REASON QUALITATIVELYOVER A SPACE DOMAINPROLOGUSED TO REPRESENT AND REASON QUALITATIVELYOVER A SPACE DOMAIN
PROLOGUSED TO REPRESENT AND REASON QUALITATIVELYOVER A SPACE DOMAIN
 
IRJET - Automatic Lip Reading: Classification of Words and Phrases using Conv...
IRJET - Automatic Lip Reading: Classification of Words and Phrases using Conv...IRJET - Automatic Lip Reading: Classification of Words and Phrases using Conv...
IRJET - Automatic Lip Reading: Classification of Words and Phrases using Conv...
 
Advanced Robotics Projects For Undergraduate Students
Advanced Robotics Projects For Undergraduate StudentsAdvanced Robotics Projects For Undergraduate Students
Advanced Robotics Projects For Undergraduate Students
 
[IJET-V1I5P9] Author: Prutha Gandhi, Dhanashri Dalvi, Pallavi Gaikwad, Shubha...
[IJET-V1I5P9] Author: Prutha Gandhi, Dhanashri Dalvi, Pallavi Gaikwad, Shubha...[IJET-V1I5P9] Author: Prutha Gandhi, Dhanashri Dalvi, Pallavi Gaikwad, Shubha...
[IJET-V1I5P9] Author: Prutha Gandhi, Dhanashri Dalvi, Pallavi Gaikwad, Shubha...
 
IRJET- ASL Language Translation using ML
IRJET- ASL Language Translation using MLIRJET- ASL Language Translation using ML
IRJET- ASL Language Translation using ML
 
Language-Based Actions for Self-Driving Robot
 	  Language-Based Actions for Self-Driving Robot 	  Language-Based Actions for Self-Driving Robot
Language-Based Actions for Self-Driving Robot
 
INTELLIGENT QUERY PROCESSING IN MALAYALAM
INTELLIGENT QUERY PROCESSING IN MALAYALAMINTELLIGENT QUERY PROCESSING IN MALAYALAM
INTELLIGENT QUERY PROCESSING IN MALAYALAM
 
Arabic named entity recognition using deep learning approach
Arabic named entity recognition using deep learning approachArabic named entity recognition using deep learning approach
Arabic named entity recognition using deep learning approach
 
Feature Extraction and Analysis of Natural Language Processing for Deep Learn...
Feature Extraction and Analysis of Natural Language Processing for Deep Learn...Feature Extraction and Analysis of Natural Language Processing for Deep Learn...
Feature Extraction and Analysis of Natural Language Processing for Deep Learn...
 
COMPREHENSIVE ANALYSIS OF NATURAL LANGUAGE PROCESSING TECHNIQUE
COMPREHENSIVE ANALYSIS OF NATURAL LANGUAGE PROCESSING TECHNIQUECOMPREHENSIVE ANALYSIS OF NATURAL LANGUAGE PROCESSING TECHNIQUE
COMPREHENSIVE ANALYSIS OF NATURAL LANGUAGE PROCESSING TECHNIQUE
 
French machine reading for question answering
French machine reading for question answeringFrench machine reading for question answering
French machine reading for question answering
 
final-day1-july2.pptx
final-day1-july2.pptxfinal-day1-july2.pptx
final-day1-july2.pptx
 
Anatomical Survey Based Feature Vector for Text Pattern Detection
Anatomical Survey Based Feature Vector for Text Pattern DetectionAnatomical Survey Based Feature Vector for Text Pattern Detection
Anatomical Survey Based Feature Vector for Text Pattern Detection
 
Self Automated Rovers
Self Automated RoversSelf Automated Rovers
Self Automated Rovers
 

More from Akira Taniguchi

第8回Language and Robotics研究会20221010_AkiraTaniguchi
第8回Language and Robotics研究会20221010_AkiraTaniguchi第8回Language and Robotics研究会20221010_AkiraTaniguchi
第8回Language and Robotics研究会20221010_AkiraTaniguchi
Akira Taniguchi
 
論文紹介 LexToMap: lexical-based topological mapping
論文紹介 LexToMap: lexical-based topological mapping論文紹介 LexToMap: lexical-based topological mapping
論文紹介 LexToMap: lexical-based topological mapping
Akira Taniguchi
 
論文紹介 Semantic Mapping for Mobile Robotics Tasks: A Survey
論文紹介 Semantic Mapping for Mobile Robotics Tasks: A Survey論文紹介 Semantic Mapping for Mobile Robotics Tasks: A Survey
論文紹介 Semantic Mapping for Mobile Robotics Tasks: A Survey
Akira Taniguchi
 
論文紹介 Grounded Spatial Symbols for Task Planning Based on Experience
論文紹介 Grounded Spatial Symbols for Task Planning Based on Experience論文紹介 Grounded Spatial Symbols for Task Planning Based on Experience
論文紹介 Grounded Spatial Symbols for Task Planning Based on Experience
Akira Taniguchi
 
論文紹介  Communication between Lingodroids with different cognitive capabilities
論文紹介  Communication between Lingodroids with different cognitive capabilities論文紹介  Communication between Lingodroids with different cognitive capabilities
論文紹介  Communication between Lingodroids with different cognitive capabilities
Akira Taniguchi
 
Multiple Categorization by iCub: Learning Relationships between Multiple Moda...
Multiple Categorization by iCub: Learning Relationships between Multiple Moda...Multiple Categorization by iCub: Learning Relationships between Multiple Moda...
Multiple Categorization by iCub: Learning Relationships between Multiple Moda...
Akira Taniguchi
 
Simultaneous Estimation of Self-position and Word from Noisy Utterances and S...
Simultaneous Estimation of Self-position and Word from Noisy Utterances and S...Simultaneous Estimation of Self-position and Word from Noisy Utterances and S...
Simultaneous Estimation of Self-position and Word from Noisy Utterances and S...
Akira Taniguchi
 
Simultaneous Localization, Mapping and Self-body Shape Estimation by a Mobile...
Simultaneous Localization, Mapping and Self-body Shape Estimation by a Mobile...Simultaneous Localization, Mapping and Self-body Shape Estimation by a Mobile...
Simultaneous Localization, Mapping and Self-body Shape Estimation by a Mobile...
Akira Taniguchi
 
論文紹介 A Bayesian framework for word segmentation: Exploring the effects of con...
論文紹介 A Bayesian framework for word segmentation: Exploring the effects of con...論文紹介 A Bayesian framework for word segmentation: Exploring the effects of con...
論文紹介 A Bayesian framework for word segmentation: Exploring the effects of con...
Akira Taniguchi
 

More from Akira Taniguchi (9)

第8回Language and Robotics研究会20221010_AkiraTaniguchi
第8回Language and Robotics研究会20221010_AkiraTaniguchi第8回Language and Robotics研究会20221010_AkiraTaniguchi
第8回Language and Robotics研究会20221010_AkiraTaniguchi
 
論文紹介 LexToMap: lexical-based topological mapping
論文紹介 LexToMap: lexical-based topological mapping論文紹介 LexToMap: lexical-based topological mapping
論文紹介 LexToMap: lexical-based topological mapping
 
論文紹介 Semantic Mapping for Mobile Robotics Tasks: A Survey
論文紹介 Semantic Mapping for Mobile Robotics Tasks: A Survey論文紹介 Semantic Mapping for Mobile Robotics Tasks: A Survey
論文紹介 Semantic Mapping for Mobile Robotics Tasks: A Survey
 
論文紹介 Grounded Spatial Symbols for Task Planning Based on Experience
論文紹介 Grounded Spatial Symbols for Task Planning Based on Experience論文紹介 Grounded Spatial Symbols for Task Planning Based on Experience
論文紹介 Grounded Spatial Symbols for Task Planning Based on Experience
 
論文紹介  Communication between Lingodroids with different cognitive capabilities
論文紹介  Communication between Lingodroids with different cognitive capabilities論文紹介  Communication between Lingodroids with different cognitive capabilities
論文紹介  Communication between Lingodroids with different cognitive capabilities
 
Multiple Categorization by iCub: Learning Relationships between Multiple Moda...
Multiple Categorization by iCub: Learning Relationships between Multiple Moda...Multiple Categorization by iCub: Learning Relationships between Multiple Moda...
Multiple Categorization by iCub: Learning Relationships between Multiple Moda...
 
Simultaneous Estimation of Self-position and Word from Noisy Utterances and S...
Simultaneous Estimation of Self-position and Word from Noisy Utterances and S...Simultaneous Estimation of Self-position and Word from Noisy Utterances and S...
Simultaneous Estimation of Self-position and Word from Noisy Utterances and S...
 
Simultaneous Localization, Mapping and Self-body Shape Estimation by a Mobile...
Simultaneous Localization, Mapping and Self-body Shape Estimation by a Mobile...Simultaneous Localization, Mapping and Self-body Shape Estimation by a Mobile...
Simultaneous Localization, Mapping and Self-body Shape Estimation by a Mobile...
 
論文紹介 A Bayesian framework for word segmentation: Exploring the effects of con...
論文紹介 A Bayesian framework for word segmentation: Exploring the effects of con...論文紹介 A Bayesian framework for word segmentation: Exploring the effects of con...
論文紹介 A Bayesian framework for word segmentation: Exploring the effects of con...
 

Recently uploaded

Deep Dive: Getting Funded with Jason Jason Lemkin Founder & CEO @ SaaStr
Deep Dive: Getting Funded with Jason Jason Lemkin Founder & CEO @ SaaStrDeep Dive: Getting Funded with Jason Jason Lemkin Founder & CEO @ SaaStr
Deep Dive: Getting Funded with Jason Jason Lemkin Founder & CEO @ SaaStr
saastr
 
Azure API Management to expose backend services securely
Azure API Management to expose backend services securelyAzure API Management to expose backend services securely
Azure API Management to expose backend services securely
Dinusha Kumarasiri
 
Serial Arm Control in Real Time Presentation
Serial Arm Control in Real Time PresentationSerial Arm Control in Real Time Presentation
Serial Arm Control in Real Time Presentation
tolgahangng
 
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdfUnlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Malak Abu Hammad
 
Ocean lotus Threat actors project by John Sitima 2024 (1).pptx
Ocean lotus Threat actors project by John Sitima 2024 (1).pptxOcean lotus Threat actors project by John Sitima 2024 (1).pptx
Ocean lotus Threat actors project by John Sitima 2024 (1).pptx
SitimaJohn
 
Operating System Used by Users in day-to-day life.pptx
Operating System Used by Users in day-to-day life.pptxOperating System Used by Users in day-to-day life.pptx
Operating System Used by Users in day-to-day life.pptx
Pravash Chandra Das
 
June Patch Tuesday
June Patch TuesdayJune Patch Tuesday
June Patch Tuesday
Ivanti
 
System Design Case Study: Building a Scalable E-Commerce Platform - Hiike
System Design Case Study: Building a Scalable E-Commerce Platform - HiikeSystem Design Case Study: Building a Scalable E-Commerce Platform - Hiike
System Design Case Study: Building a Scalable E-Commerce Platform - Hiike
Hiike
 
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development ProvidersYour One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
akankshawande
 
Finale of the Year: Apply for Next One!
Finale of the Year: Apply for Next One!Finale of the Year: Apply for Next One!
Finale of the Year: Apply for Next One!
GDSC PJATK
 
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdfHow to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
Chart Kalyan
 
Recommendation System using RAG Architecture
Recommendation System using RAG ArchitectureRecommendation System using RAG Architecture
Recommendation System using RAG Architecture
fredae14
 
A Comprehensive Guide to DeFi Development Services in 2024
A Comprehensive Guide to DeFi Development Services in 2024A Comprehensive Guide to DeFi Development Services in 2024
A Comprehensive Guide to DeFi Development Services in 2024
Intelisync
 
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
saastr
 
Building Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and MilvusBuilding Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and Milvus
Zilliz
 
GraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracyGraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracy
Tomaz Bratanic
 
Nordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptxNordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptx
MichaelKnudsen27
 
UI5 Controls simplified - UI5con2024 presentation
UI5 Controls simplified - UI5con2024 presentationUI5 Controls simplified - UI5con2024 presentation
UI5 Controls simplified - UI5con2024 presentation
Wouter Lemaire
 
Presentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of GermanyPresentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of Germany
innovationoecd
 
GenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizationsGenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizations
kumardaparthi1024
 

Recently uploaded (20)

Deep Dive: Getting Funded with Jason Jason Lemkin Founder & CEO @ SaaStr
Deep Dive: Getting Funded with Jason Jason Lemkin Founder & CEO @ SaaStrDeep Dive: Getting Funded with Jason Jason Lemkin Founder & CEO @ SaaStr
Deep Dive: Getting Funded with Jason Jason Lemkin Founder & CEO @ SaaStr
 
Azure API Management to expose backend services securely
Azure API Management to expose backend services securelyAzure API Management to expose backend services securely
Azure API Management to expose backend services securely
 
Serial Arm Control in Real Time Presentation
Serial Arm Control in Real Time PresentationSerial Arm Control in Real Time Presentation
Serial Arm Control in Real Time Presentation
 
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdfUnlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
 
Ocean lotus Threat actors project by John Sitima 2024 (1).pptx
Ocean lotus Threat actors project by John Sitima 2024 (1).pptxOcean lotus Threat actors project by John Sitima 2024 (1).pptx
Ocean lotus Threat actors project by John Sitima 2024 (1).pptx
 
Operating System Used by Users in day-to-day life.pptx
Operating System Used by Users in day-to-day life.pptxOperating System Used by Users in day-to-day life.pptx
Operating System Used by Users in day-to-day life.pptx
 
June Patch Tuesday
June Patch TuesdayJune Patch Tuesday
June Patch Tuesday
 
System Design Case Study: Building a Scalable E-Commerce Platform - Hiike
System Design Case Study: Building a Scalable E-Commerce Platform - HiikeSystem Design Case Study: Building a Scalable E-Commerce Platform - Hiike
System Design Case Study: Building a Scalable E-Commerce Platform - Hiike
 
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development ProvidersYour One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
 
Finale of the Year: Apply for Next One!
Finale of the Year: Apply for Next One!Finale of the Year: Apply for Next One!
Finale of the Year: Apply for Next One!
 
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdfHow to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
 
Recommendation System using RAG Architecture
Recommendation System using RAG ArchitectureRecommendation System using RAG Architecture
Recommendation System using RAG Architecture
 
A Comprehensive Guide to DeFi Development Services in 2024
A Comprehensive Guide to DeFi Development Services in 2024A Comprehensive Guide to DeFi Development Services in 2024
A Comprehensive Guide to DeFi Development Services in 2024
 
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
 
Building Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and MilvusBuilding Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and Milvus
 
GraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracyGraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracy
 
Nordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptxNordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptx
 
UI5 Controls simplified - UI5con2024 presentation
UI5 Controls simplified - UI5con2024 presentationUI5 Controls simplified - UI5con2024 presentation
UI5 Controls simplified - UI5con2024 presentation
 
Presentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of GermanyPresentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of Germany
 
GenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizationsGenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizations
 

[IROS2017] Online Spatial Concept and Lexical Acquisition with Simultaneous Localization and Mapping

  • 1. Online Spatial Concept and Lexical Acquisition with Simultaneous Localization and Mapping IROS2017@Vancouver 1 Akira Taniguchi *, Yoshinobu Hagiwara *, Tadahiro Taniguchi *, Tetsunari Inamura ** * Ritsumeikan University, Japan. (E-mail: a.taniguchi@em.ci.ritsumei.ac.jp) ** National Institute of Informatics / The Graduate University for Advanced Studies, Japan.
  • 2. Research background Robots coexisting with humans and operating in various environments are required to adaptively learn the spatial concepts (place categories and a lexicon) while incrementally generating an environmental map. ◦ Spatial concepts are such that their target domain may be unclear and may differ according to the user and environment. ◦ Therefore, it is difficult to manually design spatial concepts in advance, and it is desirable for robots to autonomously learn spatial concepts based on their own experiences. 2 Which area is the same place? What scenery can I see? What is the name of this place?
  • 3. Spatial concept Spatial concept based on multimodal information ◦ Word information (Place names) ◦ Place information (Position distribution) ◦ Image information (Visual features) Meething room Laboratory Elevator hall 3
  • 4. [Taniguchi 16] Taniguchi, A et.al. Spatial Concept Acquisition for a Mobile Robot that Integrates Self-Localization and Unsupervised Word Discovery from Spoken Sentences, IEEE TCDS, Vol. 8, No. 4, pp. 285–297 (2016) Previous method : SpCoA [Taniguchi 16] Nonparametric Bayesian spatial concept acquisition method The main features • This model can learn unknown words from continuous speech signals. • This model can learn an appropriate number of spatial concepts, depending on the data. (using nonparametric Beysian approach) • This model can learn many-to-many correspondences between names and places by relating several places to several names via spatial concepts. /sofamae//hoNdana//qgeNkaN/ /kiqchiN/ /daidokoro/ /terebimae//gomibakoo/ /tereburunoatari/ 4 Learning result in Japanese
  • 5. Previous method : SpCoA [Taniguchi 16] Nonparametric Bayesian spatial concept acquisition method ◦ Batch learning ◦ The robot learns the spatial concepts after getting sufficient data while moving the environment. ◦ Environmental map ◦ This method cannot learn spatial concepts from unknown environments without a map. ◦ Over-segmentation problem ◦ It’s is caused by word segmentation of phoneme-recognition results including errors. [Taniguchi 16] Taniguchi, A et.al. Spatial Concept Acquisition for a Mobile Robot that Integrates Self-Localization and Unsupervised Word Discovery from Spoken Sentences, IEEE TCDS, Vol. 8, No. 4, pp. 285–297 (2016) This place is the laboratory. |dis|pu|rai|su|iz a|ra|bora|to|ri| ??? 5
  • 6. Research purpose Mobile robots learn spatial concepts, a lexicon, and an environmental map incrementally from interaction with an environment and human, even in an unknown environment without prior knowledge. 6
  • 7. The proposed method : SpCoSLAM This model integrates multimodal place categorization, lexical acquisition and SLAM as one Bayesian generative model. 7Gray nodes indicate observation variables. 1tx  xt  xt1 tz tu tC ti  l   00 ,m 00 ,V 1tz 1tu 1tz 1tu  LMtyAM tS  ltf  ∞ ∞ m lW k k
  • 8. The proposed method : SpCoSLAM This model integrates multimodal place categorization, lexical acquisition and SLAM as one Bayesian generative model. 8Gray nodes indicate observation variables. 1tx  xt  xt1 tz tu tC ti  l   00 ,m 00 ,V 1tz 1tu 1tz 1tu  LMtyAM tS  ltf  ∞ ∞ m lW k k Simultaneous localization and mapping (SLAM) Position distribution (Gaussian distribution) Nonparametric Bayesian multimodal place categorization Index of place Image feature Words Lexical acquisition (Speech recognition and word segmentation)
  • 9. FastSLAM and SpCoSLAM Simultaneous Localization And Mapping (SLAM) ◦ FastSLAM has realized an on-line algorithm for efficient self-localization and mapping using a Rao-Blackwellized particle filter (RBPF) [Grisetti 05] . Online learning algorithm of SpCoSLAM ◦ The online learning algorithm can be derived by introducing sequential update equations for estimating the parameters of the spatial concepts into the formulation of FastSLAM based on RBPF. 9 [Grisetti 05] G. Grisetti, C. Stachniss, and W. Burgard, “Improving grid-based SLAM with Rao-Blackwellized particle filters by adaptive proposals and selective resampling,” in Proceedings of ICRA, 2005.
  • 10. FastSLAM and SpCoSLAM FastSLAM SpCoSLAM Self- position map Control data Sensor data Latent variables Model parameters Hyperprameters Language model Acoustic model Speech signal Image feature LM is updated. Model parameters are updated. 10 Rao-Blackwellized particle filter (RBPF)
  • 11. Online learning algorithm of SpCoSLAM 2. Calculating the proposal distribution of FastSLAM 2.0 3. Word segmentation, sampling latent variables, and calculating weights 6. Updating a language model 7. Resampling of particles 4. mapping 5. Estimation of parameters of spatial concepts 1. Speech recognition 11
  • 12. Experiment I : Online learning We performed experiments for online learning of spatial concepts in a novel environment. [1] latticelm: http://www.phontron.com/latticelm/ [2] The robotics data set repository (radish): http://radish.sourceforge.net/ Conditions 12 Middleware Robot Operating System (ROS) indigo Speech recognition system Julius dictation-kit-v4.3.1-linux (GMM-HMM decoding), Japanese syllable dictionary Word segmentation system latticelm [1] (WFST-based word segmentation system) Image feature extractor Caffe (CNN model of Places205-AlexNet) Dataset Robotics Data Set Repository (Radish) [2] albert-b-laser-vision by Cyrill Stachniss • Rosbag file (odometry, depth, image data) Speech data 50 sentences including 10 types of various phrases WFST:Weighted Finite-State Transducer
  • 13. Experiment I : Online learning 13 Video: https://youtu.be/hVKQCdbRQVM
  • 14. Experiment I : Online learning 1 2 3 4 5 6 Step 15 1 23 4 5 6 7 8 Step 30 1 23 4 5 6 78 9 10 Step 50 Position distribution: 6 Position distribution: 1 Position distribution: 8 14 Correct: /ikidomari/ (The end of corridor) Estimated word: /ikidomaekidayao/ Estimated word: /kyooyuusehi/ Correct: /kyouyuuseki/ (Sharing desk) Estimated words: /upuriNpabeyatarero/ /izaridokourodayo/ Correct1: /puriNtaabeya/ Correct2: /daidokoro/ (Printer room, kitchen) Words are estimated by
  • 15. Experiment I : Online learning We compare the performance of four methods as follows: (A) SpCoSLAM (The proposed method) (B) Online SpCoA based on RBPF (C) Online SpCoA (D) SpCoA (Batch learning) [Taniguchi 16] [Taniguchi 16] Taniguchi, A et.al. Spatial Concept Acquisition for a Mobile Robot that Integrates Self-Localization and Unsupervised Word Discovery from Spoken Sentences, IEEE Transactions on Cognitive and Developmental Systems, Vol. 8, No. 4, pp. 285–297 (2016) The number of particles:30 15 Methods (B), (C), and (D) based on SpCoA did not perform the update of a language model and did not use image features.
  • 16. Experiment I : Online learning 16 We compare the performance of SpCoSLAM and SpCoA-based methods. 1tx  xt  xt1 tz tu tC ti  l   00 ,m 00 ,V 1tz 1tu 1tz 1tu  LMtyAM tS  ltf  ∞ ∞ m lW k k 1tx  xt  xt1 tz tu tC ti  l   00,m 00 ,V 1tz 1tu 1tz 1tu  tS ∞ ∞ m lW k k SpCoA-based methodSpCoSLAM SpCoA did not perform the update of a language model and did not use image features.
  • 17. Evaluation I : The estimated number of spatial concepts Figures show the number of spatial concepts and the number of position distributions by online learning. True data was determined by an user based on teaching data. SpCoSLAM was closer to the true data than other methods. 17
  • 18. Evaluation II : Word segmentation in the lexical acquisition Figure shows the number of segmented words. SpCoSLAM improved the over-segmentation problem by updating the language model sequentially. SpCoSLAM was closer to the phrase segmentation. Over-segmentation Morpheme: The morphological segmentation (using MeCab) Phrase: The phrase segmentation (segmenting words only before and after the name of the place.) 18
  • 19. Evaluation II : Word segmentation in the lexical acquisition SpCoSLAM SpCoA 19 SpCoSLAM SpCoA SpCoSLAM SpCoA (in English) (in Japanese)
  • 20. Experiment II : Place recognition using a speech signal When the user says “Go to **.”, the estimation of a target position was calculated as follows: SpCoSLAM showed the highest overall evaluation values of the online methods. We calculated the place recognition rate (PRR) that the rate of positions estimated within the correct area in the test data. SpCoA (0.5) 20
  • 21. Conclusion  We proposed an online learning method of spatial concepts and an environmental map by a mobile robot.  The proposed method integrated the spatial concept acquisition into SLAM by an RBPF-based approach.  In the experiments, we conducted online learning in a novel environment by the robot without a pre- existing lexicon and map.  SpCoSLAM improved the performance of place recognition using a speech signal in online learning methods.  SpCoSLAM improved over-segmentation problem in lexical acquisition by updating the language model sequentially. 21THANK YOU FOR YOUR KIND ATTENTION.

Editor's Notes

  1. Each talk is 15 minutes: 12 minutes of presentation + 3 minutes of discussion. I’m Akira Taniguchi, in Ritsumeikan University, Japan. I’d like to present about our research, “Online Spatial Concept and Lexical Acquisition with Simultaneous Localization and Mapping”. Win+P
  2. First, research background. Robots coexisting with humans and operating in various environments are required to adaptively learn the spatial concepts (place categories and a lexicon) while incrementally generating an environmental map. However, spatial concepts are such that their target domain may be unclear and may differ according to the user and environment. Therefore, it is difficult to manually design spatial concepts in advance, and it is desirable for robots to autonomously learn spatial concepts based on their own experiences.
  3. We define spatial concept as the place category based on multimodal information. Spatial concept includes word, place, and image information, like this.
  4. Next, I will introduce our previous method Nonparametric Bayesian spatial concept acquisition method. As the main features, (読む) However, this method has some problems. As a problem, this method cannot learn spatial concepts from unknown environments without a map. The robot needs to have a map generated by SLAM.
  5. However, this method has some problems. First is batch learning. This method cannot learn spatial concepts from unknown environments without a map. The robot needs to have a map generated by SLAM. Second is Over-segmentation problem. It is caused by word segmentation of phoneme-recognition results including errors.
  6. Next is Research purpose. The goal of this study is to develop Mobile robots learn spatial concepts, a lexicon, and an environmental map incrementally from interaction with an environment and human, even in an unknown environment without prior knowledge.
  7. We propose an unsupervised Bayesian generative model and an online learning algorithm that can perform simultaneous learning of the spatial concepts and an environmental map from multimodal information. This model integrates multimodal place categorization, lexical acquisition and SLAM as one Bayesian generative model. This figure shows the graphical model representation of SpCoSLAM. Blue part is SLAM, red part is position distribution represented by Gaussian mixture, Green part is multimodal place categorization of place, image feature, and words. Orange part is lexical acquisition. speech recognition and word segmentation. --- SpCoSLAM Integrating SpCoA (place categorization and lexical acquisition) and SLAM (mapping) as one model Using scene-image features Updating the language model based on place information It can learn incremental spatial concepts for unknown environments and unsearched regions without maps. It can mutually complement the uncertainty of information by using multimodal information. SLAM、Multimodal LDA, GMM、speech recognition, word segmentationをone model で表現した。
  8. We propose an unsupervised Bayesian generative model and an online learning algorithm that can perform simultaneous learning of the spatial concepts and an environmental map from multimodal information. This model integrates multimodal place categorization, lexical acquisition and SLAM as one Bayesian generative model. This figure shows the graphical model representation of SpCoSLAM. Blue part is SLAM, red part is position distribution represented by Gaussian mixture, Green part is multimodal place categorization of place, image feature, and words. Orange part is lexical acquisition. speech recognition and word segmentation. --- SpCoSLAM Integrating SpCoA (place categorization and lexical acquisition) and SLAM (mapping) as one model Using scene-image features Updating the language model based on place information It can learn incremental spatial concepts for unknown environments and unsearched regions without maps. It can mutually complement the uncertainty of information by using multimodal information. SLAM、Multimodal LDA, GMM、speech recognition, word segmentationをone model で表現した。
  9. (次のスライドを映しながらこれを読むのもいいかも)
  10. The formulation of SLAM is the probability distribution of self-position x and map m given control data u and sensor data z. FastSLAM has realized an online algorithm for efficient self-localization and mapping using a Rao-Blackwellized particle filter (RBPF). And, this is the formulation of SpCoSLAM. The online learning algorithm introduced sequential update formulation for estimating the parameters of the spatial concepts into the formulation of FastSLAM based on RBPF. The joint posterior distribution can be factorized to the probability distributions of updating a language model, mapping, updating model parameters and the joint distribution of self-position and latent variables. This part is estimated by the particle filter.
  11. I introduce the overview of the online learning algorithm. First is speech recognition. Second is Calculating the proposal distribution of FastSLAM. 3rd is Word segmentation, sampling latent variables, and calculating weights. 4th is mapping. 5th is Estimation of parameters of spatial concepts. 6th is Updating a language model. 7th is Resampling of particles. Blue areas are same to FastSLAM. Orange areas are original part in this work. --- Please check proceedings for details of algorithm and formulation.
  12. In the Experiment I : Online learning, We performed experiments for online learning of spatial concepts in a novel environment. This table shows experimental conditions. (表を読む)
  13. This video is visualization of online learning. The lower right is the robot camera image. The black dot is the robot position. These circles represent the position distributions of the spatial concepts. This result shows the robot can learn the spatial concepts while mapping. --- maximum 1:15
  14. This is a summary of online learning results, Figure shows the position distributions in the map (at steps 15, 30, and 50). And, Bottom is examples of the estimated words on each position distribution. For example, in position distribution 1, the correct word is kyouyuuseki in Japanese. It’s sharing desk in English. The estimated word is kyooyuusehi. --- The upper part of this figure shows an example of the image corresponding to each position distribution, the correct phoneme sequence of the name of the place, and the best word of the probability value estimated by the probability distribution. at step t. As a result, figure shows how the spatial concepts are acquired while sequentially mapping.
  15. We compare the performance of four methods as follows: (A) SpCoSLAM (The proposed method) (B) Online SpCoA based on RBPF (C) Online SpCoA (D) SpCoA (Batch learning) Methods (B), (C), and (D) based on SpCoA did not perform the update of a language model and did not use image features.
  16. We compare the performance of SpCoSLAM and SpCoA-based methods. SpCoA did not perform the update of a language model and did not use image features.
  17. Evaluation I Figures show the average of the number of spatial concepts and the number of position distributions in 10 trials by online learning. True data was determined by a user based on teaching data. SpCoSLAM was closer to the true data than other methods.
  18. Evaluation II Figure shows the average value of the number of segmented words. SpCoA-based methods are over-segmentation. However, SpCoSLAM was closer to the phrase segmentation. Phrase segmentation is segmenting words only before and after the name of the place. --- The morphological segmentation (purple line) was suitably segmented into Japanese morphemes using MeCab, which is an off-the-shelf Japanese morphological analyzer that is widely used for natural language processing. The phrase segmentation (yellow line) was the number of words in the case of segmenting words only before and after the name of the place, i.e., we assume that a phrase other than the name of the place is one word.
  19. This table shows the example of word segmentation results. slash is a word segment point. SpCoSLAM improved the over-segmentation problem by updating the language model sequentially.
  20. Experiment II : Place recognition using a speech signal When the user says “Go to **.”, the estimation of a target position was calculated by this equation. We calculated the place recognition rate (PRR) that the rate of positions estimated within the correct area in the test data. SpCoSLAM showed the highest overall evaluation values of the online methods. --- The experimental results show that the robot was able to more accurately learn the relationships between words and the position in the map incrementally by using SpCoSLAM.
  21. (時間がない場合) This is conclusion. Thank you for king attention. (時間がある場合) This is conclusion. (読む) Thank you for king attention.