2. Tadahiro Taniguchi @tanichu
Associate professor, Emergent system laboratory
College of information science and technology,
Ritsumeikan University, Japan
2006, PhD Eng. , Kyoto university
2008‐ Assistant professor
2010‐ Associate professor
Machine
learning
Cognitive
robotics
Decentralized
autonomous
system
Human
communication
Symbol Emergence in Robotics
4. Symbol grounding problem
How can a robot “ground” his/her symbol to the real
physical world? [Harnad ‘90]
The robot has to give some meaning to a symbol in its
symbolic system designed by a human designer through
sensor‐motor interaction with its environment.
Arbitrary nature of symbol (in semiotics)
Arbitrariness of labeling/naming
Arbitrariness of categorization/segmentation
SGP implicitly assumes that human “arbitrarily
SGP is
missing
evolved” symbolic system is a “true” symbolic system.
Should a symbolic system for a robot be same as human’s ?
5. Understanding that symbolic system is an emergent
property of human cognitive and social system
“Worlds of robots”
“Worlds of animals” (Uexküll 1934)
Umwelt (self‐centered world)
Animals can receive information only from
their sensor‐motor system.
A human has to obtain various behaviors, concepts
and language on the basis of experiences in his/her
umwelt (closed cognitive system).
Human Symbolic system should be understand
on the basis of human embodiment (sensor‐motor
system).
Concepts have to be formed
on the basis of sensor‐motor information
in a bottom‐up way
Ernst Mach’s famous picture
"Early Scheme for a circular
Feedback Circle by Uxkull
13. Modular learning architecture
for segmenting sensor‐motor time series
MOSAIC [Kawato et al.]
Mixture of experts [Jacobs et al.]
HAMMER [Yiannis et al.]
Dual schemata model [Taniguchi et al.]
“Segment” corresponds to
a linear system
local information
a short‐term event
How can we grasp long‐term context?
http://www.cns.atr.jp/cnb/HarunoG/
harunoG.ja.html
T. Taniguchi, T. Sawaragi, "Incremental acquisition of multiple nonlinear forward models based on
differentiation process of schema model“, Neural Networks, Vol.21 (1), pp.13‐27 .(2008)
16. Double Articulation Analyzer
to estimate latent double articulation structure
Unsupervised learning
Estimating
NPYLM
[Moachihashi ‘09]
iHMM
[Fox ‘07]
Language model
Emission distribution
Segments and chunks
Conditions
Unknown number of
words and letters
Unknown emission
distributions
Inference
Approximate Inference
Procedure of
Double Articulation
Analyzer [Taniguchi ‘11]
Nonparametric Bayesian approach
Tadahiro Taniguchi, Shogo Nagasaka, Double Articulation Analyzer for Unsegmented Human Motion using Pitman‐
Yor Language model and Infinite Hidden Markov Model, 2011 IEEE/SICE SII.(2011)
17. iHMM
[Fox ‘07]
sticky HDP‐HMM [Fox ‘07]
An infinite HMM is an HMM
which can estimate the number of
hidden state flexibly (potentially
infinite)[Beal ‘02] [Teh ‘06]
Sticky HDP‐HMM is a generative
model for iHMM with a stickiness
parameter [Fox ‘07].
Fox et al. developed fast inference
algorithm with weak‐limit
approximation.
Gaussian emission distribution
γ
β
κ
α
πk
z1
λ
z2
z3
zT
y1
y2
y3
yT
θk
∞
E.B. Fox, E.B. Sudderth, M.I. Jordan, A.S. Willsky, "A Sticky HDP‐HMM with Application to Speaker
Diarization," Annals of Applied Statistics, June 2011. (First appeared as MIT LIDS Technical Report P‐2777, November
2007.)
18. NPYLM
[Moachihashi ‘09]
Unsupervised morphological analysis
Thisisanapple ‐> This is an apple
わたしはたなかです. ‐> わたし は たなか です
Morphological analysis
To segment sentences into
words(morpheme).
This usually requires dictionary
(preexisting knowledge of language
model).
Unsupervised morphological
analysis
It does not assume preexisting
dictionary.
Mochihashi proposed an
unsupervised morphological
analysis method based on Nested
Pitman‐Yor language model
(NPYLM)[Mochihashi ‘09].
http://chasen.org/~daiti‐m/paper/nl190segment‐slides.pdf
19. NPYLM [Mochihashi ‘09]
(Nested Pitman‐Yor Language Model)
Mochihashi developed NPYLM for unsupervised
morphological analysis.
NPYLM has word n‐gram model and letter ngram model,
hierarchically. Each adopts hierarchical Pitman‐Yor
language model as a language model (smoothing method).
Bayesian nonparametric model
Efficient blocked Gibbs sampler
Daichi Mochihashi, Takeshi Yamada, Naonori Ueda."Bayesian Unsupervised Word Segmentation
with Nested Pitman‐Yor Language Modeling". ACL‐IJCNLP 2009, pp.100‐108, 2009.
25. Conclusion
Summary
Defining the symbol emergence system
Introducing symbol emergence in robotics
Introducing double articulation analyzer
Current challenge
Unsupervised lexicon acquisition using double
articulation analyzer and multi‐modal categorization
Discussion topic
What is the important feature of language which we
have to model to obtain computational understanding
of human language.