Akira Taniguchi, Tadahiro Taniguchi, and Tetsunari Inamura, "Spatial Concept Acquisition for a Mobile Robot that Integrates Self-Localization and Unsupervised Word Discovery from Spoken Sentences", IEEE Transactions on Cognitive and Developmental Systems, Vol. 8, No. 4, pp285-297, 2016.
Simultaneous Estimation of Self-position and Word from Noisy Utterances and S...Akira Taniguchi
Akira Taniguchi, Tadahiro Taniguchi, and Tetsunari Inamura, "Simultaneous Estimation of Self-position and Word from Noisy Utterances and Sensory Information", 13th IFAC/IFIP/IFORS/IEA Symposium on Analysis, Design, and Evaluation of Human-Machine Systems (IFAC HMS2016), Aug, 2016. in Kyoto, Japan.
The document discusses strategies for mid-level robot control and learning. It proposes using a common language called teleo-reactive (T-R) programs to represent robot control programs that were explicitly programmed, planned, or learned. The document describes T-R programs and some preliminary experiments on learning T-R programs through reinforcement learning. It found that perceptual imperfections like noise and aliasing pose challenges for learning but that T-R programs can still be learned to achieve robot tasks. Further experimentation is needed to develop the approach.
[IROS2017] Online Spatial Concept and Lexical Acquisition with Simultaneous L...Akira Taniguchi
○Akira Taniguchi, Yoshinobu Hagiwara, Tadahiro Taniguchi, and Tetsunari Inamura, "Online Spatial Concept and Lexical Acquisition with Simultaneous Localization and Mapping", IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS2017), 2017.
Video: https://youtu.be/hVKQCdbRQVM
A robot may need to use a tool to solve a complex problem. Currently, tool use must be pre-programmed by a human. However, this is a difficult task and can be helped if the robot is able to learn how to use a tool by itself. Most of the work in tool use learning by a robot is done using a feature-based representation. Despite many successful results, this representation is limited in the types of tools and tasks that can be handled. Furthermore, the complex relationship between a tool and other world objects cannot be captured easily. Relational learning methods have been proposed to overcome these weaknesses [1, 2]. However, they have only been evaluated in a sensor-less simulation to avoid the complexities and uncertainties of the real world. We present a real world implementation of a relational tool use learning system for a robot. In our experiment, a robot requires around ten examples to learn to use a hook-like tool to pull a cube from a narrow tube.
Design of Mobile Robot Navigation system using SLAM and Adaptive Tracking Con...iosrjce
IOSR Journal of Computer Engineering (IOSR-JCE) is a double blind peer reviewed International Journal that provides rapid publication (within a month) of articles in all areas of computer engineering and its applications. The journal welcomes publications of high quality papers on theoretical developments and practical applications in computer technology. Original research papers, state-of-the-art reviews, and high quality technical notes are invited for publications.
The document provides an overview of robotics, including definitions of robots, a brief history of robots, common robot structures and applications, challenges in robot navigation, human-robot interaction, research areas in robotics like evolutionary robots, and emerging techniques like robotic augmentation of humans. It discusses what robots can and cannot easily do compared to humans and covers topics like kinematics, dynamics, control systems, sensors for navigation, and generations of robots per Professor Hans Moravec's predictions.
LIP READING - AN EFFICIENT CROSS AUDIO-VIDEO RECOGNITION USING 3D CONVOLUTION...IRJET Journal
This document presents a lip reading system that uses 3D convolutional neural networks for audio-video speech recognition. The system recognizes phrases and sentences from silent video input with or without audio. It extracts lip movements from video frames using face tracking and processes audio files separately. 3D CNNs are used to extract spatial and temporal features from the audio and video streams. The system is evaluated on the MIRACLE-VC1 dataset and achieves accuracy comparable to other recent work on visual speech recognition. Future applications of this lip reading system include improved hearing aids, voice generation for those unable to speak, and biometric authentication.
Symbol Emergence in Robotics: Language Acquisition via Real-world Sensorimoto...Tadahiro Taniguchi
This document summarizes a talk given by Tadahiro Taniguchi on symbol emergence in robotics through language acquisition via real-world sensorimotor information. The talk discusses several lexical acquisition tasks performed by robots, including direct phoneme and word discovery from speech signals, simultaneous acquisition of word units and multimodal categories, and online spatial concept acquisition. It outlines Taniguchi's research interests and background, and describes computational models and experiments for language acquisition through self-organizational learning based on real-world sensorimotor information.
Simultaneous Estimation of Self-position and Word from Noisy Utterances and S...Akira Taniguchi
Akira Taniguchi, Tadahiro Taniguchi, and Tetsunari Inamura, "Simultaneous Estimation of Self-position and Word from Noisy Utterances and Sensory Information", 13th IFAC/IFIP/IFORS/IEA Symposium on Analysis, Design, and Evaluation of Human-Machine Systems (IFAC HMS2016), Aug, 2016. in Kyoto, Japan.
The document discusses strategies for mid-level robot control and learning. It proposes using a common language called teleo-reactive (T-R) programs to represent robot control programs that were explicitly programmed, planned, or learned. The document describes T-R programs and some preliminary experiments on learning T-R programs through reinforcement learning. It found that perceptual imperfections like noise and aliasing pose challenges for learning but that T-R programs can still be learned to achieve robot tasks. Further experimentation is needed to develop the approach.
[IROS2017] Online Spatial Concept and Lexical Acquisition with Simultaneous L...Akira Taniguchi
○Akira Taniguchi, Yoshinobu Hagiwara, Tadahiro Taniguchi, and Tetsunari Inamura, "Online Spatial Concept and Lexical Acquisition with Simultaneous Localization and Mapping", IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS2017), 2017.
Video: https://youtu.be/hVKQCdbRQVM
A robot may need to use a tool to solve a complex problem. Currently, tool use must be pre-programmed by a human. However, this is a difficult task and can be helped if the robot is able to learn how to use a tool by itself. Most of the work in tool use learning by a robot is done using a feature-based representation. Despite many successful results, this representation is limited in the types of tools and tasks that can be handled. Furthermore, the complex relationship between a tool and other world objects cannot be captured easily. Relational learning methods have been proposed to overcome these weaknesses [1, 2]. However, they have only been evaluated in a sensor-less simulation to avoid the complexities and uncertainties of the real world. We present a real world implementation of a relational tool use learning system for a robot. In our experiment, a robot requires around ten examples to learn to use a hook-like tool to pull a cube from a narrow tube.
Design of Mobile Robot Navigation system using SLAM and Adaptive Tracking Con...iosrjce
IOSR Journal of Computer Engineering (IOSR-JCE) is a double blind peer reviewed International Journal that provides rapid publication (within a month) of articles in all areas of computer engineering and its applications. The journal welcomes publications of high quality papers on theoretical developments and practical applications in computer technology. Original research papers, state-of-the-art reviews, and high quality technical notes are invited for publications.
The document provides an overview of robotics, including definitions of robots, a brief history of robots, common robot structures and applications, challenges in robot navigation, human-robot interaction, research areas in robotics like evolutionary robots, and emerging techniques like robotic augmentation of humans. It discusses what robots can and cannot easily do compared to humans and covers topics like kinematics, dynamics, control systems, sensors for navigation, and generations of robots per Professor Hans Moravec's predictions.
LIP READING - AN EFFICIENT CROSS AUDIO-VIDEO RECOGNITION USING 3D CONVOLUTION...IRJET Journal
This document presents a lip reading system that uses 3D convolutional neural networks for audio-video speech recognition. The system recognizes phrases and sentences from silent video input with or without audio. It extracts lip movements from video frames using face tracking and processes audio files separately. 3D CNNs are used to extract spatial and temporal features from the audio and video streams. The system is evaluated on the MIRACLE-VC1 dataset and achieves accuracy comparable to other recent work on visual speech recognition. Future applications of this lip reading system include improved hearing aids, voice generation for those unable to speak, and biometric authentication.
Symbol Emergence in Robotics: Language Acquisition via Real-world Sensorimoto...Tadahiro Taniguchi
This document summarizes a talk given by Tadahiro Taniguchi on symbol emergence in robotics through language acquisition via real-world sensorimotor information. The talk discusses several lexical acquisition tasks performed by robots, including direct phoneme and word discovery from speech signals, simultaneous acquisition of word units and multimodal categories, and online spatial concept acquisition. It outlines Taniguchi's research interests and background, and describes computational models and experiments for language acquisition through self-organizational learning based on real-world sensorimotor information.
This document discusses communication techniques in swarm robotics. It summarizes three studies on this topic. The first two studies found that using chemical pheromones for communication between swarm robots resulted in better performance than not using pheromones. The third study used a heterogeneous swarm of footbots and eyebots that communicated via telecommunication; this approach worked well due to synchronization between the bot types. In conclusion, chemical communication works best due to the self-learning nature of the robots, but telecommunication can also be effective if different robot types are synchronized properly. Future work could explore new types of chemical trails that are not limited to land or use combinations of communication approaches.
Robotics involves the design, manufacture, and application of robots. It is an interdisciplinary field that includes aspects of electronics, mechanics, and software. Some key branches of robotics include android science, which studies human-robot interaction; artificial intelligence, which aims to create machine intelligence; and nanorobotics, which focuses on controlling nanoscale objects. Other branches discussed are robot surgery, laboratory robotics, robot locomotion, telepresence, swarm robotics, and speech processing.
Robot operating system based autonomous navigation platform with human robot ...TELKOMNIKA JOURNAL
In emerging technologies, indoor service robots are playing a vital role for people who are physically challenged and visually impaired. The service robots are efficient and beneficial for people to overcome the challenges faced during their regular chores. This paper proposes the implementation of autonomous navigation platforms with human-robot interaction which can be used in service robots to avoid the difficulties faced in daily activities. We used the robot operating system (ROS) framework for the implementation of algorithms used in auto navigation, speech processing and recognition, and object detection and recognition. A suitable robot model was designed and tested in the Gazebo environment to evaluate the algorithms. The confusion matrix that was created from 125 different cases points to the decent correctness of the model.
AN APPROACH OF IR-BASED SHORT-RANGE CORRESPONDENCE SYSTEMS FOR SWARM ROBOT BA...ijaia
This paper exhibits a short-run correspondence method appropriate for swarm versatile robots application.
Infrared is utilized for transmitting and accepting information and obstruction location. The infrared
correspondence code based swarm signaling is utilized for an independent versatile robot communication
system in this research. A code based signaling system is developed for transmitting information between
different entities of robot. The reflected infrared sign is additionally utilized for separation estimation for
obstruction evasion. Investigation of robot demonstrates the possibility of utilizing infrared signs to get a
solid nearby correspondence between swarm portable robots. This paper exhibits a basic decentralized
control for swarm of self-collecting robots. Every robot in the code based swarm signaling is completely
self-governing and controlled utilizing a conduct based methodology with just infrared-based nearby
detecting and correspondences. The viability of the methodology has been checked with simulation, for a
set of swarm robots.
Swarm intelligence is an artificial intelligence technique inspired by the collective behavior of decentralized and self-organized systems found in nature, such as ant colonies and bird flocks. Two common swarm intelligence algorithms are ant colony optimization and particle swarm optimization. Ant colony optimization is based on the behavior of real ant colonies and can be used to find approximate solutions to difficult optimization problems. Particle swarm optimization is a population-based stochastic optimization technique inspired by swarming behavior in nature, such as bird flocking. It searches for optimal solutions within a problem space by updating the movement of individual particles based on their own experiences and those of neighboring particles.
Language-Based Actions for Self-Driving RobotIRJET Journal
This document describes a framework for a self-driving robot to follow natural language commands. The framework uses sequence modeling to learn the meaning of sentences describing a path and identify relevant objects and prepositions. It then uses this information in the cognizance phase to generate a path and move the robot to accomplish the navigational goal described in the input sentence. The researchers created a virtual environment using Unity game engine to simulate the robot and collect training data on floor plans, sentences, and robot paths. They preprocessed this data and used hidden Markov models and a probabilistic graphical model to represent the temporal segments of sentences and learn the relationships between objects for sequence modeling.
In this report, one of the main applications of fuzzy logic is proposed i.e in robotic navigation.
Starting from scratch to building up the fuzzy logic and its validation using the MATLAB fuzzy logic toolbox , everything is covered in this report. If you find it helpful do like and share it with your friends. Fuzzy logic finds its application in AGVs and autonomous vehicles etc. Nowadays it is employed to find out the instantaneous power split ratio between the Engine and battery in the parallel hybrid EV.
This paper presents an expressive gesture model for generating communicative gestures with speech for the humanoid robot Nao. The model extends an existing virtual agent platform called GRETA to control gestures of both virtual and physical agents. Gestures are stored symbolically in a lexicon and selected based on intentions and emotions. Parameters of gestural expressivity like temporal extension and fluidity are applied when gestures are instantiated on the robot while considering its physical limitations.
EFFECTIVE REDIRECTING OF THE MOBILE ROBOT IN A MESSED ENVIRONMENT BASED ON TH...ijfls
The use of fuzzy logic in redirecting mobile robot is based on two sets of received information. First set is the instantaneous distance of the robot from the obstacle and second set is the instantaneous information of the robot's position. For this purpose, the fuzzy rules base consists of forty-two bases, which is extracted based on the robot's distance from obstacles, and the target position relative to the instantaneous orientation of the robot. In the structure of fuzzy systems, minimal inference engine are considered. Also, Extended Kalman filter is used for localization in a noisy environment. Accordingly, the inputs of the fuzzy systems are determined based on the estimation of the localization process, the information of the obstacles center and the target position. Also, the linear acceleration and instantaneous orientation of the mobile robot are determined by the desired fuzzy structures which are applied to its kinematic model.
EFFECTIVE REDIRECTING OF THE MOBILE ROBOT IN A MESSED ENVIRONMENT BASED ON TH...Wireilla
The use of fuzzy logic in redirecting mobile robot is based on two sets of received information. First set is
the instantaneous distance of the robot from the obstacle and second set is the instantaneous information of
the robot's position. For this purpose, the fuzzy rules base consists of forty-two bases, which is extracted
based on the robot's distance from obstacles, and the target position relative to the instantaneous
orientation of the robot. In the structure of fuzzy systems, minimal inference engine are considered. Also,
Extended Kalman filter is used for localization in a noisy environment. Accordingly, the inputs of the fuzzy
systems are determined based on the estimation of the localization process, the information of the obstacles
center and the target position. Also, the linear acceleration and instantaneous orientation of the mobile
robot are determined by the desired fuzzy structures which are applied to its kinematic model.
This document describes a design for mobile robot navigation using simultaneous localization and mapping (SLAM) and an adaptive tracking controller with particle swarm optimization in indoor environments. An adaptive fuzzy tracking controller is designed using 9 fuzzy rules to calculate a reference path for navigation between a starting and goal point. Particle swarm optimization is then used to optimize and reduce the time required for the calculated path. The controller is simulated in two indoor environments containing obstacles, and particle swarm optimization is shown to reduce navigation time compared to without its use. This approach allows for efficient mobile robot navigation in indoor monitoring applications.
This is the final year project on the topic of "Humanoid Robot" it consist of Abstract, Introduction, Literature Survey, System analysis, system requirements, Implementation conclusion and all.
With the development of robotics and artificial intelligence field unceasingly thorough, path planning for avoid
obstacles as an important field of robot calculation has been widespread concern. This paper analyzes the
current development of robot and path planning algorithm for path planning to avoid obstacles in practice. We
tried to find a good way in mobile robot path planning by using ant colony algorithm, and it also provides some
solving methods.
The document discusses natural semantics for a robot, where the robot can acquire and maintain meanings for itself through interaction with its environment, rather than just providing pre-programmed responses. It notes that reinforcement learning may allow a robot to develop a rudimentary natural semantic system. The document also describes Cohen's robot project, which uses clustering of sensor experiences to develop prototypes of activities and can plan goals, but is still missing an understanding of roles and representing experiences as interactions between entities.
This document provides a summary of a technical seminar presentation on multi-robot systems for space applications. The presentation covered topics including what space robots are, why they are used, examples of multi-robot systems, how path planning algorithms work, the key technologies used in space robots, a proposed multi-robot system architecture, the importance of space robots, types of space robots, and future space missions that will utilize robotics. The presentation provided information on space robots through diagrams, flow charts, and explanations of concepts like sensing, planning, control and execution in multi-robot systems.
This document provides an introduction to speech recognition and signal processing. It discusses how speech recognition systems like Google use deep learning and natural language processing to understand spoken queries and responses. It then gives a brief history of speech recognition, including early systems from Bell Labs in the 1950s and advances driven by DARPA, IBM and the introduction of hidden Markov models. Finally, it defines key concepts for audio signals like amplitude, wavelength, frequency, and explains how analog signals are converted to digital through the sampling process to allow for storage and processing.
Incremental Difference as Feature for LipreadingIDES Editor
This paper represents a method of computing
incremental dif ference f eatures on the basis of scan line
projection and scan converting lines for the lipreading problem
on a set of isolated word utterances. These features are affine
invariants and found to be eff ective in identification of
similarity between utterances by the speaker in spatial domain
Recognition of Facial Emotions Based on Sparse CodingIJERA Editor
This paper deals with acknowledgment of characteristic feelings from human countenances is a fascinating subject with an extensive variety of potential applications like human-PC communication, robotized mentoring frameworks, picture and video recovery, brilliant situations, what's more, driver cautioning frameworks. Generally, facial feeling acknowledgment frameworks have been assessed on lab controlled information, which is not illustrative of the earth confronted in genuine applications. To vigorously perceive facial feelings in genuine regular circumstances, this paper proposes a methodology called Extreme Sparse Learning (ESL), which can mutually take in a word reference (set of premise) and a non-direct grouping model. The proposed approach consolidates the discriminative force of Extreme Learning Machine (ELM) with the reproduction property of meager representation to empower exact arrangement when given uproarious signs and blemished information recorded in common settings. Moreover, this work exhibits another neighborhood spatioworldly descriptor that is particular what's more, posture invariant. The proposed structure can accomplish best in class acknowledgment precision on both acted what's more, unconstrained facial feeling databases.
Robotics has evolved significantly from early remote-controlled devices to modern intelligent machines. Early pioneers like Tesla developed remote-controlled boats in the late 1800s. The term "robot" was coined in a 1920 play. During WWII, militaries developed early autonomous systems for tasks like bomb disposal. Today robots are used widely for industrial manufacturing, space exploration, surgery, entertainment and more. Researchers are developing humanoid robots and drawing inspiration from biological systems like spider locomotion to create versatile machines. Evolutionary algorithms and other techniques allow robots to adapt their behavior through experience.
This document summarizes research on self-supervised multimodal representation learning and foundation models. It discusses how self-supervised learning can learn representations from data without direct human supervision across various media like language, vision, audio, and robotics. The document also examines combining these modalities to create more effective AI agents and highlights recent successes in multimodal representation learning while identifying ongoing challenges and opportunities for future work.
This document discusses communication techniques in swarm robotics. It summarizes three studies on this topic. The first two studies found that using chemical pheromones for communication between swarm robots resulted in better performance than not using pheromones. The third study used a heterogeneous swarm of footbots and eyebots that communicated via telecommunication; this approach worked well due to synchronization between the bot types. In conclusion, chemical communication works best due to the self-learning nature of the robots, but telecommunication can also be effective if different robot types are synchronized properly. Future work could explore new types of chemical trails that are not limited to land or use combinations of communication approaches.
Robotics involves the design, manufacture, and application of robots. It is an interdisciplinary field that includes aspects of electronics, mechanics, and software. Some key branches of robotics include android science, which studies human-robot interaction; artificial intelligence, which aims to create machine intelligence; and nanorobotics, which focuses on controlling nanoscale objects. Other branches discussed are robot surgery, laboratory robotics, robot locomotion, telepresence, swarm robotics, and speech processing.
Robot operating system based autonomous navigation platform with human robot ...TELKOMNIKA JOURNAL
In emerging technologies, indoor service robots are playing a vital role for people who are physically challenged and visually impaired. The service robots are efficient and beneficial for people to overcome the challenges faced during their regular chores. This paper proposes the implementation of autonomous navigation platforms with human-robot interaction which can be used in service robots to avoid the difficulties faced in daily activities. We used the robot operating system (ROS) framework for the implementation of algorithms used in auto navigation, speech processing and recognition, and object detection and recognition. A suitable robot model was designed and tested in the Gazebo environment to evaluate the algorithms. The confusion matrix that was created from 125 different cases points to the decent correctness of the model.
AN APPROACH OF IR-BASED SHORT-RANGE CORRESPONDENCE SYSTEMS FOR SWARM ROBOT BA...ijaia
This paper exhibits a short-run correspondence method appropriate for swarm versatile robots application.
Infrared is utilized for transmitting and accepting information and obstruction location. The infrared
correspondence code based swarm signaling is utilized for an independent versatile robot communication
system in this research. A code based signaling system is developed for transmitting information between
different entities of robot. The reflected infrared sign is additionally utilized for separation estimation for
obstruction evasion. Investigation of robot demonstrates the possibility of utilizing infrared signs to get a
solid nearby correspondence between swarm portable robots. This paper exhibits a basic decentralized
control for swarm of self-collecting robots. Every robot in the code based swarm signaling is completely
self-governing and controlled utilizing a conduct based methodology with just infrared-based nearby
detecting and correspondences. The viability of the methodology has been checked with simulation, for a
set of swarm robots.
Swarm intelligence is an artificial intelligence technique inspired by the collective behavior of decentralized and self-organized systems found in nature, such as ant colonies and bird flocks. Two common swarm intelligence algorithms are ant colony optimization and particle swarm optimization. Ant colony optimization is based on the behavior of real ant colonies and can be used to find approximate solutions to difficult optimization problems. Particle swarm optimization is a population-based stochastic optimization technique inspired by swarming behavior in nature, such as bird flocking. It searches for optimal solutions within a problem space by updating the movement of individual particles based on their own experiences and those of neighboring particles.
Language-Based Actions for Self-Driving RobotIRJET Journal
This document describes a framework for a self-driving robot to follow natural language commands. The framework uses sequence modeling to learn the meaning of sentences describing a path and identify relevant objects and prepositions. It then uses this information in the cognizance phase to generate a path and move the robot to accomplish the navigational goal described in the input sentence. The researchers created a virtual environment using Unity game engine to simulate the robot and collect training data on floor plans, sentences, and robot paths. They preprocessed this data and used hidden Markov models and a probabilistic graphical model to represent the temporal segments of sentences and learn the relationships between objects for sequence modeling.
In this report, one of the main applications of fuzzy logic is proposed i.e in robotic navigation.
Starting from scratch to building up the fuzzy logic and its validation using the MATLAB fuzzy logic toolbox , everything is covered in this report. If you find it helpful do like and share it with your friends. Fuzzy logic finds its application in AGVs and autonomous vehicles etc. Nowadays it is employed to find out the instantaneous power split ratio between the Engine and battery in the parallel hybrid EV.
This paper presents an expressive gesture model for generating communicative gestures with speech for the humanoid robot Nao. The model extends an existing virtual agent platform called GRETA to control gestures of both virtual and physical agents. Gestures are stored symbolically in a lexicon and selected based on intentions and emotions. Parameters of gestural expressivity like temporal extension and fluidity are applied when gestures are instantiated on the robot while considering its physical limitations.
EFFECTIVE REDIRECTING OF THE MOBILE ROBOT IN A MESSED ENVIRONMENT BASED ON TH...ijfls
The use of fuzzy logic in redirecting mobile robot is based on two sets of received information. First set is the instantaneous distance of the robot from the obstacle and second set is the instantaneous information of the robot's position. For this purpose, the fuzzy rules base consists of forty-two bases, which is extracted based on the robot's distance from obstacles, and the target position relative to the instantaneous orientation of the robot. In the structure of fuzzy systems, minimal inference engine are considered. Also, Extended Kalman filter is used for localization in a noisy environment. Accordingly, the inputs of the fuzzy systems are determined based on the estimation of the localization process, the information of the obstacles center and the target position. Also, the linear acceleration and instantaneous orientation of the mobile robot are determined by the desired fuzzy structures which are applied to its kinematic model.
EFFECTIVE REDIRECTING OF THE MOBILE ROBOT IN A MESSED ENVIRONMENT BASED ON TH...Wireilla
The use of fuzzy logic in redirecting mobile robot is based on two sets of received information. First set is
the instantaneous distance of the robot from the obstacle and second set is the instantaneous information of
the robot's position. For this purpose, the fuzzy rules base consists of forty-two bases, which is extracted
based on the robot's distance from obstacles, and the target position relative to the instantaneous
orientation of the robot. In the structure of fuzzy systems, minimal inference engine are considered. Also,
Extended Kalman filter is used for localization in a noisy environment. Accordingly, the inputs of the fuzzy
systems are determined based on the estimation of the localization process, the information of the obstacles
center and the target position. Also, the linear acceleration and instantaneous orientation of the mobile
robot are determined by the desired fuzzy structures which are applied to its kinematic model.
This document describes a design for mobile robot navigation using simultaneous localization and mapping (SLAM) and an adaptive tracking controller with particle swarm optimization in indoor environments. An adaptive fuzzy tracking controller is designed using 9 fuzzy rules to calculate a reference path for navigation between a starting and goal point. Particle swarm optimization is then used to optimize and reduce the time required for the calculated path. The controller is simulated in two indoor environments containing obstacles, and particle swarm optimization is shown to reduce navigation time compared to without its use. This approach allows for efficient mobile robot navigation in indoor monitoring applications.
This is the final year project on the topic of "Humanoid Robot" it consist of Abstract, Introduction, Literature Survey, System analysis, system requirements, Implementation conclusion and all.
With the development of robotics and artificial intelligence field unceasingly thorough, path planning for avoid
obstacles as an important field of robot calculation has been widespread concern. This paper analyzes the
current development of robot and path planning algorithm for path planning to avoid obstacles in practice. We
tried to find a good way in mobile robot path planning by using ant colony algorithm, and it also provides some
solving methods.
The document discusses natural semantics for a robot, where the robot can acquire and maintain meanings for itself through interaction with its environment, rather than just providing pre-programmed responses. It notes that reinforcement learning may allow a robot to develop a rudimentary natural semantic system. The document also describes Cohen's robot project, which uses clustering of sensor experiences to develop prototypes of activities and can plan goals, but is still missing an understanding of roles and representing experiences as interactions between entities.
This document provides a summary of a technical seminar presentation on multi-robot systems for space applications. The presentation covered topics including what space robots are, why they are used, examples of multi-robot systems, how path planning algorithms work, the key technologies used in space robots, a proposed multi-robot system architecture, the importance of space robots, types of space robots, and future space missions that will utilize robotics. The presentation provided information on space robots through diagrams, flow charts, and explanations of concepts like sensing, planning, control and execution in multi-robot systems.
This document provides an introduction to speech recognition and signal processing. It discusses how speech recognition systems like Google use deep learning and natural language processing to understand spoken queries and responses. It then gives a brief history of speech recognition, including early systems from Bell Labs in the 1950s and advances driven by DARPA, IBM and the introduction of hidden Markov models. Finally, it defines key concepts for audio signals like amplitude, wavelength, frequency, and explains how analog signals are converted to digital through the sampling process to allow for storage and processing.
Incremental Difference as Feature for LipreadingIDES Editor
This paper represents a method of computing
incremental dif ference f eatures on the basis of scan line
projection and scan converting lines for the lipreading problem
on a set of isolated word utterances. These features are affine
invariants and found to be eff ective in identification of
similarity between utterances by the speaker in spatial domain
Recognition of Facial Emotions Based on Sparse CodingIJERA Editor
This paper deals with acknowledgment of characteristic feelings from human countenances is a fascinating subject with an extensive variety of potential applications like human-PC communication, robotized mentoring frameworks, picture and video recovery, brilliant situations, what's more, driver cautioning frameworks. Generally, facial feeling acknowledgment frameworks have been assessed on lab controlled information, which is not illustrative of the earth confronted in genuine applications. To vigorously perceive facial feelings in genuine regular circumstances, this paper proposes a methodology called Extreme Sparse Learning (ESL), which can mutually take in a word reference (set of premise) and a non-direct grouping model. The proposed approach consolidates the discriminative force of Extreme Learning Machine (ELM) with the reproduction property of meager representation to empower exact arrangement when given uproarious signs and blemished information recorded in common settings. Moreover, this work exhibits another neighborhood spatioworldly descriptor that is particular what's more, posture invariant. The proposed structure can accomplish best in class acknowledgment precision on both acted what's more, unconstrained facial feeling databases.
Robotics has evolved significantly from early remote-controlled devices to modern intelligent machines. Early pioneers like Tesla developed remote-controlled boats in the late 1800s. The term "robot" was coined in a 1920 play. During WWII, militaries developed early autonomous systems for tasks like bomb disposal. Today robots are used widely for industrial manufacturing, space exploration, surgery, entertainment and more. Researchers are developing humanoid robots and drawing inspiration from biological systems like spider locomotion to create versatile machines. Evolutionary algorithms and other techniques allow robots to adapt their behavior through experience.
This document summarizes research on self-supervised multimodal representation learning and foundation models. It discusses how self-supervised learning can learn representations from data without direct human supervision across various media like language, vision, audio, and robotics. The document also examines combining these modalities to create more effective AI agents and highlights recent successes in multimodal representation learning while identifying ongoing challenges and opportunities for future work.
Similar to SpCoA: Nonparametric Bayesian Spatial Concept Acquisition (20)
Simultaneous Localization, Mapping and Self-body Shape Estimation by a Mobile...Akira Taniguchi
The document proposes a method called SLAM-SBE that allows a mobile robot to simultaneously estimate its location, map its environment, and determine its own body shape using only sensory-motor information. The method extends existing SLAM techniques by including a node for self-body information represented by an occupancy grid. An experiment shows that a robot is able to recursively update its estimated body shape over time based on its position in the environmental map. The estimated shapes were similar but not identical to the robot's actual shapes, with some errors due to sensor limitations. Future work could improve accuracy and apply the approach to 3D or multi-joint robotic systems.
This presentation provides valuable insights into effective cost-saving techniques on AWS. Learn how to optimize your AWS resources by rightsizing, increasing elasticity, picking the right storage class, and choosing the best pricing model. Additionally, discover essential governance mechanisms to ensure continuous cost efficiency. Whether you are new to AWS or an experienced user, this presentation provides clear and practical tips to help you reduce your cloud costs and get the most out of your budget.
Building Production Ready Search Pipelines with Spark and MilvusZilliz
Spark is the widely used ETL tool for processing, indexing and ingesting data to serving stack for search. Milvus is the production-ready open-source vector database. In this talk we will show how to use Spark to process unstructured data to extract vector representations, and push the vectors to Milvus vector database for search serving.
Generating privacy-protected synthetic data using Secludy and MilvusZilliz
During this demo, the founders of Secludy will demonstrate how their system utilizes Milvus to store and manipulate embeddings for generating privacy-protected synthetic data. Their approach not only maintains the confidentiality of the original data but also enhances the utility and scalability of LLMs under privacy constraints. Attendees, including machine learning engineers, data scientists, and data managers, will witness first-hand how Secludy's integration with Milvus empowers organizations to harness the power of LLMs securely and efficiently.
Digital Marketing Trends in 2024 | Guide for Staying AheadWask
https://www.wask.co/ebooks/digital-marketing-trends-in-2024
Feeling lost in the digital marketing whirlwind of 2024? Technology is changing, consumer habits are evolving, and staying ahead of the curve feels like a never-ending pursuit. This e-book is your compass. Dive into actionable insights to handle the complexities of modern marketing. From hyper-personalization to the power of user-generated content, learn how to build long-term relationships with your audience and unlock the secrets to success in the ever-shifting digital landscape.
Main news related to the CCS TSI 2023 (2023/1695)Jakub Marek
An English 🇬🇧 translation of a presentation to the speech I gave about the main changes brought by CCS TSI 2023 at the biggest Czech conference on Communications and signalling systems on Railways, which was held in Clarion Hotel Olomouc from 7th to 9th November 2023 (konferenceszt.cz). Attended by around 500 participants and 200 on-line followers.
The original Czech 🇨🇿 version of the presentation can be found here: https://www.slideshare.net/slideshow/hlavni-novinky-souvisejici-s-ccs-tsi-2023-2023-1695/269688092 .
The videorecording (in Czech) from the presentation is available here: https://youtu.be/WzjJWm4IyPk?si=SImb06tuXGb30BEH .
For the full video of this presentation, please visit: https://www.edge-ai-vision.com/2024/06/temporal-event-neural-networks-a-more-efficient-alternative-to-the-transformer-a-presentation-from-brainchip/
Chris Jones, Director of Product Management at BrainChip , presents the “Temporal Event Neural Networks: A More Efficient Alternative to the Transformer” tutorial at the May 2024 Embedded Vision Summit.
The expansion of AI services necessitates enhanced computational capabilities on edge devices. Temporal Event Neural Networks (TENNs), developed by BrainChip, represent a novel and highly efficient state-space network. TENNs demonstrate exceptional proficiency in handling multi-dimensional streaming data, facilitating advancements in object detection, action recognition, speech enhancement and language model/sequence generation. Through the utilization of polynomial-based continuous convolutions, TENNs streamline models, expedite training processes and significantly diminish memory requirements, achieving notable reductions of up to 50x in parameters and 5,000x in energy consumption compared to prevailing methodologies like transformers.
Integration with BrainChip’s Akida neuromorphic hardware IP further enhances TENNs’ capabilities, enabling the realization of highly capable, portable and passively cooled edge devices. This presentation delves into the technical innovations underlying TENNs, presents real-world benchmarks, and elucidates how this cutting-edge approach is positioned to revolutionize edge AI across diverse applications.
Your One-Stop Shop for Python Success: Top 10 US Python Development Providersakankshawande
Simplify your search for a reliable Python development partner! This list presents the top 10 trusted US providers offering comprehensive Python development services, ensuring your project's success from conception to completion.
GraphRAG for Life Science to increase LLM accuracyTomaz Bratanic
GraphRAG for life science domain, where you retriever information from biomedical knowledge graphs using LLMs to increase the accuracy and performance of generated answers
A Comprehensive Guide to DeFi Development Services in 2024Intelisync
DeFi represents a paradigm shift in the financial industry. Instead of relying on traditional, centralized institutions like banks, DeFi leverages blockchain technology to create a decentralized network of financial services. This means that financial transactions can occur directly between parties, without intermediaries, using smart contracts on platforms like Ethereum.
In 2024, we are witnessing an explosion of new DeFi projects and protocols, each pushing the boundaries of what’s possible in finance.
In summary, DeFi in 2024 is not just a trend; it’s a revolution that democratizes finance, enhances security and transparency, and fosters continuous innovation. As we proceed through this presentation, we'll explore the various components and services of DeFi in detail, shedding light on how they are transforming the financial landscape.
At Intelisync, we specialize in providing comprehensive DeFi development services tailored to meet the unique needs of our clients. From smart contract development to dApp creation and security audits, we ensure that your DeFi project is built with innovation, security, and scalability in mind. Trust Intelisync to guide you through the intricate landscape of decentralized finance and unlock the full potential of blockchain technology.
Ready to take your DeFi project to the next level? Partner with Intelisync for expert DeFi development services today!
Skybuffer AI: Advanced Conversational and Generative AI Solution on SAP Busin...Tatiana Kojar
Skybuffer AI, built on the robust SAP Business Technology Platform (SAP BTP), is the latest and most advanced version of our AI development, reaffirming our commitment to delivering top-tier AI solutions. Skybuffer AI harnesses all the innovative capabilities of the SAP BTP in the AI domain, from Conversational AI to cutting-edge Generative AI and Retrieval-Augmented Generation (RAG). It also helps SAP customers safeguard their investments into SAP Conversational AI and ensure a seamless, one-click transition to SAP Business AI.
With Skybuffer AI, various AI models can be integrated into a single communication channel such as Microsoft Teams. This integration empowers business users with insights drawn from SAP backend systems, enterprise documents, and the expansive knowledge of Generative AI. And the best part of it is that it is all managed through our intuitive no-code Action Server interface, requiring no extensive coding knowledge and making the advanced AI accessible to more users.
Have you ever been confused by the myriad of choices offered by AWS for hosting a website or an API?
Lambda, Elastic Beanstalk, Lightsail, Amplify, S3 (and more!) can each host websites + APIs. But which one should we choose?
Which one is cheapest? Which one is fastest? Which one will scale to meet our needs?
Join me in this session as we dive into each AWS hosting service to determine which one is best for your scenario and explain why!
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-EfficiencyScyllaDB
Freshworks creates AI-boosted business software that helps employees work more efficiently and effectively. Managing data across multiple RDBMS and NoSQL databases was already a challenge at their current scale. To prepare for 10X growth, they knew it was time to rethink their database strategy. Learn how they architected a solution that would simplify scaling while keeping costs under control.
zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...Alex Pruden
Folding is a recent technique for building efficient recursive SNARKs. Several elegant folding protocols have been proposed, such as Nova, Supernova, Hypernova, Protostar, and others. However, all of them rely on an additively homomorphic commitment scheme based on discrete log, and are therefore not post-quantum secure. In this work we present LatticeFold, the first lattice-based folding protocol based on the Module SIS problem. This folding protocol naturally leads to an efficient recursive lattice-based SNARK and an efficient PCD scheme. LatticeFold supports folding low-degree relations, such as R1CS, as well as high-degree relations, such as CCS. The key challenge is to construct a secure folding protocol that works with the Ajtai commitment scheme. The difficulty, is ensuring that extracted witnesses are low norm through many rounds of folding. We present a novel technique using the sumcheck protocol to ensure that extracted witnesses are always low norm no matter how many rounds of folding are used. Our evaluation of the final proof system suggests that it is as performant as Hypernova, while providing post-quantum security.
Paper Link: https://eprint.iacr.org/2024/257
leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...alexjohnson7307
Predictive maintenance is a proactive approach that anticipates equipment failures before they happen. At the forefront of this innovative strategy is Artificial Intelligence (AI), which brings unprecedented precision and efficiency. AI in predictive maintenance is transforming industries by reducing downtime, minimizing costs, and enhancing productivity.
1. A. TANIGUCHI et al.: SPATIAL CONCEPT ACQUISITION FOR A MOBILE ROBOT
Spatial Concept Acquisition for a Mobile
Robot that Integrates Self-Localization and
Unsupervised Word Discovery from Spoken
Sentences
Akira Taniguchi, Tadahiro Taniguchi,
*Ritsumeikan University, Japan
Tetsunari Inamura
* National Institute of Informatics
* The Graduate University for Advanced Studies
1
IEEE Transactions on Cognitive and Developmental Systems, 2016. (accepted)
2. A. TANIGUCHI et al.: SPATIAL CONCEPT ACQUISITION FOR A MOBILE ROBOT
• Abstract
In this paper, we propose a novel unsupervised learning
method for the lexical acquisition of words related to
places visited by robots, from human continuous speech
signals.
We address the problem of learning novel words by a
robot that has no prior knowledge of these words
except for a primitive acoustic model.
Further, we propose a method that allows a robot to
effectively use the learned words and their meanings for
self-localization tasks.
The proposed method is nonparametric Bayesian spatial
concept acquisition method (SpCoA) that integrates the
generative model for self-localization and the
unsupervised word segmentation in uttered sentences
via latent variables related to the spatial concept.
The experimental results showed that SpCoA enabled
the robot to acquire the names of places from speech
sentences. They also revealed that the robot could
effectively utilize the acquired spatial concepts and
reduce the uncertainty in self-localization.
2Akira Taniguchi, Tadahiro Taniguchi, and Tetsunari Inamura, "Spatial Concept Acquisition for a Mobile Robot that Integrates Self-Localization
and Unsupervised Word Discovery from Spoken Sentences", IEEE Transactions on Cognitive and Developmental Systems, 2016.
Journal paper PDF
[arXiv]
[IEEE Xplore]
3. A. TANIGUCHI et al.: SPATIAL CONCEPT ACQUISITION FOR A MOBILE ROBOT
Introduction
• The robot needs to learn knowledge related to the place
and space.
– For estimation of self-position
– For moving the environment
– For performing many tasks
– For communication with human
• The robot autonomously learns knowledge related to
places because the shape and place names of the space are
different in each environment.
– Generate the environmental map from sensorimotor
information (SLAM)
– Acquire novel words from human speech sentences
– Associate words to the parts of the environmental map
3
4. A. TANIGUCHI et al.: SPATIAL CONCEPT ACQUISITION FOR A MOBILE ROBOT
Background of our research
• Language acquisition
– The robot does not have word knowledge in advance.
– The robot learns words from human speech signals.
– Estimating the identity of the speech recognition results
with errors is a very difficult problem.
• Self-localization
– The robot estimates self-position using sensor information
and an environment map.
– Probabilistic self-localization method
• We adopt Monte-Carlo Localization (MCL)
– If the robot uses local sensor only, global localization is a
very difficult problem.
4
5. A. TANIGUCHI et al.: SPATIAL CONCEPT ACQUISITION FOR A MOBILE ROBOT
• Uncertain verbal information
• Phoneme/Syllable recognition
• Unsupervised word segmentation
Language acquisition
• Uncertain position information
• It is estimated by Monte-Carlo
Localization (MCL)
Purpose of our research
5
Self-localization
Lexical acquisition
related to places
Mutually effective utilization
“Here is the front of
TV.”
/hia/, /iz/, /afroqtabu/, /tibe/??
/hiaiz/, /a/, /frontobutibi/??
Monte-Carlo Localization
/afroqtabutibe/
/bigbuqkkais/
/waitosherfu/
6. A. TANIGUCHI et al.: SPATIAL CONCEPT ACQUISITION FOR A MOBILE ROBOT
Research outline 1/3
6
1. Teaching
The mobile robot can
recognize phonemes or
syllables.
7. A. TANIGUCHI et al.: SPATIAL CONCEPT ACQUISITION FOR A MOBILE ROBOT
Research outline 1/3
7
1. Teaching
The mobile robot can
recognize phonemes or
syllables.
“Here is
Emergent
System Lab.”
/hiaizemajentoshitemurabo/ ?
Phoneme recognition
8. A. TANIGUCHI et al.: SPATIAL CONCEPT ACQUISITION FOR A MOBILE ROBOT
Research outline 1/3
8
1. Teaching
The mobile robot can
recognize phonemes or
syllables.
“Here is
Emergent
System Lab.”
/hiaizemajentoshitemurabo/ ?
“This place is
a front of
elevator.”
/dispreisuizafroqtabueredeta/?
Multiple wordings
Multiple teachings
9. A. TANIGUCHI et al.: SPATIAL CONCEPT ACQUISITION FOR A MOBILE ROBOT
Research outline 1/3
9
1. Teaching
The mobile robot can
recognize phonemes or
syllables.
“Here is
Emergent
System Lab.”
/hiaizemajentoshitemurabo/ ?|hia|iz|emajentoshitemurabo | ?
“This place is
a front of
elevator.”
/dispreisuizafroqtabueredeta/?
|dis|preisu|iz|
afroqtabu|eredeta|?
Unsupervised
word segmentation
10. A. TANIGUCHI et al.: SPATIAL CONCEPT ACQUISITION FOR A MOBILE ROBOT
Research outline 2/3
10
2. Learning spatial concepts
• Place names
• Spatial areas
(position distribution)
Spatial concept
/eredeta//suteasu/
/emajentoshi
temurabo/
11. A. TANIGUCHI et al.: SPATIAL CONCEPT ACQUISITION FOR A MOBILE ROBOT
Research outline 3/3
11
Before
3. Self-localization using spatial concepts
“Where is this
place?”
12. A. TANIGUCHI et al.: SPATIAL CONCEPT ACQUISITION FOR A MOBILE ROBOT
Research outline 3/3
12
“This is a front
of elevator.”
Before
3. Self-localization using spatial concepts
/dis/, /iz/, /afroqtabu/, /eredeta / ?
“Where is this
place?”
13. A. TANIGUCHI et al.: SPATIAL CONCEPT ACQUISITION FOR A MOBILE ROBOT
Research outline 3/3
13
“This is a front
of elevator.”
Before
After
3. Self-localization using spatial concepts
/dis/, /iz/, /afroqtabu/, /eredeta / ?
“Where is this
place?”
The robot can narrow down the hypothesis of self-position.
14. A. TANIGUCHI et al.: SPATIAL CONCEPT ACQUISITION FOR A MOBILE ROBOT
Nonparametric Bayesian Spatial Concept
Acquisition method (SpCoA)
14
• This model can learn spatial concepts from continuous speech signals
• This model can learn an appropriate number of spatial concepts, depending
on the data (using nonparametric Beysian approach)
• This model can relate several places to several names.
(many-to-many correspondences between names and places)
/emajentoshitemurabo/,
/taniguchizurabo/
“Here is
Emergent
System Lab.”
/konekutiNgukorida/
(connecting corridor)
15. A. TANIGUCHI et al.: SPATIAL CONCEPT ACQUISITION FOR A MOBILE ROBOT
• The recognized phoneme sequence includes errors of phoneme because
the robot do not have vocabulary in advance.
• For learning words, the robot segments the phoneme sequence to words.
• We adopt an unsupervised word segmentation method from WFSTs [1]
• WFST (Weighted Finite-State Transducer): more compact representation
of N-best speech recognition
• Word segmentation using WFSTs can reduce the variability and errors in
phonemes
Language acquisition based on uncertain
speech recognition
15
Learning words from spoken sentences
[1] Graham Neubig, Masato Mimura, and Tatsuya Kawahara. Bayesian learning of a language model from
continuous speech. IEICE TRANSACTIONS on Information and Systems, Vol. 95, No. 2, pp. 614-625, 2012.
N
e
a
p
q
u
o
r
u
ex) WFST speech recognition result, “This is an apple.”
d i s
u
i z a
u
16. A. TANIGUCHI et al.: SPATIAL CONCEPT ACQUISITION FOR A MOBILE ROBOT
1. Registering words to the dictionary of speech recognition system
2. Recognizing a user’s speech using the dictionary
3. Modification of self-localization by using the speech recognition
result and spatial concepts
1. Speech recognition in the weighted finite-state transducer (WFST)
2. Unsupervised word segmentation from WFSTs
3. Estimating model parameters from self-positions and word
segmentation results
• The robot performs self-localization (MCL).
• The user says a sentence, including the name of the current place.
3. Self-localization using spatial concepts
2. Learning
1. Teaching
Procedure of SpCoA
16
17. A. TANIGUCHI et al.: SPATIAL CONCEPT ACQUISITION FOR A MOBILE ROBOT
𝑥 𝑡 ロボットの自己位置
𝑢 𝑡 制御値
𝑧𝑡 計測値
W 場所の名前の多項分布
O 𝑡,𝑏 𝑏番目の分割単語
C 𝑡 場所概念のindex
𝜋 場所概念のindexの多項分布
μ,Σ 位置分布(ガウス分布)
𝑖 𝑡 位置分布のindex
𝜙𝑙 位置分布のindexの多項分布
𝛼 𝜙𝑙のハイパーパラメータ
𝛾 𝜋のハイパーパラメータ
𝛽0 ディリクレ事前分布のハイパー
パラメータ
𝑚0, 𝜅0
𝑉0, 𝜈0
ガウス-ウィシャート事前分布
のハイパーパラメータ
1tx
xt
xt1
tz
tu
tC
btO , W
Σ
μ ti
l
K L
L𝐵𝑡
Overview of the graphical model of SpCoA
17
0
00,m
00,V
1tz
1tu
1tz
1tu
Self-localization
(Monte-Carlo Localization)
Clustering of
position distribution
(mixture of Gaussian
distributions)
Clustering of
place names
(word distributions)
This model can estimate both of spatial concepts and self-location
18. A. TANIGUCHI et al.: SPATIAL CONCEPT ACQUISITION FOR A MOBILE ROBOT
Graphical model of SpCoA
18
19. A. TANIGUCHI et al.: SPATIAL CONCEPT ACQUISITION FOR A MOBILE ROBOT
Generative model of SpCoA
19
The stick breaking process (SBP) is denoted as GEM(·)
The multinomial distribution as Mult(·)
The Dirichlet distribution as Dir(·)
The inverse–Wishart distribution as IW(·)
The multivariate Gaussian distribution as N(·)
The motion model and the sensor model of self-localization
are denoted as 𝑝(𝑥 𝑡|𝑥 𝑡−1, 𝑢 𝑡) and 𝑝(𝑧𝑡|𝑥 𝑡) respectively.
20. A. TANIGUCHI et al.: SPATIAL CONCEPT ACQUISITION FOR A MOBILE ROBOT
Learning Algorithm
using Gibbs sampling
The sampling values of the model
parameters from the following joint
posterior distribution are obtained by
performing Gibbs sampling.
20
21. A. TANIGUCHI et al.: SPATIAL CONCEPT ACQUISITION FOR A MOBILE ROBOT
21
22. A. TANIGUCHI et al.: SPATIAL CONCEPT ACQUISITION FOR A MOBILE ROBOT
Experiment:Learning spatial concepts
Conditions
• Environment: Creation-Core, Ritsumeikan University
• Robot:Turtlebot2
• Sensor device: Kinect (depth sensor)
• Speech recognition system:Julius *1
• Unsupervised word segmentation system:latticelm *2
22
*1:http://julius.sourceforge.jp/index.php
*2:http://www.phontron.com/latticelm/index-ja.html
23. A. TANIGUCHI et al.: SPATIAL CONCEPT ACQUISITION FOR A MOBILE ROBOT
Experiment:Learning spatial concepts
Conditions
• Mapping using SLAM
• Teaching places : 19
• The number of teaching words :16
23
/kyouiNbeya/
/toire/
/tsubokeN/
/kitanokeN/
/nishikawakeN/
/raqkukeN//kameikuupaakeN/
/gomibako/
/puriNtaabeya/
/watarirouka/
/kaigishitsu/
/shinodaseyakeN/
/hagiwarakeN//gomibako/
/gomibako/
/watarirouka/
/kaidaNmae//kaidaNmae/
/souhatsukeN/
/taniguchikeN/
Same name, different places
Same place, different names
24. A. TANIGUCHI et al.: SPATIAL CONCEPT ACQUISITION FOR A MOBILE ROBOT
Learning results of spatial concepts
• A point group of each color denoting each position
distribution was drawn on the map.
24
k=0
k=1
k=7
k=8k=10
k=11
k=12
k=13
k=14
k=23
k=29
k=35k=36
k=42
k=50
k=55
k=58
k=60
k=69
k=59
k is the index of the position distribution
25. A. TANIGUCHI et al.: SPATIAL CONCEPT ACQUISITION FOR A MOBILE ROBOT
Learning results of spatial concepts
25
k=0
k=1
k=7
k=8k=10
k=11
k=12
k=13
k=14
k=23
k=29
k=35k=36
k=42
k=50
k=55
k=58
k=60
k=69
k=59
26. A. TANIGUCHI et al.: SPATIAL CONCEPT ACQUISITION FOR A MOBILE ROBOT
Learning results of spatial concepts
26
k=0
k=1
k=7
k=8k=10
k=11
k=12
k=13
k=14
k=23
k=29
k=35k=36
k=42
k=50
k=55
k=58
k=60
k=69
k=59
27. A. TANIGUCHI et al.: SPATIAL CONCEPT ACQUISITION FOR A MOBILE ROBOT
Learning results of spatial concepts
27
k=0
k=1
k=7
k=8k=10
k=11
k=12
k=13
k=14
k=23
k=29
k=35k=36
k=42
k=50
k=55
k=58
k=60
k=69
k=59
28. A. TANIGUCHI et al.: SPATIAL CONCEPT ACQUISITION FOR A MOBILE ROBOT
Experiment:self-localization task
Conditions
• The flow of experiments
1. The initial particles were uniformly distributed on the entire floor.
2. The robot begins to move from a little distance away to the target place.
3. When the robot reached the target place, the user spoke the sentence
including the place name for the robot.
• The evaluation method
– Comparing the state of particles before and after a user’s speech
28
The state of initial particles
(Red arrows)
Speech recognition by
the word dictionary of
learned words
The robot performs
MCL during move.
29. A. TANIGUCHI et al.: SPATIAL CONCEPT ACQUISITION FOR A MOBILE ROBOT
29
Before
After
• Red arrows:particles (self-localization results)
• Green dotted arrows:the robot moving trajectory
The proposed method can modify self-localization by using spatial concepts.
Speech sentence
“koko wa souhatsukeN dayo”
(Here is Emergent system Lab.)
Speech recognition result
|koko|wa|sohatsuke|N|dayo|
30. A. TANIGUCHI et al.: SPATIAL CONCEPT ACQUISITION FOR A MOBILE ROBOT
• The proposed method can learn place names and areas
from continuous speech signals
• Learning novel words by the robot that has no word knowledge
• Learning relationship of names and places
Learning spatial concepts
• The robot can effectively utilize the spatial concepts for
the global self-localization
• Reducing estimation errors by user’s speech
• Enable even if learned words included phoneme errors
Self-localization using spatial concepts
Conclusion
30
31. A. TANIGUCHI et al.: SPATIAL CONCEPT ACQUISITION FOR A MOBILE ROBOT
The simulator results of
learning spatial concepts
31
32. A. TANIGUCHI et al.: SPATIAL CONCEPT ACQUISITION FOR A MOBILE ROBOT
Comparison of the phoneme accuracy rates of uttered
sentences for different word segmentation methods
We compared the performance of three types of word segmentation
methods for all the considered uttered sentences.
32
Examples of word segmentation results of uttered sentences
33. A. TANIGUCHI et al.: SPATIAL CONCEPT ACQUISITION FOR A MOBILE ROBOT
Comparison of the accuracy rates of the
estimation results of spatial concepts
We compared the matching rate with the estimation results of index Ct of the
spatial concepts of each teaching utterance and the classification results of
the correct answer given by humans.
33
ARI : the adjusted Rand index
DPM : the Dirichlet process mixture of the unigram model of an SBP representation
34. A. TANIGUCHI et al.: SPATIAL CONCEPT ACQUISITION FOR A MOBILE ROBOT
The phoneme accuracy rate (PAR) scores for the
word considered the name of a place
We evaluated whether the names of places were properly learned for
the considered teaching places.
34
35. A. TANIGUCHI et al.: SPATIAL CONCEPT ACQUISITION FOR A MOBILE ROBOT
Quantitative evaluation of estimation errors and the
estimation accuracy rate (EAR) of self-localization
We compare the estimation accuracy of localization for the
proposed method (SpCoA MCL) and the conventional MCL.
35