Train Your Dog

Learning from how dogs learn Prof. Bruce Blumberg The Media Lab, MIT [email_address] www.media.mit.edu/~bruce

Practical & compelling real-time learning ,[object Object],[object Object],[object Object],[object Object]

My bias & focus ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

sheep|dog:trial by eire See sheep|dog video on my website

Object persistence See object persistence video on my website

Temporal representation See temporal representation (aka Goatzilla) video on my website

Alpha Wolf See alpha wolf video on my website

[email_address] See rover@home video on my website or go to Scientific American Frontiers website

Dobie T. Coyote Goes to School See Dobie video on my website

Why look at Dog Training? ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Invaluable resources ,[object Object],[object Object],[object Object],[object Object]

The problem facing dogs (real and synthetic) Set of all possible actions Set of all motivational goals Set of all possible stimuli What do I do, when, in order to best satisfy my motivational goals?

The space of possible stimuli is wicked big Time of Occurence State Space Set of all possible stimuli Smells Motion Sounds Dog sounds Speech Whistles Modality of Stimuli

The space of possible actions is also very big Set of all possible actions Action Time of Performance Action Space Figure -8 Shake Low shake High -5 Beg Down Left ear twitch

Who gets credit for good things happening? Yumm.. Action Figure -8 Shake High -5 Beg Down Left ear twitch Modality of Stimuli Low shake Motion Sounds Dog sounds Speech Whistles

Who gets credit for good things happening? Yumm.. Time stalk grab-bite eye orient kill-bite chase

Conventional idea: back propagation from goal stalk grab-bite eye orient kill-bite chase Yumm.. Time Credit flows backward

The problem ,[object Object],[object Object],[object Object],[object Object]

Leyhausen’s suggestion… stalk grab-bite eye orient kill-bite chase Time Each element is innately self-motivating and has innate reward metric motivation & reward motivation & reward motivation & reward motivation & reward motivation & reward motivation & reward

Coppinger’s suggestion… stalk grab-bite eye orient kill-bite chase Time Varying innate tendency to follow behavior with “next” in sequence

Functional goal plays incidental role stalk grab-bite eye orient kill-bite chase Time Propagated value from functional goal plays incidental role Yumm..

Big idea: innate biases make learning possible ,[object Object],[object Object],[object Object],[object Object],[object Object]

Good trainers actively guide dog’s exploration ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Dogs constrain search for causal agents Time Consequences Window: Trainer “clicks” signaling reward is coming. When reward is actually received Attention Window: Cue given immediately before or as dog is moving into desired pose Sit Approach Eat Dogs make the problem tractable by constraining search for causal agents to narrow temporal windows

Dogs use implicit feedback to guide perceptual learning Sit Time “ sit-utterance” perceived. Approach Eat “ click” perceived. Dog decides to sit Build & update perceptual model of “sit-utterance” Dogs use rewarded action to identify potentially promising state to explore and to guide formation of perceptual models

Dogs give credit where credit is due… ,[object Object],[object Object],[object Object],[object Object]

Observation: dogs give credit where credit is due Sit Time “ sit-utterance” perceived. Approach Eat “ click” perceived. Dog decides to sit ,[object Object],[object Object]

D.L.: Take Advantage of Predictable Regularities ,[object Object],[object Object],[object Object],[object Object],[object Object]

D.L.: Make Use of All Feedback: Explicit & Implicit ,[object Object],[object Object],[object Object],[object Object]

D.L.: Make Them Easy to Train ,[object Object],[object Object],[object Object],[object Object],[object Object]

Dobie T. Coyote… See dobie video on my website

Limitations and Future Work ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Useful Insights ,[object Object],[object Object],[object Object],[object Object],[object Object]

Acknowledgements ,[object Object],[object Object],[object Object]

Train Your Dog

Recommended

Recommended

More Related Content

Similar to Train Your Dog

Similar to Train Your Dog (20)

Recently uploaded

Recently uploaded (20)

Train Your Dog