Locations in the Neocortex: A Theory of Sensorimotor Prediction Using Cortical Grid Cells

Workshop on Integrative Theories
of Cortical Function
July 18, 2018
Subutai Ahmad
sahmad@numenta.com
@SubutaiAhmad
Locations in the Neocortex:
A Theory of Sensorimotor Prediction Using
Cortical Grid Cells
Collaborators:
-Jeff Hawkins, Marcus Lewis,
Scott Purdy, Mirko Klukas

Observation:
The neocortex is constantly predicting its inputs.
“the most important and also the most neglected problem
of cerebral physiology” (Lashley, 1951)
How can networks of neurons learn predictive models of
the world and objects through movement?
Question:
The neocortex uses “cortical grid cells” and path
integration to model an object’s structure.
Proposal:

1) Properties of grid cells
3) Mapping to biology
- Suggestive biological evidence
- Cortical columns revisited
- Model predictions
- Encoding location using grid cells
- Path integration and location spaces
- Sensory cues and recalling locations
- Learns predictive models of objects using locations and sensory cues
- Simulation results
2) A network model of sensorimotor prediction using cortical grid cells

Single Grid Cell
(Moser & Moser, 2013)

Grid Cell Module Contains Multiple Cells That Are Offset

Multiple Modules At Different Scales and Orientations
Environment

Multiple Modules Can Represent Locations Uniquely
(Fiete et al, 2008; Sreenivasan and Fiete, 2011)

Path Integration
(Hafting et al, 2005;
McNaughton et al., 2006; Ocko et al., 2018)
- As animal moves, grid cells update their location
- This can happen in the dark, using efference copy of motion signals
- Path integration: regardless of path trajectory, the same location in
environment will activate a consistent grid code
- Imprecise, so sensory cues are used to “anchor” grid cells

Unique Location Spaces For Each Environment
- Each module randomly initialized in a new room, codes for each room will be unique
(e.g. 20 modules, 100 cells each = 10020 possible codes)
- Initial point implicitly defines a location space for each environment
- Modules update independently, so path integration still works
- Sensory cues can be used to anchor or re-activate the location space
(Rowland & Moser, 2014)

Summary: Grid Cells Represent the Structure of Environments
Entorhinal Cortex
Body in environments
Location
- Encoded by grid cells
- Unique to location in room AND room
- Location is updated by movement
A Room is:
- A set of locations that are connected
by movement (via path integration).
- Some locations have associated
features.

Proposal: Cortical Grid Cells Represent the Structure of Objects
Location
- Encoded by grid cells
- Unique to location in room AND room
A Room is:
features.
Entorhinal Cortex
Cortical Column
Sensor patch relative to objects
Location
- Encoded by grid-like cells
- Unique to location on object AND object
An Object is:
features.

Network Model
Grid cell modules
use movement to
update their location
Sensory predictions
(forward model)Predictions + sensation =>
sensory representation
- Sequence of discrete time steps
- Each time step consists of 4 stages
representing a movement followed by a
sensation
New sensation updates
location

Updating Grid Cell Module Based On Motion
- Given a motion 𝒅 𝑡
, phase is shifted according to:
- Threshold activity to get a binary location representation
- Cells in each module represented by 2D phase Φ
- Activity at time t is a bump centered around a
phase Φ 𝑡
𝑖
- Each module has a different scale and orientation,
represented by a transform matrix:
𝑴𝑖 =
𝑠𝑖 cos 𝜃𝑖 − 𝑠𝑖 sin 𝜃𝑖
𝑠𝑖 sin 𝜃𝑖 𝑠𝑖 cos 𝜃𝑖
−1
Φ 𝑡,move
𝑖
= 𝜑 + 𝑴𝑖 𝒅 𝑡 mod 1.0 𝜑 ∈ Φ 𝑡−1
𝑖

Location Layer Forms Predictions in Sensory Layer
Sensory predictions
(forward model)

Sensory Layer is a Sequence Memory Layer
- Neurons in a mini-column learn same FF receptive field.
- Distal dendritic segments form connections to cells in location layer.
- Active segments act as predictions and bias cells.
- With sensory input these cells fire first, and inhibit other cells within
mini-column.
(Hawkins & Ahmad, 2016; Hawkins et al, 2017)
No prediction
t=0
t=1
Sensory input
Predicted input
t=0
Predicted cells
inhibit neighbors
t=1
Sensory input
Sensory layer
Very specific sparse representation
that encodes the current sensory
input at the current location.
Dense representation that activates
the codes for this sensory input at
any location.

Multiple Simultaneous Locations Represents Uncertainty
- Sensory representation activates grid cell locations
- If this sensory representation is not unique, we activate
a union of grid cells in each module
- With sufficiently large modules, the union can represent
several locations without confusion
- Movement shifts all active grid cells
(Ahmad & Hawkins, 2016)

Learning
- For a new object, we first activate a random
cell in each module and move to first feature
- Select random subset of active sensory cells for
this sensory input. Store location representation
on independent dendritic segment in sensory
cells.
- Store this sensory representation on an
independent dendritic segment in each active
location cell
- Move to next feature and repeat
- This process invokes a new unique location
space and sequentially stores specific location
representation with sensory cues, and specific
sensory code with locations.

An Example Recognition Walkthrough
Sense f1

Move to f2

Sense f2

Move to f1

Simulation: Network Convergence

Network Convergence Improves With More Modules
100 objects with 10 points each, pool of 100 unique features

Network Convergence vs Ideal Observer
100 objects, 10 unique features, 20 modules

Capacity of Single Cortical Column
100 unique features, 20 modules

Two Layer Network Model For Sensorimotor Inference
- Structure of objects are defined by the
relative locations of sensory features
- Grid cell code enables a powerful
predictive sensorimotor network
- Objects are recognized through a
sequence of movements and sensations
- Simulation results demonstrate
convergence and capacity

Sensorimotor Inference With Multiple Columns And Long-
Range Lateral Connections
(Hawkins et al, 2017)

Sensorimotor Inference With Multiple Columns And Long-
Range Lateral Connections
- Each column has partial knowledge of object.
- Columns use lateral context and vote through long range lateral connections.
- Inference is much faster with multiple columns.
(Hawkins et al, 2017)

L2/3
L4
L6
L5
Input
Anatomy of a Cortical Column
1) Cortical columns are really complex! The function of a cortical
column must also be complex.
2) Large reciprocal projections from L6a to L4
3) Motor signals drive activity in L6
Simple
Output L2
L3a
L3b
L4
L6a
L6b
L6 ip
L6 mp
L6 bp
L5 tt
L5 cc
L5 cc-ns
L5: Calloway et. al, 2015
L6: Zhang and Deschenes, 1997
Binzegger et al., 2004
Nelson et al., 2013
Output, via thalamus
50%10%
Cortex
Thalamus
Output, direct
L5 CTC: Guillery, 1995
Constantinople and Bruno, 2013
Long range lateral connections

1) Border ownership cells:
Cells fire only if feature is present at object-centric location on object.
Detected even in primary sensory areas (V1 and V2).
(Zhou et al., 2000; Willford & von der Heydt, 2015)
2) Grid cell signatures in cortex:
Cortical areas in humans show grid cell like signatures (fMRI and single cell recordings)
Seen while subjects navigate conceptual object spaces and virtual environments.
(Doeller et al., 2010; Jacobs et al. 2013; Constantinescu et al., 2016; )
3) Sensorimotor prediction in sensory regions:
Cells predict their activity before a saccade.
Predictions during saccades are important for invariant object recognition.
(Duhamel et al., 1992; Nakamura and Colby, 2002; Li and DiCarlo, 2008)
4) Hippocampal functionality may have been conserved in neocortex:
Six-layer neocortex evolved by stacking 3-layer hippocampus and piriform cortex
(Jarvis et al., 2005; Luzatti, 2015)
Other Biological Evidence
33

Experimentally Testable Predictions
34
1) Object coding:
Every sensory region will contain layers that are stable while sensing a familiar object.
The set of cells will be sparse but specific to object identity.
Ambiguous information will lead to denser activity in upper layers.
Each region will contain cells tuned to locations of features in the object’s reference frame.
(Zhou et al., 2000; Zheng & Kwon, 2018)
2) Cortical columns:
Cortical cols can learn complete object models
Complexity of objects tied to span of long-range lateral connections
Activity within stable layers will converge slower with long–range connections disabled
Subgranular layers of primary sensory regions (Layer 6) will be driven by motor signals
Grid-like cells in Layer 6a
(Nelson et al., 2013; Sutter et Shepherd, 2015; Lee et al, 2008; Leinweber et al, 2017)

Summary: Cortical Grid Cells Represent the Structure of Objects
Entorhinal Cortex
Cortical Column
Sensor patch relative to objects
1. Grid cell code provides a neural mechanism for representing locations
2. Cortical grid cells can build predictive sensorimotor models of objects
3. Suggestive anatomical and physiological mapping to biology

Numenta Team
Jeff Hawkins Marcus Lewis
Contact: sahmad@numenta.com
Scott Purdy
Mirko Klukas Luiz Scheinkman
Max Schwarzer

Locations in the Neocortex: A Theory of Sensorimotor Prediction Using Cortical Grid Cells

Locations in the Neocortex: A Theory of Sensorimotor Prediction Using Cortical Grid Cells

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Locations in the Neocortex: A Theory of Sensorimotor Prediction Using Cortical Grid Cells

Similar to Locations in the Neocortex: A Theory of Sensorimotor Prediction Using Cortical Grid Cells (20)

More from Numenta

More from Numenta (15)

Recently uploaded

Recently uploaded (20)

Locations in the Neocortex: A Theory of Sensorimotor Prediction Using Cortical Grid Cells