@Article{Granger2006, author = {Granger, Richard}, title = {Engines of the brain: The computational instruction set of human cognition}, journal = {AI Magazine}, year = {2006}, volume = {27}, number = {2}, pages = {15-33}, }
Argument: AI's are underconstrained because they are not informed enough by neurology. The same behaviours can be output by a variety of different algorithms; we need to focus on those functions we can derive from brain anatomy. Functions emerge from bottom-up analysis: hierarchical structure, embedded structures, hash coding. Anatomical layout establishes how individual operators are composed into larger routines = "the 'instruction set' of the brain, constitut[ing] the basic mental procedures from which all major behavioural and cognitive operations are assembled."
Sensory input: posterior cortext receives input from diencephalic thalamic nuclei
Motor output: interactions between anterior cortex and the striatal complex or basal ganglia
Mammalian brains scale well. Those aspects of brains that grow disproportionately large as brain-to-body-size ratio increases are likely to be very important: they are the "key to human intelligence." Sites that are candidates for this:
Consequences: for instance, if you damage a mouse's cortical motor areas, it doesn't have much effect, because it uses more brain-stem regions for this function; on the other hand, damage a human's cortical motor area and the result is paralysis. Reason: the striatal region has become a "subroutine" that humans' motor cortex can call.
Basal ganglia/striatal complex is the primary brain system in reptiles, second-largest brain structure in humans. Modules within the basal ganglia are connected through excitatory (activating), modulatory (strength-changing) and inhibitory (suppressing) neuro-transmitter pathways.
There are four parts of this system:
In sum, the striatal complex is "an adaptive controller, beginning with preset responses to inputs, tracking the outcomesw of those responses, and altering behaviour to continually improve those outcomes, as in reinforcement learning algorithms." [p 19]
2 parallel circuit types:
1. Core Loop: Superficial cells that initially respond to a particular input pattern become increasingly responsive to that input and similar inputs = clustering or categorization. Superficial layer responses activate deep layers, retaining topography; is then sent to the nucleus reticularis, which then inhibits the core thalamic nucleus. "Thus it is hypothesized that the predominant component of the next input to cortex is only the uninhibited remainder of the input, whereupon the same operations as before are performed. Thus the second cortical response will consist of a quite distinct set of neurons from the initial response, since many of the input components giving rise to that first response are now inhibited." This gets us category -> subcategory -> subsubcategory activation of the cortex.
Simplified Thalamocortical Core Algorithm: for input X for C &element; win(X,W) W <= W1 + k(X-C) end_for X <= X- mean(win(X,W)) end_for where X = input activity pattern (vector); W = layer I synaptic weight matrix; C = responding superficial layer cells (col vector); k = learning rate parameter win(X,W) = column vector in W most responsive to X before lateral inhibition [p 20]
2. Thalamocortical "Matrix" Circuits:
Diffuse projections from layer V to matrix nuclei, and from matrix nuclei back to cortex sparsify and orthogonalize their inputs. Structural relationships that may obtain among inputs are not retained in resulting projections. The effect of this layer is to "chain" elements in the input sequence through "links" created due to coincident layer V activity. "The implicit data structures created are trees in which initial sequence elements branch to their multiple possible continuations." [p 23] This is conceptualized as "scatter storage" or a hash function.
Simplified Thalamocortical Matrix Algorithm: for input sequence X(L) for C &element; TopographicSuperficialResponse(X(L)) for V(s) &element; NNtResponse (X(L-1)) Potentiate (V(s)) NNt(L) = NontopographicDeepResponse(V) end_for end_for end_for where L = length of input sequence; C = columnar modules activated at step X(L); V(s) = synaptic vector of responding layer V cell, NNt(L) = response of nonspecific thalamic nucleus to feedback from layer V. [p 22]
The thalamo-cortico-stiatal system gives rise to reinforcement learning of similarity based clusters, and brief sequences (and by extension, brief sequences of similarity based clusters.) The output of any region becomes input to another region, and each region in the thalamo-cortico-striatal pathway performs the same sort of analysis on inputs, "generating learned nested sequences of clusters of sequences of clusters." [p 23]
Auditory processing: statistical filters note a sequence of features and choose a best partial match from prior input to determine what a word is. Visual Image Processing: Beyond specialized cortical areas, auditory and visual processing are similar neurologically, at the thalamocortical circuitry level. "It is here hypothesized that although primary cortical regions perform specialized processing, subsequent cortical regions treat all inputs teh same, regardless of modality of origin." Curve-and-line information from primary visual cortex is later analysed similarly to auditory information, statistically segmented into objects etc using partial best match. Auditory parsing (because it is only left-right, time delimited) can be thought of as a subroutine of visual processing (which is more-dimensional). Though all "regions" are identical in structure, they receive different inputs, and after exposure to many inputs, regional specializations of function arise due to lateral competition among areas [pp 23-24].
Hierarchical Grammatical Structure: "It is notable that the emergent data structure of the thalamo-cortico-striatal model, nested sequences of clusters, is a superset of the structures that constitute formal grammars, that is, ordered sequences of 'proto-grammatical' elements, such that each element represents either a category (in this case a cluster), or expands to another such element (nesting), just as rewrite rules establish new relations among grammatical elements." [p 26] Data structures can grow new rules with more loops added to the thalamo-cortico-striatal system. Higher brain-to-body-size-ratio = more loops, more complex grammars. "Growth of grammars need not be linear; grammars have the property of exhibiting apparently new behaviours due to the addition of just a few rules." [p 27] Processing of linguistic inputs is the same as processing of other sensory inputs, but has become more complex with our brain size's growth.