On Intelligence, by Jeff Hawkins (2004)

What a book. I don’t know where to start.

I had hoped to come away with an understanding of the overall theory and had expected that the details would be mostly irrelevant to me as soon as they had served their purpose of explaining some higher-level concept, but fuck me, the details are interesting.

The way the problem is broken down makes me desperately want to understand more of it. I know nothing about neuroscience, but the whole book reads like an invitation; here’s the goal, and here’s what we know, connect the two.

A few things stood out as excellent while reading; clearly drawn boundaries between established science and the author’s thinking, elegant abstractions of the physical machinery of the brain, and attempts to answer nearly every question I thought of as I worked through it.

I can’t wait to read the follow up, A Thousand Brains, which is said to have solutions to some of the problems introduced in this book.

Useful definitions

Some very rough notes

The neocortex is the outermost layer of the brain. It is 2mm thick, has six layers, no discernable separation between different areas, and it houses intelligence.

In 1978 Vernon Mountcastle suggested that because the neocortex looks the same throughout, it could all be doing the same thing. Regardless of whether it’s vision, hearing, touch, etc., it could be doing the same processing with different sensory inputs. A common cortical algorithm. The book runs with this assumption.

The neocortex remembers patterns by storing invariant representations of them. Taking vision as an example, whenever a face is anywhere in your view, a certain set of cells in the visual part of your cortex lights up. It doesn’t matter who’s face it is or what it looks like, or even if it’s half of face instead of a full one, as soon as a pattern partially matching that of your internal representation of a face is present in your line of sight, those cells become active and stay active until the face disappears from view.

This happens automatically. You don’t need to dig around in your brain for the memory of what a face looks like and match it to the sensory inputs you’re receiving; it activates automatically as the result of a face being in view.

These representations have a temporal dimension as well as a spatial one. You can’t recognise a song from a single beat for the same reason that you can’t remember an entire song at once. Your invariant representation of it has a temporal dimension, it’s a pattern that plays out over time.

V1 V2 V4 IT region L1 L2 L3 L4 L5 L6 neurons cortical column layers within region hierarchy of cortical regions
Figure 1. A mash-up of figures in the book to help visualise this summary. Opens in a new window.

Figure 1 depicts the visual pathway in the neocortex on the left.

There are four hierarchical levels in the visual pathway; V1, V2, V4, and IT. V1 is at the bottom and receives sensory input first. There is no physical basis for the hierarchy (no ā€œaboveā€ or ā€œbelowā€ in the brain) other than the manner in which the regions communicate to each other, although lower regions are much larger than higher regions.

The conventional view sees each level in the hierarchy as a distinct region performing a similar task throughout. V1 would perform low level recognition on its inputs, such as the detection of lines, then pass it up the to V2 which might in turn recognise shapes, and so on. Higher level regions, such as IT, are called ā€œassociationā€ areas. These combine the information from lower regions and have a bird’s eye view over the pathway below. In this view, invariant representations are formed only when the input reaches the top, at IT.

The book challenges that traditional view by proposing that each traditional region (e.g. all of V2) should be divided into many smaller subregions, as shown on the left of Figure 1. Applying the common cortical algorithm assumption leads the book to propose that each of these subregions performs the same task as any other. That means that invariant representations exist in all of them, not just in the higher association areas like IT. Every region forms memories of patterns in the inputs coming from the regions below it.

Inside each sub-region (shown on the right of Figure 1), information flows up and down cortical columns. A cortical column represents the smallest unit of computation in the neocortex. Information in each cortical column behaves similarly, for example, converging inputs from lower regions arrive at Layer 4 before being passed up to Layers 2 and 3.

In the flow of information up and down columns and between regions, the book attempts to identify the mechanisms that enable the predictive activation of invariant representations that are necessary to explain our experience.

Taking imagination as an example. The book says that if you close your eyes and imagine a hippopotamus, the visual area of your brain associated with seeing a hippopotamus lights up. It then shows how the connections between neurons in a cortical column allow for this by turning its predictions into its own inputs, essentially feeding back into itself.

Highlights

005

Most scientists say that because the brain is so complicated, it will take a very long time for us to understand it. I disagree. Complexity is a symptom of confusion, not a cause.

012

The biggest reason I thought computers would not be intelligent is that I understood how computers worked, down to the level of the transistor physics, and this knowledge gave me a strong intuitive sense that brains and computers were fundamentally different. I couldn't prove it, but I knew it as much as one can intuitively know anything.

017

Computers could play checkers at expert skill levels and eventually IBM's Deep Blue famously beat Gary Kasparov, the world chess champion, at his own game. But these successes were hollow. Deep Blue didn't win by being smarter than a human; it won by being millions of times faster than a human. Deep Blue had no intuition. An expert human player looks at a board position and immediately sees what areas of play are most likely to be fruitful or dangerous, whereas a computer has no innate sense of what is important and must explore many more options. Deep Blue also had no sense of the history of the game, and didn't know anything about its opponent. It played chess yet didn't understand chess, in the same way that a calculator performs arithmetic but doesn't understand mathematics.

025

[...] for every fiber feeding information forward into the neocortex, there are ten fibers feeding information back toward the senses. Feedback dominates most connections throughout the neocortex as well.

029

But intelligence is not just a matter of acting or behaving intelligently. Behavior is a manifestation of intelligence, but not the central characteristic or primary definition of being intelligent.

042

[...] the neocortex is about 2 millimeters thick and has six layers.

042

Humans are smarter [than other mammals] because our cortex, relative to body size, covers a larger area, not because our layers are thicker or contain some special class of "smart" cells.

042

Your neocortex is loaded with nerve cells, or neurons. They are so tightly packed that no one knows precisely how many cells it contains. If you draw a tiny square, one millimeter on a side [...], you are marking the position of an estimated one hundred thousand (100,000) neurons. Imagine trying to count the exact number in such a tiny space; it is virtually impossible. Nevertheless, some anatomists have estimated that the typical human neocortex contains around thirty billion neurons (30,000,000,000), but no one would be surprised if the figure was significantly higher or lower.

Those thirty billion cells are you. They contain almost all your memories, knowledge, skills, and accumulated life experience. After twenty-five years of thinking about brains, I still find this fact astounding. That a shin sheet of cells sees, feels, and creates our worldview is just short of incredible. The warmth of a summer day and the dreams we have for a better world are somehow the creation of these cells.

045

All functional areas of the cortex reside in the same convoluted cortical sheet. What makes one region "higher" or "lower" than another is how they are connected to one another. In the cortex, lower areas feed information up to higher areas by way of a certain neural pattern of connectivity, while higher areas send feedback down to lower areas using a different connection pattern.

046

Eventually, sensory information passes into "association areas," which is the name sometimes used for the regions of the cortex that receive inputs from more than one sense. For example, your cortex has areas that receive input from both vision and touch. It is thanks to association regions that you are able to be aware that the sight of a fly crawling up your arm and the tickling sensation you feel there share the same cause. Most of these areas receive highly processed input from several senses, and their functions remain unclear.

Out of respect for the author's work, this content is truncated. To view it, please enter the code below.