We come to know about reality through our perceptions: reality impresses the senses, and perception emerges as an abstraction of that reality. Perception is itself reflected upon and analyzed through the conscious and subconscious mind. Associations between stimuli and various abstract representations get imprinted in memory. These associations get referenced and updated continuously as new stimuli occur and thoughts ruminate. How much information is in this narrative? How much understanding can be squeezed from this process of perception and reflection? What are the mechanisms of our understanding?
Abstraction depends intimately on the notion of information, and to meaningfully talk about the issues given above requires that these concepts be well-defined and thought-out. That shall be the purpose of the next two sections. Once we have a good grasp of the natures of abstraction and information, we can tackle the problem of how organisms abstract information from the external environment through the senses. After that will be a shallow description of my conception of low-level perception, followed by an explanation of how we might recursively abstract information from more generalized perceptual data sets. I will not offer a computational model of human abstraction, only a rough outline of what one might expect the mechanisms to be like.
II. What is information?
While information at one point in time may have been inadequately defined and fraught with confusion, it is now very well-defined and quantifiable thanks to Claude E. Shannon (1948), the father of Information Theory. To help explain the theory, I will start with an intuition pump or thought experiment. Suppose we have a string of events, a string of coin flips using a supposedly unbiased coin. The outcome of a single event is either a head or a tail. Statistically, as the number of events grows, we should expect the number of heads outcomes and tails outcomes to be the same. If the coin is truly unbiased, we should not be able to predict any outcome or subset of outcomes in the entire string of outcomes no matter how much we know about past outcomes. Essentially, very little can be learned from any flip of a truly unbiased coin. All we can do is estimate the bias of the coin based on the past outcomes, and this estimation should approach zero as we accumulate more and more outcomes. Such a system is said to have an information entropy [1] of one bit, i.e. it takes one bit per outcome to represent the entire string of outcomes. Since there are only two possible states for an outcome of a coin flip and only two possible states of a bit, the strings of outcomes and bits in this example are isomorphic and may as well be identical. Thus there is virtually no information content in a string of unbiased coin flips; if randomness has a coherent definition, it must surely be this, the lacking of information. Conversely, if we flip a coin that has two heads, then the outcome will always be the same, and we can predict the outcomes of all future coin flips. The information entropy in this case is zero, and we can represent an unbounded number of coin flips with zero bits: the outcome of each coin flip can be taken for granted, and thus representation is unnecessary.
[1] Information entropy is a metric that represents the lack of information in a string of data.
Information Theory characterizes the limits of abstract representation of data sets. Any arbitrary finite data set can be represented by a string of bits of at least the same dimensionality as the data set. If a data set has structure or statistical correlations between the data points, then there exists a pattern within the data that can be represented by a string of bits that is of a lower dimensionality than the data set. For instance, most electrical data sets acquired in America have a very typical and predictable time varying component with a frequency of around 60 Hz due to the Electro-Magnetic Interference (EMI) from the power grid infrastructure. This 60 Hz structure can symbolically be represented by a trigonometric function of two numbers (of arbitrary but finite precision), amplitude and frequency, regardless of the size of the data set. The data set can be reduced or compressed from N dimensions to 2 dimensions without theoretically losing any information about the 60 Hz structure. In practice, however, no simple function is able to precisely represent most real data sets, so some information is inevitably lost through data compression. The loss of information through abstraction is known as lossy data compression. It may sound bad to lose relevant information, but usually the loss is so minor in effect as to be unnoticeable in most applications. When a data set can be perfectly recovered from an abstract representation, this is known as lossless data compression, which can normally only be achieved in a digital environment.
III. What is abstraction?
I mentioned abstraction and representation a few times in the previous section, and now it's time to explain what I mean by those terms. There have been centuries-worth of debates over what exactly abstraction is. For my part, I tend to view abstraction from a Nominalist perspective, meaning that I don't privilege abstraction with its own special ontological status separate from physical reality like Plato, Descartes, or Frege. Given the potential for confusion over terms, I will offer my own definition of abstraction.
Abstraction is the process of paring down larger-dimensional data sets to smaller-dimensional data sets while preserving the information of interest. A representation or abstractum is the end result of the process of abstraction; it is a smaller-dimensional problem derived from a larger-dimensional problem. There is in general a lot of information in raw sets of data, but at any given moment only a small subset of that information is relevant for a particular end or goal. The information that is not relevant can be ignored or destroyed by reducing or compressing the dimensionality of the problem at hand. Interestingly, the smaller problem implies a set or class of larger problems through an inverse process or relation. Thus the smaller problem is a generalization of a class of larger problems that can potentially be reduced to the smaller problem. This generalization increases the power and speed of data processing, takes less memory to store, and allows decisions to be made more quickly and easily. It also enables the breaking-up of very difficult problems into much smaller and easier chunks.
It is possible to reduce the dimensionality of a problem without losing information, but in that case, the information that would normally be lost must be encoded in the process of abstraction somehow. Computer lossless compression algorithms have this feature of being able to reduce data sets without losing information due to the fact that the information is partially encoded in the algorithm itself. The compressed data might possibly be considered a generalization of a class of larger data sets, but the decompression algorithm will only ever produce one unique larger data set.
In order to have an abstractum, there must be something or set of somethings being abstracted from. Consider a specific person, such as Abraham Lincoln. What I really mean to say is "consider the referent or exemplification of the name or denotation Abraham Lincoln." The real person being referred to is a concrete object, whereas the image or idea or knowledge of the real person is an abstract object, abstractum, or representation. There is much information about the concrete Abraham Lincoln that we can abstract into more general traits. For instance, based on the definitions of the following terms, we know that he was a US President, a person, a mammal, an animal, and a living organism. Each successive term is a respectively more general trait of Abraham Lincoln, containing less and less distinctive information. If we remove all information pertaining to Abraham Lincoln, perhaps the only thing we can say is that he existed, if even that. The important thing to note is that these traits can be recursively abstracted from each other: living organism is an abstraction of animal, which is an abstraction of mammal, and so on all the way down to the concrete Abraham Lincoln. [2]
[2] On a semi-related note, doesn't it seem that cladistic phylogeny is a taxonomy of abstracta in the field of biology?
IV. Abstraction by the senses: how neurons work.
Neurons are cells within the nervous system that transmit and store information throughout the body (see Image 1 near the end of the essay for a detailed picture of a neuron). They accomplish this feat in a rather elegant and interesting way. A neuron has dendrites for the input of stimuli, a cell body or soma for the production of action potential signals to be transmitted, an axon to propagate signals along, and synapses at the end of the axon that interface with dendrites of other neurons over synaptic clefts. The neuron maintains a voltage potential difference across its membranes through the use of ion pumps/channels embedded in the cell membrane. This voltage potential changes when the dendrites of the cell are stimulated and ion channels are opened in the cell membrane. When this voltage potential is increased beyond a certain threshold, the soma or body of the neuron produces what is called an action potential, which is simply a narrow voltage spike. This spike in voltage results in a spiked electric field which induces ion channels in the membrane of the axon to open up and allow ions to flow through. As the ions flow through the ion channels in the axon, the voltage potential in that localized area spikes and induces more ion channels further down the axon to open. Thus the action potential propagates down the axon through this process of induced ion channel opening. When the action potential reaches the synapses, an amount of chemical neurotransmitter is released across the synaptic clefts which binds to and activates receptors in the post-synaptic membrane of the dendrites of other neurons. The more stimulated the first neuron is, the faster it generates action potentials to send to the next neurons. As a neuron transmits action potentials more and more over the course of its life, its synapses become better and more efficient at producing and releasing neural transmitter. This effectively reinforces the neural pathway, making a particular set of neurons more likely to fire, and is the basis for memory. (Kandel, Schwartz 2000)
There are different kinds of neurons that have different geometries and purposes within the nervous system, but they all operate by the same principles given above. Sense and perception start with the stimulation of sensory neurons by analog signals in the immediate external environment. For instance, light (in the visible spectrum) stimulates the neural photoreceptors in the eye, starting a chain reaction of neurons generating action potentials which eventually makes its way to the visual cortex in the occipital lobe of the brain for neural image processing. The first level of abstraction in the body occurs at these sensory neurons; sense is the most basic mechanism of abstraction. Analog signals from the environment get converted into essentially digital pulse-trains of action potentials. It is important to note that these action potentials don't represent anything in particular absent a context within a neural framework. In other words, the action potentials don't have any semantic meaning except to say that a particular neuron got stimulated. It is not possible to infer the external stimulus or even the type of external stimulus from an action potential. We cannot say that an action potential represents the light reflecting off of the surface of some object, for instance. We cannot even say that a set of action potentials or neural firings represents light reflecting off of some object unless we have a mechanism that can interpret this set of neural firings as such. So where is the abstraction? The conversion of analog environmental stimulus into digital neural firings is the very first level of abstraction. All we can say at this point is that some neurons fired due to some kind of external stimulus that the body sensed.
Let's look at how computers work as a comparison. Computers organize, store, and transmit information in the form of bits. Bits, like action potentials, are implemented with voltages: if a voltage at a given time and place is above a certain threshold, it is interpreted (by us, the designers) as "high" or "one", and as "low" or "zero" otherwise. The voltages are manipulated to produce predictable strings of abstract ones and zeroes that don't necessarily represent anything in particular. Computer programs, which are themselves abstract strings of ones and zeroes, provide a context and mechanism of interpretation for other abstract strings. Computers are designed to abstract electricity into binary symbols, binary symbols into algorithms, and algorithms into programs. The output or results of these programs are then communicated (which is another abstraction) out of the system to people and other devices through monitors and speakers and other kinds of media. The first level of abstraction for a computer is turning analog user inputs, such as key presses, into digital bit streams. The computer is essentially sensing the user input through its user interfaces, which are analogous to sensory organs. But none of these abstractions have any meaning absent a context and mechanism of interpretation. How could it be otherwise? A single bit has as much inherent meaning as a single action potential, that is to say no inherent meaning at all. A single bit or action potential could in principle represent anything and everything there is to represent as long as there was a mechanism to interpret it thus. What is the mechanism that abstracts and interprets information from sensory data?
V. Perceptual abstraction from sensory data.
Senses are imperfect, which is to say that they offer incomplete and imprecise reports of the external world. There is a certain maximum and minimum resolution of external phenomena that the senses can reliably detect. For instance, human eyes cannot see light or other RF (Radio Frequency) waveforms outside of the (human) visible spectrum, human ears cannot hear frequencies outside of the (human) audible spectrum, and human smell and taste cannot discern all types of molecules. These limitations are not fundamental, however, as other animals clearly respond to external stimulus undetectable by humans. Human senses are thus incomplete in their reporting of physical reality. But even the data that is reported by the senses is not always reliable. The senses in general have non-linear operating ranges where the data reported is confounded by the channel characteristics of the senses, i.e. the senses report certain ranges of stimuli differently than other ranges, thus leading to an imprecise representation of the physical phenomena. Even if the channel characteristics of the senses were perfectly linear in their operating ranges, the fact that there are only finite sensory neurons which are unevenly distributed in unknowable locations that can only digitize the external phenomena at a fairly slow rate (relative to electronic digitizers) means that the senses are severely limited in their precision. Given the limitations of the data reported by the senses, how can information be abstracted from them? And what kinds of information can be abstracted from the senses?
Once we have all of this sensory data abstracted from the external environment, what happens? What do we do with it? At the most basic level, some sensory data gets used in its raw state and never gets abstracted further. The patellar or knee-jerk reflex is a tendon reflex that controls increasing muscle tension by causing muscle relaxation before tension force becomes so great it may damage the muscle (Wikipedia). It is the result of a sensory neural stimulation traveling from the point of impact below the knee cap to the base of the spinal cord and then back to the quadriceps muscle, which flexes as a result of the neural stimulus, kicking the leg out. In this case, we have an analog external stimulus being converted to digital neural firings which travel a little bit before getting turned into an analog muscle contraction. In this very narrow context, there is no further abstraction of the sensory data. Organisms without complex nervous systems likely don't do any abstraction beyond this simple sensory abstraction, with all their actions being purely reflexive. As the complexity of the nervous system increases, there is a greater potential to abstract information from the senses, allowing the organism to reflexively react to specific patterns in the information rather than just the raw sensory data. This second level of abstraction I will define as (low-level) Perception.
Perception is the abstraction that emerges directly from the senses; it encompasses the mechanisms by which information is extracted from sensory data. This definition is a bit different from other definitions in the philosophical literature which rely on the notion of awareness, so perhaps a different term should be used to avoid confusion. Above I mentioned organisms that might reflexively react to certain patterns detected in the perceptual information abstracted from the sensory data. As a thought experiment, let's imagine that there is an organism that has a sense that reports a small change in the environment at a certain time. As time goes by, more and more neurons of the sense start reporting a change and the originally stimulated neurons start reporting ever greater changes. From this raw sensory data, the organism abstracts information and effectively perceives that there is an object approaching from the rough direction of the perpendicular gradient of the center of the stimulated neurons [3]. The perception of this kind of pattern in the sensory data triggers a reflexive response of the organism which might either be equivalent to avoiding or seeking, depending on the organism and the reports from the other senses. The organism could be said to be aware of motion in the external environment, but this awareness is not necessarily conscious or reflected upon; the organism is perhaps as aware of the environment as a similarly structured robot would be.
[3] The perpendicular of the gradient is not actually calculated by the organism, rather I imagine it sort of emerges as a property of the stimulated neurons through the mechanism of perception.
Up to now, I've claimed that there is such a thing as perception without explaining where it comes from. Quite simply, once we have data in the realm of abstraction thanks to the senses, the data can be manipulated and interpreted in the same way a finite-state machine (read: computer) would do it. In other words, once the information has passed from the realm of the concrete to the realm of the abstract, all we need is a neural network to perceive the patterns in the data and to do further abstractions. We already know we can artificially simulate biological neurons with computers, to a degree, in order to abstract complex bits of information from the environment. And we already know how to make robots with senses that can use this information in useful, natural, and interesting ways to interact with the environment. In fact, many of the techniques we use in robotics and computer science were inspired by models of how our own minds might work. So it's not a great leap to suppose that perception should naturally emerge from a collection of neurons in a network (or several networks), assuming that the networks capable of such a feet are evolutionarily viable.
VI. Thoughts on recursive abstraction and feedback.
Now we're approaching the limits of what we know and can usefully describe. Sense is a direct abstraction of physical reality, and perception is a direct abstraction of sensory data. As long as there is sufficient enough information in the perceptual data, it should be possible to recursively abstract bits and pieces of that information over many iterations. For the uninformed, recursion is the process of iteratively transforming a data set by repeatedly using the same function or process. The basic idea is this: 1) get sensory data; 2) abstract information from sensory data through the mechanism of perception; 3) abstract more information from the perceptual data; 4) recursively apply 3). In order for this process to work, there has to be a feedback mechanism to allow the output of a neural network to influence its own behavior and perhaps the behaviors of neural networks at other points in the chain of abstractions. In other words, the outputs of some neural networks must partially function as their own inputs. There are a couple of ways to accomplish this: 1) the neural network can have its output connected to its input, either directly or via other neurons; or 2) the neural network can have a built in mechanism of memory where each neuron keeps a running and exponentially decaying tally of how often it gets stimulated. Both of these mechanisms are used in brains. Not only are these mechanisms necessary for recursive abstraction, but I daresay they are sufficient as well, i.e. recursive abstraction will naturally occur given these mechanisms of neural network feedback.
Here is a just-so story of human abstraction. We acquire information about the world first through the senses. There are some reflexive reactions at this point, but we are largely unconscious of them. As the sensory information reaches specialized processing centers, the process of perception starts to occur, though still not necessarily anything we are consciously aware of. The transmission of this information through the nervous system partially gets stored in the efficiency of the synapses of the stimulated neurons. This memory combined with new external stimuli and the stimuli still ruminating in higher brain functions dynamically alters the behavior of the neural networks. New and perhaps more general abstractions emerge from the interplay of the cognitive faculties as manifest in the dynamically changing neural networks. The genesis of consciousness lies in these recursive and dynamic abstractions.
VII. Concluding remarks.
To understand how our minds work requires us to understand the ontological nature of abstraction and information, the ways in which these concepts are bound to reality. In this analysis, I have taken the Nominalist stance of supposing the abstract to have an ontology that is unified with that of the physical realm. Many have disputed this position, and many would still dispute this position today, but I fail to see a viable alternative that allows us as much expressive explanatory power. That being the case, I have lobbied that the neuron should be considered the physical interface between the concrete and the abstract, and that further simple levels of abstraction are emergent from perceptual mechanisms. From this, it is not difficult to suppose that memory and feedback lead to recursive levels of abstraction that are more complex and interesting while ironically being concerned with less and less detailed information. If there is a viable alternative to this viewpoint, it is not clear to me what it could possibly be.
A. Images.
Image 1: Anatomy of a neuron.
http://upload.wikimedia.org/wikipedia/commons/a/a9/Complete_neuron_cell_diagram_en.svg
http://upload.wikimedia.org/wikipedia/commons/a/a9/Complete_neuron_cell_diagram_en.svg
B. Bibliography.
- D'Ambrose, Chris (2003), "Frequency Range of Human Hearing", The Physics Factbook, http://hypertextbook.com/facts/2003/ChrisDAmbrose.shtml
- Kandel ER, Schwartz JH, Jessell TM (2000), "Principles of Neural Science", 4th ed. McGraw-Hill, New York
- Mahoney, Matt (Aug. 20, 2006), "Rationale for a Large Text Compression Benchmark", http://cs.fit.edu/~mmahoney/compression/rationale.html
- Rosen, Gideon (2001), "Abstract Objects", Stanford Encyclopedia of Philosophy, http://plato.stanford.edu/entries/abstract-objects/
- Shannon, Claude E., (1948), "A Mathematical Theory of Communication", Bell System Technical Journal, Vol. 27, pp. 379–423, 623–656.
- Stufflebeam, Robert (2008), "Neurons, Synapses, Action Potentials, and Neurotransmission", http://www.mind.ilstu.edu/curriculum/neurons_intro/neurons_intro.php
- Wikipedia, (2010), "Patellar Reflex", http://en.wikipedia.org/wiki/Patellar_reflex