This book may be purchased
from Amazon.Com

Do Seated Souls Experience Slumberous Sensations?
Review of The Engine of Reason, The Seat of the Soul by Paul Churchland

Luciano da Fontoura Costa
Cybernetic Vision Research Group
IFSC - University of Sao Paulo
Caixa Postal 369
13560-970 Sao Carlos, SP
BRAZIL

luciano@ifqsc.sc.usp.br

Copyright (c) Luciano da Fontoura Costa 1996
Received: November 21, 1995; Accepted: April 18, 1996; Published: June 18, 1996

PSYCHE, 2(29), June 1996
http://psyche.cs.monash.edu.au/v2/psyche-2-29-dacosta.html

KEYWORDS: neurocomputing, consciousness, brain research, recurrent networks, vector coding

REVIEW OF: Paul Churchland (1995) The Engine of Reason, The Seat of the Soul. A Bradford Book/The MIT Press. i-xii + 330 pp., 89 illus. Price: $US29.95 pbk. ISBN: 0-262-03224-4.

1. Introduction

'I feel, therefore I can appreciate ice cream.' (Folk Psychology)

'My personal computer has recurrent networks, therefore it can appreciate ice cream.' (New Folk Psychology)
The Engine of Reason, The Seat of the Soul (Churchland, 1995), EORSOS for short, is composed of two main parts. Whereas the first part presents a well motivated and accessible introduction to important issues in neurocomputing, the following part mainly dwells on controversial philosophical issues concerning consciousness, morality and society. The following sections present a critical overview of Paul Churchland's new book.

2. A Tasteful Introduction to Neurocomputing

One of the main purposes of the first part of EORSOS is to acquaint the general reader with the emerging area of neurocomputation, and especially with the concepts of vector coding, vector completion and recurrent neural networks, which play an important role throughout the book. The first chapter introduces some of the elementary concepts in neuroscience, including a whimsical analogy between TV sets and the retinal and cortical neural surfaces, as well as a description of the benefits of PDP (parallel distributed processing -- i.e., neural net computation). After an 'heroic-age-of-science' section, in which Churchland points out that seemingly controversial hypotheses have proven correct in the past and so might be expected to be proven correct in the future as well, the main objectives of the book are outlined. These are, generally speaking, introducing the general reader to the new theories and results from neurocomputational research (both natural and artificial), and, he hopes, motivating him or her to rethink our mental lives and to participate in the debates surrounding these new developments.

The second chapter deals with the important concept of vector coding, which is illustrated through examples such as the encoding of smells and faces. By not addressing the differences between vector coding and local coding, Paul Churchland has avoided a tricky question (for a discussion see da F. Costa, 1995). There is little doubt that vector coding (the representation of information in multi-dimensional state spaces) is an important computational and representational concept, as it has been known by researchers in statistic pattern recognition for a long time.

Chapter 3 addresses the issue of vector processing. It starts by presenting a very simple example of how a neuron can be configured to perform simple pattern recognition and follows this by discussing Cottrell's neural system for face recognition. The important concepts of synaptic adjustment (represented here by the backpropagation paradigm), internal coding and distributed representations, the emergence of categories, and inductive inference are also presented and illustrated in a didactic and well-motivated fashion. Two observations, however. The first relates to the fact that the network for face recognition has produced receptive fields (an interesting concept from neuroscience of the sensory region which activates a neuron, and which has not been introduced in EORSOS) that extend throughout the whole image area, thus enhancing fault-tolerance. By not emphasizing that such an emergent feature is to a large extent determined by the way in which the image space is covered by the first neural layer, Churchland seems to imply that such extensive receptive fields are common in neural systems. However, had a more localized mapping been used instead, more localized receptive fields would have been obtained which, in addition to being biologically more realistic, would also provide fault-tolerance and account for more effective (less redundant) and compact hierarchical representations of the visual information (e.g., in terms of straight line segments). Paul Churchland does observe that the neural structure for face recognition under consideration is unlikely to be faithful to nature -- however, ironically, only with respect to its third and fourth layers. The second observation regarding this chapter is that Cottrell's neural structure has been presented in EORSOS as a kind of major novel breakthrough (see the second paragraph in page 40). There is no doubt that Cottrell's is an interesting application, but it does not depart substantially from the traditional feedforward model. In addition, there are a number of alternative neural approaches to face recognition capable of reasonable performance, such as those developed by I. Alexander and T. Poggio, which have been around for a long time.

Neural vector processing is further explored in Chapter 4, again from the exclusive perspective of feedforward neural networks, as a means of imitating parts of the brain. The first example addressed is Paul Churchland's model for stereo cortical processing. This model, which relies upon a specific geometry of synaptic connections (greatly simplifying the problem), is used to introduce the concepts of excitatory and inhibitory neural interconnections. Churchland's 'Lilliputian effect' illustrating stereoscopic vision (a pair of images of Manhattan island as seen by a giant) is remarkable and is itself worth the stereoscope which accompanies the book. The second example consists of Sejnowski and Gorman's network for sonar signature classification, a nice illustration of a situation where an artificial neural solution has beaten human expertise. NETtalk is taken for the third case study, whose principal merits reside in illustrating the potential of artificial neural nets for learning (although in supervised fashion) and in building categories. However, it should be noted that its initial problem (getting a computer to read printed text aloud) is in fact not as complex as it may at first appear, for this is a problem that could have been solved straightforwardly by the simple alternative of table look-up, i.e., by adopting a database containing the correct pronunciation for each possible English word. The fourth example, namely sensorimotor integration through vector transformation, is nicely illustrated with the aid of Paul Churchland's famous crab. Although the CalTech bias exhibited by the selection of examples does support a nice didactic illustration of several important aspects of neural nets, I felt that the inclusion of some alternative models -- such as Kohonen's self-organizing maps and the more recently introduced radial-basis neural networks -- would have served quite as well, while also introducing some of promising biological features by which they have been inspired.

Recurrent networks are presented in Chapter 5 mainly as a means of implementing the temporal dimension missing in the feedforward paradigm. The potential of such nets for generating and recognizing sequential patterns as well as their relationship with the ambiguous figure/background phenomenon are neatly described. This chapter also presents some exciting ideas relating recurrent networks to recognition, understanding and scientific progress, particularly from the perspective of vector coding and completion.

The title of Chapter 6, namely 'The Neural Representation of the Social World,' is self-explanatory. Here it is argued that recurrent networks as well as the related processing paradigms (e.g. vector processing, vector completion) are the principal mechanisms through which higher level activities such as language and social integration are achieved by humans. This chapter starts by briefly presenting and discussing Cottrell's EMPATH network, a variant of the face recognition system discussed above, that is dedicated to emotion recognition. Churchland uses EMPATH to illustrate a situation in which a neural network has proven to perform in a behavioral task (narrowly construed) almost as well as a human. However, the net is in fact doing nothing other than computing the spatial correlations between sets of pixels in images that are nicely registered (i.e., spatially aligned) and are devoid of any substantial geometrical transformations -- limitations which human performance considerably outstrips. In addition to performing expert classification of emotions, and even of more sophisticated patterns, humans can precisely characterize such emotions and their contexts. It could be argued that a system for face recognition does not need to do anything more complex than to classify facial patterns by outputting a code number. Such a point of view, one that is in fact shared by a large part of the pattern recognition community, may itself be one of the reasons for the current lack of powerful and general recognition systems. For how can a hysterical laugh be distinguished of a common laugh if not by analyzing its whole context? Indeed, pattern recognition should not be limited to trivial object classification, but should also involve the perception of the multi-dimensional universe of related aspects, such as the function of the object, its typical context, its relationship with the rest of the world, its category, temporal dimension, and so on. Considering that such comprehensive knowledge is essential as a means of enhancing the recognition system performance, it may take some time before neural networks are developed which are fully able to perform in complex situations such as dealing with social representations.

The remainder of Chapter 6 deals with human linguistic ability, a discussion of Elman's neural approach to grammatical discrimination and the issue of moral perception and understanding. Making a long story short, this section is dedicated to the hypothesis that linguistic and moral skills are not rule-based, but are related to hierarchical learned prototypes stored in the neural system. The truth may, however, prove to lie somewhere in between rules and prototypes, for we do employ rules from time to time.

Chapter 7 treats many problems which can affect the human brain and behavior. This chapter starts by presenting in an uncomplicated manner some of the techniques which have been used as means of unveiling the anatomical and functional structure of the human brain, namely CAT, PET and MRI scans. Then, a number of diseases such as apraxia, MS, ALS, the striking cases of blindness denial and hemi-neglect, schizophrenia, manic-depression and depressive illnesses are characterized and discussed. It is stated that a better knowledge of biological neural networks can help us to develop improved treatments for such disorders, a prospect that no reasonable person would dare deny nowadays.

3. Seated Souls and Their Social and Moral Implications

The first chapter in the second part of EORSOS is fittingly entitled 'The Puzzle of Consciousness.' Not so fittingly, a variant of the 'heroic-age-of-science' argument -- in this case with respect to the fact that vitalism in biology has been replaced in surprising fashion by physical processes such as DNA -- is invoked as the means of introducing the controversies of consciousness research to the general reader. The great shortcoming of this approach, as expressed by Paul Churchland himself, is that such parallels provide no implications at all for the nature of consciousness. So, why to use them? In the following four sections of EORSOS, Leibnitz's mite example (imagine yourself as small as a mite wandering around inside your brain; presumably you will fail to find anything answering to conscious experience), Nagel's bat, Jackson's neuroscientist Mary and Searle's Chinese room are all briefly described and dismissed (in different counterarguments). The inclusion of the relatively less known and simple Leibnitz mite argument is used as a kind of basic metaphor to undermine the subsequent philosophical positions. Although Paul Churchland's dismissal of such positions is reasonable from the perspective of purely mechanical aspects of consciousness, I feel that the key point of consciousness, namely its qualitative experiential aspect (e.g. Chalmers, 1993, 1996), has been almost completely missed. Having bypassed that tricky question, the subsequent sections in Churchland's book become constrained to the classical scheme identified by Chalmers (1996), i.e., to showing that the mechanical aspects of consciousness can be physically explained and even artificially implemented, in this specific case by using recurrent neural networks.

The issue whether an electronic machine can be conscious or not provides the theme for the next chapter, which starts with an interesting report regarding Paul Churchland's participation as an observer in the hidden room of Loebner's annual Turing Test Competition in 1993. The fact that five judges took one of the human participants to be a machine is understood by Paul Churchland as one of the indications that the Turing test should be reformulated in order to incorporate other cognitive features such as learning, recognition and conceptual change. Perhaps the cleverest feature of the Turing test is that it relies entirely on the fact that human intelligence is a relative concept and a concept that can best be assessed by humans. The next issue addressed in Chapter 9 consists of Mead's silicon retina, which is presented as a successful attempt to emulate a natural neural network, in this case the primate retina, in a dedicated and highly compact microchip. Some traditional objections to machine intelligence, namely how to provide machine states with meaning, the capacity for doing mathematics and the qualitative characters of conscious experience, are discussed subsequently. While the first two issues are represented as a critical discussion of Searle's and Penrose's positions respectively, Churchland approaches the third issue from the perspective of the 'out of fashion' functionalist approach to consciousness and, since according to Churchland this endeavor seems to have failed (cf. p. 251), the conclusion is (presumably) that the qualitative aspects, as well as other major features of human consciousness, are nothing else but a direct consequence of the physical configuration of the nervous system.

The next chapter -- 'Language, Science, Politics, and Art' -- ranges over all of the broad-brush themes. Strictly speaking, religion is also addressed. Issues such as intelligence differences between humans and other animal species, the question whether language is unique to humans and Dennett's language-oriented theory of consciousness are discussed in an enlightening fashion. The rest of the chapter addresses the hypothesis that scientific creativity, cognitive and moral abilities and even the arts can be explained from the perspective of neurocomputation.

The last chapter, entitled 'Neurotechnology and Human Life,' addresses the implications that the advances from neuroscience and neurocomputation may have on our lives, including their medical, legal and scientific impact as well as that upon our own self-concept. Churchland reasonably forecasts that applications of artificial recurrent networks will include medical diagnosis and treatment as well as the assessment of criminals. The controversial problems of abortion, corrective policies, and scientific advancement are also discussed from the neurocomputational perspective. Although many of the positions expressed in this chapter are provocative, there is little doubt that advances from neurocomputational research, not to mention the progresses in genetic engineering, are bound to change the way in which we see the world and ourselves. Whether this might also lead to an improved moral and social standards is considerably less clear. Indeed, to judge from previous interference of science in such areas (e.g., phrenology, polygraphs, Bell curves and other freaks from the scientific menagerie), the prospects here may not be so rosy.

4. Concluding Remarks

Churchland's EORSOS is undeniably a well motivated and clearly written book. While its first part provides a nice introductory survey to neuroscience and neurocomputing, including the important concepts of vector coding, vector completion, PDP and recurrent networks, its second part covers the implications that such neurocomputing concepts the author believes will have on consciousness research, morality, science and society. Through the extensive use of graphics, metaphors, case studies and an excellent didactic technique Paul Churchland has successfully accomplished his main objectives in writing this exciting and provocative book. However, there are occasions throughout EORSOS in which artificial neural networks, and particularly those that goes under the name of recurrent networks, have been presented in a somewhat overenthusiastic fashion. Indeed, one of the positions defended in this book is that the whole plethora of remarkable behavioral and intellectual abilities exhibited by humans can be explained in terms of (artificial and natural) recurrent neural networks. This point seems to be supported by the following line of thought: (i) the recurrent nets in the human brain are responsible for virtually all of our intellectual abilities; (ii) recurrent networks are defined to be similar to those in the human brain; (iii) artificial recurrent networks are consequently capable of emulating the brain. The problem resides in hypothesis (ii), since there is an enormous gap between biological networks and the current artificial neural models, recurrent or not. (And, indeed, in connectionist research recurrent nets are not at all defined as being similar to the brain: recurrent nets simply add to artificial feedforward neural nets a feedback mechanism for retaining context, in a kind of 'short term memory'; cf. Ellman, 1990.) Consider the following facts:

(A) If the current neural paradigm is so good, why do we have no artificial neural structures capable of addressing actually challenging problems such as versatile pattern recognition and vision? Whereas it is relatively easy to design dedicated pattern recognition systems capable of 90% or even 98% correct classifications, it becomes exceedingly difficult to improve on such performance. It is likely that near 100% performance can only be achieved by incorporating additional information such as the spatio-temporal context (see above) of the objects under analysis, a task that is generally accomplished with expertise by human recurrent networks. This is not to say that artificial recurrent networks will never be able to perform such complex tasks, but only that the current nets do not.

(B) Most of the neural network techniques are not well understood in mathematical terms. The major problem is that we need such a knowledge in order to validate each neural network with respect to specific applications, particularly the most critical ones (e.g., criminal analysis and medical care), since artificial neural nets are plagued by shortcomings such as the plasticity/stability compromise, inherent noise, and many other problems. It is true that our recurrent nets are also not completely reliable, but we need at least to determine that the error rates of such artificial neural systems are not worse than ours.

(C) There has been a general lack of effort towards assessing the performance of artificial neural networks comparatively with alternative approaches to pattern recognition and classification, and Paul Churchland provides no exception to the trend. The fact of the matter is that alternative processing strategies such as statistical and probabilistic classification (in, for example, the computer programs: AUTOCLASS, Cheeseman et al., 1988; C4.5, Quinlan, 1993; and SNOB, Wallace, 1990; finally see Michie et al., 1994, for an experimental comparison of statistical, AI and neural network techniques), an area which is itself a subject of intense research, do provide effective solutions to many, if not all, of the problems to which artificial neural networks have been applied. Is it sensible to overlook such a great asset? It is true that statistical approaches are not inherently parallel, but the area of concurrent processing (e.g. pipelining, systolic arrays, vector processing) and fault-tolerant systems seems to have come of age (the ubiquitous Intel processors used in personal computers present in themselves a definite trend toward becoming truly parallel systems). It is a pity that statistical artificial intelligence has been overlooked in EORSOS, for there is a strong tendency towards hybrid systems in which the advantages and disadvantages of the neural and statistical approaches could be used complementary fashion.

In brief, although there is little doubt that biological recurrent networks are capable of producing human abilities, this endeavor is beyond any doubt far beyond the capabilities of the currently available artificial recurrent neural networks.

As far as consciousness is concerned, I feel that its most tricky feature, namely qualia or qualitative experience, has been overlooked, perhaps due to the same reasoning as that outlined above: (i) recurrent networks in the human brain are capable of sensation; (ii) artificial recurrent networks are defined to be analogous to those in the human brain; (iii) artificial recurrent networks are consequently capable of sensation. While this is possibly true as far as the mechanical aspects of consciousness are concerned, such a reasoning appears to rely on the very functionalism which Churchland elsewhere rejects, since it presupposes that all aspects of consciousness depend upon the functional features of the brain, which can be recapitulated in the hardware of artificial recurrent neural networks. The 'hard problem' of Chalmers (1996), of spanning the gap between function and feel, is hardly even addressed. But it is precisely here that the controversies about consciousness, swirling around Searle, Penrose and others, have found their energy.

I conclude that, in spite of all the well-intended approaches to the problems of consciousness such as Churchland's here, the qualitative aspect of consciousness remains as puzzling as ever. Despite all the scientific-technological advances over the centuries, perhaps we have reached a point of being forced to simply acknowledge our ignorance without prospect of any quick improvement in that state. This is by no means accepting 'mysterianism' or supernatural concepts. On the contrary, it is only by precisely characterizing our lack of knowledge that solid basis for further scientific advances can be achieved. After all, there are worse things than not knowing the answers to our questions.

Acknowledgements

The author is indebted to FAPESP and CNPq for financial help and to Dr K. Korb (Monash University, Australia) for invaluable suggestions and enhancements to the present text.

References

Chalmers, D. J. (1993). Toward a theory of consciousness. PhD thesis, Indiana University.

Chalmers, D. J. (1996). 'Facing up to the problem of consciousness.' In Proceedings: Toward a scientific basis for consciousness. MIT Press.

Cheeseman, P., Self, M., Stutz, J., Kelly, J., Taylor, W. and D. Freeman (1988) 'Autoclass: a Bayesian classification system,' 5th Int. Conf. on Machine Learning.

Churchland, P.M. (1995). The engine of reason, the seat of the soul. MIT Press.

da F. Costa, L. (1995). 'On the most efficient and versatile real-time image processing system - A review of The computational brain,' Real-Time Imaging, 1, 87-90.

Ellman, J. (1990) 'Finding structure in time,' Cognitive Science, 14, 179-211.

Michie, D., Spieghalter,D.J. and Taylor, C.C. (eds.) (1994) Machine learning, neural and statistical classification. Ellis Horwood.

Quinlan, J.R. (1993) C4.5: programs for machine learning. San Mateo, Calif.: Morgan Kaufmann.

Searle, J. R. (1992). The rediscovery of the mind. MIT Press.

Wallace, C.S. (1990) 'Classification by minimum-message-length encoding,' Advances in Computing and Information - ICCI '90 (pp. 72-81). Springer Verlag.


Return to PSYCHE home page