Working Draft -- Somewhat incomplete and unpolished
Comments/criticisms are most welcome!

SYNTAX, CONTENT AND FUNCTIONALISM:
WHAT IS WRONG WITH THE SYNTACTIC THEORY OF MIND

MURAT AYDEDE

The University of Chicago
Department of Philosophy
1010 East 59th Street
Chicago, IL 60637
(E-MAIL: m-aydede@uchicago.edu)
1993/97

ABSTRACT. I argue that Stich's Syntactic Theory of Mind (STM) and a naturalistic narrow content functionalism run on a Language of Though story have the same exact structure. I elaborate on the argument that narrow content functionalism is either irremediably holistic in a rather destructive sense, or else doesn't have the resources for individuating contents interpersonally. So I show that, contrary to his own advertisement, Stich's STM has exactly the same problems (like holism, vagueness, observer-relativity, etc.) that he claims plague content-based psychologies. So STM can't be any better than the Representational Theory of Mind (RTM) in its prospects for forming the foundations of a scientifically respectable psychology, whether or not RTM has the problems that Stich claims it does.


TABLE OF CONTENTS

1. Introduction
2. The Problem Space according to Stich
3. What Is STM?
4. Some Curious Aspects of STM
5. The Alleged Superiority of STM

5.1 How STM Is Supposed to Be Superior
5.2 The Parallel Disadvantages of STM-Style Theories

5.2.1 The Referential Dimension and Narrow Content
5.2.2 Doxastic Similarity Dimension and the Holism Problem

5.2.2.1 Stich's "Holism"
5.2.2.2 Holism and STM

5.2.3 Dimension of Narrow Causal Pattern Similarity

6. The NCA and the Type Individuation of Brain Sentences

6.1 Troubles with Content Functionalism
6.2 Connections to Stimuli and Behavior

7. Why a Purely Syntactic Psychology Cannot Get off the Ground
8. If Cognition Is Computational, How Can Psychological Laws be Intentional?
9. REFERENCES
10. NOTES


1. Introduction

There is a thesis often aired by some philosophers of psychology that syntax is all we need and there is no need to advert to intentional/semantic properties of symbols for purposes of psychological explanation. Indeed, the worry has been present since the first explicit articulation of the so-called Computational Theory of Mind (CTM). Even Fodor, who is the most ardent defender of the Language of Thought Hypothesis (which requires the CTM), has raised worries about its apparent consequences. The worry can be put in the form of a question, which Fodor called the "Eponymous Question" alluding to the title of a chapter in his recent book (1994): If cognition is computational, how can psychological laws be intentional? This question has been haunting people working in the field at least since the publication of a paper in 1978 by Stich in which he gave his celebrated "autonomy argument". Then, as everybody knows, came Fodor's notorious "Methodological Solipsism" in 1980, in which he argued for the formality condition: namely that thought processes are causal sequences of symbol tokenings in one's language of thought (LOT) and the causal processes are sensitive only to the syntactic/formal properties of its symbols. Hence, he argued against a "naturalistic psychology," i.e. a psychology whose laws essentially advert to broad semantic properties of mental states they cover. The alternative, rationalist psychology, according to Fodor, was to advert only to formal characteristics of symbols, of which Fodor conceived as narrow computational roles of LOT symbols.

Stich's 1983 book, From Folk Psychology to Cognitive Science, was the culmination of the worries presented in fact not as so much worries but rather as a sustained argument against the possibility of a scientific intentional psychology (along with the common sense belief-desire psychology) and for a syntactic way of doing psychology, i.e. for his much discussed Syntactic Theory of Mind (STM). He defended an eliminativist stance: STM involves the elimination of all intentional idioms proposed to be used in a scientific enterprise, hence envisions a scientific psychology free of a semantics. STM has been around and very influential for over a decade now, since it has usually been taken to articulate the paradox alleged to underlie the LOTH, which was to vindicate intentional folk psychology through computationalism. For this reason, I will concentrate on Stich's book in what follows, and argue that the worries are altogether baseless, that a computational theory needs a semantic individuative scheme to get off the ground, and that the envisioned alternative, i.e. a pure CTM, or STM is a non-starter, cannot do the job. Although there are probably few actual adherents of STM nowadays or the vision it has of a scientific psychology, to my mind, no one so far has been able to conclusively refute the theory. Indeed, as recently as 1994, Fodor raised the worry and tried to answer it by showing the feasibility of its alternative, and not by directly attacking the syntacticalist claim. In what follows, I intend to refute once and for all not only the STM but also the kind of semantic free psychology it envisions, thus answering Fodor's Eponymous Question. Along the way, I will also attack narrow content functionalism as a naturalistic semantic program, since, as we'll see, it will turn out to have a very close relation to STM!

In his book, Stich has argued basically for two claims. First, the application conditions of such intentional common sense predicates as `believes that P' and `desires that Q' are essentially vague, context-sensitive, observer-relative, and thus are not suitable to be used as stable projectible predicates in the vocabulary of a scientific psychology. In particular, since, according to Stich, observer relativity partly stems from the fact that content ascriptions are essentially based on similarity judgments along different dimensions (see below) between the ascriber and the ascribee, a consequence of observer relativity of ordinary content ascriptions is that a certain form of parochialism will profusely infect our psychological theories if we insist on having a content-based psychology which, according to Stich, essentially relies on ascribing such contents to agents covered by its generalizations. This means that content-based psychologies are bound to miss many important generalizations about the psychology of children, exotics, perceptually or cognitively handicapped people, higher animals, etc., since any content ascribed to these will necessarily reflect how cognitively similar the ascribee is to the ascriber.[1] In short, Stich thinks that content-based psychologies won't make respectable science.

His second general claim is that we don't need to advert to the content of mental states in doing psychology, "syntax" will be enough, and we had better advert only to the syntactic properties of mental states if we don't want to miss any psychological generalizations: STM offers a paradigm that has all the virtues of content-based psychologies and none of its vices.

These two claims are relatively independent of each other: in particular, the truth of the latter does not depend on the truth of the former. If Stich is right in his second claim, then the falsity of his former claim, i.e. his characterization of contentful mental states as scientifically problematic posits, would imply that content vocabulary is at best otiose in doing scientific psychology. It is therefore important to see whether Stich is right in his second claim. So my attack will be on his second claim, and I will leave the discussion of the first aside in this paper. In what follows I will argue for three basic claims.

The first is that when we see what STM is supposed to be, as presented by Stich, the claimed superiority of STM over content based psychologies totally disappears. Put differently, I will be arguing for a conditional claim: if Stich is right in his claim that content-based psychologies have the disadvantages he enumerates, then STM-style theories have exactly the parallel problems; so it is false that the STM framework is scientifically superior to content-based psychologies as conceived by Stich. Therefore, Stich will loose his primary motivation to promote STM. This is the topic of § 5. Although I think Stich is wrong about his first claim, I will say very little about it in what follows.

Secondly and more boldly, I will argue in § 6 that STM can't do the required job: it lacks the necessary resources to type individuate particular psychological states qua mapped onto particular "syntactic objects" as Stich puts it.

In this connection, let me bring out a curious and prima facie puzzling aspect of Stich's second claim, i.e., the claim that the only scientifically viable paradigm in psychology is and ought to be STM, by relating it to some worries that Fodor has about the nature of semantics. Fodor, as is well known, has opted for a purely denotational theory of meaning. He has various reasons for this, but the most important one is that the alternative, the functional role semantics or content functionalism in general, seems to have consequences that are destructive of scientific intentional psychology. The problem is supposed to be that any bit of functionalism in semantics inevitably leads to holism, or else has theoretical commitments that are very dubious on independent grounds.

Now, Stich, on the other hand, does think that one of the reasons why intentionally characterized mental states like beliefs and desires are ill-suited for scientific purposes is that their individuation conditions are inherently holistic, and therefore, intentionally characterized states are a mess beyond repair to serve as stable law-instantiating theoretical posits. So Stich offers his STM within which scientific psychology ought to be developed. But his STM, as I will present in a moment, is committed to what he calls the Narrow Causal Account (NCA) of typing brain states, which makes it a purely functionalist theory.

So, in a nutshell, here is the puzzle: Whereas Fodor thinks that it is because functionalism cannot individuate contents we have to develop denotational approaches in semantics if we want to have a scientific intentional psychology, Stich, on the other hand, thinks that it is because content is inherently holistic (among its other defects) that we have to develop purely functionalist but semantic free psychological theories if we want psychology to be scientifically respectable, hence his STM!

Intuitively, we may ask: If Fodor is generally right about what he thinks of functionalism in semantics, how can Stich think that his purely functionalist STM can be free of similar or parallel defects? Or, conversely, if Stich is right about what he thinks of functionalism in its capabilities to type brain states in a scientifically acceptable way, why can Fodor not utilize functionalism in his story about fixing the semantics of mental states?

Putting the question in this way might seem misleading to many. For one might object: Fodor's worry is about the role and place of functionalism in fixing the semantic content of Mentalese expressions, but Stich couldn't care less about it because his brand of functionalism has nothing to do with semantics; rather, Stich's claim is that the syntax of "sentences" realized in the brain should be fixed purely functionally and syntax is all we need for a scientific psychology.

My second claim will take up exactly this issue. What I'll argue is that given a certain picture of the cognitive mind in broad outline, a certain brand of functionalism, namely the one that is committed to the NCA, is equally problematic both in fixing the content of psychological states and in fixing their "syntax," especially as understood by Stich and, in certain of his moods, by Fodor as well. In arguing for my claim, my strategy will be as follows. I will show that STM is a purely functionalist theory whose structure is exactly parallel to content functionalism of the NCA variety (§ 3). Then I will show that such a content functionalism cannot type-individuate the particular semantic contents that are to be assigned to central psychological states (like beliefs and desires) on the basis of their narrow causal role (§ 6). I will then argue that the burden of the proof is on Stich to show how STM is supposed to accomplish the exactly parallel feat of assigning particular syntactic objects to psychologically relevant brain states on the basis of their narrow causal profile. I will conclude that STM can't type-individuate the particular brain states qua mapped to particular syntactic objects. So, STM, contrary to Stich's advertisement, will turn out to be incapable of providing a solid framework within which a scientifically respectable psychology can be pursued.

Thirdly, I will argue in § 7 that the STM-theorist is, at any rate, committed to intentional vocabulary at some stage of theory construction. Put differently, if the STM strategy is taken to claim, as Stich seems to intend, that it is possible and advisable to develop psychological theories without using any intentional scheme whatsoever no matter what the stage of theory construction is, then STM is false: psychological theory construction cannot get off the ground if the strictures of STM are firmly complied. Admittedly, this is an issue that belongs to the "context of discovery", but I believe it is still instructive to see why STM has to rely on intentional vocabulary at least in the initial stages of theory construction. For, this will reveal how wrong-headed Stich is both in his conception of syntax and the place of functionalism in psychology.

I will end (§ 8) by moralizing on Stich's failure, and point out that if STM (= Narrow Causal Account of typing symbols over which computation is defined, as I will show) and type-type idenity theory are false, as I will argue, then a content-based psychology (= intentional psychology) is practically mandatory. Hence, if cognition is computational, psychological laws have got to be intentional!

Since all my arguments crucially depend on what exactly STM is, I will present it in a way that its purely functionalist structure becomes explicit. This is the job of § 3. In particular, I will use a procedure similar to the one developed by Brian Loar (1982a) in his presentation of his own content functionalism. Indeed this is how I want to establish the exact structural parallelism between STM and a narrow content functionalism.

However, before embarking on my criticism, I need to say a little about how Stich views the problem space within which he criticizes content-based psychologies and thus motivates his own alternative, STM. In particular, it will turn out that the exact way in which Stich motivates his STM is very important, since my arguments against STM partly rely on his own strategy. So we will need this general background. This is the job of the next § 2.

In § 4, I will comment on some curious aspects of STM, especially on what apparently makes STM a "synactic" theory. I will argue that in one reading, STM has nothing to do with "syntax". In particular, I will present the reasons for why the notion of syntax Stich has in mind is not the notion of syntax that the Language of Thought Hypothesis (LOTH) requires. Hence, we will see that, despite the widely held opinion to the contrary, STM has very little to do with the Computational Theory of Mind (CTM).

2. The Problem Space according to Stich

Stich takes what he calls the Mental Sentence Theories as his starting point, and assumes their basic framework throughout his discussion. After a lengthy presentation of a Fodor-style LOTH he raises the following problem:

for a Fodor-style account of belief sentences to hang together, we must have some workable notion of what it is for two distinct people, speaking different languages, to have in their heads distinct tokens of the same sentence type. (Stich, 1983:43-4)

On behalf of Fodor, he offers three possible solutions: One of them is the Narrow Causal Account (NCA), according to which two sentence tokens count as type identical iff they have the same narrow causal/functional role. Since this is going to be of some importance, let me elaborate on it a bit. According to Stich, "[t]o adopt this view of... psychology is to exclude any reference to noncausal relations... There can be no mention of a subject's social setting, natural environment, or personal history, nor of the psychological characteristics of other people" (1983, p.22). This is what makes this kind of individuation narrow causal. It is narrow because the causal role in question is defined in terms of generalizations that detail nomological connections among proximal stimuli, behavior (like motor commands) and other central cognitive states. Moreover, the causal relations are given by a set of counterfactual supporting generalizations. Thus, for a mental state of an individual to count, say, as the belief that P, it is not necessary that the state actually play a causal role in the individual's mental economy; all that is required to be true of the state is that it would play a certain causal role if some other conditions specified in the generalizations were to obtain. So the notion of functional/causal role of a mental state should be so understood as to include the potential causal interactions that the state would enter. Accordingly, two mental states of two distinct organisms count as of the same type if their potential causal interactions are the same, namely, if they are covered by more or less the same generalizations, despite the fact they may differ quite radically in their actual etiologies. Finally, the generalizations in question are hedged by ceteris paribus clauses.

The second account is what Stich calls the Semantic Account (SA) according to which two sentence tokens count as type identical iff they have the same semantic content. The third one is what might be called the Quasi-Physical Account (QPA) according to which the sentence tokens are of the same sentence type iff their quasi-physical properties, their shape, so to speak, are the same.[2] After quickly dismissing the QPA as hopeless, he makes the following remark:

[The] interesting question is how causal accounts and content accounts compare with each other. Do they categorize mental tokens differently, or do they inevitably come out with the same categorization? On this issue, opinions divide. According to Fodor the two sorts of classification schemes coincide, "plus or minus a bit." Indeed Fodor sees this as "the basic idea of modern cognitive science." Any thoroughgoing [i.e., content] functionalist in the philosophy of mind will also end up on this side of the divide. On the other side, denying that causal and content accounts converge, are Field, Lycan, Perry, McDowell, and the truth. (1983:48-9)

Here Stich conceives the Semantic Account as fixing the type identity of mental sentences according to their broad content. Twin-earth cases show that functionally identical twins may differ in the broad content of their mental states. However, this is not going to be very important for what follows. Since, for many people in the field, narrow content is a construct out of broad content, Stich has the same line of argument against narrow content.[3]

Here is briefly Stich's argumentative strategy. Stich thinks that if folk psychology is to be scientifically vindicated through some version of a mental sentence theory, the Semantic Account of typing mental sentence tokens is indispensable. He then proceeds to show that the NCA and SA come up with radically different taxonomies. The way he does this is idiosyncratic. He constructs a series of thought experiments that are supposed to intuitively show that folk judgments about how to classify certain mental states do radically differ from the way the NCA would type them. Then relying on what these thought experiments seem to show, he proceeds to give an account, or rather a "descriptive analysis" of folk conception of belief as a paradigm case of a contentful mental state, i.e. as a paradigm case of mental state typed according to SA.

According to Stich's analysis, the "content identity" of beliefs that is thought to be assumed by folk psychology is a myth. On the basis of the evidence he claims to have collected through his thought experiments, he claims that the notion of content according to folk psychology is such that it is only a similarity measure along three different dimensions that the folk implicitly assume. One dimension of similarity between contents is the functional or causal-pattern of contentful mental states: "A pair of belief states count as similar along this dimension if they have similar patterns of potential causal interaction with (actual or possible) stimuli, with other (actual or possible) mental states, and with (actual or possible) behavior" (Stich, 1983, pp.88-9). The second dimension draws on the ideological (doxastic) background of the agents. Since these can greatly vary from person to person, the relation between two beliefs in two different people can only be a matter of similarity: "The ideological similarity of a pair of beliefs is a measure of the extent to which the beliefs are embedded in similar networks of belief" (p.89). The third dimension of similarity measure concerns the reference or truth-conditions of beliefs. Since they are dependent on the speakers linguistic community, social imbeddings, the causal history of the use of terms, physical as well as cultural environments, etc., the reference will vary as these vary without necessarily affecting the functional role of a mental state. To the extent that these factors are similar, to that extent the contents of beliefs will be similar. Stich thinks that this is essentially what the SA of typing mental sentences comes down to. Stich, of course, needs such an analysis because he has to know what it is exactly that a mature cognitive science threatens to eliminate and why.

It is now relatively easy to see how the two taxonomic schemes diverge. The NCA can capture only the causal pattern similarity dimension assumed in the SA. It can't be sensitive to the other dimensions. Stich concludes that "the mental sentence theory of belief, if fleshed out with a narrow causal account of belief, just does not comport with our workaday folk psychological notion of belief -- it is not an account of belief, as the term is ordinarily used" (1983, p.49).

If the two taxonomic schemes differ, what scheme should a scientific psychology adopt? Stich argues that adopting the SA is ill-advised, because mental states typed according to the SA will make bad science since a semantic taxonomy would only provide the psychologist with a theoretical vocabulary whose application is vague, unstable, context-sensitive, and observer relative. (I will return to this issue in § 5.1) Who would want such a science, Stich argues, especially if there is a clear alternative that is free of such defects? According to Stich, the alternative is a psychology whose taxonomic scheme is based on the NCA. This is the STM paradigm. Hence Stich's main conclusion: if a mature cognitive science is and ought to be committed to the NCA (STM), then folk psychological intentional notions like beliefs and desires are likely to be eliminated.

This is how Stich motivates and argues for his STM. It is therefore very important to see whether Stich is right in his claim that the STM paradigm is really superior in any of the respects in which he criticizes content based psychologies. As I advertised, I will argue that Stich is wrong (§ 5).

There are a number of curious features in Stich's discussion. In particular, Stich does not distinguish between having a revisionist intentional psychology (hence, one committed to some version of SA) and scientifically vindicating folk psychology in its entirety. This is especially apparent in his identification of SA with the folk conception of contentful states. I will take up this issue in § 5.2.

Stich's indictment of intentional psychology is essentially based on his "descriptive analysis" of the folk notion of a belief. His discussion, however, seems to contain some curious confusions. In particular, as I said, Stich does not distinguish between beliefs and belief ascription. From a very idiosyncratic account of the latter he draws an account of the former. I don't think that there is any good reason to buy his account, but I won't argue against it here.

Let's now see what sort of approach STM is.

3. What Is STM?

According to Stich, the core idea of STM can be captured in the following way:

the cognitive states whose interaction is (in part) responsible for behavior can be systematically mapped to abstract syntactic objects in such a way that causal interactions among cognitive states, as well as causal links with [proximal] stimuli and behavioral events, can be described in terms of the syntactic properties and relations of the abstract objects to which the cognitive states are mapped. More briefly, the idea is that the causal relations among cognitive states mirror formal relations among syntactic objects. (1983:149)

Stich here considers two networks, one of which is the network consisting of the causal relations among brain state types, proximal stimuli and behavioral events. This network is supposed to be mirrored by another network expressed by a syntactic psychological theory T. This theory consists of at least three kinds of generalizations: (1) the ones that nomologically connect proximal stimuli to B-states (belief-like states) with particular syntactic objects mapped to them, (2) the ones that describe causal relations among B-states and D-states (desire-like states), and (3) the ones that nomologically connects B- and D-states to motor-gestures. Following Michael Devitt (1990), I will call these kinds of generalizations: I-T, T-T, and T-O generalizations respectively.[4]

If we want to put T into some canonical form, we may write out T as a single conjunctive sentence, replacing all the occurrences of the theoretical predicates such as "x has a B-state mapped to s 1" and "x has a D-state mapped to (1" with expressions of the form:

x is in (some member of) B(s 1),
x is in (some member of) D(s 1).

B and D are functions (in the set theoretic sense) that map a particular syntactic object, which the theorist had already specified for the job at hand, onto the set of x's first order physical state types that have the functional role that T associates with that syntactic object. We may now express T in the following way:

(i) T[s1, s2,..., B(s 1), B(s 2),..., D(s 1), D(s 2),..., b1, b2,...]

where si's are proximal stimulus types and bi's behavioral event types (motor gestures), and s i's are specific syntactic objects.

Roughly, this is the form an STM theory would take. Let us now see how STM is committed to the NCA of typing brain states hypothesized by the theorist, i.e., how we can get their explicit functional definitions.

From (i) it is easy to get the Ramsey sentence of T by quantifying over the functions B and D:

(ii) (Ef1)(Ef2) T[s1, s2,..., f1(s 1), f1(s 2),... f2(s 1), f2(s 2),..., b1, b2,...]

We can now get the explicit functional definition of B:

B =df. The function f1, such that there is a function f2, such that the two uniquely satisfy `T[s1, s2,..., x1(s 1), x1(s 2),... x2(s 1), x2(s 2),..., b1, b2,...]'.

Similarly for the definition of D.

Although this is the formal procedure to get the explicit functional definitions of B and D, what we really want is explicit functional definitions of `B(s i)' and `D(s i)' for each i. The intuitive idea is this. Notice that in this formalism the existential quantification in getting the Ramsey sentence is over certain functions that map distinct syntactic objects to distinct sets of an organism's first order physical states. Here, in a certain sense, syntactic objects are exploited as external indices that pick up certain states of an organism that have distinct functional roles as specified by theory T. Each specific syntactic object in virtue of its distinctive place in T's generalizations specifies a unique functional role that the two functions B and D then map onto the underlying physical states of the organism. The syntactic objects may be viewed to be indices that are external to the underlying states (but see below). They only function to pick up certain states with certain functional roles. Intuitively, we may extract the functional definition of B(s i) for each i in the following way: since s i in the domain of B, in virtue of its place in T, is supposed to pick out a unique functional role that may be indexed by Fi,

B(s i) =df. the set of first order states that have Fi as determined by T.

Similarly for D and for each particular s i.

Now Stich does not present his STM in this way. Here I have used a procedure very similar to the one developed by Brian Loar (1982a) in his presentation of his own content functionalism. This is not accidental of course. In fact, this is the point. For, as should be obvious, Stich's STM, structurally at least, is nothing but a de-intentionalized version of Loar's content functionalism, except that Loar takes the causal role of "observational" beliefs to be fixed on the basis of distal stimuli. Where Stich uses abstract syntactic objects, Loar uses ("fine-grained") propositions, intentional objects par excellence. The type identity of specific abstract syntactic objects is given by their place in the theory. This is the way they are purely functionally defined according to their narrow causal profile.

In fact, the similarities between STM and Loar's content functionalism are, in one respect, stronger than that. Loar uses propositions in the initial stage of getting the functional theory first. (And, for good reasons, see § 7.) He then proposes a procedure by which all the propositions are replaced by purely formal expressions. The theory in this ultimate form structurally is almost an STM! Loar, of course, is no eliminativist. His aim is to naturalize intentionality by offering a sophisticated functionalist theory. So he thinks at some stage he should get rid of the intentional objects like propositions he initially used. Once the theory is completed, it is supposed to provide sufficient (and, necessary?) conditions for a mental state to have a semantic content, which can ultimately be specified without using any intentional terminology. This is his strategy, and as far as it goes it is perfectly kosher. But if I am right in what I am going to say, it does not go very far at least in its narrow version (see § 6).

4. Some Curious Aspects of STM

My presentation of Stich's STM may be taken to be tendentious. I presented it as a purely functionalist theory and said that the abstract syntactic objects, which the brain states are mapped onto, may be viewed as indices that are totally external to the underlying first order brain states. But, STM is supposed to be a formal/syntactic theory very similar to the Computational Theory of Mind (CTM) Fodor has developed and defended. STM is supposed to be a de-intentionalized version of what Stich calls Mental Sentence Theories. Indeed STM has been taken in this way in the literature by its friends and foes. But if my presentation is right, STM is not in fact theoretically committed to there being syntactically complex "sentences" literally realized in the brain. If so, how could it be very similar to Fodor's CTM?

In fact, there is no mystery here. STM as presented by Stich is indeed not committed to there being sentences realized in the brain, as Stich himself acknowledges:[5]

It is not, strictly speaking, required for an STM theorist to view hypothesized neurological state tokens as mental sentence tokens, though talking of them in this way is often an all but unavoidable shorthand. (1983:152)

This is curious but actually quite understandable. Remember Stich's question about how the tokens in different heads can be individuated as of the same sentence type. His solution is the NCA. But the NCA requires a theory first in which syntactic expressions figure as theoretical terms in the generalizations. However, once we have such a theory, it is easy to define the syntactic expressions functionally à la Loar. But once we do that, the question whether the referents of such expressions do really have syntactic structure somehow realized in the brain becomes secondary and at best an open empirical question. For, if the functional theory is a true one, we can do everything we want that the Mental Sentence version of the theory can do.[6]

So STM as a purely functionalist theory is not committed to a semantic free LOT. On the other hand, of course, whatever CTM is, it cannot be neutral with respect to the question of whether there are syntactically complex sentences realized in the brain. CTM should be so formulated that it essentially entails a positive answer to this question. The problem in fact stems from the widely shared conviction that the type identity of brain sentences can and should be given in terms of the NCA (for some, as well as in terms of other ways like the SA).[7] In § 6, I will argue that this can't be done. So there is at least this dissimilarity between STM and CTM: whereas STM is non-committal about there being brain sentences, CTM, whatever it is, is essentially committed to it.

Having made the point, however, I want to talk of STM most of the time as if it's concerned with the functional individuation of syntactically complex brain sentences. Not only because, as Stich says, this is an all but unavoidable shorthand, but also because I want to see whether Mentalese expressions can be individuated on the basis of the NCA if the LOTH is true. So, in what follows, I will assume the framework of Mental Sentence Theories, and often treat STM in this form.

So far we have been talking about the functionalist nature of STM, and thus its commitment to the NCA of typing brain states. But what does this have to do with syntax? Where is the place of syntax in all this? More particularly, how does Stich conceive of syntax when he talks about the syntactic type identity of brain sentences? Or, what makes his theory a "syntactic" theory? To this last question he answers in the following way:[8]

We would have no reason to view brain states as syntactically structured unless that structure can be exploited in capturing generalizations about the workings of mind/brain's mechanisms. Attributing syntactic structure to brain state tokens -- assigning them to syntactic types -- is justified only if some interesting set of causal interactions among those tokens is isomorphic to formal relations among abstract syntactic objects. (1991:244)

Notice that if Stich is right about this, Fodor can't have any reason for postulating a separate computational level in which intentional laws of psychology are implemented.[9] In particular, what is puzzling about Stich's answer is that he doesn't mention at all the Turing legacy which is the main driving force behind Fodor's insistence that the computational story, according to which thought processes are defined over the formal/syntactic properties of representations, is our only plausible story about how semantically coherent processes can be physically/mechanically possible. Stich's interest seems not to be in computationalism. This is understandable to a certain extent. For Stich doesn't think that there are any semantically coherent thought processes that need the attention of science because he doesn't think that there are any states with semantic content. Put this aside. He has a different line of answer.

When he talks about the syntactic type identity of brain sentences, he has a "rich" notion of syntax, according to which mere difference in lexical items (e.g. "Tully was bald" versus "Cicero was bald", or "Fa" versus "Fb") is enough to make the sentence tokens belong to different syntactic types.[10] In particular, for Stich, the criterion according to which two sentence tokens in two different heads count as of the same type is a syntactic criterion. But since this criterion is captured by the NCA, the syntactic type identity of brain sentences is a matter of functional identity:

when mental states are viewed as tokens of syntactic types, the functional profile exhibited by a mental state can be equated what we have been calling its formal or syntactic properties. (Stich, 1983: 190)

So it seems that, according to Stich, the very postulation of complex semantic-free sentences realized in the brain whose "syntactic" type identities are given purely functionally is what makes Stich's theory a syntactic theory. I would like to point out that such an understanding of the notion of syntax is very different from the one that is needed for a Fodorian Computational Theory of Mind: what is required for the LOTH is a combinatorial syntax that fixes the logical form of expressions.[11] The important question I will address in § 6, however, is whether the type identity of brain sentences can be given in terms of their narrow causal profile, whatever it is called.

Now that we have set the stage, let us see whether the STM paradigm is any superior over content-based psychologies. In the following section I will present my argument for my conditional claim.

5. The Alleged Superiority of STM

From Stich's analysis of folk conception of belief individuation it follows that predicates like `believes that P' (1) are vague and unstable, (2) depend on a (observer-relative) similarity matrix along three different dimensions for their applicability, and (3) their application involves many unnecessary "fine-grained distinctions which contribute nothing in the explanation [and prediction] of behavior." From (2), it also follows that there are likely to be many important cognitive generalizations that will not be stateable in terms of such predicates. So a content-based psychology will inherit all of these limitations. In contrast, the STM style theories, Stich claims, will have none of these.

In this section, I will argue that if Stich's criticism of content-based psychologies is right then exactly parallel problems equally plague STM. But for this, we first need to see, exactly, how Stich argues for the superiority of STM. In other words, since my claim is conditional, we need to see in some detail what makes its antecedent true according to Stich and why he thinks that STM is free of similar problems.

5.1 How STM Is Supposed to Be Superior

In discussing how STM theories will succeed where the content theories fail, Stich again uses the thought experiments he has considered in showing how the content taxonomies radically differ from the ones based on the NCA. Much of the difference stems from the fact that whereas the individuation of content essentially depends on three different dimensions, the NCA is only committed to individuating mental states according to their narrow causal pattern. The other two dimensions, ideological and referential (or, truth conditional) similarity are to be amputated. First, these last two are unnecessary and therefore contribute nothing to the explanation and prediction of behavior. Second, by getting rid of them, context-sensitivity is eliminated. That is because, as in every multi-dimensional similarity judgment, it is the context that decides which dimension is to be emphasized in deciding whether a given state in a particular situation counts as the belief that P. Sometimes referential similarity will count more, sometimes ideological similarity, or simply causal pattern similarity depending on the demands of the particular situation in which the question arises.

Stich puts the greatest emphasis on the problems created by the ideological similarity dimension. This is what he calls the holism problem in the folk conception of belief. In order to bring out the problem vividly, let's focus on his most celebrated thought experiment: the case of Mrs. T. Mrs. T is an elderly woman who suffers from a progressive loss of memory. At the end, she does not "know" what an assassination is, what dying is, who McKinley was, etc. Nonetheless, she appears to remember/believe that McKinley was assassinated, because that is what she persistently says when asked "What happened to McKinley?" According to Stich, the folk psychology's clear verdict is that she does not believe that McKinley was assassinated. Stich's diagnosis is that when she ceased to have a certain set of relevant beliefs, she ceased to believe that McKinley was assassinated, despite the fact that she appears to respond correctly to the question. This, Stich says, shows that folk conception of belief attribution attends to the doxastic background of an agent. From this he seems to infer that the type-identity of someone's belief is partly constituted by what other actual beliefs the individual happens to have. This is the notorious problem of content holism, according to Stich.

On the other hand, an individuating scheme based on a NCA, he claims, is and ought to be nonsensitive to the actual doxastic surrounding of a mental state it individuates. That is because the NCA taxonomizes her state underlying her utterance on the basis of its potential (narrow) causal interactions. Thus STM is able to account for her ability to infer, for instance, "McKinley was buried in Ohio" from her "acknowledgment" of "McKinley was assassinated" and "if McKinley was assassinated then he is buried in Ohio." So whereas content psychologies miss such important generalizations as those that cover Mrs. T STM theories will be able to take such agents under their scope (for more on this, see below).

With respect to the reference similarity dimension, the situation is similar. For instance, there are many cases in which the causal history of certain subjects' use of some terms are so tangled that there is simply no saying what they refer to. And this makes it difficult to find comfortable characterizations of the content of the beliefs the subjects express using those terms. Hence, the difficulty in subsuming them under content generalizations. But for STM theories there is no such difficulty. The underlying states that cause those subjects to utter a sentence whose truth conditions are moot and the ones that we utter can be categorized as tokens of the same B-state mapped to the same syntactic object on the basis of their narrow causal potential. Similarly for those utterances whose referential properties are sensitive to the linguistic community of the subjects. The NCA is insensitive to such links outside of the agents' skin, and in being so, it is perfectly apt to capturing the regularities that ought to be under the proper domain of cognitive psychological theories.

The case with the causal pattern similarity is different. Those who are sufficiently causal pattern dissimilar to us will be beyond the reach of content generalizations because it will often be very difficult to find content sentences to characterize what intentional states they are in. For an STM style theory, however, this is no problem, or so claims Stich. All the STM theorist needs is a different set of generalizations, if the case at hand warrants it. Since the type identity of mental sentences depends substantially on which generalizations cover it, and since the causal patterns are different, different generalizations will be needed. Therefore, the abstract syntactic objects that the brain states are mapped onto will not be comparable to those of us. So there is a certain parallel here between the content and syntactic theories. However, unlike content theories, STM theories have no difficulty in stating these new generalizations, since all it has to be done is to postulate new syntactic objects in terms of which new generalizations are stated without any regard to what other actual brain states are mapped to which syntactic objects and to what referential connections obtain between the states and the objects outside the skin of the subject.

So, Stich claims, only the NCA, and hence STM, is what we ought to have for a successful and scientifically respectable cognitive psychology, if we don't want to miss important generalizations and cripple with a pervasive vagueness. If content based psychologies as analyzed by Stich were to be on a part with STM with respect to their virtues and vices, there would be very little point on the part of Stich to insist that we have to couch our generalization syntactically and forget about content. But, according to Stich, content and syntactic taxonomies radically diverge, so we have to choose. And it is the STM we have to choose because it is explanatorily superior to content psychologies: it has all the virtues of content psychologies and none of their aforementioned limitations. Let us now see whether Stich is right about his claim.

5.2 The Parallel Disadvantages of STM-Style Theories

In this subsection, I will take up all the three dimensions one by one, and show that either STM is no better off with respect to each of them or the criticisms leveled against content-based psychologies are unwarranted. However, Stich's emphasis is understandably on the "holistic" nature of beliefs, and so will be mine. I will start with the discussion of the referential dimension and argue that Stich is wrong in assuming without any argument that a scientifically respectable intentional psychology must be committed to broad content. I will then take up the doxastic similarity dimension. In fact, Stich's most important criticisms concern this dimension. I will show that Stich's STM has exactly parallel problems. Finally, I will briefly discuss the causal pattern similarity, and argue that this dimension does not pose any special problems for an intentional psychology, and STM has parallel problems anyway.

5.2.1 The Referential Dimension and Narrow Content

Stich rightly argues that commonsense individuation of propositional attitudes attends to reference or truth conditions. In other words, the folk individuate beliefs (inter alia) according to their broad content. This is one of the three dimensions mentioned above. In arguing against the prospects of incorporating folk notion of belief (and other propositional attitudes) into a respectable science of psychology, it is this notion of belief with broad content that Stich has in mind. Stich, of course, is entirely right in thinking that broad content does not supervene on what is inside the head, and therefore, you can, and sometimes do, have causal pattern identity without reference identity, and this seems to show that the broad taxonomy of attitudes plays no essential role in psychological explanation. Hence, one of Stich's reasons for his case against belief: as far as commonsense individuation is committed to a broad taxonomy, it is likely to be eliminated.[12]

Stich does not consider the possibility of a certain form of revisionism vis-à-vis folk psychology. Granted that folk individuate beliefs broadly and what a scientific psychology needs is a taxonomic scheme that supervenes only on what is inside the skin, it simply does not follow that the notion of content has no place in cognitive science, unless there is no other notion of content but a broad one. But, many people in the field think that there is a notion of content that is narrow, and thus supervenes only on what is inside the head. Fodor, for instance, thinks that narrow contents are functions from contexts to truth conditions.[13] These functions are implemented in the brain. Two thoughts are identical in their narrow content if: for all contexts C, one has the truth conditions TC in C iff the other has TC in C. In other words, context plus narrow content gives broad content. Intuitively put, narrow content is what you get when you substract the contribution of the context from broad content.

There are many complicated and subtle issues about the notion of narrow content. I will not go into discussing them here. The important point I want to make against Stich is to distinguish between two strategies in arguing for the prospects of having a scientifically respectable intentional psychology. One is simply to adopt all the intentional notions of folk psychology with their taxonomic scheme without any filtering attempt to press them into scientific use. Fodor calls this strategy "die-hard".[14] The other is to go revisionist. This strategy is ready to give up many things that the folk assume in so far as what remains uses an apparatus that is essentially and in some form recognizably intentional. Unfortunately, Stich does not distinguish between these two roads. Throughout his discussion what he seems to have in mind is the die-hard strategy.[15]

The notion of narrow content is a perfectly intentional notion: its identity conditions are given in terms of the identity of truth conditions across contexts. Put differently, narrow content is semantically evaluable given a context. So, as observed by Devitt (1990), a revisionist that opts for the notion of narrow content is not necessarily jeopardizing the prospects of scientifically vindicating folk psychology.

In the absence of an argument on the part of Stich to show that there can be no such thing as narrow content, I will simply drop the issue of reference similarity dimension for what follows as a red-herring. As we will see shortly, however, this will make little difference for what I am going to say against Stich. Stich, in fact, himself seems to have dropped the issue in criticizing Fodor's notion of narrow content in his (1991) article. He seems to grant the possibility of an intentional narrow psychology that is not committed to reference or truth-conditions. However, he has the same line of criticism against narrow content anyway: since narrow content, in some sense, is a construct out of broad content, the former has all the limitations that the latter has, except, of course, those that stem from the dimension of reference (or, truth conditions) similarity. This is not surprising, because, as I said, Stich views the holistic nature of intentional states (narrow or wide) as the most important argument against any sort of intentional psychology.

So we are left with the other two dimensions and their alleged trouble causing features for content based psychologies. Let us now see whether a taxonomy based on NCA has any superiority with respect to other two dimensions over content taxonomies.

5.2.2 Doxastic Similarity Dimension and the Holism Problem

Most of Stich's arguments for his case against belief turns on the "holistic" nature of commonsense individuation of beliefs. It is unfortunate that he does not elaborate on what, exactly, holistic nature of content comes down to. Much of what he has to say on the matter is provided through a handful of examples like the case of Mrs. T. This vagueness is typical on those who think that semantic content is holistic.

In what follows, I will first try to explain as clearly as possible the sense in which Stich thinks that commonsense individuation of beliefs is holistic. My discussion of Stich on holism of folk psychology will show that he is obscure and vague about what he thinks the "holism problem" is. I will then indicate exactly why Stich thinks that the holistic nature of belief is a problem for content based-psychologies, and why he thinks that an STM-style theory is totally free of it. We need to be as precise as possible about this, because my argument against Stich depends on his own premises.

5.2.2.1 Stich's "Holism"

According to Stich, the identity of a particular belief, say, the belief that McKinley was assassinated, depends on what other actual beliefs a person happens to have. The doxastic surround of the belief that McKinley was assassinated is constitutive of its content. But Stich does not say what this doxastic surround is, how it is determined, nor how it is supposed to be constitutive of a given content. For instance, at any moment, a person who believes that McKinley was assassinated has also a very large stock of other beliefs. Do all of them contribute to the content of the belief in question, or only some of it? From the way Stich writes and uses the expression `doxastic surround of a belief' and its cognates, it seems that only some portion of a person's entire belief system, he thinks, is relevant to the determination of the content. Unfortunately, he gives almost no clue about how big the portion is supposed to be. At any rate, we have no idea about what, exactly, makes commonsense content individuation holistic. Where does the holism come from? The paradoxical aspect of Stich's discussion lies in here. Stich claims that commonsense taxonomy of beliefs is holistic, but all he may be said to show is that, at most, the identity of the belief that P is (partly) determined by some other beliefs a person actually has. If we reflect on his examples, this is obvious.

The case of Mrs. T is typical. She ceases to have many beliefs among which, for instance, is the belief that if someone is assassinated then she is dead. In fact, she no longer "knows" what dying is, what a presidency is, what presidents do, etc. In all of Stich's similar examples, the beliefs that make up the doxastic surround of a particular belief have rather certain "direct relations" to the belief. They are not only semantically close, but also, in some loose sense (to be discussed later), "conceptually" tied to the belief. In all such examples, the fact we are invited to observe is that when someone ceases to have those kinds of beliefs then someone ceases to have a particular belief.

When Stich presents his own descriptive analysis of what a content taxonomy comes to, he is equally unhelpful. Again, he does not give any clue about what portion of one's belief system, or how much of it, is relevant in determining any particular content. Just for the same reason, he does not discuss what, if only some portion is relevant, determines its relevancy. But he concludes that belief individuation is holistic!

I think, Stich is crucially vague and not particularly careful in his discussion. If his claim is that the content of a particular belief is (partly) determined by the set of all beliefs one has, which I take it what holism at its extreme comes down to, then he has not provided any single reason, let alone a relatively elaborate argument, for his claim. On the other hand, if his claim is that only some beliefs determine content, as he seems to intend, then the identity conditions for belief are not holistic.

I think that part of the reason why Stich is so vague and careless is that he does not care about this distinction, some or all. According to Stich, it seems, the very fact that the content of a belief depends for its identity on at least some other actual beliefs the agent has is enough to make serious trouble for any psychology that hopes to essentially advert to content in its generalizations. For one thing, given Quine's influence on him, Stich clearly thinks that the distinction between those beliefs that determine content and those that don't can be anything but sharp and principled. If so, content can at best be a matter of degree. And this is enough to make trouble: a content psychology is possible, at best, for those who are doxastically similar. But even for such a psychology, vagueness will still continue to be a serious problem, since it is almost certain that doxastic similarity never actually achieves doxastic identity among people. Whatever the case is with Stich's analysis of belief, however, he clearly thinks that his alternative paradigm of doing psychology, STM, does not have any such problem.

Let us see why and how Stich thinks that the framework provided by STM has no such "holistic" problem. Here is a typical remark by Stich:[16]

In chapter 7, section 3 our focus was on ideological similarity, and the persistent problem was that as subjects became increasingly ideologically distant from ourselves, we lost our folk psychological grip on how to characterize their beliefs. For a syntactic theory, however, ideological similarity poses no problem, since the characterization of a B-state does not depend on the other B-states that the subject happens to have. A B-state will count as a token of a wff if its potential causal links fit the pattern detailed in the theorist's generalizations, regardless of the further B-states the subject may have or lack. (1983:158)

Stich, then, goes on to clarify how this can be so by working on the example provided by Mrs. T:[17]

If we assume that before the onset of her disease the B-state which commonly caused her to say "McKinley was assassinated" obeyed generalizations like (4)-(6), then if the illness simply destroys B-states... without affecting the causal potential of the tokens which remain, the very same generalizations will be true of her after the illness has become quite severe. In chapter 7 we imagined a little experiment in which, shortly before her death, we tell Mrs. T, "If McKinley was assassinated, then he is buried in Ohio," and she replies, "Well, then, he is buried in Ohio." This is readily explainable by (5) [the syntactic version of psychologized Modus Ponens]... So if the generalization is there, it can be captured by a syntactic theory. But as we saw, there is no comfortable way to capture this generalization in the language of folk psychology... Thus a cognitive science that adopts the STM paradigm can aspire to broadly applicable developmental, clinical, and comparative theories, all of which are problematic for a content-based theory because of the constraints of ideological similarity. (1983: 158-9)

Is it true that ideological similarity poses no parallel problems for STM-style theories? I think not. It is time to see why.

5.2.2.2 Holism and STM

Here is the structure of the argument for my claim that STM, contrary to Stich's advertisement, has exactly the parallel problems.

(1) The STM framework is committed to the NCA of type individuating B-states qua mapped to particular syntactic objects like, say, `Fa', through the generalizations that cover them.

(2) The NCA is capable of individuating such states only if it has enough generalizations of a certain sort, which I will call, S-generalizations.

(3) If STM has S-generalizations among its stock of generalizations then it has all the parallel problems that Stich complains about content-based psychologies regarding the dimension of ideological similarity.

In the remainder of this section, I will make this argument stick. I take it that (1) is common ground (see §§ 3-4). Let me first argue for premise (2).

All the T-T generalizations Stich ever considers, by way of giving examples or otherwise, have rather a certain character: They all quantify over particular syntactic objects, i.e., they all use meta-variables to refer to classes of actually specified sentences that have a certain common "logical form". Even from his passage above, it is apparent that when he talks about the causal interaction of the token that underlies Mrs. T's utterance of `McKinley was assassinated', the generalizations Stich has in mind are of this kind. Let me call the generalizations that quantify over particular brain sentences "L-generalizations", since these apply to any sentences that have a certain "logical" form. L-generalizations are all blind to the primitive non-logical vocabulary that the STM-theorist specifies.

It should be obvious that if all the T-T generalizations that go into the specification of the causal role F in the individuation of B(s i) for any i (see above) are of this type, i.e., if they are all L-generalizations, then there cannot be a unique causal role for each particular B(s i), which means that there can be no type individuation of B-states with particular syntactic objects mapped onto them. Here is why: with only L-generalizations in force, any sentence token has potential inferential (causal) connections to any other one. Put differently, since, on Stich's own admission, the generalizations in the theory detail not only the actual but also the potential causal interactions of any particular B-state, and since any sentence token can potentially be "inferred" from any other (i.e., causally connected through L-generalizations to any other), L-generalizations all by themselves cannot type individuate particular B-states.[18] All they can specify is at most the "logical" form or syntactic type of sentence tokens. As we will see in the next section, this situation does not change even when we add the I-T and T-O generalizations to L-generalizations: together they are still incapable of providing unique causal roles for particular B-states.[19] For one thing, as I will argue in the next section, there can be no such (narrow) I-T/T-O generalizations. But, for our purposes here, more importantly, even if there are such generalizations they can at most help to identify a very small subset of particular B-states whose character is rather "observational". However, Stich himself is pessimistic of there being any such subset (see below). My point is that S-generalizations are necessary (not sufficient) for type individuating at least some B-states, and this will do for premise (2).

What is needed, of course, is a different kind of T-T generalizations in addition to L-generalizations, T-T generalizations that are not blind, so to speak, to the primitive non-logical vocabulary of the STM-theorist, generalizations that detail (part of) the causal role that is unique to, say, the B-state mapped to `Fa'. It is obvious that such "low-level" generalizations will typically be the syntactic parallels of such "content generalizations" (C-generalizations) as[20]

(i) For all subjects S and for all x, if S comes to believe that x is a cow, then S will typically come to believe that x is an animal,

(ii) For all subjects S and for all x, if S comes to believe that x is a bachelor, then S will typically come to believe that x is unmarried,

and so on.[21] Let me call the syntactic parallel of this kind of C-generalizations "S-generalizations". Stich is committed to such generalizations, otherwise there is no individuation of particular B-states. Hence, premise (2).

Let's now take up premise (3): If Stich is committed to S-generalizations, then his STM framework has exactly the same "holism" problem that he claims plague content-based psychologies. There are different ways of showing this, but at the end they all come to the same thing. Let me begin with the obvious version.

S-generalizations are low-level generalizations. What makes them low level is the following fact. Subjects that are covered by such generalizations are also covered by L-generalizations if the subjects have certain actual B-states. For instance,[22]

IF S has the belief* that #for all x, if x is a cow, then x is an animal#,

AND

S comes to believe* that #Samantha is a cow#,

THEN

S will typically come to have the belief* that #Samantha is an animal#.

What might license this inference* is, of course, the existence of high-level L-generalizations that Stich (mutatis mutandis) specifies among his examples:

(5) For all subjects S, and all wffs A and B, if S has a B-state mapped to A->B and if S comes to have a B-state mapped to A, then S will come to have a B-state mapped to B. (1983:155)

As we may recall, according to Stich the "holism" problem that plagues the content-based theories consists in the fact that the type identity of a particular belief (partly) depends on what other actual beliefs the subject has. And Stich thinks that this fact is the source of the problem. In contrast, he claims, the NCA of typing particular B-states has no such commitment to there being any actual B-states surrounding a particular B-state in terms of which its type-identity is determined.

But, if every subject who is covered by S-generalizations is also covered by the relevant L-generalizations, in the way I've just indicated, then the STM-theorist is committed to their being actual B-states for determining the type-identity of particular B-states, and thus committed to construct syntactic theories only for those who more or less share their doxastic* background. In other words, in the STM paradigm, the "syntactic" type identity of sentence tokens is, contrary to Stich's advertisement, acutely sensitive to the actual particular B-states that surround them. This is a problem that is exactly parallel to what Stich calls the "holism" problem of belief individuation. And so, STM must incur all the parallel problems that Stich claims seriously bother content psychologies: Sharing a particular B-state can only be a matter of degree, therefore, those that are doxastically* dissimilar to us cannot be covered by STM-theories. What are we to do with the children, exotics, cognitively handicapped, higher animals, etc.? Furthermore, unless Stich can come up with a principled distinction between those B-states that contribute to the syntactic type identity of a sentence token and those that don't, the vagueness that is already existent in the conditions that type identify sentence tokens will be greatly aggravated. Again, we have exactly the parallel problem here. If Stich is right in his criticism of content-based theories regarding "holism" problem, it is false that STM theories are any superior in just that respect.

However, one might object: It is not necessary that for any subject who is covered by S-generalizations is also covered by the relevant L-generalizations in the way I have just indicated, and therefore, it is not necessary for an STM theory to be committed to there being actual B-states, which a subject must have, for the type individuation of sentence tokens. It may be that the S-generalization (i) above may be true of a subject even though she may not have any actual belief* that #all cows are animals#. In such a case, the syntactic type identity of a sentence token may be given in terms of such dispositions as the likes of (i) and (ii) specify without any recourse to high-level L-generalizations.[23] How does this evade the problem? Well, let me show that it doesn't.

Although I don't have to make my point in the way I will do, I think it is important to cast the issue in terms of that perspective. But nothing important will hang on it. STM has usually been brought up as a de-intentionalized version of a language of thought story, or CTM. We have seen that STM is not in fact committed to there being (semantic-free) sentences literally realized in the brain. But it may be taken in this way, and this is the assumption we are now operating under.

Anyone who is sympathetic to the computational paradigm must keep in mind that CTM is a "rules and representations" framework: any relatively higher level mental processes consist in transformation of syntactically structured representations according to rules that are causally sensitive only to the formal properties of representations over which they are defined. In other words, the typical computational treatment of such inferences as expressed by (i) or (ii) will take the form of applying some relatively high level rule like Modus Ponens to actually tokened complex sentences.

Of course, this is only one possible implementation story that can be given for such generalizations like (i) or (ii) at the computational level. Another possibility is that the rules that govern the inference from #x is a cow# to #x is an animal# is rather more specific and low-level, rather like the syntactic analogues of Carnap's "meaning postulates" implemented as rules.[24] But, either way, according to CTM, you need rules to manage inferential processes defined over data structures.[25]

My point is simply this. On a computational paradigm S-generalizations can be cashed out either by postulating high-level laws and actual beliefs* upon which they operate or by postulating the syntactic analogues of meaning postulates in the form of low-level rules. So, once it is obvious that an STM-theorist is committed to such S-generalizations, the STM-theorist is no longer in the position that Stich claims is free of the problems confronting the content psychologist. Let me illustrate.

As we may remember, Stich claims that unlike content psychologies, the STM framework is capable of covering Mrs. T's mental states in its generalizations. That is because, he says, according to STM, the type identity of the state underlying her utterance "McKinley was assassinated" does not depend on further actual doxastic* states she has. In fact, as the example is constructed, she has almost none. The type individuation of Mrs. T's state proceeds according to its inferential* potential, not according to what it actually inferentially* interacts with. So far so good. For instance, all the L-generalizations that cover her state detail just this potential. But, of course, with only L-generalizations, the STM-theorist cannot individuate the state. It is obvious that in addition to L-generalizations, the STM-theorist needs some such S-generalizations as

(iii) For all subjects S, if S comes to believe* that #someone is assassinated#, then S will typically come to believe* that #someone is dead#

in order to type individuate the state underlying Mrs. T's utterance of "McKinley was assassinated". But, of course, we see that it is precisely this kind of generalizations that become inapplicable to Mrs. T when we come to see that she ceases to have many relevant beliefs. This can easily be explained on the version of the computational story that derives the S-generalizations from actual beliefs* and high-level L-generalizations. But Stich would probably insist that this is the wrong version. Well, then, let us look at the other version where S-generalizations are implemented as specific "dedicated" rules rather like the syntactic analogues of "meaning postulates".

The question now is whether there are any such rules intact in Mrs. T's case. As we may remember, it becomes apparent under questioning that she does not "know" whether an assassinated person is dead, what dying is, who McKinley was, etc. When she is asked whether McKinley was dead, she answers "I don't know". What better evidence can there be for the fact that the above S-generalization is broken? In the case of S-generalizations, appeal to potential causal profile doesn't even begin to help since it is precisely this potential that is lost in her case. But then, if such generalizations do no longer cover the mental states of people like Mrs. T, of course, we can't tell the computational story along the lines we have been assuming given that the other version is out. But either way, the important point is that the S-generalizations do simply not hold in Mrs. T's case. If so, however, the STM theorist is in exactly the same boat as the content psychologist: there is simply no saying what "syntactic" state Mrs. T is in, since the STM-theorist is no longer able to type individuate her state.

The same is true, similarly, for people who are doxastically* dissimilar to us like children, exotics, cognitively handicapped, higher animals, etc. In so far as the S-generalizations that are true of them are not available or non-existent, there is no type individuation of their syntactic mental states, hence they are beyond the reach of STM-theories.

So here is the score. Contrary to Stich's claim and advertisement, because of their commitment to NCA, STM theories are committed to the type individuation of particular B-states either (depending on the computational story preferred) according to what other actual B-states the subjects have, or according to what S-generalizations are true of them. The first option makes STM equally sensitive to the actual doxastic background of subjects. The second option restricts the scope of STM-theories to those for whom S-generalizations exist, or are specifiable, thus, again, to those who are doxastically/disposiotionally similar. But the consequences of both options are just the same for the prospects of STM if the prospects of content psychologies are as Stich claims them to be.

Before ending this section, we have to take care of one more thing. Stich's analysis of belief includes, remember, one more dimension in the similarity matrix that goes into the commonsense individuation of beliefs: the causal pattern similarity. Let us now see whether this dimension poses any special problems for content-based psychologies in a way STM-theories are free from.

5.2.3 Dimension of Narrow Causal Pattern Similarity

According to Stich, when certain subjects are sufficiently dissimilar to us with respect to the causal pattern dimension (call it cp-dissimilar), there is no saying what they believe. But since STM style theories do not traffic in intentional terminology, they can come up with different generalizations, hence with different syntactic objects, that detail these subjects' functional/causal organization. Cp-dissimilarities can be attributed to three different sources:

[1] dissimilarities that stem from different doxastic background (as in the case of exotics, etc.),

[2] dissimilarities that stem from different or broken down ratiocinative mechanisms (as in cognitively handicapped people -- different L-generalizations), and

[3] dissimilarities that stem from perceptual or behavioral differences (as in the case of perceptually or behaviorally handicapped people -- different I-T or T-O generalizations).

With respect to [1], Stich is no better off than the content-based psychologists. Here is why. First, Stich thinks that the cp-similarity dimension is distinct from the other two dimensions. But this can't be quite right. In particular, since Stich is committed to S-generalizations, the cp-similarity cannot be an entirely separate dimension different from doxastic similarity dimension. S-generalizations are supposed to detail nomological connections among beliefs* with particular #content#, hence they are rather specific and dedicated generalizations that hold among doxastically* similar subjects. So, in a certain sense, the doxastic similarity issue turns out to be an aspect of cp-similarity issue. They collapse into one dimension. To put it differently, the class of subjects who are sufficiently cp-dissimilar to us include those who, Stich calls, are doxastically dissimilar. So if exotics and the like pose a problem for intentional psychology, by parity of the cases, they should pose a parallel problem for STM theories. If, on the other hand, the STM-theorist can come up with different S-generalizations for doxastically dissimilar subjects, as Stich might claim, why can a content psychologist not do the same? In fact, given that the reference dimension is put aside, there is every reason to believe that this is in fact what they do. Anthropologists routinely describe in intentional terms the doxastic network of such culturally remote groups: they seem to detail, in other words, the network of C-generalizations that hold for such groups.

Stich similarly claims that where differences stem from [2] and [3], the STM theorist can always state new generalizations. But again why can't the content psychologist do the same, if this is the proper way to go?

Stich would probably say that (broad) content attribution is necessarily parochial, whereas attribution of syntactic objects on the basis of new generalizations is not. But we are assuming that a scientific intentional psychology will not advert to broad content but to a narrow one. Also, given that the cases are exactly parallel, it should be the case that either the two types of theories both can't cover causally dissimilar subjects or they both can.

I think, Stich is forcing us again to be die-hard conservatives in defending a content psychology, and he's been wrong on this. The essential question is whether the dissimilar subjects are intentional agents, and not whether we can attribute broad content to them. In cases like [2] and [3], as long as the subjects are intentional, much of what matters will be whether identity or difference of the narrow content of their mental state is defined. And, as long as the answer to that question is positive, the content psychologist can perfectly well quantify over the content of their states.[26] And there is no reason to think that the answer is negative. Or if it is, then, similarly, specifying the identity or difference for abstract syntactic objects Stich wants to operate with is equally problematic in the STM paradigm.

My present point however is that, given the cases are exactly parallel, Stich needs an argument for his claim that STM theories are superior in the aforementioned respects to content based theories. And he doesn't have one. I suspect that the reason for that is his confusion about what STM theories are or are not committed to.

In fact, as we will see in the last section of this paper, an intentional psychology has even an important advantage over STM theories in their ability to discover and state new set of generalizations for those that are sufficiently cp-dissimilar to us. I will argue that especially in the context of discovery, an STM-theorist has no chance but to become an intentional psychologist.

In conclusion, I claim to have established in this section the following conditional claim: if content based psychologies have the problems Stich enumarates then STM-theories have exactly the same problems. So Stich looses all his motivation to promote STM as a superior paradigm of doing scientifically respectable psychology.

Now it is time to see whether the NCA can type-individuate the particular syntactic states at all, as Stich assumes. In the following section, as advertised, I will argue directly against STM by showing that it can't.

6. The NCA and the Type Individuation of Brain Sentences

[[Dear reader: The following subsections §§ 6.1 & 6.2 were written after a considerable period of time had passed since the first draft of this paper was written. I hope the difference in style and some (mostly terminological) dispcrepencies between the material in this section and the others aren't big enough to cause serious problems in following the thread of the argument in the paper. I also hope that some overlap and repetitions would be excused.]]

As we may remember, Stich had raised a problem when he was discussing Fodorian mental sentence theories:

for a Fodor-style account of belief sentences to hang together, we must have some workable notion of what it is for two distinct people, speaking different languages, to have in their heads distinct tokens of the same sentence type. (1983:43-4)

Among the three possible accounts he offered, he rejects two, the Quasi-Physical Account and the Semantic Account (SA), because he thinks that either they can't do what they are supposed to do or they are ill-suited for serious scientific purposes. What remains is the NCA. Since Stich intends to take his STM rather in the spirit of mental sentence theories, he makes the NCA the basis of his allegedly superior STM in type identifying the brain sentence tokens.

In this section, I will first show what was supposed to be wrong with narrow content functionalism. I will then argue that Stich's STM is identical to narrow content functionalism, and as such suffers exactly from the same problems: namely, NCA-cum-STM either doesn't have the resources to type-individuate brain states with particular syntactic objects across different organisms in a way appropriate for the purposes of scientific psychology, or else suffers from the same radically interactable "holism problem" that Stich says plagues content based psychologies. We will see that these two problems are at bottom different aspect of a single problem, so at the end, it doesn't really matter which horn of the dilemma Stich's theory will get stuck to.

6.1 Troubles with Content Functionalism

Let us begin by reminding ourselves what was supposed to be wrong with functionalism in fixing mental content. Fodor has been the most vocal person in the last ten years or so in his attack on semantic functionalism. Partly becasue of this, I will loosely take Fodor's attack on semantic functionalism to guide us through the steps of the argument that will show what the troubles are with content functionalism. Where necessary, I will clarify, supplement and correct Fodor. In a way, here I am siding with Fodor about the the place of functionalism in fixing the type-identity of particular Mentalese symbols against Stich. I think that Stich has misunderstood the role of functionalism in cognitive science.

Fodor's sustained attack on content functionalism has changed in emphasis and argument over the years, and there are many obscurities in the details. In the beginning, the emphasis was on the holistic consequences of content functionalism, which were thought to be destructive of a scientifically respectable intentional psychology.[27] Later on, he argued that functional role semantics either violates semantic compositionality or else is committed to there being a principled analytic/synthetic distinction, and neither option was thought to be palatable.[28] Although his earlier and later criticisms are very intimately connected, I will concentrate in what follows only on his earlier views, since they are the ones that are relevant to criticism of STM.

According to narrow content functionalism, the content of a Mentalese symbol token is (metaphysically) determined or constituted (partly) by some of the inferential relations it has.[29] It is crucially important to be clear about the notion of inferential relations of a token. I want to make six points (in no particular order) to clarify the notion.

First, a content functionalist aspiring to naturalize intentionality, or at least conceiving functionalism as part and parcel of the project of naturalizing semantics, cannot, unquestion-beggingly, appeal to a token's "inferential relations" as such, since the very notion of inference is an intentional one. As noted by Fodor,[30] the usual solution to this problem has been to combine functional role semantics with LOTH or computationalism in general. The idea is that inferential relations are to be cashed out by computational/functional/causal relations among non-intentionally characterized symbol tokens.[31] This point is very important. Indeed this is the first point where content functionalism and the NCA of typing tokens comes into contact, since a content functionalist is now in need of non-intentionally individuating symbols so that she can assign semantic content to them on the basis of certain of their computational relations. I will come back to this point below.

Second, as I pointed out previously, a symbol token, in a certain obvious sense, has indefinitely many potential inferential relations.[32] For instance, #Fa#[33] can be inferentially connected to any other token provided that the agent has the further appropriate tokens and the "syntactic" (or, proof-theoretic) version of some of the basic logical generalizations (L-generalizations) like Modus Ponens, Conjunction Elimination, etc., in her computational repertoire. We called them L-generalizations. So, for example, #Fa# is inferentially connected to #Ys #, through Modus Ponens* and the expression #Fa-->Ys # for any predicate Y and any constant s . L-generalizations achieve generality by quantifying over specific symbols. Modus Ponens, for instance, says that given any conditional and its antecedent, its consequent may be inferred. No particular conditional needs be mentioned. This makes it clear that the meaning-constitutive inferential relations of a token cannot be any such potential relations secured by L-generalizations. Put differently, since L-generalizations make any given token potentially connected to any other actual or possible tokens, Mentalese tokens cannot be type-individuated (intra or interpersonally) with only L-generalizations.[34] Rather, some of the individuating generalizations must be more restricted, immediate and specific. Above, we called such generalizations S-generalizations. Such generalizations must not quantify over specific symbols. Their quasi-canonical form can perhaps be given as such:

(C) For all S, if S comes to have #Fx# in her B-box, then, ceteris paribus, S will come to have #Gx# in her B-box.[35]

Here are some putative examples, some of which are perhaps more likely to be true than others:

(1) For all S, if S comes to believe* that #x is a bachelor#, then, ceteris paribus, S will come to believe* that #x is unmarried#;

(2) For all S, if S comes to believe* that #x is assassinated#, then, ceteris paribus, S will come to believe* that #x is dead#;

(3) For all S, if S comes to believe* that #x is a cat#, then, ceteris paribus, S will come to believe* that #x is an animal#;

(4) For all S, if S comes to believe* that #x is a star#, then, ceteris paribus, S will come to believe* that #x is a celestial object#;

(5) For all S, if S comes to believe* that #x is a tiger#, then, ceteris paribus, S will come to believe* that #x is dangerous#.

It is on the basis of such generalizations (in addition to L-generalizations) true of a token that its inferential relations are specified.

Third, although I specified the S-generalizations as quantifying over all subjects, S, it remains to be seen what the domain of `S' will actually be for each generalization. At one extreme, the generalizations may be true of one organism; at the other, they may range over an entire population, or even species. True enough, content functionalism, if it is to vindicate intentional realism and a scientifically respectable intentional psychology, must not make the domain of such generalizations restricted to individuals or indeed to subcultures or perhaps even to cultures. Intentional psychology is meant to be the psychology of the folk. Hence content functionalists must mean their S-generalizations to cover at least the folk.

Fourth, as I mentioned in characterizing NCA above, the canonical way of specifying the narrow functional role of a token is by way of the Ramsey sentence of the generalizations covering that token. Informally, the (narrow) meaning of a token is to reduce to its functional role. This functional role is to be fixed on the basis of generalizations that cover it through the standard Ramsey-Lewis-Loar method. However, not all S-generalizations that cover a token might go into the Ramsey sentence that would determine its semantic content. Content functionalism typically assumes that only some of all available generalizations will go into determining content. Hence, we can talk of a "holist" functional role of a token with its Ramsey sentence containing all the available generalizations covering it, and we can also talk of a "localist" functional role with its Ramsey sentence containing only some of all the available generalizations covering the token.[36] The way I initially characterized content functionalism above is to be understood in this latter sense. (See below.)

Fifth, since, according to functionalism, the content of a token is meant to be determined by its localist functional role, and since distinct functional role means distinct content, and vice versa, it is assumed that the generalizations will fix unique roles for each intuitively distinct content. Thus it is assumed that there are enough generalizations in the relevant "localist" Ramsey sentence to fix a unique role for each distinct content.

Sixth, the meaning-constitutive generalizations must in some intuitive sense be lawlike. I am not sure what this means exactly. But at a minimum it is supposed to ensure that generalizations must go beyond describing statistical summaries of what has actually caused what, that they support counterfactuals in appropriate ways.[37] (I will come back to this point in a moment.)

Now given these clarifications and distinctions, we may characterize narrow content functionalism as follows. There is a theory T consisting of a set of S-generalizations[38] such that via Ramsefication they secure a unique functional role for any contentful token and this role is the (functional role -- narrow) content/meaning of the token. The generalizations of T are true of all those with a reasonably common intentional psychology (more or less assumed to be the Folk with a certain common conceptual repertoire). Thus the lawlike generalizations are interpersonally applicable.

Let me emphasize again that the generalizations in T are not assumed to exhaust all the generalizations covering any particular token whose content is secured by T. In particular, there may be, in the first case, other equally interpersonally applicable (and, perhaps, lawlike -- see below) generalizations covering all (or most, or some) tokens covered by T, but they need (or perhaps must) not be put in T. At least, this is a question left open. Intuitively, they may not be meaning-constitutive S-generalizations. Perhaps, (4)-(5) may be given as examples here. Call such interpersonally applicable S-generalizations not meant to be in T (hence not meaning-constitutive), pluralist.[39] In the second case, there may be many other S-generalizations only true of single individuals' tokens otherwise covered by T, an example of which may be:

(6) For all S, if S comes to believe* that #x is a bachelor#, then, ceteris paribus, S will come to believe* that #x is a neurotic#.

Suppose this is true only of me; so the domain of `S' is the unit set {Murat}. Intuitively, this would be the case, for instance, if I were to strongly believe that all bachelors are neurotic. Then it is true of only me that if E were the tokening of #x is a bachelor# in me, then E would tend to cause the tokening of #x is a neurotic# in me. Call such S-generalizations singularist. Then my #bachelor# token is covered both by (1) presumably in T and by (6) not in T.

Content functionalists' informal claim that the meaning of a token is constituted only by some of its inferential relations is to be therefore taken in the sense that only some interpersonally applicable S-generalizations covering a token can be meaning-constitutive, i.e., can go into T which is, via Ramsefication, the sole basis of securing a unique functional role for the token that is to be identified with its (narrow) meaning.

Now, what is wrong with content functionalism so characterized? Surprisingly, it is very hard to tell from the way people write on and complain about functionalism. Fodor is typical in this regard. Part of the reason, I think, is that Fodor ignores many of the above clarificatory points and distinctions in his discussion. But it is possible to discern his bottom line. He accuses the content functionalist of being holist.

Semantic holism is the doctrine that the meaning of a token is constituted by all of its inferential relations. But this statement carries with it all the obscurities of the original phrase `inferential relation.' Given that the functional role of a token is fixed by the generalizations covering it, semantic holism, in the context of Fodor's accusation, must be the doctrine that the content of a token is constituted by all the generalizations that cover it. But what is supposed to be wrong with this?

The official word for why holism is to be avoided is that it is destructive of intentional psychology: it is supposed to make meanings not publicly sharable. As can be seen from our discussion, a destructive sort of holism follows from functionalism only if by `all' we mean all the available generalizations covering a token including not only the pluralist but also the singularist ones. It must be obvious that even if we decide to make the pluralist S-generalizations meaning-constitutive by incorporating them into T, a destructive sort of holism still doesn't follow, not unless without some further assumptions. Call this sort of holist functionalism Non-Destructive (ND-holism for short). So Fodor's worry about holistic functionalism must be a worry about a version of the theory that also takes the singularist S-generalizations into account in fixing a functional role, hence (narrow) meaning. Only then does functionalism become destructive.

But why take functionalism in this version? As Fodor put it once, in this form functionalism, as a naturalistic semantics, becomes suicidal: it ends up naturalizing meanings that no one seems to share. This wasn't at all the notion of `meaning' that functionalism had envisioned to naturalize to begin with. Indeed why take functionalism in this way?[40]

Fodor seems to think that a localist or even a ND-holist functionalist is somehow forced to become a holist of a destructive sort. But what are the grounds for thinking that? Why is a localist functionalist forced to become a destructive sort of holist? Why can't the content functionalist remain localist, or even a ND-holist, admitting as meaning-constitutive only those generalizations true of all those organisms with, intuitively, a reasonably common intentional psychology and discarding all those generalizations with intolerably narrow scope? Indeed, this way seems quite natural for a content functionalist who wants to avoid the destructive consequences of holism.

Unfortunately, there is no clear answer to this question in Fodor. His official line is to be found in an argument he reconstructs on behalf of holists, which he calls the "Ur-Argument":[41]

(i) The semantic content of a Mentalese symbol is constituted by some of its inferential relations.

(ii) No principled distinction can be drawn between those inferential relations that constitute the content of a symbol and those that don't.

(iii) Therefore, the content of a symbol is constituted by all its inferential relations.

(i) is apparently the position of localist functionalists. (ii) is supposed to have been established by Quine's argument against drawing a principled distinction between analytic and synthetic statements. Hence (iii), which is the position of holists. As I said, in order for Fodor's worry about holism to be warranted, (iii) must be read as referring to inferential relations captured by S-generalizations including the singularist ones. I will be charitable and assume that this is what Fodor has in mind.

But what does the analytic/synthetic (a/s) distinction have to do with functionalism? When reconstructed in our terminology, (ii) seems to say that there is no principled distinction between those generalizations covering a token that are in T (meaning-constitutive) from those that aren't (non-meaning-constitutive). Is this claim the same with Quine's claim on the untenability of the a/s distinction? And is it true?

The answer to the former question is, strictly speaking, negative, although as we will see in a moment the two claims are intimately related. Functionalists (local or holist) cannot appeal to intentional idioms like the a/s distinction in their attempt to explicate what meanings are. Being naturalists all there is in their disposal are causal/functional/computational relations. At best, they have to reconstruct the a/s distinction, if they deem it worth doing, in terms of such naturalistically acceptable relations as specified by causal/functional/computational generalizations.

But is it true that there is no principled distinction between those causal generalizations covering a token that are meaning-constitutive from those that aren't? Here is one suggestion about how to draw such a distinction that seems natural given the dire consequences of destructive holism. A generalization is meaning-constitutive (hence in T) only if its scope is sufficiently large, intuitively, to accommodate all who have a common psychology.[42] For instance, the very reason why we shouldn't count (6) as meaning-constitutive is precisely because its scope is not large enough: it applies only to me. So the difference between (1) and (6) lies precisely in the fact that (6) is idiosyncratic in its application only to me, whereas (1) seems to be true of all those who have, intuitively, the concept of a bachelor. This move would establish a difference in such a way that avoids holism's destructive consequences. The suggestion tries to establish a difference between those generalizations that are in T from those that aren't in terms of their application scope or domain.

But, whatever other problems the proposal may have (and there are plenty), unless the localist functionalist moves into a ND-holist position, it falls short of its intended target, because the question then becomes: What decides on which ones go into T from among all those interpersonally applicable S-generalizations (i.e., generalizations with a sufficiently large scope)? Intuitively, suppose (1)-(5) are of equally large scope. What is to distinguish between them -- if anything -- as to which ones go into T? Is there, for instance, a difference between (1) and (2) on the one hand, and (4) and (5) on the other in that regard?

It is at this point one is tempted to appeal to a de-intentionalized version of a/s distinction. The problem here isn't that Quine was right and so there isn't a principled a/s distinction. For even if there were it wouldn't be available, as such, to a localist content functionalist on pain of violating the strictures of naturalism. So the functionalist must come up with a distinction, which would perhaps reconstruct the semantic a/s distinction in non-semantic (causal/computational) terms. What could the basis of such a distinction be?

There are at least two options available to the localist at this juncture. One is to admit that the job of finding such a naturalistically acceptable distinction is hopeless and move into a ND-holist position, to which I will come back below. The other is to try what seems to be an intuitively natural proposal: the difference between the S-generalizations that are in T and those that aren't (pluralist ones) is that whereas the former are truly lawlike, the latter are not. So far we haven't emphasized this aspect of functionalism, and assumed that all the generalizations we have talked about are equally lawlike. But perhaps this assumption isn't true. As I said, I am not sure what exactly lawlikeness comes down to in the case of S-generalizations, and so am not sure whether the following rough and ready characterization of the proposal can be further elaborated and successfully defended. I just want to put it on the table on behalf of the localist functionalist.

A generalization is lawlike only if it supports counterfactuals. (But not every counterfactual supporting generalization may be lawlike.) So, for instance, the generalization `in C, (x)(Fx-->Gx)' is lawlike only if it is true that in C if this were (had been) an F, it would be (would have been) G. Given some of the most popular accounts for evaluating counterfactuals, e.g. Lewis' (1973) and Stalnaker's (1968), the S-generalizations (1) through (6) and the likes (with perhaps their differing domains and to various degrees), if true, turn out to be counterfactual supporting. This is hardly surprising. Such generalizations describe the causal regularities among state transitions of the systems under their domain. These regularities are described at the level of state types. Intuitively, a Ramsefied functional theory consisting of such generalizations describe the abstract functional/causal structure and organization of individual systems/organisms under its scope. If the theory applies only to me at a time, then it describes the causal potential of my state types at that time. So, for instance, suppose (6) is part of a functional theory of me, and only me, at t. Then it is counterfactually true of me at t that if I were to token #John is bachelor#, I would, ceteris paribus, tend to token #John is neurotic#.

For all that, on the other hand, (6) may not be lawlike if its application domain is only me and it is restricted to particular times. Unfortunately, there doesn't seem to be any consensus on a recipe of what needs to be added to a counterfactual supporting generalization to turn it into a law. But it is clear, at least intuitively, that laws are not the kind of things that are parameterized for particular times and populations who are relevantly homogeneous. If this doesn't seem intuitively clear with respect to laws or lawlikeness, we can perhaps introduce a term, "strongly counterfactual supporting" (strongly-CS in short), to capture this aspect of being relatively free from temporal and regional restrictions (in addition to whatever laws are beyond simply being counterfactual supportive), and define it thus:

a generalization is strongly-CS if an only if in all the nearby (otherwise nomologically) possible worlds, where the antecedent is true of the same population over a given period of time, its consequent is also true.[43]

So for instance, it may be said that (1) and (2) are strongly-CS in this sense (or else, intuitively, there is no token with the contents [bachelor] and [assassinated], hence no concepts of bachelor and assassination), whereas (4) and (5) are not.[44] Take, for instance, (5). There seem to be indefinitely many nearby nomologically possible worlds in which no organisms tend to think that something is dangerous upon thinking that it is a tiger, and this seems to hold independently of whether in those worlds tigers, if they exist, are dangerous. Similarly for many others. In fact, it appears that we don't even have to look at other worlds. To make the point, the actual world will do. For instance, (4) is said to be false of ancient Greeks (it may even be false of some of our contemporaries) of this world: they apparently believed that stars were holes in the celestial spheres that the cosmic fire shows through (cf. Fodor 1987:88-9). That such generalizations are not strongly-CS in the sense defined, a localist functionalist might say, should hardly come as a surprise in fact. For, intuitively, their truth depends on what actual beliefs* people have in common at a time, or what common content*-specific inferences* they are actually disposed to engage at a time, and to a very large extent this doesn't seem to be a matter expressible in terms of nomological necessities.[45]

We may put the problem slightly differently as follows. Among such generalizations, many do not seem to be necessary for a subject to have a particular concept*. For instance, for a subject to have tiger-thoughts*, it does not seem necessary that the generalization (5) should hold for her, even though (5) may hold for everybody else.[46] But this does not seem to be true of, at least, (1)-(2). There indeed seems to be a difference between (1)-(2) and (4)-(5). That, say, (1) should hold seems to be necessary for someone to have any bachelor-thought* at all. It seems that this is another way of saying that (1), if anything, is a strongly-CS generalization. Anyone for whom (1) is false ipso facto can't have bachelor-thoughts*, or so it seems. In other words, in all the nearby nomologically possible worlds in which bachelor-thoughts* are defined, anyone who comes to think* that #x is a bachelor# tends, ceteris paribus, to think* that #x is unmarried#.

Perhaps, this is indeed the way to reconstruct the a/s distinction: if there are strongly-CS content generalizations, they seem to detail the conceptual or analytic relations among contents, and vice versa. One is even tempted to give a demonstration: Suppose that `B*#F#-->B*#G#' is such an S-generalization, then in every (nomologically) possible world, whenever someone comes to have an #F#-thought she tends to have a #G#-thought. Now suppose that `F-->G' is not a conceptual/analytic truth. Then it is possible for someone to have an F-thought without ever being disposed to have a G-thought. But this seems to contradict the original assumption that `B*#F#-->B*#G#' is such a strongly-CS generalization, supposing that `B*#F#-->B*#G#' implements the intentional generalization, `B(F)-->B(G).'

Conversely, if there are conceptual/analytic relations among contents then they can be specified in terms of strongly-CS content generalizations. Intuitive quasi-proof: Suppose that `F-->G' is a conceptual truth, then it follows that it is not possible for someone to have an F-thought without being disposed to have a G-thought. But this just means that `B*#F#-->B*#G#' is a strongly-CS generalization, supposing, again, that `B*#F#-->B*#G#' implements the intentional generalization, `B(F) --> B(G).'

If this is right, then, a localist functionalist might claim that the attempt to specify functional roles of tokens in terms of strongly-CS S-generalizations is the attempt to specify tokens' analytic connections. Analyticity reduces to strong counterfactual support (or, perhaps, to lawlikeness, if you like).[47]

As I said, I am not sure whether the proposal can be further clarified and sustained. My point, at this stage, is this: even if the proposal to distinguish between S-generalizations in terms of strong counterfactual support can be successfully elaborated and defended, and even if the distinction is shown to reconstruct the a/s distinction, and even if we assume, contra Fodor, that Quine's criticism of the a/s distinction can be successfully met, there is the following obvious problem.

Remember that there must be enough interpersonally applicable and lawlike (= strongly-CS -- in what follows I will use them interchangeably for the sake of brevity) generalizations to secure unique functional roles for each intuitively distinct content. Are there enough S-generalizations like that? If we assume, along with the localist functionalist, that many such proposed S-generalizations will mirror the conceptual/analytic connections among concepts as traditionally/historically understood, such as the intentional versions of the likes of (1) and (2), it is obvious that there aren't enough of them to secure unique functional roles for almost any content. Interestingly enough, this point is acknowledged by some functional role semanticists. Block, for instance, writes, in a recent polemical article written against Fodor and Lepore's criticism, thus:

Fodor and Lepore seem to assume... that... the inferential role theorist has the option of appealing to analyticity as a way of discriminating the inferential liaisons that are in inferential roles from those that are out. But if we stick to traditional ideas about the extension of `analytic', there aren't enough analyticities. Consider the putative analytic truths involving `cat' -- `Cats are animals', `Cats are living beings', `Cats are grown up kittens', etc. The problem is that abstracting from the words `cat', `kitten', etc., appearing in these sentences, there is nothing here to distinguish `cat' from `dog'. Corresponding to `Cats are grown up kittens', we have `Dogs are grown up puppies'. Sure, `nothing is both a cat and a dog' can be used, but so can `nothing is both a dog and a cat'. Even if `Cats are feline', and `Dogs are canine' are analytic, this is of no help without other analytic truths that distinguish `feline' and `canine'... (Block 1993: 3-4)[48]

Trying to secure a unique functional role for every token on the basis of S-generalizations that detail "conceptual" relations among them is like trying to define every concept or term. But even if it were possible to give a few necessary/analytic connections for some concepts or even for each concept, it would be overly optimistic, to put it mildly, to claim that each concept could be given both necessary and sufficient conditions for its application, especially after observing, as Fodor once put it, the failure of philosophy to define any term of any significance after two millennia. Similarly, there is no reason to expect that each token (with a distinct meaning) can be secured a unique functional role in terms of S-generalizations like (1) and (2).

It is, then, the pressure to secure unique functional roles that forces the local functionalist to a holistic position. To secure uniqueness, there must be enough generalizations. The scarcity of "analytic" generalizations forces the functionalist to look for other sorts of generalizations that pretty commonly hold among the folk but specify causal transitions between states whose representational contents, intuitively put, are only contingently connected. (4), (5) (and, even 3) may be examples to these kinds of S-generalizations. Hence the move to a ND-holist functionalism.

So let us take up ND-holism again. There are now three problems that such a functionalist must solve. One is that T now will contain S-generalizations that don't seem to be lawlike or strongly-CS. As I said, such generalizations as (4) and (5) seem to detail the causal profile of tokens whose representational contents are only contingently connected. As such they are like commonly held ordinary empirical beliefs. And the inferential relations among such beliefs don't seem to be a matter expressible in terms of nomological necessities. Put this worry temporarily aside. It may be that the functionalists' initial requirement that the generalizations that go into T be lawlike was just too strong. Perhaps mere counterfactual support is enough for them to go beyond being descriptions of mere statistical summaries of what typically causes what. If so, it was perhaps a mistake to demand that they be lawlike.

The second problem, however, is more serious. As I characterized ND-holism, what makes it non-destructive is that the S-generalizations are applicable interpersonally. But how do we determine how big their application domain actually is or ought to be. I said that their domain should be sufficiently large to cover all those with a reasonably common intentional psychology. Even if we ignore the potential circularity in specifying the domain in such intentional terms as `common intentional psychology,' it is the problem of vagueness which seems to be so bothersome and intractable. How many people are enough for an S-generalization to count as meaning-constitutive? Surely there is no principled boundary to be drawn by just counting heads here. Furthermore, even if we come up with a set of S-generalizations whose scope is sufficiently large to secure unique roles, is there any guarantee that this will be the case for each concept* at any given time?

Relativization to particular times is also a serious problem here since there is no guarantee that the set of such generalizations will not exhibit variations over time, both in terms of the number of heads they cover and in terms of admitting "new" and dropping "old" generalizations in the set. In fact, given that we dropped the requirement of lawlikeness as characterized above, there is every reason to believe that the S-generalizations in T, even though may have a sufficiently large population as their domain, will exhibit variations over time. This is intuitively clear if we think of the situation before, suppose, psychologists' discovery that all bachelors are neurotic, and after. Consider also the period during which the belief* that #all bachelors are neurotic# becomes popular, getting out of psychologists' labs and spreading to an entire population. Consider also skeptics, disbelievers, etc.

This brings us to the third problem, which is that even with all such pluralist S-generalizations (with sufficiently large scope for a particular time period) included in T, there is still no guarantee that a unique functional role will be secured for each distinct concept. It is obvious that the main pressure here is to secure uniqueness: as far as this is the sole non-negotiable constraint on individuation of functional roles, there is no guarantee that the S-generalizations covering a token will not become increasingly narrower in scope of application, and at the end, restricted perhaps only to single individuals. In fact, there is every reason to believe that they will.

This worry is exactly what lies at the source of Fodor's pessimism that such content functionalism inevitably leads to a destructive sort of holism, since there doesn't seem to be a robust and stable way of individuating functional roles interpersonally and in such a way that would also guarantee their projectibility (their appropriateness for genuine lawlike generalizations). With the move to accepting S-generalizations into T that are not lawlike and the desire to secure uniqueness, the functionalist moves into a position into which a serious vagueness is essentially built. This is already disturbing holistic terrain, since it is the first and apparently inevitable step taken towards specifying functional roles in terms of all the available S-generalizations including the singularist ones. So the slippery slope to a destructive holism is grounded on the constraint for uniqueness.[49]

The trouble, then, arises in trying to simultaneously satisfy the three constraints on the S-generalizations: lawlikeness (or, being strongly-CS), uniqueness and interpersonal applicability. The solution, under the pressure of securing unique functional roles, is to appeal increasingly more to those S-generalizations whose scope is increasingly narrower and at the end restricted to single individuals. As we have seen, this also results in forcing the functionalist to sacrifice the constraint that generalizations used in the fixation of a functional role be strongly-CS or lawlike. This is, then, how you get a destructive sort of holism from localist (as well as ND-holist) functionalism. Localist functionalism all by itself doesn't seem to suffice to specify unique functional roles for content that are robustly applicable across people. But if localist functionalism is abandonned, lawlikeness is abandonned with the move to ND-holism, and then the pluralist scope of S-generalizations is given up, which leads exactly to holism of the sort Stich was accusing the content theorist of.

Now, as I said in the beginning, this is not quite the way Fodor, or for that matter any one else, characterizes the discontent with functionalism. Fodor's early attacks against holism assumed the truth of the second premise of the Ur-argument. But instead of directly arguing for it Fodor left the issue as if it were already established by Quine's attack on a/s distinction. As I tried to show above, defending (ii) on Quinean grounds is not available to Fodor, nor to a (holist) functionalist. Its defense must be conducted on non-intentional grounds. And this is what I've tried to do above.

Now with this characterization of content functionalism and of its troubles at hand, let us go back to my claim that the grounds for rejecting content functionalism are the same with rejecting functionalism in individuating Mentalese tokens across systems, hence rejecting Stich's STM.

If what I have said so far is correct, it is relatively easy to see how exactly parallel problems plague NCA of typing Mentalese tokens (=brain states with particular syntactic objects mapped onto them). Indeed the two enterprises are in fact identical as far as S-generalizations are concerned. The reason is that according to the content functionalist the (narrow) semantic content of a token is metaphysically constituted by the functional role it plays, and the job of specifying this functional role is exactly the job of type-individuating the token on the basis of its functional role as uniquely specified by S-generalizations. In fact, as I hinted above in my first clarificatory point about inferential relations, the content-functionalist needs a vocabulary to state the S-generalizations. I said that this vocabulary cannot be intentional for fear of violating the demands of naturalism. So the vocabulary in stating S-generalizations ought to pick up symbols as they are non-intentionally characterized. The device with `#'s and `*'s I used above in giving examples for S-generalizations in a way conceals the urgency of the problem by implicitly using the semantics of English. Take, for instance, the S-generalization (1) above:

(1) For all S, if S comes to believe* that #x is a bachelor#, then, ceteris paribus, S will come to believe* that #x is unmarried#;

Putting aside the problem of computationally specifying what believing* is supposed to be, the functionalist needs to answer the question of how `#x is a bachelor#' and `#x is unmarried#' refer to what they are supposed to refer to. Their referents are supposed to be Mentalese symbol types that are individuated across systems on the basis of their non-intentional, and presumably, non-(quasi)-physical, properties that obey a certain set of causal/computational regularities. Such a vocabulary is not generally available to a functionalist prior to having already established these regularities on the basis of which she proposes to type-individuate Mentalese tokens, i.e. on the basis of their narrow functional profile across systems -- see the next section.

The trouble with content functionalism is the trouble of getting a robust and stable individuation criterion for functional roles that are interpersonally applicable. If this is right, it is obvious that exactly the same worry must attach to the project of individuating syntactic Mentalese tokens across systems on the basis of their functional roles. Indeed, the projects are identical: while a content functionalist wants to assign a content, say, [water is wet], to a Mentalese token on the basis of its interpersonally applicable functional role, an STM-like functionalist wants to assign to it the syntactic type #water is wet# on exactly the same basis.

I think it must now be transparent why the job of assigning a content to a token on the basis of its functional role is exactly the same as the job of type-individuating it on the basis of its functional role. Indeed, the content is supposed to metaphysically reduce to the functional role (non-intentionally characterized). This is after all what naturalism is supposed to be on a functionalist research program. I conclude, therefore, that to the extent to which narrow content functionalism has the problems I claimed it does to that extent the NCA-cum-STM has exactly the same problems. Stich is promoting a scientific framework for psychology that lacks the resources to type-individuate the relevant mental/brain states in a robust and stable manner in a way that respects interpersonal applicability of its theories.

6.2 Connections to Stimuli and Behavior

Before ending this section, we need to address one more issue. Although I characterized NCA as involving three kinds of generalizations detailing causal relations of a symbol token to stimuli (I-T laws), other symbol tokens (T-T laws), and a proprietary set of behaviors like basic motor-gestures (T-O laws), in my discussion of content functionalism I addressed only the S-ge