Procflyctoon" putting words 5 mi one's moyth We are quick to recognize the exceptional precocity of talented writers, artists, or athletes, but we often fail to appreciate the gifts underlying so many of our everyday activities. It is only through loss or injury that we suddenly realize how much we take them for granted. The skill involved in such a literally pedestrian activity as walking down a flight of stairs is immediately recognized after one has sprained an ankle. It is only then that we begin to appreciate the marvelous manner in which the visual input from our eyes and the tactile information from our feet transmit complementary information to our brain's sensory cortex. There it is immediately synthesized and fed to corresponding areas of the motor cortex which, in turn, feeds the cerebellum, the part of the brain devoted to the programming, timing, and coordination of all voluntary muscular movements. From the cerebellum radiate hundreds of simultaneous messages along the nerve pathways which go to the appropriate muscles involved in the head and neck (to focus the face and eyes downward toward the stairwell), in the back (to keep the posture erect and tilted slightly backward to compensate for the downward motion of the body), in the arms and hands (to slide down the banister for continual support and feedback), in the legs (to maintain a lifting and dropping motion quite different from normal walking), and in the feet (to angle the foot in just the right manner so that the ball of the foot catches the stair). Even this elaborate description is a gross oversimplification of the neurosensory and neuromuscular processes that are involved at any single moment of a descent down a staircase. But all of this is taken for granted and considered uninteresting, until we stumble and injure ourselves. Loss of what we consider the -i (\ qiipvcv simple and common gives us renewed appreciation of life's uncommon complexity. The production of speech is neurologically and psychologically far more complicated than negotiating a flight of stairs, but its intricacy also goes unappreciated until we suffer some linguistic disability or commit a slip of the tongue. In daily conversations, we remain generally unaware of the complexity of our achievement. Again, it is only through disability that our marvelous ability is made manifest. We have already seen in the previous chapter that psycholinguists tend to divide linguistic phenomena into stages. One of the most influential psycholinguists models for speech production, developed by Levelt, views it as a linear progression of four successive stages: (i) conceptualization, (2) formulation, (3) articulation, and (4) self-monitoring. We will look at each of these in turn, not forgetting that viewing speech phenomena as a step-by-step sequential process is only one way of investigating production. Alternative approaches exist; for example, characterizing the production of speech as a holistic activity where several simultaneous and parallel activities are taking place to create the utterances we intend to produce. Conceptualization Where does the very beginning of any spoken utterance come from? What sparks speech? These are difficult questions to answer, partly because we still don't know enough about how language is produced, but partly because they deal with mental abstractions so vague that they elude empirical investigation. The American psycholinguist David McNeill, however, has gone on record with an interesting mentalistic account of how speech is first conceptualized in the human mind. His theory is that primitive linguistic concepts are formed as two concurrent and parallel modes of thought. These are syntactic thinking, which spawns the sequence of words which we typically think of when we talk about how language is initiated, and imagistic thinking, which creates a more holistic and visual mode of communication. The former is segmented and linear and creates the strings of syllables, words, phrases, and sentences that together make up speech. The latter is global and synthetic and tends to develop the gestures which we naturally use to punctuate and illustrate our conversations. McNeill's claim, that syntactic thought and imagistic thought collaborate to conceptualize conversation, is quite convincingly demonstrated by the way in which speech utterances and ordi- 1 nary gestures seem to be tied and timed together in any conversation. Consider the following very simple example. Two people are holding a short discussion over the whereabouts of a lost object. Visualize in your mind how they gesture as they interact in the following two dialogues. You might even try reading these aloud, acting out Person B's role by pointing at the appropriate moment. First dialogue Person A: Where's my briefcase? Person B: There's your briefcase! Person B points to the briefcase the same moment he says There's. Second dialogue Person A: Where's my coat and briefcase? Person B: There's your briefcasel Person B points to the briefcase the same moment he says briefcase. What are the very first things that are going through Person B's mind when she is responding to Person A's questions in these two dialogues? Of course we cannot be too mentalistic and pretend we know what B is thinking. After all, we are often unsure of what we are thinking ourselves when we think about what we think, if we think about thinking at all. This is the problem with mentalism. But McNeill offers some plausible evidence for this bimodal view of how speech is produced. It seems likely that after B hears A's query in the first example, her syntacric thought might generate something that begins with the demonstrative, 'there' while, simultaneously, her imagistic thought might be of someone pointing toward an object, in this case, a briefcase. Evidence that these two modes are operating concurrently at the conceptualization stage is found in the simultaneous timing of the pointing gestures with the stressed words in each of these two scenes. In the first dialogue, B points to the briefcase (manifesting the imagistic part of her attempt to communicate) just as she stresses the word 'there' in her speech (illustrating the syntactic component of her communicative intent). Again, in the second dialogue, we see the synchrony of image and speech; at the end of the phrase B points to the briefcase just as she stresses the word in her articulation. If you read this last example out loud, you will also note a slight change in B's intonation—the voice trails off a bit as if to say 'There's your briefcase ...' Were B suddenly to spot the coat, she could continue with 'and there's your coat\ with a more decisive, falling intonation on 'coat' and, of course, another pointing gesture to show A where his coat was located. Appealing as McNeill's hypothesis might appear, and convincing as these examples might be, it is difficult to use his model to explain this first stage of production. For one thing, his attempts to describe how imagistic and syntactic thought are initially conceptualized are unclear. For another, the illustrations he uses to describe how gestures synchronize with important syntactic breaks in spoken language are difficult to follow. Perhaps this form of research, like studies of American Sign Language, can only be adequately illustrated by a videotape and not by drawings. Levelt's initial stage of conceptualization seems justified. After all, speech does not start from nothing, and if it does not start with concepts, how else could it possibly begin? At the same time, we realize how difficult it is to actually define this stage in non-mental-istic terms, and despite the plausibility of McNeill's binary model of language and gestures being birthed together, like twins, it is difficult to muster any hard evidence to support this, or any other theory for the embryonic development of speech. Although we know very little about how speech is initiated at this first stage of conceptualization, we have psycholinguistic evidence to help us understand the successive stages of production, so it is easier for us to describe and to understand Levelt's second stage, formulation. Formulation Introduction We have seen that the initial stage of conceptualization is so far removed from the words we actually speak and write that it is difficult to delineate this phase of production. But at the second stage of speech production, formulation, we move close enough to the eventual output of the process to allow us to be more precise in our terminology and more convincing in our use of empirical data. Conceptualization is hard to conceptualize, but formulation is much easier to formulate. Well over three decades ago, the psychologist Karl Lashley published one of the first attempts to account for the way speakers sequence strings of sounds, words, and phrases together so rapidly and accurately, and his essay was influential enough to be included in the first book ever published in English which focused exclusively on the then very new field of the psychology of language. His essay was first presented as an oral address, and it is intriguing to see how Lashley organized it to demonstrate some of the very concepts about speech production which he was writing about. For example, he talked about how common it is to commit spelling errors when one is typing, and he mentioned how he misspelled 'wrapid' with a w, while typing 'rapid writing', most probably because as he was about to type 'rapid', he anticipated the 'silent w' in the following word. These slips of the tongue, or pen, or computer keyboard, are of keen interest to us in this chapter on production. A moment later in his talk, to illustrate several of the themes that were central to his presentation, Lashley gave the following utterance as an example of how we comprehend spoken sentences. Rapid righting with his uninjured hand saved from loss the contents of the capsized canoe. Remember that this sentence was beard, not seen, so having been primed by the earlier phrase 'rapid writing', it was natural for the audience to hear 'Rapid writing with his uninjured hand!' Of course, like all native speakers of any language, the listeners were able to readjust their comprehension of this sentence. After they recognized they had initially wandered down the wrong garden path of comprehension, they were forced to retrace their steps, and to choose the proper path toward complete understanding. Thus Lashley was able to demonstrate many of the themes which were central to this seminal essay on speech production. First, he showed how slips of the tongue (or the computer keyboard) provide vivid insights into our understanding of how speech is formulated. Second, he illustrated the power of priming in guiding the direction of speech production and comprehension. Because Lashley first talked about 'writing', his audience was primed to hear the phrase again, and this is what confused them initially when they heard 'rapid righting' as part of an utterance about a canoe. Note also that it is possible that priming works in the production process as well. A critical insight from this example, and from Lashley's essay as a whole, is the way it demonstrates how both the production and the comprehension of speech is largely a linear process. The audience didn't know that Lashley had purposely misled their comprehension until they suddenly heard something about 'saving the contents of the capsized canoe'. People tend to produce and comprehend sentences in a linear way, and for comprehension, each additional piece of information we receive has the potential to force us to revamp our understanding of what we have already heard. The comprehension of the 'canoe' example, taken from Lashley's lecture and subsequently published paper, lends credence to the notion that, in several ways, production and comprehension are similar. Both are largely sequential, both are affected by priming, and they both depend, to a large degree, on the constant winnowing of implausible alternatives at each juncture in our stream of speech. Slips of the tongue Think back to the example at the beginning of this chapter of how we tend to ignore the complexity of strolling down a flight of stairs until we trip. Over the past few decades, psycholinguists have become excited about a new way of discovering how we put words into our mouths: they look at what happens when we trip over our tongues. Unlike stammering or aphasia (linguistic loss due to brain damage), slips of the tongue, or typographical mistakes, are normal, everyday occurrences which pervade our speaking and our writing. And because of this, as soon as our friends spot our mistake, or we happen to catch the goof ourselves, we can immediately backtrack and correct. However, when speech and language disintegrate into clear pathologies, as they do in stammering and aphasia (to be discussed in Chapter 5), there appears to be no recourse, and the error remains uncorrectable and uncorrected. So we can see why slips of the tongue provide the data that delight psycholinguists; they allow us to peek in on the production process because we know what the speaker intended to say, but the unintentional mistake freezes the production process momentarily and catches the linguistic mechanism in one instance of production. The use of linguistic deviations as data for scientific investigation is a new phenomenon, but the recognition of speech errors goes back more than a century. Spoonerisms, like the unfortunate use of 'the breast in bed' instead of 'the best in bread', are named after the Victorian cleric and teacher, William Spooner, who reputedly blundered through many a lecture or sermon with infamous slips in speech production. He called a group of Welsh miners 'you noble tons of soil' and supposedly scolded an errant student by saying, 'you have hissed all my mystery lectures; in fact, you have tasted the whole worm!' Spoonerisms then are slips of the tongue in which an actual word or phrase is created, often with a humorous twist to the meaning which was intended. The mention of verbal miscues, especially ones like the 'bread' example just cited, evokes a discussion of Freudian slips. In one of his earliest treatises, Psycbopathology of Everyday Life, Sigmund Freud hypothesized that slips of the tongue were important because, like dreams, they help to reveal the unconscious mind. But most psycholinguists have ignored Freudian interpretations of speech errors for a variety of reasons. For one thing, although slips like 'the breast in bed' appear to be embarrassingly indicative of an unconscious desire, a coldly empirical approach to this mistake would propose at least two explanations for the source of this illicit feeling: either the speaker was sexually provoked (and was unconsciously thinking about the first noun in the phrase), or he was actually fixating on the second noun because he was so exhausted! Here, we run into exactly the same problem we faced earlier when we tried to define what the conceptualization stage for speech consisted of; we are in danger of becoming too mentalistic—that is, relying on logic and intuition rather than experimental evidence—in our attempts to fathom what exactly puts words into people's mouths. A more important reason why psycholinguists tend to ignore Freudian analyses of why who said what is because, irrespective of their meaning, the formulation of slips of the tongue reveals important linguistic patterns—patterns that also pervade normal and natural speech. That is, what is relevant to psycholinguistics is not what is being said, but how it is being said, or, to be precise, misspoken. But before leaving Freud, it might be illuminating to examine some recent work in the psychology of language that attempts to wed the Freudian, mentalistic tradition of psychology with the experimental school which so strongly colors contemporary psycholinguistics. This was the goal of an experiment in which university students were asked to read aloud two, unrelated words flashed quickly in front of them on a computer screen. For example, the subjects might have seen 'barn door' instantaneously flashed in front of them, and they either read them correctly, or, as was often the case, because of the pressure of time, came up with a slip of the tongue, such as 'darn bore'. There were three different situations: a control—or normal—situation, where the subjects had no distractions; a second situation where subjects knew that they might receive a small electrical shock at any moment, and a third situation, where they performed the task in a state of slight sexual arousal (the subjects were all male, and the experimenter was an attractive and well-dressed female). Here are examples of two stimuli phrases which were flashed to all three groups. (i)sham dock (2) past fashion Although most of the subjects were accurate most of the time under all three conditions, the slips of the tongue which did occur differed significantly among the three groups. The control group tended to make arbitrary errors, such as 'darn bore', but the two experimental conditions tended to elicit two different kinds of slips. When (1) was flashed to the subjects who were in the group that was threatened with a potential electrical discharge, they, much more frequently than the other two groups, came up with the slip 'damn shock'. And when (2) was shown to the group with the attractive female experimenter, they, as you have already anticipated, came up with the phrase 'fast passion' much more frequently than the others. This experiment comes about as close as we can expect to get to testing Freud's ideas under laboratory SURVEY conditions, or to catching a glimpse of the conceptualization stage of speech production. Be that as it may, our focus here is on formulation, and from all of the examples cited so far, we can readily see that slips of the tongue are not a random, haphazard zigzagging of the production mechanism, like quarks in a cloud chamber. Sounds and words are not thrown together arbitrarily; there is a clear, linear and hierarchical order to the way in which we put words into our mouths. There has been a long and rich tradition of examining speech errors in psycholinguistics as a window to the formulation process and not as a reflection of some Freudian motivation. Based on examples gleaned over the years, researchers have been able to demonstrate that these superficially trivial quirks of communication are quite useful in offering insights about how speech is formulated. For one thing, there is sure evidence from this data that the units of speech, such as 'phoneme' and 'morpheme', which linguists have proposed and discussed for many years are psychologically real. This means that when we misspeak, we make errors within the boundaries and the framework of a certain language structure, as if we had intentionally planned our slips to fit an appropriate linguistic slot. Mistakes do not pop out just anywhere when we speak, they occur at predictable points and follow predictable patterns. It is almost as if we think about syllables, words, and phrases as we are formulating what we are going to say, and this is why psycholinguists find slips of the tongue insightful. Let us review some of this evidence. Linguists divide sounds into vowels and consonants and sub-categorize each of these into various phonetic groupings. Speech errors seem to follow the phonetic classifications established by linguists and rarely, if ever, cross over these linguistic boundaries. Consider the following examples. (1) a reading list a /eading list (2) big and fat pig and fat (3) fill the pool fool the p/'ll (4) drop a bomb bop a dromb As trivial and silly as these mistakes may appear initially, they actually tell us a great deal about the organization of the English language. The anticipation of [1J in the third word in (1), creates the substitution of [1] for [r] in the second word. Phoneticians point out that [1] and [r] are two consonants which share many-phonetic features for example, both are pronounced in the same part of the mouth, so that this type of substitution would always be likely. There is no such phonetic explanation for a substitution of [1] for [sh] {e.g. 'shopping list' becoming 7opping list'), and, in fact, whereas flip-flops of [1] and [r] for each other pervade the miscue data, transpositions of [1J or [r] for sounds like [sh] are exceedingly rare. The second example is a bit more subtle, because at first sight it seems that [p] is randomly introduced into the phrase from nowhere. But again, linguistic analysis gives a clear explanation. The phonetic feature of voicelessness of the following [f] in fat seems to be anticipated when the speaker is about to produce the [b] in the first word of phrase (i). As it turns out, the voiceless equivalent of the consonant [b] is [p]. Although we are focusing on sound structure in these first few examples, it is also possible that speakers are simultaneously being influenced by other linguistic factors, so that the person who misspoke 'pig and fat' may have also been gently nudged by the semantic association between these words. In (3) we see the psychological reality of the contrast between vowels and consonants in the minds of speakers. The vowels in the two words replace each other. It is theoretically possible for vowels to substitute for consonants and vice versa, but again, this rarely occurs because they are so distinct linguistically. Finally, in (4) we see how speech errors follow rules about what consonants can go together to form clusters. We can't just put any two consonants together in English, or in any other language for that matter: although we have [dr] in 'drop' and [st] in 'stick', we have no words that begin with [sr] like 'srop' or even worse, [dr| as in 'dtick'. So even though there is no word 'drom' in English, the [dr] cluster that the slip in (4) creates is permissible, and it tells linguists that people who come up with odd expressions like these still follow the sound patterns of English. Slips of the tongue also reveal that when we formulate speech, we are not only influenced by the sound system of the language we are speaking, we are also conditioned by the way words are put together in that language. Consider the following examples as evidence of the psychological reality of morphology—the way words are organized and structured in a language. (5) sesame seed crackers sesame street crackers (6) rules of word formation words of rule formation (7) a New Yorker a New Yorkan (8) the derivation of the derival of Unlike the first set of examples, these slips do not involve individual sounds; rather, they seem to reflect a higher level of linguistic organization because they are associated with complete words, or with significant parts of words. (5) and (6) are very common examples, and they remind us of the spoonerisms discussed earlier. Notice how the misspoken forms still adhere to normal patterns of word usage. For example 'Sesame Street crackers' might be a brand of cracker named after the children's TV show. Note too the manner in which (6) adheres to a regular word pattern in English. A 'four-door sedan' has four doors, an 'apple pie' is made of apples. A common error by learners of English is to call these objects a 'four-doors sedan' and 'apples pie', following the logical, but non-English pattern of extending the plural to the formation of noun phrases. But native speakers, who follow the rules of word formation, do not simply swap the two words that are reversed in (6) and say 'word of rules formation'. Even during the micromomentary process of formulating their speech, they follow the regular and established pattern. Examples (7) and (8) are further elaborations of this same theme, but in this case, the suffix slots are exchanged while the original words remain the same. The person who misspoke (7) might have been thinking, if an 'American' is someone who lives in America, why isn't a resident of New York a 'New Yorkan?' And by the same logic, if 'arrival' is the noun form of the verb 'arrive', why isn't the noun form of 'to derive', 'derival?' Once again we witness the way slips of the tongue provide psycholinguists insights into the production of speech; they help us see how speakers arrive at derivations. Speech errors are also helpful in revealing a third level of language processing at the formulation stage; they give support to the notion that utterances are not just strings of sounds and linear sequences of words, but are formed into larger structural units. This is demonstrated in examples (9) and (10). (9) he swam in the pool he swimmed in the pool (10) the children are in the park the cbilds are in the park These mistakes are much less common than the swapping of words and parts of words that we find in spoonerisms and similar constructions, but their occurrence, however rare, tells us something about the way grammar affects the formulation process. Those familiar with the speech and writing of non-native users of English will recognize these goofs as learner errors, but the big difference between learner errors and the slips exemplified by (9) and (10) is that native speakers almost always correct themselves when they err; learners of English, on the other hand, experience great difficulty recognizing exactly what was wrong and how to rectify it. Almost immediately after saying (10}, for example, a native speaker might stop and say 'I mean children''. Learners, upon recognizing that they said something wrong in a sentence like (10), or, more commonly, having it pointed out to them, will often miscor-rect the original error and come out with something like 'I mean cbildrens\ The fact that native speakers correct themselves shows that they are also paying attention to grammar, in addition to concentrating at the sound and word levels of the language. It is no accident that these last examples all involve irregular words. It looks very much as if the speaker has chosen the words and the slots which they fill, and at the last moment, forgotten to choose the right verb or noun form. The errors suggest that speakers organize their utterances into smaller groups of words, like noun phrases, or clauses with a main verb, and having filled these groups with the appropriate lexical items which express the intended meaning, the speakers finally add the appropriate grammatical inflections. Almost always, this complicated process is completed fluently and accurately, and only occasionally, as in these examples, does the formulation of speech slip up. But when it does, it provides us with a glimpse of the production process. The planning of higher levels of speech Another way of trying to understand the process of producing language is to analyze the steps we have to take and the decisions we have to make in order to produce an intended utterance. 3