| HOME | ARCHIVE | SEARCH | TABLE OF CONTENTS |
|---|
| ||||||||||||||||||||||||
RESEARCH ARTICLE |
Gerontology Center, University of Kansas, Lawrence.
Address correspondence to Susan Kemper, 3090 Dole, Gerontology Center, 1000 Sunnyside, University of Kansas, Lawrence, KS 66045. E-mail: Skemper{at}KU.edu
| Abstract |
|---|
|
|
|---|
STUDIES of elicited speech have shown that older adults produce shorter utterances using simpler syntactic structures and reduced propositional content than young adults (Kemper, Thompson, & Marquis, 2001
). However, uncontrolled age-related differences in discourse pragmatics may contribute to these findings (James, Burke, Austin, & Hulme, 1998
). To date, to our knowledge, few studies have examined these issues of aging and speech production by using experimental methods that control for, for example, lexical choice, discourse style, or the reliance on nonlinguistic gestures or paralinguistic devices to convey meaning. Experimental methods that constrain these aspects of speech production have recently gained acceptance (Bock, 1996
). Constrained production tasks require the speaker to formulate and utter a sentence by using words or phrases presented on a computer screen or in response to stimulus pictures. Sentence-formulation time can be assessed, as well as various aspects of the utterance such as syntactic form and prosodic structure (Dell & O'Seaghdha, 1992
; F. Ferreira, 1991
, 1994
; F. Ferreira & Swets, 2002
; V. Ferreira, 1996
; V. Ferreira & Dell, 2000
; Lindsley, 1975
; Roelofs, 1998
; Stallings, MacDonald, & O'Seaghdha, 1998
; Wheeldon & Lahiri, 1997
).
Two previous studies using constrained production tasks have suggested that sentence production is well preserved in older adults: Davidson, Zacks, and Ferreira (1996)
reported that younger and older adults produce similar patterns of responses and disfluencies on a task requiring participants to complete a sentence stem with either a dative construction (I told ... a story to the manager) or a double-object construction (I told ... the manager a story). Spieler and Griffin (2001)
used a picture description task. They suggested that older adults adopt a more "deliberate" response style in that older adults take longer to begin and complete sentences but show similar effects of picture codability and name frequency.
The constrained production task used in the present experiments required participants to formulate a sentence by using a set of words presented on a computer screen. The words were displayed until the participants spoke, forcing participants to plan their utterances by using the entire list of target words. Most prior studies of constrained sentence production have focused on response latency as an indication of sentence planning. The present experiments also compare the length, grammatical complexity, and propositional content of the responses. Experiment 1 varied the number of words presented to the participants. The length, grammatical complexity, and propositional content of the participants' responses were expected to vary with the number of words presented. Older adults were expected to produce shorter, less complex, and less informative sentences than young adults, thereby replicating findings from the analysis of elicited speech samples.
| Experiment 1: Methods |
|---|
|
|
|---|
The participants were given a battery of cognitive tests including the Shipley (1940)
vocabulary test; the Digits Forward, Digits Backwards, and Digit Symbol tests (Wechsler, 1958
); the Daneman and Carpenter (1980)
Reading Span test; and a Stroop test requiring participants to name the color of blocks of Xs printed in colored inks or to name the color of color words printed in contrasting colored inks, such as RED printed in blue ink. Table 1 summarizes the performances of the participants on these tests. An alpha level of.05 was set for these and all subsequent t and F tests.
|
Procedure
E-Prime software (Schneider, Eschman, & Zuccolotto, 2002
) was used to present the stimuli and collect responses. Participants were seated in front of a computer workstation equipped with a voice-activated response box connected to a microphone. The participants were instructed to "produce a sentence, as quickly as possible, using the words presented on the computer screen." They were instructed to "use all of the words presented" and encouraged to "add other words to make a complete, grammatical sentence." After a block of 10 practice trials, two blocks of 54 experimental trials were administered. Each trial consisted of a fixation point presented for 2 s followed by the presentation of two, three, or four words in a vertical column. The words remained on the screen until the participant spoke. As soon as the response box detected a vocal response, the words were removed from the computer screen. Response latencies were recorded from the onset of the response. The participant's response was audiorecorded and later transcribed. The participant initiated each trial by pressing a computer key.
The words were randomly selected and randomly ordered for presentation such that each set included at least one human character. Each participant was tested on 36 different two-word combinations, 36 different three-word combinations, and 36 different four-word combinations.
Coding
Each response was initially classified as a response error or a valid response. Response errors were subcategorized as (a) nonfluent responses, trials on which the response box was triggered by a cough or a nonlexical response such as "hum"; (b) false starts, trials on which participants started fluently but then repeated one or more words; (c) responses that were sentence fragments, missing one or more obligatory constituents; and (d) memory errors, responses that failed to use all of the stimulus words presented. Sentence fragments and memory errors were assumed to reflect a breakdown in sentence planning, whereas nonfluent responses and false starts were assumed to reflect articulation problems. Multiple errors could occur on a single trial.
With the use of the procedures described by Turner and Greene (1977)
, each valid response was decomposed into its constituent propositions, which represent basic ideas and the relations between them. The number of propositions expressed in a sentence is a measure of how informative it is. Developmental Level, or DLevel, is an index of grammatical complexity, based on a scale originally developed by Rosenberg and Abbeduto (1987)
. Each valid response was assigned to one of eight levels of complexity: Level 0, simple, one-clause sentences; 1, complex sentences with embedded infinitival complements; 2, complex sentences with wh-predicate complements, conjoined clauses, and compound subjects; 3, complex sentences with relative clauses modifying the object noun phrase or with predicate noun phrase complements; 4, complex sentences with gerundive complements or comparative constructions; 5, complex sentences with relative clauses modifying the subject noun phrase, subject noun phrase complements, and subject nominalizations; 6, complex sentences with subordinate clauses; and 7, complex sentences with multiple forms of embedding and subordination. As a way to provide a more detailed comparison of the responses, Developmental Sentence scoring (DSS; Lee, 1974
) was applied to each valid response. Eight different categories of grammatical forms are scored for each sentence: indefinite pronouns, personal pronouns, main verbs, secondary (embedded) verbs, conjunctions, negatives, and two types of questions. Within each category, variants are assigned different points to reflect the developmental order of appearance in children's speech. A total score is derived for each sentence by summing the points for each category plus 1 point if the sentence is fully grammatical. Sentence length in words was also determined. Intercoder reliability was assessed for each level of coding. Reliability averaged better than 90% for all levels of coding. Coded examples of the participants' responses are presented in Table 2.
|
| Experiment 1: Results |
|---|
|
|
|---|
Errors
Table 3 summarizes the error results. Young adults made one or more errors on 10% of the trials, whereas older adults made one or more errors on 16% of the trials. False starts occurred on less than 1% of all trials and were not analyzed.
|
2 =.11. The main effect of set size was significant, F(2,57) = 50.52, p <.01, and
2 =.47, as was the Age Group x Set Size interaction, F(2,57) = 11.43, p <.01, and
2 =.16. Older adults produced more nonfluent responses than young adults, and nonfluent responses by older adults increased with set size.
Sentence fragments
The percentage of trials on which the participant produced a sentence fragment missing one or more obligatory constituents was analyzed with a 2 Age Group x 3 Set Size (two, three, or four words presented) ANOVA. The main effect of age group was significant, F(1,58) = 6.92, p =.01, and
2 =.11. The main effect of set size was significant, F(2,57) = 30.59, p <.00, and
2 =.35, as was the Age Group x Set Size interaction, F(2,57) = 10.07, p <.01, and
2 =.15. Older adults produced more sentence fragments than young adults. As set size increased, older adults produced more fragments.
Memory errors
The percentage of trials on which the participant failed to use all words presented was analyzed with a 2 Age Group x 3 Set Size (two, three, or four words presented) ANOVA. The main effect of age group was significant, F(1,58) = 13.63, p <.01, and
2 =.19. The main effect of set size was significant, F(2,57) = 41.15, p <.01, and
2 =.42, as was the Age Group x Set Size interaction, F(2,57) = 11.88, p <.01, and
2 =.17. Older adults committed memory errors more often than did young adults, particularly when set size = 4.
Valid Responses
All valid responses were subjected to analysis of their length, propositional content, and grammatical complexity as well as response latency. The responses were classified by set size (two, three, or four words presented). Initially, word order was included as a design factor, contrasting trials on which a human character was the first word presented with locative-first and object-first trials, reflecting the strong bias in English for animate-first sentences (Bock, Loebell, & Morey, 1992
; McDonald, Bock, & Kelly, 1993
). The word order factor was not significant in any of the analyses reported in what follows, nor did it interact with set size or age group.
Sentence length
The overall main effect of age group was not significant for the sentence length measure, that is, F(1,58) < 1.0; sentence length differed as a function of set size, that is, F(2,57) = 187.79, p <.01, and
2 =.77, and the Set Size x Age Group interaction was significant, F(2,57) = 4.30, p =.02, and
2=.07. Figure 1A summarizes these findings. The length of young adults' responses increased monotonically with set size. Older adults' responses were similar in length to those of young adults for set size = 2 or 3; young adults produced longer responses than the older adults when set size = 4.
|
2 =.30, and the Set Size x Age Group interaction was significant, F(2,57) = 4.26, p =.03, and
2 =.07. Figure 1B summarizes these findings. Young adults tended to produce more propositions as set size increased; older adults tended to limit their responses to two or three propositions. Older adults produced as many propositions as young adults with set size = 2 or 3; young adults produced more propositions with set size = 4.
DLevel
The overall main effect of age group was not significant for the DLevel measure, that is, F(1,58) < 1.0. DLevel did differ as a function of set size, F(2,57) = 32.89, p <.01, and
2 =.36, and the Set Size x Age Group interaction was significant, F(2,57) = 4.85, p <.01, and
2 =.08. Figure 1C summarizes these findings. DLevel scores for young and older adults were similar when set size = 2 or 3; older adults produced sentences with lower DLevel scores when set size = 4.
DSS
The overall main effect of age group was not significant for the DSS measure, that is, F(1,58) < 1.0. DSS did vary as a function of set size, F(2,57) = 53.04, p <.01, and
2 =.72, and the Set Size x Age Group interaction was significant, F(2,57) = 6.30, p <.01, and
2 =.10. Figure 1D summarizes these findings. DSS scores for young adults increased monotonically with set size. DSS scores for young and older adults were similar when set size = 2 or 3; older adults produced sentences with lower DSS scores when set size = 4.
Response latency
The latency to produce a fluent sentence using all words did vary with age group, F(1,58) = 12.33, p <.01, and
2 =.17. Response latency increased with set size, F(2,57) = 146.556, p <.01, and
2 =.72, and the Set Size x Age Group interaction was significant, that is, F(2,57) = 7.32, p <.01, and
2 =.11. Figure 1E summarizes these findings. Older adults responded more slowly than young adults, the latency to respond increased with set size, and this increase was greater for older adults than for young adults.
| Experiment 1: Discussion |
|---|
|
|
|---|
The length, grammatical complexity, and propositional density of the valid responses by older adults provide evidence of an effect of memory load on sentence production. Even on trials on which the older adults were able to retain and use all of the words presented in a fluent response, their responses when set size = 4 were shorter, less complex, and less informative than those produced by young adults. However, when set size = 2 or 3, older adults were able to produce sentences that matched those of young adults in length, grammatical complexity, and content.
Response latencies for both young and older adults increased with set size. This pattern suggests that the speakers attempted to preplan their utterances before speaking; hence, as set size increased, they preplanned more segments or longer segments incorporating additional words.
| Experiment 2 |
|---|
|
|
|---|
| Experiment 2: Methods |
|---|
|
|
|---|
Materials
The word stimuli consisted of five sets of words, matched for word frequency by use of the Francis and Kucera (1982)
norms. The sets included 12 human characters, such as pianist, nurse, and witch; 12 locations, such as canyon, meadow, and cliff; and 18 verbs. The verbs included 6 intransitive verbs, such as laughed, smiled, and looked, which typically do not occur with direct objects; 6 transitive verbs, such as copied, examined, and replaced, which typically require direct objects; and 6 complement-taking verbs, such as guessed, suggested, and realized, which preferentially require sentential complements (Ferreira & Dell, 2000
). All were regular past tense verbs ending in -ed.
Procedure
E-Prime was used to present the stimuli and collect responses by using a procedure similar to that followed in Experiment 1. After a block of 10 practice trials, two blocks of 54 experimental trials were administered. Each trial consisted of a presentation of one agent plus a verb and optionally a location. The words were randomly selected from each stimulus set and randomly ordered for presentation. Each participant was tested on 18 two-word combinations with each type of verb and 18 agentlocativeverb combinations with each type of verb.
Coding
The participants' responses were transcribed and coded by using the same procedures followed in Experiment 1 to classify response errors and valid responses. Response errors included nonfluent responses, false starts, memory errors, and responses that were sentence fragments. In addition, a new error response category was used: substitutions. On some trials, participants substituted a different form of the verb, for example, substituting progressives such as laughing, or nominals such as laughter, for the correct form, laughed. Valid responses were coded for the number of words in the sentence, the number of propositions, DLevel, and DSS. Intercoder reliability was assessed for each level of coding. Reliability averaged better than 90% for all levels of coding. Coded examples of the participants' responses are presented in Table 2.
| Experiment 2: Results |
|---|
|
|
|---|
Errors
Table 4 summarizes the error findings. Young adults made one or more errors on 5% of the trials, whereas older adults made one or more errors on 12% of the trials. False starts occurred on less than 1% of all trials and were not analyzed. Substitutions of adjectives and other parts of speech for the verbs occurred on less than 1% of responses from young adults and 4% of responses from older adults; they were not analyzed.
|
The main effect of age group was significant, F(1,58) = 11.38, p <.01, and
2 =.16. The main effect of set size was significant, at F(2,57) = 134.02, p <.01, and
2 =.69, as was that for verb type, F(2,57) = 13.23, p <.01, and
2 =.19. In addition, the Set Size x Age Group interaction, F(2,57) = 20.63, p <.01, and
2 =.26, and the three-way interaction of Set Size x Verb Type x Age Group, F(2,57) = 5.12 p <.01, and
2 =.08, were significant. Older adults produced more nonfluent responses overall than young adults. For older adults, nonfluent responses increased with verb type, but only when an agent and location were presented along with the verb. Older adults produced nonfluent responses on 17% of the trials when two words were presented along with a complement-taking verb.
Sentence fragments
The percentage of trials on which the participant produced a sentence fragment missing one or more obligatory constituents was analyzed with a 2 Age Group x 2 Set Size (two or three words presented) x 3 Verb Type (intransitive, transitive, or complement-taking) ANOVA.
The main effect of age group was significant, at F(1,58) = 4.14, p =.05, and
2 =.07. All main effects and interactions were significant, including the three-way interaction of Set Size x Verb Type x Age Group at F(2,57) = 3.13, p =.05, and
2 =.05. Older adults produced more fragments than young adults, especially when set size = 3. Fragments varied with verb type, particularly for older adults who produced fragments on 12% of the trials when an agent and a location were presented along with a complement-taking verb.
Memory errors
The percentage of trials on which the participant failed to use all words presented was analyzed with a 2 Age Group x 2 Set Size (two or three words presented) x 3 Verb Type (intransitive, transitive, or complement-taking) ANOVA.
The main effect of age group was significant at F(1,58) = 23.05, p <.01, and
2 =.28. All main effects and interactions were significant, including the three-way interaction of Set Size x Verb Type x Age Group, F(2,57) = 4.04, p =.02, and
2 =.06. Young adults failed to use all of the words presented on 3% of the trials. Older adults failed to use all of the words presented on 15% of the trials, and such memory errors increased with set size, particularly when complement-taking verbs were presented.
Valid Responses
All valid responses were subjected to an analysis of their length, propositional content, and grammatical complexity as well as response latency. The responses were classified by set size (two or three words presented) and verb type (intransitive, transitive, or complement-taking). Initially, word order was included as a design factor, contrasting trials on which a human character was the first word presented with locative-first and object-first trials. The word order factor was not significant in any of the analyses reported in what follows, nor did it interact with set size, verb type, or age group.
Sentence length
The overall main effect of age group was not significant for the sentence length measure, F(1,58) < 1.53, p =.22, and
2 =.01; sentence length did differ as a function of set size, F(2,57) = 204.32, p <.01, and
2 =.78, and verb type, F(2,57) = 26.38, p <.01, and
2 =.31. Figure 2A summarizes these findings. The participants produced longer sentences when they were given two words in addition to a verb to incorporate into a sentence than when they were given only an agent. They produced longer sentences when they were given complement-taking verbs than when they were given intransitive or transitive verbs.
|
2 =.45, and the Set Size x Age Group interaction was significant, F(2,57) = 25.26, p <.01, and
2 =.30. Propositional content varied with verb type, F(2,57) = 31.52, p <.01, and
2 =.35, and the Verb Type x Age Group interaction was significant, F(2,57) = 21.63, p <.01, and
2 =.27. In addition, the three-way interaction of Verb Type x Set Size x Age Group was significant at F(2,57) = 4.08, p =.02, and
2 =.07. Figure 2B summarizes these findings. Young adults produced more propositions for complement-taking verbs than for intransitive and transitive verbs, particularly when they were given two words in addition to the verb. Older adults limited their sentences to approximately three propositions, regardless of verb type or set size.
DLevel
Although the overall main effect of age group was not significant for the DLevel measure, F(1,58) < 1.0, DLevel did differ as a function of set size, F(2,57) = 26.26, p <.01, and
2 =.31, and verb type, F(2,57) = 167.79, p <.01, and
2 =.74. Figure 2C summarizes these findings. DLevel scores for all participants increased with set size and with verb type. When intransitive or transitive verbs were presented, DLevel scores averaged around 1, indicating the participants produced mostly simple, one-clause sentences. DLevel scores for complement-taking verbs were higher, averaging 3 to 4 points, indicating that the participants were in fact producing sentences with predicate complements.
DSS
Again, the overall main effect of age group was not significant for the DSS measure, F(1,58) = 1.90, p =.17, and
2 =.01. DSS did vary as a function of set size, F(2,57) = 110.36, p <.01, and
2 =.66, but the Set Size x Age Group interaction was not significant at F(2,57) = 1.50, p =.22, and
2 =.01. DSS varied with verb type, F(2,57) = 13.38, p <.01, and
2 =.19; the Verb Type x Set Size interaction was significant, F(2,57) = 7.98, p =.01, and
2 =.12; and the Verb Type x Age Group interaction was significant, F(2,57) = 8.76, p <.01, and
2 =.13. The three-way interaction was significant at F(2,57) = 13.37, p <.01, and
2 =.19. Figure 2D summarizes these findings. DSS scores for young adults increased with set size, particularly when complement-taking verbs were presented. Older adults' responses tended to be limited to 7 to 10 DSS total points, even when complement-taking verbs were presented.
Response latency
The latency to produce a fluent sentence using all words did vary with age group, F(1,58) = 5.51, p =.02, and
2 =.09. Response latency increased with set size, F(2,57) = 95.76, p <.01, and
2 =.62, and the Set Size x Age Group interaction was significant, F(2,57) = 5.82, p =.02, and
2 =.09. Response latency increased with verb type, F(2,57) = 18.54, p <.01, and
2 =.24, and the Verb Type x Age Group interaction was significant, F(2,57) = 5.06, p <.01,
2 =.08. The three-way interaction was not significant at F(2,57) = 1.53, p =.22, and
2 =.01. Figure 2E summarizes these findings. Older adults responded more slowly than young adults; the latency to respond increased with the number of words to be incorporated into the sentence, and this increase was greater for older adults than for young adults. Response latency was longer when complement-taking verbs were presented than when intransitive and transitive verbs were presented, particularly for older adults when set size = 3.
| Experiment 2: Discussion |
|---|
|
|
|---|
Conclusions
Previous studies using constrained production tasks have found basic syntactic processing to be well preserved in older adults (Davidson et al., 1996
; Spieler & Griffin, 2001
). The current findings, in part, are consistent with these studies. Error rates and measures of syntactic complexity, sentence length, and propositional content were similar for older and young adults when they were given two or three words to use in a sentence in Experiment 1 or when verbs were limited to intransitive or transitive forms in Experiment 2.
These experiments using a constrained production task demonstrate two manipulations that affect older adults' sentence production: (a) directly manipulating memory load by increasing the number of words to be used in a sentence and (b) manipulating the linguistic characteristics of the words presented while holding memory load constant. In Experiment 1, when they were given four words to use in a sentence, older adults produced sentences that were shorter, simpler, and less informative than those produced by young adults. In Experiment 2, older adults' responses using a complement-taking verb were shorter, simpler, and less informative than those produced by young adults.
These results support earlier findings, derived from the analysis of elicited speech samples, that aging affects sentence production in response to explicit manipulation of memory load or the manipulation of key linguistic factors such as verb type. Verb type, particularly complement-taking verbs, appears to impose an implicit memory load on older adults, forcing them to use shorter, simpler sentences. Other linguistic manipulations, such as the dativedouble-object alternation examined by Davidson and colleagues (1996)
, for example, I told a story to the manager/the manager a story, may not increase memory load and hence not differentially affect young and older adults' ability to plan and produce fluent, grammatical sentences. Although constrained production tasks impose artificial requirements on sentence production, they provide a degree of experimental control over pragmatic factors that is lacking in the analysis of spontaneous or elicited speech samples. By examining how aging affects the time course of sentence production under controlled conditions, it may be possible to distinguish other cognitive and linguistic factors that affect older adults' ability to generate long, fluent, complex, and informative sentences.
| Acknowledgments |
|---|
Received for publication May 28, 2002. Accepted for publication April 29, 2003.
| References |
|---|
|
|
|---|
| ||||||||||||||||||||||||
| HOME | ARCHIVE | SEARCH | TABLE OF CONTENTS |
|---|