| HOME | ARCHIVE | SEARCH | TABLE OF CONTENTS |
|---|
| ||||||||||||||||||||||||
RESEARCH ARTICLE |
1 Department of Psychology, University of Pennsylvania, Philadelphia.
2 Volen Center for Complex Systems, Brandeis University, Waltham, Massachusetts.
Address correspondence to M. J. Kahana, Department of Psychology, University of Pennsylvania, 3401 Walnut Street, Philadelphia, PA 19104 or to A. Wingfield, Volen National Center for Complex Systems, MS 013, Brandeis University, Waltham, MA 02454-9110. E-mail: kahana{at}psych.upenn.edu or wingfield{at}brandeis.edu
| Abstract |
|---|
|
|
|---|
TO UNDERSTAND the mechanisms underlying episodic memory, we must understand both correct responses and memory errors. In word-list recall tasks, errors reflect both omissions from a to-be-recalled memory set plus intrusions in the form of erroneous recall of items that had not appeared in the target list. To experimentally induce intrusions, Deese (1959)
presented participants with word lists comprising items that were semantically related to a nonpresented critical item. Following study of these lists, participants typically recall the nonpresented critical item with a probability as high (or higher) than that of the target list items. Significant new attention has been directed toward understanding the factors underlying semantic intrusions in this "false-memory" paradigm (Roediger & McDermott, 1995
). Among the findings are that older adults are more likely than their younger counterparts to show false memories for words semantically associated with target items (Norman & Schacter, 1997
; Tun, Wingfield, Rosen, & Blanchard, 1998
).
Our focus here is on intrusions committed during standard free-recall tasks, in which lists are not specifically designed to evoke false memories. In such lists, participants do occasionally make intrusions, though they are very few in number. Older adults, who recall fewer correct items on average than their younger counterparts, have also been reported to make a greater number of intrusions than younger adults (Hartley & Walsh, 1980
; Kahana, Howard, Zaromb, & Wingfield, 2002
; Stine & Wingfield, 1987
). Although many studies have been done to help researchers understand older adults' recall deficit (e.g., Kahana et al., 2002
; Kausler, 1994
; Naveh-Benjamin, 2000
; Wingfield, Lindfield, & Kahana, 1998
), relatively little is known about the age-related increase in intrusions observed in standard recall tasks. This is because there are often too few intrusions available for analysis (Kahana & Wingfield, 2000
; Keppel, 1968
; Wingfield et al., 1998
).
Process models of episodic recall (e.g., the search of associate memory, or SAM, model of Raaijmakers & Shiffrin, 1981
; see also Raaijmakers, 2003
) posit that overt responses reflect an initial process of sampling candidates for recalla process that is largely associativefollowed by a subsequent editing process that limits responses to those items that were in the target list. SAM thus embraces the classic generaterecognize view that participants do in fact think of many items that were not on the target list, but that they reject those responses prior to producing them for overt recall (Anderson & Bower, 1972
; Keppel 1968
; Raaijmakers & Shiffrin, 1980
; Wixted & Rohrer, 1994). Thus, older adults' increased tendency to produce intrusions may reflect either a tendency to generate more intrusions or an inability to recognize that a generated intrusion did not come from the target list. Theoretically, such recognition failures may reflect a deficit in source monitoring insofar as recognition judgments rely, in part, on an ability to remember the context in which an item occurred (Henkel, Jonhson, & De Leonardis, 1998
; Johnson, Hashtroudi, & Lindsay, 1993
; Kelley & Sahakyan, 2003
). On the basis of studies of list discrimination, it would seem that older adults do indeed have difficulty remembering the source of a presented item (e.g., McCormack, 1984
), but it is not known whether this difficulty also applies to items that are being overtly produced. One may also view older adults' tendency to commit intrusions as a consequence of a deficit in inhibitory processes (Hasher & Zacks, 1988
; Hasher, Stoltzfus, Zacks, & Rypma, 1991
). In the case of recall from episodic memory, inhibition may be viewed as a set of mechanisms or control processes that help to focus retrieval on target information by inhibiting the activation of extraneous thoughts or the retrieval of associated information that was experienced in a different context. Effective use of these control processes may depend on the retrieval of information indicating the source of a generated item.
Despite considerable theoretical and empirical research on free recall, and a general acknowledgment that some type of internal editing process must be operating, almost nothing is known about how often and under what circumstances participants censor their recalls. We devised an externalized free recall (EFR) task to potentially reveal this hypothesized editing process. Our method builds on that of Bousfield and Rosner (1970
; see also Roediger & Payne, 1985
), who asked participants to say aloud any words that came to mind during the recall phase of a free-recall task. In our variant of this technique, participants were further instructed to press a key immediately following any response that they knew to be incorrect. Thus, the EFR task attempts to open to experimental scrutiny the dual processes of generating and editing potential recall responses.
The existing literature, in demonstrating striking age-related deficits in episodic memory, suggests that older adults are impaired in their ability to remember an item's temporal context or to inhibit items that have inappropriate contextual information. Such a deficit in inhibition, source monitoring, or the formationretrieval of episodic associations would predict that older adults, after generating an intrusion, will have difficulty rejecting it as a nontarget item. This would also explain the age-related increase in intrusions committed during episodic recall tasks. Our primary interest, therefore, is in how often participants were able to correctly reject intrusions in the EFR procedure.
The EFR procedure also provides information on the total number of extralist words generated by young and older participants, whether from prior lists (prior-list intrusions, henceforth PLIs) or words that had not appeared in the experiment (extralist intrusions, henceforth XLIs). To the extent that older adults generally produce fewer items in verbal fluency tests than young adults (e.g., Kempler, Teng, Dick, Taussig, & Davis, 1998
; Tombaugh, Kozak, & Rees, 1999
), older adults might produce fewer intrusions than young adults. One might equally imagine scenarios in which older adults might produce more words under EFR instructions than young adults.
We report on two experiments. The first experiment is intended to demonstrate the traditional finding that, in free recall, older adults recall fewer items and make more intrusions (both PLIs and XLIs) than their younger education-matched counterparts. The second experiment uses EFR to induce greater numbers of intrusions in both age groups, but especially among younger adults who rarely make intrusions in ordinary free recall. By examining both the number of intrusions committed and the probability of accepting intrusions, we wished to determine whether older adults might be less able than their younger counterparts to correctly reject these intrusions.
| Experiment 1 |
|---|
|
|
|---|
| METHODS |
|---|
|
|
|---|
Because vocabulary grows with adult aging, it is not uncommon in research studies for the older participants to have higher vocabulary scores than the young participants (Nicholas, Barth, Obler, Au, & Albert, 1997
). In our case, the two age groups did not differ significantly in vocabulary. We conducted a regression analysis and confirmed that, for these participants, there was no relation between vocabulary score and the intrusion rates to be described. All participants reported themselves to be in good health and to have no difficulty reading the words as they would be presented on the computer screen.
Stimuli and Procedures
Participants studied a total of 10 lists of common, two-syllable nouns, chosen at random without replacement from a subset of the Toronto Word Pool (Friendly, Franklin, Hoffman, & Rubin, 1982
), with each list followed by a delayed free-recall test. Half of the lists were composed of 10 nouns (short lists) and half were composed of 20 nouns (long lists), with the order of short and long lists counterbalanced between participants.
At the start of each trial, the computer displayed each list item in capital letters for 1.4 s, followed by a 100-ms blank interstimulus interval (ISI). During list presentation, participants were required to say each word aloud. Immediately following the presentation of list items, participants performed a 16-s arithmetic distracter task consisting of addition in the form "A + B + C = ... ," where A, B, and C were positive, single-digit integers. Participants were instructed to work quickly while maintaining accuracy. Errors were rare. A row of asterisks, accompanied by a tone (lasting 300 ms), then signaled that participants should begin recalling the list items aloud, in any order. Participants were allowed 90 s to recall list items; on the basis of previous research with young and older adults (e.g., Kahana et al., 2002
), we have found that 90 s provides ample time for recall, with virtually all correct responses being produced during the first 60 s of this recall period. During the 90-s recall period, participants were to recall as many items as possible from the list, in any order. We considered the first four lists to be practice and we excluded them from the subsequent analyses. This served both to eliminate warm-up effects, which can differentially affect young and older participants, and to mirror the analyses to be used in Experiment 2.
| RESULTS AND DISCUSSION |
|---|
|
|
|---|
|
In addition to looking at correct responses, we were also interested in characterizing the behavior of intrusions, both PLIs and XLIs. We submitted the number of PLIs produced by the young and older participants for the long and short lists shown in Table 1 to a 2 (list length) x 2 (age) mixed design ANOVA with list length as a within-participants variable and age as a between-participants variable. The ANOVA confirmed significant main effects of list length, F(1, 22) = 4.84, MSE = 0.06, p <.05, and of age, F(1, 22) = 5.16, MSE = 0.42, p <.05. The interaction between age and list length did not quite reach our significance threshold, F(1, 22) = 3.64, MSE = 0.06, p =.07.
Older adult's increased tendency to commit PLIs is consistent with several prior reports (e.g., Hartley & Walsh, 1980
; Kahana et al., 2002
). The finding that PLIs are more frequent in longer lists is not surprising, given the greater number of total recalls in long lists and the tendency for intrusions to be associatively evoked by list items (Roediger & McDermott, 1995
). We observed a similar qualitative pattern for XLIs insofar as there was a marginally significant effect of age on number of intrusions F(1, 22) = 3.56, MSE = 0.16, p =.07. For XLIs, the effect of list length, and the Age x List length interaction did not approach significance. Aggregating XLIs and PLIs together, we found significant effects of both age, F(1, 22) = 6.06, MSE = 0.83, p <.05, and of list length, F(1, 22) = 12.04, MSE = 0.01, p <.01, on the total number of intrusions committed. The interaction between age and list length was not statistically significant, F(1, 22) = 1.93, MSE = 0.01, ns.
Repetitions of already recalled items represent another category of memory errors. Although repetition errors are not the focus of this article, we report them here for completeness. Young and older adults rarely repeated items; the mean number of repetitions per list was 0.10 (SD = 0.13) and 0.21 (SD = 0.19) for young and older adults, respectively; this difference did not reach significance, t(22) = 1.7, ns. Younger adults never repeated intrusions (M = 0), and older adults did so very rarely (M = 0.03, SD = 0.07).
| Experiment 2 |
|---|
|
|
|---|
| METHODS |
|---|
|
|
|---|
Procedures
The method for Experiment 2 was the same as for Experiment 1 with one key addition: On trials 410 participants were instructed, in addition to recalling words from the just-presented list, to recall aloud all of the words that came to mind as they were attempting to give their list recall. Participants were told, however, that if a word they had just given had not been on the just-presented list, then they were to press the spacebar on the computer keyboard immediately after recalling the word but prior to recalling the next word.
Participants were given 90 s to recall the list items. During this time they were to recall as many items as possible from the list, in any order they wished. As in Experiment 1, participants were tested on a total of 10 lists (half of which were short, i.e., 10 items, and other half long, i.e., 20 items). We did not include performance on the first three lists, on which participants performed standard delayed free recall, and on the first list of EFR in the analyses reported. Thus, as in Experiment 1, we conducted the analyses of correct recall and intrusions on the data from lists 510.
| RESULTS AND DISCUSSION |
|---|
|
|
|---|
|
A two-way mixed design ANOVA that we conducted on the number of correct items that were not rejected revealed significant main effects of list length, F(1, 78) = 116.09, MSE = 1.04, p <.001, and of age, F(1, 78) = 38.97, MSE = 3.84, p <.001, as well as a significant List length x Age interaction F(1, 78) = 14.23, MSE = 1.04, p <.001. This replicates with EFR instructions the basic effects of list length and age on correct recall demonstrated in Experiment 1 using standard free-recall procedures.
Our specific question pertains to the editing process in recall: The probability that, when a participant produces an intrusion, it will be erroneously accepted as a correct response. Assuming that participants' behavior in EFR at least partially mirrors the generation and editing of responses hypothesized to underlie successful episodic recall, we would expect to find that older adults are impaired at editing their responses. Such impairment predicts that older adults would be more likely than their younger counterparts to accept as correct their intrusion errors.
We separately analyzed young and older adults probabilities of accepting PLIs and XLIs (see Table 2). For the probability of accepting PLIs, there was a significant main effect of age, F(1, 70) = 11.59, MSE = 0.12, p <.01, but the effects of list length, F(1, 70) = 1.02, MSE = 0.05, ns, and the List length x Age interaction, F(1, 70) = 1.29, MSE = 0.05, ns, did not approach significance. Thus, older adults were more likely than younger adults to accept a PLI as correct, regardless of list length.
For the probability of accepting XLIs, there was a significant main effect of age, with older adults exhibiting a greater tendency to accept XLIs as correct, F(1, 64) = 7.32, MSE = 0.12, p <.01. Here, however, the main effect of list length was also significant, F(1, 64) = 4.15, MSE = 0.04, p <.05, indicating that participants were more likely to accept intrusions as correct in long lists than they were in shorter lists. There was no significant Age x List length interaction, F(1, 64) = 2.63, MSE = 0.04, p =.11.
Treating PLIs and XLIs together, we obtained significant main effects of both age, F(1, 76) = 18.86, MSE = 0.08, p <.001, and list length, F(1, 76) = 8.44, MSE = 0.02, p <.01, but no significant Age x List length interaction, F(1, 76) < 1, ns. Thus, older adults are more likely than younger adults to accept an intrusion as correct, and both groups were more likely to accept intrusions in recalling longer lists.
Although our interest was primarily in the probability that young and older adults would accept intrusions as correct, in Table 2 we also give the mean raw numbers of accepted and rejected intrusions per list. For the total number of accepted intrusions (PLIs and XLIs combined), there was a significant main effect of age, F(1, 78) = 8.10, MSE = 1.60, p <.01, and there was no main effect of list length, F(1, 78) = 0.08, MSE = 0.30, ns, but there was a significant Age x List length interaction, F(1, 78) = 6.79, MSE = 0.30, p <.05. For the total number of rejected intrusions (PLIs and XLIs together), there was a significant main effect of age, F(1, 78) = 3.87, MSE = 19.09, p =.05, and of list length, F(1, 78) = 9.37, MSE = 1.00, p <.05, but the Age x List length interaction did not approach significance, F(1, 78) = 1.01, MSE = 1.00, ns.
In addition to considering correct responses, PLIs and XLIs, we also report the number of times participants repeated items during recall. As in Experiment 1, young and older adults did not differ significantly in the number of times they repeated correct list items: younger, M = 0.56, SD = 1.23; older, M = 0.44, SD = 0.45; t(78) = 0.6, ns. However, older adults did show a trend toward repeating intrusions more frequently than the young: younger, M = 0.11, SD = 0.23; older, M = 0.25, SD = 0.42; t(78) = 2.0, p =.05.
| GENERAL DISCUSSION |
|---|
|
|
|---|
There are several possible causes of older adults' increased tendency to commit intrusions during recall in standard recall tests such as demonstrated in Experiment 1. Theories of memory (e.g., Raaijmakers & Shiffrin, 1980
, 1981
) traditionally assume that, during recall, participants internally generate items as they search memory but censor their responses when they believe that the internally generated item had not occurred on the list, perhaps because of a mismatch in source information (e.g., Johnson et al., 1993
). Within this framework, older adults' increased tendency to commit intrusions could result from either a greater tendency to internally generate nonlist items during recall or a diminished ability to censor the internally generated nonlist items, perhaps because of a failure of retrieving the source of the retrieved items (e.g., Henkel et al., 1998
). Finally, a third possibility is that older adults' increased tendency to commit intrusions arises simply because they are more willing to "think out loud," producing responses that they know were not on the list because, in standard recall, there is no penalty for doing so. According to this last possibility, older adults' increased tendency to overtly commit intrusions does not necessarily reflect an increased tendency to internally generate intrusions or a lack of knowledge concerning the source of these internally generated intrusions. Clearly these factors are not mutually exclusive: Older adults' tendency to commit intrusions in the standard recall task could result from some combination of a conscious think-out-loud strategy, a reduced ability to distinguish between intrusions versus items on the target list, or an inability to inhibit these intrusions from response.
In standard free recall, all of the aforementioned factors are obscured from experimental scrutinyone can only measure the frequency of participants' intrusions. We therefore designed a new method, called externalized free recall, to unveil the process of editing. EFR provides participants with the opportunity to speak aloud any items that they may think of and to overtly reject any responses they deem to be errors.
Experiment 2 demonstrated that older adults are impaired in their ability to edit intrusions, both those that came from prior lists and those that had not appeared on any list. That is, older adults are more likely to accept intrusions as list items than their younger counterparts. This finding is consistent with an age-related deficit in the ability to inhibit from response nonlist items that come to mind during recall, perhaps because of a failure to remember the source of those items.
The EFR method can only approximate the covert editing process hypothesized to take place during recall. Whereas some participants may not be comfortable responding overtly, others may excessively free associate during recall in order to be compliant with the experimenter. Another potential concern is that instructing participants to press a key to reject responses may interfere with the normal processes operating during recall. Further research using the EFR procedure is needed to determine how well it approximates the normal recall process.
Albeit with these caveats and cautions, data generated using the EFR method may be particularly suitable for testing computationally explicit models of episodic recall. Many such models (e.g., Becker & Lim, 2002; Kahana, 1996
; Raaijmakers, 2003
; Raaijmakers & Shiffrin, 1981
) posit that, during recall, participants generate candidate responses for recall and then edit these responses to reduce or eliminate intrusion errors. For example, one could extend the classic SAM model to use the item-to-context strength in the model to "recognize" sampled items (e.g., Sirotin, Kimball & Kahana, 2004
). On the basis of this postsampling recognition process, items may be rejected or accepted. Data from EFR could then be used to test such enhanced versions of SAM.
Using the EFR method, our data suggest that older adults are less effective in editing their intrusions than their younger counterparts, with older adults accepting 36% of their overt intrusions and younger adults accepting only 21%. Although an inability to edit may underlie older adults' tendency to make intrusions, this is only one of multiple mechanisms that are likely to account for the large age-related deficits seen in episodic retrieval.
| Acknowledgments |
|---|
| Footnotes |
|---|
Received for publication March 30, 2004. Accepted for publication October 13, 2004.
| References |
|---|
|
|
|---|
| ||||||||||||||||||||||||
| HOME | ARCHIVE | SEARCH | TABLE OF CONTENTS |
|---|