Abstract

Objectives.

Stereotype threat can impair older adults’ performance on clinical assessments for cognitive decline. We examined why this occurs. Based upon the regulatory focus account of stereotype threat, we predicted that the effects of stereotype threat should depend upon the assessments’ reward structure. Stereotype threat should be associated with poor performance when the assessment emphasizes gaining correct answers, but not when it emphasizes avoiding mistakes.

Method.

Healthy older adults completed a series of mental status examinations. Half of the participants completed these examinations under stereotype threat about their cognitive abilities. Monetary incentives were also manipulated. For half of the participants correct responding led to gains. For the remaining participants incorrect responding/forgetting led to losses.

Results.

Consistent with the regulatory focus account, stereotype threat was associated with poor performance when the mental status examinations had a gains-based structure, but not when they had a losses-based structure.

Discussion.

Older adults respond to stereotype threat by becoming vigilant to avoid the losses that will make them their worst. Researchers and clinicians can capitalize on this motivational change to combat stereotype threat’s negative effects. By using a loss-avoidance frame, stereotype threat’s negative effects can be attenuated or even eliminated.

Despite the fact that dementia is prevalent among older adults, clinicians often fail to detect it. In fact, between 29 to 75 percent of dementia cases are overlooked by primary care physicians ( Chodosh, et al., 2004 ; Olafsdóttir, Skoog, & Marcusson, 2000 ; Valcour, Masaki, Curb, & Blanchette, 2000 ). Although controversial ( Boustani et al., 2003 ), to combat this problem, clinicians could administer routine mental status examinations to all older adult patients; doing this increases the detection of cognitive impairment from 59 to 83 percent ( Borson, Scanlan, Watanabe, Tu, & Lessig, 2006 ). The practice of administering routine mental status examinations to healthy older adults will also become more common in the United States due to recent policy changes. As of 2011, the Patient Protection and Affordable Care Act added the Annual Wellness Visit as a Medicare benefit. As part of this visit, patients receive both a routine physical exam as well as a screening for cognitive impairment.

The current study examines the impact of stereotype threat on healthy older adults’ performance on such mental status examinations. Stereotype threat occurs when people realize that a stigmatized aspect of their identity may be used to evaluate their performance. As a result of this, people often underperform compared to their potential ( Steele, 1997 ). For example, when older adults take memory tests they may fear that poor performance on their part will confirm the stereotype that older adults have memory problems. This in turn often leads older adults to underperform on memory tests compared to their potential (for a review, see Barber & Mather, 2014 ).

Stereotype threat can also affect mental status examination performance. In one study, healthy older adults who self-categorized as “older” and who expected cognitive abilities to decline with age underperformed on a mental status examination. This in turn had clinical implications; 70% of these older adults met diagnostic screening criteria for dementia compared to about 14% of the older adults in the lower-threat conditions ( Haslam et al., 2012 ). This is similar to research examining “diagnosis threat.” In these studies, people with a history of mild head injuries were told that their medical history increased the likelihood that they would perform poorly on a mental status examination. In response to this, these patients underperformed compared with a control group who also had a history of mild head injuries but were not told that this would impact their performance ( Suhr & Gunstad, 2002 ; 2005 ).

Thus stereotype threat can impact mental status examination performance. The current study examines why this occurs. Although a large variety of mechanisms have been put forth to explain older adults’ stereotype threat effects ( Barber & Mather, 2014 ), the current study focuses solely on testing a motivational-based explanation of stereotype threat based upon regulatory focus theory. According to regulatory focus theory ( Higgins, 1997 ; 1998 ), people differ in their motivational orientations. People with a promotion focus are concerned with achieving their ideals; they work towards their goals with eagerness and are sensitive to the presence or absence of gains in their environments. In contrast, people with a prevention focus are concerned with fulfilling their obligations; they work towards their goals with vigilance and are sensitive to the presence or absence of losses in their environment. Although people differ in their dispositional focus, situational context can determine whether a person is in a short-term situational promotion or prevention state. It has been argued that stereotype threat is one such context ( Seibt & Förster, 2004 ). When people encounter stereotype threat they abandon the approach of eagerly approaching the gains that will make them their best and rather become vigilant to avoid the losses that will make them their worst. Thus, under stereotype threat people adopt a short-term prevention focus, in which they become sensitive to the presence or absence of losses in their environment.

Situational fluctuations in regulatory focus can in turn affect performance. People perform optimally when the tasks’ reward structure matches their current regulatory focus, that is, when there is a regulatory fit ( Higgins, 2000 ; Higgins, Idson, Freitas, Spiegel, & Molden, 2003 ). People with a promotion focus perform best on tasks that emphasize gains; people with a prevention focus perform best on tasks that emphasize losses. Importantly, research demonstrating stereotype-threat-related performance deficits can be reinterpreted as representing a lack of regulatory fit. As noted by Grimm and colleagues (2009) , most cognitive tasks have an implicit gains-based structure. For example, the assumed goal of a free-recall memory test is to gain as many correct answers as possible. Thus, under stereotype threat there is a poor fit between the individuals’ stereotype-threat-induced prevention focus and the free-recall tests’ implicit gains-based-reward structure. However, according to the regulatory focus account, the performance deficits typically associated with stereotype threat should be ameliorated when regulatory fit is obtained. That is, stereotype threat should not impair performance when the task emphasizes losses. Research with younger adults has supported this conclusion ( Chalabaev, Major, Sarrazin, & Cury, 2012 ; Grimm, et al., 2009 ).

Studies with older adults have also supported the regulatory focus account of stereotype threat. Although stereotype threat impairs older adults’ memory performance when gains are emphasized, it does not impair (and sometimes improves) their performance when losses are emphasized. This is true both for working memory performance ( Barber & Mather, 2013a ), and also free recall performance ( Barber & Mather, 2013b ). Furthermore, consistent with a loss-averse prevention focus, older adults under threat adopt a more conservative response criterion during a recognition test ( Barber & Mather, 2013b ), make fewer recall intrusions ( Barber & Mather, 2013b ), and respond more slowly but with higher accuracy during a letter-canceling task ( Popham & Hess, 2013 ).

In this study we examined the clinical relevance of these findings. Based on the regulatory focus account, we predicted that changing the framing of a clinical mental status examination from focusing on gains towards focusing on losses would ameliorate older adults’ stereotype threat performance deficits.

Method

Participants

We tested 80 older adults from the Los Angeles area. One participant had difficulty understanding task instructions and extremely poor performance. We excluded this participant and recruited a new older adult to replace him. However, reported results are unchanged when retaining this participant. Our final set of 80 participants (53.7% women) were on average 69.54 years old ( SD = 5.33; range = 61–80). Participants were highly educated having completed an average of 17.77 years of education ( SD = 2.13; range = 12–24). For the three participants who indicated they had “16+ years” of education we assumed 17 years of education. For the participant who indicated they had completed trade school we assumed 14 years of education. One participant did not answer this question. Stereotype threat effects are typically greater for older adults with high levels of education ( Hess, Hinson, & Hodges, 2009 ), thus this was expected to increase our observed effects. For more demographic information, see Table 1 .

Table 1.

Participant Characteristics as a Function of Stereotype Threat and Reward Structure Condition

Gains-relatedLosses-related
No-threatStereotype threatNo threatStereotype threat
Age69.2069.5069.3070.15
Sex9 women, 11 men12 women, 8 men10 women, 10 men13 women, 7 men
Education (in years) a16.7017.7518.0518.58
Self-reported health7.557.407.858.05
Self-reported stress3.204.082.903.48
Reappraisal score30.0527.5528.9529.10
Suppression score11.4012.6512.2011.50
Age-group identification26.4522.4525.9023.65
Gains-relatedLosses-related
No-threatStereotype threatNo threatStereotype threat
Age69.2069.5069.3070.15
Sex9 women, 11 men12 women, 8 men10 women, 10 men13 women, 7 men
Education (in years) a16.7017.7518.0518.58
Self-reported health7.557.407.858.05
Self-reported stress3.204.082.903.48
Reappraisal score30.0527.5528.9529.10
Suppression score11.4012.6512.2011.50
Age-group identification26.4522.4525.9023.65

Note. Self-reported health was provided on a 1 (very poor health) to 9 (excellent health) scale. Self-reported stress was provided on 1 (very low) to 9 (very high) scale. Reappraisal and suppression scores came from the Emotion Regulation Questionnaire (Gross & John, 2003). The age-group identification score came from a questionnaire previously used by Garstka and colleagues (2004).

a For the three participants who indicated they had “16+ years” of education we assumed 17 years of education. For the participant who indicated they had completed trade school we assumed 14 years of education. One participant did not answer this question.

Table 1.

Participant Characteristics as a Function of Stereotype Threat and Reward Structure Condition

Gains-relatedLosses-related
No-threatStereotype threatNo threatStereotype threat
Age69.2069.5069.3070.15
Sex9 women, 11 men12 women, 8 men10 women, 10 men13 women, 7 men
Education (in years) a16.7017.7518.0518.58
Self-reported health7.557.407.858.05
Self-reported stress3.204.082.903.48
Reappraisal score30.0527.5528.9529.10
Suppression score11.4012.6512.2011.50
Age-group identification26.4522.4525.9023.65
Gains-relatedLosses-related
No-threatStereotype threatNo threatStereotype threat
Age69.2069.5069.3070.15
Sex9 women, 11 men12 women, 8 men10 women, 10 men13 women, 7 men
Education (in years) a16.7017.7518.0518.58
Self-reported health7.557.407.858.05
Self-reported stress3.204.082.903.48
Reappraisal score30.0527.5528.9529.10
Suppression score11.4012.6512.2011.50
Age-group identification26.4522.4525.9023.65

Note. Self-reported health was provided on a 1 (very poor health) to 9 (excellent health) scale. Self-reported stress was provided on 1 (very low) to 9 (very high) scale. Reappraisal and suppression scores came from the Emotion Regulation Questionnaire (Gross & John, 2003). The age-group identification score came from a questionnaire previously used by Garstka and colleagues (2004).

a For the three participants who indicated they had “16+ years” of education we assumed 17 years of education. For the participant who indicated they had completed trade school we assumed 14 years of education. One participant did not answer this question.

Participants were recruited through a list of research volunteers that was obtained via letters to University of Southern California alumni, newspaper and online ads, and fliers at senior centers and public places. No mental status examinations were administered during recruitment. Participants were compensated $15 per hr in addition to a performance-based monetary reward.

Design

Stereotype threat (threat vs. no-threat) and reward structure of the dependent measures (gains-based vs. losses-based) were manipulated between-subjects. As described below, there were three critical mental status examinations: The Word List Memory Test from the Consortium to Establish a Registry on Alzheimer’s Disease (CERAD; Welsh, Butters, Hughes, Mohs & Heyman, 1991 ), the Addenbrooke’s Cognitive Examination-Revised (ACE-R; Mioshi, Dawson, Mitchell, Arnold, & Hodges, 2006 ), and the Mini Mental State Examination (MMSE; Folstein, Folstein, & McHugh, 1975 ). Also described below, we included two baseline mental status examinations: A Control Word List Memory Test (which was formatted to be identical to the one from the CERAD) and the MMSE.

Critical Mental Status Examinations

Word List Memory Test from the CERAD.

The Word List Memory Test is part of a larger battery of tests that assess the cognitive processes affected by Alzheimer’s disease ( Morris, Mohs, Rogers, Fillenbaum, & Heyman, 1988 ; Welsh et al., 1991 ). During this test, participants view 10 concrete nouns, presented at the rate of one word every 2 s. They immediately recall aloud as many of the words as possible. This cycle is repeated two more times; words are shown in a different order each cycle. After a filled delay of 10min, participants again recall aloud as many of the 10 words as possible. Finally, participants complete a recognition memory test in which the 10 critical words are interspersed with 10 new words. There are many ways to score performance; we assumed a total of 10 points on each of the 5 sub-tests—the 10 words that could be recalled on each of the three initial study-test cycles, the 10 words that could be recalled on the delayed recall test, and the 10 hits on the delayed recognition test.

ACE-R.

This is a brief test of cognitive functions sensitive to the early stages of dementia. There are a total of 100 possible points that can be obtained on the ACE-R; 30 of which correspond to the MMSE ( Folstein, Folstein, & McHugh, 1975 ; Mioshi et al., 2006 ). Five cognitive domains are assessed. The first domain is verbal fluency. Here, participants complete a letter fluency task (generate words beginning with the letter “P” for 1min) and a category fluency task (generate names of animals for 1min). The second domain is orientation and attention (e.g., “What is the date today?”). The third domain is language abilities. Here, participants are asked to name line drawings, to comprehend sentences, to repeat words and phrases, and to write a sentence. The fourth domain is memory abilities. Participants are asked to learn three words as well as a name and address, and then recall that information after a delay. They are also asked to recall the names of famous individuals. We made two modifications to the ACE-R questions assessing memory abilities to accommodate our American participants. First, rather than ask participants to name the current Prime Minister of the U.K., we instead asked them to name the current Vice President of the United States. Second, we edited the address that participants were asked to learn and remember from being located in England to being located in the United States. The final domain is visuospatial abilities. Participants copy overlapping pentagons, copy a cube, and draw a clock face.

MMSE.

This is a brief measure of cognitive status, with a maximum score of 30 points ( Folstein et al., 1975 ). It includes questions assessing orientation, word recall, attention and calculation, language abilities, and visuospatial abilities ( Tombaugh & McIntyre, 1992 ). As noted above, all of the MMSE questions were embedded in the ACE-R.

Baseline Mental Status Examinations

Control Word List Memory Test.

Using the MRC Psycholinguistic Database ( Wilson, 1988 ), we constructed two sets of 10 words that were approximately matched in number of letters, word frequency, familiarity, imagibility, and concreteness to the 10 critical words, and to the 10 recognition lures, in the Word List Memory Test from the CERAD ( Welsh et al., 1991 ). The procedure of the Control Word List Memory Test was identical to that of the Word List Memory Test from the CERAD (described above) using these new critical words and recognition test lures.

MMSE.

Participants completed the MMSE both at baseline and later as part of the ACE-R. The only difference in the questions between the two administrations was the identity of the three words that participants were asked to repeat and recall.

Procedure

Participants first completed the three study-test free-recall cycles of the Control Word List Memory Test (described above). There was then a 10-minute filled delay during which participants completed the baseline MMSE (described above), and spent remaining time completing puzzles. Participants then completed the delayed recall and recognition tests of the Control Word List Memory Test.

We next manipulated stereotype threat by asking participants to read a fictitious news article taken from Hess, Auman, Colcombe, and Rahhal (2003) . In the stereotype threat condition the article described research confirming that memory declines with age. In the no-threat condition the article described preservation of memory for words and increased language abilities with age. The experimenter stated that the current study’s purpose was either to confirm that “memory inevitably declines with age” or that “verbal abilities and memory for words improves with age.”

Participants then completed the Word List Memory Test from the CERAD (described above). While doing this, participants were given a performance-based monetary reward. For half of the participants the reward was gains-based. Participants received one poker chip for each word correctly recalled. For the other half of the participants the reward was losses-based. Participants were initially given 60 poker chips, but lost one chip for each word forgotten or false alarm made. In both reward-conditions, each chip was worth five cents, and corresponded to an actual monetary payment given to participants at the end of the experiment. Participants’ chips (that they had accumulated or not lost) were placed beside them and were clearly visible.

After completing the three study-test-free-recall cycles of the Word List Memory Test from the CERAD there was a 10-minute delay period. During this time participants completed a demographics questionnaire, the Emotion Regulation Questionnaire ( Gross & John, 2003 ) and the Positive and Negative Affective Scale (PANAS; Watson, Clark, & Tellegen, 1988 ). Remaining time (if any) was spent doing puzzles. The experimenter next reminded participants about the performance-based reward and restated the threat/no-threat manipulation. Participants then completed the delayed recall and recognition tests of the Word List Memory Test from the CERAD (with either a gains-based or losses-based reward).

We next manipulated stereotype threat again by having participants read a second fictitious news article. This time the articles focused on how aging affects general cognitive abilities (copies of these fictitious news articles may be obtained from any of the authors.). In the stereotype threat condition the article stated that general cognitive decline begins as early as age 45, rapidly accelerates after age 60, and causes older adults to score lower on tests of intelligence, reasoning, decision making, and verbal abilities. The experimenter also stated that the participant was at the older end of the age-spectrum being tested. In the no-threat condition, the article stated that cognitive declines are minimal before the age of 85, and that people generally preserve their intelligence, reasoning, and verbal abilities as they age. The experimenter also stated that the participant was at the younger end of the age-spectrum being tested. Assignment to the stereotype threat or no-threat conditions was consistent throughout the experiment.

Participants then completed the ACE-R (described above, it includes all the questions from the MMSE). It was coupled with a performance-based monetary reward. When the reward was gains-based participants received one poker chip for each point earned (for a maximum of 100 chips). When the reward was losses-based, participants were initially given 100 poker chips, and lost one chip for each point lost on the ACE-R. As before, each chip was worth five cents, corresponded to an actual monetary payment, and chips were always clearly visible to participants. As with the stereotype threat manipulation, assignment to the gains-based versus losses-based reward structure conditions was consistent throughout the entire experiment.

At the end of the experiment, participants completed a series of questionnaires. As a manipulation check, participants completed an adaptation of the Perceived Stereotype Threat Questionnaire ( Kang & Chasteen, 2009 ). We included five-items assessing situational perceptions of threat (e.g., “The researcher will think that older adults, as a whole, have poorer memory and cognitive abilities if I did not do well on the tests”) and three-items assessing dispositional perceptions of threat (e.g., “I often feel that I have to prove to others that their perceptions of my memory abilities are wrong”). We also included four-items assessing trait perceptions of age-group discrimination (e.g., “I feel like I am personally a victim of society because of my age”; Garstka, Schmitt, Branscombe, & Hummert, 2004 ). Our stereotype threat manipulation was expected to affect situational perceptions of threat, but not dispositional levels of threat or perceived age-group discrimination.

Within this final series of questionnaires, participants also noted how well they thought they had done on the tests (“I did well on the memory tests”), and indicated the amount of anxiety they experienced while reading the news articles (using a scale from Osborne, 2001 ). Participants also completed an age-group identification questionnaire (“I like being a member of my age group”; from Garstka, et al., 2004 ).

Results

Baseline Measures

We ran a series of 2 (stereotype threat condition) × 2 (reward structure condition) analysis of variance (ANOVA) to ensure that participants across the conditions did not differ on our assessed demographic variables ( Table 1 ), or in baseline mental status examination performance. Results revealed that the participants randomly assigned to the losses-based reward structure conditions were more highly educated ( M = 18.31 years) than those assigned to the gains-based reward structure conditions ( M = 17.23 years), F (1, 75) = 5.56, MSE = 4.21, p = .02, η p2 = .07.

Results also revealed an interaction between stereotype threat and reward structure conditions on baseline MMSE performance, F (1, 76) = 6.40, MSE = 1.76, p = .01, η p2 = .08. This interaction occurred because participants randomly assigned to the stereotype-threat gains-related condition ( M = 28.95) had higher baseline MMSE performance than participants assigned to the no-threat gains-related condition ( M = 28.10), F (1, 38) = 4.67, MSE = 1.55, p = .04, η p2 = .11. Importantly, this pattern is opposite to the one expected on the critical measures, and was also absent on the baseline Control Word List Memory Test. Thus, if stereotype threat impairs critical mental status examination performance when gaining rewards is emphasized but not when avoiding losses is emphasized, this cannot readily be explained by these baseline differences.

Manipulation Checks

Our stereotype threat manipulations were effective. Participants in the stereotype threat conditions ( M = 14.15) self-reported higher situational perceptions of threat (on the Perceived Stereotype Threat Questionnaire adapted from Kang & Chasteen, 2009 ; observed Cronbach’s alpha = 0.65) than participants in the no-threat conditions ( M = 12.00), F (1, 76) = 8.17, MSE = 11.32, p = .005, η p2 = .10. This replicates previous results ( Chasteen, Bhattacharyya, Horhota, Tam, & Hasher, 2005 ; Kang & Chasteen, 2009 ), and did not vary by the reward-structure manipulation, F < 1. In contrast, participants in the stereotype threat conditions ( M = 8.68) and no-threat conditions ( M = 8.88) did not vary in dispositional levels of stereotype threat, F < 1, or in perceived age-discrimination, F < 1.

Our stereotype threat manipulations also affected anxiety (observed Cronbach’s alpha = 0.51). Participants in the stereotype threat conditions ( M = 1.43) self-reported higher anxiety while reading the news articles than did participants in the no-threat conditions ( M = 0.83), F (1, 76) = 4.60, MSE = 1.56, p = .04, η p2 = .06. This replicates previous results ( Abrams, Eller, & Bryant, 2006 ), and did not vary by the reward-structure manipulation, F (1, 76) = 1.57, MSE = 1.56, p = .22, η p2 = .02. In contrast, there were no significant differences amongst the conditions in self-reported positive and negative affective states (assessed via the PANAS) or in self-perceptions of performance (in answer to the question “I did well on the memory tests”).

Performance on the Word List Memory Test From the CERAD and on the MMSE

To examine the impact of stereotype threat on mental status examination performance, we first examined performance on the Word List Memory Test from the CERAD (defined here as the total score across the sub-tests) and on the MMSE. As described above, all participants completed baseline versions of these tests (i.e., the baseline Control Word List Memory test or the baseline MMSE). We then manipulated stereotype threat before asking participants to complete critical versions of these tests, which included either a gains-based or losses-based monetary incentive. Because of this design, we used change scores as our units of analysis. We predicted that threat would impair performance when gaining rewards were emphasized but not when avoiding losses were emphasized. To test this, we conducted a 2 (stereotype threat condition) × 2 (reward structure condition) multivariate analysis of covariance (MANCOVA) on performance change scores and included three covariates. First, we included years of education because the reward-structure conditions differed on this measure (For the one participant who did not specify her years of education, we imputed the average educational obtainment from all other participants [i.e., 17.8 years] for use in these analyses.). We also included chronological age and sex as covariates because these variables were associated with performance. The older the participant the worse they tended to perform. Women tended to have higher mental status examination performance than men.

Consistent with our predictions, within this MANCOVA there was a significant multivariate interaction between the stereotype threat and reward structure conditions, F (2, 72) = 4.71, p = .01, η p2 = .11. As can be seen in Figure 1 and 2 , follow-up multivariate analyses showed that stereotype threat led participants to underperform when the assessments had a gains-based structure, F (2, 34) = 7.68, p < .01, η p2 = .31, but not when they had a losses-based structure, F (2, 34) = 1.11, p = .34, η p2 = .06.

Figure 1.

Adjusted performance on the critical mini mental state examination (MMSE; embedded in the Addenbrooke’s Cognitive Examination-Revised [ACE-R]) as a function of stereotype threat and task reward structure. Means are adjusted for the covariates of educational attainment, chronological age, sex, and baseline MMSE performance. Error bars represent the adjusted standard errors of the means.

Figure 2.

Adjusted performance on the critical Word List Memory Test from the Consortium to Establish a Registry on Alzheimer’s disease (CERAD) as a function of stereotype threat and task reward structure. Means are adjusted for the covariates of educational attainment, chronological age, sex, and baseline Control Word List Memory Test performance. Error bars represent the adjusted standard errors of the means.

The prior analyses show that stereotype threat can impact overall mental status examination performance. Does this vary across the cognitive domains included in such examinations? To answer this, we next conducted a 2 (stereotype threat condition) × 2 (reward structure condition) × 5 (sub-test of the Word List Memory Test from the CERAD change score: Immediate recall study-test cycle 1, Immediate recall study-test cycle 2, Immediate recall study-test cycle 3, Delayed recall, Delayed hits) ANCOVA. We again included education, age, and sex as covariates. Results revealed a significant interaction between the stereotype threat and reward structure conditions, F (1, 73) = 3.88, MSE = 3.18, p = .05, η p2 = .05. This interaction did not vary across the sub-tests, F < 1. As shown in Table 2 , across the sub-tests, stereotype threat led to performance impairments when correct responding led to gains, F (1, 35) = 8.34, MSE = 2.68, p = .007, η p2 = .19, but not when incorrect responding led to losses, F < 1.

Table 2.

Unadjusted Mean Performance on the Baseline Control Word List Memory Test and on the Critical Word List Memory Test From the Consortium to Establish a Registry on Alzheimer’s Disease (CERAD) as a Function of Stereotype Threat and Reward Structure Condition. The Maximum Possible Score on Each of the Sub-Tests is 10 Points

Gains-relatedLosses-related
No threatStereotype threatNo threatStereotype threat
Baseline Word List Memory Test
 Immediate recall study-test cycle 14.554.755.054.75
 Immediate recall study-test cycle 26.456.756.956.60
 Immediate recall study-test cycle 37.557.608.207.65
 Delayed recall5.806.155.855.90
 Delayed hits9.809.759.809.50
Critical Word List Memory Test
 Immediate recall study-test cycle 14.504.404.504.75
 Immediate recall study-test cycle 26.456.306.856.75
 Immediate recall study-test cycle 38.007.158.057.20
 Delayed recall5.504.504.854.75
 Delayed hits9.759.659.809.50
Change in performance (critical test minus baseline test performance )
 Immediate recall study-test cycle 1−0.05−0.35−0.550.00
 Immediate recall study-test cycle 20.00−0.45−0.100.15
 Immediate recall study-test cycle 30.45−0.45−0.15−0.45
 Delayed recall−0.30−1.65−1.00−1.15
 Delayed hits−0.05−0.100.000.00
Gains-relatedLosses-related
No threatStereotype threatNo threatStereotype threat
Baseline Word List Memory Test
 Immediate recall study-test cycle 14.554.755.054.75
 Immediate recall study-test cycle 26.456.756.956.60
 Immediate recall study-test cycle 37.557.608.207.65
 Delayed recall5.806.155.855.90
 Delayed hits9.809.759.809.50
Critical Word List Memory Test
 Immediate recall study-test cycle 14.504.404.504.75
 Immediate recall study-test cycle 26.456.306.856.75
 Immediate recall study-test cycle 38.007.158.057.20
 Delayed recall5.504.504.854.75
 Delayed hits9.759.659.809.50
Change in performance (critical test minus baseline test performance )
 Immediate recall study-test cycle 1−0.05−0.35−0.550.00
 Immediate recall study-test cycle 20.00−0.45−0.100.15
 Immediate recall study-test cycle 30.45−0.45−0.15−0.45
 Delayed recall−0.30−1.65−1.00−1.15
 Delayed hits−0.05−0.100.000.00
Table 2.

Unadjusted Mean Performance on the Baseline Control Word List Memory Test and on the Critical Word List Memory Test From the Consortium to Establish a Registry on Alzheimer’s Disease (CERAD) as a Function of Stereotype Threat and Reward Structure Condition. The Maximum Possible Score on Each of the Sub-Tests is 10 Points

Gains-relatedLosses-related
No threatStereotype threatNo threatStereotype threat
Baseline Word List Memory Test
 Immediate recall study-test cycle 14.554.755.054.75
 Immediate recall study-test cycle 26.456.756.956.60
 Immediate recall study-test cycle 37.557.608.207.65
 Delayed recall5.806.155.855.90
 Delayed hits9.809.759.809.50
Critical Word List Memory Test
 Immediate recall study-test cycle 14.504.404.504.75
 Immediate recall study-test cycle 26.456.306.856.75
 Immediate recall study-test cycle 38.007.158.057.20
 Delayed recall5.504.504.854.75
 Delayed hits9.759.659.809.50
Change in performance (critical test minus baseline test performance )
 Immediate recall study-test cycle 1−0.05−0.35−0.550.00
 Immediate recall study-test cycle 20.00−0.45−0.100.15
 Immediate recall study-test cycle 30.45−0.45−0.15−0.45
 Delayed recall−0.30−1.65−1.00−1.15
 Delayed hits−0.05−0.100.000.00
Gains-relatedLosses-related
No threatStereotype threatNo threatStereotype threat
Baseline Word List Memory Test
 Immediate recall study-test cycle 14.554.755.054.75
 Immediate recall study-test cycle 26.456.756.956.60
 Immediate recall study-test cycle 37.557.608.207.65
 Delayed recall5.806.155.855.90
 Delayed hits9.809.759.809.50
Critical Word List Memory Test
 Immediate recall study-test cycle 14.504.404.504.75
 Immediate recall study-test cycle 26.456.306.856.75
 Immediate recall study-test cycle 38.007.158.057.20
 Delayed recall5.504.504.854.75
 Delayed hits9.759.659.809.50
Change in performance (critical test minus baseline test performance )
 Immediate recall study-test cycle 1−0.05−0.35−0.550.00
 Immediate recall study-test cycle 20.00−0.45−0.100.15
 Immediate recall study-test cycle 30.45−0.45−0.15−0.45
 Delayed recall−0.30−1.65−1.00−1.15
 Delayed hits−0.05−0.100.000.00

Performance on the ACE-R

Participants also completed the ACE-R. In contrast to the Word List Memory Test and the MMSE this was done only after our manipulation of stereotype threat. Therefore, to examine ACE-R performance we next conducted a 2 (stereotype threat condition) × 2 (reward structure condition) × 5 (ACE-R sub-test: orientation, memory, fluency, language, or visuospatial abilities) ANCOVA on performance (scores on the sub-tests were standardized). In addition to education, age, and sex, we included baseline MMSE score as a covariate to account for individual differences in baseline cognitive performance; baseline MMSE scores ranged from 25 to 30 ( M = 28.65, SD = 1.36).

As shown in Table 3 , the pattern of results was consistent with our predictions; there was an interaction between the stereotype threat and reward structure conditions, F (1, 72) = 2.90, MSE = 1.03, p = .047 (one-tailed), η p2 = .04. This interaction did not vary across the sub-tests, F < 1. Follow-up analyses revealed no significant effects. However, the interaction between stereotype threat and reward structure condition was driven by the fact that stereotype threat was associated with numerically poorer performance when correct responding led to gains, F (1, 34) = 1.67, MSE = 1.00, p = .10 (one-tailed), η p2 = .05 .04, but did not affect performance when incorrect responding led to losses, F < 1. However, as noted above, neither of these follow-up analyses approached conventional levels of statistical significance.

Table 3.

Mean Performance on the Addenbrooke’s Cognitive Examination-Revised (ACE-R; Adjusted Only for Baseline Mini Mental State Examination [MMSE] Performance) as a Function of Stereotype Threat and Reward Structure Condition. The Maximum Possible Score on Each of the Sub-Tests is Provided in Parentheses Next to the Name of the Test

Gains-relatedLosses-related
No threatStereotype threatNo threatStereotype threat
Orientation and attention (18)17.8317.5417.7117.63
Memory abilities (26)22.1821.1021.6022.12
Verbal fluency (14)12.4212.0711.9811.98
Language abilities (26)25.0024.6024.6225.34
Visuospatial abilities (16)15.3415.5815.4015.63
Gains-relatedLosses-related
No threatStereotype threatNo threatStereotype threat
Orientation and attention (18)17.8317.5417.7117.63
Memory abilities (26)22.1821.1021.6022.12
Verbal fluency (14)12.4212.0711.9811.98
Language abilities (26)25.0024.6024.6225.34
Visuospatial abilities (16)15.3415.5815.4015.63
Table 3.

Mean Performance on the Addenbrooke’s Cognitive Examination-Revised (ACE-R; Adjusted Only for Baseline Mini Mental State Examination [MMSE] Performance) as a Function of Stereotype Threat and Reward Structure Condition. The Maximum Possible Score on Each of the Sub-Tests is Provided in Parentheses Next to the Name of the Test

Gains-relatedLosses-related
No threatStereotype threatNo threatStereotype threat
Orientation and attention (18)17.8317.5417.7117.63
Memory abilities (26)22.1821.1021.6022.12
Verbal fluency (14)12.4212.0711.9811.98
Language abilities (26)25.0024.6024.6225.34
Visuospatial abilities (16)15.3415.5815.4015.63
Gains-relatedLosses-related
No threatStereotype threatNo threatStereotype threat
Orientation and attention (18)17.8317.5417.7117.63
Memory abilities (26)22.1821.1021.6022.12
Verbal fluency (14)12.4212.0711.9811.98
Language abilities (26)25.0024.6024.6225.34
Visuospatial abilities (16)15.3415.5815.4015.63

Of note, the current results replicate the previous findings reported by Haslam and colleagues (2012) ; stereotype threat exerted a negative effect on ACE-R performance when the examination emphasized gaining correct answers. However, in contrast to Haslam and colleagues, 2012 , stereotype threat did not affect the proportion of participants meeting diagnostic screening criteria for dementia [i.e., the number of participants falling below the cut-off score of 82 on the ACE-R]. In this study, extremely few participants fell below this cut-off. In fact, there were fewer than two participants per conditions, that met this criterion. Thus, although stereotype threat impacts mental status examination performance such effects may not always lead to changes in diagnostic outcomes.

The Roles of Perceived Stereotype Threat and Anxiety

Participants in the stereotype threat conditions self-reported higher situational levels of perceived threat. We next examined whether this mediated the effect of stereotype threat on performance as a function of the examinations’ reward structure. To do this, we conducted a moderated mediation analysis, using Hayes’ (2012 , model 14) PROCESS macro for SPSS. As shown in Figure 3 , the model estimated the effect of stereotype threat on performance directly, as well as indirectly through situational perceptions of threat. The mediating effect of situational perceived threat was also modeled as being moderated by the reward structure of the examinations. A bootstrapping procedure was used to generate a sample size of 5,000 to assess the regression models. The mediator and interaction terms were mean-centered. We used a 95% bias-corrected and accelerated confidence intervals.

Figure 3.

Conceptual diagram of the moderated-mediation analysis.

For the outcome variable we created a composite score that reflected performance on all three of our critical measures: The ACE-R (which includes the critical MMSE score) and the Word List Memory Test from the CERAD. This was done by adding the standardized z -scores from the ACE-R and the Word List Memory Test from the CERAD. We again included education, age, and sex as covariates. We also included standardized performance on both the baseline MMSE and on the Control Word List Memory Test as covariates to account for individual baseline performance differences. We also included self-reported anxiety as a covariate in order to examine the unique contribution of perceived threat, independent of anxiety’s effects. Covariates were allowed to load on both the mediator and outcome variable.

The first part of this analysis assessed the relationship between stereotype threat condition and levels of situational perceived threat, and was significant, F (7, 72) = 3.04, p = .01, R2 = .23. Consistent with our previous analyses, our stereotype threat manipulation was associated with significantly higher levels of situational perceived threat, t (72) = 2.35, p = .02.

The second part of the analysis assessed the relationship between stereotype threat condition and mental status examination performance directly, as well as indirectly through levels of situational perceived threat. In addition, it tested whether the indirect path was moderated by the tasks’ reward structure. Overall, this model was significant and accounted for 65% of the observed variance, F (10, 69) = 12.74, R2 = .65. 11.90, R2 = .63. However, levels of situational perceived threat did exert a significant impact on performance, t (69) = 1.30, p = .20, regardless of the tests’ reward structure, t (69) = 1.16, p = .25.

We repeated the above analysis using anxiety (rather than perceived threat) as the mediating factor (and included situational perception of stereotype threat as a covariate). Within this analysis there were no significant effects. After accounting for the covariates there was no relationship between stereotype threat condition and self-reported anxiety, t (72) = 1.28, p = .21. Furthermore, anxiety did not directly affect performance, t (69) = 1.59, p = .12, regardless of the tests’ reward structure, t (69) = .05, p = .96.

Discussion

Of the many negative stereotypes that exist about older adults, one of the most common is that they are forgetful and prone to “senior moments.” In fact, although cognitive processes do decline with age, simply reminding older adults about ageist ideas often exacerbates their cognitive problems via stereotype threat ( Barber & Mather, 2014 ). These stereotype threat-related impairments have clinical implications for older adults. In one study 70% of older adults scored below the cut-off for dementia on a mental status examination when assessed under stereotype threat, compared to approximately 14% when not assessed under threat ( Haslam et al., 2012 ).

In the current study, we tested whether such stereotype threat effects arise due to changes in regulatory focus in older adults. Results were generally consistent with the regulatory focus account. Stereotype threat led older adults to underperform on the Word List Memory Test of the CERAD and on the MMSE when providing correct answers was associated with monetary gains. Numerically the same pattern emerged on the ACE-R, although this failed to reach statistical significance. In contrast, the negative effects of stereotype threat were eliminated when providing incorrect answers led to monetary losses. These results extend our prior research showing analogous patterns within the domains of working memory ( Barber & Mather, 2013a ) and free recall ( Barber & Mather, 2013b ). However, the effects of stereotype threat on performance were not mediated by either levels of perceived stereotype threat or anxiety, regardless of the exam’s reward structure.

There are several limitations to this study that should be addressed in future research. First, we did not include a true control condition in which no information about how aging affects memory and cognition was presented. Rather, participants were placed in either a stereotype threat condition, which presumably increased their perceptions of threat, or in a “no-threat” condition, which may have decreased their perceptions of threat. This was done to clearly separate the conditions in terms of the amount of threat they experienced. However, it leaves open the question of how much participants naturally feel threat under “neutral” conditions, and also how task framing affects performance for individuals who are not experiencing strong levels of either stereotype threat or threat-alleviation. Our use of a “no-threat” condition may also explain why these participants tended to do better when the task had a gains-based, rather than losses-based, frame. It is possible that the “no-threat” manipulation evoked a promotion motivational focus in which participants became eager to perform their best.

A second limitation is that our losses-based reward structure focused primarily on errors of omission rather than errors of commission. Within the loss-based reward-structure conditions, all errors of omission were associated with point losses. In contrast, the only penalized error of commission was false alarms during the Word List Memory Tests; other errors of commission (such as free-recall intrusions during the Word List Memory Tests or incorrect responses during the MMSE or ACE-R) were not penalized. This loss-based framing is more consistent with a prevention focus than with a promotion focus, and thus we expected stereotype threat effects to be eliminated. However, as noted by a reviewer, stronger regulatory fit effects would likely have emerged if participants had lost points for all errors of commission. Thus, the current results may under-estimate the benefits of loss-based framing for ameliorating stereotype threat effects.

A final limitation of the current study is that we manipulated the reward structure via feedback that was tied to a monetary incentive after every response. This does not readily translate to clinical practice. Future research is needed to determine whether the observed pattern of results would also be obtained by varying whether the verbal instructions emphasize gaining correct answers or avoiding incorrect answers. This in turn could be used as a simple and no-cost method to alleviate older adults’ stereotype threat impairments in clinical settings.

This study provides additional insight into why stereotype threat affects older adults and how this can affect clinically-relevant outcomes. Results suggest that older adults respond to stereotype threat by becoming vigilant to avoid confirming the conclusion that they have experienced cognitive declines. However, researchers and clinicians can capitalize on this motivational change to combat stereotype threat’s negative effects. By using a loss-avoidance frame to the task, stereotype threat effects can be ameliorated or even eliminated.

Funding

This work was supported by grants from the National Institute on Aging (grant numbers R01-AG046464, R01-AG038043, and K02-AG032309).

Acknowledgments

Thanks are due to Lauran Evans, Josh Faskowitz, and Rico Velasco for research assistance. Thanks are also due to Dr. Tom Hess for providing us with part of the stereotype threat manipulation.

References

Abrams
D.
Eller
A.
Bryant
J
. (
2006
).
An age apart: The effects of intergenerational contact and stereotype threat on performance and intergroup bias
.
Psychology and Aging
,
21
,
691
702
. doi:
10.1037/0882-7974.21.4.691

Barber
S. J.
Mather
M
. (
2013a
).
Stereotype threat can both enhance and impair older adults’ memory
.
Psychological Science
,
24
,
2522
2529
. doi:
10.1177/0956797613497023

Barber
S. J.
Mather
M
. (
2013b
).
Stereotype threat can reduce older adults’ memory errors
.
The Quarterly Journal of Experimental Psychology
,
66
,
1888
1895
. doi:
10.1080/17470218.2013.840656

Barber
S. J.
, &
Mather
M
. (
2014
).
Stereotype threat in older adults: When and why does it occur, and who is most affected?
In
Verhaeghen
P.
Hertzog
C.
(Eds.),
The Oxford handbook of emotion, social cognition, and problem solving during adulthood
(pp.
302
320
).
Oxford, UK
:
Oxford University Press
.

Borson
S.
Scanlan
J. M.
Watanabe
J.
Tu
S. P.
Lessig
M
. (
2006
).
Improving identification of cognitive impairment in primary care
.
International Journal of Geriatric Psychiatry
,
21
,
349
355
. doi:
10.1002/gps.1470

Boustani
M.
Peterson
B.
Hanson
L.
Harris
R.
, &
Lohr
K. N
. (
2003
).
Screening for dementia in primary care: A summary of the evidence for the US Preventive Services Task Force
.
Annals of Internal Medicine
,
138
,
927
937
. doi:
10.7326/0003-4819-138-11-200306030-00015

Chalabaev
A.
Major
B.
Sarrazin
P.
, &
Cury
F
. (
2012
).
When avoiding failure improves performance: Stereotype threat and the impact of performance goals
.
Motivation and Emotion
,
36
,
130
142
. doi:
10.1007/s11031-011-9241-x

Chasteen
A. L.
Bhattacharyya
S.
Horhota
M.
Tam
R.
Hasher
L
. (
2005
).
How feelings of stereotype threat influence older adults’ memory performance
.
Experimental Aging Research
,
31
,
235
260
. doi:
10.1080/03610730590948177

Chodosh
J.
Petitti
D. B.
Elliott
M.
Hays
R. D.
Crooks
V. C.
Reuben
D. B.
,…
Wenger
N
. (
2004
).
Physician recognition of cognitive impairment: Evaluating the need for improvement
.
Journal of the American Geriatrics Society
,
52
,
1051
1059
. doi:
10.1111/j.1532-5415.2004.52301.x

Folstein
M. F.
Folstein
S. E.
McHugh
P. R
. (
1975
).
“Mini-mental state”. A practical method for grading the cognitive state of patients for the clinician
.
Journal of Psychiatric Research
,
12
,
189
198
. doi:
10.1016/0022-3956(75)90026-6

Garstka
T. A.
Schmitt
M. T.
Branscombe
N. R.
, &
Hummert
M. L
. (
2004
).
How young and older adults differ in their responses to perceived age discrimination
.
Psychology and Aging
,
19
,
326
335
. doi:
10.1037/0882-7974.19.2.326

Grimm
L. R.
Markman
A. B.
Maddox
W. T.
Baldwin
G. C
. (
2009
).
Stereotype threat reinterpreted as a regulatory mismatch
.
Journal of Personality and Social Psychology
,
96
,
288
304
. doi:
10.1037/a0013463

Gross
J. J.
John
O. P
. (
2003
).
Individual differences in two emotion regulation processes: Implications for affect, relationships, and well-being
.
Journal of Personality and Social Psychology
,
85
,
348
362
. doi:
10.1037/0022-3514.85.2.348

Haslam
C.
Morton
T. A.
Haslam
S. A.
Varnes
L.
Graham
R.
Gamaz
L
. (
2012
).
“When the age is in, the wit is out”: Age-related self-categorization and deficit expectations reduce performance on clinical tests used in dementia assessment
.
Psychology and Aging
,
27
,
778
784
. doi:
10.1037/a0027754

Hayes
A. F
. (
2012
).
PROCESS: A versatile computational tool for observed variable mediation, moderation, and conditional process modeling
. Manuscript submitted for publication.

Hess
T. M.
Auman
C.
Colcombe
S. J.
Rahhal
T. A
. (
2003
).
The impact of stereotype threat on age differences in memory performance
.
The Journals of Gerontology, Series B: Psychological Sciences and Social Sciences
,
58
,
P3
11
. doi:
10.1093/geronb/58.1.P3

Hess
T. M.
Hinson
J. T.
Hodges
E. A
. (
2009
).
Moderators of and mechanisms underlying stereotype threat effects on older adults’ memory performance
.
Experimental Aging Research
,
35
,
153
177
. doi:
10.1080/03610730802716413

Higgins
E. T
. (
1997
).
Beyond pleasure and pain
.
American Psychologist
,
52
,
1280
1300
. doi:
10.1037/0003-066X.52.12.1280

Higgins
E. T
. (
1998
).
Promotion and prevention: Regulatory focus as a motivational principle
.
Advances in Experimental Social Psychology
,
30
,
1
46
.

Higgins
E. T
. (
2000
).
Making a good decision: Value from fit
.
American Psychologist
,
55
,
1217
1230
. doi:
10.1037/0003-066X.55.11.1217

Higgins
E. T.
Idson
L. C.
Freitas
A. L.
Spiegel
S.
Molden
D. C
. (
2003
).
Transfer of value from fit
.
Journal of Personality and Social Psychology
,
84
,
1140
1153
. doi:
10.1037/0022-3514.84.6.1140

Kang
S. K.
Chasteen
A. L
. (
2009
).
The moderating role of age-group identification and perceived threat on stereotype threat among older adults
.
International Journal of Aging & Human Development
,
69
,
201
220
. doi:
10.2190/AG.69.3.c

Mioshi
E.
Dawson
K.
Mitchell
J.
Arnold
R.
Hodges
J. R
. (
2006
).
The Addenbrooke’s Cognitive Examination Revised (ACE-R): A brief cognitive test battery for dementia screening
.
International Journal of Geriatric Psychiatry
,
21
,
1078
1085
. doi:
10.1002/gps.1610

Morris
J. C.
Mohs
R. C.
Rogers
H.
Fillenbaum
G.
Heyman
A
. (
1988
).
Consortium to establish a registry for Alzheimer’s disease (CERAD) clinical and neuropsychological assessment of Alzheimer’s disease
.
Psychopharmacology Bulletin
,
24
,
641
652
.

Olafsdóttir
M.
Skoog
I.
Marcusson
J
. (
2000
).
Detection of dementia in primary care: the Linköping study
.
Dementia and Geriatric Cognitive Disorders
,
11
,
223
229
. doi:
10.1159/000017241

Osborne
J. W
. (
2001
).
Testing stereotype threat: Does anxiety explain race and sex differences in achievement?
Contemporary Educational Psychology
,
26
,
291
310
. doi:
10.1006/ceps.2000.1052

Popham
L. E.
, &
Hess
T. M
. (
2013
).
Age differences in the underlying mechanisms of stereotype threat effects
.
The Journals of Gerontology, Series B: Psychological Sciences and Social Sciences
. doi:
10.1093/geronb/gbt093

Seibt
B.
Förster
J
. (
2004
).
Stereotype threat and performance: how self-stereotypes influence processing by inducing regulatory foci
.
Journal of Personality and Social Psychology
,
87
,
38
56
. doi:
10.1037/0022-3514.87.1.38

Steele
C. M
. (
1997
).
A threat in the air. How stereotypes shape intellectual identity and performance
.
American psychologist
,
52
,
613
629
. doi:10.1037/0003-066X.52.6.613

Suhr
J. A.
, &
Gunstad
J
. (
2002
).
“Diagnosis threat”: The effect of negative expectations on cognitive performance in head injury
.
Journal of Clinical and Experimental Neuropsychology
,
24
,
448
457
. doi:
10.1076/jcen.24.4.448.1039

Suhr
J. A.
Gunstad
J
. (
2005
).
Further exploration of the effect of “diagnosis threat” on cognitive performance in individuals with mild head injury
.
Journal of the International Neuropsychological Society
,
11
,
23
29
. doi:
10.1017/S1355617705050010

Tombaugh
T. N.
McIntyre
N. J
. (
1992
).
The mini-mental state examination: a comprehensive review
.
Journal of the American Geriatrics Society
,
40
,
922
935
.

Valcour
V. G.
Masaki
K. H.
Curb
J. D.
Blanchette
P. L
. (
2000
).
The detection of dementia in the primary care setting
.
Archives of Internal Medicine
,
160
,
2964
2968
. doi:
10.1001/archinte.160.19.2964

Watson
D.
Clark
L. A.
Tellegen
A
. (
1988
).
Development and validation of brief measures of positive and negative affect: The PANAS scales
.
Journal of Personality and Social Psychology
,
54
,
1063
1070
. doi:
10.1037/0022-3514.54.6.1063

Welsh
K.
Butters
N.
Hughes
J.
Mohs
R.
Heyman
A
. (
1991
).
Detection of abnormal memory decline in mild cases of Alzheimer’s disease using CERAD neuropsychological measures
.
Archives of Neurology
,
48
,
278
281
. doi:
10.1001/archneur.1991.00530150046016

Wilson
M
. (
1988
).
MRC Psycholinguistic Database: Machine-usable dictionary, version 2.00
.
Behavior Research Methods, Instruments, & Computers
,
20
,
6
10
. doi:
10.3758/BF03202594

Author notes

Decision Editor: Shevaun Neupert, PhD