Abstract

Objectives: Although numerous studies have measured behaviors among individuals in congregate settings, few have focused on resident-to-resident elder mistreatment (R-REM). To our knowledge, there is no psychometrically developed measure of R-REM extant. The quantitative development of a measure of staff-reported R-REM is described. Methods: The design was a prevalent cohort study of residents of 5 long-term care facilities. The primary certified nursing assistant was interviewed about R-REM. Advanced measurement methods were used to develop a measure of R-REM. Results: The loadings on the general factor for the final 11-item scale were greater than those on the group factor except for the item “other physical behavior” (0.63 vs. 0.74), suggesting essential unidimensionality. Although the bifactor model fit was slightly better than that of the unidimensional model, the difference was trivial (bifactor comparative fit index [CFI] = 0.997, root mean square error of approximation [RMSEA] = 0.013, unidimensional CFI = 0.979, and RMSEA = 0.030). However, modest support was provided for use of verbal and physical subscales. The explained common variance statistics were 0.76 for the bifactor model compared with 0.63 for the unidimensional model. Discussion: The development of this R-REM measure will help to advance the measurement and ultimately evaluation of interventions associated with this important and under recognized problem facing residents in long-term care settings.

Although numerous studies have identified, classified, and measured behaviors among individuals in congregate settings and among those with cognitive impairment (Burgio et al., 2002; Cohen-Mansfield, Marx, & Rosenthal, 1989; Logsdon & Teri, 1997; Rose & Pruchno, 1999; Talerico & Evans, 2000; Teresi et al., 1997), practically none have focused on resident-to-resident elder mistreatment (R-REM). To our knowledge, there is no psychometrically developed measure of R-REM extant.

Behaviors directed by residents against other residents are potentially serious because they can be sudden and unexpected, leaving the recipients fearful in anticipation of the next occurrence. The prevalence of R-REM has not been demonstrated fully; however, Lachs, Bachman, Williams, and O’Leary (2007) found that almost 6% of community residents had police contact after being placed in a long-term care institution, the majority of which were due to R-REM.

A recent study of combative behaviors of residents (Morgan et al., 2012), although not focused on R-REM, used diaries and questionnaires completed by frontline staff to document incidents of aggression directed toward them. The majority of staff reported occurrence of incidents within the past month, providing evidence regarding the ubiquity of such aggressive behaviors. Although there are many factors that contribute to resident-to-resident aggression, chief among them is the commingling of residents with a mixture of cognitive impairment, comorbidity, and differing lifestyles and personalities in small places (Teresi, Holmes, & Monaco, 1993). For example, in one of the few studies of R-REM extant, Shinoda-Tagawa and colleagues (2004) found that cognitive impairment, aggressive behavior, and physical independence were associated with physical abuse of recipients. Measurement in the study by Shinoda-Tagawa and colleagues (2004) was from reports to the Department of Health and abstraction of police reports. Most recently, Castle (2012), building on the conceptual work of Lachs and Pillemer (2004) and Stiles, Koren, and Walsh (2002), assessed resident-to-resident abuse using a mailed questionnaire completed by certified nursing assistants (CNAs), with 35 items measuring five domains: verbal, physical, psychological, sexual abuse, and material exploitation (taking other peoples belongings). Although the study was large, with facilities sampled from 10 states, the authors discuss that more refined measures focused on specific residents rather than aggregated reports are needed to determine prevalence and better understand the mechanisms of R-REM.

The first study of abuse of patients in nursing homes (Pillemer & Moore, 1989) resulted in a taxonomy of abuse, based on earlier work in the community (Pillemer & Finkelhor, 1988) using an instrument designed to measure forms of family violence, on which the instrument derived for this study was partially based. The methodology has been replicated in several other countries and has produced remarkably stable estimates of self-reported community elder abuse (Lachs & Pillemer, 2004). There is no “gold standard” in violence research because researchers or anyone outside the violent dyad rarely witnesses actual episodes. Self-report is widely accepted as the primary methodology for case finding in violence research, and this method has been used in this study. Given the high prevalence of cognitive impairment in long-term care settings, staff reports are critical in the assessment of R-REM.

Aims

The purpose of this article is to describe the quantitative development of a measure of staff-reported R-REM (R-REM-staff version; R-REM-S). The qualitative development is described elsewhere (Ramirez et al., in press).

Conceptual Model

The conceptual model is derived from earlier work by Pillemer and Moore (1989) and by qualitative research using focus groups (Rosen et al., 2008). In the latter study, R-REM was categorized as physical, verbal, and sexual and 35 types were identified within these classifications. Later, Pillemer and colleagues (2011) characterized R-REM in terms of typologies, identifying five categories as follows: invasion of privacy, roommate altercations, hostile interpersonal interactions, unprovoked actions, and inappropriate sexual behaviors. Qualitative work (Ramirez et al., in press) identified four broad domains of R-REM: verbal, physical, sexual, and other (e.g., wandering into others’ rooms). These domains form the basis for the measure presented here. For the purpose of this study, resident-to-resident mistreatment is defined as negative and aggressive physical, sexual, or verbal interactions between residents in long-term care settings, which in a community setting would likely be considered as unwelcome.

The general approach taken was derived from a long-standing and well-established measurement strategy in the field of interpersonal aggression. The best known of the instruments based on this strategy is the Conflict Tactics Scale (CTS; Straus, Hamby, Boney-McCoy, & Sugarman, 1996). Measures based on the CTS model have been successfully used with nursing home staff (Pillemer & Moore, 1989). The CTS conforms to a major finding regarding effective measurement of mistreatment, that is, violence measures must be specific. Behaviorally specific questions have a number of advantages: They require fewer definitions of terminology (as would a general question such as “Have you ever been abused?”), they are less subject to a respondent’s interpretation of terms, and they are less dependent on educational level.

Most important, behavioral specificity reduces the effects of potential personal and cultural differences in interpretation of terms such as “abuse” or “neglect” (Acierno, 2003). A further strength of the CTS approach is separating the focus on behaviors from outcomes. The approach is consistent with legal definitions, which negatively sanction assault regardless of whether it results in serious injury. R-REM may also be serious but does not cause obvious physical harm. Shinoda-Tagawa and colleagues (2004) included only individuals who had suffered a documented injury in their study. This rule is likely to exclude individuals who may be psychologically distressed by an incident or who may have been injured in ways that are not obvious (Herbert & Bradshaw, 2004). Thus, the present measure moves beyond the focus only on R-REM that has resulted in injury and includes a wide range of actions, regardless of outcome.

One central feature of R-REM may separate it from nearly every other form of interpersonal violence—it occurs in an environment that is both a residence as well as a health care facility. Accordingly, many health care providers are potentially “stationed” to detect R-REM either directly or indirectly. To not make systematic use of these observers would be to lose an opportunity for more comprehensive case finding and description. Thus, this study used staff reports via a staff version of the R-REM measure, after brief in-service training that is part of customary periodic facility education (e.g., infection control).

Although few systematic studies of staff reporting of R-REM exist, it is noted that nursing home staff have been “violence and conflict reporters” in other research studies not involving R-REM. Remarkably, nursing home staff members have actually reported their own acts of mistreatment (anonymously) toward residents as well as witnessed acts of abuse by other staff (Pillemer & Moore, 1989). These studies would suggest that nursing home staff could be reliable informants for detecting and describing R-REM, particularly because it is not as personally stigmatizing as self-promulgated abuse.

Methods

Design

The design was a prevalent cohort study of all residents who consented or were consented by designated proxies (families or legal guardians). The study was approved by Institutional Review Board, protocol 0803009718.

Sample

Selection of facilities.

Using a pseudo-random number generator procedure, 6 nursing homes were selected from among the population of 21 nursing homes with 250 or more beds in two New York regions. Large facilities were selected to ensure a sufficient number of units to permit randomization within facilities. Two facilities with severe survey deficiencies were excluded. Agreement to participate was obtained from five of the six facilities. The final sample represented 24% of large facilities (250 plus beds) in the two regions.

Procedures

The CNA who was most involved with the resident’s care (usually the primary care CNA) was interviewed by research staff using a structured interview schedule via a computerized assisted interview.

Measure

In addition to the CTS, the staff R-REM instrument was developed after extensive modifications to some items from the Cohen-Mansfield Agitation Instrument (CMAI; Cohen-Mansfield et al., 1989), an instrument that measures behavioral disruption in nursing home residents. The qualitative work in modification and translation is described elsewhere (Ramirez et al., in press). After focus groups with nursing home staff, more R-REM items were added so as to create a new instrument: the resident-to-resident elder mistreatment-staff version (R-REM-S). The items range from those low on aggressiveness (such as screaming at another resident) to increasingly aggressive tactics involving physical violence.

In following in the tradition of elder mistreatment reporting in community studies, both a global dimension and subtypes of R-REM (e.g., physical, verbal, and so on) were posited. The original item pool contained 22 items across four domains: verbal (five items, e.g., “cursing,” “intimidation,” “ethnic slurs,” and so on), physical (seven items, e.g., “hitting,” “kicking,” “scratching,” and so on), sexual (three items, e.g., “saying sexual things,” “inappropriate touching,” and so on), and others (seven items, e.g., “unwanted help,” “threatening gestures,” “wandering,” and so on).

The R-REM-S instrument was designed to collect data on staff recollection of resident-to-resident mistreatment during the past 2-week period. A 2-week period was selected because this time period is often used in psychiatric and behavioral measures addressing underlying latent constructs that may be rare. A list of all potential R-REM behaviors is provided and the number of distinct incidents involving these behaviors is requested. For each incident (up to five), the behaviors involved, the location (e.g., dining area, hallway, resident’s room), and the time (e.g., morning, noon meal, afternoon) it occurred are recorded. In addition, a description of other participant(s) (sex and relationship) and who started it are also recorded. The CNA reports about whether he or she witnessed the incident, and if so, what he or she did about this (e.g., separated residents, redirected residents). Excluding incidents that occurred within the past 2 weeks, the inquiry then focused on the two most serious occurrences within the past year. Finally, the staff were asked to describe the most serious incident, if any that was directed toward staff.

Statistical Procedures

Increasingly, advanced psychometric methods are being used to assess gerontological and health-related measures (Fang et al., 2012; Reeve et al., 2007; Teresi, Stewart, Morales, & Stahl, 2006). According to the conceptual model discussed previously, the items are effect indicators in that the underlying latent variable R-REM gives rise to the indicators and accounts for the variance in the item set. Further, it is posited that the measure is essentially unidimensional with one single underlying attribute but that additional variance arises due to factors that may reflect R-REM or nuisance variables. First, traditional methods of examining dimensionality were performed. This is the method similar to that described in Asparouhov and Muthén (2009), in which a merged exploratory factor analysis (EFA) and confirmatory factor analysis (CFA) are performed fitting a unidimensional model with polychoric correlations. This method uses the exploratory analyses to permit cross-loadings on variables, followed by a confirmatory model with loadings specified on only one factor. The exploratory analyses used principal components estimation and examined tests of scree with cross-loadings permitted. This was followed by the confirmatory analyses of the unidimensional model. Additionally, a two-factor model was tested against a unidimensional model to determine if the fit improved (results not reported).

However, to further investigate unidimensionality, a bifactor model was tested. The bifactor model assumes that a single general trait explains most of the common variance but that group traits explain additional common variance for item subsets (Reise, Moore, & Haviland, 2010). Thus, a general and two or more group factors are posited. Following the methods of Reise and colleagues (2010) and that of Revelle and Zinbarg (2009), a Schmid-Leiman (S-L; Schmid & Leiman, 1957) transformation (using the “psych” R package; Rizopoulus, 2006, 2009) was performed to find an alternative set of group factors for the restricted bifactor (Gibbons et al., 2007) model without cross-loadings. A three-factor model was specified with uncorrelated factors. The S-L procedure is as follows (Jennrich & Bentler, 2011; Reise et al., 2010). First, an oblique rotation factor analysis using the covariance matrix is performed to produce factor loadings, correlations, and unique variances. Then, a single factor analysis of the correlations is performed to extract second-order factors, followed by the S-L orthogonalization of the oblique results. The purpose is to obtain loadings for each item on uncorrelated general and group factors. However, it is noted that the S-L solution is used to guide the final bifactor estimation of loadings using MPLUS because the loadings produced by the S-L solution have been found to be biased (Reise et al. [2010]; see Jennrich and Bentler [2012] for alternative bifactor methods). This procedure permits inspection of the final loadings on the general as contrasted with the group factors and for comparison with a single factor solution in sensitivity analyses to determine if essential unidimensionality is achieved and if subscales such as physical and verbal mistreatment are of any value.

All items were specified to load on the general factor, and the loadings on the group factors were specified following the S–L solution shown in Supplementary Figure 2. The bifactor model was run with the group factors as described previously using MPLUS (version 6.1; Muthén & Muthén, 1998–2010). A robust parameter estimation procedure, based on a mean- and variance-adjusted weighted least squares procedure (WLSMV) contained in MPLUS was used.

Summary statistics from the various models are presented in Table 1. Factor loadings are presented along with communalities (h2) and general factor variance. Both eigenvalues and fit statistics (root mean square error of approximation [RMSEA] and comparative fit index [CFI]) were examined but were not central in decision making because of the desire to find alternative approaches to the use of fit statistics for evaluating model adequacy.

Table 1.

R-REM Item Loadings (λ) From the Unidimensional Exploratory Factor Analysis (MPLUS, Column 2), Schmid–Leiman (S-L) Bifactor Model With Three- and Two-Group Factors (performed with R, and allowing cross-loadings), and MPLUS Bifactor Two-Group Factor Solution Without Cross-Loadings

Item descriptionOne factor, λ (SE)aS–L bifactor solutionsMPLUS bifactor solutions
Three factorTwo factorS–L-based modelbFinal model
Gλ (var.)F1λF2λF3λh2Gλ (var.)F1λF2λh2Gλ (SE)F1λ (SE)F2λ (SE)Gλ (SE)F1λ (SE)F2λ (SE)
Use bad words toward another resident0.91 (0.02)0.78 (0.73)0.420.220.830.71 (0.61)0.570.830.88 (0.05)0.24 (0.16)0.85 (0.05)0.33 (0.10)
Scream at another resident0.95 (0.02)0.80 (0.69)0.520.920.75 (0.62)0.580.890.87 (0.07)0.47 (0.14)0.83 (0.05)0.53 (0.09)
Try to scare, frighten, or threaten (another resident) with words0.85 (0.03)0.72 (0.65)0.480.200.790.64 (0.49)0.640.820.79 (0.08)0.34 (0.17)0.74 (0.07)0.43 (0.13)
Boss around/tell another resident what to do0.79 (0.03)0.62 (0.50)0.610.780.58 (0.55)0.510.600.71 (0.08)0.40 (0.16)0.66 (0.07)0.47 (0.13)
Hit another resident0.81 (0.04)0.68 (0.62)0.520.750.71 (0.68)0.470.750.74 (0.06)0.49 (0.10)0.77 (0.06)0.44 (0.11)
Grab or yank another resident0.75 (0.06)0.62 (0.56)0.540.680.66 (0.64)0.500.690.62 (0.08)0.58 (0.12)0.64 (0.08)0.55 (0.13)
Push or shove another resident0.74 (0.06)0.61 (0.69)0.230.340.550.63 (0.74)0.250.290.540.71 (0.07)0.34 (0.14)0.74 (0.07)0.28 (0.14)
Throw things at another resident0.82 (0.05)0.79 (0.64)0.590.990.70 (0.72)0.380.210.670.92 (0.06)−0.20 (0.22)0.89 (0.06)
Threaten another resident with a cane, fist, or other object0.81 (0.05)0.74 (0.78)0.250.250.700.70 (0.71)0.420.700.85 (0.07)0.02 (0.16)0.88 (0.06)
Other physical behavior like kicking, biting, scratching, or spitting at another resident0.80 (0.06)0.63 (0.40)0.771.000.72 (0.54)0.660.960.61 (0.08)0.74 (0.11)0.63 (0.08)0.74 (0.13)
Going into another resident’s room without asking/taking/touching/ damaging/breaking other residents “personal” things0.51 (0.06)0.50 (0.71)0.270.350.47 (0.74)0.210.300.59 (0.08)−0.23 (0.18)0.55 (0.07)
Eigenvalues5.21.21.40.64.91.81.1
Correlation of scores with factors0.880.780.930.850.770.79
Item descriptionOne factor, λ (SE)aS–L bifactor solutionsMPLUS bifactor solutions
Three factorTwo factorS–L-based modelbFinal model
Gλ (var.)F1λF2λF3λh2Gλ (var.)F1λF2λh2Gλ (SE)F1λ (SE)F2λ (SE)Gλ (SE)F1λ (SE)F2λ (SE)
Use bad words toward another resident0.91 (0.02)0.78 (0.73)0.420.220.830.71 (0.61)0.570.830.88 (0.05)0.24 (0.16)0.85 (0.05)0.33 (0.10)
Scream at another resident0.95 (0.02)0.80 (0.69)0.520.920.75 (0.62)0.580.890.87 (0.07)0.47 (0.14)0.83 (0.05)0.53 (0.09)
Try to scare, frighten, or threaten (another resident) with words0.85 (0.03)0.72 (0.65)0.480.200.790.64 (0.49)0.640.820.79 (0.08)0.34 (0.17)0.74 (0.07)0.43 (0.13)
Boss around/tell another resident what to do0.79 (0.03)0.62 (0.50)0.610.780.58 (0.55)0.510.600.71 (0.08)0.40 (0.16)0.66 (0.07)0.47 (0.13)
Hit another resident0.81 (0.04)0.68 (0.62)0.520.750.71 (0.68)0.470.750.74 (0.06)0.49 (0.10)0.77 (0.06)0.44 (0.11)
Grab or yank another resident0.75 (0.06)0.62 (0.56)0.540.680.66 (0.64)0.500.690.62 (0.08)0.58 (0.12)0.64 (0.08)0.55 (0.13)
Push or shove another resident0.74 (0.06)0.61 (0.69)0.230.340.550.63 (0.74)0.250.290.540.71 (0.07)0.34 (0.14)0.74 (0.07)0.28 (0.14)
Throw things at another resident0.82 (0.05)0.79 (0.64)0.590.990.70 (0.72)0.380.210.670.92 (0.06)−0.20 (0.22)0.89 (0.06)
Threaten another resident with a cane, fist, or other object0.81 (0.05)0.74 (0.78)0.250.250.700.70 (0.71)0.420.700.85 (0.07)0.02 (0.16)0.88 (0.06)
Other physical behavior like kicking, biting, scratching, or spitting at another resident0.80 (0.06)0.63 (0.40)0.771.000.72 (0.54)0.660.960.61 (0.08)0.74 (0.11)0.63 (0.08)0.74 (0.13)
Going into another resident’s room without asking/taking/touching/ damaging/breaking other residents “personal” things0.51 (0.06)0.50 (0.71)0.270.350.47 (0.74)0.210.300.59 (0.08)−0.23 (0.18)0.55 (0.07)
Eigenvalues5.21.21.40.64.91.81.1
Correlation of scores with factors0.880.780.930.850.770.79

Notes: G = general factor; F1, F2, and F3 = group factors; h2 = communality; (uniqueness u2 = 1–communality).

aGeomin (oblique) rotation.

bS–L PC = S–L principal components.

Table 1.

R-REM Item Loadings (λ) From the Unidimensional Exploratory Factor Analysis (MPLUS, Column 2), Schmid–Leiman (S-L) Bifactor Model With Three- and Two-Group Factors (performed with R, and allowing cross-loadings), and MPLUS Bifactor Two-Group Factor Solution Without Cross-Loadings

Item descriptionOne factor, λ (SE)aS–L bifactor solutionsMPLUS bifactor solutions
Three factorTwo factorS–L-based modelbFinal model
Gλ (var.)F1λF2λF3λh2Gλ (var.)F1λF2λh2Gλ (SE)F1λ (SE)F2λ (SE)Gλ (SE)F1λ (SE)F2λ (SE)
Use bad words toward another resident0.91 (0.02)0.78 (0.73)0.420.220.830.71 (0.61)0.570.830.88 (0.05)0.24 (0.16)0.85 (0.05)0.33 (0.10)
Scream at another resident0.95 (0.02)0.80 (0.69)0.520.920.75 (0.62)0.580.890.87 (0.07)0.47 (0.14)0.83 (0.05)0.53 (0.09)
Try to scare, frighten, or threaten (another resident) with words0.85 (0.03)0.72 (0.65)0.480.200.790.64 (0.49)0.640.820.79 (0.08)0.34 (0.17)0.74 (0.07)0.43 (0.13)
Boss around/tell another resident what to do0.79 (0.03)0.62 (0.50)0.610.780.58 (0.55)0.510.600.71 (0.08)0.40 (0.16)0.66 (0.07)0.47 (0.13)
Hit another resident0.81 (0.04)0.68 (0.62)0.520.750.71 (0.68)0.470.750.74 (0.06)0.49 (0.10)0.77 (0.06)0.44 (0.11)
Grab or yank another resident0.75 (0.06)0.62 (0.56)0.540.680.66 (0.64)0.500.690.62 (0.08)0.58 (0.12)0.64 (0.08)0.55 (0.13)
Push or shove another resident0.74 (0.06)0.61 (0.69)0.230.340.550.63 (0.74)0.250.290.540.71 (0.07)0.34 (0.14)0.74 (0.07)0.28 (0.14)
Throw things at another resident0.82 (0.05)0.79 (0.64)0.590.990.70 (0.72)0.380.210.670.92 (0.06)−0.20 (0.22)0.89 (0.06)
Threaten another resident with a cane, fist, or other object0.81 (0.05)0.74 (0.78)0.250.250.700.70 (0.71)0.420.700.85 (0.07)0.02 (0.16)0.88 (0.06)
Other physical behavior like kicking, biting, scratching, or spitting at another resident0.80 (0.06)0.63 (0.40)0.771.000.72 (0.54)0.660.960.61 (0.08)0.74 (0.11)0.63 (0.08)0.74 (0.13)
Going into another resident’s room without asking/taking/touching/ damaging/breaking other residents “personal” things0.51 (0.06)0.50 (0.71)0.270.350.47 (0.74)0.210.300.59 (0.08)−0.23 (0.18)0.55 (0.07)
Eigenvalues5.21.21.40.64.91.81.1
Correlation of scores with factors0.880.780.930.850.770.79
Item descriptionOne factor, λ (SE)aS–L bifactor solutionsMPLUS bifactor solutions
Three factorTwo factorS–L-based modelbFinal model
Gλ (var.)F1λF2λF3λh2Gλ (var.)F1λF2λh2Gλ (SE)F1λ (SE)F2λ (SE)Gλ (SE)F1λ (SE)F2λ (SE)
Use bad words toward another resident0.91 (0.02)0.78 (0.73)0.420.220.830.71 (0.61)0.570.830.88 (0.05)0.24 (0.16)0.85 (0.05)0.33 (0.10)
Scream at another resident0.95 (0.02)0.80 (0.69)0.520.920.75 (0.62)0.580.890.87 (0.07)0.47 (0.14)0.83 (0.05)0.53 (0.09)
Try to scare, frighten, or threaten (another resident) with words0.85 (0.03)0.72 (0.65)0.480.200.790.64 (0.49)0.640.820.79 (0.08)0.34 (0.17)0.74 (0.07)0.43 (0.13)
Boss around/tell another resident what to do0.79 (0.03)0.62 (0.50)0.610.780.58 (0.55)0.510.600.71 (0.08)0.40 (0.16)0.66 (0.07)0.47 (0.13)
Hit another resident0.81 (0.04)0.68 (0.62)0.520.750.71 (0.68)0.470.750.74 (0.06)0.49 (0.10)0.77 (0.06)0.44 (0.11)
Grab or yank another resident0.75 (0.06)0.62 (0.56)0.540.680.66 (0.64)0.500.690.62 (0.08)0.58 (0.12)0.64 (0.08)0.55 (0.13)
Push or shove another resident0.74 (0.06)0.61 (0.69)0.230.340.550.63 (0.74)0.250.290.540.71 (0.07)0.34 (0.14)0.74 (0.07)0.28 (0.14)
Throw things at another resident0.82 (0.05)0.79 (0.64)0.590.990.70 (0.72)0.380.210.670.92 (0.06)−0.20 (0.22)0.89 (0.06)
Threaten another resident with a cane, fist, or other object0.81 (0.05)0.74 (0.78)0.250.250.700.70 (0.71)0.420.700.85 (0.07)0.02 (0.16)0.88 (0.06)
Other physical behavior like kicking, biting, scratching, or spitting at another resident0.80 (0.06)0.63 (0.40)0.771.000.72 (0.54)0.660.960.61 (0.08)0.74 (0.11)0.63 (0.08)0.74 (0.13)
Going into another resident’s room without asking/taking/touching/ damaging/breaking other residents “personal” things0.51 (0.06)0.50 (0.71)0.270.350.47 (0.74)0.210.300.59 (0.08)−0.23 (0.18)0.55 (0.07)
Eigenvalues5.21.21.40.64.91.81.1
Correlation of scores with factors0.880.780.930.850.770.79

Notes: G = general factor; F1, F2, and F3 = group factors; h2 = communality; (uniqueness u2 = 1–communality).

aGeomin (oblique) rotation.

bS–L PC = S–L principal components.

Tests of unidimensionality.

The explained common variance (ECV) provides information about whether the observed variance covariance matrix is close to unidimensionality (Sijtsma, 2009). The ECV can be estimated as the percent of observed variance explained. It is the ratio of the first eigenvalue to the sum of all eigenvalues extracted from a bifactor model analysis and was calculated as the eigenvalue for the general factor divided by the sum of the eigenvalues for the general and group factors from a bifactor model (Reise et al., 2010):

Reise and colleagues also suggest examining the difference in the loadings between the unidimensional model and the general factor loadings (λ1) in the bifactor model as an indication of the degree of distortion that would occur by fitting a unidimensional model to data that are multidimensional.

Tests of reliability.

The methods for assessing reliability that have emerged from the psychometric literature as the preferred statistics include McDonald’s (1970, 1999) omega total and hierarchical (ωt and ωh) derived from linear structural equation modeling (SEM) and those derived from nonlinear SEM. Under the single common factor model, reliability can be evaluated by decomposing the scale score into the sum of the item scores, and the contribution of the common term (λFj) or communality. Known as McDonald’s (1999) omega total (ωt), this reliability estimate is based on the proportion of total common variance explained. Alternatively, this can be expressed in terms of the unique variance, where the unique variance contains both specific and error variance:

(Revelle and Zinbarg, 2009; formula 19). Omega hierarchical (ωh; McDonald, 1970) is calculated as the sum of the squared loadings on the general factor divided by the total scale score variance (Vx; Revelle & Zinbarg, 2009): For the one- or two-factor model, ωh is not meaningful because three factors are required for identification in the calculation of ωh. However, omega total (ωt) is interpretable for a unidimensional solution.

Item response theory parameter estimation.

Additionally, item response theory (IRT), applying the two-parameter logistic model (Lord & Novick, 1968) was used to evaluate the measure. IRT has been used to model gerontological data sets. See Teresi, Cross, & Golden, 1989; Teresi, Kleinman, & Ocepek-Welikson, 2000 for didactic explications of the models in that context. The estimates for the discrimination and severity parameters (a and b, respectively) were evaluated, the item and test information functions were graphed, and the reliability estimates were calculated for points along the dimension of the underlying construct, denoted as θ (theta).

Tests of IRT model fit and local dependencies.

Various goodness of fit statistics are available to test IRT model fit (Cai, Maydeu-Olivares, Coffman, & Thissen, 2006). Residual covariances can be examined in the context of violation of the assumption of local independence. The local dependency (LD) statistics (Chen & Thissen, 1997) compare the observed and expected frequencies associated with cross-tabulations of item pairs. Standardized (z) scores are used for comparison across items with different response categories: graphic (Thissen, 2011). If the observed covariation between responses to a pair of items is greater than that predicted by the model, the values are flagged indicating that a cluster of items may measure an unmodeled dimension. Values that are larger than 10 are considered problematic (Thissen, 2011).

Software.

The ECV and McDonald’s omega were estimated using both exploratory and confirmatory factor analyses. R software (R Development Core Team, 2008) was used to calculate McDonald’s omega statistics, as well as several others recommended by Revelle and Zinbarg (2009), contained in the “psych” package that they developed in R (www.R-project.org). Polychoric and polyserial correlations based on the underlying continuous normal variables (Zi) were estimated using a SEM package, MPLUS (Muthén & Muthén, 1998–2010). The S–L solution was obtained in the R “psych” package and loadings were estimated using MPLUS software. Item Response Theory Patient Reported Outcomes (IRTPRO) software (Cai, Thissen, & du Toit, 2011; Thissen, 2011) was used for IRT parameter estimation and tests of model fit. IRTPRO incorporates various tests of LD, for example, SS-χ2 (Orlando & Thissen, 2003). The IBM Statistical Package for the Social Sciences (IBM SPSS, 2010) reliability module calculates the corrected item-total correlations and overall classical test theory estimate of reliability of the item set.

Results

The analytic sample of residents whose behavior was evaluated by the staff at baseline was 1,812. The majority (72.1%) were women; the mean age was 84.1 (SD = 10.0) years; the mean educational level was 11.7 (SD =4.1) years; and 19.1% were Black, 16.6% were non-White Hispanic, 61.9% were non-Latino White, and 2.7% other.

Preliminary Analyses

Only the resident-to-resident mistreatment items were included in the psychometric analysis (Table 1). The frequency of specific incidents was low (Table 3). Therefore, the decision was made to combine any occurrence of a specific type of R-REM within the past 2 weeks or past year. Among the items in the final analysis, the highest frequency was 0.09 for the item “scream” and the next highest was 0.08 for the item “use bad words.” Five items with prevalence rates of 0.01 were excluded: “insulting race or ethnic group,” “saying sexual things,” “doing sexual things in front of another resident,” “touching in a sexual manner,” and “getting help when didn’t ask or want help.” Prior to combining items into testlets, there were some instances of local dependencies, for example, one pair of items showed very high LD statistics: “going to another resident’s room without asking” and “touch, damage, or break another resident’s personal things” (61.0). Thus, four items were combined in two testlets based on the LD results: “kicking,” “biting,” “scratching,” and “spitting” were combined and “going into room without asking,” “taking/touching things,” and “damaging things” were combined. The final analytic item set included 11 items (Table 1). After the items were combined, all item pair LD statistics were within the normal range.

EFA Analyses

The simple (4 to 1) eigenvalue rule provided some evidence for essential unidimensionality (eigenvalues of 6.943 to 1.264 or 5.5 to 1). The first component (eigenvalue) explained 63% of the variance (Supplementary Table 1). The scree plot showed a clear dominance of the first component (factor) relative to the other (Supplementary Figure 1). The fit indices for the unidimensional and two-factor models performed by MPLUS were CFI: one factor = 0.979 and two factor = 0.995. The RMSEAs were as follows: one factor = 0.030 and two factor = 0.017 (Supplementary Table 1). Because the eigenvalue ratio was moderate, further tests were performed using the bifactor model.

Bifactor Model

Prior to analyses, the inter-item tetrachoric correlation matrix was reviewed; correlations range from 0.314 to 0.878 (Supplementary Table 2). The graphic depiction of the bifactor model tested is shown in Supplementary Figure 2. The RMSEA statistic produced by “psych” R was 0.04 for the three-factor solution, 0.05 for the two-factor S–L solution, and 0.01 for the MPLUS bifactor model, applied using the general and two-group factor results from the S–L solution (Table 2). The MPLUS bifactor models with three-group factors did not converge. Table 1 shows the factor loadings from the S–L solution and from the MPLUS solutions for both the unidimensional and bifactor models. The relative size of the general and group factor loadings (MPLUS two-group factors) indicates a very strong general factor. Loadings for only one item, “other physical behavior like kicking, biting, scratching, or spitting” was higher on a group factor (the second) than the general factor (0.74 vs. 0.63). Two bifactor models were run in MPLUS because two items, “throw things at another resident” and the testlet item “going to another resident’s rooms without asking or touching, damaging, or breaking another residents’ personal things” had negative loadings on the first group factor (–0.20, –0.23), however, not on the general factor, and the item “threaten another resident with a cane, fist, or other object” evidenced a low loading on the group factor (0.02). These three items were specified not to load on any group factors. For the remaining items, the loadings on the general factor were greater than on any of the loadings on the group factors, suggesting essential unidimensionality. Although the fit of the bifactor model was slightly better than the unidimensional model (bifactor: CFI = 0.997, RMSEA = 0.014; unidimensional: CFI = 0.979, RMSEA = 0.030), the difference was trivial. ECV statistics were 0.76 for the bifactor model compared with 0.63 for the unidimensional MPLUS models. However, it is noted that the group factor results did provide some modest support for the use of two subscales: verbal and physical, with the remaining items forming a potential third subset. The highest loadings on the first group factor were for verbal items: “use bad words toward another resident”; “scream at another resident”; “try to scare, frighten, or threaten with words”; and “boss around, tell another resident what to do.” The second group factor comprises physical items: “hit another resident”; “grab or yank another resident”; “push or shove another resident; and “other physical behavior such as kicking, biting, scratching, or spitting at another resident.” Finally, the remaining items loaded on a third S–L group factor: “threaten another resident with cane, fist, or other object”; “throw things at another resident”; and the testlet item “going to another resident’s room without asking or taking, touching, damaging, or breaking another resident’s personal things.” However, as stated earlier, the MPLUS three-group factor solution did not converge; thus, there is robust support for at most two-group factors.

Table 2.

Reliability, Dimensionality, and Model Fit Statistics Obtained From R as the Three-Factor Schmid–Leiman (S-L) Solution, MPLUS One-Factor Solution, and Bifactor Solutions

StatisticsS–L three-group factorsS–L two-group factorsMPLUS (one factor)MPLUS bifactor (general and two-group factors)
Reliability (alpha coefficient for general factor)0.940.94N/AN/A
Omega hierarchical (ωh)0.76N/AN/AN/A
Omega total (ωt)0.960.97N/AN/A
ECV0.640.640.630.76
RMSEA0.040.050.030.01
StatisticsS–L three-group factorsS–L two-group factorsMPLUS (one factor)MPLUS bifactor (general and two-group factors)
Reliability (alpha coefficient for general factor)0.940.94N/AN/A
Omega hierarchical (ωh)0.76N/AN/AN/A
Omega total (ωt)0.960.97N/AN/A
ECV0.640.640.630.76
RMSEA0.040.050.030.01

Notes: ECV = explained common variance; RMSEA = root mean square error of approximation; N/A = not applicable.

Table 2.

Reliability, Dimensionality, and Model Fit Statistics Obtained From R as the Three-Factor Schmid–Leiman (S-L) Solution, MPLUS One-Factor Solution, and Bifactor Solutions

StatisticsS–L three-group factorsS–L two-group factorsMPLUS (one factor)MPLUS bifactor (general and two-group factors)
Reliability (alpha coefficient for general factor)0.940.94N/AN/A
Omega hierarchical (ωh)0.76N/AN/AN/A
Omega total (ωt)0.960.97N/AN/A
ECV0.640.640.630.76
RMSEA0.040.050.030.01
StatisticsS–L three-group factorsS–L two-group factorsMPLUS (one factor)MPLUS bifactor (general and two-group factors)
Reliability (alpha coefficient for general factor)0.940.94N/AN/A
Omega hierarchical (ωh)0.76N/AN/AN/A
Omega total (ωt)0.960.97N/AN/A
ECV0.640.640.630.76
RMSEA0.040.050.030.01

Notes: ECV = explained common variance; RMSEA = root mean square error of approximation; N/A = not applicable.

The IRT discrimination (a) parameters estimated with IRTPRO, ranged from the lowest 1.18 (going to another resident’s room without asking or take, touch, damage, or break another resident’s personal things) to the highest 4.95 (scream; Table 3). Additional items with high discrimination parameters greater than 3.0 were as follows: “use bad words” (4.16); “threaten another resident with cane, fist, or other objects” (3.78); “try to scare, frighten, or threaten with words” (3.63); and “throw things at another resident” (3.52). The severity (difficulty) parameters (b) ranged from 1.43 (scream), the least severe item, to 3.30 (going to another resident’s room without asking/take, touch, damage, or break another resident’s personal things; Table 3). As expected, the verbal items were less severe indicators of R-REM.

Table 3.

Item Response Theory (IRT) Item Parameters and Standard Error Estimates (IRTPRO)

Item descriptionBase rate (% positive)aSE of ab1SE of b1
Use bad words toward another resident7.74.160.501.570.06
Scream at another resident9.34.950.681.430.05
Try to scare, frighten, or threaten (another resident) with words1.93.630.582.290.12
Boss around/tell another resident what to do4.52.650.302.060.11
Hit another resident2.02.600.382.480.16
Grab or yank another resident1.32.240.392.880.26
Push or shove another resident1.52.320.382.760.22
Throw things at another resident1.03.520.692.580.17
Threaten another resident with a cane, fist, or other object0.83.780.842.630.19
Other physical behavior like kicking, biting, scratching, or spitting at another resident1.02.340.442.920.27
Going into another resident’s room without asking or taking/touching/damaging or breaking other residents “personal” things3.61.180.193.300.39
Item descriptionBase rate (% positive)aSE of ab1SE of b1
Use bad words toward another resident7.74.160.501.570.06
Scream at another resident9.34.950.681.430.05
Try to scare, frighten, or threaten (another resident) with words1.93.630.582.290.12
Boss around/tell another resident what to do4.52.650.302.060.11
Hit another resident2.02.600.382.480.16
Grab or yank another resident1.32.240.392.880.26
Push or shove another resident1.52.320.382.760.22
Throw things at another resident1.03.520.692.580.17
Threaten another resident with a cane, fist, or other object0.83.780.842.630.19
Other physical behavior like kicking, biting, scratching, or spitting at another resident1.02.340.442.920.27
Going into another resident’s room without asking or taking/touching/damaging or breaking other residents “personal” things3.61.180.193.300.39
Table 3.

Item Response Theory (IRT) Item Parameters and Standard Error Estimates (IRTPRO)

Item descriptionBase rate (% positive)aSE of ab1SE of b1
Use bad words toward another resident7.74.160.501.570.06
Scream at another resident9.34.950.681.430.05
Try to scare, frighten, or threaten (another resident) with words1.93.630.582.290.12
Boss around/tell another resident what to do4.52.650.302.060.11
Hit another resident2.02.600.382.480.16
Grab or yank another resident1.32.240.392.880.26
Push or shove another resident1.52.320.382.760.22
Throw things at another resident1.03.520.692.580.17
Threaten another resident with a cane, fist, or other object0.83.780.842.630.19
Other physical behavior like kicking, biting, scratching, or spitting at another resident1.02.340.442.920.27
Going into another resident’s room without asking or taking/touching/damaging or breaking other residents “personal” things3.61.180.193.300.39
Item descriptionBase rate (% positive)aSE of ab1SE of b1
Use bad words toward another resident7.74.160.501.570.06
Scream at another resident9.34.950.681.430.05
Try to scare, frighten, or threaten (another resident) with words1.93.630.582.290.12
Boss around/tell another resident what to do4.52.650.302.060.11
Hit another resident2.02.600.382.480.16
Grab or yank another resident1.32.240.392.880.26
Push or shove another resident1.52.320.382.760.22
Throw things at another resident1.03.520.692.580.17
Threaten another resident with a cane, fist, or other object0.83.780.842.630.19
Other physical behavior like kicking, biting, scratching, or spitting at another resident1.02.340.442.920.27
Going into another resident’s room without asking or taking/touching/damaging or breaking other residents “personal” things3.61.180.193.300.39

The information functions (estimated using IRTPRO) for all the items are located at the upper half of the theta continuum; there was little information contributed at the lower half of the underlying construct distribution (Figure 1). The highest information function (5.2) was for the item “scream at another resident,” which peaked at the theta level of 1.6 followed by the item “use bad words” (4.3) at theta 1.6. The items “threaten another resident with cane, fist, or other objects”; “try to scare, frighten, or threaten with words”; and “throw things at another resident” were more informative at even higher theta levels (the highest information points: 3.2 at theta level 2.4, and 3.2, 2.7 all at theta level 2.8). The testlet item “going to another resident’s room without asking or take, touch, damage, or break another resident’s personal things” provides minimum information (highest information point: 0.3 at theta level 2.8; Figure 1 and Supplementary Figure 3).

Figure 1.

Resident-to-resident elder mistreatment (R-REM) item information functions (IRTPRO).

The essential dimensionality estimates were high; the ECV from the “psych” R program was 0.64 and 0.76 from the MPLUS bifactor model. Different reliability estimates were obtained from several methods. The alpha estimate from the “psych” R package was 0.94, omega hierarchical (ωh; three-group solution) was 0.76, and omega total (ωt; two-group solution) was 0.97 (Table 2); thus, adding support for a two- rather than three-group solution. The Cronbach’s alpha reliability estimate was 0.74 and standardized alpha was 0.75. Corrected item-total correlations ranged from 0.30 (grab or yank another resident) to 0.61 (scream; Supplementary Table 3). Based on the IRTPRO results, the estimates of reliability were calculated along the theta continuum and ranged from 0.50 (lower end of theta) to 0.95 at theta 2.4 (Supplementary Table 4). Reliabilities were adequate (≥ 0.90) at θ ≥ 1.2.

Verbal and physical subscales were examined using classical test theory reliability estimates. The combined verbal items (use bad words toward another resident, scream at another resident, try to scare another resident with words, and boss around, tell people what to do) resulted in a Cronbach’s alpha estimate of 0.73 and 0.74 for unstandardized and standardized versions, respectively, with corrected item-total correlations ranging from 0.48 to 0.66. The six physical items (hit another resident; grab another resident; push another resident; throw things at another resident; threaten another resident with cane, fist, or other object; and other physical behavior like kicking, biting, scratching, or spitting at another resident) resulted in a Cronbach’s alpha estimate of 0.65 (for both unstandardized and standardized alphas). The corrected item-total correlations ranged from 0.31 to 0.48. It is noted that the factor analyses placed the item, “threaten another resident with cane, fist, or other object” in the verbal as well as the third factor along with the items “throw things at another resident” and “going to another resident’s room without asking”/“take, touch, damage, or break another resident’s personal things” in the three-group S–L solution, even though the content was verbal threatening of physical actions. Placing the threatening item in the verbal scale reduces slightly the internal consistency estimate of the scale (from 0.73 to 0.71) but increases the alpha estimate if the item is added to the physical subscale (from 0.63 to 0.65). The internal consistency estimate did not change whether the item, “throw things at another resident” was included in the physical subscale (0.63). Thus, one could plausibly use a four-, five-, or six-item subscale for each, if desired. However, the final confirmatory bifactor results shown in Table 1 suggested the possibility of four item measures of physical and verbal mistreatment. It is emphasized that the bottom-line result is that the factor analyses supported most strongly an essentially unidimensional measure.

Table 4.

Distribution of the Resident-to-Resident Elder Mistreatment (R-REM) Summary Score Mapped to the R-REM Estimate (Theta)

Summed scoreN%Cumulative %EAPa [θ/x]SD [θ/x]Modeled %Modeled Cumulative %Reliabilityb
01,52684.384.3–0.2550.83483.9083.90.52
11096.090.30.9020.5698.3592.20.80
2874.895.11.4660.3263.2795.50.92
3442.497.51.7360.2831.8697.40.93
4201.198.61.9520.2731.0698.40.93
5120.799.32.1530.2670.6199.00.93
690.599.82.3450.2620.3799.40.94
720.199.92.5310.2630.2499.70.95
810.199.92.7190.2740.1699.80.94
910.11002.9230.2990.1099.90.94
1000.01003.1590.3450.06100
1100.01003.4260.4020.02100
Summed scoreN%Cumulative %EAPa [θ/x]SD [θ/x]Modeled %Modeled Cumulative %Reliabilityb
01,52684.384.3–0.2550.83483.9083.90.52
11096.090.30.9020.5698.3592.20.80
2874.895.11.4660.3263.2795.50.92
3442.497.51.7360.2831.8697.40.93
4201.198.61.9520.2731.0698.40.93
5120.799.32.1530.2670.6199.00.93
690.599.82.3450.2620.3799.40.94
720.199.92.5310.2630.2499.70.95
810.199.92.7190.2740.1699.80.94
910.11002.9230.2990.1099.90.94
1000.01003.1590.3450.06100
1100.01003.4260.4020.02100

Notes:aEAP is the expected a posteriori estimate (mean of the posterior distribution of theta [θ], conditional on x [the observed response pattern]).

bThe average reliability across all measured levels of theta (θ) is 0.67.

Table 4.

Distribution of the Resident-to-Resident Elder Mistreatment (R-REM) Summary Score Mapped to the R-REM Estimate (Theta)

Summed scoreN%Cumulative %EAPa [θ/x]SD [θ/x]Modeled %Modeled Cumulative %Reliabilityb
01,52684.384.3–0.2550.83483.9083.90.52
11096.090.30.9020.5698.3592.20.80
2874.895.11.4660.3263.2795.50.92
3442.497.51.7360.2831.8697.40.93
4201.198.61.9520.2731.0698.40.93
5120.799.32.1530.2670.6199.00.93
690.599.82.3450.2620.3799.40.94
720.199.92.5310.2630.2499.70.95
810.199.92.7190.2740.1699.80.94
910.11002.9230.2990.1099.90.94
1000.01003.1590.3450.06100
1100.01003.4260.4020.02100
Summed scoreN%Cumulative %EAPa [θ/x]SD [θ/x]Modeled %Modeled Cumulative %Reliabilityb
01,52684.384.3–0.2550.83483.9083.90.52
11096.090.30.9020.5698.3592.20.80
2874.895.11.4660.3263.2795.50.92
3442.497.51.7360.2831.8697.40.93
4201.198.61.9520.2731.0698.40.93
5120.799.32.1530.2670.6199.00.93
690.599.82.3450.2620.3799.40.94
720.199.92.5310.2630.2499.70.95
810.199.92.7190.2740.1699.80.94
910.11002.9230.2990.1099.90.94
1000.01003.1590.3450.06100
1100.01003.4260.4020.02100

Notes:aEAP is the expected a posteriori estimate (mean of the posterior distribution of theta [θ], conditional on x [the observed response pattern]).

bThe average reliability across all measured levels of theta (θ) is 0.67.

Examining the distributional characteristics of the measure, the mean sum score was 0.35 (SD = 0.99) with the median of 0.0; the mean theta was 0.002 (0.62) with the median of −0.26. Skewness statistics were 3.84 (0.06) and 2.27 (0.06) and kurtosis was 17.95 (0.12) and 3.88 (0.12). The Kolmogorow–Smirnov test of normality provided evidence of highly skewed distributions (20.36, p < .001 and 21.38, p < .001; Supplementary Table 5). The majority of the cases had the sum score of zero (84.3%), which is equivalent to the theta value −0.26. Examining the expected item scores mapped onto theta, the highest score for the sample was 9, which is equivalent to a theta value of 2.92. The highest theoretical value for the sum score is 11 corresponding to an estimated theta value of 3.4; however, no cases had the sum score 10 or 11. There is almost a complete overlap of the cumulative proportion for the sum score and the modeled proportion (Table 4).

Discussion

The study supports the unidimensional assessment of R-REM using the 11-item scale; however, modest support was provided for the use of separate subscales for verbal and physical constructs. The S–L bifactor model identified three factors: verbal, physical, and a less differentiated factor including items on room invasion and throwing and threatening gestures; however, this result was not supported by the final estimation procedures.

Limitations

A limitation of these analyses is the low prevalence of many items, resulting in omission of items that may be potentially salient and stressful, for example, sexual encounters. Thus, insertion of a combined item is recommended: “saying sexual things and inappropriate touching of another resident.” The proposed 12-item measure is appended. Additionally, items were combined into presence in the past 2 weeks or year. Although this method may be best for analyses of data and for constructing global summary scales, for individual assessments, it may be desired to collect data separately on recent and past R-REM events to obtain a frequency of occurrences measure to inform appropriate care planning and individual interventions. Thus, the items for up to five incidents occurring within the past 2 weeks and two additional incidents occurring within the past 1 year are included in the measure shown in Supplementary Material. Finally, the distributions of R-REM were skewed and higher reliability estimates were observed at the more severe tail of the R-REM distribution. Reliability estimates are lower at the less severe tail of the distribution where noncases are prevalent. In part, this is due to the low base rate of R-REM. It is not surprising that greater accuracy is observed at the “caseness” tail of the distribution, where reporting data are available in contrast to the “noncase” tail, where the potential for error is most likely due to failure to observe or report R-REM. Nonetheless, it is important to have high reliability among cases because these are the ones singled out for adjudication and appropriate actions. Moreover, the phenomenon of higher reliability in the middle and upper section of the theta distribution is not uncommon in the field of patient-reported outcome data. For example, this phenomenon is commonly observed in data from the National Institutes of Health Patient Reported Outcome Measurement Information System (PROMIS; Reeve et al., 2007; Teresi et al., 2009).

Conclusion

Most studies of aggressive behavior have used staff informant reports or observations. Creative methodologies have typically been developed and employed in studies of behavioral disturbance in nursing home residents. Some of the observational methods that have been used for this purpose include direct videography of groups of residents for brief proscribed periods (Kihlgren et al., 1993), bar-coding strategies in which researchers tally behaviors of interest with a light pen system (Bridges-Parlet, Knopman, & Thompson, 1994; Holmes & Teresi, 1996), and computer-assisted observations in which researchers use a laptop computer to keystroke appropriate tabs when residents are observed in specific activities (e.g., during morning care; Rogers, Holm, & Burgio, 1999). It was determined that none of these methods were practical or feasible for four reasons. First, episodes of R-REM are likely to be too brief and too intermittent to make any of these methods useful. A large research staff would have to be positioned virtually everywhere in the facility for extensive periods to make meaningful observations; this would be costly and potentially invasive for staff and residents, and most importantly, not sustainable beyond the life of a research project. Second, episodes of R-REM are unpredictable. In arriving at this methodology, the investigators also considered oversampling suspected high-risk times and environments (e.g., congregate meals), but no data exist on the epidemiology of when R-REM occurs on which to base such a sampling strategy. Third, violent episodes that are volitional are probably less likely to occur with research observers monitoring residents directly; their presence could actually influence the prevalence of R-REM. Finally, the use of videography was ruled out as too invasive for sustained use, particularly in resident’s rooms. Although the experience with bar-code technology has been positive, it is not likely that this method can be implemented on a wide scale basis without extensive support. However, the increased use of electronic medical records could make this approach viable in the future.

Given the previous considerations, this project adopted a strategy of R-REM case identification that makes use of the best potential reporters—residents themselves and the staff who care for them. Analogously, the instrument proposed to measure the phenomenon derives its methodology from two fields (and representative scales reflective of them) that have proven strategies appropriate for this study—the CTS (derived from the interpersonal aggression literature) and the CMAI (derived from the nursing home behavioral disturbance literature).

A challenge in long-term care settings is recognition and documentation of resident-to-resident mistreatment. Because of the ubiquity of aggressive behaviors in general as well as toward residents (Castle, 2012) and staff (Lachs et al., 2012; Morgan et al., 2012), R-REM may often be ignored. This may be due in part to desensitization because such incidents may be perceived as daily routine events. Although staff training has resulted in increased recognition and reporting (Teresi et al., 2012), documentation is still a challenge. Documentation of aggressive behaviors toward residents is not only important in and of itself for record keeping, but it can also be crucial as a preventive measure at an individual level for risk management and care planning. Additionally, such documentation is important at the institutional level to avoid financial and or licensing penalties (Soreff, 2012). Thus, methods for increasing recognition, reporting, and documentation benefit residents, direct care, and administrative staff.

Castle (2012) has called for measurement approaches at the individual resident level that can provide representative estimates of incidence and prevalence of R-REM. Additionally, such a measure could be incorporated into routine data collection used for care planning. Collectively, it is hoped that the extensive qualitative and quantitative work that resulted in the development of this measure will help to advance the measurement and ultimately interventions associated with this important and underrecognized problem facing residents of long-term care settings.

Funding

This work was supported in part by National Institute on Aging (AG014299-06A2), National Institute of Justice (FYO 42USC3721), New York State Department of Health Dementia Grant Program (contract # C-022657). This work does not necessarily reflect the opinions of the funding organizations.

References

Acierno
R
.
(2003)
.
Elder mistreatment: Epidemiological assessment methodology
. In
Bonnie
R. J.
Wallace
R. B
. (Eds.),
Elder mistreatment: Abuse, neglect, and exploitation in an aging America
(pp.
261
302
).
Washington, DC
:
National Academies Press
.

Asparouhov
T.
Muthén
B
.
(2009)
.
Exploratory structural equation modeling
.
Structural Equation Modeling
,
16
,
397
438
.

Bridges-Parlet
S.
Knopman
D.
Thompson
T
.
(1994)
.
A descriptive study of physically aggressive behaviors by direct observation
.
Journal of the American Geriatrics Society
,
42
,
192
197
.

Burgio
L. D.
Stevens
A.
Burgio
K. L.
Roth
D. L.
Paul
P.
Gerstle
J
.
(2002)
.
Teaching and maintenance behavior management skills in the nursing home
.
The Gerontologist
,
42
,
487
496
.

Cai
L.
Maydeu-Olivares
A.
Coffman
D. L.
Thissen
D
.
(2006)
.
Limited-information goodness-of-fit testing of item response theory models for sparse 2 tables
.
British Journal of Mathematical and Statistical Psychology
,
59
(
Pt 1
)
173
194
.

Cai
L.
Thissen
D.
du Toit
S. H. C
.
(2011)
.
IRTPRO: Flexible, multidimensional, multiple categorical IRT modeling [Computer software]
.
Chicago, IL
:
Scientific Software International
.

Castle
N. G
.
(2012)
.
Resident-to-resident abuse in nursing homes as reported by nurse aides
.
Journal of Elder Abuse & Neglect
,
24
,
340
356
.

Chen
W. H.
Thissen
D
.
(1997)
.
Local dependence indices for item pairs using item response theory
.
Journal of Educational and Behavioral Statistics
,
22
,
265
289
.

Cohen-Mansfield
J.
Marx
M. S.
Rosenthal
A. S
.
(1989)
.
A description of agitation in a nursing home
.
Journal of Gerontology: Social Sciences
,
44
,
577
584
.

Fang
J.
Power
M.
Lin
Y.
Zhang
J.
Hao
Y.
Chatterji
S
.
(2012)
.
Development of short versions for the WHOQOL-OLD module
.
The Gerontologist
,
52
(
1
),
66
78
.

Gibbons
R. D.
Bock
R. D.
Hedeker
D.
Weiss
D. J.
Segawa
E.
Bhaumik
D. K.
, …
Stover
A
.
(2007)
.
Full-information item bi-factor analysis of graded response data
.
Applied Psychological Measurement
,
31
,
4
19
.

Herbert
B.
Bradshaw
Y. S
.
(2004)
.
Comment: Aggressive behaviors and injuries among nursing home residents
.
Journal of the American Medical Association
,
291
,
2074
2075
.

Holmes
D.
Teresi
J
.
(1996)
.
Using technology in behavioral approaches to Alzheimer’s disease
.
International Psychogeriatrics/IPA
,
8
(Suppl
1
),
67
71
.

IBM SPSS
. (
2010
).
Statistics for Windows, Version 19.0
.
Armonk, NY
:
IBM
.

Jennrich
R. I.
Bentler
P. M
.
(2011)
.
Exploratory bi-factor analysis
.
Psychometrika
,
76
(
4
),
537
549
.

Kihlgren
H.
Kuremyr
D.
Norberg
A.
Brane
G.
Karlson
I.
Engstrom
B.
Melin
E
.
(1993)
.
Nurse-patient interaction after training in integrity promoting care at a long-term ward: Analysis of video-recorded morning care sessions
.
International Journal of Nursing Research
,
30
,
1
13
.

Lachs
M.
Bachman
R.
Williams
C. S.
O’Leary
J. R
.
(2007)
.
Resident-to-resident elder mistreatment and police contact in nursing homes: Findings from a population-based cohort
.
Journal of the American Geriatrics Society
,
55
(
6
),
840
845
.

Lachs
M. S.
Pillemer
K. A
.
(2004)
.
Elder abuse
.
Lancet
,
304
,
1236
1272
.

Lachs
M. S.
Rosen
T.
Teresi
J. A.
Eimicke
J. P.
Ramirez
M.
Silver
S.
Pillemer
K
.
(2012)
.
Verbal and physical aggression directed at nursing home staff by residents
.
Journal of General Internal Medicine
.

Logsdon
R. G.
Teri
L
.
(1997)
.
The Pleasant Events Schedule-AD: Psychometric properties and relationship to depression and cognition in Alzheimer’s disease patients
.
The Gerontologist
,
37
(
1
),
40
45
.

Lord
F. M.
Novick
M. R
.
(1968)
.
Statistical theories of mental test scores
.
Reading, MA
:
Addison-Wesley
.

McDonald
R. P
.
(1970)
.
Theoretical foundations of principal factor analysis and alpha factor analysis
.
British Journal of Mathematical and Statistical Psychology
,
23
,
1
21
.

McDonald
R. P
.
(1999)
.
Test theory: A unified treatment
.
Mahwah, NJ
:
L. Erlbaum Associates
.

Morgan
D. G.
Cammer
A.
Stewart
N. J.
Crossley
M.
D’Arcy
C.
Forbes
D. A.
Karunanayake
C
.
(2012)
.
Nursing aide reports of combative behavior by residents with dementia: Results from a detailed prospective incident diary
.
Journal of the American Medical Directors Association
,
13
,
220
227
.

Muthén
L. K.
Muthén
B. O
. (
1998
2010
).
MPLUS users guide
, 6th ed.
Los Angeles, CA
:
Muthén and Muthén
.

Orlando
M.
Thissen
D
.
(2003)
.
Further investigation of the performance of S-X2: An item fit index for use with dichotomous item response theory models
.
Applied Psychological Measurement
,
27
,
289
298
.

Pillemer
K.
Chen
E. K.
Van Haitsma
K. S.
Teresi
J.
Ramirez
M.
Silver
S.
, …
Lachs
M. S
.
(2011)
.
Resident-to-resident aggression in nursing homes: Results from a qualitative event reconstruction study
.
The Gerontologist
,
52
(
1
),
24
33
. doi:
10.1093/gnr107

Pillemer
K.
Finkelhor
D
.
(1988)
.
The prevalence of elder abuse: A random sample survey
.
The Gerontologist
,
28
(
1
),
51
57
.

Pillemer
K.
Moore
D. W
.
(1989)
.
Abuse of patients in nursing homes: Findings from a survey of staff
.
The Gerontologist
,
29
(
3
),
314
320
.

R Development Core Team
.
(2008)
.
R: A language and environment for statistical computing
. R Foundation for Statistical Computing.
Vienna, Austria
(ISBN:3-900051-07-0).

Ramirez
M.
Watkins
B.
Teresi
J. A.
Silver
S.
Sukha
G.
Boratgis
G.
, …
Pillemer
K.
(in press).
Using qualitative methods to develop a measure of resident-to-resident elder mistreatment in nursing homes
.
International Psychogeriatrics
.

Reeve
B. B.
Hays
R. D.
Bjorner
J. B.
Cook
K. F.
Crane
P. K.
Teresi
J. A.
, …
Cella
D
.
(2007)
.
Psychometric evaluation and calibration of health-related quality of life item banks: Plans for the Patient-Reported Outcome Measurement Information System (PROMIS)
.
Medical Care
,
45
, (
5
Suppl
1
),
S22
S31
.

Reise
S. P.
Moore
T. M.
Haviland
M. G
.
(2010)
.
Bifactor models and rotations: Exploring the extent to which multidimensional data yield univocal scale scores
.
Journal of Personality Assessment
,
92
(
6
),
544
559
.

Revelle
W.
Zinbarg
R. E
.
(2009)
.
Coefficient alpha, beta, omega, and the GLB: Comments on Sijtsma
.
Psychometrika
,
74
,
145
154
.

Rizopoulus
D
.
(2006)
.
ltm: An R package for latent variable modeling and item response theory analyses
.
Journal of Statistical Software
,
17
,
1
25
.

Rizopoulus
D
.
(2009)
.
ltm: Latent Trait Models under IRT
. Retrieved from http://cran.rproject.org/web/packages/ltm/index.html

Rogers
J. C.
Holm
M. B.
Burgio
L. D
.
(1999)
.
Improving morning care routines of nursing home residents with dementia
.
Journal of the American Geriatrics Society
,
47
,
1049
1057
.

Rose
M. S.
Pruchno
R. A
.
(1999)
.
Behavior sequences of long-term care residents and their social partners
.
Journal of Gerontology: Social Sciences
,
54B
,
S75
S83
, doi:
10:1093/geronb/54b;2.S75

Rosen
T.
Lachs
M. S.
Bharucha
A. J.
Stevens
S. M.
Teresi
J. A.
Nebres
F.
Pillemer
K
.
(2008)
.
Resident-to-resident aggression in long-term care facilities: Insights from focus groups of nursing home residents and staff
.
Journal of the American Geriatrics Society
,
56
,
1398
1408
. doi:
10.1111/j.1532-5415.2008.01808.x

Schmid
L.
Leiman
J
.
(1957)
.
The development of hierarchical factor solutions
.
Psychometrika
,
22
,
53
61
.

Shinoda-Tagawa
T.
Leonard
R.
Pontikas
J.
McDonough
J. E.
Allen
D.
Dreyer
P. I
.
(2004)
.
Resident-to-resident violent incidents in nursing homes
.
Journal of the American Medical Association
,
4
,
591
598
.

Sijtsma
K
.
(2009)
.
On the Use, the Misuse, and the Very Limited Usefulness of
Cronbach’s
Alpha
.
Psychometrika
,
74
(
1
),
107
120
.

Soreff
S
.
(2012)
.
Violence in the nursing homes: Understandings, management, documentation and impact of resident to resident aggression
In
Olisah
V
. (Ed.),
Essential notes in psychiatry
. Retrieved from http://www.intechopen.com/books/howtoreference/essential-notes-in-psychiatry/violence-in-the-long-term-care-facilities-resident-to-resident-aggression-understandings-managemen

Stiles
M.
Koren
C.
Walsh
K
.
(2002)
.
Identifying elder abuse in a primary care setting
.
Clinical Geriatrics
,
10
,
33
41
.

Straus
M. A.
Hamby
S. L.
Boney-McCoy
S.
Sugarman
D. B
.
(1996)
.
The revised conflict tactics scales (CTS2: Development and preliminary psychometric data)
.
Journal of Family Issues
,
17
,
283
316
.

Talerico
K. A.
Evans
L. K
.
(2000)
.
Making sense of aggressive/protective behaviors in persons with dementia
.
Alzheimer’s Care Quarterly
,
1
,
77
88
.

Teresi
J.
Cross
P.
Golden
R
.
(1989)
.
Some applications of latent trait analysis to the measurement of ADL
.
Journal of Gerontology: Social Sciences
,
44
,
S196
S204
.

Teresi
J. A.
Holmes
D.
Dichter
E.
Koren
M. J.
Ramirez
M.
Fairchild
S
.
(1997)
.
Prevalence of behavior disorder and disturbance to family and staff in a sample of adult day health care clients
.
The Gerontologist
,
37
(
5
),
629
639
.

Teresi
J. A.
Holmes
D.
Monaco
C
.
(1993)
.
An evaluation of the effects of commingling cognitively and noncognitively impaired individuals in long-term care facilities
.
The Gerontologist
,
33
(
3
),
350
358
.

Teresi
J.
Kleinman
M.
Ocepek-Welikson
K
.
(2000)
.
Modern psychometric methods for detection of differential item functioning: Application to cognitive assessment measures
.
Statistics in Medicine
,
19
,
1651
1683
.

Teresi
J.
Ocepek-Welikson
K.
Kleinman
M.
Eimicke
J. E.
Crane
P. K.
Jones
R. N.
, …
Cella
D
.
(2009)
.
Analysis of differential item functioning in the depression item bank from the Patient Reported Outcome Measurement Information System (PROMIS): An item response theory approach
.
Psychology Science Quarterly
,
51
,
148
180
.

Teresi
J. A.
Ramirez
M.
Ellis
J.
Silver
S.
Boratgis
G.
Kong
J.
, …
Lachs
M. S
.
(2012)
.
A staff intervention targeting resident-to-resident elder mistreatment (R-REM) in long-term care increased staff knowledge, recognition and reporting: Results from a cluster randomized trial
.
International Journal of Nursing Studies
.

Teresi
J. A.
Stewart
A. L.
Morales
L. S.
Stahl
S. M
.
(2006)
.
Measurement in a multi-ethnic society. Overview to the special issue
.
Medical Care
,
44
(
11
Suppl
3
),
S3
S4
.

Thissen
D
.
(2011)
.
IRTPRO: Beta features and operation
.

Author notes

Decision Editor: Rachel Pruchno, PhD

Supplementary data