Development of an Instrument to Measure Staff-Reported Resident-to-Resident Elder Mistreatment (R-REM) Using Item Response Theory and Other Latent Variable Models

Table 1.

R-REM Item Loadings (λ) From the Unidimensional Exploratory Factor Analysis (MPLUS, Column 2), Schmid–Leiman (S-L) Bifactor Model With Three- and Two-Group Factors (performed with R, and allowing cross-loadings), and MPLUS Bifactor Two-Group Factor Solution Without Cross-Loadings

Item description	One factor, λ (SE)^a	S–L bifactor solutions									MPLUS bifactor solutions
		Three factor					Two factor				S–L-based model^b			Final model
		Gλ (var.)	F1λ	F2λ	F3λ	h²	Gλ (var.)	F1λ	F2λ	h²	Gλ (SE)	F1λ (SE)	F2λ (SE)	Gλ (SE)	F1λ (SE)	F2λ (SE)
Use bad words toward another resident	0.91 (0.02)	0.78 (0.73)	0.42		0.22	0.83	0.71 (0.61)	0.57		0.83	0.88 (0.05)	0.24 (0.16)		0.85 (0.05)	0.33 (0.10)
Scream at another resident	0.95 (0.02)	0.80 (0.69)	0.52			0.92	0.75 (0.62)	0.58		0.89	0.87 (0.07)	0.47 (0.14)		0.83 (0.05)	0.53 (0.09)
Try to scare, frighten, or threaten (another resident) with words	0.85 (0.03)	0.72 (0.65)	0.48		0.20	0.79	0.64 (0.49)	0.64		0.82	0.79 (0.08)	0.34 (0.17)		0.74 (0.07)	0.43 (0.13)
Boss around/tell another resident what to do	0.79 (0.03)	0.62 (0.50)	0.61			0.78	0.58 (0.55)	0.51		0.60	0.71 (0.08)	0.40 (0.16)		0.66 (0.07)	0.47 (0.13)
Hit another resident	0.81 (0.04)	0.68 (0.62)		0.52		0.75	0.71 (0.68)		0.47	0.75	0.74 (0.06)		0.49 (0.10)	0.77 (0.06)		0.44 (0.11)
Grab or yank another resident	0.75 (0.06)	0.62 (0.56)		0.54		0.68	0.66 (0.64)		0.50	0.69	0.62 (0.08)		0.58 (0.12)	0.64 (0.08)		0.55 (0.13)
Push or shove another resident	0.74 (0.06)	0.61 (0.69)	0.23	0.34		0.55	0.63 (0.74)	0.25	0.29	0.54	0.71 (0.07)		0.34 (0.14)	0.74 (0.07)		0.28 (0.14)
Throw things at another resident	0.82 (0.05)	0.79 (0.64)			0.59	0.99	0.70 (0.72)	0.38	0.21	0.67	0.92 (0.06)	−0.20 (0.22)		0.89 (0.06)
Threaten another resident with a cane, fist, or other object	0.81 (0.05)	0.74 (0.78)	0.25		0.25	0.70	0.70 (0.71)	0.42		0.70	0.85 (0.07)	0.02 (0.16)		0.88 (0.06)
Other physical behavior like kicking, biting, scratching, or spitting at another resident	0.80 (0.06)	0.63 (0.40)		0.77		1.00	0.72 (0.54)		0.66	0.96	0.61 (0.08)		0.74 (0.11)	0.63 (0.08)		0.74 (0.13)
Going into another resident’s room without asking/taking/touching/ damaging/breaking other residents “personal” things	0.51 (0.06)	0.50 (0.71)			0.27	0.35	0.47 (0.74)	0.21		0.30	0.59 (0.08)	−0.23 (0.18)		0.55 (0.07)
Eigenvalues		5.2	1.2	1.4	0.6		4.9	1.8	1.1
Correlation of scores with factors		0.88	0.78	0.93			0.85	0.77	0.79

Item description	One factor, λ (SE)^a	S–L bifactor solutions									MPLUS bifactor solutions
		Three factor					Two factor				S–L-based model^b			Final model
		Gλ (var.)	F1λ	F2λ	F3λ	h²	Gλ (var.)	F1λ	F2λ	h²	Gλ (SE)	F1λ (SE)	F2λ (SE)	Gλ (SE)	F1λ (SE)	F2λ (SE)
Use bad words toward another resident	0.91 (0.02)	0.78 (0.73)	0.42		0.22	0.83	0.71 (0.61)	0.57		0.83	0.88 (0.05)	0.24 (0.16)		0.85 (0.05)	0.33 (0.10)
Scream at another resident	0.95 (0.02)	0.80 (0.69)	0.52			0.92	0.75 (0.62)	0.58		0.89	0.87 (0.07)	0.47 (0.14)		0.83 (0.05)	0.53 (0.09)
Try to scare, frighten, or threaten (another resident) with words	0.85 (0.03)	0.72 (0.65)	0.48		0.20	0.79	0.64 (0.49)	0.64		0.82	0.79 (0.08)	0.34 (0.17)		0.74 (0.07)	0.43 (0.13)
Boss around/tell another resident what to do	0.79 (0.03)	0.62 (0.50)	0.61			0.78	0.58 (0.55)	0.51		0.60	0.71 (0.08)	0.40 (0.16)		0.66 (0.07)	0.47 (0.13)
Hit another resident	0.81 (0.04)	0.68 (0.62)		0.52		0.75	0.71 (0.68)		0.47	0.75	0.74 (0.06)		0.49 (0.10)	0.77 (0.06)		0.44 (0.11)
Grab or yank another resident	0.75 (0.06)	0.62 (0.56)		0.54		0.68	0.66 (0.64)		0.50	0.69	0.62 (0.08)		0.58 (0.12)	0.64 (0.08)		0.55 (0.13)
Push or shove another resident	0.74 (0.06)	0.61 (0.69)	0.23	0.34		0.55	0.63 (0.74)	0.25	0.29	0.54	0.71 (0.07)		0.34 (0.14)	0.74 (0.07)		0.28 (0.14)
Throw things at another resident	0.82 (0.05)	0.79 (0.64)			0.59	0.99	0.70 (0.72)	0.38	0.21	0.67	0.92 (0.06)	−0.20 (0.22)		0.89 (0.06)
Threaten another resident with a cane, fist, or other object	0.81 (0.05)	0.74 (0.78)	0.25		0.25	0.70	0.70 (0.71)	0.42		0.70	0.85 (0.07)	0.02 (0.16)		0.88 (0.06)
Other physical behavior like kicking, biting, scratching, or spitting at another resident	0.80 (0.06)	0.63 (0.40)		0.77		1.00	0.72 (0.54)		0.66	0.96	0.61 (0.08)		0.74 (0.11)	0.63 (0.08)		0.74 (0.13)
Going into another resident’s room without asking/taking/touching/ damaging/breaking other residents “personal” things	0.51 (0.06)	0.50 (0.71)			0.27	0.35	0.47 (0.74)	0.21		0.30	0.59 (0.08)	−0.23 (0.18)		0.55 (0.07)
Eigenvalues		5.2	1.2	1.4	0.6		4.9	1.8	1.1
Correlation of scores with factors		0.88	0.78	0.93			0.85	0.77	0.79

Notes: G = general factor; F1, F2, and F3 = group factors; h² = communality; (uniqueness u² = 1–communality).

^aGeomin (oblique) rotation.

^bS–L PC = S–L principal components.

Tests of unidimensionality.

The explained common variance (ECV) provides information about whether the observed variance covariance matrix is close to unidimensionality (Sijtsma, 2009). The ECV can be estimated as the percent of observed variance explained. It is the ratio of the first eigenvalue to the sum of all eigenvalues extracted from a bifactor model analysis and was calculated as the eigenvalue for the general factor divided by the sum of the eigenvalues for the general and group factors from a bifactor model (Reise et al., 2010):

Reise and colleagues also suggest examining the difference in the loadings between the unidimensional model and the general factor loadings (λ₁) in the bifactor model as an indication of the degree of distortion that would occur by fitting a unidimensional model to data that are multidimensional.

Tests of reliability.

The methods for assessing reliability that have emerged from the psychometric literature as the preferred statistics include McDonald’s (1970, 1999) omega total and hierarchical (ω_t and ω_h) derived from linear structural equation modeling (SEM) and those derived from nonlinear SEM. Under the single common factor model, reliability can be evaluated by decomposing the scale score into the sum of the item scores, and the contribution of the common term (λF_j) or communality. Known as McDonald’s (1999) omega total (ω_t), this reliability estimate is based on the proportion of total common variance explained. Alternatively, this can be expressed in terms of the unique variance, where the unique variance contains both specific and error variance:

(Revelle and Zinbarg, 2009; formula 19). Omega hierarchical (ω_h; McDonald, 1970) is calculated as the sum of the squared loadings on the general factor divided by the total scale score variance (V_x; Revelle & Zinbarg, 2009): For the one- or two-factor model, ω_h is not meaningful because three factors are required for identification in the calculation of ω_h. However, omega total (ω_t) is interpretable for a unidimensional solution.

Item response theory parameter estimation.

Additionally, item response theory (IRT), applying the two-parameter logistic model (Lord & Novick, 1968) was used to evaluate the measure. IRT has been used to model gerontological data sets. See Teresi, Cross, & Golden, 1989; Teresi, Kleinman, & Ocepek-Welikson, 2000 for didactic explications of the models in that context. The estimates for the discrimination and severity parameters (a and b, respectively) were evaluated, the item and test information functions were graphed, and the reliability estimates were calculated for points along the dimension of the underlying construct, denoted as θ (theta).

Tests of IRT model fit and local dependencies.

Various goodness of fit statistics are available to test IRT model fit (Cai, Maydeu-Olivares, Coffman, & Thissen, 2006). Residual covariances can be examined in the context of violation of the assumption of local independence. The local dependency (LD) statistics (Chen & Thissen, 1997) compare the observed and expected frequencies associated with cross-tabulations of item pairs. Standardized (z) scores are used for comparison across items with different response categories: (Thissen, 2011). If the observed covariation between responses to a pair of items is greater than that predicted by the model, the values are flagged indicating that a cluster of items may measure an unmodeled dimension. Values that are larger than 10 are considered problematic (Thissen, 2011).

Software.

The ECV and McDonald’s omega were estimated using both exploratory and confirmatory factor analyses. R software (R Development Core Team, 2008) was used to calculate McDonald’s omega statistics, as well as several others recommended by Revelle and Zinbarg (2009), contained in the “psych” package that they developed in R (www.R-project.org). Polychoric and polyserial correlations based on the underlying continuous normal variables (Zi) were estimated using a SEM package, MPLUS (Muthén & Muthén, 1998–2010). The S–L solution was obtained in the R “psych” package and loadings were estimated using MPLUS software. Item Response Theory Patient Reported Outcomes (IRTPRO) software (Cai, Thissen, & du Toit, 2011; Thissen, 2011) was used for IRT parameter estimation and tests of model fit. IRTPRO incorporates various tests of LD, for example, SS-χ² (Orlando & Thissen, 2003). The IBM Statistical Package for the Social Sciences (IBM SPSS, 2010) reliability module calculates the corrected item-total correlations and overall classical test theory estimate of reliability of the item set.

Results

The analytic sample of residents whose behavior was evaluated by the staff at baseline was 1,812. The majority (72.1%) were women; the mean age was 84.1 (SD = 10.0) years; the mean educational level was 11.7 (SD =4.1) years; and 19.1% were Black, 16.6% were non-White Hispanic, 61.9% were non-Latino White, and 2.7% other.

Preliminary Analyses

Only the resident-to-resident mistreatment items were included in the psychometric analysis (Table 1). The frequency of specific incidents was low (Table 3). Therefore, the decision was made to combine any occurrence of a specific type of R-REM within the past 2 weeks or past year. Among the items in the final analysis, the highest frequency was 0.09 for the item “scream” and the next highest was 0.08 for the item “use bad words.” Five items with prevalence rates of 0.01 were excluded: “insulting race or ethnic group,” “saying sexual things,” “doing sexual things in front of another resident,” “touching in a sexual manner,” and “getting help when didn’t ask or want help.” Prior to combining items into testlets, there were some instances of local dependencies, for example, one pair of items showed very high LD statistics: “going to another resident’s room without asking” and “touch, damage, or break another resident’s personal things” (61.0). Thus, four items were combined in two testlets based on the LD results: “kicking,” “biting,” “scratching,” and “spitting” were combined and “going into room without asking,” “taking/touching things,” and “damaging things” were combined. The final analytic item set included 11 items (Table 1). After the items were combined, all item pair LD statistics were within the normal range.

EFA Analyses

The simple (4 to 1) eigenvalue rule provided some evidence for essential unidimensionality (eigenvalues of 6.943 to 1.264 or 5.5 to 1). The first component (eigenvalue) explained 63% of the variance (Supplementary Table 1). The scree plot showed a clear dominance of the first component (factor) relative to the other (Supplementary Figure 1). The fit indices for the unidimensional and two-factor models performed by MPLUS were CFI: one factor = 0.979 and two factor = 0.995. The RMSEAs were as follows: one factor = 0.030 and two factor = 0.017 (Supplementary Table 1). Because the eigenvalue ratio was moderate, further tests were performed using the bifactor model.

Bifactor Model

Prior to analyses, the inter-item tetrachoric correlation matrix was reviewed; correlations range from 0.314 to 0.878 (Supplementary Table 2). The graphic depiction of the bifactor model tested is shown in Supplementary Figure 2. The RMSEA statistic produced by “psych” R was 0.04 for the three-factor solution, 0.05 for the two-factor S–L solution, and 0.01 for the MPLUS bifactor model, applied using the general and two-group factor results from the S–L solution (Table 2). The MPLUS bifactor models with three-group factors did not converge. Table 1 shows the factor loadings from the S–L solution and from the MPLUS solutions for both the unidimensional and bifactor models. The relative size of the general and group factor loadings (MPLUS two-group factors) indicates a very strong general factor. Loadings for only one item, “other physical behavior like kicking, biting, scratching, or spitting” was higher on a group factor (the second) than the general factor (0.74 vs. 0.63). Two bifactor models were run in MPLUS because two items, “throw things at another resident” and the testlet item “going to another resident’s rooms without asking or touching, damaging, or breaking another residents’ personal things” had negative loadings on the first group factor (–0.20, –0.23), however, not on the general factor, and the item “threaten another resident with a cane, fist, or other object” evidenced a low loading on the group factor (0.02). These three items were specified not to load on any group factors. For the remaining items, the loadings on the general factor were greater than on any of the loadings on the group factors, suggesting essential unidimensionality. Although the fit of the bifactor model was slightly better than the unidimensional model (bifactor: CFI = 0.997, RMSEA = 0.014; unidimensional: CFI = 0.979, RMSEA = 0.030), the difference was trivial. ECV statistics were 0.76 for the bifactor model compared with 0.63 for the unidimensional MPLUS models. However, it is noted that the group factor results did provide some modest support for the use of two subscales: verbal and physical, with the remaining items forming a potential third subset. The highest loadings on the first group factor were for verbal items: “use bad words toward another resident”; “scream at another resident”; “try to scare, frighten, or threaten with words”; and “boss around, tell another resident what to do.” The second group factor comprises physical items: “hit another resident”; “grab or yank another resident”; “push or shove another resident; and “other physical behavior such as kicking, biting, scratching, or spitting at another resident.” Finally, the remaining items loaded on a third S–L group factor: “threaten another resident with cane, fist, or other object”; “throw things at another resident”; and the testlet item “going to another resident’s room without asking or taking, touching, damaging, or breaking another resident’s personal things.” However, as stated earlier, the MPLUS three-group factor solution did not converge; thus, there is robust support for at most two-group factors.

Table 2.

Reliability, Dimensionality, and Model Fit Statistics Obtained From R as the Three-Factor Schmid–Leiman (S-L) Solution, MPLUS One-Factor Solution, and Bifactor Solutions

Statistics	S–L three-group factors	S–L two-group factors	MPLUS (one factor)	MPLUS bifactor (general and two-group factors)
Reliability (alpha coefficient for general factor)	0.94	0.94	N/A	N/A
Omega hierarchical (ω_h)	0.76	N/A	N/A	N/A
Omega total (ω_t)	0.96	0.97	N/A	N/A
ECV	0.64	0.64	0.63	0.76
RMSEA	0.04	0.05	0.03	0.01

Statistics	S–L three-group factors	S–L two-group factors	MPLUS (one factor)	MPLUS bifactor (general and two-group factors)
Reliability (alpha coefficient for general factor)	0.94	0.94	N/A	N/A
Omega hierarchical (ω_h)	0.76	N/A	N/A	N/A
Omega total (ω_t)	0.96	0.97	N/A	N/A
ECV	0.64	0.64	0.63	0.76
RMSEA	0.04	0.05	0.03	0.01

Notes: ECV = explained common variance; RMSEA = root mean square error of approximation; N/A = not applicable.

Table 2.

Reliability, Dimensionality, and Model Fit Statistics Obtained From R as the Three-Factor Schmid–Leiman (S-L) Solution, MPLUS One-Factor Solution, and Bifactor Solutions

Statistics	S–L three-group factors	S–L two-group factors	MPLUS (one factor)	MPLUS bifactor (general and two-group factors)
Reliability (alpha coefficient for general factor)	0.94	0.94	N/A	N/A
Omega hierarchical (ω_h)	0.76	N/A	N/A	N/A
Omega total (ω_t)	0.96	0.97	N/A	N/A
ECV	0.64	0.64	0.63	0.76
RMSEA	0.04	0.05	0.03	0.01

Statistics	S–L three-group factors	S–L two-group factors	MPLUS (one factor)	MPLUS bifactor (general and two-group factors)
Reliability (alpha coefficient for general factor)	0.94	0.94	N/A	N/A
Omega hierarchical (ω_h)	0.76	N/A	N/A	N/A
Omega total (ω_t)	0.96	0.97	N/A	N/A
ECV	0.64	0.64	0.63	0.76
RMSEA	0.04	0.05	0.03	0.01

Notes: ECV = explained common variance; RMSEA = root mean square error of approximation; N/A = not applicable.

The IRT discrimination (a) parameters estimated with IRTPRO, ranged from the lowest 1.18 (going to another resident’s room without asking or take, touch, damage, or break another resident’s personal things) to the highest 4.95 (scream; Table 3). Additional items with high discrimination parameters greater than 3.0 were as follows: “use bad words” (4.16); “threaten another resident with cane, fist, or other objects” (3.78); “try to scare, frighten, or threaten with words” (3.63); and “throw things at another resident” (3.52). The severity (difficulty) parameters (b) ranged from 1.43 (scream), the least severe item, to 3.30 (going to another resident’s room without asking/take, touch, damage, or break another resident’s personal things; Table 3). As expected, the verbal items were less severe indicators of R-REM.

Table 3.

Item Response Theory (IRT) Item Parameters and Standard Error Estimates (IRTPRO)

Item description	Base rate (% positive)	a	SE of a	b₁	SE of b₁
Use bad words toward another resident	7.7	4.16	0.50	1.57	0.06
Scream at another resident	9.3	4.95	0.68	1.43	0.05
Try to scare, frighten, or threaten (another resident) with words	1.9	3.63	0.58	2.29	0.12
Boss around/tell another resident what to do	4.5	2.65	0.30	2.06	0.11
Hit another resident	2.0	2.60	0.38	2.48	0.16
Grab or yank another resident	1.3	2.24	0.39	2.88	0.26
Push or shove another resident	1.5	2.32	0.38	2.76	0.22
Throw things at another resident	1.0	3.52	0.69	2.58	0.17
Threaten another resident with a cane, fist, or other object	0.8	3.78	0.84	2.63	0.19
Other physical behavior like kicking, biting, scratching, or spitting at another resident	1.0	2.34	0.44	2.92	0.27
Going into another resident’s room without asking or taking/touching/damaging or breaking other residents “personal” things	3.6	1.18	0.19	3.30	0.39

Item description	Base rate (% positive)	a	SE of a	b₁	SE of b₁
Use bad words toward another resident	7.7	4.16	0.50	1.57	0.06
Scream at another resident	9.3	4.95	0.68	1.43	0.05
Try to scare, frighten, or threaten (another resident) with words	1.9	3.63	0.58	2.29	0.12
Boss around/tell another resident what to do	4.5	2.65	0.30	2.06	0.11
Hit another resident	2.0	2.60	0.38	2.48	0.16
Grab or yank another resident	1.3	2.24	0.39	2.88	0.26
Push or shove another resident	1.5	2.32	0.38	2.76	0.22
Throw things at another resident	1.0	3.52	0.69	2.58	0.17
Threaten another resident with a cane, fist, or other object	0.8	3.78	0.84	2.63	0.19
Other physical behavior like kicking, biting, scratching, or spitting at another resident	1.0	2.34	0.44	2.92	0.27
Going into another resident’s room without asking or taking/touching/damaging or breaking other residents “personal” things	3.6	1.18	0.19	3.30	0.39

Table 3.

Item Response Theory (IRT) Item Parameters and Standard Error Estimates (IRTPRO)

Item description	Base rate (% positive)	a	SE of a	b₁	SE of b₁
Use bad words toward another resident	7.7	4.16	0.50	1.57	0.06
Scream at another resident	9.3	4.95	0.68	1.43	0.05
Try to scare, frighten, or threaten (another resident) with words	1.9	3.63	0.58	2.29	0.12
Boss around/tell another resident what to do	4.5	2.65	0.30	2.06	0.11
Hit another resident	2.0	2.60	0.38	2.48	0.16
Grab or yank another resident	1.3	2.24	0.39	2.88	0.26
Push or shove another resident	1.5	2.32	0.38	2.76	0.22
Throw things at another resident	1.0	3.52	0.69	2.58	0.17
Threaten another resident with a cane, fist, or other object	0.8	3.78	0.84	2.63	0.19
Other physical behavior like kicking, biting, scratching, or spitting at another resident	1.0	2.34	0.44	2.92	0.27
Going into another resident’s room without asking or taking/touching/damaging or breaking other residents “personal” things	3.6	1.18	0.19	3.30	0.39

Item description	Base rate (% positive)	a	SE of a	b₁	SE of b₁
Use bad words toward another resident	7.7	4.16	0.50	1.57	0.06
Scream at another resident	9.3	4.95	0.68	1.43	0.05
Try to scare, frighten, or threaten (another resident) with words	1.9	3.63	0.58	2.29	0.12
Boss around/tell another resident what to do	4.5	2.65	0.30	2.06	0.11
Hit another resident	2.0	2.60	0.38	2.48	0.16
Grab or yank another resident	1.3	2.24	0.39	2.88	0.26
Push or shove another resident	1.5	2.32	0.38	2.76	0.22
Throw things at another resident	1.0	3.52	0.69	2.58	0.17
Threaten another resident with a cane, fist, or other object	0.8	3.78	0.84	2.63	0.19
Other physical behavior like kicking, biting, scratching, or spitting at another resident	1.0	2.34	0.44	2.92	0.27
Going into another resident’s room without asking or taking/touching/damaging or breaking other residents “personal” things	3.6	1.18	0.19	3.30	0.39

Open in new tab Download slide

The information functions (estimated using IRTPRO) for all the items are located at the upper half of the theta continuum; there was little information contributed at the lower half of the underlying construct distribution (Figure 1). The highest information function (5.2) was for the item “scream at another resident,” which peaked at the theta level of 1.6 followed by the item “use bad words” (4.3) at theta 1.6. The items “threaten another resident with cane, fist, or other objects”; “try to scare, frighten, or threaten with words”; and “throw things at another resident” were more informative at even higher theta levels (the highest information points: 3.2 at theta level 2.4, and 3.2, 2.7 all at theta level 2.8). The testlet item “going to another resident’s room without asking or take, touch, damage, or break another resident’s personal things” provides minimum information (highest information point: 0.3 at theta level 2.8; Figure 1 and Supplementary Figure 3).

Figure 1.

Resident-to-resident elder mistreatment (R-REM) item information functions (IRTPRO).

The essential dimensionality estimates were high; the ECV from the “psych” R program was 0.64 and 0.76 from the MPLUS bifactor model. Different reliability estimates were obtained from several methods. The alpha estimate from the “psych” R package was 0.94, omega hierarchical (ω_h; three-group solution) was 0.76, and omega total (ω_t; two-group solution) was 0.97 (Table 2); thus, adding support for a two- rather than three-group solution. The Cronbach’s alpha reliability estimate was 0.74 and standardized alpha was 0.75. Corrected item-total correlations ranged from 0.30 (grab or yank another resident) to 0.61 (scream; Supplementary Table 3). Based on the IRTPRO results, the estimates of reliability were calculated along the theta continuum and ranged from 0.50 (lower end of theta) to 0.95 at theta 2.4 (Supplementary Table 4). Reliabilities were adequate (≥ 0.90) at θ ≥ 1.2.

Verbal and physical subscales were examined using classical test theory reliability estimates. The combined verbal items (use bad words toward another resident, scream at another resident, try to scare another resident with words, and boss around, tell people what to do) resulted in a Cronbach’s alpha estimate of 0.73 and 0.74 for unstandardized and standardized versions, respectively, with corrected item-total correlations ranging from 0.48 to 0.66. The six physical items (hit another resident; grab another resident; push another resident; throw things at another resident; threaten another resident with cane, fist, or other object; and other physical behavior like kicking, biting, scratching, or spitting at another resident) resulted in a Cronbach’s alpha estimate of 0.65 (for both unstandardized and standardized alphas). The corrected item-total correlations ranged from 0.31 to 0.48. It is noted that the factor analyses placed the item, “threaten another resident with cane, fist, or other object” in the verbal as well as the third factor along with the items “throw things at another resident” and “going to another resident’s room without asking”/“take, touch, damage, or break another resident’s personal things” in the three-group S–L solution, even though the content was verbal threatening of physical actions. Placing the threatening item in the verbal scale reduces slightly the internal consistency estimate of the scale (from 0.73 to 0.71) but increases the alpha estimate if the item is added to the physical subscale (from 0.63 to 0.65). The internal consistency estimate did not change whether the item, “throw things at another resident” was included in the physical subscale (0.63). Thus, one could plausibly use a four-, five-, or six-item subscale for each, if desired. However, the final confirmatory bifactor results shown in Table 1 suggested the possibility of four item measures of physical and verbal mistreatment. It is emphasized that the bottom-line result is that the factor analyses supported most strongly an essentially unidimensional measure.

Table 4.

Distribution of the Resident-to-Resident Elder Mistreatment (R-REM) Summary Score Mapped to the R-REM Estimate (Theta)

Summed score	N	%	Cumulative %	EAP^a [θ/x]	SD [θ/x]	Modeled %	Modeled Cumulative %	Reliability^b
0	1,526	84.3	84.3	–0.255	0.834	83.90	83.9	0.52
1	109	6.0	90.3	0.902	0.569	8.35	92.2	0.80
2	87	4.8	95.1	1.466	0.326	3.27	95.5	0.92
3	44	2.4	97.5	1.736	0.283	1.86	97.4	0.93
4	20	1.1	98.6	1.952	0.273	1.06	98.4	0.93
5	12	0.7	99.3	2.153	0.267	0.61	99.0	0.93
6	9	0.5	99.8	2.345	0.262	0.37	99.4	0.94
7	2	0.1	99.9	2.531	0.263	0.24	99.7	0.95
8	1	0.1	99.9	2.719	0.274	0.16	99.8	0.94
9	1	0.1	100	2.923	0.299	0.10	99.9	0.94
10	0	0.0	100	3.159	0.345	0.06	100
11	0	0.0	100	3.426	0.402	0.02	100

Summed score	N	%	Cumulative %	EAP^a [θ/x]	SD [θ/x]	Modeled %	Modeled Cumulative %	Reliability^b
0	1,526	84.3	84.3	–0.255	0.834	83.90	83.9	0.52
1	109	6.0	90.3	0.902	0.569	8.35	92.2	0.80
2	87	4.8	95.1	1.466	0.326	3.27	95.5	0.92
3	44	2.4	97.5	1.736	0.283	1.86	97.4	0.93
4	20	1.1	98.6	1.952	0.273	1.06	98.4	0.93
5	12	0.7	99.3	2.153	0.267	0.61	99.0	0.93
6	9	0.5	99.8	2.345	0.262	0.37	99.4	0.94
7	2	0.1	99.9	2.531	0.263	0.24	99.7	0.95
8	1	0.1	99.9	2.719	0.274	0.16	99.8	0.94
9	1	0.1	100	2.923	0.299	0.10	99.9	0.94
10	0	0.0	100	3.159	0.345	0.06	100
11	0	0.0	100	3.426	0.402	0.02	100

Notes:^aEAP is the expected a posteriori estimate (mean of the posterior distribution of theta [θ], conditional on x [the observed response pattern]).

^bThe average reliability across all measured levels of theta (θ) is 0.67.

Table 4.

Distribution of the Resident-to-Resident Elder Mistreatment (R-REM) Summary Score Mapped to the R-REM Estimate (Theta)

Summed score	N	%	Cumulative %	EAP^a [θ/x]	SD [θ/x]	Modeled %	Modeled Cumulative %	Reliability^b
0	1,526	84.3	84.3	–0.255	0.834	83.90	83.9	0.52
1	109	6.0	90.3	0.902	0.569	8.35	92.2	0.80
2	87	4.8	95.1	1.466	0.326	3.27	95.5	0.92
3	44	2.4	97.5	1.736	0.283	1.86	97.4	0.93
4	20	1.1	98.6	1.952	0.273	1.06	98.4	0.93
5	12	0.7	99.3	2.153	0.267	0.61	99.0	0.93
6	9	0.5	99.8	2.345	0.262	0.37	99.4	0.94
7	2	0.1	99.9	2.531	0.263	0.24	99.7	0.95
8	1	0.1	99.9	2.719	0.274	0.16	99.8	0.94
9	1	0.1	100	2.923	0.299	0.10	99.9	0.94
10	0	0.0	100	3.159	0.345	0.06	100
11	0	0.0	100	3.426	0.402	0.02	100

Summed score	N	%	Cumulative %	EAP^a [θ/x]	SD [θ/x]	Modeled %	Modeled Cumulative %	Reliability^b
0	1,526	84.3	84.3	–0.255	0.834	83.90	83.9	0.52
1	109	6.0	90.3	0.902	0.569	8.35	92.2	0.80
2	87	4.8	95.1	1.466	0.326	3.27	95.5	0.92
3	44	2.4	97.5	1.736	0.283	1.86	97.4	0.93
4	20	1.1	98.6	1.952	0.273	1.06	98.4	0.93
5	12	0.7	99.3	2.153	0.267	0.61	99.0	0.93
6	9	0.5	99.8	2.345	0.262	0.37	99.4	0.94
7	2	0.1	99.9	2.531	0.263	0.24	99.7	0.95
8	1	0.1	99.9	2.719	0.274	0.16	99.8	0.94
9	1	0.1	100	2.923	0.299	0.10	99.9	0.94
10	0	0.0	100	3.159	0.345	0.06	100
11	0	0.0	100	3.426	0.402	0.02	100

Notes:^aEAP is the expected a posteriori estimate (mean of the posterior distribution of theta [θ], conditional on x [the observed response pattern]).

^bThe average reliability across all measured levels of theta (θ) is 0.67.

Examining the distributional characteristics of the measure, the mean sum score was 0.35 (SD = 0.99) with the median of 0.0; the mean theta was 0.002 (0.62) with the median of −0.26. Skewness statistics were 3.84 (0.06) and 2.27 (0.06) and kurtosis was 17.95 (0.12) and 3.88 (0.12). The Kolmogorow–Smirnov test of normality provided evidence of highly skewed distributions (20.36, p < .001 and 21.38, p < .001; Supplementary Table 5). The majority of the cases had the sum score of zero (84.3%), which is equivalent to the theta value −0.26. Examining the expected item scores mapped onto theta, the highest score for the sample was 9, which is equivalent to a theta value of 2.92. The highest theoretical value for the sum score is 11 corresponding to an estimated theta value of 3.4; however, no cases had the sum score 10 or 11. There is almost a complete overlap of the cumulative proportion for the sum score and the modeled proportion (Table 4).

Discussion

The study supports the unidimensional assessment of R-REM using the 11-item scale; however, modest support was provided for the use of separate subscales for verbal and physical constructs. The S–L bifactor model identified three factors: verbal, physical, and a less differentiated factor including items on room invasion and throwing and threatening gestures; however, this result was not supported by the final estimation procedures.

Limitations

A limitation of these analyses is the low prevalence of many items, resulting in omission of items that may be potentially salient and stressful, for example, sexual encounters. Thus, insertion of a combined item is recommended: “saying sexual things and inappropriate touching of another resident.” The proposed 12-item measure is appended. Additionally, items were combined into presence in the past 2 weeks or year. Although this method may be best for analyses of data and for constructing global summary scales, for individual assessments, it may be desired to collect data separately on recent and past R-REM events to obtain a frequency of occurrences measure to inform appropriate care planning and individual interventions. Thus, the items for up to five incidents occurring within the past 2 weeks and two additional incidents occurring within the past 1 year are included in the measure shown in Supplementary Material. Finally, the distributions of R-REM were skewed and higher reliability estimates were observed at the more severe tail of the R-REM distribution. Reliability estimates are lower at the less severe tail of the distribution where noncases are prevalent. In part, this is due to the low base rate of R-REM. It is not surprising that greater accuracy is observed at the “caseness” tail of the distribution, where reporting data are available in contrast to the “noncase” tail, where the potential for error is most likely due to failure to observe or report R-REM. Nonetheless, it is important to have high reliability among cases because these are the ones singled out for adjudication and appropriate actions. Moreover, the phenomenon of higher reliability in the middle and upper section of the theta distribution is not uncommon in the field of patient-reported outcome data. For example, this phenomenon is commonly observed in data from the National Institutes of Health Patient Reported Outcome Measurement Information System (PROMIS; Reeve et al., 2007; Teresi et al., 2009).

Conclusion

Most studies of aggressive behavior have used staff informant reports or observations. Creative methodologies have typically been developed and employed in studies of behavioral disturbance in nursing home residents. Some of the observational methods that have been used for this purpose include direct videography of groups of residents for brief proscribed periods (Kihlgren et al., 1993), bar-coding strategies in which researchers tally behaviors of interest with a light pen system (Bridges-Parlet, Knopman, & Thompson, 1994; Holmes & Teresi, 1996), and computer-assisted observations in which researchers use a laptop computer to keystroke appropriate tabs when residents are observed in specific activities (e.g., during morning care; Rogers, Holm, & Burgio, 1999). It was determined that none of these methods were practical or feasible for four reasons. First, episodes of R-REM are likely to be too brief and too intermittent to make any of these methods useful. A large research staff would have to be positioned virtually everywhere in the facility for extensive periods to make meaningful observations; this would be costly and potentially invasive for staff and residents, and most importantly, not sustainable beyond the life of a research project. Second, episodes of R-REM are unpredictable. In arriving at this methodology, the investigators also considered oversampling suspected high-risk times and environments (e.g., congregate meals), but no data exist on the epidemiology of when R-REM occurs on which to base such a sampling strategy. Third, violent episodes that are volitional are probably less likely to occur with research observers monitoring residents directly; their presence could actually influence the prevalence of R-REM. Finally, the use of videography was ruled out as too invasive for sustained use, particularly in resident’s rooms. Although the experience with bar-code technology has been positive, it is not likely that this method can be implemented on a wide scale basis without extensive support. However, the increased use of electronic medical records could make this approach viable in the future.

Given the previous considerations, this project adopted a strategy of R-REM case identification that makes use of the best potential reporters—residents themselves and the staff who care for them. Analogously, the instrument proposed to measure the phenomenon derives its methodology from two fields (and representative scales reflective of them) that have proven strategies appropriate for this study—the CTS (derived from the interpersonal aggression literature) and the CMAI (derived from the nursing home behavioral disturbance literature).

A challenge in long-term care settings is recognition and documentation of resident-to-resident mistreatment. Because of the ubiquity of aggressive behaviors in general as well as toward residents (Castle, 2012) and staff (Lachs et al., 2012; Morgan et al., 2012), R-REM may often be ignored. This may be due in part to desensitization because such incidents may be perceived as daily routine events. Although staff training has resulted in increased recognition and reporting (Teresi et al., 2012), documentation is still a challenge. Documentation of aggressive behaviors toward residents is not only important in and of itself for record keeping, but it can also be crucial as a preventive measure at an individual level for risk management and care planning. Additionally, such documentation is important at the institutional level to avoid financial and or licensing penalties (Soreff, 2012). Thus, methods for increasing recognition, reporting, and documentation benefit residents, direct care, and administrative staff.

Castle (2012) has called for measurement approaches at the individual resident level that can provide representative estimates of incidence and prevalence of R-REM. Additionally, such a measure could be incorporated into routine data collection used for care planning. Collectively, it is hoped that the extensive qualitative and quantitative work that resulted in the development of this measure will help to advance the measurement and ultimately interventions associated with this important and underrecognized problem facing residents of long-term care settings.

Funding

This work was supported in part by National Institute on Aging (AG014299-06A2), National Institute of Justice (FYO 42USC3721), New York State Department of Health Dementia Grant Program (contract # C-022657). This work does not necessarily reflect the opinions of the funding organizations.

References

Acierno

R

.

(2003)

.

Elder mistreatment: Epidemiological assessment methodology

. In

Bonnie

R. J.

Wallace

R. B

. (Eds.),

Elder mistreatment: Abuse, neglect, and exploitation in an aging America

(pp.

261

–

302

).

Washington, DC

:

National Academies Press

.

Asparouhov

T.

Muthén

B

.

(2009)

.

Exploratory structural equation modeling

.

Structural Equation Modeling

,

16

,

397

–

438

.

Bridges-Parlet

S.

Knopman

D.

Thompson

T

.

(1994)

.

A descriptive study of physically aggressive behaviors by direct observation

.

Journal of the American Geriatrics Society

,

42

,

192

–

197

.

Burgio

L. D.

Stevens

A.

Burgio

K. L.

Roth

D. L.

Paul

P.

Gerstle

J

.

(2002)

.

Teaching and maintenance behavior management skills in the nursing home

.

The Gerontologist

,

42

,

487

–

496

.

Cai

L.

Maydeu-Olivares

A.

Coffman

D. L.

Thissen

D

.

(2006)

.

Limited-information goodness-of-fit testing of item response theory models for sparse 2 tables

.

British Journal of Mathematical and Statistical Psychology

,

59

(

Pt 1

)

173

–

194

.

Cai

L.

Thissen

D.

du Toit

S. H. C

.

(2011)

.

IRTPRO: Flexible, multidimensional, multiple categorical IRT modeling [Computer software]

.

Chicago, IL

:

Scientific Software International

.

Castle

N. G

.

(2012)

.

Resident-to-resident abuse in nursing homes as reported by nurse aides

.

Journal of Elder Abuse & Neglect

,

24

,

340

–

356

.

Chen

W. H.

Thissen

D

.

(1997)

.

Local dependence indices for item pairs using item response theory

.

Journal of Educational and Behavioral Statistics

,

22

,

265

–

289

.

Cohen-Mansfield

J.

Marx

M. S.

Rosenthal

A. S

.

(1989)

.

A description of agitation in a nursing home

.

Journal of Gerontology: Social Sciences

,

44

,

577

–

584

.

Fang

J.

Power

M.

Lin

Y.

Zhang

J.

Hao

Y.

Chatterji

S

.

(2012)

.

Development of short versions for the WHOQOL-OLD module

.

The Gerontologist

,

52

(

1

),

66

–

78

.

Gibbons

R. D.

Bock

R. D.

Hedeker

D.

Weiss

D. J.

Segawa

E.

Bhaumik

D. K.

, …

Stover

A

.

(2007)

.

Full-information item bi-factor analysis of graded response data

.

Applied Psychological Measurement

,

31

,

4

–

19

.

Herbert

B.

Bradshaw

Y. S

.

(2004)

.

Comment: Aggressive behaviors and injuries among nursing home residents

.

Journal of the American Medical Association

,

291

,

2074

–

2075

.

PubMed

Holmes

D.

Teresi

J

.

(1996)

.

Using technology in behavioral approaches to Alzheimer’s disease

.

International Psychogeriatrics/IPA

,

8

(Suppl

1

),

67

–

71

.

IBM SPSS

. (

2010

).

Statistics for Windows, Version 19.0

.

Armonk, NY

:

IBM

.

Jennrich

R. I.

Bentler

P. M

.

(2011)

.

Exploratory bi-factor analysis

.

Psychometrika

,

76

(

4

),

537

–

549

.

Kihlgren

H.

Kuremyr

D.

Norberg

A.

Brane

G.

Karlson

I.

Engstrom

B.

Melin

E

.

(1993)

.

Nurse-patient interaction after training in integrity promoting care at a long-term ward: Analysis of video-recorded morning care sessions

.

International Journal of Nursing Research

,

30

,

1

–

13

.

Lachs

M.

Bachman

R.

Williams

C. S.

O’Leary

J. R

.

(2007)

.

Resident-to-resident elder mistreatment and police contact in nursing homes: Findings from a population-based cohort

.

Journal of the American Geriatrics Society

,

55

(

6

),

840

–

845

.

Lachs

M. S.

Pillemer

K. A

.

(2004)

.

Elder abuse

.

Lancet

,

304

,

1236

–

1272

.

Lachs

M. S.

Rosen

T.

Teresi

J. A.

Eimicke

J. P.

Ramirez

M.

Silver

S.

Pillemer

K

.

(2012)

.

Verbal and physical aggression directed at nursing home staff by residents

.

Journal of General Internal Medicine

.

Logsdon

R. G.

Teri

L

.

(1997)

.

The Pleasant Events Schedule-AD: Psychometric properties and relationship to depression and cognition in Alzheimer’s disease patients

.

The Gerontologist

,

37

(

1

),

40

–

45

.

Lord

F. M.

Novick

M. R

.

(1968)

.

Statistical theories of mental test scores

.

Reading, MA

:

Addison-Wesley

.

McDonald

R. P

.

(1970)

.

Theoretical foundations of principal factor analysis and alpha factor analysis

.

British Journal of Mathematical and Statistical Psychology

,

23

,

1

–

21

.

McDonald

R. P

.

(1999)

.

Test theory: A unified treatment

.

Mahwah, NJ

:

L. Erlbaum Associates

.

Morgan

D. G.

Cammer

A.

Stewart

N. J.

Crossley

M.

D’Arcy

C.

Forbes

D. A.

Karunanayake

C

.

(2012)

.

Nursing aide reports of combative behavior by residents with dementia: Results from a detailed prospective incident diary

.

Journal of the American Medical Directors Association

,

13

,

220

–

227

.

Muthén

L. K.

Muthén

B. O

. (

1998

–

2010

).

MPLUS users guide

, 6th ed.

Los Angeles, CA

:

Muthén and Muthén

.

Orlando

M.

Thissen

D

.

(2003)

.

Further investigation of the performance of S-X2: An item fit index for use with dichotomous item response theory models

.

Applied Psychological Measurement

,

27

,

289

–

298

.

Pillemer

K.

Chen

E. K.

Van Haitsma

K. S.

Teresi

J.

Ramirez

M.

Silver

S.

, …

Lachs

M. S

.

(2011)

.

Resident-to-resident aggression in nursing homes: Results from a qualitative event reconstruction study

.

The Gerontologist

,

52

(

1

),

24

–

33

. doi:

10.1093/gnr107

Pillemer

K.

Finkelhor

D

.

(1988)

.

The prevalence of elder abuse: A random sample survey

.

The Gerontologist

,

28

(

1

),

51

–

57

.

Pillemer

K.

Moore

D. W

.

(1989)

.

Abuse of patients in nursing homes: Findings from a survey of staff

.

The Gerontologist

,

29

(

3

),

314

–

320

.

R Development Core Team

.

(2008)

.

R: A language and environment for statistical computing

. R Foundation for Statistical Computing.

Vienna, Austria

(ISBN:3-900051-07-0).

Ramirez

M.

Watkins

B.

Teresi

J. A.

Silver

S.

Sukha

G.

Boratgis

G.

, …

Pillemer

K.

(in press).

Using qualitative methods to develop a measure of resident-to-resident elder mistreatment in nursing homes

.

International Psychogeriatrics

.

Reeve

B. B.

Hays

R. D.

Bjorner

J. B.

Cook

K. F.

Crane

P. K.

Teresi

J. A.

, …

Cella

D

.

(2007)

.

Psychometric evaluation and calibration of health-related quality of life item banks: Plans for the Patient-Reported Outcome Measurement Information System (PROMIS)

.

Medical Care

,

45

, (

5

Suppl

1

),

S22

–

S31

.

Reise

S. P.

Moore

T. M.

Haviland

M. G

.

(2010)

.

Bifactor models and rotations: Exploring the extent to which multidimensional data yield univocal scale scores

.

Journal of Personality Assessment

,

92

(

6

),

544

–

559

.

Revelle

W.

Zinbarg

R. E

.

(2009)

.

Coefficient alpha, beta, omega, and the GLB: Comments on Sijtsma

.

Psychometrika

,

74

,

145

–

154

.

Rizopoulus

D

.

(2006)

.

ltm: An R package for latent variable modeling and item response theory analyses

.

Journal of Statistical Software

,

17

,

1

–

25

.

Rizopoulus

D

.

(2009)

.

ltm: Latent Trait Models under IRT

. Retrieved from http://cran.rproject.org/web/packages/ltm/index.html

Rogers

J. C.

Holm

M. B.

Burgio

L. D

.

(1999)

.

Improving morning care routines of nursing home residents with dementia

.

Journal of the American Geriatrics Society

,

47

,

1049

–

1057

.

Rose

M. S.

Pruchno

R. A

.

(1999)

.

Behavior sequences of long-term care residents and their social partners

.

Journal of Gerontology: Social Sciences

,

54B

,

S75

–

S83

, doi:

10:1093/geronb/54b;2.S75

Rosen

T.

Lachs

M. S.

Bharucha

A. J.

Stevens

S. M.

Teresi

J. A.

Nebres

F.

Pillemer

K

.

(2008)

.

Resident-to-resident aggression in long-term care facilities: Insights from focus groups of nursing home residents and staff

.

Journal of the American Geriatrics Society

,

56

,

1398

–

1408

. doi:

10.1111/j.1532-5415.2008.01808.x

Schmid

L.

Leiman

J

.

(1957)

.

The development of hierarchical factor solutions

.

Psychometrika

,

22

,

53

–

61

.

Shinoda-Tagawa

T.

Leonard

R.

Pontikas

J.

McDonough

J. E.

Allen

D.

Dreyer

P. I

.

(2004)

.

Resident-to-resident violent incidents in nursing homes

.

Journal of the American Medical Association

,

4

,

591

–

598

.

. Retrieved from http://www.intechopen.com/books/howtoreference/essential-notes-in-psychiatry/violence-in-the-long-term-care-facilities-resident-to-resident-aggression-understandings-managemen

Sijtsma

K

.

(2009)

.

On the Use, the Misuse, and the Very Limited Usefulness of

Cronbach’s

Alpha

.

Psychometrika

,

74

(

1

),

107

–

120

.

Soreff

S

.

(2012)

.

Violence in the nursing homes: Understandings, management, documentation and impact of resident to resident aggression

In

Olisah

V

. (Ed.),

Essential notes in psychiatry

Stiles

M.

Koren

C.

Walsh

K

.

(2002)

.

Identifying elder abuse in a primary care setting

.

Clinical Geriatrics

,

10

,

33

–

41

.

Straus

M. A.

Hamby

S. L.

Boney-McCoy

S.

Sugarman

D. B

.

(1996)

.

The revised conflict tactics scales (CTS2: Development and preliminary psychometric data)

.

Journal of Family Issues

,

17

,

283

–

316

.

Talerico

K. A.

Evans

L. K

.

(2000)

.

Making sense of aggressive/protective behaviors in persons with dementia

.

Alzheimer’s Care Quarterly

,

1

,

77

–

88

.

Teresi

J.

Cross

P.

Golden

R

.

(1989)

.

Some applications of latent trait analysis to the measurement of ADL

.

Journal of Gerontology: Social Sciences

,

44

,

S196

–

S204

.

Teresi

J. A.

Holmes

D.

Dichter

E.

Koren

M. J.

Ramirez

M.

Fairchild

S

.

(1997)

.

Prevalence of behavior disorder and disturbance to family and staff in a sample of adult day health care clients

.

The Gerontologist

,

37

(

5

),

629

–

639

.

Teresi

J. A.

Holmes

D.

Monaco

C

.

(1993)

.

An evaluation of the effects of commingling cognitively and noncognitively impaired individuals in long-term care facilities

.

The Gerontologist

,

33

(

3

),

350

–

358

.

Teresi

J.

Kleinman

M.

Ocepek-Welikson

K

.

(2000)

.

Modern psychometric methods for detection of differential item functioning: Application to cognitive assessment measures

.

Statistics in Medicine

,

19

,

1651

–

1683

.

Teresi

J.

Ocepek-Welikson

K.

Kleinman

M.

Eimicke

J. E.

Crane

P. K.

Jones

R. N.

, …

Cella

D

.

(2009)

.

Analysis of differential item functioning in the depression item bank from the Patient Reported Outcome Measurement Information System (PROMIS): An item response theory approach

.

Psychology Science Quarterly

,

51

,

148

–

180

.

PubMed

Teresi

J. A.

Ramirez

M.

Ellis

J.

Silver

S.

Boratgis

G.

Kong

J.

, …

Lachs

M. S

.

(2012)

.

A staff intervention targeting resident-to-resident elder mistreatment (R-REM) in long-term care increased staff knowledge, recognition and reporting: Results from a cluster randomized trial

.

International Journal of Nursing Studies

.