Abstract

Background: Prospective cohort studies recruit relatively healthy population samples, resulting in lower morbidity and mortality rates than in the source population. This is known as the healthy volunteer effect. The aim of this study was to define the magnitude and the development over time of the healthy volunteer effect in the EPIC-NL cohort. Methods: We studied mortality rates in the EPIC-NL cohort, which comprises 37 551 men and women aged 20–70 years at recruitment in 1993–97. The date and cause of death of deceased participants until 2010 were obtained through linkage with the municipal registry and Statistics Netherlands. Standardized mortality ratios (SMRs) were computed by dividing the observed number of deaths by the number of deaths expected from the general Dutch population. Additionally, standardized incidence ratios were calculated to compare cancer incidence. Results: After an average follow-up of 14.9 years, 3029 deaths were documented. Overall mortality in men [SMR 73.5%, 95% confidence interval (CI): 68.1–79.3] and women (SMR 65.9%, 95% CI: 63.2–68.6) was lower compared with the general population for the whole follow-up period. The SMRs clearly increased over the follow-up period. Among women, the SMR was lower for death due to cardiovascular diseases than death due to cancer. Cancer incidence was also lower in EPIC-NL than in the general population (SMR 78.3 and 82.7% for men and women, respectively). Conclusion: The results show a healthy volunteer effect in the EPIC-NL cohort, which tapers off with longer follow-up. Therefore, in the first years of follow-up, power might not be sufficient to detect small associations.

Introduction

Participants of cohort studies tend to be healthier than the overall population they are sampled from.1,2 This is known as the healthy volunteer effect. Self-selection of healthy and usually higher-educated persons, together with self-exclusion of persons in poor health who meet the study criteria, are most likely causes of the healthy volunteer effect in cohort studies.3 Furthermore, persons might not feel well long before death and may, therefore, be less likely to participate.

Because of the healthy volunteer effect, morbidity and mortality rates in the first years after the study entry might be much lower than the morbidity and mortality rates from the general population.1 This leads to selection bias when comparisons are made with an external population. When associations are made between different exposure groups and the outcome of interest within a cohort, the result is less affected. However, follow-up needs to be long enough to generate sufficient endpoints and power. Moreover, the healthy volunteer effect might result in less variation in the exposure status of cohort members, resulting in small effects that remain undetected.

The extent of the healthy volunteer effect can be evaluated by comparing the mortality and morbidity with that of the general population using standardized mortality ratios (SMRs) and standardized incidence ratios (SIRs). Previous studies that showed a healthy volunteer effect by calculating SMRs were mostly screening trials4–6 and cohorts of vegetarian populations.7,8 Having a better insight into the healthy volunteer effect might be helpful in interpreting results from epidemiological cohort studies. The aim of this study was, therefore, to define the magnitude and the development over time of the healthy volunteer effect by calculating SMRs and SIRs in a cohort study with a large sample size, including a substantial general population sample of men and women, and a long follow-up.

Methods

The EPIC-NL cohort

The EPIC-NL study consists of the two Dutch contributions to the European Prospective Investigation into Cancer and Nutrition (EPIC), Prospect-EPIC and MORGEN-EPIC.9 EPIC is an ongoing multicentre prospective cohort study, designed to investigate the relationship of nutrition and lifestyle with the development of cancer and other chronic diseases. Prospect-EPIC and MORGEN-EPIC were set up simultaneously between 1993 and 1997. For Prospect-EPIC, 50 313 Dutch women aged 49–70 years living in the city of Utrecht and vicinity who were scheduled for the nationwide Dutch breast cancer screening, received a mailed invitation to join the Prospect-EPIC study, along with their invitations for a routine mammography. Of them, 17 357 (34.5%) agreed to participate. The participation rate was somewhat higher for the younger women and for women from less urbanized areas. Those who agreed to participate in the Prospect-EPIC project received questionnaires by mail.10 For MORGEN-EPIC, a random sample of 50 766 men and women aged 20–64 years was drawn from the general Dutch population in three different towns (Doetinchem, Amsterdam and Maastricht), stratified for age and sex. This sample received an invitation to participate in the MORGEN cohort. Those who returned a reply card received questionnaires by mail and were asked to visit the local public health service for a physical examination. From the general sample, 22 654 (45%) individuals participated in the study. The response percentages were higher among women and among older persons.11

At baseline, a general questionnaire was administered containing questions on lifestyle such as smoking status. A physical examination was performed for anthropometry and blood pressure measurements. All 40 011 participants provided informed consents before study inclusion. The study complies with the Declaration of Helsinki and was approved by the institutional review board of the University Medical Center Utrecht (Prospect-EPIC) and the medical ethical committee of TNO Nutrition and Food Research (MORGEN-EPIC). Further details of the study design have previously been published.9

EPIC-NL participants are followed for vital status, cause of death and disease occurrence by regular linkage to several disease registries. Vital status of all EPIC-NL participants is obtained through linkage with the municipal registries. Subsequently, causes of death for the deceased persons are obtained through linkage with Statistics Netherlands. Information on vital status and causes of death is complete until 31 December 2010. Information on cancer incidence (overall, breast, prostate, colorectal and lung cancer) was available through the Dutch cancer registration and was complete until 31 December 2007.

For comparing mortality rates, subjects who did not give permission for linkage with the municipal registry and Statistics Netherlands (n = 2460) were excluded, leaving 37 551 men and women for analysis. For comparing cancer morbidity, we excluded participants who did not give permission for linkage with municipal registries and the Dutch cancer registration (n = 2306), leaving 37 705 participants for analysis.

Risk factors

Baseline characteristics, i.e. age, high educational level (those with vocational education or university), current smoking status, average number of cigarettes a day and overweight [body mass index (BMI) ≥25 kg/m2], are presented as means with standard deviations or percentages and stratified by age and sex. The distribution of educational level and risk factors in EPIC-NL is compared with the distribution in the general reference population, which is obtained from the Permanent Onderzoek LeefSituatie (POLS) questionnaire from Statistics Netherlands.12,13 We used the chi-square test to test whether this difference is significant. The POLS questionnaires are yearly questionnaires filled out by a random sample of ∼10 000 persons of the general Dutch population (response rate 35–40%). The goal of POLS is to gather information on living conditions in the Netherlands.14 The health survey of POLS aims to give an overview of developments in health, medical consumption, lifestyle and preventive behaviour of the Dutch population.15 We used the data from 2001 as a reference because data from 1993 to 1997 were not available.

Standardized Mortality Ratios

To compare mortality in EPIC-NL with mortality in the general Dutch population, we calculated SMRs for overall mortality and for the two most frequent causes of death, i.e. cardiovascular diseases (CVDs; ICD-10 I00-I99) and cancer (ICD-10 C00-C97). The SMR is the ratio between observed and expected deaths, expressed as a percentage. Expected deaths were calculated by multiplying the sex-, age- and calendar year-specific national mortality rates by the number of person-years in EPIC-NL in the same category. Information on national mortality rates of the reference population was retrieved from Statistics Netherlands.16 EPIC-NL person-years at risk is the time between recruitment and the censor date (date of death, loss to follow-up or end of follow-up). To compute person-years for each category, numbers of living participants were calculated by counting the number of living participants per age and sex on July first of every calendar year, assuming the enrolment as well as the number of deaths was evenly distributed over the calendar year. For women from Prospect-EPIC who entered the study in 1993, ages were determined on 1 October because the recruitment started in July 1993. Thus, for example, we multiplied the mortality rate for 40-year-old persons in 2000 with the number of persons in the study who were 40-year-old in the year 2000, etc. Observed deaths were also categorized according to sex, age at death and calendar year. Observed and expected deaths were summed into specific age and calendar year groups and subsequently divided by each other to obtain the SMRs. To compare cancer incidence in the EPIC-NL study with cancer incidence in the general population, we additionally calculated SIRs in the same way as we did for SMRs. Information on national cancer incidence rates of the reference population was retrieved from the Dutch cancer registration.17 Reference CVD incidence rates were unfortunately not available.

SMRs and SIRs were calculated for the total follow-up period and additionally for three subsequent calendar years of the study follow-up. Because of the different recruitments years, the number of follow-up years of participants at a given moment in time can vary 5 years. Therefore, we additionally calculated the SMRs and SIRs per 3-year person-time on the study to assess whether the healthy volunteer effect changes over time. The calculations were done for the total population and per 10-year age groups. All analyses were stratified by sex. We additionally stratified the analysis by cohort. CIs were computed assuming a Poisson’s distribution when the number of observed deaths was below 100. Above 100 observed deaths, Byar’s approximation was used.1

Results

During a median follow-up of 14.9 years, 3029 EPIC-NL participants died. The baseline table shows that EPIC-NL consists of ∼75% women who were older than the men in the cohort (table 1). Compared with the general population, EPIC-NL participants did not seem to smoke less in all age categories. Compared with the general population, EPIC-NL participants were not only more often overweight but also more often highly educated in all age categories.

Table 1

Baseline characteristics of 37 551 EPIC-NL participants and the reference population (general Dutch population in 2001)

CategoriesWomen
Men
EPIC-NLThe NetherlandsPEPIC-NLThe NetherlandsP
1993–9720011993–972001
N28 0499502
Age, mean ± SD51.5 ± 11.443.3 ± 11.1
Current smokers (%) (cigarettes a day, mean)
    15–24a31.7 (13.1)36.1 (9.7)<0.0141.0 (13.0)38.8 (11.8)0.24
    25–4437.7 (15.1)35.3 (12.4)<0.0139.3 (16.6)44.5 (12.7)<0.01
    45–6426.1 (14.3)31.0 (14.9)<0.0136.7 (17.2)38.1 (13.5)0.04
    65+b15.6 (11.0)13.5 (10.9)<0.01
Overweightc (%)
    15–2424.518.2<0.0122.520.10.11
    25–4433.430.6<0.0147.946.50.06
    45–6458.347.2<0.0169.159.6<0.01
    65+b70.254.9<0.01
High educational leveld (%)
    15–2412.06.7<0.019.73.3<0.01
    25–4427.624.6<0.0132.527.4<0.01
    45–64e16.615.4<0.0124.324.8<0.01
CategoriesWomen
Men
EPIC-NLThe NetherlandsPEPIC-NLThe NetherlandsP
1993–9720011993–972001
N28 0499502
Age, mean ± SD51.5 ± 11.443.3 ± 11.1
Current smokers (%) (cigarettes a day, mean)
    15–24a31.7 (13.1)36.1 (9.7)<0.0141.0 (13.0)38.8 (11.8)0.24
    25–4437.7 (15.1)35.3 (12.4)<0.0139.3 (16.6)44.5 (12.7)<0.01
    45–6426.1 (14.3)31.0 (14.9)<0.0136.7 (17.2)38.1 (13.5)0.04
    65+b15.6 (11.0)13.5 (10.9)<0.01
Overweightc (%)
    15–2424.518.2<0.0122.520.10.11
    25–4433.430.6<0.0147.946.50.06
    45–6458.347.2<0.0169.159.6<0.01
    65+b70.254.9<0.01
High educational leveld (%)
    15–2412.06.7<0.019.73.3<0.01
    25–4427.624.6<0.0132.527.4<0.01
    45–64e16.615.4<0.0124.324.8<0.01

a: 20–24 in EPIC-NL.

b: Only part of the women from Prospect were 65+ at baseline.

c: BMI ≥25.

d: Those with higher vocational education and university.

e: The reference data for 65+ are not available.

Table 1

Baseline characteristics of 37 551 EPIC-NL participants and the reference population (general Dutch population in 2001)

CategoriesWomen
Men
EPIC-NLThe NetherlandsPEPIC-NLThe NetherlandsP
1993–9720011993–972001
N28 0499502
Age, mean ± SD51.5 ± 11.443.3 ± 11.1
Current smokers (%) (cigarettes a day, mean)
    15–24a31.7 (13.1)36.1 (9.7)<0.0141.0 (13.0)38.8 (11.8)0.24
    25–4437.7 (15.1)35.3 (12.4)<0.0139.3 (16.6)44.5 (12.7)<0.01
    45–6426.1 (14.3)31.0 (14.9)<0.0136.7 (17.2)38.1 (13.5)0.04
    65+b15.6 (11.0)13.5 (10.9)<0.01
Overweightc (%)
    15–2424.518.2<0.0122.520.10.11
    25–4433.430.6<0.0147.946.50.06
    45–6458.347.2<0.0169.159.6<0.01
    65+b70.254.9<0.01
High educational leveld (%)
    15–2412.06.7<0.019.73.3<0.01
    25–4427.624.6<0.0132.527.4<0.01
    45–64e16.615.4<0.0124.324.8<0.01
CategoriesWomen
Men
EPIC-NLThe NetherlandsPEPIC-NLThe NetherlandsP
1993–9720011993–972001
N28 0499502
Age, mean ± SD51.5 ± 11.443.3 ± 11.1
Current smokers (%) (cigarettes a day, mean)
    15–24a31.7 (13.1)36.1 (9.7)<0.0141.0 (13.0)38.8 (11.8)0.24
    25–4437.7 (15.1)35.3 (12.4)<0.0139.3 (16.6)44.5 (12.7)<0.01
    45–6426.1 (14.3)31.0 (14.9)<0.0136.7 (17.2)38.1 (13.5)0.04
    65+b15.6 (11.0)13.5 (10.9)<0.01
Overweightc (%)
    15–2424.518.2<0.0122.520.10.11
    25–4433.430.6<0.0147.946.50.06
    45–6458.347.2<0.0169.159.6<0.01
    65+b70.254.9<0.01
High educational leveld (%)
    15–2412.06.7<0.019.73.3<0.01
    25–4427.624.6<0.0132.527.4<0.01
    45–64e16.615.4<0.0124.324.8<0.01

a: 20–24 in EPIC-NL.

b: Only part of the women from Prospect were 65+ at baseline.

c: BMI ≥25.

d: Those with higher vocational education and university.

e: The reference data for 65+ are not available.

For women, the total SMR calculated over total follow-up until 2010 was 65.9 (95% CI: 63.2–68.6) (table 2). This means that the mortality of female EPIC-NL participants was ∼34% lower than the mortality in the general female population. With increasing study follow-up, this healthy volunteer effect decreased from an SMR of 28.4% (95% CI: 19.2–40.6) in follow-up years 1993–95 to an SMR of 78.1% (95% CI: 72.7–83.7) for the period 2008–10, but all SMRs remained statistically significantly at <100%. Also with increasing participants’ follow-up time on the study, the healthy volunteer effect decreases from an SMR of 39.9% (95% CI: 33.4–47.2) for a follow-up of 3 years to an SMR of 65.9% (95% CI: 63.2–68.6) after a maximum follow-up of 18 years. For men, the observed healthy cohort effect was somewhat smaller (SMR 73.5%, 95% CI: 68.1–79.3). A similar trend with follow-up time was observed starting at an SMR of 52.8% (95% CI: 31.8–82.5) in 1993–95 and ending at 87.0% (95% CI: 75.8–99.4) in the period 2008–10. The healthy volunteer effect per participants time on the study also decreased from an SMR of 54.8% (95% CI: 40.7–72.3) after 3 years in the study to an SMR of 73.5% (95% CI: 68.1–79.3) after a maximum of 18 years in the study. The healthy volunteer effect was more pronounced in younger participants compared with older participants, both in men and women. When the analyses for women were stratified for cohort, we observed a somewhat lower SMR for the women from Prospect-EPIC (64.7%, 95% CI: 61.9–67.7) than for the women from the MORGEN-EPIC cohort (70.9%, 95% CI: 64.6–77.6) (data not shown in tables).

Table 2

SMRs among 37 551 participants of the EPIC-NL cohort stratified by sex

CategoriesWomen
Men
Observed deathsPerson- years at riskMortality ratea reference populationExpected deathbSMRCI of SMRObserved deathsPerson- years at riskMortality ratea reference populationExpected deathbSMRCI of SMR
nNn%%Nnn%%
Overall mortality2347417 60419.5356365.963.2–68.6682141 54832.192773.568.1–79.3
Calendar years
    1993–953022 89321.210628.419.2–40.619908536.43652.831.8–82.5
    1996–9820474,72820.639551.644.7–59.26725 08835.210862.348.3–79.1
    1999–200133982 56620.554861.955.5–68.89927 85433.914866.954.4–81.5
    2002–443381 19419.768463.357.5–69.513227 30632.118073.261.2–86.8
    2005–755179 22618.081867.461.9–73.714926 45329.020771.960.8–84.4
    2008–1079076 99716.8101278.172.7–83.721625 76226.324887.075.8–99.4
Follow-up periodc (years)
    313470 63619.533639.933.4–47.25023 87132.19154.840.7–72.3
    6417153 65619.583449.845.1–54.813951 90332.122661.551.8–72.7
    9820235,41619.5147055.852.0–59.726179,41832.139266.658.7–75.2
    121325315 37919.5223659.356.1–62.5397106 18032.158867.561.0–74.5
    151996389 30119.5313063.861.0–66.6559130 76532.180669.463.7–75.4
    182347417 60419.5356965.963.2–68.6682141 54832.192773.568.1–79.3
Age categories (years)
    20–29296440.3364.27.8–231.9366970.6468.314.1–199.6
    30–391829 7730.617103.761.4–163.81422 2270.82071.439.0–119.8
    40–494642 7881.67065.948.3–88.04435 3692.07459.243.0–79.5
    50–59255114 0143.947353.947.5–61.017242 2505.823972.161.7–83.7
    60–69812147 7399.1127163.959.6–68.529229 10616.840671.963.9–80.6
    70–7994267 03725.9139467.663.3–72.0153583449.917985.372.3–99.9
    80–89272660995.233581.171.8–91.4465149.0585.623.3–219.2
Cause-specific SMR
    CVD529417 6047.398253.949.4–58.6201141 54811.228271.361.8–81.9
    Cancer1219417 6044.4159076.772.4–81.1295141 5488.440073.765.5–82.6
CategoriesWomen
Men
Observed deathsPerson- years at riskMortality ratea reference populationExpected deathbSMRCI of SMRObserved deathsPerson- years at riskMortality ratea reference populationExpected deathbSMRCI of SMR
nNn%%Nnn%%
Overall mortality2347417 60419.5356365.963.2–68.6682141 54832.192773.568.1–79.3
Calendar years
    1993–953022 89321.210628.419.2–40.619908536.43652.831.8–82.5
    1996–9820474,72820.639551.644.7–59.26725 08835.210862.348.3–79.1
    1999–200133982 56620.554861.955.5–68.89927 85433.914866.954.4–81.5
    2002–443381 19419.768463.357.5–69.513227 30632.118073.261.2–86.8
    2005–755179 22618.081867.461.9–73.714926 45329.020771.960.8–84.4
    2008–1079076 99716.8101278.172.7–83.721625 76226.324887.075.8–99.4
Follow-up periodc (years)
    313470 63619.533639.933.4–47.25023 87132.19154.840.7–72.3
    6417153 65619.583449.845.1–54.813951 90332.122661.551.8–72.7
    9820235,41619.5147055.852.0–59.726179,41832.139266.658.7–75.2
    121325315 37919.5223659.356.1–62.5397106 18032.158867.561.0–74.5
    151996389 30119.5313063.861.0–66.6559130 76532.180669.463.7–75.4
    182347417 60419.5356965.963.2–68.6682141 54832.192773.568.1–79.3
Age categories (years)
    20–29296440.3364.27.8–231.9366970.6468.314.1–199.6
    30–391829 7730.617103.761.4–163.81422 2270.82071.439.0–119.8
    40–494642 7881.67065.948.3–88.04435 3692.07459.243.0–79.5
    50–59255114 0143.947353.947.5–61.017242 2505.823972.161.7–83.7
    60–69812147 7399.1127163.959.6–68.529229 10616.840671.963.9–80.6
    70–7994267 03725.9139467.663.3–72.0153583449.917985.372.3–99.9
    80–89272660995.233581.171.8–91.4465149.0585.623.3–219.2
Cause-specific SMR
    CVD529417 6047.398253.949.4–58.6201141 54811.228271.361.8–81.9
    Cancer1219417 6044.4159076.772.4–81.1295141 5488.440073.765.5–82.6

a: The mortality rate (cases/1000) is the average age- and calendar year-specific mortality rate of the reference population in that category.

b: The number of expected deaths cannot be recalculated from this table because it has been calculated from age- and calendar year-specific rates.

c: The follow-up period represents the maximum period participants are on the study.

Table 2

SMRs among 37 551 participants of the EPIC-NL cohort stratified by sex

CategoriesWomen
Men
Observed deathsPerson- years at riskMortality ratea reference populationExpected deathbSMRCI of SMRObserved deathsPerson- years at riskMortality ratea reference populationExpected deathbSMRCI of SMR
nNn%%Nnn%%
Overall mortality2347417 60419.5356365.963.2–68.6682141 54832.192773.568.1–79.3
Calendar years
    1993–953022 89321.210628.419.2–40.619908536.43652.831.8–82.5
    1996–9820474,72820.639551.644.7–59.26725 08835.210862.348.3–79.1
    1999–200133982 56620.554861.955.5–68.89927 85433.914866.954.4–81.5
    2002–443381 19419.768463.357.5–69.513227 30632.118073.261.2–86.8
    2005–755179 22618.081867.461.9–73.714926 45329.020771.960.8–84.4
    2008–1079076 99716.8101278.172.7–83.721625 76226.324887.075.8–99.4
Follow-up periodc (years)
    313470 63619.533639.933.4–47.25023 87132.19154.840.7–72.3
    6417153 65619.583449.845.1–54.813951 90332.122661.551.8–72.7
    9820235,41619.5147055.852.0–59.726179,41832.139266.658.7–75.2
    121325315 37919.5223659.356.1–62.5397106 18032.158867.561.0–74.5
    151996389 30119.5313063.861.0–66.6559130 76532.180669.463.7–75.4
    182347417 60419.5356965.963.2–68.6682141 54832.192773.568.1–79.3
Age categories (years)
    20–29296440.3364.27.8–231.9366970.6468.314.1–199.6
    30–391829 7730.617103.761.4–163.81422 2270.82071.439.0–119.8
    40–494642 7881.67065.948.3–88.04435 3692.07459.243.0–79.5
    50–59255114 0143.947353.947.5–61.017242 2505.823972.161.7–83.7
    60–69812147 7399.1127163.959.6–68.529229 10616.840671.963.9–80.6
    70–7994267 03725.9139467.663.3–72.0153583449.917985.372.3–99.9
    80–89272660995.233581.171.8–91.4465149.0585.623.3–219.2
Cause-specific SMR
    CVD529417 6047.398253.949.4–58.6201141 54811.228271.361.8–81.9
    Cancer1219417 6044.4159076.772.4–81.1295141 5488.440073.765.5–82.6
CategoriesWomen
Men
Observed deathsPerson- years at riskMortality ratea reference populationExpected deathbSMRCI of SMRObserved deathsPerson- years at riskMortality ratea reference populationExpected deathbSMRCI of SMR
nNn%%Nnn%%
Overall mortality2347417 60419.5356365.963.2–68.6682141 54832.192773.568.1–79.3
Calendar years
    1993–953022 89321.210628.419.2–40.619908536.43652.831.8–82.5
    1996–9820474,72820.639551.644.7–59.26725 08835.210862.348.3–79.1
    1999–200133982 56620.554861.955.5–68.89927 85433.914866.954.4–81.5
    2002–443381 19419.768463.357.5–69.513227 30632.118073.261.2–86.8
    2005–755179 22618.081867.461.9–73.714926 45329.020771.960.8–84.4
    2008–1079076 99716.8101278.172.7–83.721625 76226.324887.075.8–99.4
Follow-up periodc (years)
    313470 63619.533639.933.4–47.25023 87132.19154.840.7–72.3
    6417153 65619.583449.845.1–54.813951 90332.122661.551.8–72.7
    9820235,41619.5147055.852.0–59.726179,41832.139266.658.7–75.2
    121325315 37919.5223659.356.1–62.5397106 18032.158867.561.0–74.5
    151996389 30119.5313063.861.0–66.6559130 76532.180669.463.7–75.4
    182347417 60419.5356965.963.2–68.6682141 54832.192773.568.1–79.3
Age categories (years)
    20–29296440.3364.27.8–231.9366970.6468.314.1–199.6
    30–391829 7730.617103.761.4–163.81422 2270.82071.439.0–119.8
    40–494642 7881.67065.948.3–88.04435 3692.07459.243.0–79.5
    50–59255114 0143.947353.947.5–61.017242 2505.823972.161.7–83.7
    60–69812147 7399.1127163.959.6–68.529229 10616.840671.963.9–80.6
    70–7994267 03725.9139467.663.3–72.0153583449.917985.372.3–99.9
    80–89272660995.233581.171.8–91.4465149.0585.623.3–219.2
Cause-specific SMR
    CVD529417 6047.398253.949.4–58.6201141 54811.228271.361.8–81.9
    Cancer1219417 6044.4159076.772.4–81.1295141 5488.440073.765.5–82.6

a: The mortality rate (cases/1000) is the average age- and calendar year-specific mortality rate of the reference population in that category.

b: The number of expected deaths cannot be recalculated from this table because it has been calculated from age- and calendar year-specific rates.

c: The follow-up period represents the maximum period participants are on the study.

The healthy volunteer effect among women was stronger for mortality due to CVD (SMR 53.9%, 95% CI: 49.4–58.6) than for mortality due to cancer (SMR 76.7%, 95% CI: 72.4–81.1). In men, the healthy volunteer effect was approximately the same for CVD and cancer mortality (SMR CVD 71.3%, 95% CI: 61.8–81.9, SMR cancer 73.7%, 95% CI: 65.5–82.6) and comparable with the SMR of overall mortality.

Cancer incidence rates in EPIC-NL were also lower than that in the reference population. The SIR for women was 82.7% (95% CI: 79.5–86.1) and 78.3% (95% CI: 71.8–85.2) for men (table 3). Although the SMRs clearly decrease over follow-up time, this is not observed for SIRs. Additionally, for cancer incidence, the healthy volunteer effect was not more pronounced in younger participants as was the case for the SMRs.

Table 3

SIRs of cancer incidence among 37 705 participants of the EPIC-NL cohort stratified for sex

CategoriesWomen
Men
Observed casesPerson- years at riskIncidence ratea reference populationExpected casesbSIRCI of SIRObserved casesPerson- years at riskIncidence ratea reference populationExpected casesbSIRCI of SIR
nnn%%nnn%%
Overall cancer2470331 6958.5298582.779.5–86.1538114 50113.968778.371.8–85.2
Calendar years
    1993–9512722 8847.914587.773.1–104.328909013.93092.661.5–133.8
    1996–9846274 1558.052388.380.4–96.76825 06213.89869.754.1–88.4
    1999–200152980 9548.466979.172.5–86.211127 65713.413880.266.0–96.6
    2002–465278 4358.876785.078.6–91.713826 93113.918574.562.6–88.0
    2005–770075 2679.488179.473.7–85.519325 76114.423681.770.6–94.1
Follow-up periodc (years)
    341066 5048.544292.483.9–102.17022 42213.97574.5
    6908152 1788.5110382.377.0–87.816451 81913.920679.667.8–92.7
    91536231 5328.5183983.579.4–87.829479 04113.937279.170.3–88.7
    122224304 0548.5264084.280.8–87.8444103 79613.957477.370.3–84.9
    152470331 6958.5298582.779.5–86.1538114 50113.968778.371.8–85.2
Age categories (years)
    20–29596560.45109.535.6–255.5167430.4334.90.9–194.5
    30–393426 3711.43791.963.7–128.41720 0160.715115.567.3–184.9
    40–4910935 9873.713680.065.7–96.53830 3561.85668.148.2–93.5
    50–59683103 5357.477887.781.3–94.616334 5636.121077.566.1–90.4
    60–691087112 91811.5130783.278.3–88.327820 62917.233682.673.2–92.9
    70–7953241 77816.169176.970.5–83.841219432.66761.043.8–82.8
    80–8920145019.23165.540.0–101.2
Cancer types
    Breast cancer928336 9192.4100692.286.4–98.4
    Lung cancer189341 6900.628167.857.9–77.473116 1352.3712160.647.5–76.2
    Colorectal cancer334340 7051.236990.681.1–100.986115 9701.789095.676.5–118.1
    Prostate cancer147115 6942.84125117.499.2–138.0
CategoriesWomen
Men
Observed casesPerson- years at riskIncidence ratea reference populationExpected casesbSIRCI of SIRObserved casesPerson- years at riskIncidence ratea reference populationExpected casesbSIRCI of SIR
nnn%%nnn%%
Overall cancer2470331 6958.5298582.779.5–86.1538114 50113.968778.371.8–85.2
Calendar years
    1993–9512722 8847.914587.773.1–104.328909013.93092.661.5–133.8
    1996–9846274 1558.052388.380.4–96.76825 06213.89869.754.1–88.4
    1999–200152980 9548.466979.172.5–86.211127 65713.413880.266.0–96.6
    2002–465278 4358.876785.078.6–91.713826 93113.918574.562.6–88.0
    2005–770075 2679.488179.473.7–85.519325 76114.423681.770.6–94.1
Follow-up periodc (years)
    341066 5048.544292.483.9–102.17022 42213.97574.5
    6908152 1788.5110382.377.0–87.816451 81913.920679.667.8–92.7
    91536231 5328.5183983.579.4–87.829479 04113.937279.170.3–88.7
    122224304 0548.5264084.280.8–87.8444103 79613.957477.370.3–84.9
    152470331 6958.5298582.779.5–86.1538114 50113.968778.371.8–85.2
Age categories (years)
    20–29596560.45109.535.6–255.5167430.4334.90.9–194.5
    30–393426 3711.43791.963.7–128.41720 0160.715115.567.3–184.9
    40–4910935 9873.713680.065.7–96.53830 3561.85668.148.2–93.5
    50–59683103 5357.477887.781.3–94.616334 5636.121077.566.1–90.4
    60–691087112 91811.5130783.278.3–88.327820 62917.233682.673.2–92.9
    70–7953241 77816.169176.970.5–83.841219432.66761.043.8–82.8
    80–8920145019.23165.540.0–101.2
Cancer types
    Breast cancer928336 9192.4100692.286.4–98.4
    Lung cancer189341 6900.628167.857.9–77.473116 1352.3712160.647.5–76.2
    Colorectal cancer334340 7051.236990.681.1–100.986115 9701.789095.676.5–118.1
    Prostate cancer147115 6942.84125117.499.2–138.0

a: The incidence rate (cases/1000) is the average age- and calendar year-specific incidence rate of the reference population in that category.

b: The number of expected deaths cannot be recalculated from this table because it has been calculated from age- and calendar year-specific rates.

c: The follow-up period represents the maximum period participants are on the study.

Table 3

SIRs of cancer incidence among 37 705 participants of the EPIC-NL cohort stratified for sex

CategoriesWomen
Men
Observed casesPerson- years at riskIncidence ratea reference populationExpected casesbSIRCI of SIRObserved casesPerson- years at riskIncidence ratea reference populationExpected casesbSIRCI of SIR
nnn%%nnn%%
Overall cancer2470331 6958.5298582.779.5–86.1538114 50113.968778.371.8–85.2
Calendar years
    1993–9512722 8847.914587.773.1–104.328909013.93092.661.5–133.8
    1996–9846274 1558.052388.380.4–96.76825 06213.89869.754.1–88.4
    1999–200152980 9548.466979.172.5–86.211127 65713.413880.266.0–96.6
    2002–465278 4358.876785.078.6–91.713826 93113.918574.562.6–88.0
    2005–770075 2679.488179.473.7–85.519325 76114.423681.770.6–94.1
Follow-up periodc (years)
    341066 5048.544292.483.9–102.17022 42213.97574.5
    6908152 1788.5110382.377.0–87.816451 81913.920679.667.8–92.7
    91536231 5328.5183983.579.4–87.829479 04113.937279.170.3–88.7
    122224304 0548.5264084.280.8–87.8444103 79613.957477.370.3–84.9
    152470331 6958.5298582.779.5–86.1538114 50113.968778.371.8–85.2
Age categories (years)
    20–29596560.45109.535.6–255.5167430.4334.90.9–194.5
    30–393426 3711.43791.963.7–128.41720 0160.715115.567.3–184.9
    40–4910935 9873.713680.065.7–96.53830 3561.85668.148.2–93.5
    50–59683103 5357.477887.781.3–94.616334 5636.121077.566.1–90.4
    60–691087112 91811.5130783.278.3–88.327820 62917.233682.673.2–92.9
    70–7953241 77816.169176.970.5–83.841219432.66761.043.8–82.8
    80–8920145019.23165.540.0–101.2
Cancer types
    Breast cancer928336 9192.4100692.286.4–98.4
    Lung cancer189341 6900.628167.857.9–77.473116 1352.3712160.647.5–76.2
    Colorectal cancer334340 7051.236990.681.1–100.986115 9701.789095.676.5–118.1
    Prostate cancer147115 6942.84125117.499.2–138.0
CategoriesWomen
Men
Observed casesPerson- years at riskIncidence ratea reference populationExpected casesbSIRCI of SIRObserved casesPerson- years at riskIncidence ratea reference populationExpected casesbSIRCI of SIR
nnn%%nnn%%
Overall cancer2470331 6958.5298582.779.5–86.1538114 50113.968778.371.8–85.2
Calendar years
    1993–9512722 8847.914587.773.1–104.328909013.93092.661.5–133.8
    1996–9846274 1558.052388.380.4–96.76825 06213.89869.754.1–88.4
    1999–200152980 9548.466979.172.5–86.211127 65713.413880.266.0–96.6
    2002–465278 4358.876785.078.6–91.713826 93113.918574.562.6–88.0
    2005–770075 2679.488179.473.7–85.519325 76114.423681.770.6–94.1
Follow-up periodc (years)
    341066 5048.544292.483.9–102.17022 42213.97574.5
    6908152 1788.5110382.377.0–87.816451 81913.920679.667.8–92.7
    91536231 5328.5183983.579.4–87.829479 04113.937279.170.3–88.7
    122224304 0548.5264084.280.8–87.8444103 79613.957477.370.3–84.9
    152470331 6958.5298582.779.5–86.1538114 50113.968778.371.8–85.2
Age categories (years)
    20–29596560.45109.535.6–255.5167430.4334.90.9–194.5
    30–393426 3711.43791.963.7–128.41720 0160.715115.567.3–184.9
    40–4910935 9873.713680.065.7–96.53830 3561.85668.148.2–93.5
    50–59683103 5357.477887.781.3–94.616334 5636.121077.566.1–90.4
    60–691087112 91811.5130783.278.3–88.327820 62917.233682.673.2–92.9
    70–7953241 77816.169176.970.5–83.841219432.66761.043.8–82.8
    80–8920145019.23165.540.0–101.2
Cancer types
    Breast cancer928336 9192.4100692.286.4–98.4
    Lung cancer189341 6900.628167.857.9–77.473116 1352.3712160.647.5–76.2
    Colorectal cancer334340 7051.236990.681.1–100.986115 9701.789095.676.5–118.1
    Prostate cancer147115 6942.84125117.499.2–138.0

a: The incidence rate (cases/1000) is the average age- and calendar year-specific incidence rate of the reference population in that category.

b: The number of expected deaths cannot be recalculated from this table because it has been calculated from age- and calendar year-specific rates.

c: The follow-up period represents the maximum period participants are on the study.

We also stratified for different cancer types. Especially, lung cancer incidence was much lower than expected in the EPIC-NL cohort for both men and women (table 3). Compared with the general population, the incidence of breast cancer in women was also lower. Colorectal cancer incidence and prostate cancer incidence among men did not differ significantly compared with the general population.

Discussion

The results demonstrate a healthy volunteer effect for men and women at all ages of the EPIC-NL cohort. After a median follow-up of 15 years, overall mortality among men and women was 26 and 34% lower, respectively, compared with the general population (SMR 73.5 and 65.9%, respectively). The healthy volunteer effect was strongest during the first years of follow-up and declined over time but was still observable after 15 years of follow-up. Cancer incidence was somewhat less affected by the healthy cohort effect (SMR 78.3 and 82.7% for men and women, respectively) and remained stable over time. The results indicate that when causal relationships are of interest, the power in the first years of follow-up is lower than expected from the general population. Together with a possible lower degree of variation in the exposure of interest among the more healthy population, this might lead to small effects remaining undetected. When cohorts like EPIC-NL, where a healthy volunteer effect is present, are used, for example, for the estimation of absolute risks or prediction models, it is important to realize that the cohort is not completely representative of the target population.

Other studies reported similar healthy volunteer effects. Froom et al. reported an SMR among male volunteers of 71% after 8 years of follow-up.2 In the EPIC-Oxford cohort, the SMR was lower (52%).7 Two other prospective cohorts also reported low SMRs of 59 and 52%.8 Participants from the latter three studies were mainly vegetarians who were selected via vegetarian societies, health food shops and magazines. This has likely resulted in an even more health-conscious study population and might explain the lower SMRs. Screening trials also seem to attract more health-conscious participants. Among volunteers of a US cancer screening trial, SMRs of 46% for men and 38% for women were observed over 7 years of follow-up.4 This strong healthy volunteer effect in this screening trial might also be caused by the fact that persons who were already diagnosed with prostate, lung, colorectal or ovarian cancer were excluded from the trial, leaving a healthier group for analysis. A similar effect was reported in an ovarian cancer screening trial that reported an overall SMR of 37% after an average of 5.6 years of follow-up.5 However, in the Rotterdam part of the European Randomized Study of Screening for Prostate Cancer, the SMR was 86.7% after an average follow-up of only 2.8 years.6 When we stratified our analysis for women recruited via breast cancer screening (Prospect-EPIC) and those recruited from the general population (MORGEN-EPIC), we observed a somewhat stronger reduced overall mortality among the women from Prospect-EPIC, i.e. 64.7 vs. 70.9%, respectively, suggesting that the healthy cohort effect might be stronger for cohorts based on the screening. However, it should be taken into account that the woman from the MORGEN cohort are on average much younger, and morbidity and mortality are still lower compared with the Prospect cohort.

Another way of exploring the healthy volunteer effect is by comparing the mortality of study participants with non-participants.18–20 However, in many situations, like in our cohort follow-up of non-participants, it is not feasible because of the absence of informed consent. The response rate was 34.5% for the Prospect-EPIC cohort and 45% for the MORGEN-EPIC cohort, resulting in an overall response rate of 40% for the whole EPIC-NL cohort. Van Loon et al.11 compared characteristics of MORGEN-EPIC participants with those of non-responders who gave limited information, showing only minor differences in the level of education and proportion of smokers. However, ∼34% of the invitees did not respond at all, and it cannot be excluded that those non-responders have different educational level or smoking habits. In addition, we compared the characteristics of participants who are included in the EPIC-NL study but did not give informed consent for linkage (n = 2460; 6.2%) with the municipal registry and Statistics Netherlands with those who gave permission. Participants who did not give informed consent were on average younger, more often current smokers and less active than those who gave permission (data not shown). Exclusion of these participants might contribute to the healthy volunteer effect.

In women, the healthy volunteer effect was stronger for mortality due to CVD than for mortality due to cancer. This was also observed in other studies.2, 4–8 It is possible that a healthier lifestyle reduces CVD mortality more than cancer mortality. It might also implicate that people who eventually die from CVD experience more complaints of their disease long before, which reduces willingness to participate in studies. Why we did not observe a similar healthy cohort effect in men and women is unclear. Possibly, the number of deaths was still too low.

Over time, the healthy volunteer effect was fading, which has also been observed in other studies.4,5 This seems intuitive, as persons with a disease are less likely to participate and lifestyle takes decades before impacting on major chronic diseases. Thus, the effect of a healthy selection at start is likely to slowly disappear over time, although not completely, as after even >10 years of follow-up a lower mortality is observed in the EPIC-NL cohort. The SIRs for cancer incidence remained more stable over time in our study because participants are probably more health conscious and less likely to develop cancer and remain more health conscious, resulting in lower cancer incidence in all years of the study. This has also been observed in the cancer screening trial of Pinsky et al.4 In the paper of Burnell et al., the SIR increased over time, but follow-up was only 5 years.5

Participants of the EPIC-NL cohort are higher educated compared with the general population. A higher socio-economic status has been associated with earlier cancer treatment and therefore better survival and lower mortality.21 Furthermore, higher-educated persons tend to have a healthier lifestyle; this might partly explain the healthy volunteer effect. However, the significantly lower mortality and cancer incidence in our study cannot be easily explained by more favourable distributions of the risk factors—overweight and smoking. The percentage of overweight was higher in EPIC-NL participants than in the reference population. However, unlike the measured EPIC-NL data, the reference data for overweight were based on self-report. The proportion of overweight in the general reference population might therefore be underestimated; it is known that people tend to underestimate their weight and that the level of underreporting increases with increasing overweight.22 EPIC-NL women did not seem to smoke less often than the general population. The reference data used to compare the lifestyle of the participants were available for 2001, whereas the information on smoking status for the EPIC-NL participants is from 1993–97. Therefore, the percentage of smokers in the reference data may have been too low because the percentage of smokers has decreased over this time. Moreover, the reference data on smoking were also based on surveys among the general population. Respondents to these surveys might also be more health conscious and have a more favourable distribution of risk factors than the total population. Therefore, we cannot exclude the possibility that smoking and overweight explains some of the healthy volunteer effect in our cohort. Unfortunately, there were no good reference data available to compare physical activity and diet.

We conclude that there is a healthy volunteer effect in the EPIC-NL study causing lower mortality rates compared with the general population, which tapers off during follow-up. Also, cancer incidences are lower compared with the general population. When the cohort is used to obtain absolute risks or information for prediction models, it is important to realize that cohort studies like EPIC-NL comprise a subgroup of the population with some different characteristics than the general population where the results are used for. The consequence of the healthy volunteer effect when studying etiological associations is that small effects might remain undetected when comparisons are made within the cohort, especially in the first years of follow-up. Therefore, follow-up needs to be long enough to generate sufficient power.

Acknowledgements

The authors thank the National Cancer registry and Statistics Netherlands for follow-up data. None of the study sponsors had a role in the study design, data collection, analysis or interpretation, report writing or the decision to submit the report for publication. This work has previously been presented orally at the following conference: WEON 2012, Rotterdam, 14 June. The authors' responsibilities were as follows: E.A.S. conducted the statistical analyses and drafted the article; A.M.M., J.W.J.B. and P.H.M.P. supervised data analysis and provided critical review of the article; C.H.G., E.M.M., J.M.A.B., W.M.M.V., H.B.B.-M. and Y.T.S. provided critical review of the article. J.W.J.B., H.B.B.-M. and P.H.M.P. shared responsibility for data collection. All authors read and approved the final article.

Funding

Dutch research council (NWO-ZonMW; grant no 40-00812-98-10040). The EPIC-NL study was funded by the ‘Europe against Cancer’ Program of the European Commission (SANCO), the Dutch Ministry of Health, the Dutch Cancer Society, the Netherlands Organisation for Health research and Development (ZonMW) and World Cancer Research Fund (WCRF).

Conflict of interest: None declared.

Key points

  • Our results show a healthy volunteer effect in the EPIC-NL cohort, which is strongest during the first years of follow-up and declined over time but was still observable after 15 years of follow-up.

  • Cancer incidence was also lower than expected from the general population, but this effect remained stable over time.

  • Especially in the first years of follow-up, the power of the study is lower than expected from the general population with the consequence that when studying etiological associations, small effects might remain undetected.

References

1
Breslow
NE
Day
NE
Statistical Methods in Cancer Research: Volume II—The Design and Analysis of Cohort Studies
1987
Lyon
International Agency for Research on Cancer
2
Froom
P
Melamed
S
Kristal-Boneh
E
, et al. 
Healthy volunteer effect in industrial workers
J Clin Epidemiol
1999
, vol. 
52
 (pg. 
731
-
5
)
3
Ederer
F
Church
TR
Mandel
JS
Sample sizes for prevention trials have been too small
Am J Epi
1993
, vol. 
137
 (pg. 
787
-
96
)
4
Pinsky
PF
Miller
A
Kramer
BS
, et al. 
Evidence of a healthy volunteer effect in the prostate, lung, colorectal, and ovarian cancer screening trial
Am J Epidemiol
2007
, vol. 
165
 (pg. 
874
-
81
)
5
Burnell
M
Gentry-Maharaj
A
Ryan
A
, et al. 
Impact on mortality and cancer incidence rates of using random invitation from population registers for recruitment to trials
Trials
2011
, vol. 
12
 (pg. 
61
-
70
)
6
Otto
SJ
Schroder
FH
de Koning
HJ
Low all-cause mortality in the volunteer-based Rotterdam section of the European randomised study of screening for prostate cancer: self-selection bias?
J Med Screen
2004
, vol. 
11
 (pg. 
89
-
92
)
7
Key
TJ
Appleby
PN
Spencer
EA
, et al. 
Mortality in British vegetarians: results from the European Prospective Investigation into Cancer and Nutrition (EPIC-Oxford)
Am J Clin Nutr
2009
, vol. 
89
 (pg. 
1613S
-
9S
)
8
Key
TJ
Appleby
PN
Davey
GK
, et al. 
Mortality in British vegetarians: review and preliminary results from EPIC-Oxford
Am J Clin Nutr
2003
, vol. 
78
 (pg. 
533S
-
8S
)
9
Beulens
JW
Monninkhof
EM
Verschuren
WM
, et al. 
Cohort profile: the EPIC-NL study
Int J Epidemiol
2010
, vol. 
39
 (pg. 
1170
-
8
)
10
Boker
LK
van Noord
PAH
van der Schouw
YT
, et al. 
Prospect-EPIC Utrecht: study design and characteristics of the cohort population
Eur J Epidemiol
2001
, vol. 
17
 (pg. 
1047
-
53
)
11
van Loon
AJ
Tijhuis
M
Picavet
HS
, et al. 
Survey non-response in the Netherlands. Effects on prevalence estimates and associations
Ann Epidemiol
2003
, vol. 
13
 (pg. 
105
-
10
)
12
Statistics Netherlands, statline. Gezondheid, leefstijl, zorggebruik, t/m, Den Haag, 2009. Available at: http://statline.cbs.nl (December 2012, date last accessed)
13
Statistics Netherlands, statline. Beroepsbevolking; geslacht en leeftijd, Den Haag. Available at: http://statline.cbs.nl (December 2012, date last accessed)
14
Netherlands Official Statistics. Vol. Special Issue: Integration of Household Surveys: Design, Advantages and Methods. Voorburg/Heerlen: Statistics Netherlands, 1998
16
Statistics Netherlands, statline. Sterfte, geslacht, leeftijd en burgerlijke staat, Den Haag. Available at: http://statline.cbs.nl (December 2012, date last accessed)
17
Dutch cancer registration. Available at: http://cijfersoverkanker.nl (December 2012, date last accessed)
18
Lindsted
KD
Fraser
GE
Steinkohl
M
, et al. 
Healthy volunteer effect in a cohort study: temporal resolution in the Adventist Health Study
J Clin Epidemiol
1996
, vol. 
49
 (pg. 
783
-
90
)
19
Benfante
R
Reed
D
MacLean
C
Kagan
A
Response bias in the Honolulu Heart Program
Am J Epidemiol
1989
, vol. 
130
 (pg. 
1088
-
100
)
20
Criqui
MH
Barrett-Connor
E
Austin
M
Differences between respondents and nonrespondents in a population-based cardiovascular disease study
Am J Epidemiol
1978
, vol. 
108
 (pg. 
367
-
72
)
21
Aarts
MJ
Lemmens
VEPP
Louwman
MWJ
, et al. 
Socioeconomic status and changing inequalities in colorectal cancer? A review of the associations with risk, treatment and outcome
Eur J Cancer
2010
, vol. 
46
 (pg. 
2681
-
95
)
22
May
AM
Barnes
DR
Forouhi
NG
, et al. 
Prediction of measured weight from self-reported weight was not improved after stratification by body mass index
Obesity
2013
, vol. 
21
 (pg. 
137
-
42
)

Author notes

*These authors contributed equally to this work.

Comments

0 Comments
Submit a comment
You have entered an invalid code
Thank you for submitting a comment on this article. Your comment will be reviewed and published at the journal's discretion. Please check for further notifications by email.