Case ascertainment and selection
Colon cancers were defined as tumours in the cecum, appendix, ascending colon, hepatic flexure, transverse colon, splenic flexure, descending and sigmoid colon (C18.0-C18.7 as per the 10th Revision of the International Statistical Classification of Diseases, Injury and Causes of Death), and overlapping or unspecified origin tumours (C18.8 and C18.9). Rectal cancers were defined as tumours occurring at the rectosigmoid junction (C19) or rectum (C20). Anal canal cancers were excluded. Colorectal cancer is the combination of the colon and rectal cancer cases.
After exclusions (56 cases for missing matching information, 31 cases for missing laboratory 25-(OH)D data for the case-control set), a total of 1248 first incident colorectal cases (colon cancer=785; rectal cancer=463) were identified. Cases were not selected from Norway (blood samples only recently collected; few colorectal cancers diagnosed after blood donation) and the Malmö centre of Sweden. The numbers of cases for analyses of dietary vitamin D, calcium, and retinol were 772 colon and 448 rectal because of missing nutrient intake values from Greece.
Control selection
Controls were selected (1:1) by incidence density sampling from all cohort members alive and free of cancer at the time of diagnosis of the cases and were matched by age (plus or minus six months at recruitment), sex, study centre (to account for centre-specific differences such as questionnaire design and blood collection procedures), time of the day at blood collection, and fasting status at the time of blood collection (less than three hours, three to six hours, and more than six hours). Women were further matched by menopausal status (pre-menopausal, post-menopausal, peri-menopausal/unknown), phase of menstrual cycle at time of blood collection, and usage of hormone replacement therapy at time of blood collection (yes/no). The additional matching criteria for women were needed for other studies that were being done using the same matched case-control sets. The numbers of case-control matched sets from each country are shown in table 1⇓.
Table 1 Description of cases and matched controls, by anatomical site
Laboratory assays
The feasibility and reliability of measuring 25-(OH)D in EPIC samples has been previously established.25 Vitamin D status was quantitatively determined by measuring 25-(OH)D in 25 μl of serum (heparin plasma for Swedish samples) using a commercially available enzyme immunoassay kit (OCTEIA 25-(OH)D Kit, Immuno Diagnostic Systems, Boldon, UK) at the Laboratory for Health Protection Research, National Institute for Public Health and the Environment, the Netherlands. The kit is specific for 100% of vitamin D3 origin and 75% of vitamin D2 origin. For technical reasons, some case-control sets were not measured in the same analytical batch. However, batch-to-batch differences are considered to be minor: the coefficient of variation (inter-assay) as determined with two kit control samples was minimal (5.9% at the level of 20.3 nmol/L and 5.4% at the level of 77.4 nmol/l), no significant between-day drift, time shifts, or other trends were observed and the percentage of variance attributable to batch-to-batch differences was 4.5%. For all analyses, laboratory technicians were blinded to the case-control status of the samples.
Statistical analysis
Differences between cases and controls in mean dietary variables, circulating 25-(OH)D levels and baseline covariates were tested by paired t-tests of the values in each case-control set for colon and rectum anatomical sub-sites separately. For categorical variables (smoking status, physical activity, education level), case-control differences were assessed by conditional logistic regression.
Conditional logistic regression, stratified by the case-control set, was used to estimate the risk and 95% confidence intervals of colorectal cancers and cancers of the colon and rectum in relation to levels of intake of dietary variables and circulating 25-(OH)D concentrations (SAS statistical software, version 9, SAS Institute, Cary, NC). In a nested case-control study where controls are selected using incidence density sampling, this procedure estimates the incidence rate ratio which, given the rarity of the disease, is roughly equal to the odds ratio.26 For dietary vitamin D and calcium, quintile cut-points were based on the variable distributions in all the controls combined. Circulating 25-(OH)D concentration was divided into five categories with predefined cut-points on the basis of proposed levels of vitamin D deficiency/insufficiency:27 28 29 30 category 1: <25.0 nmol/l, category 2: ≥25.0 to <50.0 nmol/l, category 3 (referent): ≥50.0 to <75.0 nmol/l, category 4: ≥75.0 to 100.0 nmol/l, category 5: ≥100.0 nmol/l. A level of between ≥50.0 and <75.0 nmol/l was assumed as a central, mid-range reference category in order to provide stability in the statistical analyses and for a clearer ascertainment of the cancer risk consequences of both lower and higher 25-(OH)D concentrations. As an additional analysis, circulating 25-(OH)D concentration was also divided by quintiles based on the distribution in the control members, with the lowest category chosen as the referent. Quintile cut-points are described in web table 1.
For all variables of interest, risk estimates were computed as both univariate analyses based on the matching factors, and multivariate analyses, with additional adjustments for potential confounders including body mass index (kg/m2), physical activity (combined recreational and household activity; expressed as sex-specific categories of metabolic equivalents), duration/status/intensity of smoking (table 1), education level (an indicator variable for socioeconomic status), total energy intake (in quartiles), total intake of fruits (quartiles), total intake of vegetables (quartiles), total intake of red and processed meats (quartiles), and total alcohol intake (categorical cut-points for men: non-consumers, 1 to 10, 11 to 20, 21 to 40, >40 g/day; cut-points for women: non-consumers, 1 to 5, 6 to 15, 16 to 25, >25 g/day). Models similar to the above were also run with variables included in the model as log transformed continuous variables with the incidence rate ratio estimated for the risk related to a 10% increase in the value of the variable. Potential effects of dietary fibre intake, as well as consumption of dairy products and fish (rich dietary sources of vitamin D) were examined, but they did not provide appreciable changes in risk estimates and were not included in the final models. For all models, tests for linear trend were performed using a score variable with values from 1 to 5, consistent with the category/quintile grouping.
To assess any effects of the season or month of blood collection, two different approaches were used. As a first approach, incident rate ratios and 95% confidence intervals were calculated as described above but with an additional adjustment for season of blood collection (categorical variable: winter, spring, summer, autumn). In a second approach, circulating 25-(OH)D concentrations were standardised and the standardised values were then used in conditional regression models as described above. The results were then compared with those of the non-standardised 25-(OH)D. Circulating 25-(OH)D concentrations were standardised using two different methods: (a) by the month of blood collection calculated by adding the overall mean of the circulating 25-(OH)D for all subjects to the residuals derived from a simple regression model fitted to circulating 25-(OH)D concentration by month of blood collection and (b) by the method of Munger et al.31
All analysis models were run separately for colorectal cancer and by anatomical sub-site: colon, left colon, right colon, and rectum using the same categorical cut-points as for colorectal cancer.
Since a primary function of vitamin D is maintenance of calcium homoeostasis, a potential interaction of the effect of circulating 25-(OH)D concentration with the level of dietary calcium intake on colorectal cancer risk was explored by including a single degree of freedom interaction term formed by the product of the 25-(OH)D category value (cut-points: <50.0 nmol/l, ≥50.0 to <75.0 nmol/l, ≥75.0 nmol/l) and the dietary calcium tertile. The statistical significance of a linear interaction was assessed using the likelihood ratio test. Similar models were used to test for interaction of 25-(OH)D with amount of alcohol consumption and dietary retinol intake.
For all the main variables, heterogeneity of effects by sex and anatomical subsite were assessed by χ2 statistics. To assess the potential for reverse causality (lower 25-(OH)D level due to the presence of disease at enrolment) and to assess the effect of follow-up time in greater detail, analyses were also performed excluding cases diagnosed with less than two years of follow-up. All statistical tests were two tailed.
Rapid responses
Latest Responses
Re: Ventilator associated pneumonia
Published 30 May 2012
Re: Restless legs syndrome
Published 30 May 2012
Author's reply
Published 30 May 2012
Re: Full access to trial data holds many benefits and a few pitfalls, conference hears
Published 30 May 2012
Restless Legs Syndrome: Fact or Fiction
Published 30 May 2012
Most responses
Venous thrombosis in users of non-oral hormonal contraception: follow-up study, Denmark 2001-10 (12 responses)
Published 10 May 2012 - 23:32
The psychiatric oligarchs who medicalise normality (9 responses)
Published 2 May 2012 - 15:42
Are doctors justified in taking industrial action in defence of their pensions? No (8 responses)
Published 8 May 2012 - 12:21
Are doctors justified in taking industrial action in defence of their pensions? Yes (8 responses)
Published 8 May 2012 - 12:21
The hardest thing: admitting error (7 responses)
Published 2 May 2012 - 12:27