BMJ 1998;316:549 (14 February)

Education and debate

Statistics notes: Sample size in cluster randomisation

Sally M Kerry, statistician,a J Martin Bland, professor of medical statistics b

a Division of General Practice and Primary Care, St George's Hospital Medical School, London SW17 0RE, b Department of Public Health Sciences

Correspondence to: Mrs Kerry


right arrow   Abstract
up arrowTop
dotAbstract
down arrowReferences

Techniques for estimating sample size for randomised trials are well established,1 2 but most texts do not discuss sample size for trials which randomise groups (clusters) of people rather than individuals. For example, in a study of different preparations to control head lice all children in the same class were allocated to receive the same preparation. This was done to avoid contaminating the treatment groups through contact with control children in the same class.3 The children in the class cannot be considered independent of one another and the analysis should take this into account.4 5 There will be some loss of power due to randomising by cluster rather than individual and this should be reflected in the sample size calculations. Here we describe sample size calculations for a cluster randomised trial.

For a conventional randomised trial assessing the difference between two sample means the number of subjects required in each group, n, to detect a difference of d using a significance level of 5% and a power of 90% is given by n=21s2/ d2 where s is the standard deviation of the outcome measure. Other values of power and significance can be used.1

For a trial using cluster randomisation we need to take the design into account. For a continuous outcome measurement such as serum cholesterol values, a simple method of analysis is based on the mean of the observations for all subjects in the cluster and compares these means between the treatment groups. We will denote the variance of observations within one cluster by sw2 and assume that this variance is the same for all clusters. If there are m subjects in each cluster then the variance of a single sample mean is sw2/ m. The true cluster mean (unknown) will vary from cluster to cluster, with variance sc2. The observed variance of the cluster means will be the sum of the variance between clusters and the variance within clusters—that is, variance of outcome=sc2+sw2/m. Hence we can replace s2 by sc2+sw2/m in the formula for sample size above to obtain the number of clusters required in each intervention group. To do this we need estimates of sc2 and sw2.

For example, in a proposed study of a behavioural intervention in general practice to lower cholesterol concentrations practices were to be randomised into two groups, one to offer intensive dietary intervention by practice nurses using a behavioural approach and the other to offer usual general practice care. The outcome measure would be mean cholesterol values in patients attending each practice one year later. Estimates of between practice variance and within practice variance were obtained from the Medical Research Council thrombosis prevention trial6 and were sc2=0.0046 and sw2=1.28 respectively. The minimum difference considered to be clinically relevant was 0.1 mmol/l. If we recruit 50 patients per practice, we would have s2=sc2+sw2/m=0.0046+1.28/50=0.0302. The number of practices is given by n=21x0.0302/0.12=63 in each group. We would require 63 practices in each group to detect a difference of 0.1 mmol/l with a power of 90% using a 5% significance level—a total of 3150 patients in each group.

It can be seen from the formula for the variance of the outcome that when the number of patients within a practice, m, is very large, sw2/m will be very small and so the overall variance is roughly the same as the variance between practices. In this situation, increasing the number of patients per practice will not increase the power of the study. The 1 shows the number of practices required for different values of m, the number of subjects per practice. In all situations the total number of subjects required is greater than if simple random allocation had been used.


 
View this table:
[in this window]
[in a new window]
 
Total number of practices required to detect a difference of 0.1 mmol/l cholesterol with 90% power at 5% significance level

The ratio of the total number of subjects required using cluster randomisation to the number required using simple randomisation is called the design effect. Thus a cluster randomised trial which has a large design effect will require many more subjects than a trial of the same intervention which randomises individuals. As the number of patients per practice increases so does the design effect. In the 1, the design effect is very small when m is less than 10. This would involve recruiting a total of 558 practices, and the nature of the intervention and difficulties in recruiting practices made this impractical. Thus it was decided to recruit fewer practices. The design effect of using 126 practices with 50 patients from each practice was 1.17. This design requires the total sample size to be inflated by 17%. If the study involves training practice based staff it may be cost effective to reduce the number of practices even further. If we chose to use 32 practices then we would need 500 patients from each practice and the design effect would be 2.98. Thus the cluster design with 32 practices would require the total sample size to be trebled to maintain the same level of power.

We shall discuss the use of the intracluster correlation coefficient in these calculations in a future statistics note.


right arrow   References
up arrowTop
up arrowAbstract
dotReferences

  1. Florey C du V. Sample size for beginners. BMJ 1993;306:1181-4.
  2. Machin D, Campbell MJ. Statistical tables for the design of clinical trials. Oxford: Blackwell, 1987.
  3. Chosidow O, Chastang C, Brue C, Bouvet E, Izri M, Monteny N, et al. Controlled study of malathion and d-phenothrin lotions for Pediculus humanus var capitis-infested schoolchildren. Lancet 1994;344:1724-7.
  4. Bland JM, Kerry SM. Trials randomised in clusters. BMJ 1997;315:600. [Free Full Text]
  5. Kerry SM, Bland JM. Analysis of a trial randomised in clusters. BMJ 1998;316:54. [Free Full Text]
  6. Meade TW, Roderick PJ, Brennan PJ, Wilkes HC, Kelleher CC. Extracranial bleeding and other symptoms due to low dose aspirin and low intensity oral anticoagulation. Thromb Haemostasis 1992;68:1-6. [Medline]

Related Article

Internal and external validity of cluster randomised trials: systematic review of recent trials
Sandra Eldridge, Deborah Ashby, Catherine Bennett, Melanie Wakelin, and Gene Feder
BMJ 2008 336: 876-880. [Abstract] [Full Text] [PDF]

This article has been cited by other articles:

  • Lovell, D. P., Omori, T. (2008). Statistical issues in the use of the comet assay. Mutagenesis 23: 171-182 [Abstract] [Full text]  
  • Eldridge, S., Ashby, D., Bennett, C., Wakelin, M., Feder, G. (2008). Internal and external validity of cluster randomised trials: systematic review of recent trials. BMJ 336: 876-880 [Abstract] [Full text]  
  • Cox, H., Puffer, S., Morton, V., Cooper, C., Hodson, J., Masud, T., Oliver, D., Preedy, D., Selby, P., Stone, M., Sutcliffe, A., Torgerson, D. (2008). Educating nursing home staff on fracture prevention: a cluster randomised trial. Age Ageing 37: 167-172 [Abstract] [Full text]  
  • Banks, P., Macfarlane, T. V. (2007). Bonded versus banded first molar attachments: a randomized controlled clinical trial. J. Orthod. 34: 128-136 [Abstract] [Full text]  
  • Manning, N., Chadwick, S. M., Plunkett, D., Macfarlane, T. V. (2006). A randomized clinical trial comparing 'one-step' and 'two-step' orthodontic bonding systems. J. Orthod. 33: 276-283 [Abstract] [Full text]  
  • Eldridge, S. M, Ashby, D., Kerry, S. (2006). Sample size for cluster randomized trials: effect of coefficient of variation of cluster size and analysis method. Int J Epidemiol 35: 1292-1300 [Abstract] [Full text]  
  • Guarino, P., Elbourne, D., Carpenter, J., Peduzzi, P. (2006). Consumer involvement in consent document development: a multicenter cluster randomized trial to assess study participants' understanding. Clin Trials 3: 19-30 [Abstract]  
  • Localio, A. R., Berlin, J. A., Ten Have, T. R., Kimmel, S. E. (2001). Adjustments for Center in Multicenter Studies: An Overview. ANN INTERN MED 135: 112-123 [Abstract] [Full text]  
  • Peters, T. J, Graham, A., Salisbury, C., Moore, L., Underwood, M., Eldridge, S., Gibson, P. G, Shah, S., Sindhusake, D., Wang, H., Peat, J. K, Henry, R. L (2001). Peer led programme for asthma education in adolescents. BMJ 323: 110-110 [Full text]  
  • Wilson, S., Delaney, B. C, Roalfe, A., Roberts, L., Redman, V., Wearn, A. M, Hobbs, F D R. (2000). Randomised controlled trials in primary care: case study. BMJ 321: 24-27 [Full text]  
  • Tai, S. S., Iliffe, S. (2000). Considerations for the design and analysis of experimental studies in physical activity and exercise promotion: advantages of the randomised controlled trial. Br. J. Sports. Med. 34: 220-224 [Full text]  
  • Campbell, M J (2000). Cluster randomized trials in general (family) practice research. Stat Methods Med Res 9: 81-94 [Abstract]  
  • Hilton, S., Doherty, S., Kendrick, T., Kerry, S., Rink, E., Steptoe, A. (1999). Promotion of healthy behaviour among adults at increased risk of coronary heart disease in general practice: methodology and baseline data from the Change of Heart study. Health Education Journal 58: 3-16 [Abstract]  

Rapid Responses:

Read all Rapid Responses

Intracluster correlation underestimated?
Johannes C van der Wouden
bmj.com, 26 Apr 1998 [Full text]
Re: Intracluster correlation underestimated?
J Martin Bland, et al.
bmj.com, 8 Jun 1998 [Full text]



Student BMJ

Risk of surgery for inflammatory bowel disease: record linkage studies

What can you learn from this BMJ paper? Read Leanne Tite's Paper+

www.student.bmj.com

Listen to the latest BMJ Interview