BMJ 1995;310:1373-1374 (27 May)

General practice

Sociodemographic variables for general practices: use of census data

F Azeem Majeed, lecturer in public health medicine,a Derek G Cook, senior lecturer in epidemiology,a Jan Poloniecki, lecturer in medical statistics,a Jo Griffiths, statistician,a Caroline Stones, information services manager b

a Department of Public Health Sciences, St George's Hospital Medical School, London SW17 0RE, b Merton, Sutton and Wandsworth Family Health Services Authority, London SW15 2SW

Correspondence to: Dr Majeed.

Measures of the social, ethnic, and demographic characteristics of general practice populations are essential both for planning health services and for research. Such measures can be derived by combining census data for electoral wards with postcoded data from a family health services authority's age-sex register.1 Because electoral wards have large populations (typically 5000-15000 people), the patients registered with individual general practices will rarely be representative of a ward, and this will reduce the accuracy of census derived variables. Enumeration districts, however, have smaller populations (typically 200-600 people). We (a) derived estimates of age structure of practice populations based on enumeration districts and electoral wards and compared them with those obtained from the authority's age-sex registry and (b) compared the predictive power of variables derived from enumeration districts and electoral wards in explaining the variation in breast cancer screening rates among these practices.

Patients, methods, and results

The family health services authority provided a database containing the age, general practice, and postcode of all 601330 patients living in Merton, Sutton, and Wandsworth and registered with 131 general practices. An enumeration district was assigned to each postcode by means of a table.2 Of the 11572 postcodes on the age-sex register, 166 could not be assigned to an enumeration district because they were not listed in the table. The 1149 patients living at these postcodes were excluded from the calculation of the census derived variables, as were the 1543 people living in importing and special enumeration districts (importing enumeration districts contain census data on people living in other enumeration districts, and census data for most special enumeration districts are suppressed). When postcodes lay in only one enumeration district (81.7% (9460) of postcodes) the value of the relevant census variable for the enumeration district was taken as the value for the postcode. When postcodes lay in more than one enumeration district (16.8% (1946) of postcodes) a weighted average of census data for the enumeration districts was taken as the value for the postcode, with the weights being the number of households in each part postcode unit.2 For each census variable we assigned a value to each patient based on their postcode. For each general practice we then calculated the average of these assigned values for the registered patients. Apart from the need to use the table, the method is similar to that described previously for electoral wards.1

The correlations between the observed and expected age structure of the 131 practices were higher in each age group when enumeration district data were used to calculate the expected age variables (table); in all but one case the improvements were significant. The interquartile ranges of the differences between the observed and expected age structure of the practices were also smaller when enumeration district data were used to calculate the expected values (table).


Agreement between age distribution in authority's age-sex register and distributions derived from census data for electoral wards and enumeration
districts for 131 general practices in Merton, Sutton, and Wandsworth
------------------------------------------------------------------------------------------------------------------------------------
                                                                              Absolute agreement+
------------------------------------------------------------------------------------------------------------------------------------
                      Linear agreement*                        Median difference                    Size of interquartile range
Age group  -------------------------------------------------------------------------------------------------------------------------
 (years)   Enumeration districts   Electoral wards  Enumeration districts  Electoral wards   Enumeration districts  Electoral wards
------------------------------------------------------------------------------------------------------------------------------------
 0-4               0.49                0.34++               -0.9                -0.8                  1.9                 2.1
 5-14              0.74                0.66++                0.0                 0.1                  2.0                 2.3
15-24              0.42                0.39                 -1.9                -1.9                  2.4                 2.2
25-34              0.74                0.67++                1.1                 1.2                  4.6                 4.8
35-44              0.44                0.33++                0.7                 0.7                  2.0                 2.0
45-54              0.55                0.46++                1.2                 1.1                  1.9                 1.9
55-64              0.45                0.36++                0.0                 0.0                  1.9                 2.1
65-74              0.63                0.58++               -0.1                -0.1                  1.8                 2.2
75-84              0.59                0.50++               -0.3                -0.4                  1.9                 2.1
>/=85              0.55                0.44++                0.2                 0.1                  0.9                 0.9
------------------------------------------------------------------------------------------------------------------------------------
*Correlation coefficient.
+Observed percentage of practice patients minus expected percentage from census data.
++Difference significant at P=0.05 by method of Hotelling.3

To compare the predictive power of variables derived from enumeration districts and electoral wards we performed regression analysis of breast cancer screening rates on the eight Jarman variables and the census derived estimates of list inflation. Variables derived from enumeration districts gave a significantly better fit than those from electoral wards (P=0.002 by the method of Royston and Thompson4), explaining 58% compared with 53% of the variation in screening rates between practices.

Comment

Census derived variables for general practices have many potential uses--for example, they could be used to set targets for cervical cancer screening according to the populations that practices serve. We have shown that postcoded data in family health services authority age-sex registers can be used with census data to produce such variables for general practices. Despite the inaccuracies associated with small area census data,5 variables derived from enumeration districts were more accurate than those derived from electoral wards in measuring the age structure of practice populations and explaining the variation in breast cancer screening rates. Census derived variables could also be produced for schools and hospitals, allowing league tables comparing their performance to be adjusted for the social make up of their catchment populations.

We thank Dr David Martin, lecturer in geography at the University of Southampton, and Dr Patrick Royston, reader in medical statistics, Royal Postgraduate Medical School, for their help. We also thank the University of Manchester Computer Centre for providing access to census data and to the postcode enumeration district table, and the South West London breast screening service for permission to use its data.

  1. Majeed FA, Cook DG, Anderson HR, Hilton S, Bunn S, Stones C. Using patient and general practice characteristics to explain variation in cervical smear uptake rates. BMJ 1994;308:1272-6. [Abstract/Free Full Text]
  2. Martin D. Postcodes and the 1991 census of population: issues, problems and prospects. Transactions of the Institute of British Geographers 1992;17:350-7. [Medline]
  3. Hotelling H. The selection of variates for use in prediction with some comments on the general problems of nuisance parameters. Annals of Mathematical Statistics 1940;ii:271-83.
  4. Royston P, Thompson SG. Comparing non-nested regression models. Biometrics (in press).
  5. Morphet C. The interpretation of small area census data. Area 1992;24:63-72.
(Accepted 17 February 1995)


Add to CiteULike CiteULike   Add to Complore Complore   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg   Add to Reddit Reddit   Add to StumbleUpon StumbleUpon   Add to Technorati Technorati    What's this?

This article has been cited by other articles:

  • FONE, D. L., DUNSTAN, F., JOHN, A., LLOYD, K. (2007). Associations between common mental disorders and the Mental Illness Needs Index in community settings: Multilevel analysis. Br. J. Psychiatry 191: 158-163 [Abstract] [Full text]  
  • MacDonald, B. K., Cockerell, O. C., Sander, J. W. A. S., Shorvon, S. D. (2000). The incidence and lifetime prevalence of neurological disorders in a prospective community-based study in the UK. Brain 123: 665-676 [Abstract] [Full text]  
  • Connolly, V, Unwin, N, Sherriff, P, Bilous, R, Kelly, W (2000). Diabetes prevalence and socioeconomic status: a population based study showing increased prevalence of type 2 diabetes mellitus in deprived areas. J. Epidemiol. Community Health 54: 173-177 [Abstract] [Full text]  
  • Baines, D. L, Parry, D. J (2000). Analysis of the ability of the new needs adjustment formula to improve the setting of weighted capitation prescribing budgets in English general practice. BMJ 320: 288-290 [Full text]  
  • Naish, J., Moser, K., Sturdy, P., Carter, Y. H. (1999). Using administrative data for primary care research: the City and East London General Practice Database Project. Health Informatics Journal 5: 30-34 [Abstract]  
  • Gilley, J., Majeed, A. (1999). Meeting the information and budgetary requirements of primary care groups • Commentary: Accurate information may be difficult to produce. BMJ 318: 168-170 [Full text]  
  • Majeed, A., Head, S. (1998). Controversies in primary care Setting prescribing budgets in general practice • Capitation based prescribing budgets will not work. BMJ 316: 748-753 [Full text]  
  • Carlisle, R., Groom, L. M, Avery, A. J, Boot, D., Earwicker, S. (1998). Relation of out of hours activity by general practice and accident and emergency services with deprivation in Nottingham: longitudinal survey. BMJ 316: 520-523 [Abstract] [Full text]  
  • Williams, T., Jackson, A., Turbitt, D., Scrivener, G., Lloyd, D., Sturdy, P., Pereira, F., Hull, S., Carter, Y., Naish, J., Harvey, C., Hippisley-Cox, J., Hardy, C., Pringle, M., Carlisle, R., Fielding, K., Chilvers, C. (1997). Effect of deprivation on general practitioners' referral rates. BMJ 315: 882-883 [Full text]  
  • Majeed, A. (1996). Commentary: The basis of a more rational method of funding primary medical care?. BMJ 312: 1012-1013 [Full text]  
  • Scrivener, G., Lloyd, D. C (1996). Allocating census data to general practice populations. BMJ 312: 249b-249 [Full text]  
  • Majeed, F A., Cook, D. G, Poloniecki, J., Martin, D. (1995). Using data from the 1991 census. BMJ 310: 1511-1514 [Abstract] [Full text]  



Access jobs at BMJ Careers
Whats new online at Student 

BMJ