General Practice

Attempt at deriving a formula for setting general practitioner fundholding budgets

BMJ 1994; 309 doi: (Published 22 October 1994) Cite this as: BMJ 1994;309:1059
  1. T A Sheldon,
  2. P Smith,
  3. M Borowitz,
  4. S Martin,
  5. R C Hill
  1. NHS Centre for Reviews and Dissemination, University of York, York YO1 5DD
  2. Department of Economics and Related Studies, University of York, York YO1 5DD
  3. Battelle Medical Research and Policy Centre, London WC1
  4. Institute for Research in the Social Sciences, University of York, York YO1 5DD
  5. Centre for Health Economics, University of York, York YO1 5DD
  1. Correspondence to: Dr Sheldon
  • Accepted 19 August 1994


Objective: To explore the possibility of using routine Hospital Episode Statistics, census data, and vital statistics to derive weights for an equitable capitation formula for setting general practitioner fundholding budgets for buying acute hospital services.

Design: Analysis of a routine dataset of 9 million hospital episodes in 1991–2, extracting elective general practitioner fundholding procedures, combined with 1991 census variables, vital statistics, and data on supply of health care at ward level. Costs were attached to each procedure according to the average cost of the relevant “Mersey” band category. Main outcome measures - Variation in age and sex adjusted expenditure per head on fundholding procedures across wards modelled for the impact of health and social needs variables after adjusting for variations in supply.

Results: No sensible simple model including determinants of use other than age and sex could be derived. The most parsimonious but statistically acceptable model showed that though standardised mortality ratio and self reported illness and several social class variables were associated with utilisation, the signs and the size of the coefficients were20coutradictory. The most important explanation of variation was provided by age and sex differences between wards. Conclusions - An equitable system of setting general practitioner fundholders' budgets is needed. In the short term age and sex weighted capitation should form the principal basis of fundholder budgets. Utilisation data at ward level are inadequate for developing a formula which adequately20adjusts for the differences in the health care needs of populations. A capitation formula based on information derived from individual cohort data may be the only means of promoting equity and efficiency and of avoiding discriminating against patients with known high cost health problems.

Policy implications

  • Policy implications

  • General practitioner fundholder budgets are based on a mixture of historical activity and capitation

  • Concerns that allocations to fundholders may be inequitable have led to calls for the budget to be determined by a national formula

  • Age and sex are the most important determinants of variation in utilisation of fundholding procedures at ward level

  • A valid capitation formula which takes into account variations in need for fundholding procedures between practices cannot easily be derived from analysis of routine available data

  • Research following up individuals' health care needs and use is needed in order to derive a more accurate capitation formula based on the characteristics of registered patients


The introduction of general practitioner fundholding means that some of the funds which were allocated to health authorities for secondary care of residents are now distributed to fundholders for buying a specified set of elective acute inpatient and day case procedures (fundholding procedures) and community services.1

The growing importance of fundholding raises important questions concerning equitable and efficient distribution of resources.*RF 1–4* In particular, there is considerable interest in the way in which general practitioner fundholder budgets are determined. Ideally, allocations to fundholders for buying secondary care would be based on a formula which takes into account the number of patients in a practice weighted for the likelihood of those patients needing to use NHS acute care and the associated costs of that care. Such a weighted capitation formula would have two important properties: it would be consistent with the principles and manner in which hospital and community health resources are allocated to health authorities,5,6 and it would promote the equitable allocation of resources between fundholding practice and between populations registered with fundholders, and non-fundholders.4

The white paper Working for Patients clear that a capitation formula was to be the hospital and community health services element the general practitioner fundholder's budget: “each practice's share should be based on the number of patients on its list, weighted for the same population characteristics as…for allocations to districts. There are social and other local features which affect the use of hospital services, and these too will be reflected in the budget.”7 In the initial years of the general practitioner fundholding scheme, however, budgets were based on previous activity. This has perpetuated a wide variation in the amount per patient allocated for purchasing care, with little understanding about how much of this may be justified by a corresponding variation in health care needs.8 As a result of recent guidance from the Department of Health9 new allocations represent a compromise between historical activity levels, capitation “benchmarks” derived from regional or national age and sex averages, and some discretion to take account of local factors.

This paper reports a study which explores the possibility of using routine data to derive a capitation formula for informing the allocation of hospital and community health service resources to general practitioner fundholders. The work was commissioned by the NHS Management Executive in 1993 as part of a larger study at the University of York to develop a national weighted capitation formula for allocating all hospital and community health ervice funds to health authorities in England.6,10 The work was guided throughout by a technical group and steering group appointed by the Department of Health.

Materials and methods

The assumption underlying this study was that variation in the rate at which health services are used reflects variation in the need for health care and variation in the supply of (or access to) health care facilities. The intention was to attempt to isolate indicators of the need for elective fundholding procedures it small areas (after adjusting for supply variations). Age and sex adjusted expenditures per head on elective fundholding procedures were therefore calculated for each small area and were modelled as a function of indicators of supply and indicators of need, as follows: age and sex adjusted expenditure on fundholding procedures=function of (health variables, socioeconomic conditions, supply of health services).

Data on the use of fundholding procedures were obtained from the 1991–2 Hospital Episode Statistics, which record all inpatient and day case episodes in England. Each episode and the procedures within each episode were linked by means of the postcode to one of 4985 small areas. The small areas cover the whole of England and are know as synthetic wards. They comprise aggregations of contiguous local authority wards such that none had a population of less than 5000. The average population was 9643 and the maximum 33 073.

Though concern has been expressed about the quality of the Hospital Episode Statistics, it is generally accepted that the 1991–2 dataset is more accurate and complete than that for previous years. The extract provided to us contained details of the patient's age, sex, and address; the broad specialty grouping; the source and destination of the episode; and a list of up to four procedures. None of the procedures had an invalid procedure code, and of the 3.8 million elective episodes, only 1978 (containing 1061 fundholding procedures) were classified as invalid in the specialty coding. A total of 9 042 169 completed inpatient and day case episodes were recorded across England. These episodes were associated with 8 222 945 procedures, of which 2 748 878 (33.4%) were elective fundholding procedures. The breakdown of episodes and procedures by specialty group is shown in table I.


Proportion of hospital activity which was fundholding

View this table:

The Hospital Episode Statistics extract did not contain detailed procedure codes. Instead, the Office of Population Censuses and Surveys assigned each procedure to one of four categories of fundholding procedures or to the non-fundholding category. The fundholding classification was ranked in increasing order of resource use, taking account of average lengths of stay, the proportion of cases treated without an overnight stay, and the BUPA (British United Provident Association) classification of the complexity of procedures (the “Mersey” bands). Up to four procedures were so coded for each episode. A cost was assigned to each of these procedures based on the average cost per inpatient and per day case procedure for each of the four Mersey bands (see table II). This was derived from detailed costs estimated for each fundholding procedure by East Cheshire Statistical Consultancy. A weighted average cost for each of the four bands used in this study was calculated by creating an average of these detailed costs, weighted according to national activity rates as supplied by the Department of Health. We emphasise that in order to avoid any bias due to the particular characteristics of first wave fundholders and their patients all elective fundholding procedures were included in the analysis irrespective of whether or not patients were registered with a fundholding practice.


Average fundholding procedure costs by Mersey band

View this table:

Because of the possibility that the dataset overrecorded or underrecorded activity each episode was weighted by the ratio of the total number of episodes reported on form KP70 (a manual return) to the total number of episodes recorded in the Hospital Episode Statistics in the district of treatment. The KP70 was considered to yield the more reliable total number of episodes within a district. However, there was rarely substantial deviation from the Hospital Episode Statistics total. Wards with postcodes having more than 500 episodes were excluded because they may have been used as “dump” postcodes by providers in cases where no postcodes had been recorded.

The total observed expenditure per head of population on elective fundholding procedures for each ward could now be calculated. National rates of procedures and associated costs per head of population, disaggregated by age and sex, are shown in table III. This allowed us to calculate an expected expenditure per head for each ward, given the age and sex composition of the ward. The ratio of observed to expected expenditure per head represents the age and sex adjusted rate of expenditure in the ward. The aim of the modelling was to explain variations in this ratio by using health and socioeconomic variables, after taking into account any effect of variations in the access to health services within wards.


Average activity and expenditure per head on elective fundholding procedures

View this table:

Therefore, for each synthetic ward data were assembled relating to supply of health services, mortality experience, and self reported health status and socioeconomic factors derived from the 1991 census. Four measures of the supply of health care services at the ward level were derived. These were the physical proximity to NHS acute hospital beds (assessed by using a database of 1478 hospitals in England, excluding special health authorities, which gave the number of doctors and the address of the principal surgery of all practices in England); access to private hospital beds (assessed by using the 1991 census figure for the number of visitors present in private hospitals on census night by ward); and the provision of nursing home services (measured as the proportion of the population aged 75 or over in nursing or residential homes).6,11

The three access measures were calculated by methods of spatial interaction modelling, which seek to reconcile simultaneously the supply of facilities, their proximity to the small area of interest, and the impact of competing populations and competing supply (see appendix).12 Thus access to NHS inpatient facilities increases with the number of beds in the vicinity or as distance to the facility or population in nearby wards decreases. In effect the access measures yield estimates for each ward of (distance adjusted) acute NHS beds per head, general practitioners per head, and private acute beds per head.

Standardised mortality ratios for all causes of death were available for both sexes and for three age bands: 0–64, 65–74, and 75 and over and combinations of these. Self reported health status was measured by using a question which appeared for the first time in the 1991 census about long term limiting illness: “Does the person have any long-term illness, health problem or handicap which limits his/her daily activities or the work he/she can do?” Respondents were asked to include problems which were due to old age. The prevalence of limiting longstanding illness in a ward was standardised for age and sex in the same way as the standardised mortality ratio.

Many socioeconomic variables used to explore demand for health care may be considered as proxies for the underlying social causes of need, such as poverty. With this in mind, researchers and policy makers have developed several indices which seek to provide a deprivation score for geographical areas. Examples include the Jarman and Townsend indices of deprivation. These indices have been criticised for their lack of theoretical basis and the combination of highly correlated variables. It was though unnecessary to use them in this study, as all the components of a deprivation index were available separately, without constraining the way in which they were combined.

Many socioeconomic variables from the census were created for use in this study (some are shown in the box). In summary the variables covered the following aspects of social and economic circumstances: housing tenure, housing amenities, car ownership, overcrowding, ethnic origin, elderly living alone, lone parents, students, migrants, unemployment, educational qualifications, social class, and non-earning households.

Key to variables used in modelling utilisation

Supply variables

ACCNHSA Access to NHS acute beds

ACCGPS Access to general practitioners

HOMES Proportion of population aged >=75 not in nursing or residential homes

ACCPRI Access to private hospital beds Health variables

SMR074 Standardised mortality ratio, ages 0–74

SMR75 Standardised mortality ratio, ages >=75

SIRI074 Standardised illness ratio, ages 0–74

SIRI75 Standardised illness ratio, ages >=75

LONGILL Proportion of total population with limiting long term illness

PERMSICK Proportion of residents of working age permanently sick Socioeconomic variables

UNEMP Proportion of economically active unemployed subjects

SCARER Proportion of dependants in single carer households

CHNEARN Proportion of children in non-earning families

NCENHEAT Proportion of residents in households lacking central heating

NOCAR Proportion of residents in households with no car

OVERCROWD Proportion of residents in households with more than one person per room

HNEWCOMM Proportion of residents in households with head born in new Commonwealth

NEWCOMM Proportion of residents born in new Commonwealth

BLACK Proportion of residents not in black ethnic groups

ASIAN Proportion of residents not in Indian, Pakistani, and Bangladeshi groups

STUDENT Proportion of working age population who are students

QUAL Proportion of subjects aged >=18 with some qualification

CLASSI/II Proportion of subjects in households with head in class I or II

MANUAL Proportion of economically active residents in manual social classes

NONMAN Proportion of economically active residents in non-manual social classes

HMAN Proportion of subjects in households with head in manual class

PCTURBAN Proportion of population living in urban enumeration districts

DENSITY Subjects divided by hectares

OWNOCC Proportion of residents in owner occupied accommodation

PRIVRENT Proportion of residents in privately rented accommodation

AMENITY Proportion of residents not in households lacking or sharing the use of a bath or shower or inside lavatory, or both

OLDALONE Proportion of subjects aged >=75 living alone

MOVED Proportion of residents with a different address from that of a year before

PENSALONE Proportion of subjects of pensionable age living alone

For this study the range of issues covered was likely to be enough to capture the important social causes of the need for health care. Indeed, because there is high correlation between many socioeconomic variables, the exclusion of a variable from any particualr model did not necessarily mean that the phenomena it was measuring were not being captured. We emphasise that most of the variables were acting as proxies for unmeasured characteristics of the ward. Thus, for example, lack of car ownership or unemployment may be reflecting aspects of poverty rather than simply measuring narrow economic status.

We found it necessary to use two least squares regression methods to estimate the equation shown in the appendix because of the problem of the simultaneous relation (endogeneity) between use and supply.13 The method entails, firstly, regressing each of the supply variables on the set of social and health variables (instruments), then using the predicted values of the supply variables as explanatory variables in the regression in place of their actual values. This approach should yield consistent estimates of all coefficients in the regression of utilisation on needs and supply. Standard errors are adjusted to take account of the endogeneity. Using this method means that the usual measures of the goodness of fit of a regression, such as the R2, do not have the same meaning as in ordinary least squares regression.

Regional dummy variables (relating to the former 14 regional health authorities) were included in the model to take into account shifts between regions which might reflect historical differences in, for example, funding, policy, or data collection procedures that should not be incorporated into a national formula. For this reason they are not reported here. Throughout, the models performed better when natural logarithms of all variables were used. The relation assumed between variables is therefore assumed to be multiplicative rather than additive. More details on the data collected and methods used to estimate the equations have been reported.6,11


The simplest model of use of general practitioner fundholding procedures which remained statistically acceptable is shown in table IV. B is the size of the coefficient (elasticity) of the variable, β the standardised effect size (coefficient B divided by the standard deviation of the associated variable). The larger the β value the greater is likely to be the actual impact of the variable in any formula for resource allocation. A key to the variables listed is given in the box. The results suggest that access to general practitioners (ACCGPS) was positively related to utilisation of fundholding procedures and that the provision of nursing home places was negatively associated with utilisation. (Note that the variable HOMES was the proportion of the population aged >=75 not in nursing or residential homes.) Access to NHS acute beds (ACCNHSA) and private hospital provision (ACCPRI) were not statistically significant at the 1% level.

The results for the health status and socioeconomic variables were more problematical. Though the direct measures of health status - the standardised mortality ratio (SMR) and standardised illness ratio (SIR) based on the self reported longstanding limiting illness census question for the under and over 75s - were positively associated with expenditure, the relation between the socioeconomic variables and expenditure was hard to interpret (table IV). In general, wards with a higher proportion of students and people with qualifications had lower utilisation rates, whereas apparently more deprived wards, with a higher proportion of residents in overcrowded accommodation, had higher utilisation rates. However, indicators of poverty such as the proportion of residents in households with no car (NOCAR) and the proportion of children in non- earning families (CHNEARN) were negatively associated with utilisation.


Two stage least squares regression of per capita expenditure on fundholding procedures (best model). (Regional dummies not shown)

View this table:

In addition, there were contradictory signs on the coefficients of two other groups of social variables. For example, all three social class variables (CLASSI/II, MANUAL, NONMAN) were positively correlated with relative expenditure. This implied that utilisation increased as the proportion of a ward's population which was in the manual social classes increased as well as with an increase in the proportion of social classes I and II. Similarly, the three ethnic variables (HNEWCOMM, NEWCOMM, BLACK) also opposed each other. Reducing the number of social class or ethnic variables to one each significantly reduced the adequacy of the model, which then failed the statistical tests of model specification.14 The introduction of interaction terms in the equation did not resolve the problem.

These results were not the product of the statistical methods used. For example, when the analysis was repeated with the more usual (ordinary least squares) regression method being used on the subset of health and social needs variables which were significant in table IV (excluding the supply variables) the same problems occurred (table V). Though there was a positive expenditure gradient with health variables, some social variables (children in non-earning families and non-manual) had a negative sign. This contradicted the more intuitively sensible results for similar variables such as households with head in manual class, which were positively correlated with expenditure.


Ordinary least squares regression of per capita expenditure against health and social variables

View this table:

The practice of using simple regression of age and sex adjusted per capita expenditure on elective fundholding procedures against a small subset of variables (such as standardised mortality ratio and unemployment) may therefore result in the relation between needs variables and utilisation being wrongly estimated. For example, a regression on standardised mortality ratio, standardised illness ratio, and manual social class showed no contadictory signs (though standardised mortality ratio was no longer statistically significant, the confidence interval included zero; table VI). However, unemployment entered the model with a significant negative coefficient (table VII). This showed that though it is fairly easy to derive an apparently plausible equation by using simple regression techniques on a very limited set of variables, the coefficients may be misleading and the model possibly seriously misspecified.


Ordinary least squares regression of per capita expenditure against three key variables

View this table:

Ordinary least squares regression of per capita expenditure against four key variables

View this table:

Regressing observed expenditure per head on fundholding procedures unadjusted for age and sex against dummy variables for each age and sex band showed that 26% of the total variation across wards over England was statistically explained by age and sex differences between the wards. A further 20% was explained by our preferred statistical model in table V. However, the extent to which all these latter variables reflected legitimate underlying need and therefore should be incorporated in a formula was unclear.


Despite having developed a statistically robust, theoretically sound, and intuitively appealing model for the whole of acute sector activity,10,11 we have derived no sensible model for the impact of health and socioeconomic indicators on the use of elective fundholding procedures. There are several possible hypotheses which could be put forward to explain these results, though it is impossible to test them by using these data.

Firstly, though the average social gradient for all fundholding procedures is likely to have a positive slope (as indicated by table IV and table V-VII) it is probably “flatter” than for all acute health care procedures. This is because there is no obvious role of socioeconomic factors in the epidemiology of many of the conditions corresponding to the fundholding procedures.

Secondly, all patients receiving these elective procedures must first be referred by a general practitioner. Therefore, general practitioners' referral behaviour will have an influence on the extent to which variations in the need presented to general practitioners will be translated into corresponding variation in utilisation. There is much published work documenting the wide variations in general practitioner referral behaviour independent of patient need.15 Of particular interest is the possibility (supported by some empirical evidence) that there might be an inverse gradient between social class and referral rates by general practitioners or treatment rates by hospital consultants.16 In other words, the higher social classes may be better at getting what they want. If this is true for fundholding procedures the observed overall utilisation rate may represent the net effect of a slightly positive needs social gradient and a countervailing negative referral and treatment social gradient.

Lastly, the relation between needs for elective fundholding procedures and health and socioeconomic variables may not accurately be reflected in hospital utilisation because of social class differences in patient thresholds for consultation. In addition, some groups may make more use of private care, while others may be more likely to use emergency services for the same level of need. These, combined with a large amount of statistical “noise” due to random variations,17 may help to explain the inconclusive results.


Though it is likely that there is some social gradient in the need for elective fundholding procedures, using the data available we could not identify a set of health or socioeconomic needs variables which we feel confident captures variations in underlying need. Ideally, any formula will incorporate only the relevant social and health factors, excluding other, non-needs related variables. However, there seems to be no direct way of disentangling the various sources of variation using these routine cross sectional data.

We have shown that age and sex distribution is a very important determinant of the ward variation in expenditure on fundholding procedures. This has also been shown to be important in explaining variation at the individual level.18 Therefore, it is important that at least an age and sex adjusted capitation formula should be used to inform the budget setting process for general practitioner fundholders. Table II presents the average national age and sex weights in 1991-2 which could be used as the basis for such a formula, though we emphasise that the cost data refer to the relative use made by different demographic groups and should not be used as absolute benchmarks. Clearly many will wish to modify these data locally to take account of variations in social factors or morbidity between practices. However, as this study has shown, it is not clear how these factors may indicate relative need.

The principle of using an age and sex capitation formula can also be used to determine the size of the overall budget allocated for fundholding procedures, which ensures that the populations registered with both fundholders and other general practitioners receive a fair share of the resources.*RF 3–4* This can be done either by setting a national fixed capitation per head (adjusted for age and sex), independent of the location of the fundholding practice, or by identifying a proportion of the total local hospital and community health service budget as relevant for fundholding elective activity for the local population, and then allocating this between practices on the basis of age and sex relativities. If the second course is adopted and the national formula for district allocations takes account of mortality, morbidity, and social factors as recommended by the York team's work on a national formula for allocation to health authorities,6,10,11 then fundholding budgets will to an extent reflect district variations in need.

The use of a capitation formula introduces a theoretical incentive to discriminate against the few patients with known high cost health problems (“cream skimming”) and generate surpluses.*RF 18–22* In addition, fundholders have an incentive to attract patients more likely to use private health care for elective fundholding procedures. An age and sex based capitation formula would not eliminate these problems because it does not take into account other important factors which explain variations in expenditure between population groups such as disability, functional health status, and indicators of chronic medical conditions nor use of private acute services.*RF 23–26* A formula which could adjust for these factors would need to be derived from epidemiological cohort studies following up individuals' health care experiences.18,27 However, because doctors will always have more relevant information on the likely cost of individual patients than any budget setter, some incentive to discriminate against patients whose expected costs exceed their capitations may always be present.

This work was funded by the Department of Health. We acknowledge the helpful comments of Anita Bird, Professor Alan Maynard, and two anonymous referees. We also thank the members of the technical and steering groups appointed by the Department of Health for their invaluable advice and support.


Method of constructing supply variables

For NHS inpatient facilities the relative accessibility A1 of zone i to acute hospital beds is given by:

Embedded Image

where Pi is the population of zone i, Sd is the number of available beds at hospital d, cid is the straight line distance between zone i and hospital d (inferred from the zone's centroid and the postcode of the hospital), and f(.) is an inverse square distance decay function.

Though superficially opaque, the equation can be interpreted simply as the ratio of hospital size (weighted by distance) to population (also weighted by distance). It is directly analogous to the “beds per head” ratio but takes account of distance and competition from other wards.

Similar methods were used for private inpatient beds, except that details of individual hospitals Dd were replaced by numbers of filled inpatient beds within wards. The location of these beds was taken to be the ward centroid. For general practitioners Sd was the number of general practitioners employed within the practice and the location was the address of the main surgery.


  1. 1.
  2. 2.
  3. 3.
  4. 4.
  5. 5.
  6. 6.
  7. 7.
  8. 8.
  9. 9.
  10. 10.
  11. 11.
  12. 12.
  13. 13.
  14. 14.
  15. 15.
  16. 16.
  17. 17.
  18. 18.
  19. 19.
  20. 20.
  21. 21.
  22. 22.
  23. 23.
  24. 24.
  25. 25.
  26. 26.
  27. 27.
View Abstract