Jump to: Page Content, Site Navigation, Site Search,
You are seeing this message because your web browser does not support basic web standards. Find out more about why this message is appearing and what you can do to make your experience on this site better.
BMJ 2004;329:966-968 (23 October), doi:10.1136/bmj.329.7472.966
Stephen Senn, professor of statistics1
1 Department of Statistics, University of Glasgow, Glasgow G12 8QQ stephen{at}stats.gla.ac.uk
Most drug trials assume that patients respond consistently to treatment, but the assumption is rarely tested. If patients vary randomly in their response to a drug rather than some patients never responding, searches for a genetic basis for non-response are futile
One common interpretation is that the treatment works for 70% of patients 100% of the time and for 30% of the patients 0% of the time. However, nothing in the data forbids a radically different interpretationnamely, that the treatment works in 100% of the patients 70% of the time. In the first case, ability to succeed on treatment is a permanent feature of the patient. In the second case, individual response cannot be predicted: the patients are indistinguishable from each other regarding response to treatment. They sometimes respond and they sometimes do not. Intermediate cases between these two extremes are, of course, also possible.
My first concerns a statement of Allen Roses, which was reported by Richard Smith, the former editor of the BMJ, as follows:
| [The] worldwide vice president of genetics at GlaxoSmith Kline, is reported on the front page of the Independent (8 December, p 1) as saying: "Our drugs don't work on most patients"... He is an enthusiast for pharmacogenomics and hopes that greater understanding of genetics will mean that we will be able to identify with a "simple genetic test" people who will respond to drugs and design drugs for individuals rather than populations. We have, however, been hearing this tune for a long time, and it's hard to see the business model for individually tailored drugs.1
|
The last sentence here is the wisest. We have, indeed, been hearing this for a long time. Neither Smith nor Roses, nor indeed anybody else, would be in a position to tell whether the drugs concerned work moderately well for all patients or extremely well for some and not at all for others, for the simple reason that GlaxoSmithKline, like all other drug companies, runs almost no trials that would be capable of identifying one explanation from the other.
|
|
Is individual response to a drug consistent? Credit: BSIP/PIKO/SPL
|
My second example comes from an article in the BMJ in 1998 in which Guyatt et al claimed: "A method for estimating the proportion of patients who benefit from a treatment when the outcome is a continuous variable has been developed."2 The method is most simply illustrated for a crossover trial. It consists of calculating for a given patient the difference between treatment and control and comparing this with some agreed standard, say a clinically relevant difference, to judge whether the observed difference is important.
But the same problem arises as with our erectile dysfunction example. Consider a crossover trial in asthma comparing salmeterol with salbutamol in which it is judged that a difference in forced expiratory volume in one second of 200 ml is clinically relevant. The trial is run, and 24 out of 32 patients exhibit a difference at least as great as this, whereas eight do not. The 24 are labelled as responders and the other eight as non-responders, and we conclude that salmeterol produces a clinically relevant superior response to salbutamol for three out of every four patients, or at least this is what Guyatt et al would invite us to believe.
In fact, things are not so simple. The table shows two extreme possibilities when we repeat the whole experiment again, so that we now have two comparisons of salbutamol and salmeterol for every patient. In the first case, we have perfect correlation between the responses in the two crossover trials and in the second we have independence. The two cases are radically different and are identifiable by virtue of the fact that the effect of each treatment has been measured in more than one period. It is the pattern of joint responses that permits the identification of the case. The margins on the table are the same in both cases and do not permit identification.
|
In fact, there is a further difficulty. Suppose the average effect in such a trial is greater than the clinically relevant difference of 200 ml. If there is any within patient variability due to either pure measurement error or random temporal fluctuation in the state of the patient, then the observed difference between measurements on the different drugs for a given patient may be less than the clinically relevant difference, even though the true difference is greater than it. Suppose that this within-patient standard deviation is 100 ml. Then it can be calculated (see box 1) that even if the true effect were constant and equal to 250 ml for every single patient, and hence greater than the clinically relevant difference of 200 ml, on average 36% would randomly fail to show such a clinically relevant difference between treatments.
My third example comes from a re-analysis of the
blocker heart attack trial (BHAT) by Horwitz et al.3 Here the title of the article says it all: "Can treatment that is helpful on average be harmful to some patients?" As the authors put it:
| The 31 centers were divided into 21 dominant centers (mortality rates higher for placebo than propranolol) and 10 divergent centers (higher mortality rates for patients randomised to propranolol). Overall, compared to placebo, propranolol reduced the risk of dying for the "average" patient from 9.8 to 7.2%. Results for patients in dominant centers (RR = 0.50) were significantly different from those in divergent centers (RR = 1.33).
|
This use of a significance test on groups of centres that are identified only by result is, of course, quite illegitimate. The authors continued:
| We conclude that differences in results across centers of a multicenter RCT may reflect important distinctions in the clinical conditions of enrolled subjects. These distinctions help to identify subgroups of patients in which treatment that has an average overall benefit may be harmful for some patients.
|
This may, of course, be true in general, but unfortunately for Horwitz et al it is not true of the BHAT study. The study was re-analysed by Senn and Harrell using a random effects model, and this analysis produced a result that even they did not expect: there was no variation between centres above and beyond that ascribable to random variation (box 2).4 5
|
Lesson for pharmacogenomics
The lesson for those in pharmacogenomics is the following. To the extent that the purpose of such research is to identify genetic factors governing individual response to treatment, it is founded on a largely untested assumptionnamely, that such consistent individual responses exist. Patient by treatment interaction (that is to say individual response to treatment) provides an upper bound to gene by treatment interaction (differential response by genetic subgroups) because patients differ by more than their genes.6 Kalow et al therefore suggest that when the disease is chronic and crossover trials with repeat administration are possible (that is, a series n-of-1 trials) they should be carried out to identify disease and treatment combinations in which individual response is important as a preliminary step before looking for genetic factors.7 Unless patient by treatment interaction exists, it is pointless looking for gene by treatment interactions, and patient by treatment interaction can be examined only by repeated crossover trials.8 Such trials can then be analysed using random effect models in a way that will permit the resolution of variability into various sources: the overall effect of treatment, variability between patients, variability within patients, and patient by treatment interaction.9
|
|
The pharmaceutical industry has rarely, if ever, carried out the sort of trial that would permit identification of patient by treatment interaction. Thus statements that the drugs don't work on most people are based on mere supposition; the drugs may work moderately well for all people. To identify those drug and disease combinations for which individual response is important and hence for which genetic factors may be, it will be important to plan and analyse carefully.
Competing interest. SS is a consultant to the pharmaceutical industry.
![]()
CiteULike
Complore
Connotea
Del.icio.us
Digg
Reddit
Technorati What's this?
Read all Rapid Responses
UK medical students have published unreleased government plans to restrict failed asylum seekers' access to medical care