Jump to: Page Content, Site Navigation, Site Search,
You are seeing this message because your web browser does not support basic web standards. Find out more about why this message is appearing and what you can do to make your experience on this site better.
Published 15 December 2008, doi:10.1136/bmj.a2664
Cite this as: BMJ 2008;337:a2664
Adrian M Grant, professor of health services research1, Samantha M Wileman, trial coordinator1, Craig R Ramsay, senior statistician1, N Ashley Mowat, physician2, Zygmunt H Krukowski, surgeon2, Robert C Heading, physician3, Mark R Thursz, physician4, Marion K Campbell, director1, and the REFLUX Trial Group
1 Health Services Research Unit, Health Sciences Building, University of Aberdeen, Foresterhill, Aberdeen AB25 2ZD, 2 Aberdeen Royal Infirmary, Foresterhill, Aberdeen AB25 1LD, 3 Department of Gastroenterology, Royal Infirmary, Glasgow G4 0SF, 4 Faculty of Medicine, Imperial College, St Marys Campus, London W2 1PG
Correspondence to: A M Grant a.grant{at}abdn.ac.uk
Design Multicentre, pragmatic randomised trial (with parallel preference groups).
Setting 21 hospitals in the United Kingdom.
Participants 357 randomised participants (178 surgical, 179 medical) and 453 preference participants (261, 192); mean age 46; 66% men. All participants had documented evidence of GORD and symptoms for >12 months.
Intervention The type of laparoscopic fundoplication used was left to the discretion of the surgeon. Those allocated to medical treatment had their treatment reviewed and adjusted as necessary by a local gastroenterologist, and subsequent clinical management was at the discretion of the clinician responsible for care.
Main outcome measures The disease specific REFLUX quality of life score (primary outcome), SF-36, EQ-5D, and medication use, measured at time points equivalent to three and 12 months after surgery, and surgical complications.
Main results Randomised participants had received drugs for GORD for median of 32 months before trial entry. Baseline REFLUX scores were 63.6 (SD 24.1) and 66.8 (SD 24.5) in the surgical and medical randomised groups, respectively. Of those randomised to surgery, 111 (62%) actually had total or partial fundoplication. Surgical complications were uncommon with a conversion rate of 0.6% and no mortality. By 12 months, 38% (59/154) randomised to surgery (14% (14/104) among those who had fundoplication) were taking reflux medication versus 90% (147/164) randomised medical management. The REFLUX score favoured the randomised surgical group (14.0, 95% confidence interval 9.6 to 18.4; P<0.001). Differences of a third to half of 1 SD in other health status measures also favoured the randomised surgical group. Baseline scores in the preference for surgery group were the worst; by 12 months these were better than in the preference for medical treatment group.
Conclusion At least up to 12 months after surgery, laparoscopic fundoplication significantly increased measures of health status in patients with GORD.
Trial registration ISRCTN15517081 [controlled-trials.com] .
Interest in surgery as an alternative to long term medical treatment has been considerable since the introduction of the minimal access laparoscopic approach in the early 1990s. The operation (fundoplication) involves partial or total wrapping of the fundus of the stomach around the lower oesophagus to recreate a high pressure zone. Although fundoplication produces resolution of reflux symptoms in up to 90% of patients,4 we do not know whether exchanging symptoms associated with best medical management for those of the side effects of surgery is advantageous for the patient and a good use of healthcare resources.
We carried out a multicentre pragmatic randomised trial (with parallel non-randomised preference groups to contextualise the results and augment them, particularly in respect of surgical complications),5 evaluating the clinical effectiveness, safety, and costs of a policy of relatively early laparoscopic surgery compared with optimised medical management of GORD for people judged suitable for both policies.
We invited any eligible patient who did not want to take part in the randomised trial because of a strong preference about treatment to join a non-randomised preference arm. An independent data monitoring committee recommended continued recruitment on each of the three occasions that it confidentially reviewed accumulating data.
Clinical management
Clinical management aimed to optimise current care in participating centres. Participating clinical centres had local partnerships between surgeons (who had performed at least 50 laparoscopic fundoplication operations) and gastroenterologists with whom they shared the secondary care of patients with GORD. They, supported by local research nurses, informed participants about the randomised trial and invited them to take part. It was only at this point that participants who declined to take part in the randomised trial, because of a strong preference either for remaining on medical treatment or for undergoing surgery, were informed about, and given the opportunity to take part in, the preference study.
For all participants in either the randomised or preference surgical group, surgery could be subsequently deferred or declined, by either the participant or surgeon (that is, even after trial entry). In the absence of erosive oesophagitis on endoscopy, and when necessary to exclude achalasia, manometry or pH studies were performed before surgery. A lead surgeon (or a less experienced surgeon working under supervision) undertook the surgery. Routine crural repair and non-absorbable synthetic sutures were recommended. The type of fundoplication was left to the discretion of the surgeon, who recorded intraoperative details on a specially designed study form. For the purposes of the main comparisons, we considered the different fundoplication techniques as a single policy. Those allocated to medical treatment had their treatment reviewed and adjusted as necessary by a local gastroenterologist to be "best medical management," based on the Genval workshop report.6 The medical protocol included the option of surgery if a clear indication developed after randomisation. In all other respects, clinical management was at the discretion of the clinician responsible for care.
Outcome measures
Outcome measures were those judged important to patients and health services. The primary outcome was the REFLUX questionnaire score, a validated "disease specific" measure incorporating assessment of reflux and other gastrointestinal symptoms and the side effects and complications of both treatments (score range 0 to 100, the higher the score the better the patients felt).7 The score was derived from the weighted average of six questions on quality of life (heartburn; acid reflux; eating and swallowing; bowel movements; sleep; and work, physical, and social activities). Five symptom scores were also developed as secondary measures (general discomfort; wind and frequency; nausea and vomiting; limitation in activity; constipation and swallowing). Participants were followed up by postal questionnaire three and 12 months after surgery or at an equivalent time among those who did not have surgery; the medical participants were linked to surgical participants who had been randomised at about the same time. Issues related to long, variable length, waiting lists prohibited timing of follow-up from randomisation. Other prestated outcome measures were health status (EQ-5D, based on UK public preference values, and SF-36); serious morbidity, such as operative complications; and mortality.
Sample size
We originally intended to recruit 300 participants to each trial group to identify a difference between the randomised groups of 0.25 of 1 SD in the disease specific REFLUX score (80% power;
=0.05) 12 months after surgery. In January 2003, however, in part prompted by a lower rate of recruitment than expected, we revised this target in consultation with the data monitoring committee and representatives of the funders. It was agreed that a slightly larger but still moderate sized benefit (0.3 of 1 SD, equivalent to seven points on the REFLUX quality of life score) was clinically plausible based on improvements reported in other studies8 9 after surgery among more severely affected people. We calculated that this required 196 in each group to give 80% power (
=0.05), assuming 10% attrition (that is, 176 per group with 12 month follow-up).
Randomisation
Random allocation was organised centrally by a secure system, using a computer generated sequence, stratified by clinical site, with balance in respect of age (18-49 or
50), sex (men or women), and BMI (
28 or >29) secured by minimisation. Staff in the central trial office entered details of participants on the secure database, then notified participants and respective clinical sites of their allocation. There was no subsequent blinding.
The figure
summarises the stages of the trial. The first 146 randomised participants (70 allocated to surgery and 76 allocated to medical management) were sent details of their allocation at the same time as the baseline questionnaires. This was changed at the request of the data monitoring committee such that subsequent allocations were generated only once completed baseline forms had been returned. The committee were concerned that if patients were to receive their allocation before completing the questionnaire this could potentially affect their responses.
|
Description of the groups at trial entry
The characteristics of the randomised participants (table 1
, baseline rates for medication in table 2
, and table A on bmj.com) were similar and lay between those in the preference groups; participants in the preference for surgery group were younger and had been prescribed medication for GORD for longer; participants in the preference for medical treatment group were older, more likely to be women, and had been prescribed medication for a shorter time.
|
|
|
Antireflux medication
Table 2 also shows actual use of antireflux medication during the previous two weeks at baseline, first follow-up, and second follow-up. By 12 months after surgery, 38% (59/154) of the randomised surgical participants were taking medication compared with 90% (147/164) of the randomised medical participants (nearly all of whom were taking proton pump inhibitors). Among those who had surgery, use of antireflux medication dropped to 9% at three months and 14% (14/104) at 12 months after surgery. Improvements in the scores in the medical group might reflect changes in management; lansoprazole was the predominant proton pump inhibitor used at study entry, whereas at follow-up omeprazole and lansoprazole were the most commonly reported.
Health status
Table 4 shows REFLUX scores at baseline and at three and 12 months
(see also tables B and C on bmj.com). There were substantial differences between the randomised intention to treat groups in the REFLUX score (a third to a half of 1 SD at follow up), with the surgery group having better scores than the medical group. This reflected improvements across all symptom domains within the measure (see tables B, C, and D on bmj.com). The differences between groups were larger when we considered only the per protocol participants.
|
|
Possible side effects of surgery
No differences were detected between the trial groups in their questionnaire responses at 12 months regarding "difficulty swallowing" and "bloatedness/trapped wind," but there was some evidence of more frequent "wind from the lower bowel" after surgery.
Preference groups
The participants in the preference for surgery group had lower mean REFLUX scores at baseline than those in the preference for medical treatment group (55.8 v 77.5). Despite this, at follow-up at 12 months, according to intention to treat analysis (difference 3.9, –0.2 to 8.0; P=0.064) and per protocol analysis (6.3, 2.4 to 10.2; P=0.002) the REFLUX score favoured the preference surgical group.
For participants in the preference group, other quality of life scores also tended to favour the surgical group. The differences between the preference groups, however, were less marked than the differences between the randomised groups, mainly because the preference for medical treatment group had better scores than the randomised medical group at baseline and follow-up.
Those who agreed to join the randomised trial had baseline characteristics between the two preference groups. The results in the preference groups were consistent with the randomised comparison: scores after surgery in those who preferred surgery were somewhat higher than in those who preferred medical treatment, despite starting from much lower baseline levels. None of the three participants who died had surgery and complications were uncommon; but confidence intervals around estimated frequencies were wide despite inclusion of all 319 surgical participants, leaving important uncertainty about the magnitude of surgical risk.
Strengths and weaknesses
We used a pragmatic trial design, with many UK patients, centres and experienced surgeons, thus allowing the results to be interpreted within a "real life" NHS context. The addition of the preference groups gives an indication of probable behaviour if surgery were to become more freely available.
We explored the impact of a third of those randomised to surgery not having fundoplication: firstly, through per protocol analyses limited to those randomised who received their allocated management, and, secondly, through an adjusted approach10 11 in an attempt to circumvent the probable selection bias of per protocol analyses. In the event, these two approaches gave similar results. The direction of effects was so clear that we observed significant differences even in the most conservative intention to treat analyses. While the intention to treat analyses estimate unbiased average effects on compliers and non-compliers, the adjusted analyses estimate bias adjusted average treatment effects on those who comply with treatment. In general, an adjusted analysis is preferable to a per protocol analysis as it is less likely to be biased.
The protocol allowed surgeons to use the type of fundoplication with which they were familiar, principally to avoid any learning effects. There is mixed evidence for the use of a total wrap or partial wrap.12 We explored possible differential effects of total versus partial wrap fundoplication in an observational analysis and found no evidence of a difference within either the randomised, preference, or per protocol groups (difference in 12 month REFLUX score –1.3 (–7.9 to 5.2), P=0.687).
Our study was limited to patients who were on long term acid suppression with proton pump inhibitors, who had symptoms that were reasonably controlled, and who were clinically suitable for either policy; it is to these sorts of patients with GORD that the results are generalisable. Potential participants were not easy to identify and recruit, and this led to slow recruitment. Most patients taking long term proton pump inhibitor treatment are managed in general practice, often through a repeat prescription system, whereas our trial was based in secondary care. We used three approaches to identify potentially eligible patients: retrospective review of hospital case notes; prospective identification, especially through endoscopy clinics; and (in selected centres) public advertisements. All potentially eligible patients then had to be assessed clinically, often through specially established monthly clinics, before being formally approached about the trial. Those approached often expressed clear preferences, reflecting the marked differences between the policies being compared.
The trial was conducted at a time when there was great pressure on surgical services in the NHS, with long delays for elective surgery for non-life threatening benign conditions. Indeed, the average time between trial entry and surgery was eight to nine months. Some participants experienced long delays before being formally offered surgery, and this was an important factor in their eventual decision to choose not to have surgery after all. While patients choice was the commonest reason for not having surgery, about a third of those who did not have fundoplication after allocation to surgery were refused surgery for clinical reasons: most commonly, the surgeon disagreed with the recruiting doctor that symptoms were sufficient to justify surgery or judged that the patient was not sufficiently fit (such as overweight) or the symptoms had improved since randomisation,
The standard rule in most trials is to time follow-up from randomisation. This was not appropriate in our trial because of the variable time between randomisation and surgery, exacerbated by the waiting list problem. The protocol specified follow-up at a time equivalent to three and 12 months after surgery. It was important to have follow-up in the medical groups at an equivalent time. We arranged this by setting the follow-up of medical participants at time points equivalent to three months and 12 months after surgery.
We focused directly on subjective measures of outcome (patient reported measures) rather than on objective measures of outcome. This was intentional and appropriate for the pragmatic evaluation of the interventions. In clinical practice, pH is not routinely measured after surgery. If we had included such objective outcome measures, this would have entailed our changing the regular management practices for these patients, thus jeopardising the generalisability of both our intervention and our findings.
Comparison with other studies
We identified two other randomised trials comparing laparoscopic Nissen fundoplication with continued medical management.13 14 They were less pragmatic in design with fewer participants (21713 and 10414), centres (two13 and one14), and surgeons (two13 and four14) and reported postoperative 24 hour pH measurement. The results of the two trials were consistent with ours. Across all trials, there were significantly greater improvements after surgery in measures of gastrointestinal and general wellbeing. While no major intraoperative complications were reported among the 52 patients who had surgery in one trial,14 in the other trial there were four major complications and four reoperations, including one case of gastric resection, among the 109 patients who had fundoplication,13 illustrating the potential risks of the procedure.
Conclusions
Laparoscopic fundoplication significantly increased health status at least to 12 months after surgery. Surgery does carry costs, however, and we will report investigation of cost effectiveness elsewhere. The narrowing of differences in health status between three and 12 months could reflect a postoperative placebo effect or could indicate decreasing effectiveness of surgery over time from surgery (as has been observed after open fundoplication9). We have therefore instituted annual follow-up using similar questionnaires and plan to report long term effectiveness after five years of follow-up.
|
Cite this as: BMJ 2008;337:a2664
Garry Barton (1999-2002), Laura Bojke, Marion Campbell, David Epstein, Adrian Grant, Sue Macran, Craig Ramsay, Mark Sculpher, Samantha Wileman.
Wendy Atkin (chair, independent of trial), John Bancewicz, Ara Darzi, Robert Heading, Janusz Jankowski (independent of trial), Zygmunt Krukowski, Richard Lilford (independent of trial), Iain Martin (1997-2000), Ashley Mowat, Ian Russell, Mark Thursz.
Jon Nicholl, Chris Hawkey, Iain MacIntyre (all independent of trial)
Members of the REFLUX trial group responsible for recruitment in the clinical centres
A Mowat, Z Krukowski, E El-Omar, P Phull, T Sinclair, L Swan (Aberdeen Royal Infirmary); B Clements, J Collins, A Kennedy, H Lawther (Royal Victoria Hospital, Belfast); D Bennett, N Davies, S Toop, P Winwood (Royal Bournemouth Hospital); D Alderson, P Barham, K Green, R Mittal (Bristol Royal Infirmary); M Asante, S El Hasani (Princess Royal University Hospital, Bromley); A De Beaux, R Heading, L Meekison, S Paterson-Brown, H Barkell (Royal Infirmary of Edinburgh); G Ferns, M Bailey, N Karanjia, TA Rockall, L Skelly (Royal Surrey County Hospital, Guildford); M Dakkak, C Royston, P Sedman (Hull Royal Infirmary); K Gordon, L F Potts, C Smith, PL Zentler-Munro, A Munro (Raigmore Hospital, Inverness); S Dexter, P Moayeddi (General Infirmary at Leeds); DM Lloyd (Leicester Royal Infirmary); V Loh, M Thursz, A Darzi (St Marys Hospital, London); A Ahmed, R Greaves, A Sawyerr, J Wellwood, T Taylor (Whipps Cross Hospital, London); S Hosking, S Lowrey, J Snook (Poole Hospital); P Goggin, T Johns, A Quine, S Somers, S Toh (Queen Alexandra Hospital, Portsmouth); J Bancewicz, M Greenhalgh, W Rees (Hope Hospital, Salford); CVN Cheruvu, M Deakin, S Evans, J Green, F Leslie (North Staffordshire Hospital, Stoke-on-Trent); J N Baxter, P Duane, M M Rahman, M Thomas, J Williams (Morriston Hospital, Swansea); D Maxton, A Sigurdsson, M S H Smith, G Townson (Princess Royal Hospital, Telford); C Buckley, S Gore, RH Kennedy, Z H Khan, J Knight (Yeovil District Hospital); D Alexander, G Miller, D Parker, A Turnbull, J Turvill (York District Hospital).
Contributors: AMG was the principal grant holder and contributed to development of the trial protocol and preparation of the paper and had overall responsibility for the conduct of the trial. SMW contributed to development of the trial design, was responsible for the day to day management of the trial, monitored data collection and assisted in the preparation of the paper. CRR contributed to the grant application and trial design and conducted the statistical analysis. ZHK, NAM, RCH, and MRT advised on clinical aspects of the trial design and the conduct of the trial and commented on the draft paper. MKC contributed to the development of the trial design, commented on all aspects of the conduct of the trial, and contributed to the preparation of the paper. AMG is guarantor.
Funding: This study was funded by the NIHR Health Technology Assessment Programme (as part of project No 97/10/99), and the full project report is published in Health Technology Assessment 2008;12:1-181. The Health Services Research Unit is funded by the Chief Scientist Office of the Scottish Government Health Directorates. The funders of this study (NCCHTA), other than the initial peer review process before funding and six monthly progress reviews, did not have any involvement in study design; in the collection, analysis, and interpretation of data; in the writing of the report; nor in the decision to submit the paper for publication. The views and opinions expressed in this paper are those of the authors and do not necessarily reflect those of the Department of Health or the funders that provide institutional support for the authors.
Competing interests: None declared.
Ethical approval: This study was approved by the Scottish multicentre research ethics committee and the appropriate local research ethics committees. All participants gave informed consent.
Provenance and peer review: Not commissioned; externally peer reviewed.
![]()
CiteULike
Complore
Connotea
Del.icio.us
Digg
Reddit
StumbleUpon
Technorati What's this?
Read all Rapid Responses