Published 13 August 2009, doi:10.1136/bmj.b3128
Cite this as: BMJ 2009;339:b3128

Research

Disagreements in meta-analyses using outcomes measured on continuous or rating scales: observer agreement study

Britta Tendal, PhD student1, Julian P T Higgins, senior statistician4, Peter Jüni, head of division2,3, Asbjørn Hróbjartsson, senior researcher1, Sven Trelle, associate director2,3, Eveline Nüesch, PhD student2,3, Simon Wandel, PhD student2,3, Anders W Jørgensen, PhD student1, Katarina Gesser, PhD student5, Søren Ilsøe-Kristensen, PhD student5, Peter C Gøtzsche, director1

1 Nordic Cochrane Centre, Rigshospitalet, Dept 3343, Blegdamsvej 9, DK-2100 Copenhagen, Denmark, 2 Institute of Social and Preventive Medicine, University of Bern, Switzerland, 3 CTU Bern, Bern University Hospital, Switzerland, 4 MRC Biostatistics Unit, Institute of Public Health, University of Cambridge, Cambridge, 5 Faculty of Pharmaceutical Sciences, University of Copenhagen, Denmark

Correspondence to: B Tendal bt{at}cochrane.dk

Objective To study the inter-observer variation related to extraction of continuous and numerical rating scale data from trial reports for use in meta-analyses.

Design Observer agreement study.

Data sources A random sample of 10 Cochrane reviews that presented a result as a standardised mean difference (SMD), the protocols for the reviews and the trial reports (n=45) were retrieved.

Data extraction Five experienced methodologists and five PhD students independently extracted data from the trial reports for calculation of the first SMD result in each review. The observers did not have access to the reviews but to the protocols, where the relevant outcome was highlighted. The agreement was analysed at both trial and meta-analysis level, pairing the observers in all possible ways (45 pairs, yielding 2025 pairs of trials and 450 pairs of meta-analyses). Agreement was defined as SMDs that differed less than 0.1 in their point estimates or confidence intervals.

Results The agreement was 53% at trial level and 31% at meta-analysis level. Including all pairs, the median disagreement was SMD=0.22 (interquartile range 0.07-0.61). The experts agreed somewhat more than the PhD students at trial level (61% v 46%), but not at meta-analysis level. Important reasons for disagreement were differences in selection of time points, scales, control groups, and type of calculations; whether to include a trial in the meta-analysis; and data extraction errors made by the observers. In 14 out of the 100 SMDs calculated at the meta-analysis level, individual observers reached different conclusions than the originally published review.

Conclusions Disagreements were common and often larger than the effect of commonly used treatments. Meta-analyses using SMDs are prone to observer variation and should be interpreted with caution. The reliability of meta-analyses might be improved by having more detailed review protocols, more than one observer, and statistical expertise.


This is an open-access article distributed under the terms of the Creative Commons Attribution Non-commercial License, which permits use, distribution, and reproduction in any medium, provided the original work is properly cited, the use is non commercial and is otherwise in compliance with the license. See: http://creativecommons.org/licenses/by-nc/2.0/ and http://creativecommons.org/licenses/by-nc/2.0/legalcode.

Add to CiteULike CiteULike   Add to Complore Complore   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg   Add to Reddit Reddit   Add to StumbleUpon StumbleUpon   Add to Technorati Technorati    What's this?

Relevant Article

Compliance therapy in psychotic patients: randomised controlled trial
Roisin Kemp, Peter Hayward, Grantley Applewhaite, Brian Everitt, and Anthony David
BMJ 1996 312: 345-349. [Abstract] [Full Text]




Access jobs at BMJ Careers
Whats new online at Student 

BMJ