BMJ  2008;336:789 (12 April), doi:10.1136/bmj.39542.610000.3A

Letters

Predicting citations

Validating prediction models

The first 150 words of the full text of this article appear below.

Lokker et al presented an interesting model to predict citation counts for clinical articles.1 This topic is so important that the paper will probably attract many citations. We want to clarify some of the nomenclature of validation of prediction models, to avoid confusion in future reporting.

The authors randomly divided 1274 articles into a derivation data set of 757 articles for development of a prediction model and a validation dataset for testing of 504 articles, after exclusion of outliers with >150 citations. This procedure is an example of a split sample approach, but the authors refer to it as cross validation. Cross validation would mean that we develop a model in the first part of the data and test it in the second part, and then repeat the procedure with development in the second part and testing in the first.

The authors report that explained variation (R2) decreased from 0.60 . . . [Full text of this article]

Ewout W Steyerberg, professor of medical decision making, Hester F Lingsma, junior researcher

1 Department of Public Health, Erasmus MC, PO Box 2040, 3000 CA Rotterdam, Netherlands

E.Steyerberg@erasmusmc.nl


Add to CiteULike CiteULike   Add to Complore Complore   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg   Add to Reddit Reddit   Add to StumbleUpon StumbleUpon   Add to Technorati Technorati    What's this?

Relevant Article

Prediction of citation counts for clinical articles at two years using data available within three weeks of publication: retrospective cohort study
Cynthia Lokker, K Ann McKibbon, R James McKinlay, Nancy L Wilczynski, and R Brian Haynes
BMJ 2008 336: 655-657. [Abstract] [Full Text] [PDF]




Access jobs at BMJ Careers
Whats new online at Student 

BMJ