Rapid responses are electronic comments to the editor. They enable our users
to debate issues raised in articles published on bmj.com. A rapid response
is first posted online. If you need the URL (web address) of an individual
response, simply click on the response headline and copy the URL from the
browser window. A proportion of responses will, after editing, be published
online and in the print journal as letters, which are indexed in PubMed.
Rapid responses are not indexed in PubMed and they are not journal articles.
The BMJ reserves the right to remove responses which are being
wilfully misrepresented as published articles or when it is brought to our
attention that a response spreads misinformation.
From March 2022, the word limit for rapid responses will be 600 words not
including references and author details. We will no longer post responses
that exceed this limit.
The word limit for letters selected from posted responses remains 300 words.
In a big picture/ideal world, we'd have large sample sizes where mean and SD will be independent hence exhibiting a 'normal' distribution, as you have suggested.
Many of us, however, do not work with large samples and subsequently rely (desperately) on data transformation to be able to apply the many parametric tests we are familiar with. I feel that normality has become an artificial goal when conducting statistical analyses - to the extent that data transformation is used indiscriminately.... We seem so focused on normality and mean-based analyses that medians, ranks and non parametric alternatives, some of which are equally robust, become ignored. Further, transforming data fundamentally alters values to make them more equidistant and we can't really be sure what that does to its integrity and to its interpretability....
Research with heterogeneous groups faces the risk of losing information about its tail ends in this 'mad rush' towards achieving normality. We need to provide ample warnings about risks associated with data transformation and promote some non-parametric methods as safe alternatives.
Re: Statistics Notes: Transforming data
Sir
In a big picture/ideal world, we'd have large sample sizes where mean and SD will be independent hence exhibiting a 'normal' distribution, as you have suggested.
Many of us, however, do not work with large samples and subsequently rely (desperately) on data transformation to be able to apply the many parametric tests we are familiar with. I feel that normality has become an artificial goal when conducting statistical analyses - to the extent that data transformation is used indiscriminately.... We seem so focused on normality and mean-based analyses that medians, ranks and non parametric alternatives, some of which are equally robust, become ignored. Further, transforming data fundamentally alters values to make them more equidistant and we can't really be sure what that does to its integrity and to its interpretability....
Research with heterogeneous groups faces the risk of losing information about its tail ends in this 'mad rush' towards achieving normality. We need to provide ample warnings about risks associated with data transformation and promote some non-parametric methods as safe alternatives.
Competing interests: No competing interests