Jump to: Page Content, Site Navigation, Site Search,
You are seeing this message because your web browser does not support basic web standards. Find out more about why this message is appearing and what you can do to make your experience on this site better.
BMJ 2003;326:1127 (24 May), doi:10.1136/bmj.326.7399.1127
Philip J B Brown, honorary lecturer in healthcare informatics1, Victoria Warmington, research associate2, Michael Laurence, general practitioner3, A Toby Prevost, medical statistician4
1 School of Information Systems, University of East Anglia, Norwich NR4 7TJ, 2 Humbleyard Practice, Hethersett, Norfolk NR9 3AB, 3 Bacon Road Medical Centre, Norwich NR2 3QX, 4 Department of Public Health and Primary Care, University of Cambridge, Institute of Public Health, Cambridge CB2 2SR
Correspondence to: P J B Brown, Humbleyard Practice, Hethersett, Norfolk NR9 3AB Pjbb{at}hicomm.demon.co.uk
Design Randomised crossover trial. Clinicians coded patient records using both schemes after being randomised in pairs to use one scheme before the other.
Setting 10 general practices in urban, suburban, and rural environments in Norfolk.
Participants 10 general practitioners.
Source of data Concepts were collected from records of 100 patient encounters.
Main outcome measures Percentage of coded choices ranked as being exact representations of the original terms; percentage of cases where coding choice of paired general practitioners was identical; length of time taken to find a code.
Results A total of 995 unique concepts were collected. Exact matches were more common with Clinical Terms (70% (95% confidence interval 67% to 73%)) than with Read Codes (50% (47% to 53%)) (P < 0.001), and this difference was significant for each of the 10 participants individually. The pooled proportion with exact and identical matches by paired participants was greater for Clinical Terms (0.58 (0.55 to 0.61)) than Read Codes (0.36 (0.33 to 0.39)) (P < 0.001). The time taken to code with Clinical Terms (30 seconds per term) was not significantly longer than that for Read Codes.
Conclusions Clinical Terms Version 3 performed significantly better than Read Codes 5 byte set in capturing the meaning of concepts. These findings suggest that improved coding accuracy in primary care electronic patient records can be achieved with the use of such a clinical terminology.
Comparisons of different clinical coding schemes have mainly been conducted by coding experts looking at the schemes' coverage in relation to existing lists of terms. No study has examined whether a clinical terminology improves the performance of coding electronic patient records by practising doctors in primary care.
The main aim of this crossover study was to determine whether Clinical Terms Version 3 provides greater accuracy and consistency than Read Codes 5 byte set for coding electronic patient records by general practitioners.
Design
Each general practitioner manually recorded the consultation details of 10
consecutive patients in an arbitrarily chosen consultation session. A simple
framework of headings was provided (reason for encounter, diagnosis,
treatment, and medical history) to prompt entry of details, but there was no
restriction in the terms that could be recorded. The terms from these 100
records were then entered verbatim into an Access (Microsoft) database. We
used random number tables to group the general practitioners into five pairs
and to randomly select one of each pair to code terms with Read Codes 5 byte
set (termed Read Codes in this paper) first and then to code with the Clinical
Terms Version 3 (termed Clinical Terms in this paper), and the other doctor in
each pair to use the Clinical Terms first followed by Read Codes. We asked the
clinicians to code the terms collected from their own records and those from
the other doctor in their pair (see
figure).
|
We videotaped each doctor coding his or her allocated file of terms using both Clinical Terms and Read Codes using the same software and hardware. Participants were given standardised instructions and training to identify a code for each term that would be "an acceptable match if the coded record were the only documentation of the concept in a paperless practice."
A researcher reviewed each video and recorded the time taken to code each term. Two researchers then independently examined the coded choices made by each general practitioner and ranked each match as exact or non-exact in representing the meaning of the original term.
Statistical analysis
We used Cohen's
coefficient to assess agreement among participants
in the exactness of coding under each scheme, with a value of
0.6
indicating good
agreement.7 We
estimated the accuracy of each coding scheme by calculating the proportion of
coded choices ranked as being exact semantic representations of the original
terms. We estimated consistency by identifying the proportion of cases where
the coding choice of the paired general practitioners was the same. We pooled
results across general practitioner pairs and calculated confidence intervals
using McNemar's test and a bootstrap method (see
bmj.com for
details).
The length of time taken to find a code was used as a measure of usability of each scheme. We used a non-parametric bootstrap method to estimate 95% confidence intervals and P values for mean time differences.
|
Accuracy of coding schemes
The proportion of concepts ranked as exact semantic representations with
Clinical Terms ranged from 0.60 to 0.74 (pooled proportion 0.70) for the 10
participants, with seven of the doctors being in excess of 0.7. By contrast,
the proportion of concepts ranked exact with Read Codes ranged from 0.37 to
0.58 (pooled proportion 0.50). All 10 doctors coded significantly more
concepts as exact matches with Clinical Terms than with Read Codes (P <
0.001 for each doctor). The excess proportion of concepts ranked exact with
Clinical Terms ranged from 14% (95% confidence interval 7% to 21%) to 27% (19%
to 34%) for the 10 participants. The excess proportion of concepts exactly
matched with Clinical Terms was similar in the doctors who used this scheme
before Read Codes (22%) and in those who used the scheme after using Read
Codes (18%), although this relatively small difference represented a
significant period effect. We also found a significant carryover effect but
the absolute difference was small (see
bmj.com)
Consistency of coding schemes
The percentage of concepts ranked consistent (that is, exact matches and
coded identically by both members of a pair) ranged from 53% to 63% for
Clinical Terms and from 31% to 43% for Read Codes. The excess proportion
ranked consistent with Clinical Terms ranged from 21% to 23% and was
significant for each of the general practitioner pairs. The pooled proportion
of consistent matches by general practitioner pairs was 0.58 for Clinical
Terms and 0.36 for Read Codes, with a pooled difference in proportion of 0.22
(0.19 to 0.25) (P < 0.001).
Usability of coding schemes
The median coding time for each of the 10 participants ranged from 14 to 27
seconds for Clinical Terms and from 18 to 49 seconds for Read Codes. Compared
with Read Codes, the mean excess time taken to code with Clinical Terms ranged
from 29 to 12.3 seconds for the pairs of participants. The mean time
taken to code with Clinical Terms was shorter by a mean of 5.9 seconds (4.0 to
7.9). However, on the basis of the 850 terms with full data available, there
were significant period and carryover effects.
Strengths and limitations of our study
We compared the content and usability of the two coding schemes in a
practical setting, where clinicians had a variable degree of competency in
coding. While formulating the study, we considered videotaping the coding
process during live patient consultations. We rejected this in favour of a
randomised crossover trial as consistency between experimenters would have
been difficult to assess and confounding variables such as time constraints on
searching would have been difficult to control. Coding performance and times
are therefore only proxy estimates of use in real patient encounters. Further
improvement of data entry might be achieved with more sophisticated software
than was used in our studysuch as by using templates for data entry and
menus to access commonly used terms.
We compared Clinical Terms Version 3 with Read Codes 5 byte set rather than the earlier Read Code 4 byte set, which is still in use, because an earlier study had indicated that Read Codes 5 byte set was superior in coverage than the earlier scheme.8
The carryover effect in the proportion of terms exactly matched was small compared with the size of the difference between the two coding schemes in each period. The carryover effect for coding time reflected the change from the first period to the second period in the difference between the schemes. Four of the five doctors who coded first with Read Codes took more than 10 seconds longer to code each term than they did with Clinical Terms; review of the video comments of the remaining participant suggested that the longer coding time in the second period related to user fatigue (including remarks about the doctor's uncertainty of meaning of the original term and technical difficulties in using the notebook keypad). Only one of the participants who coded first with Clinical Terms took more than 10 seconds longer to code with this than with Read Codes, and this may be accounted for by the doctor's familiarity with the content of Read Codes. The small number of participants limits our ability to explain such differences with certainty, and we have cautiously interpreted them to say that the time taken to code with Clinical Terms was not significantly longer than that with Read Codes (see bmj.com).
|
We did not try to measure the potential clinical importance of the non-exact matches. Judging the importance of non-exact matching would have introduced a further subjective element, requiring further checks of inter-rater reliability that were outside the scope of the study, although our data provide valuable material for further study.
Conclusion
The coding of clinical records is an important aspect of medical audit,
research, epidemiology, management of resources, and the direct care of
patients. For information technology to be fully adopted, clinical notions
that are often complex must be accurately and easily represented as coded
concepts that are "user friendly" and easily retrievable. Our
study suggests that substantial advantages may be achieved by investing in the
implementation of Clinical Terms Version 3 or a similar terminology.
This is an abridged
version; the full version is on
bmj.com We thank Ian Harvey (Health Policy and Practice, University of East Anglia) for his advice in designing the study and Doreen Cochrane (SuNet facilitator, Health Policy and Practice, University of East Anglia) for invaluable input during the preparation of this proposal. The following clinicians participated in the study: Robert Bawden, Peter Burrows, Jamie Dalrymple, Stephen Daykin, Christopher Hand, Carly Hughes, Andrew Leaman, David Munson, Tony Press, and Rob Stone.
Contributors: see bmj.com
Funding: The study was funded by the NHS Eastern Region R&D grant No RCC33031.
Competing interests: PJBB advises the NHS Information Authority on coding and terminology.
Ethical approval: The study was granted ethical approval by the Norfolk and Norwich Ethical Committee.
![]()
CiteULike
Complore
Connotea
Del.icio.us
Digg
Reddit
StumbleUpon
Technorati What's this?
Read all Rapid Responses