Jump to: Page Content, Site Navigation, Site Search,
You are seeing this message because your web browser does not support basic web standards. Find out more about why this message is appearing and what you can do to make your experience on this site better.
BMJ 2003;326:1070 (17 May), doi:10.1136/bmj.326.7398.1070
Krish Thiru, research fellow1, Alan Hassey, general practitioner, Frank Sullivan, general practitioner2
1 Fisher Medical Centre Research Unit, Skipton, North Yorkshire BD23 1EU, 2 Tayside Centre for General Practice, University of Dundee, Dundee DD2 4AD
Correspondence to: K Thiru krish.thiru{at}st-marys.nhs.uk
Design Systematic review of English language publications, 1980-2001.
Data sources Bibliographic searches of medical databases, specialist medical informatics databases, conference proceedings, and institutional contacts.
Study selection Studies selected according to a predefined framework for categorising review papers.
Data extraction Reference standards and measurements used to judge quality.
Results Bibliographic searches identified 4589 publications. After primary exclusions 174 articles were classified, 52 of which met the inclusion criteria for review. Selected studies were primarily descriptive surveys. Variability in methods prevented meta-analysis of results. Forty eight publications were concerned with diagnostic data, 37 studies measured data quality, and 15 scoped EPR quality. Reliability of data was assessed with rate comparison. Measures of sensitivity were highly dependent on the element of EPR data being investigated, while the positive predictive value was consistently high, indicating good validity. Prescribing data were generally of better quality than diagnostic or lifestyle data.
Conclusion The lack of standardised methods for assessment of quality of data in electronic patient records makes it difficult to compare results between studies. Studies should present data quality measures with clear numerators, denominators, and confidence intervals. Ambiguous terms such as "accuracy" should be avoided unless precisely defined.
We identified one review of mainly secondary care studies that described system and organisational factors that affect quality of the data in EPR.3 We carried out a similar review but in primary care.
We established a framework for categorising and selecting review papers (box). Eligible papers had to satisfy at least one aspect (numbered) of each category (A-C) within the box.
|
|
Measuring data quality
Thirty seven studies measured data quality, and 31 were from the United
Kingdom. A similar proportion had been published since 1995. Table B on
bmj.com gives full
details of categorisation (according to that shown in the box) and
characteristics. Eight studies were prospective, although the data extraction
was primarily cross sectional. The remaining articles were cross sectional or
retrospective surveys. Two studies were interventional: one a case-control
study involving onsite training and the other a study carried out before and
after a software update. Both showed substantial improvements in recording
levels after the intervention. A retrospective cohort study also showed an
increase in completeness and accuracy of EPRs over five years.
Structured data (codes, classifications, and nomen-clatures) were most commonly investigated. Although textual data were mentioned, only one study considered it in any detail. Twelve documents did not present their data structure (that is, coding system name) while most did not present the precise codes being investigated. UK publications generally used Read and OXMIS (Oxford medical information systems) codes. In other countries the ICPC (international classification of primary care) codes were more widely used. ICD (international classification of diseases) codes act as a referencing standard for these primary care coding systems. Where necessary, subsidiary codes (for example, chapter headings from British National Formulary; Prescription Pricing Authority) were used.
Quality of data (reliability) was usually measured with rate comparisons. Data validity was expressed under a range of terms (completeness, correctness, accuracy, consistency, and appropriateness), which were rarely defined. Sensitivity (completeness) was the commonest such index (table, webextra table B on bmj.com).
Seven studies carried out questionnaire and telephone surveying for a reference standard, one study used video recording, and 24 used clinical information gathered during the consultation (table). Seventeen publications used triangulation within the EPR to test internal consistency of data. Medication data was the most common internal reference standard. EPR diagnostic status was appraised through electronic prescribing information and subsequently validated against the paper notes. Hospital discharge details have also been used to evaluate EPR diagnostic status. The presence of hospital diagnosis and procedural data have been found to improve the quality of data in primary care. Eighteen studies used national statistics or survey data as a reference standard for data reliability. A third of UK studies used the fourth national study of morbidity in general practice (MSGP4).
Scoping data
Fifteen studies used EPR data for research or practice management. Although
the intention of these studies was not to measure data quality, they gave
insight into issues of data validation. These studies relied more on positive
predictive value as a quality measure than sensitivity
(table). Fourteen studies
considered the diagnostic status of the patient, with 10 publications dealing
primarily with information on patient identification and case validation.
Three used survey techniques to establish diagnostic status. Of the 12
retrospective investigations, seven used centralised datasets. These
"scoping" studies were more than twice as likely to present
confidence intervals than studies that measured data quality (10/15 (67%)
v 11/37 (30%)).
Levels of data recording
Prescribing data are generally the most sensitive, and the ability to link
prescriptions with diagnosis was the favoured means of identifying patients
and establishing the predictive validity of diagnostic codes. The sensitivity
of other EPR elements was wide ranging, while positive predictive value was
consistently high. Diseases with clear diagnostic criteria were generally
better recorded, as were data on specific procedures. Lifestyle and
socioeconomic data were rarely studied and then only in terms of
sensitivity.
The dominance of UK publications is unsurprising given the scope of this review. This also suggests an understanding of the importance of the quality of EPR data in terms of health policy and validated research databases. within the United Kingdom. Publications from non-English speaking countries were disadvantaged under our selection criteria. Those that were identified used similar techniques to measure data quality.
Measuring quality
The element of the EPR being investigated (numerator) and the components of
the reference standard used to appraise its quality (denominator) were often
not clearly defined within the literature (for instance, diagnostic
code/diagnostic criteria). When they were defined there was inconsistency
between studies. This makes comparisons risky and meta-analytical
interpretation of results impossible.
Measurement theory requires that both the concepts of validity and reliability be addressed. Reliability (a precursor to validity) is a measure of stability and is appraised through the subjective comparison of rates and prevalence. Sensitivity and positive predictive value, the most widespread measures of data validity, presuppose that the selected denominator is an adequate representation of the true dimension being measured. To identify the real health status of the patient subjective (perceived), objective, and diagnostic dimensions need to be measured by different techniques and their appropriateness for EPR validation considered. To aid interpretation and make comparisons between populations, confidence intervals should be provided.
When the opportunity to record clinical data in different forms (paper and computer) exists, this decreases validity of any one to act as a true reference standard. The use of paper notes to assess EPR validity will become increasingly inaccurate as clinicians migrate to electronic systems. In the medium term it is best to consider several independent markers of quality, and those studies that used several explicit reference standards (triangulation) were more likely to reflect the true quality of electronic data (see table B on bmj.com).
To facilitate comparisons of data quality across sites and systems, it is essential to have a reference standard. In the longer term we recommend the establishment of internal reference standards based on those objective and diagnostic EPR elements recognised as having high positive predictive value (that is, diagnostic codes, prescriptions, test results, referral outcomes, procedural codes). Such reference standards can then be used to explore measures of sensitivity.
|
This is an abridged
version; the full version is on
bmj.com We thank E Mitchell (Tayside Centre for General Practice) and N Booth (Sowerby Centre for Health Informatics at Newcastle) for access to their specialist medical informatics databases.
Contributors: See bmj.com
Funding: Fisher Medical Centre Research Unit is funded by the NHS Executive Northern and Yorkshire Region. The guarantor accepts full responsibility for the conduct of the study, had access to the data, and controlled the decision to publish.
Competing interests: None declared.
![]()
CiteULike
Complore
Connotea
Del.icio.us
Digg
Reddit
Technorati What's this?
Read all Rapid Responses