Jump to: Page Content, Site Navigation, Site Search,
You are seeing this message because your web browser does not support basic web standards. Find out more about why this message is appearing and what you can do to make your experience on this site better.
Kathleen M Griffiths Centre
for Mental Health Research, The Australian National University,
Canberra 0200, Australia
Correspondence to: K M Griffiths
kathy.griffiths{at}anu.edu.au
| |
Abstract |
|---|
|
|
|---|
Objectives:
To evaluate quality of web based
information on treatment of depression, to identify potential
indicators of content quality, and to establish if accountability
criteria are indicators of quality.
The web represents an unprecedented opportunity to provide high
quality, accessible healthcare information to consumers and health
providers. In the absence of editorial controls, however, the
information may be of low quality and potentially
harmful.1
In an influential paper Silberg et al proposed that
accountability standards (disclosure of authorship, ownership, and
currency of information) may be useful indicators of the quality of web based health information.2 These accountability criteria
have been widely assumed to reflect website quality,3-5
but their validity as indicators of quality of content have not been
investigated. Moreover, there have been few systematic studies of
the actual quality of the content of health information on the
web,6-9 and these studies have typically used textbook
summaries5 or author opinion8 as the gold
standard for assessing content quality rather than meta-analyses of the
available evidence.10 Finally, no published studies have
systematically evaluated the quality of mental health websites even
though mental disorders are a common cause of disability and the World
Health Organisation has predicted that depression will be the second
largest cause of disability within 20 years.11 Since only
a minority of people with depressive disorders receive
treatment,12 websites are potentially useful for
encouraging depressed people to seek help.
In this study we aimed to survey websites that a "typical" user
might access when searching for information on depression. We evaluated
the quality of the information on the treatment of depression
(including comparison with evidence based guidelines and meta-analyses)
and the relation between content quality and accountability indicators
and other site characteristics.
Selection of sites
Design:
Cross sectional survey.
Data sources:
21 frequently accessed websites about depression.
Main outcome measures:
(i) Site characteristics; (ii)
quality of content
concordance with evidence based depression
guidelines (guideline score), appropriateness of other relevant site
information (issues score), and subjective rating of site quality
(global score); and (iii) accountability
conformity with core
accountability standards (Silberg score) and quality of evidence cited
in support of conclusions (level of evidence score).
Results:
Although the sites contained useful
information, their overall quality was poor: the mean guideline,
issues, and global scores were only 4.7 (range 0-13) out of 43, 9.8 (6-14) out of 17, and 3 (0.5-7.5) out of 10 respectively. Sites
typically did not cite scientific evidence in support of their
conclusions. The guideline score correlated with the two other quality
of content measures, but none of the content measures correlated with
the Silberg accountability score. Content quality was superior for sites owned by organisations and sites with an editorial board.
Conclusions:
There is a need for better evidence based information about depression on the web, and a need to reconsider the
role of accountability criteria as indicators of site quality and to
develop simple valid indicators of quality. Ownership by an
organisation and the involvement of a professional editorial board may
be useful indicators. The study methodology may be useful for exploring
these issues in other health related subjects.
![]()
Introduction
Top
Abstract
Introduction
Methods
Results
Discussion
References
![]()
Methods
Top
Abstract
Introduction
Methods
Results
Discussion
References
To identify potential sites for our survey, we used two search
engines, DirectHit (www.directhit.com) and MetaCrawler
(www.go2net.com/search.html), to conduct searches in March 1999 using
the key word "depression." DirectHit returns 10 "popular" sites
based on analyses of previous user activity for a query (primarily
frequency of "clickthroughs" from a result list). MetaCrawler
integrates the results for a query from several well known search
engines including Alta Vista, Excite, Infoseek, Lycos, WebCrawler,
Yahoo, LookSmart, Thunderstone, and Mining Co. The usefulness of Direct
Hit and MetaCrawler in identifying popular sites has not been the
subject of formal independent evaluation. However, in the absence of
any other suitable search engine tools, the list of sites yielded by
our search methodology provided the best available approximation to a
list of depression sites that would be most commonly encountered by a
"typical" user.
Site assessment
We each independently evaluated the sites' characteristics,
content, and accountability. We resolved any disagreements soon after
rating by discussion and reference to site material.
Characteristics of the site
We rated each site as to its purpose, scope, ownership, country of
origin and for involvement of a drug company, professional editorial
board, or health professional. We also rated sites according to whether
they promoted products or services and whether they contained a
disclaimer or qualifier regarding information provided.
Quality of content
Guideline score
We assessed concordance between site information and best practice by using a 43 item rating scale developed from the evidence based guidelines on clinical practice for
treating depression published by the Agency for Health Care Policy and
Research (AHCPR).13 These guidelines are one of a set of
US federal guidelines developed according to the general principles
outlined in the US Institute of Medicine's guidelines for developing
evidence based guidelines on clinical practice.14 The
guidelines were developed by a multidisciplinary panel from systematic
reviews of the scientific evidence (meta-analyses of randomised
controlled trials using modified "intention to treat" analyses) and
underwent extensive review by all panel members, a methodologist, 28 scientific reviewers, and 73 organisations. Each item in our rating
scale corresponded to one of the statements in the guidelines. The
scale covered the use of drugs, psychotherapy, combined drugs and
psychotherapy, and electroconvulsive therapy. Topics included
effectiveness, indications, selection within a treatment type, failure
to respond, and frequency of visits. We calculated a guideline score
for each site by counting the number of items on the scale for which
site information agreed with the guidelines. We also calculated a core
guideline score (out of 5) from a subset of key items relating to
indications for and effectiveness of the four major treatment types
(see box 1).
|
We evaluated other treatment issues with
a 17 item scale designed to assess the appropriateness of site
information about important treatment and management issues not
adequately or not directly evaluated by the guideline scale (such as
the importance of seeking help, discussion of side effects, depression in young people, and the relation between depression and suicide risk).
Global score
We each provided a subjective judgment of the
overall quality of a site (score out of 10) and then calculated an
average score for each site. There was a moderately high correlation between our scores (r=0.69, P=0.001), and the mean scores
for each of us did not differ significantly (mean difference 0.38 (SD
1.5), t 20=1.16), suggesting acceptable
inter-judge agreement despite the unstructured and subjective nature of
the task.
Interventions recommended
We rated the sites according to
whether each of a range of interventions were mentioned; were said or
implied to be effective or useful or were recommended as a first line,
second line, or adjunct treatment for all or some groups; and were said
to be ineffective or were not recommended. Interventions denoted
effective but explicitly not recommended were coded as "not recommended."
Sources of help recommended
We rated potential sources of
help for depression as recommended, not recommended, or not mentioned.
Accountability
Silberg score
Sites were rated on a 9 point scale
according to Silberg et al's criteria of authorship (whether authors
and their affiliations and credentials were clearly identified), attribution (whether sources and references were mentioned), disclosure (whether ownership of the site and sponsorship was disclosed), and
currency (whether the site has been modified in the past month and year
and whether the date the site was created or modified was
specified).2
We recorded the stated level of
evidence associated with each intervention using a 5 point scale
adapted from a previously published scale of hierarchy of evidence (box 2).15 We based the evidence scores only on information
explicitly provided by the site and not on our knowledge of the cited
study or relevant literature.
|
Statistical analyses
We assessed site quality and accountability as a function of
site characteristic using Mann-Whitney tests, Kruskall-Wallis analyses
followed by Mann-Whitney tests, or Fisher's exact probability
tests. We calculated non-parametric confidence intervals for the main
findings using the procedure outlined by Campbell and
Gardner.16 We calculated intercorrelations between variables using Pearson's correlation tests and Phi tests.
| |
Results |
|---|
|
|
|---|
Site characteristics
Of the 21 sites included in our analysis, 19 were US based, one
was European, and the remaining site was of unknown origin. The
principal purpose of the sites was to provide information or
educational material (10 sites), links (4), a consumer forum (1), or
information in combination with either links or consumer forum or both
(6). Table 1 lists other characteristics of the
sites.
|
Quality of content
The mean guideline, issues, and global scores were 4.7 out of 43, 9.8 out of 17, and 3 out of 10 respectively (table 1), indicating
little concordance with guideline recommendations, inadequate
consideration of management or treatment issues, and generally low
overall ratings.
|
Accountability
The mean Silberg score was 5.4 out of 9 (table 1). Most of the
sites clearly specified the authors of the web content (13 sites) and
their credentials (11 sites) and affiliations (11 sites). Nine of the
sites mentioned at least some sources and references on the site
(although such information was typically not comprehensive). All but
one site disclosed an owner of the site, and three mentioned sponsors.
Most sites indicated when the site had been created or modified: most
had been modified in the past year, and nine had been modified in the
past month.
Association between quality of content, accountability, and site
characteristics
The guideline score was significantly correlated with the
other two measures of quality of content (with global score,
r=0.53, P<0.05; with issues score, r=0.74,
P<0.01). However, none of the measures of quality of content
correlated significantly with the Silberg accountability score
(r=-0.5 to 0.21). Of the sites offering recommendations
about psychotherapy, those citing scientific evidence were more likely
to achieve an above median guideline score (Phi=0.50, P=0.034) and
showed a tendency to achieve above median issues scores (Phi=0.45,
P=0.058). There was no comparable significant relationship for antidepressants.
| |
Discussion |
|---|
|
|
|---|
In our review of 21 popular websites containing information about treating depression we found that the quality of this information was poor. This finding reinforces concerns raised by other studies which have found inadequate quality6-9 or poor coverage17 of important health issues on the web. There is a need to improve the accuracy and coverage of information about depression on the web with regard to the relative effectiveness of different treatments, the main indications for particular treatments, important management issues such as duration of treatment, reviewing and changing treatments, and the relevance of professional expertise and patient preferences. Sites should also warn readers that tricyclic antidepressants are ineffective for adolescents and that drugs may not be the first line of treatment for this age group.18
Our findings raise questions about the usefulness of Silberg et al's accountability criteria as indicators of website quality2 and suggest that further investigation of indicators of quality is warranted. Particular site characteristics (such as ownership by an organisation or existence of a professional editorial board) may prove more useful indicators of content quality than disclosure of information per se. Our results also suggest that the number of different types of interventions mentioned may be a predictor of site quality, as may the citation of scientific evidence in support of recommended treatments.
The critical question is whether the attributes which were associated with the better quality sites about depression are valid indicators of the quality of other types of health related sites. Our methodology could be used to address this question and to identify those attributes that are common predictors of quality for different medical subjects. The methodology lends itself to replication in different subjects since any systematically produced set of guidelines can serve as a rating scale with which to evaluate websites.
It is possible that the inadequacies we documented are not restricted to websites but reflect the beliefs and level of knowledge of many health professionals. McClung et al have reported that even medical teaching centres disseminated inadequate reviews on the web.7 It is unlikely that the AHCPR guidelines are outdated or inadequate since a review of more recent evidence concluded that the major AHCPR conclusions are still applicable and that, when rigorously implemented, the guidelines result in improved outcomes compared with usual care.19 The guidelines have been criticised for their failure to recommend psychotherapy as a first line treatment for severe depression,20 but, although there is some evidence to support this criticism,21 few studies have directly compared the efficacy of different treatments for severe depression and the findings have been inconsistent. By contrast, a large number of randomised controlled trials have demonstrated the efficacy of antidepressants in treating severe depression.
Despite their generally low scores for content quality, many sites did contain important and potentially useful information. It is even possible that a formal evaluation might show that such sites improve the mental health outcomes of those who visit them. Silberg et al have referred to the importance of distinguishing the flowers from the weeds on the internet superhighway. However, a single site, whether owned by a consumer or a health professional, may grow both flowers and weeds. The real challenge is to devise strategies that selectively eliminate the weeds but leave the flowers to bloom.
|
What is already known on this topic
Depression is a major source of disability in the community Websites offer an opportunity to disseminate information to the public about effective treatments However, little is known about the quality of existing sites about depression or about indicators of a good health website What this study addsAn audit of 21 popular websites revealed that the general quality of information on the treatment of depression is poor Currently popular criteria for evaluating the quality of websites were not indicators of content quality, but sites with an editorial board and sites owned by organisations produced higher quality sites than others |
| |
Acknowledgments |
|---|
Contributors: KG conceived and designed the study, rated websites, analysed and interpreted the data, and wrote the paper. HC conceived and designed the study, rated websites, analysed data, and edited the paper. Jo Medway collected and organised data and commented on the manuscript. Ailsa Korten calculated confidence intervals for non-parametric data and commented on statistical procedures. Andrew MacKinnon contributed to discussions during the design phase. KG and HC are guarantors for the paper.
| |
Footnotes |
|---|
Funding: This work was supported by grant 973302 from the National Health and Medical Research Council.
Competing interests: None declared
This article is part of the BMJ's
randomised controlled trial of open peer review. Documentation relating
to the editorial decision making process is available on the BMJ's
website
| |
References |
|---|
|
|
|---|
| 1. |
Bower H.
Internet sees growth of unverified health claims.
BMJ
1996;
313:
381 |
| 2. |
Silberg WM, Lundberg GD, Musacchio RA.
Assessing, controlling and assuring the quality of medical information on the internet.
JAMA
1997;
277:
1244-1245 |
| 3. | O'Mahony B. Irish health care web sites: a review. Ir Med J 1999; 92: 334-337[Medline]. |
| 4. |
Jadad AR, Gagliardi A.
Rating health information on the internet: navigating to knowledge or to Babel?
JAMA
1998;
279:
611-614 |
| 5. | Shon J, Musen MA. The low availability of metadata elements for evaluating the quality of medical information on the world wide web. Proc AMIA Symp 1999; (1-2): 945-949. |
| 6. |
Impicciatore P, Pandolfini C, Casella N, Bonati M.
Reliability of health information for the public on the world wide web: systematic survey of advice on managing fever in children at home.
BMJ
1997;
314:
1875-1879 |
| 7. |
McClung HJ, Murray RD, Heitlinger LA.
The internet as a source for current patient information.
Pediatrics
1998;
101:
1-4 |
| 8. | Soot LC, Moneta GL, Edwards JM. Vascular surgery and the internet: a poor source of patient-oriented information. J Vasc Surg 1999; 30: 84-91[CrossRef][Medline]. |
| 9. | Davison K. The quality of dietary information on the world wide web. Clin Perform Qual Health Care 1997; 5: 64-66[Medline]. |
| 10. |
Wyatt JC.
Measuring quality and impact of the world wide web.
BMJ
1997;
314:
1879-1881 |
| 11. | Murray CJL, Lopez AD, eds. The global burden of disease: a comprehensive assessment of mortality and disability, injuries and risk factors in 1990 and projected to 2020. Cambridge, MA: Harvard University Press, 1996. |
| 12. | Kessler RC. Some considerations in making resource allocation decisions for the treatment of psychiatric disorders. In: Andrews G, Henderson S, eds. . Unmet need in psychiatry: problems, resources, responses. Cambridge: Cambridge University Press, 2000:59-84. |
| 13. | Depression Guideline Panel. Depression in primary care. , Vol 2. Treatment of major depression Maryland: US Department of Health and Human Services, 1993. (clinical practice guidelines No 5. AHCPR Publication No 93-0551.) |
| 14. | Institute of Medicine. Guidelines for clinical practice: from development to use. Washington, DC: National Academy Press, 1992. |
| 15. | National Health and Medical Research Council. Guidelines for the development and implementation of clinical practice guidelines. Canberra: NHMRC, 1995. |
| 16. | Campbell MJ, Gardner MJ. Calculating confidence intervals for some non-parametric analyses. In: Gardner MJ, Altman D, eds. . Statistics with confidence: confidence intervals and statistical guidelines. London: BMJ Publishing Group, 1989. |
| 17. |
Sandvik H.
Health information and interaction on the internet: a survey of female urinary incontinence.
BMJ
1999;
319:
29-32 |
| 18. | National Health and Medical Research Council. Clinical practice guidelines: depression in young people. Canberra: NHMRC, 1997. |
| 19. |
Schulberg HC, Katon W, Simon GE, Rush AJ.
Treating major depression in primary care practice: an update of the Agency for Health Care Policy and Research Practice guidelines.
Arch Gen Psychiatry
1998;
55:
1121-1127 |
| 20. |
Persons JB, Thase ME, Crits-Cristoph P.
The role of psychotherapy in the treatment of depression: review of two practice guidelines.
Arch Gen Psychiatry
1996;
53:
283-290 |
| 21. | Schulberg HC, Pilkonis PA, Houck P. The severity of major depression and choice of treatment in primary care practice. J Consult Clin Psychol 1998; 66: 932-938[CrossRef][Medline]. |
(Accepted 17 July 2000)
Read all Rapid Responses