Is voice therapy an effective treatment for dysphonia? A randomised controlled trial

BMJ 2001; 323 doi: (Published 22 September 2001) Cite this as: BMJ 2001;323:658
  1. Kenneth MacKenzie, consultant otolaryngologist (kmk2x{at},
  2. Audrey Millar, research scientista,
  3. Janet A Wilson, professor of otolaryngologyc,
  4. Cameron Sellars, senior speech and language therapistb,
  5. Ian J Deary, professor of differential psychologyd
  1. a Department of Otorhinolaryngology and Head and Neck Surgery, Glasgow Royal Infirmary, Glasgow G31 2ER
  2. b Department of Speech and Language Therapy, Glasgow Royal Infirmary
  3. c Department of Otorhinolaryngology and Head and Neck Surgery, University of Newcastle, Newcastle upon Tyne NE7 7DN
  4. d Department of Psychology, University of Edinburgh, Edinburgh EH8 9JZ
  1. Correspondence to: K MacKenzie
  • Accepted 8 July 2001


Objectives: To assess the overall efficacy of voice therapy for dysphonia.

Design: Single blind randomised controlled trial.

Setting: Outpatient clinic in a teaching hospital.

Participants: 204 outpatients aged 17-87 with a primary symptom of persistent hoarseness for at least two months.

Interventions: After baseline assessments, patients were randomised to six weeks of either voice therapy or no treatment. Assessments were repeated at six weeks on the 145 (71%) patients who continued to this stage and at 12-14 weeks on the 133 (65%) patients who completed the study. The assessments at the three time points for the 70 patients who completed treatment and the 63 patients in the group given no treatment were compared.

Main outcome measures: Ratings of laryngeal features, Buffalo voice profile, amplitude and pitch perturbation, voice profile questionnaire, hospital anxiety and depression scale, clinical interview schedule, SF-36.

Results: Voice therapy improved voice quality as assessed by rating by patients (P=0.001) and rating by observer (P<0.001). The treatment effects for these two outcomes were 4.1 (95% confidence interval 1.7 to 6.6) points and 0.82 (0.50 to 1.13) points. Amplitude perturbation showed improvement at six weeks (P=0.005) but not on completion of the study. Patients with dysphonia had appreciable psychological distress and lower quality of life than controls, but voice therapy had no significant impact on either of these variables.

Conclusion: Voice therapy is effective in improving voice quality as assessed by self rated and observer rated methods.

What is already known on this topic

What is already known on this topic Many patients with dysphonia are treated by voice therapy

The effectiveness of voice therapy in a diverse group of patients is unknown

What this study adds

What this study adds Voice therapy is an effective treatment for dysphonia in terms of report by patients and perceptual ratings by an expert

Psychological distress and reduction in general health status are common in patients with dysphonia but are not significantly affected by a course of voice therapy

Editorial by Carding


Many patients have transient, self limiting changes in voice, but those who have been hoarse for more than three weeks need specialist assessment to exclude underlying laryngopharyngeal pathology. Once conditions that need surgery have been excluded, patients are usually referred to a speech and language therapist for voice therapy. Up to 40 000 patients with dysphonia are referred for voice therapy annually in the United Kingdom.1 At the time of referral, many patients with vocal dysfunction have entered a vicious cycle in which psychological factors exacerbate voice pathology and poor voice quality adversely affects psychological wellbeing.2-9 The relation between these factors is complex, and the relative influence of each factor varies from individual to individual.

No study has yet examined the overall effectiveness of voice therapy for dysphonia in terms of either changes in voice quality or changes in psychological distress or laryngoscopic findings. We aimed to examine the efficacy of voice therapy in patients with dysphonia and to identify those patients for whom voice therapy might be most beneficial.

Participants and methods

We recruited consecutive outpatients attending the department of otorhinolaryngology and head and neck surgery of Glasgow Royal Infirmary with a primary complaint of dysphonia (hoarseness) present for a minimum of two months and without any relevant organic pathology (for example, polyp, papilloma, tumour, vocal cord palsy) or need for surgery.

The inclusion criteria were age greater than 16 years, motivation to resolve the voice problem, and willingness to enter into regular voice therapy sessions. The exclusion criteria were previously treated dysphonia, neurological disease, or upper aerodigestive tract malignancy; marked hearing impairment; acid reflux; multiple medical complaints; professional voice user requiring urgent intervention; puberphonia; and transsexual conflict.

The 204 patients (51 men, 153 women) who gave informed consent for inclusion were new referrals typical of patients referred for voice therapy. At entry to the study, 100 patients were randomised to voice therapy and 104 to no treatment. By completion of the study 12-14 weeks later, about a third of participants had dropped out or been excluded, leaving 70 patients in the treatment group and 63 patients in the observation group (figure). This attrition was mostly a result of patients defaulting. In addition to failure to reattend, some patients failed to complete all self report questionnaires, notably where these had to be returned by post, despite the issuing of prepaid envelopes. Another subgroup omitted a number of items within questionnaires. The voice therapy and no treatment groups were not significantly different in terms of either rate of attrition (30% in the therapy group, 39% in the no treatment group) or characteristics of patients who dropped out (sociodemographic variables or baseline voice or psychological variables). There is thus no evidence that the attrition will have introduced bias. Analysis of each outcome was conducted only on patients with complete data.


Pathophysiology—An otolaryngologist (KMacK) used a flexible nasolaryngoscope (Olympus ENF 3) to assess four features—nodule formation, laryngitis, glottic escape, and hyperfunction of the laryngeal musculature—on a four point (0-3) rating scale. Although stroboscopic findings may facilitate the diagnosis of subtle laryngeal dysfunction, we decided not to use our stroboscope because we wanted the study to reflect routine clinical practice throughout the United Kingdom.

Voice quality—A digital tape recording of the patient's reading of the phonetically balanced “rainbow” passage (a standard paragraph used in voice assessment) was analysed by a speech and language therapist blind to treatment group.10 Ten aspects of voice, including laryngeal tone, pitch, and resonance, were rated 0-5 on the Buffalo voice profile scale,11 of which the “overall rating” item was chosen as the key observer rated variable. The same therapist also extracted two key objective measures of voice quality—“jitter” (pitch perturbation) and “shimmer” (amplitude perturbation)—by using the Computerised Speech Laboratory (model 4300B; Kay Elemetrics Corp, NJ). The higher the score on these two variables the more dysphonic the voice. Patients rated their own voice quality using the validated vocal performance questionnaire,12 with five point scoring on 12 items (1-5, 5=worst). The total voice profile questionnaire score was the key self reported outcome measure.

Psychological measures—The hospital anxiety and depression scale is a self completed questionnaire which assesses recent anxiety and depression and provides an overall score for each.13 The revised clinical interview schedule was conducted by a trained psychologist blind to the treatment group.14 The interviewer rated 14 aspects of non-psychotic psychiatric disturbance. The clinical interview schedule's overall distress score and the hospital anxiety and depression scale's anxiety score were the key outcome measures of psychological distress.

Quality of life was assessed by the SF-36, an eight dimension, extensively validated assessment of general health status.15

Treatment protocol

The voice therapy group initially comprised 100 patients; eight patients were excluded from the study before therapy. These exclusions occurred for a range of reasons, most of which (for example, dementia) only emerged at the follow up appointment. Two patients withdrew from the study after the first visit. In addition to the eight patients who were excluded after allocation to the therapy group (two of whom withdrew), eight patients failed to attend the first therapy appointment. A further 10 patients defaulted during therapy. This meant that 74 patients completed a course of voice therapy, 70 of whom had complete data sets.

Participants entering the treatment group of the study were given an appointment at which a routine voice history was taken. A recording of the “rainbow” passage was taken for possible reference in therapy. In line with current practice,16 and depending on clinical circumstances, a number of patients (11 out of the 74 who completed treatment) received advice on good vocal hygiene and optimal voice production and did not proceed to a fuller programme of voice therapy, although this option remained open to them until attendance for the second visit. The remainder of the patients (63) underwent voice therapy for up to six sessions (eight sessions in one case). The treatment sessions lasted 45-60 minutes. The number of sessions needed and the type of treatment carried out were determined heuristically, depending partly on the nature of the symptoms and partly on patients' priorities.16 Treatment could be indirect—for example, involving discussion of issues of vocal hygiene or of lifestyle impinging on voice production. Equally, patients could be invited to practise techniques related to their vocal symptoms—for example, improving breath support for vocal production or altering vocal onset in favour of “softer” vocal attack. Because of the limited possibilities for contact with patients, therapy favouring attention to vocal symptoms predominated, but issues were incorporated relating to possible underlying psychological distress. Indeed, where these were clearly the focus of concern to the patient, a counselling approach was adopted.

The programme of voice therapy ended by mutual agreement between the treating therapist and the individual patient. Completion of therapy did not imply complete recovery at that time but rather a recognition that good vocal practice needed ongoing attention over a sustained period beyond the contact limits set for this project.


Baseline data were recorded after eligibility had been assessed and consent had been obtained by the laryngologist. The participants were then seen by one speech and language therapist (CS), who obtained a number for random allocation of the participant to either a course of voice therapy or a period of observation. Computer generated random numbers produced in a restricted randomisation form were supplied by an independent worker in a separate department. All voice therapy was delivered by CS according to a protocol (see box) derived from a review of the type, form, and frequency of voice therapy used by a substantial sample of speech and language therapists in the United Kingdom.16 CS was not involved in the collection, storage, or analysis of outcome data in any way. Equally, the details of the treatment of this group were at no time communicated to any member of the research assessment team involved in collecting outcome data. After six weeks of therapy or observation, data on pathophysiology, voice quality, psychological status, and quality of life (with the exception of the clinical interview schedule) were recorded. After a further 6-8 weeks, all measurements were repeated, and the clinical interview schedule was conducted.

Before the study, the intended number of patients to be recruited in the treatment and non-treatment groups was determined by assuming a medium effect size of treatment (0.5 SD units). This is a conservative effect size compared with that indicated in the available literature. Also, greater than 90% power was sought with α set at 0.05 (two tailed). The target chosen was 100 patients in each group, which offered 94% power. At the end of the study we had data from 70 and 63 people in the two groups; with the same assumptions, this offered 81% power.

Statistical analysis

Statistical analyses compared the mean difference in the outcome variables between the groups with and without treatment. We conducted separate analyses at the end of treatment and at the later follow up. Each analysis took into account the baseline (entry to study) score for each of the outcome variables. We thus used an analysis of covariance procedure with group (treatment versus no treatment) as a between patients variable; we used people's baseline scores on the particular variable being compared as covariates. We used baseline values as the covariates for both the end of treatment and follow up analyses.

Assessment of the effect of voice therapy on pathophysiological outcomes needed a categorical approach. We subtracted ratings for each pathophysiological feature for each patient at the end of treatment (visit 2) from those at baseline (visit 1); we also subtracted ratings for each feature at the end of follow up (visit 3) from those at visit 1. We then assigned patients to a category (0, 1, or 2) according to whether they had improved, deteriorated, or stayed the same. We performed a series of χ2 intergroup comparisons.

We calculated treatment effects as mean differences at the relevant outcome (visits 2 and 3) controlled for baseline scores in the respective measure. We used general linear modelling (analysis of covariance) in SPSS 9/10 to perform the analysis.


As expected, most patients in both groups were women; the groups were closely matched for age (table 1). Laryngeal features at study entry were similar in the intervention and control patients (either in the 204 patients originally recruited or in the 145 with repeat laryngoscopy at six weeks). Grade 2-3 (moderate to severe) scores were uncommon for all of the four features, and only minimal resolution of the abnormalities occurred between the two time points (table 1).


Laryngoscopic data for treatment and no treatment groups at baseline and first follow up (only those who attended at first follow up are included at baseline). Values are numbers (percentage) of patients unless otherwise stated

View this table:

The groups were well matched at entry to the study for subjective and objective voice variables (table 2). The treatment and no treatment groups differed at baseline only on the hospital anxiety and depression scale anxiety scores, which were significantly higher in the control group (t=2.67, P=0.008).


Mean (SD) scores for key voice and psychological variables at each visit for treatment and no treatment groups (patients with complete data)

View this table:

This difference between the treatment and no treatment groups was evident in the original 204 randomised recruits and in the 133 patients who completed all three phases of the study. Both the treatment and no treatment groups had high baseline scores for anxiety on the hospital anxiety and depression scale compared with healthy controls from the scale's normative reference data, though not necessarily with other otorhinolaryngology patients. 8 9 The baseline SF-36 quality of life scores reflect a severely impaired quality of life in both the treatment and no treatment groups compared with other groups of patients.15

Effectiveness of voice therapy

By the end of treatment voice therapy significantly improved self rated quality of voice as measured by the voice profile questionnaire and the measurement of amplitude perturbation or “shimmer” by the Computerised Speech Laboratory. At follow up the patients in the treatment group had significantly lower scores than those in the no treatment group on the Buffalo overall rating and the voice profile questionnaire total score. Treatment effects (points) and 95% confidence intervals were calculated for each of the outcome variables at both completion of treatment and completion of follow up (tables 2 and 3). All participants with data at baseline and follow up were included. For the voice profile questionnaire the effect was 4.1 points (effect size 0.54 SD). For the Buffalo scale the effect was 0.82 points (effect size 0.76 SD). In conventional statistical terminology these are medium to large effects. Voice therapy had an effect on only one quality of life outcome variable—mental health. This was significantly better in the treatment group at completion of treatment but not at completion of follow up.


Mean (SD) scores for quality of life (SF-36) variables at each visit for treatment and no treatment groups (patients with complete data). Higher scores indicate better health

View this table:

To address the issue of dropout we re-ran the analyses including all patients with data at baseline. For patients with missing data at visit 2 or visit 3 we entered the baseline values. This makes the conservative assumption that there was no difference in outcome between patients who dropped out from the treatment and no treatment groups. On reanalyses of the sensitivity scores, treatment effects (points), and confidence intervals for each of the outcome measures, the results retained their significant P values and treatment effects (tables 4 and 5).


Mean (SD) scores for key voice and psychological variables at each visit for treatment and no treatment groups (all patients with baseline data)

View this table:

Mean (SD) scores for quality of life (SF-36) variables at each visit for treatment and no treatment groups (all patients with baseline data). Higher scores indicate better health

View this table:


This first randomised controlled trial of the efficacy of voice therapy for dysphonia has shown voice therapy to be effective in improving self rated and expert rated quality of voice. The magnitude of the observed mean improvements reflects clinically meaningful improvements in voice quality. The minimal change in laryngoscopic appearances during the study reflects the fact that many of the patients referred for non-surgical voice therapy have, by definition, relatively normal laryngeal appearances.

Psychological distress was not significantly reduced as a result of treatment. Voice therapy had a significant effect on one quality of life variable—mental health—at the end of treatment, but this was not maintained at follow up. A subgroup of patients remain psychologically distressed despite receiving treatment. Speech and language therapists often use psychological strategies but often acquire psychological training after qualification and in what has been described as an ad hoc manner.17 If patients with high psychological distress could be identified by screening they could be referred for psychological intervention, perhaps from a clinical psychologist.18

The disconcertingly abnormal SF-36 results highlight the importance of effective vocal communication for an individual's psychosocial wellbeing. Indeed, the level of psychological morbidity may also mainly reflect the greatly reduced quality of life in patients with dysphonia. Such interrelations underline the importance of a holistic treatment for reduction in symptoms and improvement in overall functioning.

In conclusion, this study shows that voice therapy is effective in improving self rated and observer rated measures of voice quality. However, voice therapy does not significantly affect laryngeal pathophysiology or reduce the high levels of psychological distress that characterise patients with dysphonia.


We acknowledge the contributions of Nicola Bradshaw and Shonagh Scott, who were both involved as research assistants in the collection of data for this paper; Dr Martha Whiteman for running the sensitivity analyses; Catherine Dunnet, chief speech and language therapist at Glasgow Royal Infirmary, for contributions to the project and comments on a draft of the paper; and Professor Stuart Gatehouse for his comments on an earlier draft of the paper.

Contributors: KMacK initiated and coordinated the formulation of the primary study hypothesis, discussed core ideas, designed the protocol, and participated in data collection and the writing of the paper. AM discussed core ideas, coordinated the study, carried out the psychological analysis, ran the statistical analyses of treatments, and was involved in writing the paper. JAW initiated and coordinated the formulation of the primary study hypothesis, discussed core ideas, designed the protocol, and contributed to writing the paper. CS discussed core ideas, designed the protocol, and carried out all the voice therapy. IJD initiated and coordinated the formulation of the primary study hypothesis, discussed core ideas, designed the protocol, supervised the statistical analyses, and was principally involved in writing the paper. The guarantor for the paper is KMacK.


  • Funding Scottish Office Home and Health Department (grant reference: K/RED/4/C249).

  • Competing interests None declared.


  1. 1.
  2. 2.
  3. 3.
  4. 4.
  5. 5.
  6. 6.
  7. 7.
  8. 8.
  9. 9.
  10. 10.
  11. 11.
  12. 12.
  13. 13.
  14. 14.
  15. 15.
  16. 16.
  17. 17.
  18. 18.
View Abstract