Adverse obstetric outcomes after local treatment for cervical preinvasive and early invasive disease according to cone depth: systematic review and meta-analysisBMJ 2016; 354 doi: https://doi.org/10.1136/bmj.i3633 (Published 28 July 2016) Cite this as: BMJ 2016;354:i3633
- Maria Kyrgiou, senior lecturer1 2,
- Antonios Athanasiou, medical student3,
- Maria Paraskevaidi, research fellow1,
- Anita Mitra, clinical research fellow1 2,
- Ilkka Kalliala, clinical postdoctoral fellow1,
- Pierre Martin-Hirsch, professor45,
- Marc Arbyn, coordinator of unit6,
- Phillip Bennett, professor1 2,
- Evangelos Paraskevaidis, professor3
- 1Institute of Reproductive and Developmental Biology, Department of Surgery and Cancer, Faculty of Medicine, Imperial College, London, UK
- 2Queen Charlotte’s and Chelsea-Hammersmith Hospital, Imperial Healthcare NHS Trust, London, UK
- 3University Hospital of Ioannina, Ioannina, Greece
- 4Department of Gynaecological Oncology, Lancashire Teaching Hospitals, Preston, UK
- 5Department of Biophotonics, Lancaster University, Lancaster, UK
- 6Unit of Cancer Epidemiology, Scientific Institute of Public Health, Brussels, Belgium
- Correspondence to: M Kyrgiou
- Accepted 26 June 2016
Objective To assess the effect of treatment for cervical intraepithelial neoplasia (CIN) on obstetric outcomes and to correlate this with cone depth and comparison group used.
Design Systematic review and meta-analysis.
Data sources CENTRAL, Medline, Embase from 1948 to April 2016 were searched for studies assessing obstetric outcomes in women with or without previous local cervical treatment.
Data extraction and synthesis Independent reviewers extracted the data and performed quality assessment using the Newcastle-Ottawa criteria. Studies were classified according to method and obstetric endpoint. Pooled risk ratios were calculated with a random effect model and inverse variance. Heterogeneity between studies was assessed with I2 statistics.
Main outcome measures Obstetric outcomes comprised preterm birth (including spontaneous and threatened), premature rupture of the membranes, chorioamnionitis, mode of delivery, length of labour, induction of delivery, oxytocin use, haemorrhage, analgesia, cervical cerclage, and cervical stenosis. Neonatal outcomes comprised low birth weight, admission to neonatal intensive care, stillbirth, APGAR scores, and perinatal mortality.
Results 71 studies were included (6 338 982 participants: 65 082 treated/6 292 563 untreated). Treatment significantly increased the risk of overall (<37 weeks; 10.7% v 5.4%; relative risk 1.78, 95% confidence interval 1.60 to 1.98), severe (<32-34 weeks; 3.5% v 1.4%; 2.40, 1.92 to 2.99), and extreme (<28-30 weeks; 1.0% v 0.3%; 2.54, 1.77 to 3.63) preterm birth. Techniques removing or ablating more tissue were associated with worse outcomes. Relative risks for delivery at <37 weeks were 2.70 (2.14 to 3.40) for cold knife conisation, 2.11 (1.26 to 3.54) for laser conisation, 2.02 (1.60 to 2.55) for excision not otherwise specified, 1.56 (1.36 to 1.79) for large loop excision of the transformation zone, and 1.46 (1.27 to 1.66) for ablation not otherwise specified. Compared with no treatment, the risk of preterm birth was higher in women who had undergone more than one treatment (13.2% v 4.1%; 3.78, 2.65 to 5.39) and with increasing cone depth (≤10-12 mm; 7.1% v 3.4%; 1.54, 1.09 to 2.18; ≥10-12 mm: 9.8% v 3.4%, 1.93, 1.62 to 2.31; ≥15-17 mm: 10.1% v 3.4%; 2.77, 1.95 to 3.93; ≥20 mm: 10.2% v 3.4%; 4.91, 2.06 to 11.68). The choice of comparison group affected the magnitude of effect. This was higher for external comparators, followed by internal comparators, and ultimately women with disease who did not undergo treatment. In women with untreated CIN and in pregnancies before treatment, the risk of preterm birth was higher than the risk in the general population (5.9% v 5.6%; 1.24, 1.14 to 1.35). Spontaneous preterm birth, premature rupture of the membranes, chorioamnionitis, low birth weight, admission to neonatal intensive care, and perinatal mortality were also significantly increased after treatment.
Conclusions Women with CIN have a higher baseline risk for prematurity. Excisional and ablative treatment further increases that risk. The frequency and severity of adverse sequelae increases with increasing cone depth and is higher for excision than for ablation.
The mean age of women undergoing local treatment for cervical preinvasive cervical disease (cervical intraepithelial neoplasia or CIN) is similar to the age of women having their first child. Local cervical treatment has been correlated to an increased risk of preterm birth, perinatal morbidity, and mortality in a subsequent pregnancy.1 2 3 4 5 6 The underlying mechanism is unclear; hypotheses include immunomodulation relating to infection with human papillomavirus (HPV) affecting parturition pathways and acquired “mechanical weakness” secondary to loss of cervical tissue.7 8
In England alone in 2013-14, about 3.6 million women aged 25-64 attended for cervical screening, and over 23 800 cervical procedures were carried out,9 nearly all in an outpatient setting. In contrast, in the United States there are about 400 000 cases of preinvasive disease a year.10 The regulations in colposcopy are more liberal, leading to wide variation in clinical practice. In Germany, treatment for CIN is still commonly performed with the cold knife under general analgesia.11 The long term sequelae of treatment therefore remain an important international issue to healthcare professionals and women, whatever the clinical setting.
Since the first systematic review almost a decade ago on reproductive risk associated with treatment1 more than 50 observational studies have been published confirming12 13 or disputing these associations14 15; some of these reported data from large population based datasets. Individual attempts to synthesise parts of this rapidly evolving evidence base in small systematic reviews and meta-analyses reached contradictory conclusions1 2 3 4 16 17 18 19 and initiated debates and confusion within the scientific community.2 16 17 18 19 Whether these discrepancies were due to questionable quality of some of these primary and secondary studies or to differences in the explored comparisons,4 16 17 18 the subject is open to a definitive comprehensive high quality synthesis of the existing evidence that will be highly informative to women, clinicians, and policy makers.
Media publicity has heightened public awareness that treatment for cervical precancer is associated with increased reproductive morbidity. There has been a substantial increase in inquiries from patients and clinicians on the risks associated with different treatment techniques and cone depths20 21 and as to how this risk can be managed and prevented. With a rapidly evolving evidence base and lack of a robust synthesis of the published literature, these questions are becoming increasingly difficult to answer.
We carried out a systematic review and meta-analysis to explore the impact of treatment for cervical preinvasive and early invasive disease on obstetric outcomes and how this risk could be modified by the cone depth and comparison group.
Inclusion criteria and outcomes
We included all studies that reported on obstetric outcomes (over 24 weeks’ gestation) in women who had previously received local cervical treatment for CIN or early invasive cervical cancer compared with outcomes in women with no history of treatment. Studies reporting on the outcomes after two or more treatments were also included. The interventions included any type of treatment: excisional (cold knife conisation; laser conisation; needle excision of the transformation zone, also known as straight wire excision; large loop excision of the transformation zone, also known as loop electrosurgical excisional procedure) or ablative (laser ablation; radical diathermy; cold coagulation; and cryotherapy). In studies that reported on the impact of several techniques, when possible we extracted data for each specific method. If the outcomes were not reported separately for each technique, we analysed the intervention under broader terms—that is, excisional treatment not otherwise specified, ablative treatment not otherwise specified, and treatment not otherwise specified.
Women were included irrespective of the grade of the lesion for both squamous and glandular intraepithelial neoplasia. We excluded studies that did not include an untreated reference population, compared different treatment techniques without an untreated control, or compared outcomes for treatments performed during pregnancy.
Studies were included irrespective of the type of untreated reference population that could have been drawn from one of the following sources: external group from general population that was mostly matched or adjusted for confounders; internal group with self matching of the pregnancies for the same women before and after treatment; internal group of women who had also delivered before treatment; women undergoing colposcopy with or without CIN/biopsy but no treatment; women with high grade disease but no treatment (high grade squamous intraepithelial lesion).
We assessed obstetric outcomes of pregnancies progressing beyond 24 weeks’ gestation. We examined both maternal and neonatal outcomes. The maternal outcomes included overall (<37 weeks’ gestation), severe (<32-34 weeks), and extreme (<28-30 weeks) prematurity (all preterm birth, iatrogenic and spontaneous). We also assessed preterm birth in singleton and multiple pregnancies, in nulliparous and parous women, for single and repeat cones, for different cone depths and volumes, and for different comparison groups. We further assessed other maternal outcomes that included: overall (<37 weeks’ gestation), severe (<32-34), and extreme (<28-30)spontaneous prematurity—that is, non-iatrogenic); threatened preterm birth; premature rupture of the membranes; chorioamnionitis; mode of delivery (caesarean section, instrumental deliveries); length of labour (precipitous, prolonged); induction of labour or use of oxytocin; haemorrhage (antepartum, postpartum); analgesia (epidural, pethidine, not otherwise specified); cervical stenosis; and cervical cerclage. The neonatal outcomes included: low birth weight (<2500 g, <2000 g, <1500 g, and <1000 g), admission to neonatal intensive unit, perinatal mortality, stillbirth, and Apgar score.
When there was heterogeneity in the cut offs used in different studies for cone depth and classification of prematurity, we grouped these together when possible (that is, 32-34 weeks included both cut offs; 10-12 mm cone depth included studies grouping at both these cut offs with and without the values equal to these numbers).
Literature search, data extraction, and risk of bias
We searched three electronic databases (CENTRAL, Medline, and Embase) and targeted reports published between 1948 and April 2016. We used keywords including “cervical intraepithelial neoplasia (CIN)”, “cervical cancer”, “LLETZ or LEEP”, “conisation”, “excision”, “pregnancy”, “obstetric”, “preterm birth,” and “prematurity”. The full strategy is included in appendix 1. In an attempt to identify any articles missed by the initial search or any unpublished data, we hand searched the references of the retrieved articles and meta-analyses and the proceedings of relevant conferences. There was no language restriction.
From each study, we extracted data on the study design and setting, the study population, the interventions examined, the comparison group, the quality of the data and risk of bias, and the outcomes assessed. From each study and for each outcome we retrieved the number of events in treated and untreated women. If necessary, we contacted authors to obtain additional data if the numbers provided in the published report did not allow sufficient precision in the data extraction.
We used the Newcastle-Ottawa score to formally assess the quality of non-randomised cohort studies,22 according to the MOOSE checklist.23 This scoring system assesses the cohort selection, comparability, and assessment of outcomes to give a maximum score of 9 (highest quality).
Two investigators (MK, AA) independently performed the literature search, assessed the eligibility and quality of the retrieved papers, and performed the data extraction. The two authors then compared the results and disagreements were resolved by discussion. If required, consensus was reached with the involvement of a third investigator (MA).
Data synthesis and assessment of heterogeneity
We calculated the risk ratios and 95% confidence intervals for each reported outcome in the treated versus untreated women for dichotomous outcomes using Cochrane Revman 5 software. We used a random effect model and inverse variance weighting for all meta-analyses.24 In studies with multiple treatment groups, we proportionally divided the “shared” comparison group into the number of treatment groups; we treated comparisons between each treatment group and the split comparison group as independent comparisons. If a study presented data for more than one comparison group, we used the external comparison group of women with or without disease in preference to internal controls. If data were not of suitable quality for meta-analysis, we reported the results as a narrative in the text of the review.
We assessed heterogeneity between studies with the Cochran Q test, visual inspection of forest plots,25 estimation of the percentage of heterogeneity between studies that cannot be ascribed to sampling variation (I2 statistic),26 and a formal test of the significance for heterogeneity.27 If there was evidence of substantial heterogeneity, the possible reasons for this were investigated and reported.
We performed a series of subgroup analyses. We analysed the data separately for each treatment, in groups of ablative and excisional techniques, and as a whole irrespective of the type of method used. We further analysed the data according to the cone depth. Given the non-randomised nature of the included studies, we assessed whether the choice of comparison group affected the risk estimate for each outcome and overinflated the effect of treatment that could be partly attributed to other confounders. We therefore distinguished the different untreated comparison groups used across studies and performed subgroup analyses for the risk of preterm birth for each individual comparator (external; internal (self matching); internal (pregnancies before treatment); colposcopy but no treatment; high grade squamous intraepithelial lesion but no treatment). Furthermore, we performed sensitivity analysis to assess the impact of the quality of the studies on some selected outcomes. We calculated the median score from the Newcastle-Ottawa scale and performed sensitivity analysis for studies that scored more than the median. We performed subgroup analyses based on the cohort selection in the Newcastle-Ottawa score (truly or somewhat representative) and the comparability of the groups (those with scores of 1 or 2). Finally, we performed meta-regression analysis to assess the impact of some factors on the risk of preterm birth (<37 weeks). These included the quality of the studies (based on the Newcastle-Ottawa score); year of study (1979-89, 1990-99, 2000-09, 2010-15); type of treatment (excision or ablation); type of comparator (external, internal-pregnancies before treatment, internal-self matching, CIN but no treatment, high grade squamous intraepithelial lesion but no treatment).
Patients and the wider public were involved from the outset through informal interviews in the clinic and through patient advocate representative bodies. The research questions and outcomes were developed based on the patients’ concerns and priorities. Patients were not involved in the interpretation of results or writing of the article. The results will be disseminated to the lay audience through the authors’ involvement with charities and through public presentations.
We identified 406 potentially eligible studies that fulfilled the inclusion criteria of this review.5 12 13 14 15 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 No unpublished studies were identified. We excluded studies without an untreated reference population,94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 studies that included women treated during pregnancy,120 121 studies assessing fertility and early pregnancy outcomes below 24 weeks’ gestation,122 123 124 125 126 127 studies assessing outcomes after treatment in high risk populations,128 129 and studies assessing the impact of CIN on outcomes without information as to whether treatment was performed.130 131 132Figure 1⇓ shows more details of the literature search and the reasons for exclusion.133
Table A in appendix 2 shows detailed characteristics of the included studies and the outcomes examined. Most studies were retrospective, with only five prospective reports.71 77 78 80 82 All were cohort studies, apart from one case-control study by Castanon and colleagues.85 There were no randomised controlled studies. Fourteen studies examined the impact of cold knife conisation,13 28 29 30 32 33 34 37 60 61 62 82 87 89 10 studied laser conisation,42 46 47 48 49 51 52 56 76 78 one studied needle excision of the transformation zone.13 34 studied large loop excision of the transformation zone,13 39 40 41 44 45 50 55 56 57 58 59 60 62 63 65 66 67 68 69 73 74 76 77 78 80 81 82 83 86 87 88 90 91 eight studied laser ablation,35 38 39 47 49 54 56 62 one studied radical diathermy,62 two studied cryotherapy,31 60 16 studied excision not otherwise specified,5 12 14 15 53 64 70 71 72 75 78 79 84 85 90 93 five studied ablation not otherwise specified,12 14 53 70 87 and three studied treatment not otherwise specified.36 43 92 There were five types of untreated comparison groups. Some used an external comparator,5 12 13 14 15 28 29 33 35 36 37 38 39 40 41 42 43 44 45 48 49 51 52 53 54 55 57 58 59 60 61 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 83 86 87 89 92 93 others compared with the pregnancies before treatment in the treated population (internal)5 15 30 31 32 34 45 46 47 58 73 74 84 91 or used self matching for women who delivered both before and after treatment (internal),13 15 43 48 51 64 66 some compared with women who underwent colposcopy with or without CIN and/or biopsy but had no treatment,15 56 62 63 67 68 77 81 82 84 88 90 91 93 and some with women with high grade disease but no treatment.13 53 70 All studies that used an external comparison group either matched for known risk factors or performed regression analysis to control for known confounders; four studies did not control for any confounders.43 61 65 76
Table B in appendix 2 provides more details on the quality assessment for observational studies with the Newcastle-Ottawa score. Most studies scored 8 or 9 points, 10 scored 7,30 35 43 45 46 47 50 61 72 76 and two scored 6.38 65
The risk of preterm birth was significantly increased after cervical treatment (table 1⇓). For all treatment types, this was the case for overall prematurity at less than 37 weeks’ gestation (relative risk 1.78, 95% confidence interval 1.60 to 1.98), for severe prematurity less than 32-34 weeks’ gestation (2.40, 1.92 to 2.99), and extreme prematurity less than 28-30 weeks’ gestation (2.54, 1.77 to 3.63) (table 1⇓). Figure 2⇓ shows the risk associated with LLETZ versus no treatment. The forest plot in appendix 3 shows the risks for all treatment techniques versus no treatment. The magnitude of the effect of treatment was higher for more radical treatment techniques and for excision rather than ablation. More specifically, the risk of preterm birth at less than 37 weeks’ gestation was higher for cold knife conisation (2.70, 2.14 to 3.40), laser conisation (2.11, 1.26 to 3.54), excision not otherwise specified (2.02, 1.60 to 2.55), large loop excision of the transformation zone (1.56, 1.36 to 1.79), and ablation not otherwise specified (1.46, 1.27 to 1.66). Similar trends were noted for severe and extreme prematurity.
Treatment was also associated with an increased risk of preterm birth for women with multiple pregnancies for some but not all treatments (table C, appendix 2). Have asked author, but the results were inconsistent because of the small number of studies. The impact of treatment was no different for nulliparous and multiparous women (data not shown). The effect of multiple as opposed to single treatments on the risk of prematurity was substantially higher in comparisons with untreated women (relative risk 3.78 (95% confidence interval 2.65 to 5.39) for repeated treatment and 1.75 (1.49 to 2.06) for single treatment, table 2⇓). Compared with no treatment, the relative risk of preterm birth for two excisional treatments not otherwise specified was as high as 5.48 (2.68 to 11.24) and that of two loop excisions as high as 2.81 (2.33 to 3.39).
The analysis of the risk according to the cone dimensions showed that the risk increases progressively with increasing cone depth (table 3⇓; fig 3⇓) or cone volume (table 3⇓). The risk for treated versus untreated women was significantly higher for women with cone depth ≤10-12 mm (relative risk 1.54, 95% confidence interval 1.09 to 2.18). The magnitude of effect increased with increasing cone depth (1.93 (1.62 to 2.31) for ≥10-12 mm, 2.77 (1.95 to 3.93) for ≥15-17 mm, and 4.91 (2.06 to 11.68) for ≥20 mm; table 3⇓). The trend was similar with increasing cone volume (2.25 (1.09 to 4.66) for ≤6 cc and 13.9 (5.09 to 37.98) for ≥6 cc; table 3⇓). Further analyses of the individual cone depth cut offs not grouped together showed similar results (data not shown).
The comparison of women treated with different cone depths showed that deeper excisions significantly increased the risk of preterm birth compared with less deep excisions, and the magnitude of the effect increased in deeper cones. The relative risk were 1.54 (95% confidence interval 1.31 to 1.80) for ≥10-12 mm v ≤10-12 mm, 1.82 (1.47 to 2.26) for ≥15-17 mm v ≤15-17 mm, and 1.82 (1.47 to 2.26) for ≥20 mm v ≤20 mm (fig 4⇓). Full data are also provided in table D, appendix 2. The findings were similar for the comparison of cone volumes (2.04 (0.95 to 4.39) for ≥3-4 cc v ≤3-4 cc (15.0% v 7.3%, one study, 278 women); 6.18 (2.53 to 15.13) for ≥6 cc v ≤6 cc (50.0% v 8.1%, one study, 278 women).
We assessed the impact that the choice of comparison group can have on the magnitude of effect in a subgroup analysis that classified different studies according to the comparator used (table 4⇓). The results suggested that treatment significantly increased the risk of preterm birth at less than 37 weeks’ gestation irrespective of the comparison group used. The magnitude of effect was higher when an external comparison group was used (relative risk 1.93, 95% confidence interval 1.71 to 2.17), followed by internal comparators (1.52 (1.17 to 1.97) for self matching and 1.42 (1.01 to 1.99) for pregnancies before treatment), and ultimately women with disease but no treatment (1.27, 1.14 to 1.41). In women with untreated CIN, and in pregnancies before treatment, the risk of preterm birth compared with general population was significantly higher (1.24, 1.14 to 1.35). The subgroup analysis of the risk of preterm birth according to cone depth and comparison group showed a similar direction of effect, although for cone depth ≤10-12 mm the difference became insignificant. The number of studies was small for many comparisons. For treated versus untreated women with CIN there were four studies for cone depth ≤10-12 mm (43 145 women, 7.0% v 5.0%, relative risk 1.11, 95% confidence interval 0.85 to 1.43), four studies for cone depth ≥10-12 mm (45 275 women, 9.6% v 5.0%, 1.52, 1.37 to 1.68), three studies for cone depth ≥15-17 mm (33 934 women, 9.6% v 4.3%, 2.30, 1.57 to 3.35), and two studies for cone depth ≥20 mm (32 717 women, 9.3% v 4.2%, 4.32, 0.93 to 20.03) (table E, appendix 2). Furthermore, the sensitivity analysis that excluded studies that scored below the median Newcastle-Ottawa score (8.3) did not change the results of the analysis; similarly the results did not change when we excluded studies that scored ≤7 and ≤6 (data not shown). The subgroup analyses of studies based on the cohort selection or the comparability of the comparison groups showed similar direction and magnitude of effect (data not shown). The univariate meta-regression analysis suggested that the type of treatment and comparator significantly affected the risk of preterm birth, although the type of treatment and Newcastle-Ottawa score did not. These factors remained significant in a multivariate regression analysis. When we performed further meta-regression restricting only to excisional treatments and using as a comparator women with colposcopy/biopsy, we found that all treatments were associated with an increased risk of preterm birth (1.34, 1.10 to 1.64, for large loop excision of the transformation zone; 2.3, 1.39, 3.85, for cold knife conisation; 1.6, 0.91 to 2.87, for laser conisation; and 4.26, 1.96 to 9.33, for needle excision of the transformation zone).
Several studies assessed other adverse maternal outcomes (table F, appendix 2), and many of these were found to be increased after cervical treatment.This increase was more commonly associated with excisional than ablative techniques and with more radical treatment, although the number of studies assessing each individual treatment method was often small. Cervical treatment was associated with an increased risk of spontaneous overall, severe, and extreme preterm birth (<37 weeks: 14 studies, 1 024 731 women, 7.0% v 3.7%; relative risk 1.76, 95% confidence interval 1.47 to 2.11; <32-34 weeks: seven studies, 655 675 women, 1.8% v 0.6%; 2.63, 1.91 to 3.62; <28 weeks: two studies, 626 670 women, 0.6% v 0.2%, 3.18, 1.64 to 6.16) and admissions for threatened preterm birth (five studies, 903 women, 9.1% v 3.2%, 2.44, 1.37 to 4.33). The risk (<37 weeks) was higher for cold knife conisation (3.53, 2.05 to 6.05) followed by excision not otherwise specified (1.70, 1.17 to 2.46), large loop excision of the transformation zone (1.60, 1.22 to 2.08), and ablation not otherwise specified (1.42, 1.20 to 1.70). Needle excision of the transformation zone and laser ablation were each assessed in only one study. There was substantial heterogeneity for the comparisons assessing all gestational categories (P<0.05).
The risk of premature rupture of membranes (<37 weeks: 21 studies, 477 011 women; 6.1% v 3.4%, relative risk 2.36, 95% confidence interval 1.76 to 3.17) and chorioamnionitis (four studies, 29 198 women, 3.5 v 1.1%; 3.43, 1.36 to 8.64) was also increased after treatment. Risk was higher after cold knife conisation (4.11, 2.05 to 8.25) followed by large loop excision of the transformation zone (2.15, 1.48 to 3.12). Needle excision of the transformation zone was assessed in only one study, and laser ablation did not significantly affect the risk but was assessed in only two studies.
The mode of delivery (caesarean section or instrumental delivery), the length of labour (precipitous or prolonged), the use of analgesia (epidural, pethidine, or other), the rate of induction of labour (with or without oxytocin), cervical stenosis, and haemorrhage (antenatal or postpartum) were not affected by treatment. As expected, the rate of cervical cerclage insertion was higher for treated than non-treated women (eight studies, 141 300 women, 4.0% v 0.7%, relative risk 14.29, 95% confidence interval 2.85 to 71.65) and more so for cold knife conisation (31.42, 2.32 to 426.2), large loop excision of the transformation zone (11.0, 0.64 to 190), or excisional treatment not otherwise specified (42.45, 28.99 to 62.16).
More than 30 studies assessed one or more neonatal outcomes (table G, appendix 2). Cervical treatment (excisional or ablative) was associated with a significant increase in adverse neonatal outcomes compared with outcomes in women who did not undergo treatment (comparison group not specified). The association with adverse neonatal events was stronger and more common for excisional rather than ablative techniques and with increasing treatment radicality, although the number of studies for each individual treatment technique was often limited.
More specifically, cervical treatment overall was associated with an increased risk of low birth weight (<2500 g: 30 studies, 1 348 206 women, 7.9% v 3.7%, relative risk 1.81, 95% confidence interval 1.58 to 2.07; <1500 g: five studies, 76 836 women, 2.0% v 0.5%, 3.00, 1.54 to 5.85), admission to a neonatal intensive unit (eight studies, 2557 women, 12.6% v 8.9%, 1.45, 1.16 to 1.81), and perinatal mortality (23 studies, 1 659 433 women, 0.9% v 0.7%, 1.51, 1.13 to 2.03). There was significant heterogeneity between studies for perinatal mortality (P=0.04, I2=36%).
The rate of neonates with birth weight <2500 g was significantly higher for women treated with cold knife conisation (five studies, 30 304, relative risk 2.51, 95% confidence interval 1.78 to 3.53), large loop excision of the transformation zone (12 studies, 3357, 2.11, 1.51 to 2.94), excisional (10 studies, 823 648, 2.01, 1.62 to 2.49) or ablative (four studies, 483 402, 1.36, 1.19 to 1.55) treatment not otherwise specified but not so for laser ablation (1.07, 0.59 to 1.92), although for that comparison there were only four studies with a total of 1104 participants. The rate of admission to neonatal intensive care was assessed only for excisional techniques and was significantly increased after large loop excision of the transformation zone (five studies, 1994 women, 1.42, 1.01 to 1.99). Perinatal mortality was significantly increased overall and for excisional technique not otherwise specified (five studies, 820 028, 1.85, 1.02 to 3.36) but not for the individual techniques, possibly because of the limited number of studies and the low prevalence of the outcome. Subgroup analysis according to the different comparison groups or cone depths was not possible because of the limited number of studies assessing each outcome.
The knowledge that local treatment for cervical precancer, particularly excisional, increases the risk of preterm birth has led to major changes in clinical practice. With a rapidly evolving evidence base and inconsistencies in the published literature,14 15 17 18 66 113 a high quality synthesis of the evidence should be available for effective counselling of patients at colposcopy and antenatal clinics.
This meta-analysis shows that any local cervical treatment for preinvasive or early invasive disease increases the risk of preterm birth and adverse sequelae in a subsequent pregnancy, although the impact of small excisions, as opposed to just having the disease, remains uncertain and is likely to be small. Cervical treatment was found to be associated with an increased risk of overall, severe, and extreme prematurity, spontaneous preterm birth, threatened preterm labour, premature rupture of the membranes, chorioamnionitis, low birth weight, neonatal admission, and perinatal death. The rate of cervical cerclage was unsurprisingly substantially increased in treated women compared with untreated controls. Treatment equally affected outcomes for nulliparous as well as parous women and singleton and multiple pregnancies. The mode of delivery, length of labour, induction rate, use of analgesia, rate of stenosis, and haemorrhage were not significantly affected.
The magnitude of the effect of treatment was higher for more radical techniques (such as cold knife conisation followed by large loop excision of the transformation zone and laser ablation) and for excision rather than ablation. Multiple conisations increased the risk of preterm birth fourfold compared with untreated controls overall. Subgroup analyses clearly showed that the risk of preterm birth is directly correlated with the cone dimensions (depth/volume) and progressively increases with increasing cone depth (“dose effect”). Although the risk was increased even for excisions less than 10 mm in depth, this was almost twofold for excisions of more than 10 mm, threefold for more than 15-17 mm, and almost fivefold for excisions exceeding 20 mm in depth.
It has previously been suggested that the impact of treatment on the risk of preterm birth might not be a consequence of treatment but rather a product of other confounders present in women with cervical disease.7 14 15 Our subgroup analyses that stratified the risk by the comparator used, clearly documents that although the risk of preterm birth is significantly increased after treatment irrespective of the comparison group used, the choice of comparator might overinflate or underestimate the effect from treatment. The magnitude of effect was higher when we used external controls, followed by internal controls, followed by women who had disease but were not treated. The analyses in women with high grade squamous intraepithelial lesion but no treatment only included three studies and 3764 participants; we were unable to draw any firm conclusions from this comparison. When we assessed the risk of preterm birth according to both the cone depth and comparator, we noted overall the same direction of effect. Although the difference in the risk of preterm birth for small excisions (≤10-12 mm) compared with just having CIN but no treatment, became insignificant, the number of studies assessing that comparison was small, and we cannot draw firm conclusions.
Our results also confirm that although women with CIN have a higher baseline risk of prematurity than the general population, cervical treatment, and particularly deep cones, further increase that risk.
Strengths and limitations
This is the first systematic review to show that any local cervical treatment technique (excisional or destructive) is associated with an increased risk of preterm birth and adverse obstetric sequelae and to document that the risk directly correlates to the cone depth (and volume), the treatment technique (excision more than ablation), and radicality. This meta-analysis included a large number of studies (71 cohorts) with sufficient sample size and power to explore several comparisons of treatment techniques and cone depths. Furthermore, we were able to perform subgroup analyses according to the comparator used and quantify the risk in different clinical groups.
The results, however, should be interpreted with caution. Because of the premalignant nature of the disease, no randomised studies could be identified. All the included studies were cohorts, nearly all retrospective. Such reports are at known risk of recall bias and inadequate adjustment for known and unknown confounders, while some of the outcomes of interest were difficult to measure objectively. Many of the studies relied on data collected from structured interviews and mailed questionnaires, and in some of these the response rate was small, also increasing the risk of incomplete outcome data (attrition) and misclassification bias. The studies often had different designs and used comparisons between and among women and mixed matching. Although the overall number of studies was large, for some outcomes and comparisons there were few studies, and the analyses did not have sufficient sample sizes to support definite conclusions.
Although heterogeneity between studies was not significant for most of the analyses, some subgroup analyses did show variation in the outcomes across studies. This was often in analyses that included small number of studies and participants. Meta-regression was possible for some but not all possible confounders. For many moderators, data were reported only in a proportion of the included studies. When these studies were not deemed representative of the whole population of studies, we did not perform meta-regression as this would introduce bias. Sensitivity and subgroups analyses based on study quality did not change the effect of the meta-analysis.
There were further limitations in the interpretation of the data. The gestational age cut offs used for the definitions of severe and extreme prematurity and for different cone depths varied slightly across studies; we merged these in broader groups for the analysis. Individual patient meta-analysis data are required to more accurately describe the stratified risk of preterm birth for individual cone depths. The data on cone dimensions relied on retrospective data recorded in histopathology reports of formalin fixed samples, with obvious limitations. The formulas used for the calculation of volume also varied across studies. Future research should aim to correlate outcomes with precise prospective cone depth and cervical measurements.
Both the included and excluded studies showed a wide range of inclusion/exclusion criteria and outcome measures limiting statistical pooling of all the primary studies. There should be agreement among colposcopists and obstetricians on core research clinical outcome measures in line with the CROWN initiative of the premier reproductive health journals.134 This would improve the applicability of findings of primary and secondary research internationally.
Interpretation in light of other evidence
With an increasing evidence base suggesting that this risk is higher for more radical techniques, there has been a tendency to use less aggressive treatments.5 Although it was previously thought that the various techniques had comparable efficacy,135 evidence from a population based study raised concerns that less radical treatment could increase the risk of invasion after treatment.136 137 Although the decreased number of hysterectomies could explain this increase, the move to less radical local conservative treatments is another plausible explanation. Additionally, since the first documentation of the reproductive risk associated with treatment almost a decade ago,1 subsequent observational studies and even meta-analyses reached contradictory conclusions2 3 4 16 17 18 19 and initiated debates within the scientific community. With some authors raising concerns that the progressive reduction in the radicality of treatment has led to increased risk of future of invasion,136 137 and others advocating the move to less radical techniques like laser ablation for the prevention of future perinatal morbidity and mortality associated with treatment,138 high quality synthesis of the evidence had become an urgent unmet need. Some of the previous small meta-analyses had methodological flaws and attempted analysis of individual treatment techniques or subgroups, thereby minimising the validity of their findings in context with the rest of the literature.16 17 18 All the published meta-analyses failed to analyse the data according to major confounders and stratifiers of risk, the comparison group, and the depth of the excision. Although Bruinsma and Quinn first approached the comparison group as a possible confounder, data on the depth and dimensions of the treatment were not available.4
Preterm birth is a major cause of neonatal death and disability and represents an enormous cost to health services and society. While pregnant, these women with a history of cervical treatment make up a large proportion of referrals to specialist preterm labour prevention clinics. These referrals have increased from almost none in 1999, to more than 40% in 2012.139 Ultrasound directed surveillance is labour intensive, costly, and can be associated with maternal anxiety, more so because 85% of women after excision are effectively low risk and will deliver at term.1 4
With rapidly accumulating evidence correlating cervical treatment to adverse reproductive morbidity, quantification of the comparative obstetric morbidity for different treatment techniques and cone depths is required to assist clinicians’ decision making and counselling. The results of our meta-analysis will allow clinicians, patients, and policy makers to balance the absolute increase in reproductive morbidity with increasing treatment radicality. Patients should be informed that treatment increases the risk of preterm birth compared with having CIN only, but the absolute increase in risk in small type 1 excisions is likely to be low, if any.
Furthermore, the quantified individual risk stratified by treatment and cone depth could allow obstetricians to select women considered to be at high risk of preterm birth who would benefit from intensive surveillance antenatally and minimise the unnecessary interventions for those at low risk. The antenatal management of women after treatment has been inconsistent and largely unit or clinician dependent.30 The risks and benefits associated with various interventions in pregnant women with a history of cervical treatment have not been fully assessed in properly designed studies.140 Future research should assess their value in this distinct clinical group and devise a logical prevention strategy.
Women with CIN have a higher baseline risk of preterm birth than women from the general population. Local cervical treatment for preinvasive or early invasive disease further increases the risk, more so for excisional but also for ablative techniques. The risk of preterm birth increases with increasing cone depth (and volume) and techniques that remove or destroy larger parts of the cervix. The increase in risk for small excisions compared with having CIN is likely to be small, if any; more data are required.
In the decision to treat women of reproductive age, every effort should be made to perform a local treatment that will optimise the chances of a healthy pregnancy without compromising the completeness of the local treatment. Quality assurance in treatment of disease should include audit of dimensions of excisional specimens and persistent disease rates to ensure that treatment depth is kept to acceptable parameters (that is, at least 8 mm to include the crypts) and that oncological outcomes are not compromised.
Future research should investigate whether women who have preinvasive cervical disease are susceptible to both the disease and preterm birth, or whether HPV induced disease alone is the principal factor in increasing premature delivery. It is likely that a combination of immunological and other factors play a role. The uptake of prophylactic vaccination has been mixed in the developed world and minimal in low income countries. The impact of cervical treatment will continue to be relevant for many decades, and therefore robust clinical research in this field should remain a priority.
What is already known on this topic
Local cervical treatment has been associated with an increased risk of preterm birth, perinatal morbidity, and mortality in a subsequent pregnancy, which could be associated with depth of excision.
Discrepancies exist regarding the impact of treatment on the risk of subsequent preterm birth and whether CIN acts as a confounder, which might be caused by heterogeneity in comparison groups used in previous studies or different excision depths and/or treatment techniques that have been analysed
What this study adds
Increased risk of adverse obstetric outcomes is associated with the treatment technique (excision more than ablation) and radicality, determined by the depth and dimensions of the cone
Although the risk of preterm birth is higher after local treatment for CIN irrespective of the cone depth, the risk increases with increasing cone depth. The increase in risk in small excisions compared with just having CIN remains uncertain and is likely to be small, if any; more data are required
Choice of comparison group might overinflate or underestimate the effect from treatment because of the background increased risk of preterm birth in women with CIN. The increased risk of preterm birth, however, remains significantly increased after treatment, despite the chosen comparator and even in comparisons with women with CIN but no treatment
Contributors: The study was conceived and designed by MK, MA, PB, and EP. MK, AA, and MP acquired the data, which were collated and analysed by MK, AA, IK, and MA. All authors drafted and critically revised the manuscript for important intellectual content. MA, PB, and EP are joint senior authors. All authors gave final approval of the version to be published and have contributed to the manuscript. MK is guarantor.
Funding: This work was supported by the British Society of Colposcopy Cervical Pathology Jordan/Singer Award (P47773) (MK); the Imperial College Healthcare Charity (P47907) (AM and MK); Genesis Research Trust (P55549) (MK); Sigrid Jusélius Fellowship (P52483 to IK and MK), and the Imperial Healthcare NHS Trust NIHR Biomedical Research Centre (P45272) (MK, PRB). MA received support from the COHEAHR Network (grant No 603019), funded by Seventh Framework Programme of DG Research of the European Commission (Brussels); Institut National du Cancer (Paris) through the COSPCC study (Conséquences obstétricales du (sur)traitement des précurseurs du cancer du col); the European Federation of Colposcopy (Birmingham); and the Joint Action CANCON. None of the funders has any influence on the study design; in the collection, analysis, and interpretation of data; in the writing of the report; and in the decision to submit the article for publication.
Competing interests: All authors have completed the ICMJE uniform disclosure form at www.icmje.org/coi_disclosure.pdf and declare: no support from any organisation for the submitted work; no financial relationships with any organisations that might have an interest in the submitted work in the previous three years; no other relationships or activities that could appear to have influenced the submitted work.
Ethical approval: Not required.
Data sharing: No additional data available.
Transparency: The lead author affirms that the manuscript is an honest, accurate, and transparent account of the study being reported; that no important aspects of the study have been omitted; and that any discrepancies from the study as planned have been explained.
This is an Open Access article distributed in accordance with the terms of the Creative Commons Attribution (CC BY 3.0) license, which permits others to distribute, remix, adapt and build upon this work, for commercial use, provided the original work is properly cited. See: http://creativecommons.org/licenses/by/3.0/.