CCBYNC Open access
Research

Evaluating the risk of ovarian cancer before surgery using the ADNEX model to differentiate between benign, borderline, early and advanced stage invasive, and secondary metastatic tumours: prospective multicentre diagnostic study

BMJ 2014; 349 doi: http://dx.doi.org/10.1136/bmj.g5920 (Published 15 October 2014) Cite this as: BMJ 2014;349:g5920
  1. Ben Van Calster, professor1,
  2. Kirsten Van Hoorde, doctoral researcher23,
  3. Lil Valentin, professor4,
  4. Antonia C Testa, professor5,
  5. Daniela Fischerova, consultant gynaecologist6,
  6. Caroline Van Holsbeke, consultant gynaecologist7,
  7. Luca Savelli, consultant gynaecologist8,
  8. Dorella Franchi, consultant gynaecologist9,
  9. Elisabeth Epstein, professor10,
  10. Jeroen Kaijser, research fellow111,
  11. Vanya Van Belle, postdoctoral researcher23,
  12. Artur Czekierdowski, professor12,
  13. Stefano Guerriero, professor13,
  14. Robert Fruscio, consultant gynaecologist14,
  15. Chiara Lanzani, consultant gynaecologist15,
  16. Felice Scala, consultant gynaecologist16,
  17. Tom Bourne, professor11117,
  18. Dirk Timmerman, professor111
  19. International Ovarian Tumour Analysis (IOTA) group
  1. 1Department of Development and Regeneration, KU Leuven, Herestraat 49 box 7003, 3000 Leuven, Belgium
  2. 2Department of Electrical Engineering, KU Leuven, Leuven, Belgium
  3. 3iMinds Medical Information Technologies, KU Leuven, Leuven, Belgium
  4. 4Department of Obstetrics and Gynaecology, Skåne University Hospital Malmö, Lund University, Malmö, Sweden
  5. 5Department of Oncology, Catholic University of the Sacred Heart, Rome, Italy
  6. 6Gynaecological Oncology Center, Department of Obstetrics and Gynaecology, Charles University, Prague, Czech Republic
  7. 7Department of Obstetrics and Gynaecology, Ziekenhuis Oost Limburg, Genk, Belgium
  8. 8Gynaecology and Reproductive Medicine Unit, S Orsola-Malpighi Hospital, University of Bologna, Bologna, Italy
  9. 9Preventive Gynaecology Unit, Division of Gynaecology, European Institute of Oncology, Milan, Italy
  10. 10Department of Obstetrics and Gynaecology, Karolinska University Hospital, Stockholm, Sweden
  11. 11Department of Obstetrics and Gynaecology, University Hospitals Leuven, Leuven, Belgium
  12. 121st Department of Gynaecological Oncology and Gynaecology, Medical University in Lublin, Lublin, Poland
  13. 13Department of Obstetrics and Gynaecology, Azienda Ospedaliero Universitaria di Cagliari, Cagliari, Italy
  14. 14Clinic of Obstetrics and Gynaecology, University of Milan-Bicocca, San Gerardo Hospital, Monza, Italy
  15. 15Department of Woman, Mother and Neonate, Buzzi Children’s Hospital, Biological and Clinical School of Medicine, University of Milan, Milan, Italy
  16. 16Department of Gynaecologic Oncology, Istituto Nazionale Tumori, Naples, Italy
  17. 17Queen Charlotte’s and Chelsea Hospital, Imperial College, London, UK
  1. Correspondence to: B Van Calster ben.vancalster{at}med.kuleuven.be
  • Accepted 5 September 2014

Abstract

Objectives To develop a risk prediction model to preoperatively discriminate between benign, borderline, stage I invasive, stage II-IV invasive, and secondary metastatic ovarian tumours.

Design Observational diagnostic study using prospectively collected clinical and ultrasound data.

Setting 24 ultrasound centres in 10 countries.

Participants Women with an ovarian (including para-ovarian and tubal) mass and who underwent a standardised ultrasound examination before surgery. The model was developed on 3506 patients recruited between 1999 and 2007, temporally validated on 2403 patients recruited between 2009 and 2012, and then updated on all 5909 patients.

Main outcome measures Histological classification and surgical staging of the mass.

Results The Assessment of Different NEoplasias in the adneXa (ADNEX) model contains three clinical and six ultrasound predictors: age, serum CA-125 level, type of centre (oncology centres v other hospitals), maximum diameter of lesion, proportion of solid tissue, more than 10 cyst locules, number of papillary projections, acoustic shadows, and ascites. The area under the receiver operating characteristic curve (AUC) for the classic discrimination between benign and malignant tumours was 0.94 (0.93 to 0.95) on temporal validation. The AUC was 0.85 for benign versus borderline, 0.92 for benign versus stage I cancer, 0.99 for benign versus stage II-IV cancer, and 0.95 for benign versus secondary metastatic. AUCs between malignant subtypes varied between 0.71 and 0.95, with an AUC of 0.75 for borderline versus stage I cancer and 0.82 for stage II-IV versus secondary metastatic. Calibration curves showed that the estimated risks were accurate.

Conclusions The ADNEX model discriminates well between benign and malignant tumours and offers fair to excellent discrimination between four types of ovarian malignancy. The use of ADNEX has the potential to improve triage and management decisions and so reduce morbidity and mortality associated with adnexal pathology.

Introduction

Ovarian cancer is the most aggressive gynaecological malignancy. The five year survival rate of patients is around 40% and the disease accounts for approximately half of all deaths related to gynaecological cancer.1 2 The most important factor for survival is stage at diagnosis.3 Therefore attempts have been made to develop a screening method, which by detecting ovarian cancer at an early stage has the potential to decrease deaths from ovarian cancer. No such screening method is currently available.4 5 However, we are still awaiting the results of the United Kingdom Collaborative Trial on Ovarian Cancer Screening.6

An important factor that influences prognosis other than stage at diagnosis is referral to a gynaecology oncology centre for further diagnosis or staging, debulking surgery, and evaluation by an interdisciplinary tumour board.7 8 9 10 Although such centralised care is recommended because it results in improved prognosis, a large proportion of women with ovarian cancer remain treated by general surgeons,11 12 13 possibly because the true nature of the disease is unknown before surgery. Optimal treatment of ovarian malignancies depends on the type of tumour. Treatment of borderline tumours can be less aggressive than treatment of invasive tumours, especially if the preservation of fertility is important.14 In selected cases, stage I ovarian cancer may be managed more conservatively than late stage disease, whereas for cancers metastasised to the ovary management depends on the origin of the primary tumour.15 An accurate specific diagnosis of adnexal tumours before surgery will almost certainly improve the triage of patients and so increase the likelihood that patients will receive appropriate treatment.

Recently, the International Ovarian Tumour Analysis (IOTA) group showed that polytomous risk prediction for the diagnosis of ovarian cancer is feasible.16 Mathematical models were developed to predict four tumour categories: benign, borderline, primary ovarian cancer, and secondary metastatic cancer. This work focused on comparing mathematical algorithms. From a clinical point of view it was preliminary for several reasons. Firstly, the model was built using information from only 754 patients with 40 borderline, 121 primary invasive, and 30 secondary metastatic cancers. Secondly, despite that more than 30 clinical and ultrasound candidate predictors were statistically evaluated, the tumour marker serum CA-125 was not considered. Although we have shown that serum CA-125 may not be needed in models with a binary outcome (benign v malignant),17 CA-125 is likely to be important for distinguishing between different types of malignant tumour.18 Thirdly, the models did not distinguish between stage I and stage II-IV primary cancer, which is clinically important.19

We developed a polytomous risk prediction model that can reliably distinguish between benign, borderline, stage I invasive, stage II-IV invasive, and secondary metastatic adnexal tumours.

Methods

Design and setting

We carried out an international multicentre prospective cohort study of women with at least one adnexal mass that required surgery, as judged by a clinician. The IOTA study group collected data between 1999 and 2012. IOTA was established to develop and validate diagnostic models for adnexal masses based on large multicentre datasets using a standardised ultrasound examination protocol, terms, and definitions.20 21 22 23 24 25 26 Patients were recruited from 24 centres in 10 countries. Twelve centres were labelled oncology centres, that is, tertiary referral centres with a specific gynaecology oncology unit. The remaining centres included general hospitals and gynaecology ultrasound units not linked to an oncology centre. Data collection was carried out in phases: phase 1 between 1999 and 2002, phase 1b between 2002 and 2005, phase 2 between 2005 and 2007, and phase 3 between 2009 and 2012.21 22 23 24

Patients

Patients referred to one of the participating centres for an ultrasound examination because of a known or suspected adnexal mass were eligible for inclusion. We included consecutive patients with at least one adnexal mass judged not to be a physiological cyst, who were examined with transvaginal ultrasound by a principal investigator and later selected for surgical intervention. The decision to operate was made by the managing clinician on the basis of the full clinical picture, including the ultrasound report, the latter being based on the ultrasound examiner’s subjective assessment of the ultrasound image. Following the requirements of the local ethics committees, we obtained oral or written informed consent from the women before their ultrasound scan and surgery. Exclusion criteria were refusal for transvaginal ultrasonography, pregnancy at the time of presentation, and surgical removal of the mass more than 120 days after the ultrasound examination. If more than one mass was detected, we used the mass with the most complex morphology on the ultrasound scan. When we observed masses with similar morphology, we used the largest or the one most easily accessible by ultrasound.21 22 23

Data collection and reference standard

To collect clinical information we took a standardised history from each patient. All patients underwent a standardised transvaginal ultrasound examination.20 Transabdominal sonography was added for women with large masses that could not be visualised in full by a transvaginal probe. We collected gray scale and Doppler ultrasound information in line with the research protocols. More information can be found in previous reports.21 22 23 Participating centres were encouraged to measure serum CA-125. We used second generation immunoradiometric assay kits for CA-125 II from Roche Diagnostics, Centocor, Cis-Bio, Abbott Laboratories, Bayer Diagnostics, bioMérieux, DiaSorin, Siemens, and Beckman Coulter. All kits used the OC125 antibody.

The reference standard was the histopathological diagnosis of the mass after surgical removal by laparotomy or laparoscopy as considered appropriate by the surgeon, and the stage of malignant tumours using the classification of the International Federation of Gynecology and Obstetrics (FIGO).27 The excised tissues underwent histological examination at the local centre. Histological classification was performed without knowledge of the ultrasound results. The final diagnosis was divided into five tumour types: benign, borderline, stage I invasive, stage II-IV invasive, and secondary metastatic cancer.

Data were entered through dedicated and secure data collection systems, web based for phase 1, and through a local study screen (Astraia software, Munich, Germany) for later phases.21 22 23 To ensure data integrity, several clinicians and statisticians used built-in automatic checks and manual review and cleaning of data.

Statistical analysis

We developed a prediction model using data from the women included in IOTA phases 1, 1b, and 2 (n=3506) and validated the model on data from the women included in phase 3 (n=2403).

The serum CA-125 tumour marker was not a mandatory variable, and measurements were missing in 31% of the patients. As described in detail in supplementary appendix A, we used multiple imputation to deal with missing values for CA-125.28 We created 100 imputations, resulting in 100 completed datasets.

We selected variables in two stages (see supplementary appendix B for details). Firstly, to avoid over-fitting we reduced the number of potential predictors to 10 based on subject matter knowledge29 30 and the stability of the predictors over centres.31 We selected four clinical variables—age (years), serum CA-125 level (U/mL), family history of ovarian cancer (yes/no), and type of centre (oncology centre v other hospitals), and six ultrasound variables—the maximum diameter of the lesion (mm), proportion of solid tissue (that is, the maximum diameter of the largest solid component divided by the maximum diameter of the lesion), presence of more than 10 cyst locules (yes/no), number of papillary projections (0, 1, 2, 3, >3), presence of acoustic shadows (yes/no), and presence of ascites (yes/no). Oncology centres were defined as tertiary referral centres with a specific gynaecology oncology unit. We included the variable “type of centre” because the risk of a malignant tumour is likely to be higher in oncology centres than in other centres, even after adjustment for the characteristics of patients and tumours. Secondly, we carried out further data driven selection using a method based on multivariable fractional polynomials.32 This method simultaneously selects variables and determines the optimal transformation of numerical variables using fractional polynomials. We forced age and type of centre into the model by default.

To acknowledge variability between centres we used multinomial logistic regression with random centre intercepts to construct the polytomous model.33 We multiplied the predictor coefficients with uniform “shrinkage factors” to avoid exaggerated model coefficients (see supplementary appendix C for details).30 34 We trained the model on each of the 100 completed datasets following multiple imputation. Probabilities were derived by averaging linear predictors (without the random effects) and odds ratios by averaging model coefficients.

We evaluated the model for discrimination and calibration performance.35 To assess discrimination we first obtained the area under the receiver operating characteristic curve (AUC) for the basic discrimination between benign and malignant tumours. We calculated sensitivity and specificity for the cut-offs 3%, 5%, 10%, and 15% total risk of malignancy (that is, the sum of the estimated risks of the four malignant subtypes). We then also computed AUCs for each pair of tumour types using the conditional risk method.36 For the five tumour types, there are 10 pairwise AUCs. Finally, we calculated the polytomous discrimination index, a polytomous version of the AUC.37 This index estimates the average proportion of patients who are correctly identified by the model when presented with five patients, one with each tumour type. For five groups, the polytomous discrimination index ranges between 0.20 (worthless) and 1 (perfect). A discrimination plot was used to visualise discrimination performance.36

To assess calibration of the predicted probabilities we produced calibration plots showing the relation between predicted and observed probabilities for each type of tumour. The plots were based on a parametric multinomial logistic recalibration analysis,38 using random centre intercepts. We used the probabilistic results of this analysis, including the random effects, as observed probabilities, which were plotted against the predicted probabilities.

Because model validation was successful, we updated the model on the pooled data (n=5909) to make full use of all available information. Predicted probabilities based on this model can then be compared with baseline probabilities for each type of tumour. The baseline probabilities were estimated through a random intercepts multinomial logistic regression model containing only intercept terms. All analyses were performed with SAS 9.3 (SAS Institute, Cary, USA).

Results

In total, data on 6169 patients were recorded in the databases for phases 1, 1b, 2, and 3. We excluded 255 patients (4.1%): 163 (2.6%) based on exclusion criteria (51 pregnant women, 112 women received surgery >120 days after the ultrasound examination), 91 (1.5%) because of data errors or uncertain or missing final histology, and one due to protocol violation. Based on logistic regression influence diagnostics39 and further data review of the archived datasets, we omitted five additional cases. Thus data on 5909 women were used. Table 1 gives an overview of participating centres, included patients, and the reference standard; supplementary table S1 the histological diagnoses and FIGO stages; and supplementary table S2 the personal and reproductive characteristics of the patients. The observed rate of malignancy varied between 22% and 66% in oncology centres and between 0% and 30% in other hospitals.

Table 1

 Number of patients in each centre, and type of centre

View this table:

Model development, temporal validation, and updating

We included nine variables in the Assessment of Different NEoplasias in the adneXa (ADNEX) model: age, serum CA-125 level (log transformed), type of centre, maximum diameter of the lesion (log transformed), proportion of solid tissue (with quadratic term), number of papillary projections, more than 10 cyst locules, acoustic shadows, and ascites. Family history of ovarian cancer was dropped by the variable selection analysis. Table 2 shows descriptive statistics for the 10 variables selected a priori. The AUC of the ADNEX model for the basic discrimination between benign and malignant tumours was 0.954 (95% confidence interval 0.947 to 0.961) on the development data and 0.943 (0.934 to 0.952) on the validation data (table 3). The discrimination between benign and malignant was consistent over centres (see supplementary figure S1). Using a cut-off of 10% to predict malignancy, the sensitivity was 96.5% and specificity 71.3% on the validation data (table 3). The validation AUC was 0.85 for benign tumours compared with borderline tumours, 0.92 for benign tumours compared with stage I cancer, 0.99 for benign tumours compared with stage II-IV cancer, and 0.95 for benign tumours compared with secondary metastatic cancer (table 4). Validation AUCs between malignant subtypes varied between 0.71 and 0.95. The model showed fair discrimination between stage I cancer and borderline tumours (validation AUC 0.75) and between stage I cancer and secondary metastatic cancer (validation AUC 0.71). It was well able to distinguish stage II-IV cancer from other malignancies (AUCs for stage II-IV cancer versus borderline tumours was 0.95, versus stage I cancer was 0.87, and versus secondary metastatic cancer was 0.82). The polytomous discrimination index was 0.56 (0.54 to 0.59) on the validation data. Supplementary table S3 presents separate results for oncology centres and other hospitals.

Table 2

 Descriptive statistics of the a priori considered predictors by tumour type in pooled dataset (n=5909). Values are numbers (percentages) unless stated otherwise

View this table:
Table 3

 Diagnostic performance of ADNEX model when using different thresholds for total probability of malignancy (sum of probabilities of four subtypes of ovarian malignancy)

View this table:
Table 4

 Polytomous discrimination performance of ADNEX model on development data, validation data, and after updating on pooled data

View this table:

The calibration plots for all five tumour types showed acceptable calibration of the estimated risks (fig 1). High risks for secondary metastatic cancer were overestimated, but such high risks were uncommon. Calibration plots for oncology centres and other hospitals were similar (see supplementary figures S2 and S3).

Figure1

Fig 1 Calibration plots of predicted probabilities for each type of tumour. Data have been calculated using validation data (n=2403). Plots show how well the predicted probabilities (x axis) agree with observed probabilities (y axis). For perfect agreement, the calibration curve falls on the ideal diagonal line. Histograms below plots show distribution of predicted probabilities

Tables 3 and 4 and supplementary table S3 show the discrimination performance of the ADNEX model after it was updated on the pooled data. The discrimination plot shows that the predicted probability of a specific tumour type is highest for patients with a matching reference standard (fig 2)—for example, patients with histologically confirmed borderline tumours had the highest probabilities of a borderline malignancy. The ADNEX model formula is given in supplementary appendix D. The effects of the predictors are presented as odds ratios in table 5. Proportion of solid tissue and serum CA-125 level had the strongest independent relations with the outcome, as judged by the test statistic for the model coefficients (not shown). Type of centre was the weakest predictor, indicating that most of the differences in malignancy rates were captured by the other predictors.

Figure2

Fig 2 Discrimination plot of ADNEX model after it was updated on pooled dataset (n=5909). For each predicted tumour type, box plots of probabilities are presented for each confirmed tumour type (reference standard). Red vertical lines show baseline probabilities for each type of tumour. For example, the baseline probability of a benign tumour is 0.681; for most women with a benign tumour the predicted probability of a benign tumour was higher than 0.9, whereas most women with an ovarian malignancy (most notably stage II-IV cancer) had clearly lower predicted probabilities of a benign tumour

Table 5

 Odds ratios for predictors in ADNEX model after it was updated on pooled dataset (n=5909)

View this table:

Deriving a similar model without CA-125 level as a predictor mainly affected discrimination between stage II-IV cancer and other malignancies (see supplementary table S4): validation AUCs decreased from 0.82 to 0.59 (stage II-IV cancer v metastatic cancer), from 0.87 to 0.76 (stage II-IV cancer v stage I cancer), and from 0.95 to 0.91 (stage II-IV cancer v borderline tumours).

Implementation of ADNEX and illustrative example

The final ADNEX model is available online and in mobile applications (www.iotagroup.org/adnexmodel/). The applications allow risk calculation even without information on serum CA-125 level, despite the decrease in performance. As an example, we assess a 55 year old woman at a centre for gynaecological oncology. Her serum CA-125 level is 42 U/mL. Ultrasound examination reveals an adnexal mass with more than 10 cyst locules, no papillary projections, no acoustic shadows, ascites, a maximum lesion diameter of 120 mm, and a maximum diameter of the largest solid component of 20 mm (that is, proportion of solid tissue is 20/120). The ADNEX model gives the following probabilities: 37.4% for borderline tumour, 10.8% for stage I cancer, 8.4% for stage II-IV cancer, and 11.0% for secondary metastatic cancer. The total risk of malignancy is 37.4+10.8+8.4+11.0=67.6%. The tumour is most likely to be a borderline tumour as opposed to any other type of malignancy. If the CA-125 level was unavailable, predicted probabilities would be 25.2% (borderline), 8.3% (stage I), 35.8% (stage II-IV), and 11.5% (metastatic). Baseline probabilities for each type of tumour are 6.3% for borderline tumour, 7.5% for stage I, 14.1% for stage II-IV, and 4.0% for metastatic cancer.

Discussion

We developed and temporally validated a prediction model that is able to discriminate between five types of adnexal tumour (benign, borderline, stage I cancer, stage II-IV cancer, and secondary metastatic cancer), while still showing excellent overall discriminative capacity between benign and all malignant tumours. On the validation data, the previously proposed 10% risk cut-off for the total risk of malignancy21 resulted in 96.5% sensitivity and 71.3% specificity. The ADNEX model discriminated well between benign tumours and each of four types of malignancy (validation area under the receiver operating characteristic curves (AUCs) between 0.85 and 0.99). Moreover, the model was able to distinguish stage II-IV cancer from other malignancies (validation AUCs between 0.82 and 0.95) and showed fair discrimination between stage I cancer and borderline tumours (AUC 0.75) and stage I cancer and secondary metastatic cancer (AUC 0.71). The model uses three clinical predictors (age, serum CA-125 level, type of centre) and six ultrasound predictors (maximal diameter of lesion, proportion of solid tissue, more than 10 cyst locules, number of papillary projections, acoustic shadows, and ascites). Serum CA-125 level and proportion of solid tissue were the strongest predictors.

Results in relation to other studies

The polytomous approach to adnexal tumour diagnosis is novel. We do not know of multivariable polytomous models in this area outside the work of the International Ovarian Tumour Analysis (IOTA) group.16 In a recent meta-analysis evaluating the performance of prediction models and rules to characterise adnexal pathology, approaches by IOTA such as the logistic regression model LR221 and the simple rules25 26 (a set of 10 ultrasound features) performed best for the overall discrimination between benign and all malignant masses.40 The Royal College of Obstetricians and Gynaecologists has included the simple rules in their guidelines on management of adnexal tumours in premenopausal patients.41 The ADNEX model’s performance is similar to, or even slightly better than, that of LR2 and simple rules. For example, the AUC of LR2 on the validation data (IOTA phase 3) was 0.92.42 In contrast with LR2 and simple rules, the ADNEX model also enables specific subtyping of malignancy using risk estimates.

Strengths and weaknesses of this study

Our study has several strengths and limitations. Firstly, the strengths of the present study are that we used a large number of patients that were prospectively examined at 24 centres in 10 countries using a standardised protocol, avoided strong data driven variable selection, and conducted a large temporal validation of the model. After validation, we used the pooled data from almost 6000 patients to update the model coefficients. We would therefore expect our results to be generalisable. Secondly, it may be seen as an advantage that a histological diagnosis was obtained for every included tumour. This could also be regarded as a limitation, because the model is based on patients who were selected for surgery. Hence we cannot be certain that the test performance of the ADNEX model would be maintained if applied to a population of tumours, of which some were selected for expectant management. However, this argument holds for all prediction models for the diagnosis of ovarian tumours. Thirdly, the centres used different assay kits for CA-125 assessment. This can also be interpreted as both a strength and a limitation: using different kits introduces variability in CA-125 levels (although this variability is minor43), reflects clinical reality, and yields results that are less dependent on assay. Fourthly, a potential limitation is that experienced operators examined all tumours in the study. However, other studies have shown that dichotomous models developed by the IOTA group using ultrasound variables similar to those in the current study, work well in the hands of non-expert level 244 ultrasound examiners.45 46 Fifthly, there was no central review of pathology. In phase 1 of the IOTA study, 10% of the patients were selected at random for central review of pathology.21 Because we found no clinically important differences in reported outcomes between local and central reports, such centralised review was not performed in later phases of the IOTA study. This may nevertheless have introduced bias. For example, distinguishing borderline tumours from benign tumours or stage I cancer may be difficult for pathologists, and confusion of these tumour types might have impacted on the ability of the ADNEX model to correctly distinguish between them.

Implications for clinical practice

The ADNEX model has clear potential to optimise management of women with an adnexal tumour. Currently the risk of malignancy index (RMI)47 is often used to characterise adnexal masses as benign or malignant. However, the index had much poorer performance for discrimination between benign and malignant tumours (AUC 0.88, 67.1% sensitivity, and 90.6% specificity at the typical risk of malignancy index cut-off of 200) than the ADNEX model when tested on our validation data.42 In addition to offering excellent discrimination between benign and malignant tumours, the ADNEX model predicts type of malignancy. Knowledge of the specific type of adnexal pathology before surgery is highly likely to improve patient triage, and it also makes it possible to optimise treatment. This in turn may reduce morbidity and lead to enhanced survival from different types of ovarian malignancy. The correct identification of stage I cancer is particularly important.19 The ADNEX model can discriminate well between stage I cancers and benign tumours and between stage I cancers and advanced stage cancer. In addition, the ADNEX model can discriminate well between advanced primary cancer and secondary metastatic cancer. The latter result is largely achieved through the use of serum CA-125 level as a predictor. Although CA-125 level has little added value over ultrasound information when distinguishing benign from malignant tumours,17 the present study shows that serum CA-125 level is important for good discrimination between stage II-IV cancer and stage I and secondary metastatic cancer. An inconvenience that ADNEX shares with well known models to predict ovarian malignancy, such as the risk of malignancy index47 and the risk of ovarian malignancy algorithm (ROMA),48 is that predictions can only be made once the results of blood sample analyses are available. ADNEX implementations also allow risk calculation without a CA-125 level, but this will result in poorer discrimination between stage II-IV cancers and other types of malignancy.

We expect that the performance of the ADNEX model will be maintained in the hands of non-expert ultrasound examiners on condition that the examiners are familiar with the IOTA terms and definitions and use the IOTA examination and measurement techniques (see the IOTA consensus statement20). How the predicted risks from ADNEX should be used clinically must be decided on an individual basis, because patient management depends on many factors. When deciding on treatment of an adnexal mass, the likelihood of a specific type of malignancy is pivotal, but age, symptoms, wish to preserve fertility, comorbidity, and operative risks are also important factors. However, the ADNEX predictions may form a solid and objective base for optimal management of patients and could be incorporated in national and international clinical guidelines.

Key future research

Future work entails regular updating of ADNEX model coefficients using newly collected data, and monitoring of model performance. In addition, studies including patients who are managed conservatively are critically needed. This is the subject of phase 5 of the IOTA study, for which data collection started early in 2013. Finally, the ADNEX model could be optimised for use as a second stage test if screening for ovarian cancer is introduced into clinical practice.6

Conclusion

The ADNEX model has the potential to change management decisions for women with an adnexal tumour. This could impact considerably on the morbidity and mortality associated with adnexal pathology.

What is already known on this topic

  • Referring patients with ovarian cancer to specialised gynaecology oncology centres impacts positively on survival

  • Currently in Europe and the United States only a minority of women are triaged to receive specialist care in a gynaecology oncology centre

  • Personalised management, including fertility sparing surgery, requires knowledge of the nature of an ovarian mass

  • Prediction models exist that can discriminate between benign and malignant ovarian tumours but they do not subclassify malignant tumours

What this study adds

  • The ADNEX model discriminated well between benign and malignant ovarian tumours

  • The model was also able to discriminate between benign, borderline, stage I invasive, stage II-IV invasive, and secondary metastatic tumours

  • The ADNEX model may improve patient triage and decisions about management, and so positively impact on the morbidity and mortality associated with adnexal pathology

Notes

Cite this as: BMJ 2014;349:g5920

Footnotes

  • Contributors: BVC conceived and designed the study, with additional support from KVH, LV, TB, and DT. LV, ACT, DFi, CVH, LS, DFr, EE, JK, AC, SG, RF, CL, FS, and DT enrolled patients and acquired data. BVC, KVH, CVH, JK, and DT were involved in data cleaning. BVC analysed the data, with support from KVH and VVB. BVC, KVH, LV, ACT, JK, VVB, TB, and DT were involved in data interpretation. BVC, JK, TB, and DT wrote the first draft of the manuscript, which was then critically reviewed and revised by the other coauthors. All authors approved the final version of the manuscript for submission. All authors had full access to all of the data (including statistical reports and tables) in the study and can take responsibility for the integrity of the data and the accuracy of the data analysis. BVC, LV, TB, and DT are the guarantors

  • Funding: This study was supported by the Flemish government: Research Foundation–Flanders (FWO) project G049312N, Flanders’ Agency for Innovation by Science and Technology (IWT) project IWT-TBM 070706-IOTA3, and iMinds 2013. BVC and VVB are postdoctoral fellows of FWO. KVH is a doctoral fellow of IWT. TB is supported by the National Institute for Health Research (NIHR) Biomedical Research Centre based at Imperial College Healthcare NHS Trust and Imperial College London. The views expressed are those of the authors and not necessarily those of the NHS, NIHR or Department of Health. LV is supported by the Swedish Medical Research Council (grants K2001-72X-11605-06A, K2002-72X-11605-07B, K2004-73X-11605-09A, and K2006-73X-11605-11-3), funds administered by Malmö University Hospital and Skåne University Hospital, Allmänna Sjukhusets i Malmö Stiftelse för bekämpande av cancer (the Malmö General Hospital Foundation for fighting against cancer), and two Swedish governmental grants (ALF-medel and Landstingsfinansierad Regional Forskning). The sponsors had no role in study design; in the collection, analysis, and interpretation of data; in the writing of the report; and in the decision to submit the work for publication. The researchers performed this work independently of the funding sources.

  • Competing interests: All authors have completed the ICMJE uniform disclosure form at www.icmje.org/coi_disclosure.pdf and declare: no support from any organisation for the submitted work; no financial relationships with any organisations that might have an interest in the submitted work in the previous three years; no other relationships or activities that could appear to have influenced the submitted work.

  • Ethical approval: The research protocols were approved by the ethics committee of the University Hospitals KU Leuven and by each centre’s local ethics committee.

  • Data sharing: No additional data available.

  • Transparency: The manuscripts’ guarantors (BVC, LV, TB, and DT) affirm that the manuscript is an honest, accurate, and transparent account of the study being reported; that no important aspects of the study have been omitted; and that any discrepancies from the study as planned have been explained.

This is an Open Access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 3.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/3.0/.

References

View Abstract