ALL Metrics
-
Views
-
Downloads
Get PDF
Get XML
Cite
Export
Track
Systematic Review

Diagnosis of pelvic endometriosis: A systematic review and accuracy meta-analysis of non-invasive tests available in primary care

[version 1; peer review: 1 approved with reservations, 1 not approved]
PUBLISHED 28 Apr 2023
Author details Author details
OPEN PEER REVIEW
REVIEWER STATUS

This article is included in the Health Services gateway.

This article is included in the Endometriosis collection.

Abstract

Background: Endometriosis is a chronic, often debilitating condition with a current significant delay from symptom onset to diagnosis with much of this in primary care.
Methods: A systematic review and meta-analysis of the primary literature was conducted to investigate the accuracy of symptoms, clinical history and first-line non-invasive tests to predict pelvic endometriosis (PROSPERO: CRD42020187543). We searched Medline, Embase, Web of Science and Scopus from conception (1966; 1972; 1997; 2004 respectively) to September 2022 for primary test accuracy studies assessing non-invasive tests against reference standard diagnosis for endometriosis. Two authors independently conducted data extraction and quality assessment. Grading of evidence was performed using a novel visual pentagon model. Meta-analyses of test accuracy was estimated using bivariate random effects models.
Results: The 125 included studies (250,574 participants) showed mixed quality. Studies applying non-surgical (database/self-reporting) reference standard had a greater risk of bias. In 98 studies applying surgical reference standard, summary diagnostic odds ratios for endometriosis were: dysmenorrhoea 2.56 (95% confidence interval 1.99-3.29); pelvic pain 2.56 (1.73-3.74); dyschezia 2.05 (1.36-3.10); dyspareunia 2.45 (1.71-3.52); family history of endometriosis 6.79 (4.08-11.3); nulligravidity of 2.01 (1.62-2.50); body mass index (BMI) ≥30kg/m2 0.37 (0.19-0.68); trans-vaginal ultrasound scan (TVUSS) endometrioma 91.2 (44.0-189); TVUSS invasive endometriosis 26.1 (9.28-73.5); and cancer antigen-125 (CA-125) >35U/mL 16.0 (8.09-31.7). Sensitivity analysis excluding all high-risk studies found concordant results.
Conclusions: This meta-analysis collated the performance of non-invasive tests for endometriosis across a comprehensive and geographically varied population. Study quality was mixed, however results were consistent with high-risk studies excluded. These findings will inform future prediction models for triage in primary care.

Keywords

Endometriosis, diagnosis, prediction, diagnostic laparoscopy, primary care, pelvic pain, dysmenorrhoea

Introduction

Endometriosis is a chronic condition affecting women of reproductive age characterised by extra-uterine deposits of endometrial tissue with a prevalence of up to 10%.13 It can be found in peritoneal deposits, ovarian endometriomata, and invasive disease, with sufferers experiencing pain and sub-fertility.2,46 There is currently a 7-12 years delay between symptom onset and definitive diagnosis, by laparoscopy and biopsy.79 Reducing this has long a research priority.10,11 Up to 58% of sufferers present to primary care 10 or more times before diagnosis, demonstrated across a variety of healthcare systems and geographies.4,7,1214 A better understanding of the accuracy with which symptoms, clinical history and non-invasive tests diagnose pelvic endometriosis would aid triage and referral.

Many studies have attempted to find replacement tests for diagnostic laparoscopy.1517 These are often invasive or otherwise not applicable to primary care.16,18 Previous meta-analyses have been restricted by a narrow inclusion criteria yielding a small number of eligible studies, limiting findings.19 Meta-analyses assessing imaging and biomarkers showed that no individual test met their criteria as a replacement or triage test alone, but findings on trans-vaginal ultrasound and serum CA-125 showed a high specificity for disease.20,21 These studies did not include assessment of other clinical factors. A 2019 narrative review demonstrated the importance of clinical factors in prediction of disease, but primary studies were not assessed for quality and absence of meta-analysis limited quantitative assessment of test performance.22

A 2021 case-control study reporting on the accuracy of a simple patient-completed questionnaire identified those at high or low risk of disease with good accuracy reflecting the utility of assessment of patient reported symptoms at a primary care level.23

Research related to endometriosis is increasing in volume, with 75% of primary studies published in the last decade.24,25 Given this, and due to the limitations of previous reviews a new comprehensive systematic review and meta-analysis is required. We performed such an evidence synthesis and determined the accuracy of symptoms, clinical history, a simple low-cost biomarker and first line ultrasound for the diagnosis of pelvic endometriosis by means of a comprehensive systematic review and accuracy meta-analysis.

Methods

The protocol was designed and registered with PROSPERO (Registration number: CRD42020187543).26 Reporting followed the PRISMA guidelines.27,59

Patient and public involvement

A virtual patient and public involvement meeting involving endometriosis sufferers at Chelsea and Westminster Hospital, London was held following an open invitation on social media to inform them of plans for this study. The aim was to better understand non-invasive predictors of endometriosis and reduce the diagnostic delay, which was well supported and resonated with their personal experience. No specific ethical approval was required for the patient and public involvement, in accordance with Health Research Authority (HRA) guidelines. No specific ethical approval was required as this study retrieved and synthesised data from already published studies. No individual participant data was collected and no participant consent was required.

Literature search and study selection

Medline, Embase, Web of Science and Scopus were searched from conception (1966; 1972; 1997; 2004 respectively) to September 2022 Search strategies are shown in Table 1 (see Extended data).53 Review article references were also screened.

Titles and abstracts were screened independently by two authors (TB; SJ) using EndNote-X9 and duplicate or irrelevant studies removed. Full texts were screened and justification for inclusion or exclusion recorded, differences were resolved by discussion with the senior author (AR).28

We included published peer reviewed studies reporting accuracy estimates to predict pelvic endometriosis (peritoneal; ovarian; disease >5mm retroperitoneal invasion) for one or more index tests in participants with reported presence/absence of endometriosis. Included tests were dysmenorrhoea; pelvic pain; dyschezia; dyspareunia; nulligravidity; BMI ≥30kg/m2; family history of endometriosis; transvaginal ultrasound finding (TVUSS) of endometrioma; TVUSS finding of invasive disease; serum CA-125 >35U/mL.

Target population was reproductive age women excluding pregnancy or systemic co-morbidities. Studies with reporting non-reproductive age participants were included only where their data could be excluded from meta-analysis. Studies were included where a 2x2 contingency table for index test(s) could be constructed. We imposed no limits to language of publication, setting, or number of participants. All non-English studies were translated by a medically trained native speaker.

Studies reporting laboratory and ultrasound tests were included only when performed prior to reference standard and using only standard 2D protocols. Definitions for each test for the purposes of study selection are shown in Table 2 (see Extended data).54

We excluded reviews; case reports; studies where information on recruitment or study population was unavailable; letters; and abstracts. Studies reporting non-pelvic endometriosis were included only where pelvic endometriosis was reported separately. Authors were contacted to obtain full texts only. Failing this, they were obtained through the British Library. Studies with incomplete data preventing determination of inclusion, exclusion or test accuracy were excluded.

Data extraction and quality assessment

For each included study, two authors (TB; SJ) independently recorded information on study characteristics and data was extracted to form 2x2 tables. Where there was unreliability in data extraction from some non-English language studies, these were excluded.

Risk of bias was assessed independently by two authors (TB; SJ) using the Quality Assessment of Diagnostic Test Accuracy Studies (QUADAS 2) tool.29 For studies regarding serum CA-125 or TVUSS we included the additional questions: ‘was the index test performed by a single operator?’ to assess inter-observer bias; and ‘was timing in the participants’ menstrual cycle controlled for?’. We adjusted the original question ‘if a threshold was used, was it pre-specified?’ to ‘was there a clear definition of what was considered a positive test?’.

Data synthesis

Due to differences in design, studies were divided into groups according to application of the reference standard: ‘Complete verification’, all participants received visual inspection of the pelvis at surgery; ‘Partial verification’, all cases received surgical confirmation but controls did not; and ‘Database/self-reporting’, cases confirmed by healthcare coding or self-reporting and controls from healthy populations not known to have endometriosis.

Statistical analysis was conducted using Stata software (version 15)30 to allow exploration of heterogeneity and statistical pooling using a bivariate random effects model and produced summary accuracy measures and summary receiver operative characteristic curves for each index test. A bivariate random effects model was applied for index tests with ≥5 contributing studies, and a univariate fixed effects model for index test with ≤4.

Index tests were assessed for performance as a ‘rule-in’ or ‘rule-out’ tool with pre-specified threshold summary accuracy of 95% sensitivity/50% specificity or 95% specificity/50% sensitivity respectively.

Results

Study selection and characteristics

Of 22,016 studies identified 125 met the inclusion criteria involving 250,574 participants (Figure 1). Details of included studies by number of participants and index test(s) are shown Table 2 (see Extended data54). Characteristics of included and excluded studies are shown in Tables 3 and 4 (see Extended data).55,56

45a5e3bc-0d55-4641-9e24-2b5b8fa63e2d_figure1.gif

Figure 1. Flowchart of selection of studies included in the systematic review and meta-analysis of non-invasive tests for the diagnosis of endometriosis and division into groups by means of application of the reference standard.

Mean number of index tests per study was 2 (range 1-6). A total of 241 were assessed across all studies. Included studies were geographically varied: 45 Europe, 34 North America, 19 Asia/Oceania, 13 the Middle East/Africa, 12 South America, and 2 transcontinental. Publication date ranged from 1986 and 2022 with 57% since 2010: 4 before 1989; 22, 1990-1999; 18, 2000-2009; and 71, 2010-2022.

Most studies (75) were ‘single-gate’ design with 50 of ‘two-gate’ design, including all studies in the partial verification and database/self-reporting groups. The mean prevalence of endometriosis in studies of a ‘single-gate’ design was 52% (range 9-93%), due to the selection of matched controls, prevalence in ‘two-gate’ studies was not relevant. There was heterogeneity in population selection, with participants having surgery for a broad range of indications such uterine fibroids or adnexal cysts as well as pelvic pain or sub-fertility.

In the 61 studies assessing symptom-based tests 20 did so by self-administered questionnaire; 14 by structured interview; 12 by clinical history taking, and 15 were undefined (Table 5 - see Extended data).57

Quality of included studies

Risk of bias

The assessment of study quality by QUADAS-2 is presented in Figures 2-6. Overall methodological quality was mixed, with 5 studies presenting a low risk of bias across all domains3135 and 64 presenting a high risk of bias or applicability in at least one domain.

45a5e3bc-0d55-4641-9e24-2b5b8fa63e2d_figure2.gif

Figure 2. Risk of bias and applicability concerns among studies included in the systematic review and meta-analysis of non-invasive tests for the diagnosis of endometriosis.

Graph: review authors’ judgements on each domain presented as percentages across included studies in: T all studies; A Complete verification; B Partial verification; and C Database/self-reporting groups.

45a5e3bc-0d55-4641-9e24-2b5b8fa63e2d_figure3a.gif45a5e3bc-0d55-4641-9e24-2b5b8fa63e2d_figure3b.gif45a5e3bc-0d55-4641-9e24-2b5b8fa63e2d_figure3c.gif45a5e3bc-0d55-4641-9e24-2b5b8fa63e2d_figure3d.gif45a5e3bc-0d55-4641-9e24-2b5b8fa63e2d_figure3e.gif

Figure 3. Reviewers assessment of risk of bias and applicability by application of the Quality Assessment of Diagnostic Accuracy Studies-2 (QUADAS-2) tool for included studies in the diagnosis of pelvic endometriosis: a systematic review and accuracy meta-analysis of non-invasive tests complete verification group.

BMI, body mass index. CA-125, cancer antigen-125. US, ultrasound. DIE, deep invasive endometriosis.

45a5e3bc-0d55-4641-9e24-2b5b8fa63e2d_figure4.gif

Figure 4. Reviewers assessment of risk of bias and applicability by application of the Quality Assessment of Diagnostic Accuracy Studies-2 (QUADAS-2) tool for included studies in the diagnosis of pelvic endometriosis: a systematic review and accuracy meta-analysis of non-invasive tests partial verification group.

BMI, body mass index.

45a5e3bc-0d55-4641-9e24-2b5b8fa63e2d_figure5.gif

Figure 5. Reviewers assessment of risk of bias and applicability by application of the Quality Assessment of Diagnostic Accuracy Studies-2 (QUADAS-2) tool for included studies in the diagnosis of pelvic endometriosis: a systematic review and accuracy meta-analysis of non-invasive tests database/self-reporting group.

BMI, body mass index.

45a5e3bc-0d55-4641-9e24-2b5b8fa63e2d_figure6.gif

Figure 6. Risk of bias and applicability concerns graph: reviewers authors’ judgements on each index test presented as percentages across included studies in diagnosis of pelvic endometriosis: a systematic review and accuracy meta-analysis of non-invasive tests by application of the Quality Assessment of Diagnostic Accuracy Studies-2 (QUADAS-2) tool.

In patient selection, 22 studies presented a low risk of bias, with 73 and 30 presenting an unclear or high risk respectively. Non-consecutive or non-random selection, two-gate selection for cases and controls, and having a highly selected group of participants (infertility cohort, surgery for a narrow indication etc.) were the main reasons for a high risk of bias.

Symptom based index tests presented an unclear or high risk of bias due to a lack of definition of a positive test and of blinding. Just 9 studies presented a low risk in symptom-based tests across all groups. Index tests applicable to clinical history or investigations performed better, with 66 studies presenting a low risk. Reasons for an unclear or high risk of bias were a lack of pre-specified criteria for a positive test; no blinding to results of the reference standard; and inter-observer variability regarding imaging. Studies in the partial verification group assessed a proportionally higher number of index tests nulligravidity and BMI ≥30kg/m2, which, less subjective to interpretation presented a lower risk of bias.

The risk of bias regarding the reference standard performed best in the complete and partial verification groups where 74 studies were at low risk of bias. Those with an unclear or high risk lacked information on how likely the surgery was to correctly classify the target condition or operators not blinded to the result of index test(s). In the database/self-reporting group, 5 studies assessed for probable surgical confirmation by means of additional codes at the time of recording and therefore presented a lower risk, all other studies were high risk.

In flow and timing, the complete verification group presented the lowest risk. An unclear or high risk of bias was attributable to a long (>12 months) or unclear time interval, and a high or unclear withdrawal of participants from analysis. All other studies presented a high risk as not all participants received the same reference standard.

Applicability

In patient selection, 40 studies gave low concern, with 63 and 22 giving unclear or high risk respectively. An unclear or high risk was attributable to the two-gate selection of controls, or the study likely to only classify a limited spectrum of disease (tertiary centres or infertility clinics).

In regard the reference standard, 97 studies showed a low concern. Studies in the database/self-reporting group were deemed high/unclear depending on whether additional coding input for surgery was recorded.

Test accuracy

Due to heterogeneity in methodology and study quality, meta-analysis was performed on studies from each group separately.

The accuracy of index tests to predicting endometriosis was variable, although results across groups were consistent. Each index test gave a positive likelihood for the presence of pelvic endometriosis, apart from a BMI ≥30kg/m2, which decreased the likelihood of disease. The positive likelihood ratio (LR+) for disease was highest in investigation tests and there was a trend towards a greater specificity than sensitivity. The summary results of bi/univariate meta-analysis are shown in Figure 7. An assessment of confidence in individual sensitivity and specificity of each test is displayed by a visual pentagon model, the methodology for this assessment is described in the discussion and legend shown in Figure 8.

45a5e3bc-0d55-4641-9e24-2b5b8fa63e2d_figure7.gif

Figure 7. Summary diagnostic results of bivariate meta-analysis of index tests for included studies in A: complete verification group; B: partial verification group; C: database/self-reporting group.

BMI, body mass index. TVUSS, transvaginal ultrasound scan. DIE, deep invasive endometriosis. CA-125, cancer antigen-125. DOR, diagnostic odds ratio. LR, likelihood ratio. *univariate analysis where fewer than 4 studies included.

45a5e3bc-0d55-4641-9e24-2b5b8fa63e2d_figure8.gif

Figure 8. Graphic display of grading of accuracy measures by visual pentagon model applied to index test sensitivity and specificity from included studies in the systematic review and meta-analysis of non-invasive tests for the diagnosis of endometriosis as displayed in Figure 7.

Score of 0 to -2 in each of 5 domains: design (study design type); risk of bias (QUADAS 2 risk of bias); indirectness (QUADAS 2 applicability); inconsistency (visual assessment of inter-study variance in confidence intervals); and imprecision (width of confidence intervals)

Investigation category tests were the best performing overall and TVUSS finding of endometrioma gave the highest summary LR+ at 21.6, at sensitivity and specificity of 77.2% and 96.4% respectively. Serum CA-125 >35U/mL showed sensitivity and specificity of 55.8% and 92.7% respectively, with LR+ of 7.63. TVUSS finding of DIE had showed sensitivity and specificity of 86.5% and 80.2% with LR+ of 4.39.

Symptom based tests showed LR+ within a similar range: 1.47 (dysmenorrhoea) to 1.93 (dyspareunia). Symptoms showed a generally higher specificity than sensitivity. Dyspareunia showed the highest LR+ at 1.93 with a sensitivity and specificity of 36.3% and 81.1% respectively.

Family history of endometriosis showed a LR+ of 6.25 with a high specificity (98.5%) but low sensitivity (9.25%). The finding of BMI ≥30kg/m2 showed a decreased likelihood of diagnosis of endometriosis (LR+ 0.44).

Hierarchical Summary Receiver Operating Characteristics (HSROC) curves for index tests in each group are shown in Figures 9-11. The HSROC curves show the greatest area under the curve (AUC) for investigation category tests.

45a5e3bc-0d55-4641-9e24-2b5b8fa63e2d_figure9.gif

Figure 9. Bivariate analysis Hierarchical Summary Receiver Operating Characteristics (HSROC) curve for index tests from studies included in the systematic review and meta-analysis of non-invasive tests for the diagnosis of endometriosis in the complete verification group for the diagnosis of endometriosis.

BMI, body mass index. TV, transvaginal, DIE, deep invasive endometriosis. CA-125, cancer antigen-125.

45a5e3bc-0d55-4641-9e24-2b5b8fa63e2d_figure10.gif

Figure 10. Bivariate analysis Hierarchical Summary Receiver Operating Characteristics (HSROC) curve for index tests from studies included in the systematic review and meta-analysis of non-invasive tests for the diagnosis of endometriosis in the partial verification group for the diagnosis of endometriosis.

(Index tests with fewer than 5 studies omitted). BMI, body mass index.

45a5e3bc-0d55-4641-9e24-2b5b8fa63e2d_figure11.gif

Figure 11. Bivariate analysis Hierarchical Summary Receiver Operating Characteristics (HSROC) curve for index tests from studies included in the systematic review and meta-analysis of non-invasive tests for the diagnosis of endometriosis in the database/self-reporting group for the diagnosis of endometriosis.

(Index tests with fewer than 5 studies omitted).

In the partial verification group, symptom index tests showed a greater LR+ than the complete verification group, range 2.47 (dysmenorrhoea) to 7.13 (dyschezia). Specificity was also higher, range 69% (dysmenorrhoea) to 92% (dyschezia).

In the database/self-reporting group symptom-based index tests performed similarly to other groups. In partial verification and database/self-reporting groups BMI ≥30 kg/m2 showed no correlation with disease and had 95% CI crossing 1.0. In all other index tests across all groups the 95% CI was >1.0.

The greatest inter-study variability in confidence intervals was shown in Forest plots for the symptom-based tests, notably pelvic pain. The inter study variance for specificity was generally lower than that for sensitivity, as was the overall width of confidence intervals. Forest plots for each index test in each group are shown in Figures 12-18.

45a5e3bc-0d55-4641-9e24-2b5b8fa63e2d_figure12.gif

Figure 12. Forest Plots for sensitivity and specificity of symptom-based index tests for included studies in diagnosis of pelvic endometriosis: a systematic review and accuracy meta-analysis of non-invasive tests complete verification group.

45a5e3bc-0d55-4641-9e24-2b5b8fa63e2d_figure13.gif

Figure 13. Forest Plots for sensitivity and specificity of clinical feature index tests for included studies in diagnosis of pelvic endometriosis: a systematic review and accuracy meta-analysis of non-invasive tests complete verification group.

45a5e3bc-0d55-4641-9e24-2b5b8fa63e2d_figure14.gif

Figure 14. Forest Plots for sensitivity and specificity of investigation index tests for included studies in diagnosis of pelvic endometriosis: a systematic review and accuracy meta-analysis of non-invasive tests complete verification group.

45a5e3bc-0d55-4641-9e24-2b5b8fa63e2d_figure15.gif

Figure 15. Forest Plots for sensitivity and specificity of symptom-based index tests for included studies in diagnosis of pelvic endometriosis: a systematic review and accuracy meta-analysis of non-invasive tests partial verification group.

45a5e3bc-0d55-4641-9e24-2b5b8fa63e2d_figure16.gif

Figure 16. Forest Plots for sensitivity and specificity of clinical feature index tests for included studies in diagnosis of pelvic endometriosis: a systematic review and accuracy meta-analysis of non-invasive tests partial verification group.

45a5e3bc-0d55-4641-9e24-2b5b8fa63e2d_figure17.gif

Figure 17. Forest Plots for sensitivity and specificity of symptom-based index tests for included studies in diagnosis of pelvic endometriosis: a systematic review and accuracy meta-analysis of non-invasive tests database/self-reporting group.

45a5e3bc-0d55-4641-9e24-2b5b8fa63e2d_figure18.gif

Figure 18. Forest Plots for sensitivity and specificity of clinical feature index tests for included studies in diagnosis of pelvic endometriosis: a systematic review and accuracy meta-analysis of non-invasive tests database/self-reporting group.

Sensitivity analysis performed for studies without any high-risk features is shown in Table 6 (see Extended data58). All studies included are from the complete verification group. Summary accuracy measures are consistent with those in this group for the majority of index tests, although sensitivity for TVUSS finding of endometrioma and DIE reduced to 69.8% and 73.4% respectively.

Discussion

Main findings

This meta-analysis presents an up-to-date, large, and geographically varied data set identifying predictive factors for diagnosis of pelvic endometriosis with a high degree of confidence. Index tests showed a positive association with endometriosis and trended towards a greater specificity than sensitivity, excluding elevated BMI, which demonstrated an inverse correlation. TVUSS finding of endometrioma reached a desired threshold for use as a ‘rule-in’ test and none achieved a summary sensitivity of >95%. A family history of endometriosis, dyschezia and serum CA-125 >35U/mL showed summary specificity of >90% although low sensitivity. Sensitivity was poor for symptom and clinical history tests, where the best performing was dysmenorrhoea.

Strengths and limitations

We undertook a thorough search of the current literature, undergoing analysis by two independent reviewers with strict quality assessment. Attempts were made to mitigate inter-study heterogeneity by division of studies into groups. All index tests are relevant to primary care, are immediately available without novel techniques or additional training and can assist in the triage of those with symptoms that could be attributable to endometriosis.

There were, however, limitations. Due to difficulties in data extraction from some non-English journals, 15 studies were excluded from the analysis. Some studies, such as Chapron et al 2005, which was seminal in providing a clinical prediction model for moderate/severe endometriosis, were not able to be included due to the inability for construct 2x2 tables.36 We did not contact authors to obtain individual data that was not available in the published text.

Overall, there was significant methodological variance and population heterogeneity in age; presentation; and stage of disease. Variation in selection of cases and controls may not reflect a clinically representative population. Prevalence of disease was higher than seen in the general female population, which may reflect a high degree of surgical accuracy, but also indicates the selective nature of study populations.

There is the possibility of inappropriate assignment of cases and controls, occurring in both directions due to uneven application of the reference standard, although we attempted to account for this by assigning groups. We included studies that diagnosed endometriosis by visual inspection and there is debate regarding this in the absence of histological confirmation.3739

There was variation in the definition of positive symptom index tests. This is common across many reviews and although there is guidance on symptom reporting, it was not clearly followed in all studies.4042 Assessment of symptoms varied, with most studies using a self-administered questionnaire. Although the use of standardised validated tools would better allow for comparison across studies, the nuance and detail acquired through clinical history taking is likely to better grasp the nature and significance of a symptom and its implications.

It is likely that imaging and surgical techniques have developed over time. A trend towards recent studies may mitigate this. Imaging modalities such as MRI and advanced sonographic techniques were not assessed due to their lack of universal availability in a primary care setting.

Considering the balances of strengths and weaknesses, however, we believe that our data synthesis presents an objective summary of the current evidence.

Interpretation

An understanding of the degree of likelihood associated with various symptoms and features in the clinical history can help assessment of patients with possible endometriosis in primary care.

The negative association between elevated BMI and endometriosis shown in the complete verification group is consistent with that demonstrated previously.43 This was not replicated across other groups. This may reflect a greater negative correlation between elevated BMI in higher risk populations in all surgical cohorts who may have more severe disease. This possibility is consistent with previous studies, demonstrating a significantly lower BMI in those with severe compared to mild disease and a 12-14% decrease in the likelihood of endometriosis being diagnosed for each unit increase in BMI (kg/m2).31,44 The interplay between BMI and endometriosis pathogenesis, however, remains poorly understood.

The trend of data from the partial verification and database/self-reporting groups to demonstrate better performing accuracy measures was likely a reflection of the selection of controls. This effect seems to outweigh the possibility of an undiagnosed disease burden in controls not exposed to a surgical reference standard. The accuracy of self-reported diagnosis of endometriosis has been assessed and performs well,45 false attribution of disease in the self-reporting group may therefore only present a small source of bias.

A greater specificity than sensitivity of tests may be associated with their correlation to disease severity. Dyschezia and dyspareunia have been linked to severe disease due to the involvement of a precise anatomical location in cases of invasive disease.46,47 Tests showing a greater sensitivity such as dysmenorrhea were also less specific.

Previous systematic reviews have similarly highlighted the heterogeneity and poor methodological quality of primary studies, limiting interpretation of findings.17,48 As our methodology allowed a wide inclusion criteria, we applied a novel grading protocol in order to more quantitively assess limitations. Grading of evidence for index tests was performed for sensitivity and specificity by application of a visual pentagon model for grading of test accuracy studies described by Rogozinska and Khan.49 This methodology is described in detail elsewhere but briefly, studies were given a score of 0 to -2 in each of 5 domains: design (study design type); risk of bias (QUADAS 2 risk of bias); indirectness (QUADAS 2 applicability); inconsistency (visual assessment of inter-study variance in confidence intervals); and imprecision (width of confidence intervals). The complete verification group shows the fewest limitations, whist the database/self-reporting studies showed very serious limitations. There was greater limitation in the investigation category tests due to more highly selective populations and a generally higher inter-study inconsistency and imprecision.

Conclusions

Research recommendations

The need for high-quality studies of predictive factors for endometriosis remains, particularly assessing populations attending primary care. Further multivariate analysis in powerful primary observational studies assessing factors that can be immediately and readily assessed in primary care would be of great value, as we anticipate the index tests assessed in this study to provide a greater degree of accuracy when applied in combination.4,36,50

We examined serum CA-125 at a cut off >35 U/mL, considered the upper limit of normal range, meta-analysis from 2016 found a cut off of 30 U/mL gave a sensitivity and specificity of 52% and 93% respectively, but sensitivity dropped to just 24% for detection of minimal disease.51 Further research assessing the accuracy of CA-125 at different thresholds and in combination with other tests could help improve accuracy.

Two recent studies (Fauconnier et al 2021 and Chapron et al 2022) assessed the accuracy of a patient-completed questionnaire and epidemiological data for the early identification of endometriosis and found it could do so with high diagnostic accuracy.23,52 Although these studies were conducted in a high-risk population undergoing surgery, the model maintained accuracy in population with a lower endometriosis prevalence of 10%. We do not anticipate a clinical score replacing laparoscopy due to its added therapeutic advantages and requirement to exclude other pathologies. If, however, disease can be predicted with a high degree of accuracy early on, medical therapy may be instigated and referral made for definitive diagnosis and counselling regarding treatment, prognosis and fertility in a timely manner with the aim of reducing the current extraordinary delay.

Comments on this article Comments (0)

Version 1
VERSION 1 PUBLISHED 28 Apr 2023
Comment
Author details Author details
Competing interests
Grant information
Copyright
Download
 
Export To
metrics
Views Downloads
F1000Research - -
PubMed Central
Data from PMC are received and updated monthly.
- -
Citations
CITE
how to cite this article
Bainton T, Jeyapala S, Zamora J et al. Diagnosis of pelvic endometriosis: A systematic review and accuracy meta-analysis of non-invasive tests available in primary care [version 1; peer review: 1 approved with reservations, 1 not approved]. F1000Research 2023, 12:453 (https://doi.org/10.12688/f1000research.131729.1)
NOTE: If applicable, it is important to ensure the information in square brackets after the title is included in all citations of this article.
track
receive updates on this article
Track an article to receive email alerts on any updates to this article.

Open Peer Review

Current Reviewer Status: ?
Key to Reviewer Statuses VIEW
ApprovedThe paper is scientifically sound in its current form and only minor, if any, improvements are suggested
Approved with reservations A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.
Not approvedFundamental flaws in the paper seriously undermine the findings and conclusions
Version 1
VERSION 1
PUBLISHED 28 Apr 2023
Views
7
Cite
Reviewer Report 30 Nov 2023
Hendy Hendarto, Department of Obstetrics and Gynecology, Faculty of Medicine, Airlangga University, Surabaya, Indonesia 
Approved with Reservations
VIEWS 7
  • The title of the study should be changed to better match the content of the manuscript and be useful for clinicians for application in the field. Proposed title: “Clinical diagnosis of endometriosis: A systematic review and accuracy
... Continue reading
CITE
CITE
HOW TO CITE THIS REPORT
Hendarto H. Reviewer Report For: Diagnosis of pelvic endometriosis: A systematic review and accuracy meta-analysis of non-invasive tests available in primary care [version 1; peer review: 1 approved with reservations, 1 not approved]. F1000Research 2023, 12:453 (https://doi.org/10.5256/f1000research.144598.r216511)
NOTE: it is important to ensure the information in square brackets after the title is included in all citations of this article.
Views
16
Cite
Reviewer Report 27 Oct 2023
Dan C Martin, Health Science Center, The University of Tennessee at Martin, Memphis, Tennessee, USA;  Endometriosis Foundation of America, New York, New York, USA;  IRB, Virginia Commonwealth University, Richmond, Virginia, USA 
Not Approved
VIEWS 16
Update 31/10/2023: On the request of the reviewer, a minor language edit has been made to point 5 for clarity. No other edits have been made. 
  1. The authors perform high-level statistical analysis of many poor-quality papers in
... Continue reading
CITE
CITE
HOW TO CITE THIS REPORT
Martin DC. Reviewer Report For: Diagnosis of pelvic endometriosis: A systematic review and accuracy meta-analysis of non-invasive tests available in primary care [version 1; peer review: 1 approved with reservations, 1 not approved]. F1000Research 2023, 12:453 (https://doi.org/10.5256/f1000research.144598.r216534)
NOTE: it is important to ensure the information in square brackets after the title is included in all citations of this article.

Comments on this article Comments (0)

Version 1
VERSION 1 PUBLISHED 28 Apr 2023
Comment
Alongside their report, reviewers assign a status to the article:
Approved - the paper is scientifically sound in its current form and only minor, if any, improvements are suggested
Approved with reservations - A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.
Not approved - fundamental flaws in the paper seriously undermine the findings and conclusions
Sign In
If you've forgotten your password, please enter your email address below and we'll send you instructions on how to reset your password.

The email address should be the one you originally registered with F1000.

Email address not valid, please try again

You registered with F1000 via Google, so we cannot reset your password.

To sign in, please click here.

If you still need help with your Google account password, please click here.

You registered with F1000 via Facebook, so we cannot reset your password.

To sign in, please click here.

If you still need help with your Facebook account password, please click here.

Code not correct, please try again
Email us for further assistance.
Server error, please try again.