Background

Approach to acute pain in the right lower quadrant in pregnancy is a challenge; It has various causes including digestive, gynecological and obstetrical, and renal causes. The possibility of acute appendicitis must be specifically ruled out, since it is the most common cause of surgical intervention in pregnancy requires immediate management [1,2,3,4,5,6,7]. The incidence rate of acute appendicitis in pregnancy has been reported to be 1:1250 and 1:1500 [8,9,10,11,12,13,14,15]. The possibility of acute appendicitis is higher in the second and third decades of life which are the fertility years [16,17,18]. Accurate diagnosis is difficult because the typical diagnostic clinical images are not present in all of the cases [19,20,21,22,23,24,25]. As the gestational age increases, the accuracy of the diagnosis decreases and the likelihood of appendical perforation and other complications will increase [26,27,28,29]. The negative laparotomy rate of suspected appendicitis is 25–50% in obstetric cases and 15–35% in general surgical cases [30,31,32,33,34,35,36]. Imaging in combination with physical examination will reduce the negative results of appendectomies [37,38,39,40]. Previous studies have indicated that CT (computed tomography) scan has better sensitivity and efficiency in comparison to US (ultrasound) [41,42,43]. Moreover, these studies have shown the high failure rate of US in diagnosing the disease even in patients whose appendicitis had been already confirmed by other imaging techniques [44,45,46,47]. The common usage of CT has been confirmed for evaluating patients suspected of appendicitis with reports on reduced healthcare costs for each patient and reduced rate of unnecessary appendectomy [48,49,50,51,52]. However, the fact that CT is applying ionizing radiation is worrying for both fetuses and pregnant women during the imaging process. Moreover, intravenous contrast is applied for increasing the diagnosis accuracy of CT, and it is associated with increased allergic reactions and contrast-inducted nephropathy. The changes made into the CT protocol can reduce exposure to fetal radiation less than 3 mGy, which is lower than the doses inducing adverse fetal effects [15, 53] mGy for the risk of carcinogenesis, 50 mGy for deterministic effects) [16, 54, 55]. However, CT is recommended in cases when inclusive clinical findings and ultrasound results are obtained or in situations in which MRI is not accessible. Thus, following the initial negative result of US, the American College of Radiology has introduced MRI (magnetic resonance imaging) as the recommended treatment [15, 16]. Therefore, the present systematic review and meta-analysis has been conducted to investigate the sensitivity,specificity, and diagnostic accuracy of MRI in diagnosing acute appendicitis in pregnant women.

Methods

Presenting a systematic review and meta-analysis based on PRISMA [26] principles.

Search methods for eligible studies

Searching for the eligible studies was conducted from 1/1/2009 to the end of 30/12/2018 by using the following searching strategy:

The searches were conducted by two independent researchers (MK, MS) to find the relevant studies published from 1/1/2009 to the end of 30/12/2018. We searched for published literature in the English language in MEDLINE via PubMed and EMBASETM via Ovid, The Cochrane Library, and Trip database. For literature published in other languages, we searched national databases (Magiran and SID), KoreaMed, and LILACS. To ensure literature saturation, the list of the included research references or the relevant reviews found by searching was studied (FP). The special search strategies were created using the Health Sciences Librarian website with specialization in systematic review searches using the MESH phrases and open phrases in accordance with the PRESS standards. After finalizing the MEDLINE strategy, the results were compared with searches from other databases (MS, FP). Similarly, PROSPERO was searched to find recent or ongoing systematic reviews. The keywords used in the search strategy were Pregnancy [MeSH], Pregnant [MeSH] OR—Magnetic resonance imaging [MeSH] OR—Appendicitis [MeSH] OR—Ultrasound, [MeSH] OR, imaging, MRI [MeSH] OR"،" , and Right lower quadrant pain [MeSH]. The list of previous study resources and systematic reviews were also searched for identifying the published studies (MK, MS). In addition, it was attempted to contact the authors of all studies that met the inclusion criteria and request unpublished data and abstracts (FP).

Eligibility criteria

The inclusion criteria we used to select articles are as follows: (a) original retrospective and prospective blinded studies investigating the performance of MRI for appendicitis diagnosis in suspected pregnant women presenting with right lower quadrant pain; (b) using laparaoscopy open surgery or histopathologic examination as the standard reference, (c) containing a 2 × 2 table or included data that allowed the construction of a 2 × 2 table, (d) Described the diagnostic criteria for appendicitis on MRI in clear details, and (e) met quality standards, as assessed by the 14-item Quality Assessment of Diagnostic Accuracy Studies (QUADAS2) tool.

Data extraction and risk of bias evaluation

The data were extracted for evaluating the characteristics of the participants. The index test included characteristics including special equipment, reference standard (executor of the tests and the interval between tests). The information related to diagnosis accuracy was also extracted. The first reader extracted the data (MS). The second reader confirmed the data (MK), and he would have completed them if they were incomplete.

The risk of bias of every article was evaluated using QUADAS-2 (a revised tool for quality assessment of diagnostic accuracy studies); four possible domains of bias results are evaluated. The first domain is patient selection (selecting the participants based on sequence or random). The participants of the present study are required to have the test conditions. Thus, the risk of bias is high in the studies; only participants suspected of appendicitis were selected. The second domain is the index test (wrong interpretation of the index test, accurate explanation of detection threshold). The third domain is the reference standard or “golden standard” (99% accuracy, the interpretation without considering the results of the index test). The last domain is the flow and timing (describing the patients receiving the index test, the time interval between index tests, and reference standard). Two reviewers evaluated the article independently with QUADAS-2 criteria (MS, FP). After independent evaluations, the reviewers discussed the article. Each domain was discussed to achieve a single view. The reliability of the reviewers for each domain was measured by using κ-statistic.

Statistical analysis

On the basis of the results from the 2 × 2 tables, pooled measures for sensitivity, specificity, diagnostic odds ratio (DOR), and area under the curves (AUC) along with their 95% confidence intervals (CIs) were calculated using the DerSimonian Lair methodology [56]. Based on the pooled DOR of each index, test summary receiver–operator curves (sROC) were reconstructed using Moses–Shapiro–Littenberg methodology [57]. The DOR reflects the ability of a test to detect, in this case, appendicitis. A DOR of 1 indicates that the test has no discriminative power. The higher the DOR, the better the diagnostic ability of the imaging modality. To evaluate heterogeneity between studies, a Cochran Q statistic and the I2 index was used. A substantial I2 index indicates heterogeneity beyond sampling variation. A meta-regression analysis was performed to identify pre-defined sources of heterogeneity. We constructed the forest plots with the freeware Meta-DiSc, version 1.4, software (http://www.hrc.es/investigacion/ metadisc-en.htm; Ramon y Cajal Hospital; Madrid, Spain) [58]. The data related to the diagnostic accuracy of ultrasound were collected for providing a complete analysis. Then, for each of the categories, some studies were meta-analyzed; these studies had high and low risk of bias of participant selection (based on QUADAS-2 criteria). Sensitivity, specificity, and positive and negative likelihood ratios (LRs) were computed based on the true-positive, true-negative, false-positive, and false-negative rates for each study. Both LRs are independent from prevalence rates, and there is a consensus that a positive LR > 10 and a negative LR < 0.1 provide reliable evidence of satisfactory diagnostic performance [59]. The ratio of positive LR to negative LR was combined in a single global accuracy measure, the diagnostic odds ratio [60]. Summary sensitivity and specificity, positive and negative LRs, and diagnostic odds ratios were estimated by using a bivariate random effects model. This approach assumes bivariate normal distributions for the logit transformations of sensitivity and specificity from individual studies [61, 62]. In addition, the hierarchical summary receiver–operating characteristic (ROC) curve presenting the point estimates for each study, the joint ROC curve, and the pooled characteristics, including the 95% confidence region and the 95% prediction region, was constructed [63].

Results

Study selection

Based on the searching strategy, as many as 1164 studies were selected. After analyzing the correspondence of the studies with the required criteria, 19 studies were selected for the final review (Fig. 1).

Fig. 1
figure 1

PRISMA flow diagram

Characteristics of the studies

The required characteristics of each selected study are indicated in Table 1. In total, 2400 patients (2400 pregnant women suspected of appendicitis presenting with right lower quadrant pain) were investigated in 19 studies. From these 19 studies, as many as 17 studies (94.7%) were retrospectives studies, and 2 studies (5.2%) were prospective studies. The investigated population were pregnant women suspected of appendicitis. The patients’ age ranged from 16 to 47 years. Seventeen studies applied 1.5-T MRI. Other studies have applied lower MRI field strength, and some have not reported it. From 2400 patients, 996 patients were evaluated with a magnetic strength of larger than 1 T. The MRI images obtained from all of the studies have been analyzed by an experienced radiologist which in most of the cases was a fellowship-trained attending radiologist.

Table 1 Summary of included studies

Risk of bias

The findings of QUADAS-2 assessment have been indicated in Figs. 2 and 3; they indicate that only one parameter has a low risk of bias in the assessment. The studies have two or some specific limitations (Figs. 2 and 3).

Fig. 2
figure 2

The risk of bias in the studies conducted was measured by using the QUADAS-2 tool. The risk of bias shown in Eq. 2 of the above image model (MRI) of each diagram indicate the number and percentage of studies with high (red), medium (yellow), and low (green) risk of bias in the four groups of the QUADAS-2 tool

Fig. 3
figure 3

Sensitivity of MRI for diagnosing appendicitis in studies that included pregnant patients only. Forest plot of sensitivity reported in each study. Each study is identified by name of first author and year of publication, with circles representing individual study point estimates, size of each circle indicating relative contribution to data pooling (inverse variance weighting), horizontal lines indicating 95% CIs, and dashed vertical lines representing 95% CIs for pooled sensitivity and specificity

Overall meta-analysis

For appendicitis in pregnancy, the MRI sensitivity was 91.8% at the confidence interval of 95 percent (95% CI 87.7–94.9%). At the confidence interval of 95%, the specificity was 97.9% (95% CI 97.2–100%). The diagnostic odds ratio was 177.60 (95% CI 35.012–900.91) showing a high accuracy of MRI in diagnosing appendicitis in pregnant women (Table 2). The SROC plot showed a summary of estimated sensitivity and specificity and the area under the SROC curve of MRI in diagnosing appendicitis in pregnant women (Figs. 3, 4, and 5). The positive and negative LRs for MRI in diagnosing appendicitis in pregnant women were 30.98(21.33 to 44.99) and 0.10(0.03 to 0.32) respectively (Figs. 6 and 7).

Table 2 Accuracy of individual studies MRI in characterization of acute appendicitis during pregnancy
Fig. 4
figure 4

Specificity of MRI (≥ 1.5 T) for diagnosing appendicitis in studies that included pregnant patients only. Forest plots of specificity reported in each study. Each study is identified by name of first author and year of publication, with circles representing individual study point estimates, size of each circle indicating relative contribution to data pooling (inverse variance weighting), horizontal lines indicating 95% CIs, and dashed vertical lines representing 95% CIs for pooled specificity

Fig. 5
figure 5

Summary-ROC (SROC) curve for diagnostic accuracy of MRI in diagnosing appendicitis. Size of each circle on graph represents sample size of included study. SE = standard error; Q* index = point at which sensitivity and specificity are equal or point closest to ideal top-left corner of SROC space

Fig. 6
figure 6

Positive LR of MRI for diagnosing appendicitis in studies that included pregnant patients only. Forest plot of positive LR reported in each study. Each study is identified by name of first author and year of publication, with circles representing individual study point estimates, size of each circle indicating relative contribution to data pooling (inverse variance weighting), horizontal lines indicating 95% CIs, and dashed vertical lines representing 95% CIs for pooled positive LR

Fig. 7
figure 7

Negative LR of MRI for diagnosing appendicitis in studies that included pregnant patients only. Forest plot of negative LR reported in each study. Each study is identified by name of first author and year of publication, with circles representing individual study point estimates, size of each circle indicating relative contribution to data pooling (inverse variance weighting), horizontal lines indicating 95% CIs, and dashed vertical lines representing 95% CIs for pooled Negative LR

Meta-regression

Meta-regression indicated that the mean field strength of MRI was intervening (p = 0.0017) (Table 3).

Table 3 The quality of the articles that is calculated using a checklist which includes 5 criteria

Subgroup analysis of MRI field of strength

Analyzing studies with the field strength of ≥ 1.5 T indicated that sensitivity was 0.94% (95% CI 88–0.98%) and specificity was 0.92% (at the confidence interval of 95% CI 0.64–100%). DOR was 325.74 (at the confidence interval of 95%) showing a very high accuracy of MRI with field strength of ≥ 1.5 T in diagnosing appendicitis in pregnant women. and inconsistency was 56%. The SROC plot. The SROC plot showed a summary of estimated sensitivity and specificity and the area under the SROC curve of MRI with a field strength of ≥ 1.5 T in diagnosing appendicitis in pregnant women (Figs. 6, 7, and 8).

Fig. 8
figure 8

Specifity of MRI (≥ 1.5 T) for diagnosing appendicitis in studies that included pregnant patients only. Forest plot of sensitivity reported in each study. Each study is identified by name of first author and year of publication, with circles representing individual study point estimates, size of each circle indicating relative contribution to data pooling (inverse variance weighting), horizontal lines indicating 95% CIs, and dashed vertical lines representing 95% CIs for pooled sensitivity and specificity

Country

Analyzing 13 studies conducted in USA [64,65,66,67,68,69, 73, 74, 76, 77, 79,80,81] indicated a pooled sensitivity of 91.5% (95% CI 86.8–95%) and a pooled specificity of 98.1%(95% CI 97.4–98.7%) and a DOR of 559.41 (95% CI 262.40–1192.6) (Figs. 9 and 10). the sensitivity, specificity, and DOR of MRI in diagnosis of appendicitis in pregnant women in South Korea based on 2 included articles [70, 78] were 100% (95% CI 88.1–100%), 95.6% (95% CI 90.1–98.6%), 596.36 (95% CI 55.640–6391.9) respectively. The sensitivity, specificity, and DOR of MRI in diagnosis of appendicitis in pregnant women in Canada based on 2 included articles [72, 82] were 57.1% (95% CI 18.4–90.1%), 94.4% (95% CI 84.6–98.8%), 20.523 (95% CI 3.250–129.61) respectively (Fig. 11).

Fig. 9
figure 9

Sensivity of MRI (≥ 1.5 T) for diagnosing appendicitis in studies that included pregnant patients only. Forest plots of specificity reported in each study. Each study is identified by name of first author and year of publication, with circles representing individual study point estimates, size of each circle indicating relative contribution to data pooling (inverse variance weighting), horizontal lines indicating 95% CIs, and dashed vertical lines representing 95% CIs for pooled specificity

Fig. 10
figure 10

Summary-ROC (SROC) curve for diagnostic accuracy of MRI (≥ 1.5 T) in diagnosing appendicitis. Size of each circle on graph represents sample size of included study. SE = standard error; Q* index = point at which sensitivity and specificity are equal or point closest to ideal top-left corner of SROC space

Fig. 11
figure 11

Pooled sensitivity and specificity of MRI in diagnosis of appendicitis in pregnancy by countries

Discussion

Many investigators have previously shown that MRI for the workup of acute appendicitis in pregnancy is highly reliable and useful. In this systematic review and meta-analysis, we calculated a 91.8% sensitivity and a 97.9% specificity of MRI for the diagnosis of appendicitis in pregnant women. We were able to include 7 more studies than the most recent published meta-analysis by Eugene Duke et al [83] included, which reviewed 12 studies between 2004 and 2015 comprising 933 pregnant women and calculated a pooled sensitivity of 94% (95% CI 87–98%), and specificity of 97% (95% CI 96–98%) and a DOR of 309.8 (95% CI 140.5–711). Also, analyzing studies with the field strength of ≥ 1.5 T (which was the field strength used in most of the studies included) indicated that sensitivity was 0.94% (95% CI 88–0.98%) and specificity was 0.92% (at the confidence interval of 95% CI 0.64–100%). DOR was 325.74 (at the confidence interval of 95%) which is indicative of the better accuracy of MRI with a field strength of ≥ 1.5 T in diagnosing appendicitis in pregnant women. Our calculated sensitivity and specificity was lower than what Blumenfeld YJ et al. [84] reported in their meta-analysis (specificity of 99.9% and sensitivity of 95.0%). Due to the high number of included studies which were conducted in the USA [64,65,66,67,68,69, 73, 74, 76, 77, 79,80,81], we decided to perform a subgroup meta-analysis based on the country which was indicative of a pooled sensitivity of 91.5% (95% CI 86.8–95%) and a pooled specificity of 98.1% (95% CI 97.4–98.7%) and a DOR of 559.41 (95% CI 262.40–1192.6) for MRI in diagnosis of appendicitis in pregnant women. The sensitivity, specificity, and DOR of MRI in diagnosis of appendicitis in pregnant women in South Korea based on 2 included articles [70, 78] were 100% (95% CI 88.1–100%), 95.6% (95% CI 90.1–98.6%), and 596.36 (95% CI 55.640–6391.9) respectively. Mahesh K et al. [85] in a 5-year study on 39 pregnant women reported that the sensitivity of CT in the diagnosis of appendicitis in pregnant women was 100%; they also reported a sensitivity of 46.1% for ultrasound, but on the other hand, Kevin A et al. [86] in their meta-analysis presented a sensitivity of 89.9% and a specificity of 93.6% for CT scan in diagnosing the acute appendicitis in adults including pregnant women which is quite close and even in some countries like South Korea, The Netherlands, and Italy, less than what we calculated as the sensitivity and specificity of MRI in diagnosing acute appendicitis in pregnant women. The SAGES guidelines do not recommend the employment of CT scan as the initial imaging technique for pregnant patients, except in cases where urgent information is needed for trauma or acute abdominal pain [87,88,89,90]. Given the effectiveness of ultrasound and MRI, CT should be used only in emergency cases or in situations where MRI is inaccessible or cannot be used [91, 92]. When ionizing radiation imaging is required, specific techniques can be employed in accordance with as low as reasonably achievable (ALARA) principle [93]. MR imaging can be performed without using intravenous gadolinium for pregnant women. MRI is favored compared with CT scan to diagnose non-obstetric abdominal pain in gravid patients, as the former makes it possible to take excellent soft tissue images without using ionizing radiation and is safer when applied to pregnant patients [67, 94, 95].

The maternal and fetal outcomes can be improved if abdominal conditions during pregnancy are diagnosed accurately and timely. Diagnostic laparoscopy is a preferred choice for cases where available resources prevent prompt imaging for diagnosis or when imaging is inconclusive. The risks of delayed diagnosis should be compared with possible risks associated with possible negative laparoscopy. The conditions diagnosed at laparoscopy should be treated by the surgeon as soon as possible [96].

The results of this study indicate that although there is a small difference between CT scan and MRI sensitivity in diagnosing appendicitis in pregnancy due to multiple complications of CT scan in pregnancy including exposure of patients to ionizing radiation, which is of special concern in pediatric and obstetric populations [87], MRI seems to be a more reasonable imaging modality than CT scan in cases of suspected appendicitis in pregnancy, especially in tertiary care centers that have access to specialized radiologists.

Limitations of this meta-analysis mostly relate to the available data and the heterogeneity of design, interpretation of results, and reporting of data in primary studies. Our study is also limited by the fact that most of the studies were retrospective case series. The number of patients enrolled ranged from 12 to 709, which in some cases led to inconsistencies in the results, especially in the calculated accuracies of MRI. and most studies did not include the overall appendicitis rates in their respective institutions, and thus, we were unable to calculate prevalence-adjusted positive and negative predictive values. Also, the studies varied by their inclusion criteria.

Conclusion

MRI has high sensitivity and specificity (91.8% and 97.9% respectively) for the diagnosis of acute appendicitis in pregnant patients with clinically suspected appendicitis. It is an excellent imaging technique in many instances, which does not expose a fetus, or the mother, to ionizing radiation, making it an excellent option for pregnant patients with suspected acute appendicitis. It can be performed at any stage of pregnancy, with no evidence of adverse effects on fetal outcomes, as it is currently being used. As radiologists become increasingly comfortable with interpreting abdominal and pelvic MRI, and as it becomes more widely available as an emergent procedure, its utility will continue to increase in the future.