Background

Intrahepatic cholestasis of pregnancy (ICP) is a pregnancy complication with prevalence ranging between 0.01 and 15.6% [1,2,3,4]. ICP presents as persistent pruritus, typically on the skin of palms and soles, with elevated bile acid levels, and resolves with delivery [5]. Generally, the onset of ICP occurs in the second and third trimesters and disappears spontaneously after delivery, usually within weeks [5,6,7]. The clinical importance of ICP lies in the potential fetal risks, including spontaneous preterm birth, iatrogenic preterm birth and fetal death [8]. While the pathophysiology of ICP is still poorly understood, gene, hormone, and environment play roles. Bile acids need to enter hepatocytes or bile ducts through the action of some transporters which genetic mutations can cause cholestasis [9, 10]. Estrogen can decrease the expression of ABCB11 / BSEP gene to inhibit the function of bile salt delivery pump, or decrease the activity of Na+ / K+ ATPase to inhibit the uptake of bile acid by hepatocytes that eventually leads to intrahepatic cholestasis. Sulfated progesterone metabolites can also have an adverse effect on bile acid excretion by inhibiting the function of BSEP [11]. Although the mechanism is unknown, studies have shown that the plasma and serum selenium concentrations and glutathione peroxidase activities in ICP patients are lower than those in healthy pregnant women, and the incidence rate is higher in winter. Some environmental factors such as pesticide pollutants, erucic acid in rape and selenium deficiency in food may lead to ICP [12]. Most often the disease affects women over the age of 35 years, [13] with personal history of cholestasis associated with the use of oral contraceptives, personal or family history of cholestasis of pregnancy, [14] biliary disease, [1] or liver disease, in multiple gestation pregnancy, [15] or in vitro fertilisation pregnancies [16]. Likewise, seasonal variations, [17] low selenium intake, erucic acid, increased gut absorption of bacterial endotoxins, pollutants, infections, and medicine are factors suspected as causing the disease [15,16,17,18,19,20].

When treating the ICP, doctors always focused on reducing maternal symptoms, improving results of liver tests, and reducing total bile acid (TBA) concentration. They commonly used Ursodeoxycholic acid (UDCA), S-adenosylmethionine (SAMe), dexamethasone, or cholestyramine as well as vitamin K (preventing postpartum bleeding) as therapies [20]. Whlie there was insufficient evidence to recommend early-term delivery or to support therapies above according to one Cochrane review [21]. However, the review found that UDCA seemed to improve the maternal symptom of pruritus, which agreed with the result of a meta-analysis by Bacq [22]. Bacq strongly suggested that UDCA was also beneficial for the fetal outcome, including total prematurity, fetal distress and neonatal respiratory distress syndrome; however, the Cochrane Review did not agree [21].

Some herbal medicine, such as Emodin, [23,24,25] Bushen Granule and Bushen Rougan Recipe, which includes Biejia (shell of Trionyx sinensis Wiegmann), Ejiao (made from skins of Equus asinus L.), Dihuang (tuberous root of Rehmannia glutinosa Libosch.), Gouqi (fruit of Lycium barbarum L.), Beishashen (root of Glehnia littoralis Fr. Schmidtex Miq.), Maidong (tuberous root of Ophio pogon japonicas (L. f) KerGawl.), Danggui (root of Angelica sinensis (Oliv.) Diels), Taoren (seed of Prunus persica (L.) Batsch), Qiancao (root of Rubia cordifolia L.), Baishao (root of Paeonia lactiflora Pall.), Huangjing (tuberous root of Polygonatum kingianum Coll. et Hemsl.), Jineijin (gizzard endothelium of Gallus Gallus domesticus), Dilong (Pheretima asperigillum), Haipiaoxiao (inner shell of Sepiella maindroni de Rochebrune), [26, 27] and in vivo cultured Calculus Bovis (made from bile of Bos taurus domesticus Gmelin) [28] were proved to be protective on cholestatic hepatitis by decreasing the levels of alanine aminotransferase (ALT) and total bile acid (TBA), prevent toxic compounds overaccumulation in hepatocytes. Decoctions like Zhi-Zi-Da-Huang decoction (including Zhizi (fruit of Gardenia jasminoides Ellis), Dahuang (Rheum palmatum L.), Zhishi (fruit of Citrus aurantium L.), Dandouchi (made from seeds of Glycine max (L.) Merr.) [29] and Shuangcao Tuihuang Granule-1 (including Yinchen (Artemisia scoparia Waldst. et Kit. Or Artemisia capillaries Thunb.), Huhuanglian (tuberous root of Picrorhiza scrophulariiflora Pennell), Tuxiangru (Origanum vulgare L.), Cheqianzi (Plantago asiatica L. or Plantago depressa Willd.), Shengdihuang (tuberous root of Rehmannia glutinosa Libosch.), Houpo (bark of Magnolia officinalis Rehd. et Wils.), etc.) [30] may also significantly dose-dependently reduce the indices of liver injuries by raising Superoxide Dismutase activity, scavenging oxygen free radicals and increasing anti-oxidation [28,29,30].

With the possible underlying mechanism of herbal medicinal for this condition, we conducted this systematic review to explore the potential effectiveness and safety of oral administration of herbal medicine in treating ICP.

Methods

Protocol registration

Protocol of this review was registered in PROSPERO as Ruiting Wang, Rongyan Peng, Nuo Cheng, Zeyu Yu, Mengdie Nan, Huijuan Cao. Oral herbal medicine for women with intrahepatic cholestasis in pregnancy: a systematic review of randomized controlled trial. PROSPERO 2018 CRD42018096013. Available from: http://www.crd.york.ac.uk/PROSPERO/display_record.php?ID=CRD42018096013.

Eligibility criteria

Published and unpublished studies, inany language, were included where thefollowing PICOS (patient, intervention, comparator, outcome, study type) criteria were met:

i) Patient: Patients, of any age, with intrahepatic cholestasis in pregnancy. Intrahepatic cholestasis in pregnancy (ICP or OC), diagnosed in accordance with recognized criteria (e.g. Guidelines for diagnosis and treatment of intrahepatic cholestasis in pregnancy (2015) by Chinese Medical Association). ii) Intervention: Herbal medicine through oral administration. iii) Comparators: Controls include placebo and conventional medicine (such as UDCA, SAMe, dexamethasone, vitamin K etc.). Chinese medicine combined with conventional medicine compared to the conventional medicine alone are also included. The controls in protocol were selected as Guidelines for diagnosis and treatment of intrahepatic cholestasis in pregnancy (2015) by Chinese Medical Association. After analyzing the data, most of the controls were not reported in the study, so there are some differences. iv) Outcomes: Primary outcomes included the changes in Ribalta score, maternal serum TBA values, and incidence of adverse birth events (e.g. newborn deaths, fetal distress and suffocation). Secondary outcomes included changes in bile acid content, incidence of cesarean sections, postpartum hemorrhage, adverse effects of medicine, changes in ALT values, and changes in AST values for the maternal, as well as the averages gestational age at birth and the incidence of premature births. v) Study type: Randomized controlled trials.

Literatures that unable to obtain the analysable data, as well as the piratical documents would be excluded.

Searching strategy

We searched seven databases and two clinical trial registration systems, including PubMed, SpringerLink, ProQuest, the Cochrane Central Register of Controlled Trials (CENTRAL), the Chinese National Knowledge Infrastructure Databases (CNKI), the Chongqing VIP Chinese Science and Technology Periodical Database (VIP), Wanfang Data Knowledge Service Platform, Chinese Clinical Trial Registry (ChiCTR), and Clinical Trials (ClinicalTrials.gov). The subject of the retrieval is: “Drugs, Chinese Herbal” [Mesh] OR “Herbal Medicine” [Mesh] OR “Herbal” [Mesh] with “Pregnancy” [Mesh] OR “Pregnancy Complications” [Mesh] OR “Infant” [Mesh] OR “Infant, Newborn” [Mesh] OR “Fetus” [Mesh] OR “Fetal Development” [Mesh] OR “Prenatal Diagnosis” [Mesh] OR “Fetal Monitoring” [Mesh] OR “Fetal Therapies” [Mesh] OR “Extraembryonic Membranes” [Mesh] OR “Placenta” [Mesh] OR “Placental Function Tests” [Mesh] OR “Uterine Monitoring” [Mesh] OR “Pelvimetry” [Mesh] OR “Oxytocics” [Mesh] OR “Tocolytic Agents” [Mesh] OR “Tocolysis” [Mesh] OR “Maternal Health Services” [Mesh] OR “Peripartum Period” [Mesh] OR “Parity” [Mesh] OR “Perinatal Care” [Mesh] OR “Postpartum Period” [Mesh] OR “Labor Pain” [Mesh] OR “Obstetrical” [Mesh] OR “Maternal-Child Nursing” [Mesh] OR “Midwifery” [Mesh] OR “Apgar Score” combined with “Cholestasis, Intrahepatic” [Mesh] AND “Random*” to be adjusted for use in the different databases.

Meantime, the Chinese National Knowledge Infrastructure Databases (CNKI), Wanfang Data Knowledge Service Platform and ProQuest Dissertations were used to search for grey literature.

Data extraction (selection and coding)

Trials retrieved using the search strategy and those from additional sources were screened independently by two review authors to identify trials that potentially meet the inclusion criteria outlined above. The full texts of these trials were retrieved and independently assessed for eligibility by other two reviewers. Any disagreement was resolved through discussion with a third reviewer (Huijuan Cao). A predesigned form was used to extract data from the included trials for assessment of trial quality.

Extracted information included: i) General information: Including document number, title, first author, year(s) conducted, location (city, country), source, etc.; ii) Methodological related information: type of design, grouping number, random allocation method, Random concealment method, method of blinding, participants blinded, loss of follow up, report of selective outcome, calculation of sample size, baseline comparability; iii) Participants information: diagnostic criteria, inclusion criteria, exclusion criteria, source, sample size, age, gender, disease course. iv) Intervention information: Types of interventions, intervention performer, treatment duration; v) Outcome measures: Treatment outcomes: changes in Ribalta score, changes in TBA values, number of neonatal deaths, number of fetal distress or asphyxiation, changes in ALT values, changes in AST values, number of caesarean sections, postpartum hemorrhage, adverse medicine effects, average gestational age at birth, number of premature deliveries.

Risk of bias assessment

We assessed the methodological quality of the included trials using the risk of bias tool recommended by the Cochrane Collaboration (Higgins and Green, 2009). Seven elements were assessed: random sequence generation, allocation concealment, blinding of participants and personnel, blinding of outcome assessment, incomplete outcome data (according to record the missing data and the method to deal with it), selective reporting (determined by the consistency of the predefined and reported outcomes) and other bias (assessed according to sample size calculation, inclusion/exclusion criteria for participant recruitment, comparability of baseline data, funding sources).

Strategy for data analysis

All statistical analyses were performed using RevMan 5.3 (The Cochrane Collaboration) software. The results of each single trial were described. We presented results as risk ratio (RR) with its 95% confidence intervals (CI) for dichotomous outcomes, and mean difference (MD) with 95% CI for continuous outcomes. Statistical heterogeneity between the included trials were assessed using the I2 value, and a meta-analysis were conducted if it had proved to be no significant clinical (relating to the participants, interventions, controls, and outcomes) and statistical heterogeneity (I2 values are less than 75%) among the included trials. If the I2value was less than 25%, a fixed-effect model (FEM) was used to synthesize the data, and if the I2 value was between 25 and 75%, we explored the sources of the heterogeneity. If the results of sensitive analysis or subgroup analysis explained the statistical heterogeneity successfully (i.e., I2 value is less than 25%), we used the FEM to synthesize the data as well, otherwise, random effect model (REM) was used to combine the data. Data would not be synthesized if there was a significant level of statistical heterogeneity (i.e., I2 value is greater than 75%) which was could not be explain or to handle (by subgroup analysis).

Additional analysis

If one meta-analysis included more than 10 trials, we explored the possibility of publication bias by using a funnel plot.

If one meta-analysis contained more than 8 included studies, we performed a Trial Sequential Analysis (TSA) was performed if there were more than 8 included studies in the meta-analysis. We calculated the required sample size of each meta-analysis and tested the robustness of the result with the TSA versioned 0.9.5.10 (Copenhagen: The Copenhagen Trial Unit, Center for Clinical Intervention Research, 2017). The information size required by the diversity-adjustment that we used was estimated from a control event proportion of the included trials and a priori intervention effect of 5%, and the diversity we used was estimated in the included trials.

Evidence quality assessment

The Grading of Recommendations Assessment, Development and Evaluation criteria (GRADE) was conducted to assess the certainty of the evidence for each primary outcomes with meta-analysis. Considering the following aspects, such as study design, risk of bias, outcome consistency of trials, directness and accuracy of evidence and possibility of publication bias, we judged whether to downgrade the evidence. Finally, the evidence was rated at four levels: high, moderate, low or very low.

Results

Selection results

A total of 1604 trials were retrieved according to the search strategy, and the literature was screened according to the predefined criteria. After reading the title and abstract, 1262 trials were screened out, and further another 187 trials were excluded after reading the full text. Forty-three randomized controlled trials were finally included. All the included trials were published in Chinese. The specific literature screening process is shown in Fig. 1.

Fig. 1
figure 1

Study Flow Chart

Basic characteristics of trials (see Table 1)

A total of 3556 women with ICP were included in the 43 trials [31,32,33,34,35,36,37,38,39,40,41,42,43,44,45,46,47,48,49,50,51,52,53,54,55,56,57,58,59,60,61,62,63,64,65,66,67,68,69,70,71,72,73]. The sample size ranged from 30 to 188 (an average of 41 patients in each group). The age distribution ranged from 18 to 42 years old, and the range of gestational age was 18–40 weeks. All cases were recruited from the outpatient/inpatient Department of Obstetrics and Gynaecology. Thirty-eight trials reported baseline comparisons in terms of demographic characteristics, and the other five trials only mentioned randomization, which did not explain whether the baseline characteristic of the participants was comparable.

Table 1 Characteristics of the 43 included studies

All patients were diagnosed according to the Chinese recognized criteria, including those cited in the textbook of “Obstetrics” [31, 32, 36,37,38,39,40, 59,60,61,62,63,64,65], the “Chinese obstetrics and gynaecology” [31, 34, 35, 41,42,43,44,45,46,47,48,49,50, 66,67,68,69], the “Obstetrics and Gynaecology Section of the Chinese Medical Association Obstetrics and Gynaecology Branch: Guide to diagnosis and treatment of intrahepatic cholestasis of pregnancy” [33, 51,52,53, 70], the “Practical Obstetrics and Gynaecology Handbook” [54], the “TCM Gynaecology” [45, 60, 65], the “Guidelines for diagnosis and treatment of intrahepatic cholestasis of pregnancy (2015)” [40, 71], the “Clinical obstetrics and gynaecology” [72], and the Reyes standard [73].

Among the included trials, four trials compared Chinese herbal medicine with conventional medicine [31, 32, 34, 35], 38 trials [34,35,36,37,38,39,40,41,42,43,44,45,46,47,48,49,50,51,52,53,54,55,56,57,58,59,60,61,62,63,64,65,66,67,68,69,70,71,72,73] compared the combination of the herbal medicine and conventional medicine with conventional medicine alone, the remaining one three-arms trial [33] covered both of these two comparisons. As the intervention, 30 trials used herbal decoction with fixed prescriptions [33,34,35, 37,38,39,40,41,42,43, 45,46,47,48,49,50,51,52,53, 58, 60, 61, 65,66,67,68,69, 71,72,73], seven trials used self-made prescriptions according to syndrome differentiation principle [31, 44, 54,55,56,57, 63], six trials used herbal patent [32, 36, 59, 62, 64, 70]. Main herbal prescriptions (reported in two studies or more) was Yinchenhao Decoction (n = 7). Main herbs (reported in twenty studies or more) included: Yichen (Artemisiacapillaris thunb) (n = 38), Huangqin (Scutellaria baicalensis Georgi) (n = 29), Zhizi (Gardenia jasminoides Ellis) (n = 28), Dahuang (Rheum officinale Baill) (n = 21), Fuling (Poria cocos (Schw.) Wolf.) (n = 20). Herbal patents included Yinzhihuang oral liquid [32, 36, 59, 64], Yinzhihuang granule [62], Wuling pill [70]. The ingredients of them were showed in Supplementary material 1. Treatment durations varied from 7 to 21 days among the included trials.

Forty-three trials [31,32,33,34,35,36,37,38,39,40,41,42,43,44,45,46,47,48,49,50,51,52,53,54,55,56,57,58,59,60,61,62,63,64,65,66,67,68,69,70,71,72,73] used conventional medicine as control treatment, including Ursodesoxycholic acid (UDCA) [39, 42, 45, 53, 56, 58, 59, 61, 63, 64, 66, 68, 69] (P.O., 8 ~ 15 mg/kg/d, 1 ~ 3 times/d), S-adenosylmethionine (SAMe) [32, 34, 43, 44, 46, 48, 52, 70, 71, 73] (P.O., 1 ~ 2 g/d; or ivgtt., 0.8 ~ 1 g/d, 5% glucose injection 250 mL), Dexamethasone [35] (P.O., 9 mg/d, 3 times/d), Dexamethasone plus vitamin K3 [55], SAMe plus reduced glutathione (GSH) [31], UDCA plus SAMe [33, 36, 37, 51, 67], SAMe plus phenobarbital and dexamethasone [41, 49], Dexamethasone plus Phenobarbital [54], UDCA plus SAMe and dexamethasone [50, 57, 62], SAMe plus dexamethasone [38] (SAMe: ivgtt., 1.0 g/d; dexamethasone: i.m., 6 ~ 10 mg, 1 ~ 2 times/d), GSH plus UDCA and dexamethasone [47], UDCA plus SAMe plus magnesium isoglycyrrhizinate [60], UDCA plus phenobarbital plus SAMe [65], UDCA plus dexamethasone (for patients with tendency of premature birth) [40, 72].

Twenty-two trials reported the changes in the pruritus score [34,35,36,37,38, 41,42,43,44, 46,47,48, 50,51,52,53, 59, 66, 69,70,71, 73]. The pruritus score of 18 trials was based on the standard Ribalta pruritus score [34, 35, 37, 38, 41,42,43,44, 46, 48, 51, 53, 59, 66, 69,70,71, 73]. Twenty-seven trials reported the post-treatment TBA values [31,32,33, 36,37,38,39,40,41, 45,46,47,48, 50,51,52,53,54,55, 57, 58, 60, 63, 66, 67, 71, 73]. Among them, 24 trials reported the serum TBA values [31,32,33, 36,37,38,39,40,41, 45, 46, 50,51,52,53,54,55, 57, 60, 63, 66, 67, 71, 73]. The TBA detection method of the remaining three studies is unknown [47, 48, 58]. One trial reported the changes in the serum TBA values after treatment [49]. Nine trials reported the neonatal death toll [34, 35, 53, 55, 60, 61, 63, 69, 71]. Twenty-seven trials reported the number of fetal distress [31, 34,35,36,37,38, 40, 41, 43, 45, 47, 49,50,51, 53, 60, 62,63,64,65, 67,68,69,70,71,72,73]. Nineteen trials reported the number of asphyxia neonatorum [32, 37, 39,40,41, 43, 45, 47,48,49,50,51,52,53, 56, 61, 65, 67, 71]. Besides the above primary outcomes we defined in this review, 25 trials reported the number of cesarean section [31, 33,34,35,36,37,38,39,40,41, 43, 45, 47, 48, 53, 55, 56, 61, 63, 64, 67, 69, 70, 72, 73], 15 trials reported the number of postpartum hemorrhage [35, 39,40,41, 43, 47, 50, 53, 55, 60, 61, 65, 68, 70, 72], five trials reported the number of adverse events [48, 53, 55, 68, 70]. Twenty-nine trials reported the post-treatment serum ALT values [32,33,34,35,36, 38, 39, 41,42,43, 45, 46, 49,50,51,52,53,54,55, 58,59,60, 63, 65,66,67, 69,70,71, 73], and 19 trials reported the post-treatment serum AST values [33, 35, 36, 38, 41, 46, 49, 50, 52, 53, 55, 58, 60, 63, 65,66,67, 70, 71]. Five trials reported the average gestational age of the newborn [31, 33, 34, 66, 69]. Twenty-seven trials reported the number of premature birth [32, 35,36,37, 39,40,41, 43,44,45, 47, 48, 50,51,52,53,54,55, 60, 62,63,64,65, 67, 68, 70,71,72], and one trial reported the number of premature birth before gestational age of 37 weeks [45].

Quality evaluation of inclusion studies

None studies used the appropriate random number table method for sequence generation [32, 34, 38, 49, 57, 60, 68, 69, 71], one study used coinage method to generate random numbers [58]. It is unclear whether the remaining studies had used adequate randomization allocation methods. None of the 43 studies reported the allocation concealment or blinding of participants or investigators. One study reported and explained the dropouts and losses to follow-up [40], and none of the others reported dropouts. Two study has selective reporting outcomes [34, 51], because one of them intended to use liver function data as outcome measures but was not reported in result and another one reported the number of adverse birth events but was not reported in protocol. All studies did not report sample size estimates and financial support, also the baseline comparability was unclear. Thus, all the included trials were considered to be at high risk of other bias. So all the other bias risks of the 43 studies are unclear. In summary, all of the included trials were assessed as having high risk of bias due to the undefined methods of randomization, absence of blinding and the potential inappropriate method on dealing with missing data (See Fig. 2).

Fig. 2
figure 2

Risk of Bias assessment for the included trials

Estimate effects (see Table 2)

Herbal medicine vs. conventional medicine

A total of five studies compared herbal medicine with liver protection medicine [31,32,33,34,35].

Table 2 Effect of estimates of oral herbal medicine for intrahepatic cholestasis in pregnancy in 43 included trials

Two studies reported changes in pruritus score of the pregnancy which was based on the standard Ribalta pruritus score [34, 35]. The analysis showed potential good adjunctive effect of herbal medicine on decreasing the pruritus scores on the basis with conventional medicine (MD -0.68, 95%CI − 0.88 to − 0.47, I2 = 0%, P < 0.00001, 2 trials, 120 patients). Two studies reported changes in maternal serum TBA [32, 33], since the obvious statistical heterogeneity among studies (I2 = 95%), it was impossible to combine the data (MD and 95% CI were − 4.74 μmol/L, − 6.57 to − 2.91; and 1.40 μmol/L, − 0.45 to 3.25; respectively). Two trials reported stillbirths or neonatal deaths post-treatment, and both two groups reported none [34, 35]. Three studies reported the number of fetal distress [31, 34, 35], the results showed that there was no significant difference between the herbal medicine group and conventional medicine group (RR 0.77, 95% CI 0.43 to 1.39, I2 = 0%, P = 0.36, 3 trials, 218 patients). Another study [32] reported the numbers of neonatal asphyxia, which also showed no difference between the herbal medicine and UDCA (RR 1.00, 95% CI 0.06 to 16.76, P = 1.00, 1 trial, 60 patients).

Four studies reported the number of deliveries by cesarean section [31, 33,34,35], the results showed that there was no difference between herbal medicine and conventional medicine (RR 0.97, 95% CI 0.80 to 1.18, I2 = 0%, P = 0.78, 4 trials, 338 patients). One trial reported the number of postpartum hemorrhage [35] and the results showed that there was no significant difference in this outcome between herbal medicine and conventional medicine (RR 0.47, 95% CI 0.09 to 2.36, P = 0.36, 1 trial, 60 patients). One trial reported changes in maternal ALT (unknown as whole blood or serum) and the results showed that there was no significant difference in this outcome between herbal medicine and conventional medicine (MD -12.80 U/L, 95% CI − 25.81 to 0.21, P = 0.05, 1 trial, 98 patients) [31]. Four studies reported changes in maternal serum ALT and the results showed that there was no significant difference in this outcome between herbal medicine and conventional medicine (MD 0.90, 95% CI − 5.10 to 6.90, I2 = 67%, P = 0.77, 4 trials, 300 patients) [32,33,34,35]. Two studies reported changes in maternal serum AST and the results showed that there was no significant difference in this outcome between herbal medicine and conventional medicine (MD 11.20 U/L, 95% CI 7.86 to 14.54, P < 0.00001, 1 trial, 120 patients) [33, 35]. Three studies reported the gestational age of the newborn (MD and 95% CI were 1.80 weeks, 1.04 to 2.56; − 0.50 weeks, − 1.31 to 0.31; 0.70 weeks, − 0.35 to 1.75; respectively) [31, 33, 34], meta-analysis of all the above studies could not be conducted due to the obvious statistical heterogeneity. Another trials reported the numbers of the preterm births, which showed no differences between herbal medicine and conventional medicine (RR 0.28, 95% CI 0.06 to 1.27, I2 = 0%, P = 0.10, 12 trials, 120 patients) [32, 35].

Combination of herbal medicine and conventional medicine vs. conventional medicine alone

Thrity-nine trials contributed data to this comparison [33, 36,37,38,39,40,41,42,43,44,45,46,47,48,49,50,51,52,53,54,55,56,57,58,59,60,61,62,63,64,65,66,67,68,69,70,71,72,73].

Pruritus score

Fifteen trials reported pruritus score of the pregnancy which was based on the standard Ribalta pruritus score [37, 38, 41,42,43,44, 46, 48, 51, 53, 59, 66, 70, 71, 73]. The post-treatment score could not be synthesized due to the obvious statistical heterogeneity. Subgroup analysis showed potential good adjunctive effect of herbal medicine on decreasing the pruritus scores on the basis with SAMe (MD -0.91, 95% CI − 1.10 to − 0.72, I2 = 80%, 7 trials, 638 women), UDCA (MD -0.58, 95% CI − 0.79 to − 0.36, I2 = 56%, 4 trials, 273 women), combination of SAMe and dexamethasone (MD -0.69, 95% CI − 0.80 to − 0.57, I2 = 0%, 2 trials, 276 women), and the combination of SAMe and UDCA (MD -1.68, 95% CI − 1.99 to − 1.37, 1 trial, 80 women), (MD -0.77, 95% CI − 1.01 to − 0.53, 1 trial, 60 women).

The serum TBA value

Twenty-three trials reported the serum TBA before and post-treatment [33, 36,37,38,39,40,41, 45, 46, 50,51,52,53,54,55, 57, 60, 63, 65,66,67, 71, 73]. The score could not be synthesised due to the obvious statistical heterogeneity. Subgroup analysis showed potential good adjunctive effect of herbal medicine on decreasing the serum TBA scores on the basis with UDCA (MD − 3.99 μmol/L, 95% CI − 4.24 to − 3.74, I2 = 0%, 7 trials, 522 women), SAMe (MD − 7.33 μmol/L, 95% CI − 8.50 to − 6.15, I2 = 0%, 4 trials, 230 women), dexamethasone (MD − 23.30 μmol/L, 95% CI − 24.61 to − 21.99, 1 trial, 86 women), combination of UDCA and SAMe (MD − 7.62 μmol/L, 95% CI − 12.97 to − 2.27, I2 = 98%, 7 trials, 652 women), and the combination of UDCA and SAMe and dexamethasone (MD − 3.54 μmol/L, 95% CI − 4.03 to − 3.05, I2 = 8%, 4 trials, 417 women).

The number of adverse birth events

Six trials reported no stillbirths or neonatal deaths post-treatment [53, 55, 61, 63, 69, 71]. One trial reported one neonatal deaths post-treatment in conventional medicine group [60].

Twenty-four trials reported the number of fetal distress [36,37,38, 40, 41, 43, 45, 47, 49,50,51, 53, 60, 62,63,64,65, 67,68,69,70,71,72,73], 18 trials reported the number of asphyxia neonatorum [37, 39,40,41, 43, 45, 47,48,49,50,51,52,53, 56, 61, 65, 67, 71], 22 trials reported the number of cesarean section [33, 36,37,38,39,40,41, 43, 45, 47, 48, 53, 55, 56, 61, 63, 64, 67, 69, 70, 72, 73], and another 14 trials reported the number of postpartum hemorrhage [39,40,41, 43, 47, 50, 53, 55, 60, 61, 65, 68, 70, 72]. Significantly lower incidence of fetal distress (RR 0.41, 95% CI 0.32 to 0.51, I2 = 0%, 24 trials, 1915 women), asphyxia neonatorum (RR 0.35, 95%CI 0.25 to 0.49, I2 = 0%, 18 trials, 1492 women), cesarean section (RR 0.73, 95% CI 0.63 to 0.85, I2 = 65%, 22 trials, 1974 women), postpartum hemorrhage (RR 0.45, 95% CI 0.28 to 0.72, I2 = 0%, 14 trials, 1130 women) were observed in the combination group compared to the conventional medicine group.

Liver function blood test results

Twenty-five trials reported the serum ALT values [33, 36, 38, 39, 41,42,43, 45, 46, 50, 52,53,54,55, 58,59,60, 63, 65,66,67, 69,70,71, 73]. The data could not be synthesized due to the obvious statistical heterogeneity. Results from single study or subgroup meta-analysis showed potential good adjunctive effect of herbal medicine on decreasing the serum ALT values on the basis of UDCA (MD -19.91 U/L, 95% CI − 35.42 to − 4.39, I2 = 99%, 9 trials, 625 women), dexamethasone (MD -15.05 U/L, 95% CI − 19.59 to − 10.51, I2 = 72%, 4 trials, 432 women), and the combination of UDCA and SAMe (MD -18.94 U/L, 95% CI − 20.91 to − 16.97, I2 = 34%, 5 trials, 506 women), the combination of UDCA, SAMe and dexamethasone (MD -8.69 U/L, 95% CI − 13.16 to − 4.22, 1 trial, 96 women). Six trials reported there is no statistical difference between the combination treatment and SAMe alone in post-treatment serum ALT value (MD -15.14 U/L, 95% CI − 30.79 to 0.50, I2 = 98%, 6 trials, 410 women). Detail of the results were shown in Table 2.

Eighteen trials reported the serum AST values [33, 36, 38, 41, 46, 50, 52, 53, 55, 56, 58, 60, 63, 65,66,67, 70, 71]. Results from single study or subgroup meta-analysis showed potential good adjunctive effect of herbal medicine on decreasing the serum AST values on the basis with the combination of UDCA and SAMe (MD -16.38 U/L, 95% CI − 20.49 to − 12.27, I2 = 79%, 5 trials, 476 women), dexamethasone (MD -9.23 U/L, 95% CI − 16.71 to − 1.76, I2 = 81%, 3 trials, 346 women). There is no statistical difference of herbal medicine on decreasing the serum AST values on the basis with UDCA, SAMe, and the combination of UDCA and SAMe and dexamethasone. Detail of the results were also shown in Table 2.

The average gestational age of the newborn

One trial reported the average gestational age of the newborn which indicated that there was no difference between thecombination group and conventional medicine group (MD 0.49 weeks, 95% CI 0.01 to 0.97, 3 trials, 263 women) [33, 66, 69].

The number of premature birth

Twenty-six trials reported the number of premature birth [32, 36, 37, 39,40,41, 43,44,45, 47, 48, 50,51,52,53, 60, 62,63,64,65, 67, 68, 70,71,72]. Significantly lower incidence of Premature birth was observed in the combination group than thecontrol group (RR 0.41, 95% CI 0.34 to 0.51, I2 = 7%, 26 trials, 2199 women).

Adverse events

Five trials reported the number of adverse events [48, 53, 55, 68, 70]. One trial reported six cases of nausea, headache and flushin in treatment group [48]. One trial reported one case of loose stool, disappeared in the following course, two cases of nausea [53]. One trial reported no adverse events during the treatment [55]. One trial reported one case of headache and one case of tachycardia in treatment group, and one case of tachycardia and one case of constipation in control group [68]. Another trial reported two cases of nausea, two cases of acid reflux, one case of bloating in treatment group, and one case of acid reflux, two cases of bloating, one case of nausea, one case of rash in control group [70].

Funnel plot

A funnel plot analysis of 24 trials was performed to examine outcome for the serum TBA value of patients. The result showed potential asymmetry (Fig. 3). The potential asymmetry may be caused by small study effects, different methodological quality, or even Heterogeneity in intervention effects.

Fig. 3
figure 3

Funnel plot assessing outcomes of the number of fetal distress reported in 24 randomized controlled trials

Trial sequential analysis (TSA)

We conducted TSA with the data from the two meta-analyses above. For the outcome of numbers of fetal distress in comparing herbal medicine combined conventional medicine to conventional medicine alone, TSA illustrated that the cumulative Z-curve across the traditional boundary of 5% significance (horizontal line) as well as the monitoring boundaries (inward sloping curves) (see Fig. 4). After the sixth study, the significance testing had been performed each time a new trial was added to the meta-analysis, which means the sample size achieved the required 349 participants and we had enough power to confirm the evidence (that with adjunction of herbal medicine, the therapy may decrease 12% more fetal distress) controlling for the risk of random error.

Fig. 4
figure 4

Trial Sequential Analysis results on decreasing numbers of fetal distress

For the outcome of numbers of asphyxia neonatorum in the same comparison, the result was similar with the two crossings. TSA also illustrated that the cumulative Z-curve across the horizontal line and the inward sloping curves (see Fig. 5), which means the sample size achieved the required 376 participants and we had enough power to confirm the evidence (that the combination therapy may decrease 10% more cases of asphyxia neonatorum).

Fig. 5
figure 5

Trial Sequential Analysis results on decreasing numbers of asphyxia neonatorum

Discussion

Summary of main findings

Totally 43 trials with 3556 women were included, all of them were assessed as having high risk of bias. Results from these trials showed potential better effect of herbal medicine when combined with conventional medicine on decreasing pruritus scores, reducing adverse birth events (which was supported by the TSA results) and improving the liver functions according to the serum TBA, ALT, AST values compared to conventional medicine alone. However, due to the poor methodology of the included studies and the obvious statistical heterogeneity among trials, quality of the evidence for all these outcomes were “low” and “very low” according to the GRADE assessment (see Tables 3 and 4). When compared to conventional medicine, the single application of herbal medicine showed no better/worse effect for the condition, we could not draw any firm conclusion on this type of comparison due to the small sample size of involved trials. Furthermore, few data were reported to indicate the adverse events in herbal medicine group, which also made the safety of the herbal medicine was unclear.

Table 3 Summary of finding table of herbal medicine compared to conventional medicine for women with intrahepatic cholestasis in pregnancy
Table 4 Summary of finding table of combination of herbal medicine and conventional medicine compared to conventional medicine for women with intrahepatic cholestasis in pregnancy

Compare to the previous studies

The results of a meta-analysis preliminarily showed that traditional Chinese medicine combined with conventional medicine (or physical therapy) had a certain therapeutic effect on neonatal jaundice in treatment of recurrent spontaneous abortion [74]. Another meta-analysis showed that Yinzhihuang Oral Liquid was more effective in improving itching symptoms of pregnant women, reducing serum total bilirubin, total bilirubin, glycocholic acid and increasing neonatal weight of newborns than the control group in treatment of intrahepatic cholestasis of pregnancy [75]. The third meta-analysis showed that Yinchenhao Decoction had better effect on treating neonatal jaundice, shortening recovery time of serum total bilirubin and jaundice subsidence time than that of conventional medicine alone [76]. All these findings indicated that herbal medicine may have effect on improving the liver function of pregnancy women, which was consistent with our findings.

Since we did not find any other review focused on this disease through the literature searching, this is the probably the first study assessed the herbal medicine for ICP and the current evidence may support the effectiveness of combination of herbal medicine and conventional medicine for decreasing the maternal pruritus scores, the serum TBA, and the number of fetal distress, asphyxial events or asphyxia neonatorum events related to this condition.

Implications for practice

Though we only got “very low” quality evidence to support the adjunctive effect of herbal medicine in treating ICP. The results showed a statistically meaningful advantage of herbal medicine on improving the symptoms or reducing adverse birth events when combined with conventional medicine. Considering the uncertain safety of the herbal product, we suggest the application of herbal medicine would be recommended in consideration of the expertise and experience of the clinician. Treatment duration would be 10 to 14 days according to this review. The most frequently used prescription in our review is Yinchenhao Decoction, in which Yinchen and Zhizi are the core component. Bian found that Yinchenhao decotion can induce liver fibrosis by dimethylnitrosamine, reduce hydroxyproline and improve liver function and hepatic histology after 2 weeks of treatment in rats [77]. Mentimes, a systematic review indicated that Yinchenhao decotion can significantly improve cholestasis by reducing elevated serum markers [78]. Practitioners may choose to use modified Yinchenhao Decoction in treating this condition.

Implications for future studies

There are some deficiencies in this study that should be improved in future studies. In terms of scheme design and method application, most studies did not give a clear introduction to the method of randomization and blinding. Although the blind method may not be easy to operate for patients due to the difficulty in the operation of herbal medicine placebo, a blind method should be implemented for the outcome assessors or statisticians to avoid bias. Secondly, in terms of the data analysis, missing data should be handled with appropriate statistical methods. Reporting of the trial should follow the standard of CONSORT. Besides, five databases have been searched, but no studies that are published in English can be included, which suggests that relevant studies are possibly published mainly in Chinese. Since the access to databases such as Allied and Complementary Medicine (AMED) and the Cumulative Index to Nursing and Allied Health Literature (CINAHL) is not available for searching within the region of our researchers, there might be the possibility to miss studies. More English databases should be considered to be searched in the future.

High quality studies are needed to verify the adjuvant efficacy and safety of Chinese herbal medicine. Meanwhile, cost effect analysis may be considered to be conducted in the future.

Conclusions

This review found “very low” quality evidence which support the effectiveness of combination of herbal medicine and conventional medicine for decreasing the maternal pruritus scores, the serum TBA, and the number of adverse birth events. TSA analysis showed the results of benefit of combination of herbal medicine and conventional medicine for decreasing the number of fetal distress and asphyxia neonatorum had enough statistical power. More high-quality trials are still needed to prove the superior effect and safety of herbal medicine as adjunctive treatment for this disease.