Background

Low back pain (LBP) is a prevalent disease in adults, especially those ranging from 40 to 80 years of age [1]. LBP can be classified into two types: acute and chronic. Chronic LBP is characterized by persistent pain and a high risk of disability, which contributes to soaring medical costs and loss of labor, possibly leading to a critical impact on the social economy [2]. The major cause of LBP is lumbar disc degeneration (LDD) [3]. Although the pathogenesis of LDD is multivariate, genetic factors have been reported to play an important role in LDD, including collagen I alpha 1 (COL1A1) gene, collagen IX alpha 1 chain (COL9A1) gene, collagen IX alpha 2 chain (COL9A2) gene, collagen IX alpha 3 chain (COL9A3) gene, vitamin D receptor gene, and cartilage intermediate-layer protein gene [4,5,6,7].

Collagen IX gene is one of the most recent trending research targets among numerous genes [8]. Collagen IX has been demonstrated to serve as a bridge between collagenous and non-collagenous proteins in normal tissues [9]. COL9A2 and COL9A3 genes encode α2 and α3 chains on collagen IX, respectively [10, 11]. To date, numerous groups have reported the associations of COL9A2 and COL9A3 gene polymorphisms with the risk of LDD. The first study, conducted by Annunen et al., suggested that COL9A2 gene polymorphism (rs12077871) was associated with LDD in the Finnish population [4]. Another study by Paassilta et al. in Finland suggested that rs12077871 was not relevant to LDD, but rs61734651 in COL9A3 gene was relevant to LDD [12]. A number of studies have been conducted on this topic. However, the results were generally inconsistent and inconclusive. Therefore, we collected all the relevant studies, including 2102 cases and 2507 controls, to perform a meta-analysis in order to investigate the association between COL9A2 and COL9A3 gene polymorphisms and LDD predisposition.

Methods

Strategy for literature search

The study was conducted by searching literature databases, including PubMed (http://www.ncbi.nlm.nih.gov/pubmed), Web of Science (http://www.isiknowledge.com/), and CNKI (China National Knowledge Infrastructure). To identify all possible studies, we used the following terms: “LDD” or “Lumbar disc disease” or “Lumbar disc degeneration” and “COL9A2” or “COL9A3” or “Collagen IX” or “Polymorphisms” or “COL9A*.” No language or publication date restrictions were applied. The last electronic search was performed on September 1, 2017.

Inclusion and exclusion criteria

The following inclusion criteria were used to search eligible studies: (1) investigated the relationship of COL9A2 or COL9A3 gene polymorphisms with LDD, (2) case-control or cohort design, and (3) provided available data for the estimation of an odds ratio (OR) and 95% confidence interval (CI). Studies were excluded according to the following criteria: (1) repeated publications, (2) reported in the form of comment and review, (3) irrelevant to LDD, and (4) unavailable allele and genotype frequencies. Two investigators (W.H.H and W.S.T) independently evaluated the articles for compliance with the inclusion and exclusion criteria. For disagreements, a consensus was reached by a third investigator (C.W.Y).

Data extraction

The following data were extracted from all the eligible articles: first author’s name, publication year, country of enrollment, ethnicity, numbers of cases and controls, disease, diagnostic criteria, source of controls, genotyping methods, genotypes, and allele frequency of cases and controls. Data were extracted independently by two reviewers (W.H.H and W.S.T), and a third reviewer (C.W.Y) was needed for any disagreements.

Methodological quality

The methodological quality of the included studies was evaluated according to a quality evaluation form base in the Critical Appraisal Skills Programme (CASP) for case-control study [13]. The assessment of CASP contains 10 questions, which are associated with information given by single studies. There are three degrees for each question: “yes” (scored 2), “can’t tell” (scored 1), or “no” (scored 0). The maximum score is 20, and the minimum score is 0. Studies could be divided into three grades: grade A (high quality, scored 15–20), grade B (medium quality, scored 8–14), and grade C (low quality, scored 0–7).

Statistical analysis

Meta-analysis was conducted using Revman 5.31 software (Nordic Cochrane Centre, Cochrane Collaboration, Copenhagen, Denmark). The pooled OR and 95% CI were used to estimate the strength of correlations between COLA9A2 (rs12077871, rs12722877, rs7533552) and COL9A3 (rs61734651) variants and LDD. Heterogeneity was tested using the chi-square-based Q test and I2 test. To calculate the pooled OR, a fix effect model was performed if no heterogeneity existed (P > 0.05, I2 < 50%). Otherwise, a random effect model was used. Five comparison genetic models were conducted to evaluate the association between the four single nucleotide polymorphisms (SNPs) (rs12077871, rs12722877, rs7533552, rs61734651) and LDD risk. Hardy-Weinberg equilibrium among controls was estimated using the HWE version 1.20 program (Columbia University, New York, NY). If there was heterogeneity in some models (P < 0.05, I2 > 50%), we performed the sensitivity test to assess the possible influence of one study on the pooled OR. Studies were removed, in turn, from the overall analysis. In addition, we performed subgroup analysis stratified by ethnicity. Funnel plots and Egger’s tests were used to assess the potential publication bias.

It will be reasonable to estimate the combined effect from a group study if the effects found individually in studies are sufficiently similar. Some variation between the studies is expected because the estimates of the treatment effect are influenced by chance. What we need to know is whether there is more than just a chance-related variation. The heterogeneity test was performed to determine this extreme variation. Therefore, in this study, chi-square statistic was performed together with the degree of freedom. During the meta-analysis, the results were evaluated by incorporating suspicious relevance studies into statistical analyses. Then, these studies, the appropriateness of which were questionable, were excluded from the study and the same analyses were repeated. After comparing the two results, the data for the appropriate ones were interpreted.

Results

Characteristics of studies

As shown in Fig. 1, 182 potentially relevant studies were searched from the electronic database. Ten studies were identified by screening the full article, which included 2102 cases and 2507 controls (rs12077871 6 studies, 1086 cases, and 1210 controls; rs12722877 4 studies, 1030 cases, and 1235 controls; rs7533552 5 studies, 1172 cases, and 1287 controls; rs61734651 4 studies, 365 cases, and 631 controls). Of those, seven articles reported that gene polymorphism in two or more loci were associated with LDD risk. Table 1 and Table 2 show the main characteristics of included studies. The results of quality assessment are also shown in Table 1. All included studies were categorized as grade A, with scores ranging from 18 to 20.

Fig. 1
figure 1

Literature search flow diagram

Table 1 Characteristics of the case-control studies included in systematic review
Table 2 Genotype and allele frequency of COL9A2 and COL9A3 gene polymorphisms in LDD patients and controls

Quantitative data analysis

Association of rs12077871 and LDD susceptibility

The association between rs12077871 polymorphism and LDD predisposition was determined in six case-control studies [4, 14,15,16,17,18], including 1086 cases and 1210 controls. As shown in Table 3, we evaluated the association between rs12077871 polymorphism and LDD predisposition under five genetic models (T vs. C: OR = 1.85, 95% CI = 0.87–3.91, P = 0.11). Furthermore, we performed the subgroup analysis stratified by ethnicity. The result showed rs12077871 was not associated with LDD risk in the Asian population. The subgroup analysis of the Caucasian population was unavailable as there were insufficient studies. The forest plot of the allele contrast genetic model demonstrated the association between rs12077871 polymorphism and LDD susceptibility (Fig. 2).

Table 3 Association test and heterogeneity test of COL9A2 and COL9A3 gene polymorphisms (rs12077871, rs12727871, rs7533553, and rs61734657)
Fig. 2
figure 2

Forest plot of the pooled ORs with 95% CIs for associations between COL9A2 rs12077871 and LDD in overall populations under the allelic contrast model (C vs. T allele); events: the number of C allele

Association of rs12722877 and LDD susceptibility

The association between rs12722877 polymorphism and LDD predisposition was determined in four case-control studies [12, 17,18,19], including 1030 cases and 1215 controls. As shown in Table 3, we used five genetic models to access the relationship between rs12722877 polymorphism and LDD predisposition (G vs. C: OR = 0.83, 95% CI = 0.69–1.01, P = 0.06). Moreover, we performed the subgroup analysis stratified by ethnicity. For the Asian population, rs12722877 polymorphism was associated with LDD predisposition under the allele contrast genetic model (OR = 0.81, 95% CI = 0.67–0.99, P = 0.04). However, a significant association was not found in other genetic models. The subgroup analysis of the Caucasian population was unavailable due to insufficient studies. The forest plot of the allele contrast genetic model indicated the association between rs12722877 polymorphism and LDD predisposition (Fig. 3).

Fig. 3
figure 3

Forest plot of the pooled ORs with 95% CIs for associations between COL9A2 rs12722877 and LDD in overall populations under the allelic contrast model (C vs. G allele); events: the number of C allele

Association of rs7533552 and LDD susceptibility

The association between rs7533552 polymorphism and LDD predisposition was determined in five case-control studies [4, 14, 17, 18, 20], including 957 cases and 1054 controls. As shown in Table 3, we implemented five genetic models to access the association between rs7533552 polymorphism and LDD predisposition (G vs. A: OR = 1.11, 95% CI = 0.98–1.25, P = 0.09). In addition, we performed the subgroup analysis stratified by ethnicity. The data showed rs7533552 polymorphism was not associated with LDD risk in the Asian population. The forest plot of the allele contrast genetic model demonstrated the association between rs7533552 polymorphism and LDD predisposition (Fig. 4).

Fig. 4
figure 4

Forest plot of the pooled ORs with 95% CIs for associations between COL9A2 rs7533552 and LDD in overall populations under the allelic contrast model (A vs. G allele); events: the number of A allele

Association of rs61734651 and LDD susceptibility

The association between rs61734651 polymorphism and LDD predisposition was determined in four case-control studies [12, 15, 16, 21], including 365 cases and 631 controls. As shown in Table 3, we evaluated the association between rs61734651 polymorphism and LDD predisposition under five genetic models (T vs. C: OR = 1.57, 95% CI = 0.51–4.84, P = 0.43). Furthermore, we performed the subgroup analysis stratified by ethnicity. The result showed rs61734651 polymorphism was not associated with LDD susceptibility in the Caucasian population. The subgroup analysis of the Asian population was unavailable due to insufficient studies. The forest plot of the allele contrast genetic model indicated the association between rs61734651 polymorphism and LDD predisposition (Fig. 5).

Fig. 5
figure 5

Forest plot of the pooled ORs with 95% CIs for associations between COL9A3 rs61734651 and LDD in overall populations under the allelic contrast model (T vs. C allele); events: the number of C allele

Sensitivity analysis and publication bias

Sensitivity analysis was performed by excluding one study at a time. For the four SNPs, the results did not alter under all genetic models after sensitivity analysis (Table 4). For rs12077871, when we omitted the study reported by Rathod et al. [16], the heterogeneity was obviously reduced under allele contrast genetic models. For rs61734651, when we excluded the study reported by Paassilta et al. [12], the heterogeneity was significantly reduced under the allele contrast genetic model (Table 5). Sensitivity analysis indicated that our results were robust and consistent.

Table 4 Heterogeneity analysis and Egger regression analysis of COL9A2 and COL9A3 gene polymorphisms (rs12077871, rs12727871, rs7533553, and rs61734657)
Table 5 The result of sensitivity analysis with each study omitted for rs12077871 in COL9A2 and rs61734651 in COL9A3

Publication bias was appraised by applying Begg’s funnel plots and Egger’s regression test (Fig. 6). The result indicated no significant publication bias under all genetic models (all P > 0.05 for all models tested).

Fig. 6
figure 6

Funnel plot analysis for publication bias; COL9A2 gene polymorphism (rs7533552) under the allelic contrast model

Discussion

Various risk factors were widely considered to be related to symptomatic LDD, including environmental, ergonomic, and biometric factors. Nowadays, increasing evidence indicates that genetic factors play critical roles in LDD [22]. Plenty of recent studies show associations of COL9A2 gene (rs12077871, rs12722877, rs7533552) and COL9A3 gene (rs61734651) polymorphisms with the incidence of LDD. The study reported by Annunen et al. initially suggested that COL9A2 gene polymorphism was associated with LDD in the Finnish population [4]. Some studies were undertaken to replicate this finding [9, 14, 16]. However, the other studies did not come to the same conclusion [23]. Under the circumstances, a meta-analysis conducted by Zhang et al. investigated the association between COL9A2 gene polymorphism and LDD risk in 2014 [24]. The results indicated no significant association between COL9A2 gene polymorphisms and LDD predisposition. A number of studies have also examined the association between COL9A2 gene polymorphism and LDD in recent years [19, 20]. However, the previous meta-analysis [24] did not include data from the recent studies, which may lead to inaccuracy in the conclusion. The Cochrane Back Review Group has advocated that a comprehensive meta-analysis needs to be updated with data from the latest studies to be timely [25]. Furthermore, COL9A2 and COL9A3 genes respectively encode α2 and α3 chains of collagen IX, indicating a close relationship between COL9A2 and COL9A3. Several studies have reported that COL9A3 gene polymorphism was associated with susceptibility to LDD [10, 26,27,28]. However, these published studies have yielded contradictory results rather than conclusive evidence [15, 17, 23]. Thus, we performed a meta-analysis on the associations between COL9A2 and COL9A3 gene polymorphisms and LDD susceptibility. To the best of our knowledge, the current study is the largest sample size of meta-analysis to investigate the association between COL9A2 and COL9A3 gene polymorphisms and LDD predisposition.

Several authors reported that collagen IX provided optimal stability to the lumbar disc cross-linked with collagen II [27] and indicated that collagen IX is crucial for the functional lifespan of intervertebral discs [29]. A role for collagen IX in disc degeneration is supported by human and animal studies [2, 12]. Furthermore, the COL9A2 and COL9A3 genes are highly expressed in intervertebral discs and encode the α2 and α3 chains of type IX collagen, which suggests that they are critical for intervertebral disc metabolism [30]. Mutations of COL9A2 and COL9A3 genes could interfere with the bond between collagen IX and collagen II, leading to decreased stability of the lumbar disc [31]. Thus, it remains plausible that COL9A2 and COL9A3 may be involved in the etiology of LDD through the intervertebral disc metabolism. COL9A2 and COL9A3 gene polymorphisms are supposed to have an impact on gene regulation. However, the precise role of these SNPs is still unknown. Functional analysis of the COL9A2 and COL9A3 genes might help elucidate the real genetic effect on the etiopathogenesis of LDD.

Our meta-analysis of 10 studies, involving 2102 LDD cases and 2507 controls, found no statistically significant association between COL9A2 gene (rs12077871, rs12722877, rs7533552) and COL9A3 gene (rs61734651) polymorphisms and LDD risk. The previous studies had reported that COL9A2 and COL9A3 gene polymorphisms were related to LDD predisposition in different ethnic population groups. Therefore, we performed a subgroup analysis stratified by ethnicity (Caucasian and Asian). Our results showed rs12722877 was associated with LDD risk in the Asian population under the allele contrast genetic model (C vs. G), but not under other genetic models. This finding was in partial accordance with a previous meta-analysis study [23]. We should note that heterogeneity existed in our study in interpreting the results of our meta-analysis. For rs12077871 and rs61734651 polymorphisms, significant heterogeneity was found in all genetic models except the dominant model; for rs12722877 polymorphism, heterogeneity was detected in dominant model models, while for rs7533552 polymorphism, heterogeneity was detected in codominant and recessive models. For rs12077871 polymorphism, the heterogeneity detected in the four genetic models was effectively decreased in sensitivity analysis after excluding the study by Rathod et al. For rs61734651 polymorphism, sensitivity analysis suggested that the study of Paassilta et al. was the major source of the heterogeneity. The removal of these datasets did not change the overall results of any genetic models. There are some potential explanations for the presence of heterogeneity, including genetic background, study design, and environment factors. Furthermore, heterogeneity may result from the different phenotype selection and diagnostic criteria of LDD [32]. In view of the heterogeneity, the results of the meta-analysis should be interpreted with caution. A more powerful conclusion needs to be supported by future studies with larger sample sizes.

In our systematic review, increasing the number of studies, which were examined, was possible by extending our search criteria. However, we think that this situation may give rise to further confusion among the results and it may prevent making binding inferences. Several limitations of this study should be acknowledged. First, only English and Chinese documents were searched, while reports in other languages were excluded. This may lead to publication bias. Second, the limited sample size of the pooled studies may exert an influence on their statistical power. Third, we did not perform stratification analysis by age, gender, and environmental factors as a data limitation.

Conclusions

COL9A2 gene (rs12077871, rs12722877, rs7533552) and COL9A3 gene (rs61734651) polymorphisms were not associated with susceptibility to LDD. The associations of COL9A2 and COL9A3 gene polymorphisms and the risk of LDD could not be fully excluded. Large-scale and well-designed studies are needed to further analyze this field.