Introduction

Musculoskeletal soft tissues, including tendon, ligament, and muscle, are commonly injured as a result of participation in competitive and recreational activities. It has been estimated that over 100 million musculoskeletal soft tissue injuries (MSTIs) occur annually around the world [1]. Take Achilles tendinopathy as an example, the lifetime incidence of this disorder is nearly 10% in the general population and 50% among the elite athletes [2, 3]. MSTIs have a negative impact on the quality of life. Affected individuals always suffer from discomfort, pain or incapacity. For the athletes, MSTIs may lead to significant loss of sporting performance and a premature end to their careers. The management of MSTIs is difficult, thus imposing a substantial burden on society.

A fully recognition on the etiology of MSTIs is of great important to prevent these injuries. Nevertheless, the etiology of MSTIs is multifactorial and its pathogenesis remains largely undefined. Both genetic and non-genetic risk factors have been reported to dispose an individual to MSTIs [4, 5]. Non-genetic factors, like physical activity and chronic overuse, may be extrinsic contributors to MSTIs. However, the genetic tendency may predispose individuals to a more susceptible condition. In the past years, considerable attention has been focused on genetic basis of MSTIs [6]. Investigators have observed a familial predisposition in MSTIs [7,8,9,10]. Besides, genetics were also reported to be associated with athletic performance and rehabilitation [11, 12]. Evidences have been provided to support the association of genetic polymorphisms and susceptibility to MSTIs. Those polymorphisms are mainly located within the collagen-encoding genes, tenascin-C gene, thrombospondin-2 gene, fibrillin-2 gene, matrix proteinase (MMP) gene, and growth differentiation factor 5 gene [13, 14]. Of these genes, COL5A1 is the most extensively studied one.

The COL5A1 gene codes for the α1 chain of type V collagen. Despite type V collagen presents in a smaller amount than other fibrillar collagens, it exerts a crucial role in fibril assembly and inhibition of lateral fibril growth, leading to fewer collagen I fibrils with increased diameters in tendons and ligaments [15]. Literature has reported variants within the 3′-untranslated region (3′-UTR) of COL5A1 gene could modify the secondary structure of the mRNA and mediate its transcript stability [16].

Mokone et al. [17] first reported the rs12722 and rs13946 polymorphisms in CLO5A1 gene and their association with Achilles tendon pathology. Thereafter, multiple replicate studies were conducted with conflicting outcomes. A meta-analysis with nine studies encompassing 1140 cases and 1410 controls indicated that rs12722 polymorphism contributed to tendon-ligament injuries in Caucasians. After that, more than ten studies investigated the association of COL5A1 gene polymorphisms and MSTIs. Enlarging the sample sizes of genetic studies and determining their association with MSTIs will allow investigators estimating which variants predispose to damage of the musculoskeletal system. Therefore, this meta-analysis aimed at collecting and summarizing the existing evidences to elucidate whether COL5A1 gene polymorphisms were associated with MSTIs.

Methods

Literature search

An exhaustive literature search of PubMed, Web of Science, EMBASE, Cochrane Library, CNKI, and Wanfang databases was performed to look for studies that reported the association of COL5A1 gene polymorphisms and MSTIs. The terms for literature search included “COL5A1”, “tendon rupture”, “tendon injury”, “ligament injury”, “muscle injury”, “soft tissue injury”, “tennis elbow”, “polymorphism”, “variant”, and “mutant”. The references of eligible articles were also screened for potentially relevant studies. No restriction was set on language or publication date. For non-English and non-Chinese literature, they were translated into English paper by native speaker. The final systematic search was conducted on August, 2021. If necessary, the corresponding author of original articles was contacted for additional information.

Inclusion and exclusion criteria

The eligible studies should satisfy the following criteria: (1) studies on the association of COL5A1 polymorphisms and MSTIs; (2) cases were confirmed by clinical evaluation and/or other complementary examination; (3) controls were healthy individuals without MSTIs; (4) data were full to evaluate the odds ratios (ORs) and 95% confidence intervals (95%CI).

Correspondingly, the exclusion criteria were as follows: (1) Duplicate data were involved in the studies; (2) conference abstracts, reviews, editorials, or case reports. If multiple studies reported overlapping data, the one with the largest sample size was selected.

Evaluation of methodological quality

The assessment of study quality was also performed by two authors (RG and ZJ) individually according to the Newcastl-Ottawa Scale (NOS) [18], which included selection (four points), comparability (two points), and exposure (three points). The included studies could be graded as poor, fair or excellent quality based on the following criteria: (1) study quality was poor if one received 0–3 points; (2) study quality was fair if one received 4–6 points; (3) study quality was excellent if one received 7–9 points. Studies with poor quality would be excluded from the final analysis. Any discrepancy was settled by consulting a third reviewer.

Data extraction

Relevant data were abstracted from qualified studies independently by two investigators (RG and ZJ). The data were first author, publication year, country, ethnicity, gender, study design, diagnosis, genotype distribution of each polymorphism in both groups, result of HWE test [19]. Any divergence was addressed by consulting a third reviewer.

Statistical analysis

OR and 95%CI were estimated to evaluate the strength of the association. It was assumed that “V” and “v” represented the mutant allele and the wild allele respectively. Therefore, the genotypes could be represented by “VV”, “Vv”, and “vv”. The pooled effect size was calculated respectively for allele contrast of V versus v, homozygote contrast of VV versus vv, heterozygote contrast of Vv vs. vv, dominant contrast of VV + Vv vs. vv, and recessive contrast of VV vs. Vv + vv. The intra-study heterogeneity was evaluated using Q-statistical test and I2 test. When significant heterogeneity was achieved (P < 0.10, I2 > 50%), the data was merged with the random-effects model. Otherwise, the data were combined with the fixed-effects model. Based on ethnicity (Caucasian, Asian, mixed) and diagnosis (tendon injury, ligament injury, muscle injury), subgroup analyses were performed.

Sensitivity analysis and publication bias

Sensitivity analysis was performed by sequentially ignoring a single study at a time, which could judge the influence of an individual dataset on the aggregated outcomes. The potential publishing bias was estimated by funnel plots. The data were analyzed by RevMan 5.3 software.

Functional predictions

Bioinformatics database of HaploReg 4.1 (https://pubs.broadinstitute.org/mammals/haploreg/Haploreg.php) was used to predict the function and interplay of COL5A1 polymorphic sites. String online server was used to examine the network of gene–gene interaction for COL5A1 gene (https://string-db.org/).

Results

Literature identification

The detailed process of literature identification was displayed in Fig. 1. A total of 267 items were obtained from six databases. Two items were yielded via other sources. After the first screen, 119 duplicates were removed. A review of titles and abstracts excluded 109 irrelevant articles. Then, full-text review of 41 articles was completed. Another 20 citations were excluded with reasons. Eventually, 21 articles [17, 20,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39] were enrolled in the meta-analysis.

Fig. 1
figure 1

Flow chart of literature identification

Main characteristics

The basic characteristics of eligible studies were shown in Table 1. Totally, 21 articles [17, 20,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39] were included, of which four articles [27, 28, 31, 36] were cohort studies, one article [22] was cross-sectional study, and the rest 16 articles were case–control studies. Of the included studies, three studies [22, 28, 35] were on Asian population (Chinese, South Korean, Japanese), one study [24] was on a mixed population (Brazilian), and the rest 17 studies were on European population. Twenty studies were published in English with the exception of one in Korean [35]. The publication year ranged from 2009 to 2021. MSTIs in the original studies were divided into three subgroups including tendon injury (rotator cuff tendinopathy, Achilles tendon pathology, Achilles tendinopathy, Achilles tendon rupture, patellar tendinopathy, elbow tendinopathy), ligament injury (anterior cruciate ligament injury, tennis elbow, rotator cuff tear), and muscle injury. Three studies [26, 37, 38] contained two independent cohorts. Of note, a departure from HWE was noted for the rs12722, rs13946, and rs3196378 in some of the studies.

Table 1 Main characteristics of included studies

Quality assessment

Quality evaluation of the eligible studies was performed by using NOS. Ten studies received 7–9 scores, which were considered to be in excellent quality. The rest eleven studies received 4–6 scores, which were in fair quality (Table 2).

Table 2 Quality assessment of included studies

Meta-analyses and subgroup analyses

Table 3 summarized the outcomes of overall analyses, and stratified analyses by ethnicity and injury type.

Table 3 Associations of COL5A1 gene polymorphisms and musculoskeletal soft tissue injuries

Association of rs12722 polymorphism and MSTIs

Eighteen studies [20,21,22, 24,25,26,27,28,29,30,31,32,33,34,35,36, 38, 39] with 21 cohorts reported the rs12722 polymorphism and vulnerability to MSTIs, encompassing 2164 cases and 5079 controls. Because the heterogeneity was significant, random-effects model was employed. The combined data suggested rs12722 polymorphism was associated with an increased risk to MSTIs under allelic model (T vs. C, OR = 1.14, 95%CI 1.03–1.28, P = 0.01, Fig. 2), homozygote model (TT vs. CC, OR = 1.33, 95%CI 1.08–1.65, P = 0.008), heterozygote model (TC vs. CC, OR = 1.24, 95%CI 1.03–1.49, P = 0.02), and dominant model (TT + TC vs. CC, OR = 1.28, 95%CI 1.08–1.52, P = 0.005).

Fig. 2
figure 2

Forest plot of rs12722 polymorphism and musculoskeletal soft tissue injuries (T vs. C)

Stratified analyses by injury type suggested that rs12722 polymorphism was associated with an increased susceptibility to ligament injury under five genetic models. But the association was not found in tendon injury or muscle injury. When stratified by ethnicity, the combined outcomes indicated that s12722 polymorphism was significant associated with MSTIs in Caucasians but not Asians.

Association of rs13946 polymorphism and MSTIs

Seven studies [17, 20, 22, 27, 29, 33, 35, 38] reported the rs13946 polymorphism and susceptibility to MSTIs, including 740 cases and 1678 controls. Significant heterogeneity was observed under heterozygote model and recessive model, where the random-effects model was employed. The merged data did not support any association between rs13946 and MSTIs under five genetic models.

Subgroup analyses by injury type suggested that rs13946 was significantly associated with an elevated susceptibility to tendon injury (TC vs. CC, OR = 3.68, 95%CI 1.94–6.98, P < 0.01; TT + TC vs. CC, OR = 2.28, 95%CI 1.23–4.23, P = 0.009) and ligament injury (T vs. C, OR = 1.19, 95%CI 1.00–1.42, P = 0.05; TT vs. TC + CC, OR = 1.32, 95%CI 1.05–1.65, P = 0.02). Because only one study was conducted in Asian, stratified analysis by ethnicity was only carried out in Caucasian. The merged data indicated a null association between rs13946 polymorphism and MSTIs in Caucasians.

Association of rs11103544 polymorphism and MSTIs

Two studies [24, 38] with 424 cases and 573 controls investigated the association of rs11103544 polymorphism and MSTIs. Substantial heterogeneity was detected under allele model, homozygote model and recessive model, where the random-effects model was used. The pooled data did not support any association between rs11103544 polymorphism and MSTIs.

Association of rs71746744 polymorphism and MSTIs

Two studies [32, 37] with 199 cases and 328 controls investigated the association of rs71746744 polymorphism and MSTIs. No heterogeneity was found in five genetic models. The pooled data indicated rs71746744 polymorphism was associated with an increased risk to MSTIs (I vs. D, OR = 1.50, 95%CI 1.13–1.99, P = 0.005; II vs. DD, OR = 2.04, 95%CI 1.01–4.12, P = 0.05; II vs. ID + DD, OR = 1.72, 95%CI 1.20–2.46, P = 0.003).

Association of rs3196378 polymorphism and MSTIs

Three studies [24, 32, 37] with 456 cases and 962 controls investigated the correlation of rs3196378 polymorphism and MSTIs. The pooled data indicated a significant association between rs3196378 polymorphism and MSTIs (A vs. C, OR = 1.21, 95%CI 1.03–1.42, P = 0.02; AA vs. CC, OR = 1.46, 95%CI 1.05–2.03, P = 0.03; AA + AC vs. CC, OR = 1.45, 95%CI 1.111–1.88, P = 0.006).

Sensitivity analysis and publication bias

After excluding the studies out of HWE, the OR and 95%CI did not reverse. With sequential removal of each study, the pooled OR and 95%CI of the rest studies did not change significantly, indicating the results were stable. The funnel diagrams did not show obvious sign of dissymmetry, suggesting no significant publication bias (Fig. 3).

Fig. 3
figure 3

Funnel plot of rs12722 polymorphism and musculoskeletal soft tissue injuries (T vs. C)

Functional predictions

The results from HaploReg indicated that rs12722 was in linkage disequilibrium with rs3196378, and rs13946 was in linkage disequilibrium with several other polymorphic sites (Fig. 4). The interactive network of COL5A1 and its partners was presented in Fig. 5. It suggested that COL5A1 might interplay with COL1A1, COL5A2, ADAMTS2, and ADAMTS14.

Fig. 4
figure 4

HaploReg view of COL5A1 gene polymorphisms: a rs12722; b rs13946; c rs11103544; d rs71746744; e rs3196378

Fig. 5
figure 5

Network of COL5A1 with its potentially functional partners obtained from String server

Discussion

Knowledge on the pathogenesis of MSTIs may assist the at-risk individuals in reducing the risk of injuries. Genetics are considered to be a non-modified contributor to MSTIs. Evidence from candidate gene studies has added the understanding of the genetic predisposition to MSTIs. Nogara et al. reported that rs4986938 polymorphism in ER-β gene contributed to posterior tibial tendinopathy in the Brazilian population [40]. Diniz-Fernandes et al. found that MMP-1 and MMP-8 gene polymorphisms promoted increase and remodeling of the collagen III and V in posterior tibial tendinopathy [41]. Artells et al. reported that rs2289360 variant in elastin gene is a potential biomarker for ligament injuries in elite soccer [42]. Insulin-like growth factor 2 and elastin gene polymorphisms were reported to be associated with the degree and recovery time for tendon injuries [43]. Besides, predictive DNA profiling might help athletes to maximize utilization of their potential and improve performance in sports [44]. COL5A1 gene is of particular interest among the candidate susceptible genes for MSTIs. However, the role of COL5A1 gene polymorphisms in MSTIs susceptibility remained the subject of debate.

Reasons like diverse recruitment criteria, varied characteristic of participants, different sample size, heterogeneous ancestries and genders, may result in the inconsistency. Given meta-analysis is a powerful approach to combine data from independent studies and explain heterogeneity, this study was conducted to make a more precise estimation of the correlation of COL5A1 gene polymorphisms and MSTIs. The overall analyses supported that rs12722, rs71746744, and rs3196378 polymorphisms were associated with an increased risk to MSTIs. But the association was not identified in rs13946 or rs11103544 polymorphism. Of note, the positive association appeared to be significant in Caucasians but not Asians for rs12722 polymorphism. A detailed analyses by injury type showed that rs12722 polymorphism was associated with ligament injury, but not tendon injury or muscle injury. For rs13946 polymorphism, it appeared to be associated with tendon injury and ligament injury. It is worthwhile mentioning that the variant T of rs12722 is more frequent in Europeans (MAF: 0.60) than in Asians (MAF: 0.20, Fig. 6). Therefore, the inconsistent outcomes between Asians and Caucasians may be attributable to differences in genetic background.

Based upon the current findings, future works should be focus on rs12722 and rs13946 polymorphisms. As each individual has a unique genetic profile, genetic screening tools might be designed to identify individuals predisposed to MSTIs, thus enabling implementation of preventive strategies for them. Correspondingly, taking preventive measures might reduce the incidence of MSTIs and its cost [45]. While, it should be pointed out that none of the genetic polymorphisms could solely decide the injury risk. Therefore, multifactorial models should be developed to predict the risk of MSTIs [46].

Collagen is best known as the principal tensile element of connective tissues like tendons, ligaments, and cartilage [47]. Type V collagen is composed of several isoforms, but the key isoform is consisted of two α1 chains and one α2 chain, which are encoded by COL5A1 and COL5A2 genes, respectively [36]. Literature has reported that mutation of COL5A1 gene is associated with Ehlers-Danlos syndrome (EDS), a genetic disorder mainly characterized with irregular collagen fibrils. Individuals with EDS exhibit hyperelasticity and laxity in a variety of tendon-ligament tissues, indicating that COL5A1 gene is responsible for the adequate function of soft connective tissues [48]. Wenstrup et al. [49] reported that heterozygous mice with COL5A1 gene showed tremendously defective collagen fibril formation and increased fibril diameter, leads to the connective tissue dysfunction. Goncalves-Neto et al. [50] observed an increased type V collagen and a reduced type I collagen in injured tendons. Based upon the abovementioned evidences, it is reasonable that variants in COL5A1 gene may contribute to MSTIs.

The five studied loci were located in the 3’-UTR of COL5A1 gene. Despite 3’-UTR has a noncoding character, mutations within this region may modify the secondary structure of mRNA and protein features [51]. Indeed, Laguette et al. [16] had reported that the luciferase activity of the C-allele significantly decreased than that of the T-allele for rs12772 polymorphism, and there was an increase in COL5A1 mRNA stability in the individuals with tendinopathic disorder. Collins et al. [15] reported that rs12772 variant might cause an altered amount of type V collagen production, which altered the fibril architecture and mechanical properties. Rs3196378 and rs11103544 were located in the downstream of rs12722, and they spanned miRNA binding sites. Therefore, the two variants potentially had a functional significance in MSTIs [38].

To investigate the interaction effects between polymorphic sites, functional predictive analysis was performed. The results from HaploReg indicated that rs12722 was in linkage disequilibrium with rs3196378, and rs13946 was in linkage disequilibrium with several other polymorphic sites (Fig. 4). In addition, interactions of COL5A1 with other gene might play a role in the present genetic polymorphisms. Functional prediction also suggested that COL5A1 might be involved in the gene–gene interaction with COL1A1, COL5A2, ADAMTS2, and ADAMTS14, which have been reported to be associated with MSTIs [20, 52, 53]. Further studies are encouraged to confirm these interactions in more details.

Of note, Lv et al. [54] had published a similar meta-analysis on this topic. Compared with the previous one, the current meta-analysis had notable improvements. First, the previous study employed a model-free approach to analyze the association of rs12722 polymorphism and MSTIs. Concerning the inheritance models were complex in MSTIs, this study examined five genetic models to explore the underlying association. Second, some most recently published evidences were added into this study, which greatly enlarged the literature number and sample size. Therefore, the statistical power of the pooled results became much stronger. Third, subgroup analysis by ethnicity and injury type was conducted, and an ethnicity-specific effect was found on rs12722 polymorphism and MSTIs. Fourth, for rs13946, rs11103544, rs71746744 and rs3196378 polymorphisms, no combined study had examined their association with MSTIs.

However, several potential drawbacks could not be overcome in this study. First, although subgroup analysis was performed, the heterogeneity in some contrasts still could not be well addressed.

The heterogeneity might be explained by diversity of injury types, differences in sequencing methods, variance of ethnic origins, and differences in the selection of participants. Heterogeneity should be considered when interpreting the findings, and future studies should be focused on more homogenous groups of patients. Second, the number of studies of rs13946, rs11103544, rs71746744 and rs3196378 polymorphisms was small. The statistical power might not be strong enough to explore the relationship of the four polymorphisms and MSTIs. Third, clinical heterogeneity, such as age, sex, lifestyle, mechanism of injury, physical or occupational activity, and other potential confounding factors, could not be managed, which might distort the outcomes. Fourth, because the ethnicity subgroup analyses were restricted to European, Asian and Brazilian populations, the results are only applicable to such ancestry groups. Fifth, several of the included studies were out of HWE, which could be caused by population stratification, genotyping errors, and selection bias in the recruitment of controls [55]. Last, because of the included studies were observational studies, the evidence level presented in this meta-analysis was relatively low.

Conclusions

Taken together, the current meta-analysis supports that rs12722 is associated with an elevated susceptibility to ligament injury, especially in the Caucasian population. Rs13946 polymorphism appears to increase the risk to tendon and ligament injuries. Rs71746744 and rs3196378 polymorphisms have a tendency to confer an elevated risk to MSTIs. However, no relevance is found between rs11103544 polymorphism and MSTIs. Given limitations in this meta-analysis, it is encouraged to verify these findings with complementary larger and well-designed prospective studies.