Genotype–phenotype relationship and comparison between eastern and western patients with osteogenesis imperfecta

Purpose To evaluate the genotypic and phenotypic relationship in a large cohort of OI patients and to compare the differences between eastern and western OI cohorts. Methods A total of 671 OI patients were included. Pathogenic mutations were identified, phenotypic information was collected, and relationships between genotypes and phenotypes were analyzed. Literature about western OI cohorts was searched, and differences were compared between eastern and western OI cohorts. Results A total of 560 OI patients were identified as carrying OI pathogenic mutations, and the positive detection rate of disease-causing gene mutations was 83.5%. Mutations in 15 OI candidate genes were identified, with COL1A1 (n = 308, 55%) and COL1A2 (n = 164, 29%) being the most common mutations, and SERPINF1 and WNT1 being the most common biallelic variants. Of the 414 probands, 48.8, 16.9, 29.2 and 5.1% had OI types I, III, IV and V, respectively. Peripheral fracture was the most common phenotype (96.6%), and femurs (34.7%) were most commonly affected. Vertebral compression fracture was observed in 43.5% of OI patients. Biallelic or COL1A2 mutation led to more bone deformities and poorer mobility than COL1A1 mutation (all P < 0.05). Glycine substitution of COL1A1 or COL1A2 or biallelic variants led to more severe phenotypes than haploinsufficiency of collagen type I α chains, which induced the mildest phenotypes. Although the gene mutation spectrum varied among countries, the fracture incidence was similar between eastern and western OI cohorts. Conclusion The findings are valuable for accurate diagnosis and treatment of OI, mechanism exploration and prognosis judgment. Genetic profiles of OI may vary among races, but the mechanism needs to be explored. Supplementary Information The online version contains supplementary material available at 10.1007/s40618-023-02123-2.


Introduction
Osteogenesis imperfecta (OI) is a phenotypically and genotypically heterogeneous connective tissue disorder, with an incidence of one in 15 000-20 000 births [1].OI is a fairly common genetic bone disorder characterized by low bone mineral density (BMD), recurrent fractures, progressive bone deformity and extraskeletal manifestations, such as blue sclerae, dentinogenesis imperfecta, hearing loss and joint laxity [2].The manifestations of OI are diverse, and the several different forms of OI are associated with considerable morbidity and mortality [3].
OI is not rare in China because of the large population.Annually, approximately 500-700 Chinese children are born with OI. [7].Although previous studies have described the relationship of phenotypes and genotypes in OI patients, an in-depth study regarding genotypic and skeletal phenotypic relationships in a large cohort of Asian OI patients has not yet been reported [8][9][10][11].Additionally, it remains unclear whether differences in phenotypes and genotypes exist between Eastern and Western OI patients.Therefore, we aimed to investigate the genotypes and phenotypes of a large cohort of OI, to explore their relationship, and to compare differences in genotypes and phenotypes between Eastern and Western OI cohorts.

Subjects
Patients were enrolled at the Endocrinology Department of Peking Union Medical College Hospital (PUMCH) from January 2007 to May 2022.Eligible patients were clinically suspected of having OI on the basis of a history of fracture under minor force, low BMD, with or without blue sclera, dentinogenesis imperfecta or a familial history of OI or fracture.
This study was approved by the Scientific Ethics Committee of PUMCH (JS-3545D).Informed consent was obtained from the patients or their legal guardians before participation in the study.

Phenotypic evaluation
Clinical data for the patients were collected from medical records, including birth situation, age at diagnosis, age of initial fracture, fracture frequency and site, bone deformities (limb bending, thoracic deformity, and pelvic deformity), and extraskeletal manifestations.Body height and weight were measured using a Harpenden stadiometer (Seritex Inc., East Rutherford, NJ, USA).For patients unable to stand, body length was measured in the supine position.The height of all OI patients was converted to standard deviation score (SDS) using standardized growth charts for Chinese children and adolescents [12].OI was classified into five subtypes according to the clinical severity: mild OI (type I), perinatally lethal OI (type II), progressive deforming OI (type III), intermediate OI (type IV) and OI with hypertrophic callus (type V) [5].
BMDs at the lumbar spine (LS) and proximal hip were measured by dual-energy X-ray absorptiometry (DXA, Lunar Prodigy Advance, GE Healthcare, USA), and appropriate pediatric software was used for measurement of BMD in children.A quality control program was conducted throughout the study, and phantom testing was completed daily using the DXA device for calibration and quality checks.Obviously compressed or deformed vertebrae were excluded from BMD analysis.LS and FN BMD results of children and adolescents were converted to age-and sexspecific Z scores according to normal reference of BMD in Asian children [13,14].

Bone fracture and scoliosis evaluation
Clinical fractures were reported by the patients or their legal guardians and confirmed by X-ray films, including nonvertebral fractures and symptomatic vertebral fractures.Vertebral compression fracture (VCF) was semiquantitatively assessed as normal (grade 0), mildly deformed (grade 1), moderately deformed (grade 2), and severely deformed (grade 3) according to Genant's classification [15].Scoliosis was determined by anteroposterior radiography and defined as a Cobb angle > 10 degree [16].X-ray film results were interpreted by radiologists at PUMCH.

Genotypic analysis
Total genomic DNA was isolated from peripheral blood using a DNA Extraction Mini Kit (QIAamp DNA, Qiagen, Frankfurt, Germany).Clinically diagnosed OI patients underwent panel sequencing (Illumina HiSeq2000 platform, Illumina Inc., San Diego, CA, USA) using a previously described protocol [17].The next-generation sequencing (NGS) panel covers more than 700 candidate genes of disorders related to bone, including 20 known candidate genes for OI (COL1A1, COL1A2, IFITM5, SERPINF1, CRTAP, P3H1, PPIB, SERPINH1, FKBP10, PLOD2, BMP1, SP7, TMEM38B, WNT1, CREB3L1, SPARC , MBTPS2, P4HB, SEC24D and PLS3).The overall sequencing coverage of the target regions was more than 95% at a minimum sequencing depth of 20 × Bioinformatics processing and data analysis were performed.The "clean reads" derived from targeted sequencing and filtering were aligned to the human genome reference (hg19) using the BWA (Burrows Wheeler Aligner) Multi-Vision software package.All SNVs and indels were filtered and estimated with multiple databases (NCBI dbSNP, HapMap, 1000 human genome dataset and database of 100 Chinese healthy adults).The deleterious effects of missense variants on the corresponding proteins were predicted by silico tools (MutationTaster, PolyPhen-2, SIFT and PhyloP).Variants were classified according to the 2015 American College of Medical Genetics and Genomics/ Association for Molecular Pathology (ACMG/AMP) guidelines [18].
Pathogenic variants identified by NGS were confirmed by Sanger sequencing.Targeted primers were designed, PCR was performed, and the amplicons generated using the primers designed were sequenced with a 3730 DNA analyzer (Applied Biosystems, Foster City, CA, USA).Segregation analysis was performed if DNA was available from the family members.COL1A1 and COL1A2 variants were categorized based on the effects of gene mutation on type I collagen synthesis as glycine substitution, nonglycine substitution, haploinsufficiency or splicing mutation [19].Compound heterozygous or homozygous mutation patterns were classified as biallelic variants [20].

Comparison between eastern and western OI cohorts
We searched for previously published large cohorts of OI from Western and Eastern countries in PubMed, Embase, and Medline databases using "osteogenesis imperfecta", "cohort", "children with bone fracture", and "children with osteoporosis".We collected data on age at OI diagnosis, family history, height, fracture frequency, BMDs, classification of OI and gene mutations in these cohorts of OI, and differences between Eastern and Western OI cohorts were compared.

Statistical analysis
Continuous variables were tested for normal distribution using the Kolmogorov-Smirnov test.Normally distributed data (BMD Z-scores) are presented as the mean ± standard deviation (SD).Differences in BMD Z-scores between groups were compared with one-way ANOVA.Nonnormally distributed data are expressed as medians (range), including age, age at first fracture, height Z-score, frequency of peripheral fracture, and serum PTH and 25OHD levels, and the Mann-Whitney U test was used to compare these parameters between groups.Categorical data are presented as frequencies and percentages (%), Fisher's exact test was utilized to compare these categorical variables (positive family history of fracture, peripheral fracture, VCF, long bone deformity, scoliosis, etc.) between groups.The association between the position of glycine substitution and the frequency of peripheral fracture, LS and FN BMD Z-score, or height Z-score was evaluated using Spearman rank correlation coefficient analysis.
Statistical analyses were performed using SPSS software (version 26.0; SPSS Inc., Chicago, IL, USA).A two-tailed value of P < 0.05 was considered statistically significant.

Phenotypic spectrum
Detailed clinical data were available for 414 OI probands, including 328 children and 86 adults (Table 1).A total of 197 patients (48.3%) had a positive family history of OI.The median age of initial visit and age at first fracture among the OI patients were 10.0 years (range: 0.2-64.0years) and 1.5 years (range: 0.0-59.0years), respectively.The most common clinical manifestation was peripheral fracture (96.6%), and total frequency varied greatly, ranging from 0 to 70 times, with a median time of 4.0.(Table 1).A total of 2887 peripheral fractures were noted in this cohort, with femurs being the most commonly affected sites (34.8%).(Fig. 2a).VCF was also common in this study, as noted in 180 (43.5%)OI patients.A total of 815 VCFs were recorded from C7 to L5, with the most common sites at L1 (10.8%) and T12 (9.8%) (Fig. 2b).Overall, 251 OI patients (60.6%) developed bone deformities, including long bone and ribcage deformities and scoliosis (Table 1).Based on clinical phenotype, 202 (48.8%) patients were classified as OI type I, 70 (16.9%)as OI type III, 121 (29.2%) as OI type IV, and 21 (5.1%) as OI type V (Table 1).Additionally, among 111 patients with no detectable OI mutations, 81 (73.0%) were classified as OI type I, 9 (8.1%) as OI type III and 21 (18.9%) as OI type IV.
For glycine substitution of COL1A1/COL1A2, there were no obvious correlations between the position of the glycine substitution and phenotype, including total frequency of fracture, LS or FN BMD Z-score, and height Z-score.(Supplementary Fig. 2).
Moreover, the methods of genetic testing differed among countries.OI cohorts from Italy and Sweden were identified solely by Sanger sequencing or Sanger sequencing combined multiplex ligation-dependent probe amplification (MLPA) covering COL1A1/COL1A2 genes.OI cohorts from Canada, Turkey and China were identified by NGS panels or WES, which covered currently known candidate genes of OI.The detection rate of disease-causing gene mutation was similar between Turkish (83.6%),Italian (84.9%),Swedish (87.0%), and Chinese (83.5%) cohorts, though it seemed to be higher in Canadian (97.8%) and American (92.7%) cohorts (Table 4).Although the distribution of gene mutations  differed among countries, COL1A1 and COL1A2 were still the dominant mutation genes in all cohorts.Furthermore, we found that SERPINF1 and WNT1 were the most common biallelic pathogenic genes in our cohort (21.2%; 19.2%), which was distinct from studies of Canadian, Turkish and American OI patients.In Canadian and American cohorts, SERPINF1 and CRTAP were the most frequently affected biallelic pathogenic genes; in Turkish cohorts, FKBP10 and P3H1 were the most frequently affected genes.Interestingly, we found a notable increase in the identification of biallelic variants from 2015 to 2022, which is attributed to the widespread use of NGS technology.

Discussion
In this study, genetic mutation and phenotypic profiles were explored in the largest sample of Asian OI patients.The phenotypic spectrum indicated peripheral fracture to be the most common phenotype, especially femoral fracture.VCF was also frequent in OI patients, with the most common sites at the L1 and T12 vertebra.The genotypic spectrum revealed COL1A1 and COL1A2 to be the dominant pathogenic mutations, with c.769G > A (p.G257R) of COL1A1 and c.1009G > A (p.G337S) of COL1A2 as hotspot mutations.We observed a close correlation between the genotype and phenotype of OI patients, with skeletal phenotypes being mildest in patients with haploinsufficiency of collagen type I α chains.These phenotypes were more severe in patients carrying glycine substitution of COL1A1/COL1A2 or biallelic mutation, including lower height Z-score, more long bone and ribcage deformities, poorer mobility, and lower BMD.We compared differences in genotype and phenotype for the first time between Asian and Western OI cohorts and found that the annual incidence of fractures was similar, though the gene mutation spectrum among countries differed.The underlying mechanism of OI is quite complicated, and the fundamental mechanism involves abnormal collagen metabolism induced by multiple gene mutations.There are two general classes of mutations in type I collagen that   result in OI: failure of type I collagen synthesis (haploinsufficiency) and structural abnormalities of collagen molecules (substitution of glycine by another amino acid) [25].The α1 and α2 chains of collagen type I both contain a central triplehelical domain, which is composed of uninterrupted repeats of the Gly-X-Y tripeptide [19].As glycine is the only small residue to be accommodated inside the helix, triple-helix formation can proceed normally only if a glycine residue is present [26].In this large cohort of OI, we not only detected many kinds of pathogenic mutations leading to haploinsufficiency and substitution of glycine but also identified 10 biallelic mutational genes that impair multiple aspects of type I collagen, including translation, posttranslational folding, modification, and assembly.OI is usually overlooked because of misdiagnosis, mild forms in some cases, or remission after puberty [27].In the current study, the diagnosis of OI (mean age: 10.0 years) was much later than the initial occurrence of bone fracture (mean age: 1.5 years), which was consistent with a previous study [28].A delayed diagnosis, untimely intervention and management of disease, would lead to a series of adverse consequences, including increased risk of fractures, decreased quality of life, and potentially respiratory and cardiovascular complications [5].Therefore, it is crucial for doctors to early identify the clinical signs of OI, especially in children with unexplained fractures or a family history of bone fragility disorders [29].
Vitamin D deficiency or insufficiency was common in OI patients [30,31].In this study, we found 77.1% (269/349) of OI patients with deficiency or insufficiency of vitamin D. Serum levels of 25OHD decreased with age, which was consistent with previous studies [30][31][32].The age-related decline in 25OHD levels may be attributable to OI severity, disease progression, and age-related changes in vitamin D metabolism and neglecting treatment [31,33].
Recent studies have indicated that genotypes of OI are closely related to phenotypes, but sample sizes were relatively small, and conclusions were not completely consistent [8,20,23,34].In the present study, we found that haploinsufficiency of collagen type I α chains led to milder skeletal phenotypes than glycine substitution of collagen type I α chains.This finding may explain why patients in this cohort with COL1A2 mutation had more severe skeletal phenotypes than those with the COL1A1 mutation.COL1A1 mutations included 24.2% glycine substitutions and 44.9% haploinsufficiency, whereas the COL1A2 mutations included  73.0% glycine substitutions.We observed that patients with haploinsufficiency of collagen type I α chains had a lower proportion of wheelchair dependence than those with glycine substitution of collagen type I α chains.Haploinsufficiency produces a half amount of type I collagen with normal structure.Accordingly, patients with haploinsufficiency were more likely to have less fracture and milder skeletal deformities, indicating relatively better mobility than those with glycine substitution in type I collagen.There are no previous studies comparing OI patients with different ethnicities.We for the first time found that the distribution of biallelic variants differs among countries.SER-PINF1 and WNT1 were the most common biallelic pathogenic variants in our cohort, which was similar to previous studies of Chinese OI patients [8,11,35].However, this finding was distinct from studies of Western OI patients [20,21,23,36,37].Regarding phenotype of OI, we found that the phenotypes were roughly similar between Eastern and Western OI patients and the annual incidence of fractures did not differ statistically significantly among countries.However, the different age, gender, nutritional status, lifestyle of OI patients, and the sample size of different studies made it difficult to complete accurate phenotypic comparisons between different patients' groups.Moreover, it was difficult to exclude the effects of previous treatments, such as bisphosphonates, on phenotypes.It was worth noting that the comparison between Eastern and Western OI patients was based on limited studies and general observations, more in-depth research is worth conducting.
This study obtained an accurate genetic diagnosis of a large sample of OI patients, which is valuable for revealing pathogenic mechanisms and predicting disease prognosis and drug therapy response.Our previous study reported that OI patients with nonautosomal dominant inheritance or with pathogenic mutations leading to collagen structural defects would have relatively poor responses to zoledronic acid treatment [38], indicating that molecular diagnosis is valuable to carry out precision treatment for OI patients.Moreover, gene and cell therapy are currently the most promising treatment prospects for OI [1], and definitive genetic diagnosis may lay the foundation for future gene or cell therapies for these OI patients [39].Of note, no causative gene mutation was detected in 16.5% of our patients with a clinical diagnosis of OI.Four new OI pathogenic genes (KDELR2, FAM46A, MESD and CCDC134) were not included in the NGS panel used, which would reduce the mutation detection rate.In general, whole-genome sequencing (WGS) may be superior for detection of OI pathogenic mutations because it can not only capture noncoding regions of genes but also cover copy number variants, chromosomal rearrangements, and repeat-rich regions [40,41].In addition, single-cell RNA sequencing (scRNA-seq) is a powerful tool allowing classification, characterization and distinction of each cell at the transcriptome level [42].These technologies should be utilized in our cohort.
In this study, we delved into the pathogenic mutations and phenotypic correlation in the largest Asian cohort of OI patients.Data on the phenotype and genotype of OI patients were detailed.All parameters of this study were detected in a single center, which could minimize measurement bias.However, this study had several limitations.This was a retrospective study, and some data were unavailable, especially for patients who visited PUMCH 20 years ago.Additionally, effects of puberty on height and BMD could not be ruled out.Extraskeletal phenotypes have not been fully evaluated.An incomplete panel not covering all known disease-causing genes would affect the accuracy of molecular diagnosis.
In conclusion, detailed genotypes and phenotypes were obtained from the largest cohort of Asian OI patients, enriching the spectrum of OI.A close correlation between genotype and phenotype of OI patients is demonstrated by our results, which is valuable for prenatal diagnosis, differential diagnosis, elucidating pathogenesis mechanism, and predicting prognosis of OI.Differences in the mutation profile of causative genes were found between Eastern and Western OI patients, but the mechanism deserves in-depth study.

Fig. 1
Fig. 1 Mutation spectrum of this largest Chinese OI cohort.a Mutation spectrum of all OI patients.b Mutation types of COL1A1 and COL1A2 in this cohort.OI osteogenesis imperfecta

Table 1
Demographic and clinical characteristics of patients with OI Normally distributed results are given as the mean ± SD Nonnormally distributed results are given as the median (range) Categorical data are presented as frequencies (%) ALP alkaline phosphatase, ALT glutamic-pyruvic transaminase, Ca calcium, Cr creatinine, FN femoral neck, LS lumbar spine, OI osteogenesis imperfecta, P phosphorus, P1NP procollagen type 1 N-peptide, PTH parathyroid hormone, β-CTX β-C-terminal telopeptide of type 1 collagen, 25OHD 25-hydroxyvitamin D, VCF vertebral compression fracture Bold numbers represent P < 0

Fig. 2 2
Fig. 2 Association of gene mutation with clinical severity of OI. a Association between different mutated genes and the clinical severity of OI. b Association between different collagen changes induced

Fig. 3
Fig. 3 Distribution of fracture sites in OI patients.a Distribution of fracture sites in OI patients with peripheral bone fractures, b Distribution of fracture sites in OI patients with VCFs.OI osteogenesis imperfecta, VCF vertebral compression fracture

Table 3
Comparison of phenotypes of OI patients with different collagen changes in type I α chains and those with biallelic variants Normally distributed results are given as the mean ± SD.Nonnormally distributed results are given as the median (range).Categorical data are presented as frequencies (%) FN femoral neck, LS lumbar spine, OI osteogenesis imperfecta, VCF vertebral compression fracture a P < 0.05 compared to glycine substitution in COL1A1; b P < 0.05 compared to glycine substitution in COL1A2; c P < 0.05 compared to haploinsufficiency Bold numbers represent P < 0.05 Glycine substitution in COL1A1 n = 55 Glycine substitution in COL1A2 n = 81 Haploinsufficiency n = 107 Biallelic variants n = 48 P value Age of initial visit (years) 11

Table 4
Comparison of genotypes and phenotypes between Eastern and Western OI cohorts # The mean/median number of fractures per year FN femoral neck, LS lumbar spine, MLPA multiplex ligation-dependent probe amplification, NA not available, NGS next genome sequencing, OI osteogenesis imperfecta, WES whole exome sequencing