Introduction

Cystinuria (OMIM: 220,100) is a genetic disorder caused by defects in the dibasic amino acid transporters, which results in elevated urinary cystine along with dibasic amino acids such as arginine, ornithine, and lysine [1]. Cystinuria has traditionally been classified into three categories based on the level of urinary amino acid excretion. Following identification of the underlying genetic cause, the disorder is now labeled type A and B for variants in SLC3A1 (OMIM: 104,614) and SLC7A9 (OMIM: 604,144), respectively [2].

The b0,+ system is composed of rBAT and b0,+AT coded by SLC3A1 and SLC7A9, respectively, which form a heterodimer with a disulfide bond [3, 4]. While the b0,+AT subunit transports the amino acids, the rBAT subunit plays a role in trafficking and maturation of the complex [4,5,6]. The inheritance pattern of cystinuria-related variants also shows a distinct pattern. Variants in SLC3A1 are inherited in an autosomal recessive order, whereas those in SLC7A9 follow an autosomal dominant with incomplete penetrance pattern. This paradigm seems obvious considering the function of proteins encoded by each gene per se.

While a number of variants have been discovered as a cause of cystinuria, interpretation of variants in SLC3A1 and SLC7A9 is still challenging. First, the inheritance pattern complicates the interpretation. In the autosomal dominant with incomplete penetrance pattern, it is difficult to determine the pathogenicity when there are two variants of uncertain significance in one gene as either or both variants could contribute to the disease. Second, genetic heterogeneity of cystinuria is another concern. When there are variants of question in both SLC3A1 and SLC7A9, it is not straightforward to ascertain which variant is causative. Moreover, there are cystinuria patients without any pathogenic variants in SLC3A1 and SLC7A9. As it is currently suspected there could be other genes associated with cystinuria [7], relating a novel variant in SLC3A1 and/or SLC7A9 with cystinuria is challenging.

In this study, we report ten cystinuria cases, which include two novel variants. We believe the current study will serve as evidence in interpreting SLC3A1 and SLC7A9 variants. Furthermore, while protein stability is an attribute not employed in the current guidelines for variant interpretation, we have evaluated protein stability change caused by variants to unveil its potential utility in determining the pathogenicity of a given variant.

Methods

Study population

As the aim of this study was to investigate genetic variants in cystinuria patients, those with SLC3A1 and SLC7A9 sequencing orders were included. Additional clinical information of the enrolled cystinuria patients were obtained, which included age of onset, familial history of urolithiasis, clinical manifestation, intervention, chemical composition of the urinary stone, urine amino acid levels, and identified genetic variants of SLC3A1 and SLC7A9 genes. This study was approved by the Institutional Review Board (IRB) of Samsung Medical Center (IRB No. 2022–05-064), and the need for written informed consent was wived due to the anonymous and retrospective nature of this study.

Laboratory analyses

Urine amino acid levels were quantitatively measured with liquid chromatography tandem mass spectrometry (LC–MS/MS) to reveal changes in urinary dibasic amino acid levels including cystine. After urologic intervention of each patient, urinary stone analysis was performed to examine the chemical composition through Fourier transform infrared spectroscopy (FT-IR) using the FT-IR system 2000 (PerkinElmer, Wallac Oy, Turku, Finland) and Spectrum software (PerkinElmer) described in a previous publication [8]. For genetic analysis, DNA was extracted from whole blood using a Roche MagNA Pure 96 DNA isolation kit (Roche Applied Science, Manheim, Germany). The SLC3A1 and SLC7A9 gene sequences were obtained with polymerase chain reaction (PCR) and full sequencing using an ABI Prism 3730XL DNA sequencer (Applied Biosystems, Foster City, CA, USA). The in-house designed primers are available upon reasonable request. Nucleotides were numbered according to the transcript sequences of SLC3A1 (NM_000341.3) and SLC7A9 (NM_014270.4).

Evaluation of SLC7A9 variants with known functional changes

As there was one previous study demonstrating the functional change in SLC7A9 variants experimentally [9], we evaluated the feasibility of using the pathogenicity score of REVEL [10] and predicted stability change (ΔΔGStability) calculated with DynaMut2 [11].

Variant interpretation

The identified variants were interpreted applying the ACGS Best Practice Guidelines for Variant Classification 2019 [12], which is based on the 2015 ACMG/AMP guidelines [13]. In addition, variants with one very strong criterion along with one supporting criterion were regarded as likely pathogenic according to the ClinGen Sequence Variant Interpretation (SVI) Recommendation for PM2 [14]. The PM3 criterion was applied following the guidance of ClinGen SVI recommendation, which allows for applying different strength levels based on the phasing of two variants and classification of the variant other than the variant of interest [15]. The PM1_Supporting criterion was assigned if a variant was predicted to affect protein function according to Martell et al. (2017) [16]. For investigation of previous literature of a certain variant, HGMD Professional (2022.1) [17] and Mastermind [18] were utilized. To analyze the population frequency, gnomAD v2.1.1 [19] and KRGDB_1722 [20] were used. For in silico prediction, REVEL [10] and SpliceAI [21] were utilized for single nucleotide variants and splice site variants, respectively. Predicted stability change (ΔΔGStability) was calculated with DynaMut2 [11]. For novel missense variants, the structure of the resulting protein was illustrated using Missense3D [22].

Using DynaMut2 to predict protein instability

DynaMut2 is an in silico tool designed to predict a change in protein stability upon missense variants [11]. The authors of the DynaMut2 claim its usefulness in predicting the role of variants in disease [11]. The predicted stability change (ΔΔGStability) values were obtained with DynaMut2 where negative values indicate a destabilizing effect and positive values indicate a stabilizing effect. The greater the absolute value of ΔΔGStability, the greater the effect. The ΔΔGStability values were classified into seven categories: highly destabilizing (ΔΔGStability < −1.84 kcal/mol), destabilizing (−1.84 kcal/mol < ΔΔGStability < −0.92 kcal/mol), slightly destabilizing (−0.92 kcal/mol < ΔΔGStability < −0.46 kcal/mol), neutral (−0.46 kcal/mol < ΔΔGStability < 0.46 kcal/mol), slightly stabilizing (0.46 kcal/mol < ΔΔGStability < 0.92 kcal/mol), stabilizing (0.92 kcal/mol < ΔΔGStability < 1.84 kcal/mol), and highly stabilizing (ΔΔGStability > 1.84 kcal/mol). The PDB accession number 6LID [23] was used as the reference structure of the b0,+AT-rBAT complex. Since the b0,+AT-rBAT complex is a dimer of heterodimers consisting of b0,+AT and rBAT, the ΔΔGStability both when one dimer was affected and when both dimers were affected were calculated.

Results

Clinical characteristics

The clinical characteristics of the cystinuria patients are summarized in Table 1. While all patients had multiple events of urolithiasis, only two patients had a familial history of urolithiasis. The identified variants, stone component, and urine amino acid levels are listed in Table 2. Stones from all patients consisted of 100% cystine. Increased urinary cystine and dibasic amino acid levels were noted in all patients that were tested.

Table 1 Clinical characteristics of cystinuria patients
Table 2 Identified variants, urinary stone composition, and urine amino acid levels of each patient

Evaluation of REVEL and DynaMut2 in predicting the degree of functional change

The REVEL score was able to distinguish variants with mild effect and severe effect in transport activity as all variants with a mild decrease in protein function had a REVEL score less than 0.6, while other variants with severe loss of function all had a REVEL score greater than 0.8. Among the evaluated variants, the p.(Gly105Arg) variant exhibited a significantly decreased protein expression by 10% of wild-type in transfected cells [9], which was also the only variant predicted to be highly destabilizing according to the DynaMut2 results with a ΔΔGStability of −2.93 kcal/mol when both SLC7A9 light chains of the dimer were composed of proteins with the p.(Gly105Arg) variant (Table 3). Variants other than p.(Gly105Arg) did not have a drastic effect on ΔΔGStability, with the maximum effect being −1.65 kcal/mol by p.(Ala70Val) indicating a destabilizing effect.

Table 3 Predicted effects of protein change by multiple algorithms for variants with known functional change

Interpretation and classification of the identified variants

A total of 13 different variants were identified with seven different SLC3A1 variants and six different SLC7A9 variants including one novel SLC3A1 variant and one novel SLC7A9 variant not previously reported. Among the SLC3A1 variants were three pathogenic variants, three likely pathogenic variants, and one variant of uncertain significance. Among the SLC7A9 variants were one pathogenic variant, two likely pathogenic variants, and three variants of uncertain significance. While all variants identified in our study had an extremely low minor allele frequency in the gnomAD v2.1.1, NM_014270.4(SLC7A9):c.829G > A had a MAF of 0.6407% in KRGDB_1722. This variant also had the highest MAF in the gnomAD v2.1.1. among the variants identified in this study. In silico prediction of missense variants with REVEL resulted in (1) a score ≥ 0.8 for three SLC3A1 variants and two SLC7A9 variants, (2) 0.6 ≤ score < 0.8 for three SLC7A9 variants, and (3) a score < 0.6 for one SLC3A1 variant. SpliceAI predicted donor loss (score 1.00 at −1 bp) and gain (score 0.67 at 44 bp) in the c.1500 + 1G > A variant. All other variants identified in our patients were predicted to have no splicing effect according to SpliceAI. DynaMut2 predicted that all SLC3A1 missense variants will cause a highly destabilizing effect when both chains of the dimer are affected and the p.(Tyr135Cys) variant of SLC7A9 will cause a highly destabilizing effect even with one of the dimers affected. There were two novel variants identified: (1) NM_000341.3(SLC3A1):c.223C > T and (2) NM_014270.4(SLC7A9):c.404A > G. A novel pathogenic variant was identified in SLC3A1, whereas a novel variant of uncertain significance was identified from SLC7A9. Using Missense3D, it was predicted that the novel p.(Tyr135Cys) variant of SLC7A9, located at the end of the transmembrane helix, does not change the secondary structure of the protein (Fig. 1). In addition, the p.(Trp440Arg) variant of SLC3A1 was interpreted as a likely pathogenic variant with the support of a case included in our study, which elevates the evidence level of the PM3 criterion. Refer to Table 4 for the list of variants, their interpretation, and related information; refer to Table 5 for details regarding the PM3 criterion. For further evaluation of the effects of compound heterozygote variants, the protein stability change affected by two variants was also predicted, which demonstrated that c.418G > A(;)1976A > C and c.1976A > C(;)2017 T > C would have highly destabilizing effects (Table 6).

Fig. 1
figure 1

Structures of the A wild-type and B p.(Tyr135Cys) mutant SLC7A9 protein (the cyan color indicates the wild-type tyrosine residue and the red color indicates the mutant cysteine residue)

Table 4 Interpretation of the variants identified in SLC3A1 and SLC7A9
Table 5 Application of the PM3 criterion for variants identified in SLC3A1
Table 6 Predicted protein stability change in cases with compound heterozygous variants assumed to be in trans

Discussion

Since the discovery of the underlying genetic defects in cystinuria, a number of variants in SLC3A1 and SLC7A9 have been reported to cause cystinuria. However, interpretation of cystinuria-related variants is still challenging for several reasons. Molecular variants in the b0,+ transport system are known to cause cystinuria by various functional defects including trafficking, protein folding, protein expression, and amino acid transport [24]. While the current guidelines for variant interpretation embrace the decrease in protein expression by assigning evidence for null variants with consideration of the nonsense-mediated decay mechanism, the degradation of a translated protein due to instability is still underappreciated.

In this study, we reviewed the SLC3A1 and SLC7A9 variants identified from cystinuria patients of our institute, which revealed novel variants not reported to date as well as reclassified a previously reported variant of uncertain significance as a likely pathogenic variant. In addition, the results of our evaluation suggest the utility of REVEL in predicting the functional change of the transporter system. However, while REVEL demonstrated different scores for SLC7A9 variants with mild functional defect (score < 0.6) and severe functional loss (score ≥ 0.8), we were unable to establish a definitive cutoff for REVEL due to limited functional studies with no variants with the degree of functional change elucidated falling into the grey zone (0.6 ≤ score < 0.8). In addition, there were no available functional studies for the SLC3A1 variants. The protein stability change estimated with DynaMut2 correlated well with the decrease in protein expression level resulting from the p.(Gly105Arg) variant of SLC7A9. Among the variants of uncertain significance identified in our study were two variants (SLC3A1 p.(Gln659Pro) and SLC7A9 p.(Tyr135Cys)) predicted by DynaMut2 to result in a highly destabilizing protein. We suspect that these variants favor pathogenicity despite insufficient evidence according to the current guidelines. Although DynaMut2 correlated well with the significantly reduced protein expression of SLC7A9 p.(Gly105Arg), interpretation of the predicted protein instability should be done cautiously since most protein stability prediction tools including DynaMut2 tend to have a bias toward destabilization [25]. As none of the patients with two different variants had a familial history of urolithiasis, the most likely scenario is that both variants in a patient predispose to cystinuria and each of the variants was inherited from a different parent.

Although in silico tools serve as a powerful resource in predicting the pathogenicity of a variant, there are times when the evidence is not sufficient to classify a variant as pathogenic or likely pathogenic despite the variant being highly suspicious as the cause of cystinuria considering other information and the criteria assigned. While some variants are still classified as a variant of uncertain significance despite being highly suspected for pathogenicity, this report adds to the literature and will hopefully be utilized as further evidence for assigning any pathogenic criteria and aid further reports of these variants. Criteria such as PS4 and PM3 could be applied with different weight depending on the number of reported cases.

The diversity of cystinuria-related variants has been reported to be different depending on the ethnic group. While the variant spectrum is unknown regarding the Korean population, it is of note that the variants identified in this study showed a different pattern from a previous Korean publication that included seven cystinuria patients with genetic studies conducted [1].

There are several limitations to our study. First, three patients in our study lacked SLC7A9 sequencing results as the previous workflow of our institute was to order SLC7A9 sequencing if no variants were identified from SLC3A1. However, two patients carried homozygous likely pathogenic variants in SLC3A1, and one patient carried a pathogenic variant and a variant of uncertain significance in SLC3A1. As the variant of uncertain significance was deficient of one supporting evidence from being classified as a likely pathogenic variant, we suggest that cystinuria in all three cases could be explained with the variants identified in SLC3A1. Second, genetic tests of family members were not carried out. As the designation of phase in patients harboring two different mutations would allow applying stronger evidence in the PM3 criterion, the lack of familial data limits the variant interpretation. Moreover, family members carrying one of the two variants identified in the proband would provide a more sophisticated genotype–phenotype correlation. Third, urinary amino acid levels were measured during the course of treatment. Since the measured concentrations are from different stages of the disease, the data could not be used for genotype–phenotype correlation of the disease.

In summary, we report ten cystinuria cases and the interpretation of variants identified in SLC3A1 and SLC7A9, which included a novel variant in each gene. Our study implies the significance of reporting variants and literature review in determining the pathogenicity of a variant. Moreover, we suggest the potential role of protein stability in predicting loss of function caused by a decrease in protein expression. Hence, this study would benefit future variant interpretation by serving as a list of clinical cases as well as suggesting the approach of utilizing protein instability.