Introduction

In recent decades, the number of described mutations of the SERPINA1 gene associated with alpha1 antitrypsin deficiency (AATD) has increased considerably. Beyond the two most frequent mutations, the S mutation (c.863A > T) and the Z mutation (c.1096G > A) [1], the number of described variants has risen to more than 500 [2]. Usually, the identification of these rare mutations is initiated by a discrepancy between the serum AAT level and the mutation found by direct genotyping [3]. For this reason, rare mutations are usually considered to be pathogenetic. However, most have been described in individual cases without thorough examination of their pathogenetic capacity [4]. Moreover, the frequency of these rare mutations in a large population of AATD patients has not been consistently described. It is clear that the mutation should be identified in AATD cases with significant pulmonary or hepatic involvement [5]. Knowing the frequency of these rare mutations could help in the management of the disease and in prioritizing allele identification in routine practice. This could also highlight the gaps in our understanding of the pathophysiologic behavior of these mutations.

Recently, our group published the results of a new system for AATD diagnosis based on buccal swabs and dried blood spots samples. After analyzing more than 30,000 samples from six countries, the study showed this diagnostic procedure was feasible and suitable for the genetic diagnosis of AATD [1, 3]. The implementation of this AATD diagnostic network has revealed that there are 14 mutations that can explain the majority of the pathological cases of this disease. Using the data from this study, the present analysis describes the frequencies of rare alleles and relate them to the available data on their respiratory and hepatic pathogenicity. These results will help understand the epidemiological importance of the mutations in each geographic area and will highlight the research needed for a more complete understanding of the pathogenetic potential of these mutations.

Methods

This is a secondary analysis of the data from a study evaluating the Progenika diagnostic system (Progenika Biopharma, Derio, Vizcaya, Spain) in 30,827 samples from patients with suspected AATD from six different countries. This diagnostic network found mutations in 9,528 (30.9%) of the samples. The methodology has been previously described [1]. Briefly, this was an observational, cross-sectional analysis analyzing the anonymized data included on the Progenika web platform (https://grifolsalpha1test.com/) from March 12, 2018, to January 10, 2022. The collection kits for sampling with the dried blood spots or buccal swabs were provided to participating centers free of charge by Grifols (Barcelona, Spain) upon request from the treating physicians. For the current analysis, samples from Argentina, Brazil, Chile, Colombia, Spain, and Turkey were analyzed. The samples were registered on the web platform through a unique code associated with each sample collection kit and sent by post to the reference laboratory at the Progenika headquarters.

When registering the sample on the website, clinicians were asked to include some clinical data about the patient including age, smoking status (smoker, former smoker or never smoker), serum AAT level, and forced expiratory volume in one second (FEV1, expressed as a percentage of its predicted value), and the reasons for requesting the test. Although inclusion of these data was not mandatory, the AAT level was considered for concordance with the genotype and, if not concordant, the SERPINA1 gene was sequenced. Per the Spanish guidelines [6], AAT levels ≤ 50 mg/dl were considered a severe deficiency.

Allele-specific genotyping was carried out with the Progenika A1AT Genotyping Test. The test uses polymerase chain reaction amplification to obtain large amounts of target sequences from the SERPINA1 gene. The Luminex® 200 system to detect previously labeled amplified fragments, as previously described [3]. The test and OCR100 buccal swabs used to collect the samples are CE marked (European Conformity) and United States Food and Drug Administration approved. The test is intended for use with genomic DNA extracted from human whole blood samples collected as dried blood spots or from human buccal swab samples using ORAcollect Dx OCD-100.

The test can identify the 14 most frequent deficiency variants of the SERPINA1 gene, namely PI*S, PI*Z, PI*I, PI*Mprocida, PI*Mmalton, PI*Siiyama, PI*Q0granite falls, PI*Q0west, PI*Q0bellingham, PI*F, PI*Plowell, PI*Q0mattawa, PI*Q0clayton, and PI*Mheerlen. When none of the 14 alleles was found, the result was noted as negative and interpreted as an M allele. The absence of any of these 14 alleles suggests with over 99% probability that the genotype corresponds to PI*M, unless there was a discrepancy with AAT levels. In those cases, gene sequencing was conducted.

For the current analysis, only cases with rare mutations identified by the Progenika diagnostic system were included. Accordingly, those cases with genotypes exclusively resulting from a combination of S or Z alleles (MS, MZ, SS, SZ and ZZ) were excluded from this analysis. Newly identified mutations not previously described were also excluded. After the identification of all rare alleles, we performed a non-systematic review of the literature looking for information on these rare alleles by searching for the name of the allele in PubMed.

Results

The number of patients with rare variants was 818 (2.7% out of 30,827 samples; 8.6% out of 9,528 carrying any mutation). The flowchart of the distribution of the samples is available from a previous analysis [1]. The number of patients carrying rare alleles is listed by country in Table 1. Severe AAT deficiency was seen in 572 patients (9.8% of those with serum AAT values). All cases were heterozygous except for the following (n = 20): 1 homozygous PI*Mprocida (n = 1), homozygous PI*Mmalton (n = 13), homozygous PI*Mheerlen (n = 1), homozygous PI*Plowell (n = 4), and homozygous PI*Q0mattawa (n = 1). Of the 14 mutations included in the Progenika panel, no cases of PI*Siiyama, PI*Q0granite falls and PI*Q0west were found. Other alleles not included in the initial 14-mutation panel were identified by gene sequencing. They included PI*Mwürzburg, PI*Zbristol, and PI*Zwrexham, and the null alleles PI*Q0porto, PI*Q0madrid, PI*Q0brescia, and PI*Q0kayseri.The frequency of rare and null alleles in the different countries are summarized in the Table 2.

Table 1 Rare and null alleles with their clinical characteristics by order of frequency
Table 2 Frequency of rare and null alleles in the different countries

The frequency of the different M-like rare alleles is shown in Fig. 1. The most frequent M-like allele was PI*Mmalton followed by PI*Mheerlen. Although these alleles were identified predominantly in the samples from Spain, some combinations (PI*Mmalton, PI*Mheerlen or PI*Mprocida) were found in samples from other countries. After Spain, Brazil had the most of these rare mutations.

Fig. 1
figure 1

M-like alleles distribution

The frequency of other rare alleles is shown in Fig. 2. PI*I was the most common and was predominantly found in samples from Spain. The allele PI*F was also frequently identified. Other alleles were less frequent, but some were identified in Turkey, e.g., combinations with Plowell.

Fig. 2
figure 2

Other rare alleles distribution

The null alleles are summarized in Fig. 3. The most frequent null allele was Q0mattawa. These alleles were less frequent, and homozygous combinations were extremely rare. The Q0kayseri mutation is native to Turkey, but the only homozygous case for Q0brescia was also found in a sample from that country. Information on these mutations from a non-systematic literature review is summarized in Table 3.

Fig. 3
figure 3

Null alleles distribution

Table 3 Summary of the main characteristics of the mutations found

Discussion

The present study assessed the frequency of rare mutations in a large sample of cases with suspected AATD in six countries. Our results show the low frequency of these alleles and their distribution in different countries and help identify which variants are more frequent in different geographical areas. Our data indicate that these so-called rare variants may not be as rare when a thorough diagnostic system is used.

AATD is an inherited disorder that increases the risk of lung and liver disease. Numerous point mutations of the SERPINA1 gene have been identified so far, although many of them are not associated with an increased risk for developing respiratory or liver disorders [2]. Consequently, the identification of less frequent, but consequential mutations and their characterization are relevant objectives for the management of AATD. Greater understanding of the underlying biologic pathways leading to cell damage in AATD will also be of benefit for the treatment of AATD [7]. This is of special importance in the current pandemic situation with potential associations between AATD and COVID19 [8, 9]. The Progenika diagnostic network is formed by those countries using the Progenika system as the diagnostic standard for AATD. Other countries have started to use a similar system including Italy [10] and Germany [11].

The main strengths of our study are the large number of samples analyzed, the simultaneous determination of several genotypes and the sequencing of samples from different countries, allowing the assessment of the geographic distribution of these mutations. However, there are some limitations that must be taken into account when interpreting our results. This is not a population-based study, but a highly selected population of patients with suspected AATD. Accordingly, the prevalence figures may overestimate the prevalence of AATD in the general population. Another note of caution should be considered in the cases with hepatopathy of unknown cause. The clinicians participating in this circuit were mostly pulmonologists or general practitioners. Therefore, cases with hepatopathy of unknown cause may be under-represented. The addition of liver disease specialists to the evaluation of these patients might contribute to the detection of cases of AATD in this clinical context. Additionally, not all samples were sequenced, only those with a discrepancy between the serum level of AAT and the mutation found. There was a considerable number of cases with no AAT level available. Therefore, there may be an underestimation of some alleles. Finally, serum AAT and FEV1 reported in Table 1 are influenced by the other accompanying allele in heterozygosis. Consequently, these data may lead to a false picture of the impact of these alleles on AAT levels or the resulting functional impairment. Interestingly, the majority of cases with AAT values available presented as non-severe AATD, suggesting that these alleles cannot be ruled out by the level of serum AAT alone.

Despite these limitations, this is the largest study to date that includes analysis of the frequency of rare variants in a sample of patients with suspected AATD. The frequency of these rare alleles has been previously reported in several individual countries including Germany [12], Italy [13, 14], Tunisia [15], Switzerland [16], Spain [17], Poland [18], Turkey [19] and the USA [20]. In these studies, the frequency of rare alleles ranged from 0.5% of all screened patients in Germany [12] to 4.1% in Tunisia [15] corresponding to 1.7% of cases with any mutation in Germany [12] and 20% in Tunisia [15]. In Turkey, our data showed a higher frequency of rare alleles, in line with recently published data from this country [19] within the Progenika network.

The information obtained from our literature review should be interpreted with caution since some mutations have low case numbers, and their effects may be influenced by an accompanying mutation. Additionally, some mutations have been assigned more than one name. There were two major allele complexes that are worth noting. The Mmalton complex includes the Mmalton (c.227_229delTCT on M2 variant), Mpalermo (same mutation on M1V variant) and Mnichinan (same mutation with an additional mutation c.514G > A that does not seem to have a deleterious effect on its own). The Plowell complex includes Plowell (c.839A > T on M3 variant) and Pduarte, (same mutation on M1 variant; also known as Q0cardiff). The Plowell mutation is also seen in Ybarcelona which results from the combination of Plowell and Yorzinuovi in the same gene [21].

PI*I and PI*F were first alleles described in 1967 [22]. PI*I allele has been associated with moderate AATD with serum concentrations similar to those observed with the S allele [23]. PI*II homozygotes usually have AAT levels around 50 mgr/dL [24, 25]. Liver involvement is not usually seen with PI*I unless it is accompanied by an allele associated with liver involvement [26]. The serum concentration and function associated with the PI*F allele are at least 80% of that of the M allele [27, 28]. However, the PI*F allele shows a decreased ability to bind and less time-dependent inhibition of human neutrophil elastase compared to the M phenotype and similar inhibition to that of the Z phenotype [29]. The PI*F allele has a reduced functional ability to inhibit neutrophil elastase but not proteinase 3 [30], suggesting that inheritance of the F variant may increase a person's susceptibility to elastase-induced lung damage, but not necessarily to emphysema. Due to normal hepatic secretion, it does not produce intrahepatic accumulation and therefore, does not increase the risk of liver injury.

According to our results, M-like alleles are the most frequent in patients with suspected AATD. PI*Mmalton complex (PI*Mmalton, PI*Mpalermo and PI*Mnichinan) have a similar behavior. PI*Mmalton was first described in 1975 in a 2-year child with a minor infection [31]. PI*Mnichinan was first described in 1990 in a Japanese individual with severe AATD (18 mg/dl), associated with aggregated AAT molecules in the hepatocytes [32]. Finally, PI*Mpalermo was first described in 1994 [33]. Their presence is associated with serum AAT levels below 15%. These mutations are characterized by conformational abnormalities that result in polymerized/aggregated insoluble forms of AAT that accumulate in the endoplasmic reticulum of hepatocytes. Therefore, all three variants meet the requirements for endoplasmic reticulum storage diseases and conformational diseases [34, 35]. Interestingly, the c.514G > A additional mutation of the PI*Mnichinan does not contribute to AATD [32].

PI*Mwürzburg was first described in 1999 on a M1Val basis [36], and the same mutation was identified one year after as PI*Mvall d’hebron but on a M1Ala basis [37]. These defective alleles produce a change in the amino acid sequence at position 369 which is associated with a complete intracellular transport block in cell. Interestingly, the allele PI*Mheerlen has a different amino acid substitution in the same position which is also shown to cause complete retention of the mutant protein in the hepatocytes.

PI*Mheerlen was first described in 1981 [38]. Homozygous cases have serum AAT levels 2% of normal and very low antitrypsin activity. The tertiary structure of the Mheerlen protein is significantly altered resulting in intracellular proteolysis. Therefore, there is no accumulation of Mheerlen protein in hepatocytes [39].

PI*Mprocida was first described in 1988 [40]. This rare allele encoding AAT synthesis is associated with reduced serum AAT levels (below 10 mg/dl). The Mprocida molecule behaves normally in vivo with a half-life similar to normal M1 AAT. Neutrophil elastase inhibitory activity of Mprocida protein is slightly reduced. Evaluation of the crystallographic structure suggests that the mutation may alter alpha-helix A, suggesting that the molecule is unstable and degrades intracellularly prior to secretion. The tertiary structure of the protein is significantly altered resulting in intracellular proteolysis and, therefore, not associated with risk of liver injury. The risk of lung disease is high, but the risk of liver disease is low [40].

Although P-type mutations have been known since 1968 [41], it was not until 1990 that the PI*Plowell genotype began to be characterized [42]. In 1993, a new P-allele was identified as Pduarte which carried the same mutation as Plowell but on a M4 basis [43]. These alleles have similar behavior. Homozygous Plowell exhibits decreased AAT serum concentration—around 40% of normality [44]. However, Plowell has near normal function as an inhibitor of human neutrophil elastase [45]. Therefore, increased risk for lung involvement depends on the accompanying alleles [41]. The Plowell substitution has a profound effect on intracellular processing of the AAT molecule resulting in deficiency. This variant has been associated with increased intracellular degradation of newly synthetized protein and to serum levels 24% of normal [42]. Therefore, the risk for liver disease is low. PI*Pduarte is similar to Plowell but on M4. AAT levels in Pduarte are 41% of normal, similar to Plowell [43]. Thus, the Pduarte allele differs from the Plowell allele only by the normal allelic background in which the mutation occurs.

Ybarcelona was first described in 1998 as the combination of PI*Plowell + another mutation (c.1244C > A) [21]. In 2012, the mutation c.1244C > A was reported to have a pathogenetic effect by itself, i.e., a case with mild hyper-transaminasemia reported in Orzinuovi (Brescia, Italy). The allele was named as PI*Yorzinuovi [46]. Consequently, Ybarcelona results from a combination of PI*Plowell plus PI*Yorzinuovi. In heterozygous cases, the risk of lung disease is likely to be similar to that of MZ heterozygotes [47].

Zbristol was first reported in 1997 in a woman with an obstetric history of three perinatal deaths from fulminant liver disease and no living offspring [48]. Only a few cases have been reported in children with low levels of AAT if accompanied by a Z allele and near to normal if accompanied by an M allele, with frequent liver involvement in children [49, 50]. The Zwrexhan allele has only been described in a family with severe AATD which also carried the common mutation causing Z deficiency [51]. Individuals with such a deficiency are, therefore, compound heterozygotes. The behavior of these particular mutations in the absence of the Z mutation is not known.

Null (Q0) alleles encode a truncated protein with large conformational changes that is degraded intracellularly without having the opportunity to aggregate. These patients have undetectable serum concentrations of AAT. The protein is retained in the rough endoplasmic reticulum or pre-Golgi compartment and is degraded. This means that homozygotes are at very high risk for emphysema, but not liver disease.

In conclusion, the present report informs on the frequency of rare and null alleles updating their distribution in a large sample population from six countries. The Progenika diagnostic network has allowed the identification of several rare alleles providing a new view of the distribution of these alleles in different countries. Due to the efficacy in both the detection of AATD cases and the identification of new variants, in the future we believe that Progenika's system could continue to expand to other countries. Consequently, future studies should focus on the characterization of these and other new mutations as they emerge in the context of patients with suspected AATD. These findings may help prioritize allele selection for routine testing and highlights the need for continuing research into their pathogenetic roles.