Frequency of a natural truncated allele of MdMLO19 in the germplasm of Malus domestica

Podosphaera leucotricha is the causal agent of powdery mildew (PM) in apple. To reduce the amount of fungicides required to control this pathogen, the development of resistant apple cultivars should become a priority. Resistance to PM was achieved in various crops by knocking out specific members of the MLO gene family that are responsible for PM susceptibility (S-genes). In apple, the knockdown of MdMLO19 resulted in PM resistance. However, since gene silencing technologies such as RNAi are perceived unfavorably in Europe, a different approach that exploits this type of resistance is needed. This work evaluates the presence of non-functional naturally occurring alleles of MdMLO19 in apple germplasm. The screening of the re-sequencing data of 63 apple individuals led to the identification of 627 single nucleotide polymorphisms (SNPs) in five MLO genes (MdMLO5, MdMLO7, MdMLO11, MdMLO18, and MdMLO19), 127 of which were located in exons. The T-1201 insertion of a single nucleotide in MdMLO19 caused the formation of an early stop codon, resulting in a truncated protein lacking 185 amino acids, including the calmodulin-binding domain. The presence of the insertion was evaluated in 115 individuals. It was heterozygous in 64 and homozygous in 25. Twelve of the 25 individuals carrying the insertion in homozygosity were susceptible to PM. After barley, pea, cucumber, and tomato, apple would be the fifth species for which a natural non-functional mlo allele has been found.


Introduction
Powdery mildew (PM) is a relevant disease of apple that, in the absence of chemical control, can reduce yield up to 50% (Yoder 2000). The disease is caused by the obligate biotroph fungus Podosphaera leucotricha, and it occurs in all major apple-growing regions of the world (Turechek et al. 2004). Leaves are the most susceptible organ, particularly during the first days after opening, but blossom infections, although less common, are extremely severe because they result in small and stunted fruits, or in no fruit at all (Turechek et al. 2004). resistance exists, which can be achieved by the knockout or knockdown of specific member(s) of the MLO gene family, as previously shown by Pavan et al. (2010), Wang et al. (2014), and Pessina et al. (2016aPessina et al. ( , 2016b. The MLO gene family comprises a variable number of members, grouped in seven clades (Acevedo-Garcia et al. 2014;Pessina et al. 2014). MLO genes for PM susceptibility (MLO S-genes) belong to clade IV, which contains monocot S-genes (Panstruga 2005;Reinstädler et al. 2010;Wang et al. 2014), and clade V, which contains dicot S-genes (Consonni et al. 2006;Bai et al. 2008;Feechan et al. 2008;Winterhagen et al. 2008). Loss of function in MLO S-genes leads to PM resistance as demonstrated in barley (Jørgensen 1992), Arabidopsis thaliana (Consonni et al. 2006), tomato (Bai et al. 2008), pea (Pavan et al. 2011), wheat (Wang et al. 2014), and cucumber (Berg et al. 2015). It is possible to identify MLO S-genes through gene expression analysis: at early stages of PM infection, specific MLO S-genes have their expression increased. This was documented in barley (Piffanelli et al. 2002), tomato (Bai et al. 2008), grape (Feechan et al. 2008;Winterhagen et al. 2008), pepper (Zheng et al. 2013), and apple (Pessina et al. 2014). Of the four MLO apple genes of clade V, MdMLO11 and MdMLO19 are upregulated during PM infection, whereas MdMLO5 and MdMLO7 are not (Pessina et al. 2014). MdMLO18, a gene of clade VII, is also responsive to PM infection. Among these PM-inducible apple genes, only MdMLO19 can be considered an S-gene because its knockdown reduced PM infection up to 75%, whereas the knockdown of MdMLO11 did not support any reduction of PM infection. A role of MdMLO18 does not seem likely on the basis of the result of the complementation of resistance test carried out in A. thaliana (Pessina et al. 2016a).
Gene silencing technologies, such as RNAi, are currently not accepted by the large majority of the European public (Einsele 2007); accordingly, the EU has the strictest regulation in the world on GMOs (Davison 2010). Therefore, we searched for non-functional alleles of the apple MLO S-gene MdMLO19, using the natural genetic diversity of apple to develop PM-resistant varieties. The diversity of other four apple MLO genes (MdMLO5, 7, 11, and 18) was also studied because they are either members of clade V (MdMLO5 and 7), are upregulated upon PM infection (MdMLO18), or both (MdMLO11). In apple, the FruitBreedomics project (http://www.fruitbreedomics.com) opened interesting possibilities making available 63 re-sequenced Malus domestica individuals representing the genetic diversity present in the apple germplasm (Bianco et al. 2016). Here, we report on the screening of the 63 re-sequenced genomes, searching for non-functional alleles of five MLO genes, i.e., the four members of clade V and MdMLO18. Among them, MdMLO19 is the main gene of interest, but since recent evidences suggested that also MLO genes that do not show higher transcription levels after PM inoculation may have a role in PM pathogenesis (Pessina et al. 2016b), MdMLO5 and 7 were also considered. Furthermore, the evidences of the lack of a role for MdMLO18 in PM pathogenesis are not final, so it was included in the present study as well. We focused on the mutations located in the exons for simplicity, as their effects can be predicted more easily compared to mutations locating in introns, promoter, and terminator. A non-functional natural allele of MdMLO19 was found and the link to PM resistance investigated by the genotyping and phenotyping of cultivars, breeding selections and wild species. The possibility of using this allele to introgress durable resistance in apple varieties is discussed as well.

Materials and methods
FruitBreedomics re-sequencing data analysis The genomic regions hosting genes MdMLO5, MdMLO7, MdMLO11, MdMLO18, and MdMLO19 (Pessina et al. 2014) were screened in 63 individuals for which re-sequencing data were available from the design of a 20K and a 480K single nucleotide polymorphism (SNP) array (Bianco et al. 2014(Bianco et al. , 2016. Only the open reading frames (ORF) of the five genes were considered, whereas the sequences of the promoters and terminators were not screened. For these 63 individuals, SNPs were retrieved from the variant calling format (.vcf) file used for the development of the 480K array (Bianco et al. 2016). A custom bioinformatic script was then written to retrieve all polymorphic sites of just the five genes. Data were stored in a tab-separated value file (.tsv) for further processing. The retrieved SNPs were divided in two groups, depending on if they were located in the exons or in the introns, and only those located in the exons were considered for further analyses. SNP-based nucleotide sequences were deduced, as well as gene-encoded amino acids (aa) sequences, using E M B O S S t r a n s e q ( h t t p : / / w w w . e b i . a c . uk/Tools/st/emboss_transeq/). Mutations were grouped in seven categories: silent substitutions (no aa changes), conservative substitutions (aa substituted with one of similar chemical and sterical properties), semiconservative substitutions (substitution with an aa with similar sterical properties), non-conservative substitutions (substitution with an aa with different properties), insertions (insertion of one or more aa), deletions (removal of one or more aa), and nonsense mutations (formation of an early stop codon).

Selection of individuals
In order to study the frequency of the mutations found in the FruitBreedomics dataset and their possible association with PM resistance/susceptibility, 115 individuals from three locations were selected. Phenotypic data from different sources were available for 100 of the individuals considered. Since phenotypic data were the result of different assessment methods, they were analyzed independently.

Fondazione Edmund Mach
Two groups of individuals were collected from the orchard of Fondazione Edmund Mach (Italy). The 60 individuals of the first group were collected because their level of resistance was known from the data provided by Mr. Ted L. Swensen (Table S1): very resistant, resistant, susceptible, and very susceptible (Table S2-FEM). The second group of 35 individuals included 10 accessions of wild Malus species and 25 cultivars that are commonly used in breeding, commercially relevant or selected because their level of PM resistance/ susceptibility was known from direct observation carried out during the years by the breeders of FEM (Table S2-FEM2).

Wädenswil
An orchard including 628 apple accessions each represented by 2 tree individuals, located at Agroscope in Wädenswil (Switzerland), was evaluated yearly for 4 years after being left completely untreated with fungicides. PM symptoms were scored every spring using a scale from 1 to 9 (1, complete absence of symptoms; 9, tree completely affected). Eleven individuals were selected among those with the lowest standard deviation between replicates and years (Table S2-Wädenswil).

FruitBreedomics
The FruitBreedomics project provided the DNA and the phenotypic information of 10 individuals. Five of them were susceptible to PM, whereas the phenotype of the other five was unknown (Table S2-FruitBreedomics). These latter five were included with the purpose of validating the FruitBreedomics re-sequencing dataset, as they were among the 63 cultivars constituting the said dataset.

DNA extraction
Leaf samples were ground in liquid nitrogen and DNA was extracted with illustra Nucleon PhytoPure Kit (GE Healthcare, Buckinghamshire, UK). Resulting DNA was quantified with NanoDrop (Thermo Fisher Scientific, Waltham, USA).

Genotyping by Sanger sequencing
To validate the presence of the insertion of a T at position 1201 in MdMLO19, and to genotype a larger set of individuals, a 186-bp region was amplified (Fw, 5′-GCATCTTGTCCTCGTATGTAGAATG-3′; Rv, 5′-CGACATCTTCCAACTTCTCATGG-3′) with GoTaq Green (Promega, Fitchburg, USA) and sequenced twice from both ends (Table S2). Sequences were aligned using the Staden package software (Staden 1996).
Sanger sequencing can be easily used to detect homozygous mutations. Conversely, heterozygous mutations are not as obvious. The sequencing electropherogram was expected to show two overlapping peaks in the site of the mutation, one consisting in the wild-type sequence and one in the mutated sequence. However, overlapping peaks might also be the result of sequencing artifact/errors. To rule out this possibility, the 186-bp fragment from the heterozygous cultivar Durello di Forlì was cloned into the gateway vector pENTR/SD-TOPO (Thermo Fisher Scientific, Waltham, USA) and inserted into Escherichia coli, which was plated on a selective media. Eight colonies were picked, the plasmids extracted with QIAprep Spin Miniprep kit (Qiagen, Venlo, the Netherland) and sequenced using Sanger technology.

Canonical correspondence analysis
Canonical correspondence analysis (CCA) as embedded in the PAST software v. 2.17c (Hammer et al. 2001) was performed to determine the relative importance of resistance levels in the spatial organization of genetic diversity among individuals. This analysis, designed to relate species composition to different predictive variables (Ter Braak 1986), has been successfully used to describe relationships between environmental or phenotypical variables and genetic composition (Angers et al. 1999;Dell'Acqua et al. 2014;Zoratti et al. 2015). The analysis was based on a disease levels/genotype matrix. Sanger sequencing was used to assess the genotype of the individuals regarding insertion T-1201.

Presence of SNPs in the target MLO genes
The screening of the re-sequencing data returned 678 SNPs in the ORF of five MLO genes (Table S3), i.e., the four members of clade V and MdMLO18. One hundred twenty-seven of the SNPs were located in exons (Table S4). The MLO gene with the highest number of SNPs located in exons was MdMLO19 with 48 SNPs; the gene with the lowest number was MdMLO5 with 6 ( Table S4).
Sixty-one out of the 127 exon-located SNPs caused silent mutations, and another 30 and 9 caused conservative and semi-conservative substitutions, respectively (conservative: substitution of an aa with one of similar chemical properties; semi-conservative: substitution of an aa with one of similar steric conformation). Twentytwo mutations were non-conservative (Table 1) plus two insertions, two deletions, and a nonsense mutation. One insertion was located at the very end of MdMLO7, in position 1676-1680, causing a frameshift that changed the last three amino acids of the protein. The other insertion, T-1201, was located in MdMLO19 and caused a frameshift of one nucleotide with the formation of an early stop codon (Table 1). The resulting protein would be 405 amino acids long, instead of 590, and would lack both the trans-membrane (TMD-7) and calmodulinbinding domains at the C-terminal (Fig. 1). According to re-sequencing FruitBreedomics data, insertion T-1201 was present in 12 of the 63 genotyped individuals. In six of them, it was homozygous (BBusiard,^BPatte de Loup,^BMcIntosh,^BPepino Jaune,^BYoung America,^and BKronprins^), in the other six heterozygous (BMela Rozza,^BPriscilla,^BAbbondanza,B Jonathan,^BAlfred Jolibois,^and BFilippa^). One of the two deletions, G-1181, was remarkable: it was found in MdMLO19, where it would cause the formation of an early stop codon. However, this G-1181 deletion was present only in BPepino Jaune^, where insertion T-1201 was also present in homozygosity. The combination of deletion G-1181 and insertion T-1201 would cause the substitution of five amino acids, but no early stop codon. Since BPepino Jaune^is homozygous for insertion T-1201 and heterozygous for deletion G-1181, only one of its alleles actually carries insertion T-1201 alone. For this reason, BPepino Jaune^was included in the genotypes heterozygous for the insertion. The other nonsense mutation found in MdMLO19 was substitution G-1176-A, which caused the substitution of a tryptophan with an early stop codon. This SNP was found in BAjmi^.
Insertion T-1201 and nonsense mutation G-1176-A, both located in MdMLO19, were selected for further analysis.
Validation of the presence of insertion T-1201 in the MdMLO19 gene Sanger sequencing of a fragment of MdMLO19 in 16 individuals included in the FruitBreedomics resequencing dataset showed that 13 of them had insertion T-1201 (Fig. 2). For eight individuals, the electropherograms showed an overlapping of the peaks for A and T in position 1201, suggesting that the insertion was heterozygous ( Fig. 2). To rule out the possibility that these overlapping peaks were the result of sequencing artifact/ errors, an additional validation was carried out: a fragment of MdMLO19 from the heterozygous individual BDurello di Forlì^was cloned in a plasmid and sequenced. Of the eight E. coli colonies sequenced, four carried insertion T-1201, whereas the other four did not carry it (Fig. S2), indicating that Sanger sequencing is adequate to distinguish heterozygous mutations from homozygous ones.
Sanger sequencing also confirmed the presence of the G-1181 deletion in BPepino Jaune^, supporting the FruitBreedomics data (Fig. 2), whereas the sequencing of BAjmi^did not confirm the presence of the nonsense mutation in this individual. No further analysis were carried out on substitution G-1176-A.
The results obtained by Sanger sequencing for insertion T-1201 were compared to those of the FruitBreedomics re-sequencing dataset and found to be conflicting in seven cases (Table S5). The results of Sanger sequencing have been used for further steps of the work.

Pedigree of apple individuals
Parentages were known for 79 of the 115 individuals considered (Table S2). Two inconsistencies were noted for the Sanger data: BTelamon^did not show the insertion, but one of its parents (BMcIntosh^) had it in homozygosity, whereas the other parent (BGolden Delicious^) lacked the insertion. Therefore, BTelamonŝ hould be heterozygous. The same is true for BJames Grieve^and its parent BCox's Orange Pippin^, as BJames Grieve^is homozygous for the absence of the insertion and BCox's Orange Pippin^for the presence. The Sanger sequencing confirmed the genetic state of each of these four individuals; therefore, the  The insertion was present in 89 of them, heterozygous in 64, and homozygous in 25 ( Fig. 3 and Table S2). The sequencing also showed that, contrary to expectations, 12 of the 25 individuals homozygous for insertion T-1201 were susceptible or very susceptible to PM. Among the individuals considered, there were also three mutants, namely BRoyal Gala^(mutant of BGala^), BRed Delicious^(mutants of BDelicious^), and BTurley Winesap^(mutant of BWinesap^). All of them were identical to their individual of origin ( Fig. 3 and Table S2).
To analyze the association between the presence/ absence of insertion T-1201 and resistance or susceptibility to PM, a subset of the 115 individuals was chosen. Fifteen individuals with no phenotypic data available were excluded, as well as the mutants previously mentioned. Furthermore, the individuals from Wädenswil and from the FruitBreedomics project Table (S2) were not considered for the CCA because their small number did not allow to perform the analysis. Two independent CCAs were carried out for two groups of individuals, the phenotypic data of which were obtained from different sources: data from direct observation (23 individuals) and data provided by Mr. Swensen (60 individuals). To read correctly the CCA biplots showed in Fig. 4, it is important to note that the two axes x and y have different importance in explaining the significance of the association: for both Fig. 4a, b, the majority of the significance is explained by the x-axis (73.25 and 84.84%, respectively). This means that the distance on the x-axis between the points indicating the genotype and the arrows indicating the phenotype is more relevant than the distance on the y-axis. Thus, the CCA carried out on the data coming from direct observation (Fig. 4a) showed two associations, one between the very susceptible phenotype and no insertion and the other between resistance and heterozygous insertion. Two partial associations were also noted between high resistance and homozygous insertion and between susceptibility and absence of the insertion. Conversely, the CCA performed on Swensen data (Fig. 4b) did not show any clear association.

Discussion
The screening of the FruitBreedomics re-sequencing dataset returned 678 SNPs in five MLO genes. Not surprisingly, SNP distribution was not balanced between introns and exons: the fewer SNPs in the exons can be explained by positive selection against detrimental mutations, whereas introns mutations are to a large extent neutral and subjected to random fixation (Kimura 1977). The same holds for the predominance of silent and conservative mutations in exons. None of the 127 SNPs found in exons affected any of the 30 amino acids identified by Elliott et al. (2005) as fundamental for the S-genes activity of MLO proteins.
The case of MdMLO5 deserves a comment: only six SNPs were detected in exons, suggesting that the gene is under intense stabilizing selection. Since this gene is unrelated to the infection caused by P. leucotricha (Pessina et al. 2014), PM epidemics should not favor the fixation of new mutations. The opposite situation was observed for MdMLO19, the gene with the highest number of SNPs and the only one where nonsense mutations were present, a situation indicating that selection should have favored the fixation of mutations. Two factors may have contributed: first, MdMLO19 is the primary target of P. leucotricha, suggesting a co-evolution of host and pathogen. This means that MdMLO19 causes susceptibility to PM in apple, and this is a well-known case where gene silencing results in recessive resistance to the pathogen (Pessina et al. 2016a). The second factor is that MdMLO11, due to the possible redundancy of its metabolic activity (Pessina et al. 2014), may support a loss of function of MdMLO19 without drastically reducing plant fitness.
The insertion of a thymine in position 1201 of MdMLO19 caused a frameshift mutation resulting in an early stop codon located 15-17 bp downstream of the insertion. As a result, the T-1201 insertion causes the translation of a 405 aa protein instead of the 590 aa of the regular protein (Fig. 2). The loss of 185 aa alone would probably compromise the function of  Canonical correspondence analysis (CCA) ordination biplot representing individuals' aggregation and phenotypical variables. The arrows emerging from the origins of the two axes represent the phenotypes, and their position indicates the association with the genotype: the closer the arrow is to the dot indicating the genetic composition, the stronger is the association. The three genetic compositions in exams are no insertion, heterozygous insertion, and homozygous insertion (colored boxes). The four phenotypes considered are very resistant, resistant, susceptible, and very susceptible (solid arrows). a CCA performed on 23 individuals, the phenotype of which was directly observed by apple breeders in FEM orchard. b CCA performed on 60 individuals, the phenotype of which was retrieved from the data provided by Mr. Ted L. Swensen MdMLO19; moreover, the C-terminal MLO region carries a calmodulin-binding domain which absence reduces by 50% the capacity of MLO to support infection (Kim et al. 2002). It is reasonable to assume that the truncated MdMLO19 is a non-functional or partially functional protein. Considering that the knockdown of MdMLO19 resulted in PM resistance (Pessina et al. 2016a), the homozygosity of insertion T-1201 was expected to support PM resistance.
The main purpose of our study was the analysis of the frequency of mutations in MLO genes when a representative sample of apple germplasm is considered. In this respect, however, FruitBreedomics resequencing data needed first to be validated. Thus, the presence of insertion T-1201 had to be confirmed by Sanger sequencing. The comparison between Sanger sequencing and the FruitBreedomics re-sequencing showed conflicting results. However, the in silico prediction of INDEL is complicated and less reliable than substitutions (Minoche et al. 2011;Robison 2012), therefore, the detection of some inconsistencies was not surprising. Three SNPs (G-1181-A, T-1188-C, and C-1205-T) were found to be always associated to insertion T-1201, suggesting that the insertion is carried only by a specific haplotype. Considering that the FruitBreedomics dataset includes the genome sequences of the 14 individuals from which the large majority of European apple varieties originated (Evans et al. 2011;Bianco et al. 2014), it is interesting that four of them contained insertion T-1201, namely, BMcIntosh^, BJonathan^, BDelicious^and BPriscilla^. It is reasonable to think that the allele present especially in the first three cultivars subsequently spread through their extensive use in breeding worldwide. BPriscilla^has a more limited use in breeding, as it is younger, has been distributed under an incorrect name (Evans et al. 2011), and has probably been used only in the breeding program of Wageningen UR.
Insertion T-1201 was present in 89 individuals, heterozygous in 64, and homozygous in 25. Five of these 89 individuals were mutants. Included as further control of the quality of sequencing, they were all found identical to their individual of origin with regard to the fragment of MdMLO19 analyzed in this study. However, some differences in the level of resistance were noted, particularly between BGala^(susceptible) and BRoyal Gala^(resistant), as well as between BDelicious^(resistant) and BRed Delicious^(susceptible). These differences are ascribed to the different sources of phenotypic information included in this study.
The CCA showed conflicting results in the two cases considered. This difference can be partially explained by the different origin of the data considered, but not by the fact that the observations were carried out in different geographical areas populated by different P. leucotricha strains because mlo resistance is known to be broadspectrum and unaffected by the different strains of the pathogens (Pavan et al. 2010). The contrasting results between the CCAs and the observation that 11 individuals homozygous for insertion T-1201 were susceptible or very susceptible to PM is in contrast with our previous findings in transgenic BGala^, where the knockdown of MdMLO19 resulted in a significant reduction of PM susceptibility (Pessina et al. 2016a). The specificity of MdMLO19 knockdown was tested and confirmed (Pessina et al. 2016a), therefore, the contrast between the two studies cannot be explained by offtarget knockdown of other MLO genes. However, the present study considered a high number of individuals, whereas the previous one regarded a single cultivar, so it is possible that the specific genetic background of BGala^is the key to explain the observed discrepancy. The knockdown of MdMLO19 in other apple cultivars would be necessary to clear this point.
To explain why individuals carrying a homozygous loss-of-function mutation in what is considered a PM Sgene were susceptible to the disease, we here discuss three hypotheses: (1) presence of other mutations in MdMLO19 that null the effect of insertion T-1201, (2) presence of other S-genes for PM that may substitute the role of MdMLO19, and (3) presence of mutations in genes required for defense. The first hypothesis was that the susceptible genotypes could carry other mutations that prevented the formation of the early stop codon. The only mutation found in the FruitBreedomics data that could null the effect of the insertion and cause the regain of the correct reading frame was deletion G-1181 in BPepino Jaune^, but Sanger sequencing showed that it was not present in any of the considered susceptible individuals. Other mutations could have a similar effect, but they were not found in proximity of insertion T-1201. Although their presence in other parts of MdMLO19 cannot be excluded, this does not seem likely on the basis of FruitBreedomics data. The second hypothesis contemplates the presence of other S-genes that might interfere with the PM phenotype elicited by Mdmlo19 recessive mutation. In a previous work, we showed that MdMLO19 is a susceptibility gene for PM in apple (Pessina et al. 2016a). However, other MLO genes might be in play: MdMLO18 was not considered an S-gene on the basis of the results of a complementation test in A. thaliana (Pessina et al. 2016a), but these kinds of test are not as reliable as in planta studies. Therefore, the role of MdMLO18 requires more clarifications. The two other members of apple Clade V, MdMLO5 and 7, were not considered because they were not responsive to PM inoculation (Pessina et al. 2014). The choice of excluding MdMLO5 and 7 from the study was justified by the understanding of the role of MLO genes in pathogenesis of that time, but recent results in grapevine revealed that non-responsive genes may have a secondary role (Pessina et al. 2016b). Thus, a role for MdMLO5 and 7 cannot be excluded. An interesting fact to consider is that MdMLO19 is the clade V MLO genes of apple with the highest basal expression. In cucumber and Arabidopsis, it was also observed that the major MLO S-genes is the one expressed the most. However, in both species, a minor role in susceptibility for other clade V MLO genes was observed, detectable only when the major S-gene was knocked out (Dr. Henk J. Schouten, personal communication; Consonni et al. 2006). In apple, only MdMLO11 was knocked down together with MdMLO19, with no effect on PM resistance (Pessina et al. 2016a), but no information is available for MdMLO5 and 7. Therefore, it is possible that these two genes have a redundant effect and partially complement the role of MdMLO19 in susceptibility. A further option to consider is the presence of other Sgenes outside the MLO family. The third hypothesis considered the possibility of mutations in genes that are required for an effective response to the infection. The PEN genes are a perfect example in this sense, as their knockout in A. thaliana restored PM susceptibility in Atmlo2-resistant mutant (Consonni et al. 2006). PEN genes are well known, but clearly not the only genes involved in pathogenesis, therefore a wider approach will be necessary.
Natural loss-of-function mutations in MLO S-genes were found in four species: barley mlo-11 (Piffanelli et al. 2004), tomato ol-2 (Bai et al. 2008), pea (Pavan et al. 2011), and cucumber (Berg et al. 2015). If its role will be confirmed, insertion T-1201 in MdMLO19 would be the fifth. To date, only the germplasms of barley and cucumber were screened for natural MLO loss-of-function mutations. Among the around 4100 barley accessions tested, the frequency of spontaneous mlo mutations varied between 0.2 and 0.6% (Jørgensen 1992). By contrast, a much higher frequency was observed in cucumber, where a transposon disrupting CsaMLO8 was detected in 27% of the individuals considered (Berg et al. 2015). The estimate of the frequency of insertion T-1201 in apple MdMLO19, based on FruitBreedomics data and Sanger sequencing results, was 9.5% if only homozygous individuals are considered (6 out of 63), 27% if also heterozygous ones are considered (17 out of 63), a result identical to what was observed in cucumber. The presence of insertion T-1201 in apple breeding cultivars can contribute to explain its high frequency.
Alleles of MdMLO19 carrying insertion T-1201 do not seem to be an immediate source of durable PM resistance in apple, and further studies are required to identify the other genes causing PM susceptibility in apple. The screening of the germplasms of other species might provide more information on the important and yet poorly studied aspect of the frequency of spontaneous mlo mutants.
Our results have shown how whole-genome re-sequencing of different individuals of a species, like that on the SNP discovery panel of the FruitBreedomics Axiom 487K array (Bianco et al. 2016), can provide valuable preliminary information for the study of the natural diversity of the germplasm of a species. Furthermore, the screening of re-sequencing databases can lead to the identification of candidates MLO S-genes: the presence of homozygous nonsense mutations in specific MLO genes of PM-resistant individuals would be an important indication that the gene might act as an Sgene. Finally, this approach could be extended to other diseases and other S-genes. consistency, contributed to the selection of the individuals, and revised the manuscript. RGFV contributed to the experimental design and revised the manuscript. PM provided the scoring data for apple individuals collected in FEM and the DNA for Sanger sequencing. HJS contributed to the experimental design and revised the manuscript. YB contributed to the experimental design and revised the manuscript. RV contributed to the experimental design and revised the manuscript. MM contributed to the experimental design and was the main reviser of the manuscript.
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http:// creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.