Background

Fruit quality is defined by four main principal factors; appearance, flavour, texture and nutritional properties [1]. Of these factors, texture is the major component and the most important, especially for fruit with a crispy flesh [2], and due to its influence on general fruit quality. Texture decay causes substantial fruit loss during shipping and storage which is caused by the degradation of the internal cellular compartment of the fruit, which consequently promotes the development of diseases typical of the postharvest storage and shelf-life [3]. Texture is recognised as a complex set of different sub-phenotypes, which can be divided into two main categories [2, 4]. The first encompasses mechanical features, and is fundamentally related to the strength exerted by the chemical bonds of the cell wall/middle lamella upon application of external pressure. The second category is defined by acoustic signatures, and is related to the cell wall breaking phenomenon with the consequent release of internal pressure [4, 5]. Texture change is a physiological event which occurs naturally throughout fruit development and ripening [6], and the magnitude of texture decay is extremely variable between different apple varieties [7]. The variability observed is the result of physiological mechanisms activated during the fruit maturation and ripening, in which a large number of enzymes are co-ordinately expressed to remodel the cell wall/middle lamella polysaccharide structure, and regulated, amongst other factors, by the effect of ethylene and transcription factors [8, 9]. The remodelling process is associated with a decrease in cell-to-cell adhesion, resulting in the separation of cells along the middle lamella (mealy texture) rather than a primary cell wall breaking (crispy texture; [1, 7]) when the fruit is consumed. Fruit softening and textural changes thus involve a coordinated modification of the primary cell wall and middle lamella polysaccharide structure, a process which initially takes place with a dissolution of the pectin polysaccharides of the middle lamella, followed by a disruption of the ordered structure of the primary cell wall [10, 11]. In several fruits the most active enzymes responsible for the pectin modification are polygalacturonase (PG) and pectin methylesterase (PME), while those acting on the primary cell wall are xyloglucan endotransglycosylase (XET) and expansin (Exp). Among this inventory, polygalacturonase is the major enzyme involved in the solubilization of the pectin polysaccharides [1214]. The degradation of the cell wall/middle lamella architecture is in practice considered the final result of the concerted activity of these enzymes, which are usually encoded by multigene families, confirming the complex genetic control of the fruit texture metabolism [1519].

Because of the impact that such physiological changes have on the marketability of edible fruit, researchers have for many years attempted to unravel the genetic basis of this mechanism, with the final goal of elucidating the genes underlying this dynamic process and the development of molecular markers suitable for phenotype prediction [6, 20, 21]. Quantitatively-inherited traits can be studied using a QTL mapping approach, which is generally carried out on bi-parental crosses. In apple, several reports have already identified major genomic loci putatively involved in fruit firmness and softening control [2226], with the largest texture QTL mapping survey described by Longhi et al. [27]. However, QTL mapping carried out using full-sib progenies presents important limitations due to the number of alleles that can be simultaneously analyzed as the approach samples only a small portion of the total allelic diversity within the cultivated apple germplasm pool. Moreover, linkage analysis requires the development of a segregating population, making this procedure laborious and time consuming. In addition, in this type of material the number of recombination events per chromosome is generally low, limiting genetic mapping resolution [16, 28, 29]. To overcome these limitations, the analysis of a wider genetic background is rapidly becoming the main strategy for the dissection of complex genetic architecture in plants, establishing genotype-phenotype association complementary to bi-parental linkage mapping [3036].

The main purpose of this study was to validate a QTL identifying a new set of valuable alleles associated to apple fruit texture sub-phenotypes in 77 cultivars. The phenotype was measured using an extremely precise technique to improve association resolution [37], and the impact of this gene on fruit texture was investigated further by cortex cell wall electron microscope scanning of two apple cultivars displaying contrasting texture phenotypes. Finally, a novel set of haplotypes and microsatellite marker alleles specifically related to important texture components are presented as valuable markers suitable for marker assisted parent selection (MAPS) as well to assist traditional breeding towards the selection of novel apple accessions characterized by superior fruit quality properties.

Methods

Plant material

A panel of 77 apple varieties, including both modern and old apple cultivars (Table 1), was chosen from two germplasm collections available at the Research and Innovation Centre of the Edmund Mach Foundation and the Laimburg Research Centre for Agriculture and Forestry, both located in the North of Italy (Trentino Alto Adige region). All the apple cultivars were planted in triplicate on M9 rootstocks and maintained following standard technical management procedures. Apple fruits were collected at the commercial harvest stage defined by monitoring the change of standard pomological parameters, such as skin and seed colour, brix value (total sugar content), cortex firmness assessed on site, and starch conversion index. Fruit were picked at a starch index of 7, based on a 1 to 10 scale.

Table 1 List of apple cultivars

Total genomic DNA was isolated from young leaf tissue, using the Qiagen DNeasy Plant mini kit (Qiagen) following the manufacturer’s protocol. DNA quantity and quality was measured spectrophotometrically with a Nanodrop ND-8000® (Thermo Scientific, USA).

Apple fruit texture assessment

Fruit samples were stored in a controlled temperature cellar at 2°C for two months after harvest to maximize the trait phenotypic variance, as reported in Costa et al. [7], and high resolution phenotyping was carried out for two years. In order to avoid any effect of low temperature, samples were maintained at 20°C prior to analysis. Fruit texture was phenotypically dissected assessing simultaneously both mechanical and acoustic fruit profiles using a TA-XTplus texture analyser coupled with an AED – acoustic envelop device (Stable Micro System Ltd., Godalming, UK). Sample preparation, instrument settings and parameter characterization are described in detail in Costa et al. [7]. The fruit texture assessment was performed in an isolated room, avoiding any possible external noise interference. Texture profiles were analyzed with an ad hoc compiled macro (Exponent v.4, Stable Microsystems), which allowed the distinction of the texture sub-phenotypes using three main categories; mechanical, acoustic and force direction. General texture variability, expressed by the 14 parameters (Yield Force, Max Force, Final Force, Mean Force, Young’s Module, Area, N. of Force Peaks, Force Linear Distance, Force Ratio, Force Difference, N. of Acoustic Peaks, Acoustic Linear Distance, Max Acoustic Pressure, Mean Acoustic Pressure [7]), was illustrated by multivariate techniques, including principal component analysis (Statistica software v7).

Md-PG1gene cloning and molecular marker genotyping

In this study a region of approximately 6 kb containing the Md-PG1 gene was investigated. In particular, 2395 bp (representing the full length gene) and two regions of 800 bp each, located approximately 1 kb up and downstream from the start and stop codon respectively, were cloned and sequenced. The cloning was performed in order to design specific primers able to distinguish Md-PG1 (MDP0000326734) from its homoeologue Md-PG5 (MDP0000159240). Md-PG1 fragment amplification was performed for the 1 kb up and downstream regions in a final volume of 25 μl with 10 ng DNA, 10× buffer, 0.25 mM dNTPs, 0.1 μM reverse and forward primers and 0.625 U of Eppendorf ® Taq polymerase. Amplification was performed using the following conditions: 2 min at 94°C, 32 cycles of 30 sec at 94°C, 30 sec at 56°C, 1 min at 72°C and a final extension of 7 min at 72°C. The full length Md-PG1 gene was amplified in a final volume of 50 μl, with 10 ng DNA, 10× Advantage 2 PCR Buffer, 0.2 mM dNTPs, 0.4 μM reverse and forward primers and 50× Clontech® Advantage 2 Polymerase Mix. Amplification was performed following the manufacturer’s recommendations.

Candidate gene SNP genotyping was performed by re-sequencing (Sanger technology) the regions described above from the 77 apple cultivars, using specific forward and reverse primers listed in Additional file 1. Sequences were assembled and analysed with Pregap4 software version 1.3 (Staden Package). For fine mapping the Md-PG1 region, in addition to the SNPs genotyped by re-sequencing, two microsatellites located in the assembled gene meta-contig were also used. The first was located 3 kb upstream of the Md-PG1 start codon and retrieved from Longhi et al. [27]. The second SSR marker, here named Md-PG1SSR10kd (kd; kilobases downstream) was positioned at 10 kb downstream the stop codon, and was identified de novo using the software Sputnik (http://espressosoftware.com/sputnik/index.html). PCR for SSR marker genotyping was performed as reported in Longhi et al. [27] (Additional file 1). Fragment sizes were called by GeneMapper® (Applied Biosystems, by Life Technologies).

Md-PG1SSR10kd mapping and QTL co-localization

The novel microsatellite motif found in the Md-PG1 meta-contig was mapped to the framework map of the ‘Fuji × Delearly’ population [27] by specific primer sequences designed with the software Primer3 (http://primer3.sourceforge.net/). The marker was integrated employing the software JoinMap 4 [38], using a LOD of 5.0 and a recombination frequency of 0.45. To investigate the co-location of this marker with QTL regions already associated with texture dissected sub-traits, a MQM computation was performed de novo using MapQTL 6 [39], selecting Md-PG1SSR10kd as a co-factor in order to reduce the residual variance. A LOD threshold value of 3.0, established after running 1000 permutations, was chosen to consider a QTL significant. The linkage group was visualized using MapChart 2.1 [40].

Population structure

To correct the analysis for population structure, the molecular profiles of 17 SSR markers (Additional file 2) and 368 SNPs [27] (16 out of the initial number of 384 failed to hybridize) were combined and used. Each microsatellite marker was selected according to map position, amplification efficiency and allelic size information available at the HiDRAS website (http://www.hidras.unimi.it). The population structure of the 77 apple cultivars was computed using a principal component analysis (PCA computed by Statistica software v7), which is a faster alternative to the MCMC model-based strategy, especially with large marker sets [34, 41, 42]. To account for genetic relatedness among individuals, the same marker data set used for population structure (Q matrix, fix effect) was also employed to generate a kinship matrix (K matrix), considered as random factor in the Mixed Linear Model performed using TASSEL [43, 44].

Linkage disequilibrium and marker-trait association

The linkage disequilibrium level among markers (SNPs and SSRs) identified within the apple cultivar collection, was calculated and visualised using Haploview 4.2, a software package designed for linkage disequilibrium statistics and haplotype block inference from genotype data [45]. This software was used to illustrate the pair-wise r2 among the 40 markers identified for the Md-PG1 gene. To illustrate the LD decay within the Md-PG1 region here investigated (from 3 kb upstream to 10 kb downstream the gene start and stop codon, respectively) the marker pair-wise r2 values were plotted against their physical distance on the chromosome 10. To fit the data, a smoothed line, represented by the logarithmic trend, was also added. The distribution of 63190 pairs of unlinked markers (368 SNPs) was employed to compute the r2, and the 95th percentile was used as critical point to consider true the linkage between syntenic marker loci.

Marker-phenotype association analysis was performed using markers with a MAF ≥ 0.05 (minor allele frequency higher than 5%), and employing both a fixed general linear model (GLM) and a mixed linear model (MLM) with random factors. Initially, the GLM algorithm, implemented in the software package PLINK release 1.07 ([46]; http://pngu.mgh.harvard.edu/~purcell/plink/), was used to find associations between the marker set and the first two principal components (PC1 and PC2), derived by the PCA computation performed on the texture parameters. Genome-wide adjusted empirical P-values were then computed and corrected running 1000 permutations. In a second step, the same phenotypic and genotypic data sets were used to find association by implementing the MLM model of TASSEL, where P-values were corrected for false positives using the False Discovery Rate approach (FDR ≤ 0.05), performed by the QVALUE package implemented in R [47]. A P-value ≤ 0.05 was considered as the criterion for trait-phenotype association. MLM corrected by FDR was further used to exploit specific association between markers and each single texture dissected sub-phenotypes.

Considering that the phenotype variability is more likely associated with SNPs assembled in haplotype configuration rather than singularly, an additional analysis was performed with haplotypes, inferred by FastPhase [48] using only the significant SNPs. Haplotype-phenotype association was computed by the GLM algorithm, and P-values were adjusted by running 1000 permutations.

Scanning electron microscopy of the apple cortex structure

To depict the different anatomic structure between a mealy (‘Golden Delicious’) and a crispy (‘Granny Smith’) apple fruit, a cortex portion from both cultivars was isolated and observed using a scanning electron microscope (SEM). Apple flesh slices were prepared by pulling apart the cortex portions which were then fixed for 2 hours at 4°C with 5% formaldehyde in a 0.1 M phosphate buffer (Na2HPO4 and NaH2PO4 pH 7). Samples were successively washed over night with 0.1 M phosphate buffer at pH 7 at 4°C. Dehydration was performed by incubating the slices for 15-20 min in solution with an increasing concentration of ethanol, and an Emscope 750 (Emitech, Ashford, Kent) was used to identify the critical drying point. Samples were finally coated with a SC 500 gold sputter coater (Bio-Rad Micro-science division) and examined using a Cambridge Instruments Stereoscan 260 scanning electron microscope.

Results and discussion

Apple fruit texture phenotype dissection

The 77 apple cultivars were phenotypically assessed for fruit texture by using a TA-XTplus-AED instrument. The trait dissection was performed identifying fourteen parameters over the combined mechanical-acoustic profile, ten of which were derived from the mechanical profile and four from the acoustic signature. The fruit texture variability evaluated within the apple collection over two years of observation is illustrated by the PCA plot (Additional file 3). The first principal component (PC1) describing 74.14% and 70.95% of the entire phenotypic variability for the two years respectively, together with the second principal component (PC2), accounting for an additional 12.44% and 12.95%, discriminated the orientation of the mechanical parameters from the acoustic group, suggesting a possible different genetic control for these two components [7]. The variable projection on the PCA space distinguished the separation between the two general texture components (Additional file 3), with all the mechanical parameters plotted in the negative PC1 and positive PC2 graph area, and the acoustic more oriented towards the area characterized by negative value for both PCs. The consistent variable orientation and cultivar distribution between the two years confirms this novel strategy as an efficient and reliable method to dissect the fruit texture complexity. In both years, the data distribution clearly distinguished mealy varieties (such as ‘Delearly’, ‘Golden Delicious’, ‘Gelber Edelapfel’, plotted on the positive PC1 values) from known firm and crispy varieties (such as ‘COOP39’, ‘Granny Smith’, ‘Fuji’ and ‘Cripps Pink’) placed in the area characterized by negative PC1 values.

Candidate gene based marker genotyping

The apple genome underwent a recent duplication resulting in a pair-wise colinearity of large chromosome segments [49] and because of this Md-PG1, the candidate gene investigated in this work and located on chromosome 10, shows a similarity of 86% with its homoeologue Md-PG5 on chromosome 5 [27]. To enable the characterization of the specific sequence for Md-PG1, the sequences of the two genes were retrieved from the ‘Golden Delicious’ genome.

Out of the 38 SNPs genotyped over the Md-PG1 genomic region, 22 were identified by re-sequencing the full length (2395 bp) within the apple collection, with an average frequency of 1 SNP/108.9 bases. Among them, ten were located in exons (total length of 1380 bp) with a frequency of 1 SNP/138 bp, and 12 SNPs in introns (total length of 1015 bp) with a frequency of 1 SNP/84.5 bp. These frequencies are consistent with previous observation made for apple of 1 SNP/149 bp [50], as well as in other outcrossing plant species such as pine with 1 SNP/102 [51], but lower than white clover (1/59, [52]) and grapevine (1/64, [53]). From the gene structure analysis resulted that SNPs found in non-coding regions were two fold more frequent than in coding ones [54]. Within the Md-PG1 predicted gene (with an intron/exon structure consistent with Bird et al. and Atkinson and Gardner [55, 56]) the SNP’s location along with their functional annotation is presented in Additional file 4 and Additional file 5. Among the remaining sixteen SNPs, three were located in the 3 UTR region, two were found 1 kb upstream the start codon and eleven 1 kb downstream the stop codon of the gene. In addition to the 38 SNPs, two microsatellite markers were also included. The first (Md-PG1SSR) was retrieved from the data of Longhi et al. [27], while the second, named Md-PG1SSR10kd, was de novo identified screening for microsatellite repetition over the Md-PG1 genomic contig (MDC000024.376 and MDC004966.443, available at http://genomics.research.iasma.it).

Md-PG1SSR10kd co-localizes with a texture hot-spot QTL

The newly developed Md-PG1SSR10kd microsatellite marker was further amplified and integrated into the ‘Fuji × Delearly’ genetic map, where several QTLs for apple fruit texture were previously mapped [27], and among which the major hot-spot cluster coincided with Md-PG1 gene. The allele segregation allowed the mapping of this marker in the same position of the gene, at 22.5 cM from the top of the linkage group. This second version of the ‘Fuji x Delearly’ map was used to calculate an improved version of the QTL profile for fruit texture, implementing this new marker as co-factor for the multiple QTL detection (MQM algorithm). The QTL cluster was confirmed on chromosome 10 (Figure 1) and associated with ten sub-traits representative of the fruit texture, such as yield force, maximum, final and mean force area, Young’s module, number of force and acoustic peaks, mean and maximum acoustic pressure. It is interesting to note that the highest LOD value corresponded with this novel marker, with a LOD value ranging from 3.85 to 8.80 and expressing a phenotypic variance between 19% and 41.8%, thus confirming its impact in the fruit texture association. Among the texture parameters employed in the QTL mapping, the Young’s module, related to flesh elasticity, showed the lowest level of association. This is consistent with the observations of Costa et al. [7] about this feature, which reported this index as more related to the cell layer compression behavior rather than the cell wall fracturing, the causal event of the mealy/crispy fruit texture, thus under the control of other genes encoding cell wall degrading enzymes.

Figure 1
figure 1

QTL-LOD profile for fruit texture dissected traits on LG 10. The different texture sub-traits are visualized with different colours, while the red arrow points to the position coincident with the Md-PG1SSR10kd locus. In the linkage group chart, the microsatellite marker used to map the Md-PG1 gene and presented in Longhi et al. [27] is shown in bold black text, whereas the novel microsatellite marker named Md-PG1SSR10kd is presented in bold red text.

QTL validation and allelic survey within an apple collection

A collection of 77 apple cultivars was analyzed using 38 SNPs genotyped by re-sequencing and two SSR markers by PCR amplification, contained in a region of approximately 16 kb. Among them, 22 SNPs were associated with the Md-PG1 full length, while the other 16 were identified in two genomic portions of approximately 1 kb each, respectively located 1 kb up and downstream of the coding portion of the gene. Finally, the two SSR markers additionally delimited two outer regions, located 3 and 10 kb up and downstream of the gene. LD analysis, performed in order to estimate the non-random association among the 990 pairs of polymorphic sites over the Md-PG1 region, was computed by HaploView (Figure 2a). Four haploblocks were identified, defined by markers showing a r2 between 0.19 and 0.85. The first block included two SNPs positioned in two adjacent exons, the second comprised SNPs in both exons and introns, while the other two blocks contained markers identified after the Md-PG1 stop codon. The third haploblock contained two SNPs, one located in the 3 UTR region and 1 kb downstream, respectively. The fourth block was represented by the Md-PG1SSR10kd microsatellite, located at 10 kb downstream the Md-PG1 stop codon. The LD decay over the Md-PG1 region (Figure 2b) was plotted in reference to the LD base-line set at r2 = 0.106, represented by the 95th percentile of the r2 distribution of unlinked markers. The baseline was determined following the methods of Breseghello and Sorells [57], which proposed that LD extent should be defined comparing the target LD with the one observed among unlinked loci, being the LD dependent on the sampling scheme. In this computation the intersection between the data fitting curve and the LD baseline defined an LD extent of ~2 kb, pointing to a rapid LD decay within this gene, confirming the suitability of the candidate gene approach [58] to find association between fruit texture and markers based on Md-PG1. Among the set of 40 markers only those having a MAF value higher than 5% were further used to find association with the texture phenotype, and to avoid spurious associations (due to false positive effects), the structure was taken into consideration as covariant. When phenotypic traits are correlated with population structure, loci that are not related to the trait under investigation may nonetheless be statistically associated [33]. Statistical correction for multiple test and MAF ≥ 0.05 were also employed in order to improve the QTL detection confidence with small sample set. For a better estimation of the size effect of this QTL, a wider collection will be further assembled and implemented in the analysis.

Figure 2
figure 2

LD plot with haploblock structure and LD decay within the Md-PG1 region. The LD image is based on r2 values (a), the darker the colour the higher the value. In the upper part, the number of polymorphic sites and the markers considered for the computation are indicated. The four haploblocks are highlighted in grey. On the top the physical position of the markers on the Md-PG1 contig are also illustrated, on which are distinguished the three main portions: 3 kb upstream, gene full length and the 10 kb downstream. Markers in bold are those included in the four haploblocks. Panel “b” shows the scatterplot of the LD decay of the linked marker pairs on chromosome 10 as a function of the physical distance (bp). The red line indicates the data fitting line. LD threshold, based on the 95th percentile of the unlinked marker pair distribution is superimposed on the scatterplot and represented by a grey dashed line.

The genetic relationship among the 77 apple cultivars was investigated by Principal Component Analysis. From the total number of PCs, ten were finally selected as covariates to represent the population structure, accounting for 32% of the total genetic variance. The traits employed in the association were represented by a phenotypic dataset containing 14 texture parameters, clustered in two main categories, mechanical and acoustic. These two groups, distinguished by the two principal components (computed on the phenotypic data set), captured 85% of the total phenotypic variance. Initially, the analysis considered the first two PCs as traits, and the association with markers (MAF ≥ 0.05) was computed by running both GLM and MLM modules. Six markers were commonly identified by both algorithms as statistically associated with PC1, consistent with the higher textural variation explained by this component compared to PC2 (Table 2). Among this set, five SNPs are specifically located within the full length Md-PG1 gene, and three are included in haploblocks. In particular, PG-full 10 is located in the 1st haploblock, and PG-full 19 and 20 in the 2nd haploblock. The remaining two SNPs, PG-full 1 and 12, were not present in any of the haploblocks defined here. This association additionally confirmed the role of one SNP in particular, here named PG-full 1. This SNP was in fact originally used to map Md-PG1 to linkage group 10 [26, 27]. The effect of this marker is here validated in a wider germplasm collection, supporting the previously formulated hypothesis about the effect of the amino acid changed due to this SNP on the fruit firmness control [26]. It is worth noting that the last marker of this set is the microsatellite Md-PG1SSR10kd, located in the 4th haploblock. In the MLM computation (corrected for false discovery) this microsatellite was also associated with PC2 (q-value: 0.035739, not shown in the table), principal factor expressing a lower quotient of phenotypic variability, but oriented towards the phenotypic dissection of the mechanical/acoustic components.

Table 2 Markers statistically associated with the mean PC1 computed for the two years of phenotypic observations

To better exploit the association between the markers and fruit texture sub-traits, each SNP was further analyzed with each single texture parameter. Eight out of the fourteen texture components resulted statistically associated with the marker set employed in the analysis (Additional file 6), including acoustic linear distance, number of force and acoustic peaks, area, final force, yield, maximum and mean force. The remaining six parameters showed a limited number of associated markers. Maximum and mean acoustic pressure were associated only with PG-full 9 and 1 kb down 5. Force linear distance was associated with PG-full 9 and 1 kb down 5, while Young’s module showed a significant P-value only with the third allele of Md-PG1SSR10kd. Parameters related to the force direction (∆ force and force ratio) were associated with PG-full 1, 12, 13 and the allele 2 and 3 of the Md-PG1SSR10kd marker.

To estimate more accurately the SNP frequency markers assembled into haplotypes were tested for association with the texture sub-traits. From the total number of SNPs significantly associated to the texture components, eleven, with a MAF ≥ 0.05 and located in the Md-PG1 full length, were selected and used to infer three significant haplotypes (H1, H2 and H3; Table 3 and Figure 3). H1 (the most frequent) showed a relevant association with nine texture sub-traits, and it was shared by cultivars distributed in the PCA plot over the PC1 axis, thus characterized by medium/low texture behaviour (mealiness, like ‘Golden Delicious’). H2, associated to six texture sub-traits of both a mechanical and acoustic nature, characterized cultivars known for a favourable texture properties (crispness), such as ‘Cripps Pink’, ‘Granny Smith’ and ‘Nicogreen’. The last haplotype H3, was associated with only two texture sub-traits (10.7% of the explained phenotypic variance), but it is worth noting that these are specifically related to the acoustic components (acoustic linear distance and number of acoustic peaks). As with H2, H3 was present in high texture performing apple cultivars, such as ‘CIVG198’, ‘Coop39’, ‘Ligol’ and ‘Minnewashta’. H2 and H3 also share four SNPs which leads to changes in the Md-PG1 primary sequence. These changes have been analyzed in order to see whether they might have an impact on the polygalacturonase enzyme activity, explaining, at least partially, the high flesh firmness typical of the varieties harbouring these two haplotypes. SNP1 (V/F) is located in an un-conserved region and F is one of the most frequent residues, thus it is not expected to negatively influence PG activity (Additional file 7). On the contrary, both the Q/R (SNP6) and the C/R (SNP10) conversions might slightly change the Md-PG1 activity. Indeed R residues are very rare among plant PGs in both positions. The last considered substitution (A/V, SNP18) is closed to a highly conserved region, with the A as the predominant amino acid, while V being slightly bigger and more hydrophobic might decrease the PG activity. As the alleles leading to the three changes are homozygous in all the tested crispy varieties (exception made for ‘Minnewastha’, which is heterozygous for the only SNP6), we hypothesize that a less active polygalacturonase isoenzyme could be less effective in middle lamella depolymerization. This finding was moreover supported by the fact that apple cultivars having a homozygous presence of the haplotype H1 are characterized by an extremely low texture property (mealiness), such as ‘Dalla Rosa’, ‘Early Gold’, ‘Limoncini’, ‘Napoleone’, ‘Permain Dorato’, ‘Rosmarina Bianca’ and ‘Tavola Bianca’ (a set represented for the most by old apple varieties).

Figure 3
figure 3

Structure of the three Md-PG1 haplotypes. For each haplotype the respective association with the presence or absence of the allele “3” of the microsatellite marker Md-PG1SSR10kd is reported. At the bottom, the four significant amino acid changes, differentiating haplotypes 2 and 3 from haplotype 1, are highlighted.

Table 3 Association between the three significant haplotypes and texture sub-traits

Parental selection

These SNPs and haplotypes can be considered as a novel toolbox to improve the phenotype prediction efficiency of breeding programs towards the programmed identification of the most suitable parents (MAPS-marker assisted parent selection) and the subsequent selection of novel accessions (MASS-marker assisted seedling selection) with improved fruit texture quality. It is also worth emphasizing, as markers useful for breeding, the microsatellite Md-PG1SSR10kd which was highly associated to the set of texture sub-traits. This microsatellite was targeted in the Md-PG1 meta-contig, in strong LD with SNPs located within the gene. The allelic state configuration of this marker within the apple cultivars (Table 1) showed a clear dosage effect when compared to the texture distribution over the PCA plot for both years (Figure 4 and Additional file 8). Apple cultivars characterized by the homozygosity for the allele Md-PG1SSR10kd_3 were located in the positive PC1 area of the PCA, thus showing a general low texture behaviour. When this allele was absent the cultivars, distinguished by PC1 values from -8 to 0, showed a superior textural properties. In contrast, apple cultivars characterized by a heterozygous state for allele “3” showed an intermediate texture distribution. As additional proof of the utility of this microsatellite marker for texture selection programs in apple, a correlation with the three significant haplotypes was also observed. Apple cultivars characterized by H2 and H3 (the two favourable haplotypes associated with a valuable texture performance) lack the allele “3” of this microsatellite marker, which showed a dosage effect associated to fruit texture decay. The haplotype survey carried out on the 77 apple cultivars, and the validation of their association with the texture components, highlighted that cultivars showing the two haplotypes H2 and H3, or lacking the allele “3” of the microsatellite marker, are distinguished by a favourable fruit texture behaviour. These varieties (Table 1) are already employed as valuable potential parents in breeding programs addressing the improvement of fruit quality in apple, while the SSR alleles/haplotypes can be further exploited to investigate the breeding potential of other apple accessions not yet characterized.

Figure 4
figure 4

PCA distribution in correlation with Md-PG1 SSR 10kd. Numbers correspond to the varieties as reported in Table 1. Colours indicate the allelic dosage for Md-PG1SSR10kd-3, with blue used for cultivars characterized by the absence of the “3” allele, green for cultivars having this allele in heterozygous state and red for cultivars carrying this allele in a homozygous state (thus present two times).

Apple fruit cortex structural characterization

The impact of the Md-PG1 gene on fruit development and ripening was also investigated by SEM (scanning electron microscopy). The mealy/crispy texture behaviour of the two cultivars was assessed using the texture analyser (Figure 5a and b). The analysis of the combined texture profiles (mechanical and acoustic), performed at a ripe stage, showed that ‘Golden Delicious’ displayed a lower texture performance (mealiness) with respect to ‘Granny Smith’, in which a better texture behaviour was observed (crispness). The digital extraction of the parameters underlined the different textures of these two cultivars showing a maximum force of 11.12 and 14.11 N, and acoustic peaks of 13 and 104 for ‘Golden Delicious’ and ‘Granny Smith’, respectively.

Figure 5
figure 5

Texture profiling and cortex scanning in ‘Golden Delicious’ and ‘Granny Smith’. Panel a and b illustrate the combined mechanical (blue line) and acoustic (red line) profile for ‘Golden Delicious’ and ‘Granny Smith’, respectively. Scanning electron microscopy of the fruit cortex cells, sampled after one week of postharvest shelf-life ripening, is shown for ‘Golden Delicious’ (c) and ‘Granny Smith’ (d). The scale bar is reported on the top of both panels.

The polysaccharide depolymerization of the middle lamella is one of the major events distinguishing mealy from crispy cultivars, and excessive degradation controlled by the polygalacturonase enzyme determines a significant weakening of the chemical binding between adjacent cells, facilitating cell-to-cell slippage along the middle lamella upon mechanical compression. This hypothesis is consistent with the fruit cortex structural observation made by SEM. Fruit cortex cells of ‘Golden Delicious’ had generally collapsed, due to a loss of internal turgor pressure, but were structurally intact, meaning mechanical rupture followed the cell boundaries at the level of the middle lamella, which in this cultivar were highly degraded. In ‘Granny Smith’, the cells were completely broken, showing an increase in laceration of the cell walls rather than in the middle lamella, most likely due to a reduced degradation activity that prevented cell separation occurring (Figure 5c and d).

The distinct anatomical structure between the two cultivars are also correlated with the different haplotype structure found within the Md-PG1 gene. It is worth noting that both cultivars present the Md-PG1 haplotypes in heterozygous state. One is a common haplotype shared between the two, which is not statistically associated to any fruit texture parameters. The other haplotype is represented by H1 and H2 for ‘Golden Delicious’ and ‘Granny Smith’, respectively, both distinguishing particular fruit texture behaviours.

Conclusion

The results of this work validated the impact of a QTL associated to fruit texture in apple presenting a new set of Md-PG1 alleles valuable for a marker assisted parent selection. Fruit texture is one of the principal quality factors in apple, and is a priority world-wide in modern apple breeding programs. Many works have been already presented to the scientific community, generally limited to QTL surveys focused on bi-parental maps. In this study, we identified a new set of markers and haplotypes related to Md-PG1 gene and associated to texture dissected sub-traits. In particular three haplotypes and a novel microsatellite marker, with a clear allelic dosage effect, were specifically associated to several texture components. The fruit texture dissection in mechanical and acoustic sub-phenotypes allowed more detailed and precise phenotyping to be performed in this investigation, which to date has been recognised as the limiting step in association studies. Among the genes devoted to cell wall metabolism, Md-PG1 is one of the most relevant genes encoding a cell wall degrading enzyme, and the results discussed here represent a first step towards the genetic dissection of the mechanical-acoustic component of fruit texture, a fundamental aspect for apple breeders, since the acoustic response (crispness) is generally recognised as the main priority in the creation of new elite accessions.