A practical genome-enabled legitimacy assay for oil palm breeding and seed production
- 185 Downloads
Legitimacy in breeding and commercial crop production depends on optimised protocols to ensure purity of crosses and correct field planting of material. In oil palm, the presence of three fruit forms permits these assumptions to be tested, although only after field planting. The presence of incorrect fruit forms in a cross is a clear sign of illegitimacy. Given that tenera forms produce 30% more oil for the same weight of fruit as dura, the presence of low levels of dura contamination can have major effect during the economic lifespan of an oil palm, which is around 25 years. We evaluated two methods for legitimacy test 1) The use of SHELL markers to the gene that determines the shell-thickness trait 2) The use of SNP markers, to determine the legitimacy of the cross.
Our results indicate that the SHELL markers can theoretically reduce the major losses due to dura contamination of tenera planting material. However, these markers cannot distinguish illegitimate tenera, which reduces the value of having bred elite tenera for commercial planting and in the breeding programme, where fruit form is of limited utility, and incorrect identity could lead to significant problems. We propose an optimised approach using SNPs for routine quality control.
Both dura and tenera contamination can be identified and removed at or before the nursery stage. An optimised legitimacy assay using SNP markers coupled with a suitable sampling scheme is now ready to be deployed as a standard control for seed production and breeding in oil palm. The same approach will also be an effective solution for other perennial crops, such as coconut and date palm.
KeywordsContamination SHELL DNA fingerprinting Genetic purity Seed quality control
Crude palm oil
Kompetitive Allele Specific PCR™
Restriction fragment length polymorphism
Standards and Industrial Research Institute of Malaysia
Single nucleotide polymorphism
Oil palm (Elaeis guineensis Jacq.) is an out-pollinating, but monoecious crop; however, maternal and paternal lineages for the purpose of seed production are developed independently based on presence/absence of kernel shell in the fruit. Thick-shelled dura palms produce lower oil yield than tenera heterozygotes, whereas shell-less pisifera palms are usually female sterile. The crosses between dura and pisifera result in thin-shelled tenera progenies that exhibit 30% higher oil yield than dura parents with full fertility restored. Hence, tenera palms are preferred for commercial planting.
Like other plant and animal breeding programmes, understanding the parentage of families and individuals is crucial. In oil palm, the goal is to measure combining ability between dura and pisifera based on performance of their tenera progenies, also known as progeny testing. Controlled pollination is required to ensure the test reflects the true families or backgrounds. However, pollination errors, such as a contaminated pollen source, inflorescence bag damage and late bagging are still evident in oil palm breeding and seed production, leading to illegitimacy . Aside from instances of uncontrolled pollination, hermaphrodite flowers occasionally form whereby self-pollination can occur , although the bunches with high number of male inflorescences can usually be identified and removed. Hence, seed illegitimacy in oil palm can happen at any stage from pollen collection, pollination and seed processing through to field planting. Quality assurance and problem identification are usually difficult, especially when combinatorial errors occur.
To date, the oil palm industry still relies on the shell fruit form and the presence of a fiber ring around the kernel to determine genetic purity of seed lots in breeding and commercial seed production. Seeds derived from dura (Sh + Sh+) x pisifera (sh- sh-) are theoretically 100% tenera (Sh + sh-) due to co-dominant monogenic inheritance . Unexpected dura and pisifera progeny palms are defined as non-tenera contamination. However, the fruit census can only be carried out 2–3 years after field planting. Census methods based on a cross-section of mature fruit is laborious and inefficient. Most importantly, replacement of contaminants at this mature stage is not economically viable, leading to long-term yield loss from these contaminant palms throughout the 20–25 years of production lifespan. To solve this issue, novel mutations in the SHELL gene that are responsible for the fruit form have been identified  and can be used for detecting non-tenera contamination as early as the seed stage. Early elimination of the contaminants before field planting would significantly improve cost effectiveness and reduce yield losses due to poor seed production quality, hence improving oil palm productivity per unit area of land [5, 6]. However, conventional fruit census and SHELL testing both share two key limitations, whereby the methods are unable to identify illegitimate tenera that are derived from the wrong crossing or parentage and do not provide informative data as to the production causes of the illegitimacy. The wrong parentage illegitimacy is believed to have further impact on oil yield due to the different genetic combining ability of every resulting progeny seed produced from a pollinated flower. This is especially impactful in breeding programmes where illegitimate parental stock can effect multiple future generations and confuse interpretations from breeding trials.
Established seed producers and plantation companies have dedicated breeding programmes to sustain their planting requirements. Accurate pedigree records enable these producers to utilize legitimacy tests on every cross or seed lot before releasing the seeds to stakeholders, which often include smallholder plantations and farmers. The objective of this study was to develop an improved practical assay for legitimacy test using an optimised number of genomic single nucleotide polymorphism (SNP) markers, followed by implementation as a commercial quality control tool. A commercial population of dura, half- and full-sib mother palms was blind tested to determine the accuracies of legitimacy test using 1000 SNPs and marker subsets optimised.
Genetic clustering and fruit form validation
Legitimacy reference of Family A
Sensitivity of legitimacy test across marker densities
For perennial species such as oil palm, selective breeding of elite dura and pisifera individuals is based on phenotypic performance of their progenies (reflected in combining ability). Legitimacy of a commercial progeny is declared if non-tenera contamination through fruit census is lower than 5%. Nevertheless, excessive dura and tenera can be commonly found in breeding crossing programmes used to expand and improve parental lines, such as dura x tenera and tenera x tenera, causing significant deviation from theoretical Mendelian segregation ratios (Chi-square test) . This suggests that ‘hidden’ illegitimacy can exist at unknown levels, where shell-type is expected to segregate or even where shell-type is consistent with legitimacy, as in commercial tenera material.
In this study, a total of 1000 palms were successfully assayed using the GS1000™ SNP panel and assigned to three clusters according to their pedigree i.e. Family A, Family B and Family C (Fig. 1). The results further validated the utility of the OP200K genotyping array used in previous studies [9, 10]. Family A and Family B which overlapped, were indeed half-sibs with a common paternal parent, AVROS Pisifera 1. This explains the higher genetic relatedness between these two families. Family C originated from multiple lineages of dura mother palms and could be clearly distinguished in the principle component plot, indicating the distinct genetic base. A legitimacy reference based on the parentage of Family A was then established (Fig. 2). At a 3% illegitimacy-indicative SNP threshold, for dura contamination (Family C) was distinguished through legitimacy test. Although the shAVROS mutation alone can explain 100% of the fruit form variation in the AVROS-based tenera palms in this study, this may not be the case in other origins due to additional reported novel mutations in the SHELL gene [4, 6]. Further characterisation on the haploinsufficiency of the SHELL gene also unveiled possible novel mutations and the potential for cis-compound mutations yet to be characterised. This may confound shell-type prediction, particularly in introgression hybrids and more exotic germplasm . Legitimacy tests based on identity by descent can effectively address this issue. The legitimacy test developed in this study was also able to distinguish the tenera fruit of Family A and Family B, while SHELL markers did not distinguish them and identified them correctly as both 100% tenera. With 0% dura occurrence, the current Standards and Industrial Research Institute of Malaysia (SIRIM) certification for commercial seed production would declare the mix of Family A and B as fully legitimate, but part of the sample set was actually from incorrect parentage (Family B). The yield loss due to tenera contamination can now be prevented or at least accounted for with the help of this practical legitimacy assay protocol. The same scenario occurs in other crops whereby illegitimate individuals cannot always be distinguished by phenotype. For example, in coconut (Cocos nucifera L.) breeding, the petiole colour is often used as phenotypic marker to determine genetic purity of Yellow Dwarf x Tall hybrids, but no colour variation is observed within Tall cultivars .
The DNA fingerprinting technique was developed in the 1980’s after discovering restriction fragment length polymorphism (RFLP) markers in human DNA . More marker systems were then discovered and applied in mammals , birds  and eventually plants. To increase genotyping throughput and polymorphism, oil palm researchers shifted their preference to microsatellite markers and the results were promising [16, 17, 18]. The technique has been strongly recommended as standard practice in all crosses and for tissue culture clones for the past decade. However, full adoption of legitimacy fingerprinting for commercial seed production has not been previously realized, mainly due to high assay cost and unclear sampling method. To address this, a practical method using SNP markers was developed for commercial-scale legitimacy test and the marker set size was optimised down from 1000 to 200 loci, sufficient to identify both dura and tenera contamination with consistent accuracy based on the test Family (A + B + C) (Fig. 3). Interestingly, only 80 loci were required to distinguish dura contaminants with consistent accuracy based on Family (A + C). An oil palm seed producer usually produces more than a million seeds annually. Testing every seed per cross is impractical and economically unjustifiable. However, many established sampling schemes for quality control, such as ISO2859-3 series are widely adopted in the manufacturing sector. These schemes provide a comprehensive reference on effective sample size to achieve the quality acceptance level based on the available seed lot and can also be adopted effectively to provide a quality underpinning to the legitimacy assay developed here for seed production quality control. Re-evaluation of the assay, however, is still necessary when dealing with alternative populations, or if the genetic base of existing parent populations becomes narrow due to selective breeding.
Legitimacy tests using SNP markers can identify both dura and tenera contamination at or before the nursery stage. An optimised marker set of 200 SNPs coupled with a suitable sampling scheme will enable implementation of legitimacy test with accuracy more than 97% as a standard quality control procedure for seed purity in breeding and commercial production. Without access to pedigree records and parent palms, the SHELL test is still a powerful tool for the smallholders to discard dura contaminations before field planting. For perennial crops such as oil palm, illegitimacy can easily result in failure of a 12-year breeding selection cycle, and for commercial palms, a 25-year reduction in yield potential. Hence, genetic purity and good agricultural practices are equally essential to ensure the highest oil productivity of the oil palm industry into the future.
Plant materials and DNA preparation
A total of 1000 palms comprising of Family A (700 palms from Deli Dura 1 x AVROS Pisifera 1), Family B (150 palms from Deli Dura 2 x AVROS Pisifera 1) and Family C (150 Deli dura palms) was selected. Family A and Family B were produced as commercial tenera seeds and Family C was used as mother palms for breeding and commercial seed production. Also, the three parent palms, known as Deli Dura 1, Deli Dura 2 and AVROS Pisifera 1 were included. All palms were derived from in-house breeding materials and maintained at Sime Darby Plantation R&D Centre, Malaysia. Total genomic DNA was isolated from 0.1 g of young leaf tissue (at frond 0) using the DNAeasy Plant Mini Kit (Qiagen, Germany).
The 1003 palms were genotyped for the legitimacy test and fruit form validation. A total of 1000 SNPs were selected from the OP200K Infinium array (Illumina)  to form a smaller genotyping panel, namely GS1000™. The probe design for the SNPs was done based on the requirement of the Kompetitive Allele Specific PCR™ (KASP™) genotyping platform. About 0.3 ng of the genomic DNA was used as template. Two fluorophores FAM and HEX were used to distinguish the KASP™ genotyping data. In the clustering plot, the samples marked in red and blue were homozygous for alleles with HEX and FAM, respectively. The heterozygous samples appeared in green. To confirm the fruit forms of assayed palms, the same genotyping method was done using KASP™ probes for shmpob and shAVROS mutations  at their reported genomic positions, i.e. 3,078,161 bp and 3,078,154 bp on Chromosome 2 . Only the SNPs with lower than 5% missing data were selected for subsequent analysis.
Progeny and parental genotypes that follow Mendelian inheritance
A/A, A/a, a/a
For the blind test, Family B and Family C were assumed to be contaminants for Family A. The legitimacy tests were conducted in different combinations of families, reflecting possible contamination types in production/field: (i) Family (A + C) (850 palms) for dura contamination, (ii) Family (A + B) (850 palms) for half-sib contamination, and (iii) Family (A + B + C) (1000 palms) for multiple contamination sources. A legitimacy reference was constructed from Family (A + B + C) using the GS1000™ panel. This was followed by sensitivity analysis of the legitimacy test using different SNP subsets ranging from 20 to 900 markers with stepwise increments of 20 up to 100 SNPs and then 100 markers up to 900 for each combination of (i) and (iii). Each marker subset was analysed with 1000 iterations of random marker selection from the GS1000™ panel. The accuracy of each test was then determined based on the observed number of illegitimate palms that could not be distinguish from the reference. The descriptive statistics and graph plotting for accuracy were generated in R.
We would like to acknowledge the sample contribution of our breeders from Oil Palm Breeding Section. We also want to thank Molecular Breeding Laboratory for genotyping the samples and analytical supports.
CT and HL conceived and designed the experiments; HL and CT analysed the data; CT, HA, AO, FC, DA and SM wrote the paper. All the authors have read and approved the manuscript.
This study was fully supported by Sime Darby Plantation Sdn. Bhd. for funding, Sime Darby Plantation R&D Centre for collecting materials, conducting experiment, analyzing data, paper writing and revising the manuscript.
Ethics approval and consent to participate
Consent for publication
The authors declare that they have no competing interests.
- 1.Corley RHV. Illegitimacy in oil palm breeding - a review. J Oil Palm Res. 2005;17:64–9.Google Scholar
- 2.Beirnaert A. Introduction à la biologie florale du palmier à huile (Elasie guineensis Jacquin), vol. 5. Congo Belge, Ser. Sci.: Inst. Nat. Etude agron; 1935. p. 3–42.Google Scholar
- 3.Beirnaert A, Vanderweyen R. Contribution à l’étude génétique et biométrique des variétiés d’Elaeis guineensis Jacq. In: Publ Inst Nat Etude Agron Congo Belge Ser Sci; 1941. p. 1–101.Google Scholar
- 5.Lim CC, Rao V. DNA marker technology and private sector oil palm breeding. Planter. 2004;80:611–28.Google Scholar
- 6.Ooi LC-L, Low E-TL, Abdullah MO, Nookiah R, Ting NC, Nagappan J, Manaf MAA, Chan K-L, Halim MA, Azizi N, et al. Non-tenera Contamination and the Economic Impact of SHELL Genetic Testing in the Malaysian Independent Oil Palm Industry. Front Plant Sci. 2016;7(771):1-13.Google Scholar
- 7.Dumortier F, Van Amstel H, Corley RHV. Oil palm breeding at Binga, Zaire, 1970-1990. London: Unilever Plantations; 1992.Google Scholar
- 8.Production of crude palm oil in Malaysia for 2016 and 2017. http://bepi.mpob.gov.my/index.php/en/statistics/production/177-production-2017/792-production-of-crude-oil-palm-2017.html. Accessed 15 Sept 2019.
- 12.Rajesh MK, Thomas RJ, Rijith J, Shareefe M, Jacob PM. Genetic purity assessment of D x T hybrids in coconut with SSR markers. Indian J Genet. 2012;72(4):472–4.Google Scholar
- 18.Singh R, Jayanthi N, Tan SG, Jothi Malar P, Cheah SC. Development of simple sequence repeat (SSR) markers for oil palm and their application in genetic mapping and fingerprinting of tissue culture clones. Asia Pac J Mol Biol Biotechnol. 2007;15(3):121–31.Google Scholar
Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.