Introduction

Reflecting its long history of cultivation, various usage, and adaptation in various parts of the world, melon (Cucumis melo) is known as the most diversified among Cucurbitaceae crops. Indeed, great diversity is reported in its morphological traits, such as fruit weight and seed length ranging from 50 to 15 kg and 4.5 to 15.0 mm, respectively (Nuñez-Palenius et al. 2008; Akashi et al. 2002). C. melo is divided into 19 horticultural groups based primarily on flower and fruit traits: Agrestis, Kachri, Chito, Tibish, Acidulus, Momordica, Conomon, Makuwa, Chinensis, Flexuosus, Chate, Dudaim, Chandalak, Indicus, Ameri, Cassaba, Ibericus, Inodorus, and Cantalupensis (Pitrat 2016).

Fujishita and Nakagawa (1973), focusing on seed size variation, indicated that melon is classified into large-seed (≥ 9.0 mm) and small-seed (< 9.0 mm) types. Melon accessions of groups Cantalupensis and Inodorous are mostly classified as large-seed type and groups Agrestis, Conomon, and Makuwa as small-seed type according to Fujishita (1983) and Akashi et al. (2002). In Asia, groups Dudaim, Flexuosus, Chandalak, Ameri and Inodorus of large-seed type are mainly distributed in West Asia, Central Asia and India, whereas groups Chinensis, Conomon, Makuwa and Acidulus of small-seed type are distributed in Southeast and Far-East Asia and India (Stepansky et al. 1999; Akashi et al. 2002; Yashiro et al. 2005; Tanaka et al. 2007; Pitrat 2008; Tanaka et al. 2013).

Among Asian melons, small- and large-seed types have proven to be diversified in independent maternal lineages, Ia and Ib, respectively, according to the analysis of chloroplast genome (Tanaka et al. 2013). India and Africa are now considered as domestication centers of melon (John et al. 2013; Endl et al. 2018; Gonzalo et al. 2019), where small- and large-seed types and Ia and Ib types are frequently distributed. However, the landraces of Conomon and Makuwa are not reported in India. Conomon and Makuwa are mostly distributed from China to Japan and are considered to be diversified in small-seed melon with Ia type cytoplasm in areas somewhere lying between India and China (Akashi et al. 2002; Tanaka et al. 2007; Serres-Giardi and Dogimont 2012). In contrast to India and Far-East Asia, little is known about melon landraces in Southeast Asia.

It was proven that Myanmar melon landraces first investigated by Yi et al. (2009) possess large genetic variation as in Indian melon. Myanmar melons consist of both small- and large-seed types, and Yi et al. (2009) classified small-seed type accessions as Conomon, Momordica, or Agrestis. However, most Myanmar Conomon accessions were clustered separately from the Conomon and Makuwa of Far-East Asia. Although one accession was regarded as Makuwa, they could not rule out the possibility of its recent introduction from other countries such as China and Japan.

More recently, Nhi et al. (2010) and Duong et al. (2021) analyzing Vietnamese melon landraces showed that all except one accession were small-seed type. Vietnamese melon consists of seven cultivar groups, including “Dua le” and “Dua vang” regarded as Makuwa and “Dua bo” and “Dua gang-andromonoecious” as Conomon. In contrast, “Dua thom” and “Montok” showed genetic similarities with Indian and Myanmar landraces. Based on these results, the presence of groups Conomon and Makuwa was first confirmed in Vietnam. Therefore, to identify the origin of groups Conomon and Makuwa, melon landraces of Cambodia, Laos, and Thailand should be investigated.

We have conducted germplasm collection expeditions in Cambodia since 2014 and successfully introduced germplasms of melon landraces (Matsunaga et al. 2015; Tanaka et al. 2016, 2017, 2019). Genetic resources of Cambodian melon are not available in major Genebanks other than NARO Genebank, National Agriculture and Food Research Organization, Japan, so they are expected to contribute to breeding for disease resistance and to the analysis of diversification of melon in Asia. More specifically, genetic analysis of Cambodian landraces is essential for understanding the distribution of groups Conomon and Makuwa and their genetic relationships with Vietnamese landraces. Therefore, in this study, we aimed to identify the genetic diversity and genetic structure of Cambodian melons using RAPD and SSR markers, and to discuss genetic diversification in Asian melons.

Materials and methods

Plant materials

Among the Cambodian melon landraces collected during 2014–2017, 62 accessions were selected to cover most of Cambodia and divided into five geographical groups: east, center, west, north, and south. Details of the materials used are summarized in Table 1 and Fig. 1. These accessions have been introduced to NARO Genebank, Japan, with SMTA (Standard Material Transfer Agreement). A total of 229 accessions were used as references, including Conomon (9), Makuwa (11), Agrestis (6), Cantalupensis (10), Inodorus (8), and accessions from different areas (Vietnam: 58, Myanmar: 36, Thailand: 5, Yunnan (China): 5, Xinjiang (China): 24, Central Asia: 14, Pakistan: 11, Afghanistan: 9, Iran: 10, Spain: 8, and the USA: 5). Two accessions of wild cucumber C. sativus var. hardwickii were also used as the outgroup. Seed length and width were measured for 10 seeds of each accession, and accessions were classified as large-seed (≥ 9.0 mm in length) or small-seed (< 9.0 mm in length) types, according to Akashi et al. (2002).

Table 1 List of Cambodian melon landraces analyzed in this study
Fig. 1
figure 1

Map of Cambodia showing 19 provinces where melon landraces were collected

DNA extraction

Seeds of each accession were germinated on wet filter paper in a Petri dish and later transferred into pots, and grown in an incubator maintained at 30 °C with a 16 h light- 8 h dark cycle at a light intensity of 46.5 µMs−1-m−2. After 2 weeks, cotyledons from one seedling of each accession were ground individually in the liquid nitrogen. Total DNA was extracted using the cetyl-trimethyl-ammonium bromide (CTAB) method (Murray and Thompson 1980) with minor modifications. The quality and quantity of each DNA sample were determined with a spectrophotometer.

RAPD analysis

Ten random primers which were selected for their ability to detect polymorphism and for their stability in PCR amplification were used for RAPD analysis according to Nhi et al. (2010) (Table 2). PCR amplification was carried out in a 10 µl mixture containing 50 ng genomic DNA, 1 μl PCR buffer (Sigma®, St. Louis, MO, USA: 10 mM Tris–HCl; pH 8.3, 50 mM KCl), 2.5 mM MgCl2, 0.1 mM dNTP, 0.5 μM primer, and 0.25 U Taq polymerase (Sigma) by using an iCycler (Bio-Rad Laboratories, Hercules, CA, USA). The PCR cycle was started with an initial denaturing step at 95 °C for 3 min, 40 cycles at 93 °C for 1 min, 40 °C for 2 min, and 72 °C for 2 min. The duration of annealing temperature at 40 °C and the duration of extension at 72 °C were modified to 45 s and 1.5 min, respectively, for three primers A31, A57 and B86. The final extension step was at 72 °C for 5 min for all primers. After amplification, samples were electrophoresed on a 1.5% agarose gel (GenePure LE, BM Bio, Tokyo, Japan) at a constant voltage of 100 V using a horizontal gel electrophoresis system (Mupid-2, Cosmo Bio, Tokyo, Japan). Then, the PCR products were visualized with ethidium bromide staining, and their polymorphisms were evaluated.

Table 2 Primer sequence of 10 random amplified polymorphic DNA (RAPD) and seven simple sequence repeat (SSR) markers used in this study and the size of polymorphic fragments

SSR analysis

Seven SSR markers, developed by Ritschel et al. (2004) and Fukino et al. (2007) and showing high polymorphism in Vietnamese landraces as selected by Nhi et al. (2010), were used for the analysis of Cambodian melon landraces (Table 2). The PCR mixture was the same as used in the RAPD analysis, but the primer concentration was changed to 0.25 μM each. The PCR cycle was started with an initial denaturing step at 95 °C for 3 min, 35 cycles at 95 °C for 1 min, 60 °C for 1 min, and 72 °C for 2 min. The final extension step was at 72 °C for 5 min. PCR products were electrophoresed on 10% nondenatured polyacrylamide gel at a constant voltage of 260 V. Gels were stained in the same way as for RAPD analysis, and the size of the smallest band of each landrace was compared among the landraces.

Genotyping of CmACS7

The CmACS7 genotype was determined for sex expression analysis using the Cleaved Amplified Polymorphic Sequences (CAPS) marker developed by Boualem et al. (2008). The details of the PCR condition were given in Duong et al. (2021).

Data analysis

DNA fragments were scored as present (1) or absent (0) for RAPD markers. For SSR, DNA fragments were scored based on their size from the smallest (1) to the largest. From these data, the polymorphic information content (PIC) was calculated following Botstein et al. (1980). Genetic similarity (GS) was calculated after Apostol et al. (1993), using the formula GS = (N11 + N00)/T, where N11 and N00 were the number of positive and null bands, respectively, shared between two accessions and T was the total number of bands scored. The genetic distance (GD) between them was calculated using the formula GD = 1 − GS. Gene diversity (D) within each group and genetic distance (GD) between each group were calculated according to Weir (1996) and Nei (1972), respectively. A dendrogram was constructed by PHYLIP version 3.5.c (Felsenstein 1993), based on the GD matrix, with the unweighted pair group with arithmetic mean (UPGMA) method. Principal coordinate analysis (PCO; Gower 1966) based on the GS matrix was performed to show multiple dimensions of each population in a scatter-plot.

The population structure and the degree of admixture were estimated using the Bayesian clustering procedure of the STRUCTURE 2.3.4 (Pritchard et al. 2000). First the program was run to assume the number of distinct populations defined as K using the admixture model with correlated allele frequencies. The parameter sets were configured as follows: the length of the burn-in period and the number of Markov chain Monte Carlo (MCMC) repetitions after burn-in were 10,000, and 50,000, respectively, for each K ranging from 1 to 10 and running 20 iterations. The most plausible number of clusters was determined in the Structure Harvester program (Earl and von Holdt 2012) by calculating the distribution of the ∆K statistics as described by Evanno et al. (2005) and the performance of cluster matching across different K values was implemented through Cluster Markov Packager Across K (CLUMPAK) (Kopelman et al. 2015).

Results

Characterization of Cambodian melon landraces

Most Cambodian melons (56/62) were small-seed type as in those from Yunnan (China), Myanmar, Vietnam, Conomon, Makuwa and Agrestis (Table 3). In contrast, large-seed type was common in areas from Xinjiang (China) to the USA and also in Cantalupensis and Inodorus, while large- and small-seed types were common in areas from Central Asia to Iran.

Table 3 Number of accessions examined in each population and their characterization based on seed length, sex expression and gene diversity

Sex expression type, determined by CAPS analysis of CmACS7, also differed among melon populations (Table 3). All of the Cambodian melon accessions (62/62) proved to be monoecious. The monoecious type is also common in neighboring countries, such as Yunnan (China), Myanmar, and Vietnam (Dua thom). In contrast, andromonoecious type was common in Conomon, Makuwa, Vietnam (except Dua thom), in areas from Xinjiang (China) to the USA, and in Cantalupensis and Inodorus. Both monoecious and andromonoecious types were frequent in areas from Central Asia to Afghanistan.

RAPD and SSR analysis

Ten RAPD markers produced 12 polymorphic bands whose size ranged from 700 to 1520 bp, and the average number of marker bands per primer was 1.2 (Table 2). The average of PIC index was 0.297 and ranged from 0.047 (B32_700) to 0.452 (B99_1400). SSR analysis with 7 primer pairs generated 70 bands polymorphic in 291 accessions of melon and the average number of marker bands per primer was 10.0. The PIC index ranged from 0.664 (CMN04-07) to 0.891 (CMBR2) and was almost doubled that of the RAPD markers (Table 2).

Gene diversity (D) in Cambodian melon ranged from 0.105 (center) to 0.246 (south), and was 0.228 in 62 accessions (Table 3). This value was smaller compared with those in neighboring countries such as Vietnam (0.333 in 58 accessions), Myanmar (0.428), and Thailand (0.328), and nearly equivalent to those of Conomon and Makuwa (0.227 and 0.168).

Genetic relationship between Cambodian melon and reference accessions

Genetic distance (GD) among 291 accessions ranged from 0.000 (CAn33 vs CA n44 etc.) to 0.947 (P142 vs X8), while the maximum GD was 0.474 (CAs46 vs CAe9 etc.) in Cambodian accessions (data not shown). A dendrogram was constructed based on these GD values, and 291 accessions were grouped into 3 major clusters that were further separated into 24 subclusters (Fig. 2). Cluster I comprised 115 accessions, which included landraces from Cambodia (59/62) and neighboring countries such as Vietnam (Dua thom), Yunnan (China), Thailand, and Myanmar (Table 4). Accessions of Conomon, Makuwa, and Agrestis formed cluster II (68 accessions), together with landraces from Vietnam except Dua thom. Cluster III (108 accessions) consisted of Cantalupensis, Inodorus, and accessions from Xinjiang (China) to the USA. Landraces of Myanmar were classified into three clusters in good accordance with the highest D value (0.428).

Fig. 2
figure 2

Genetic relationship between 293 accessions revealed by unweighted pair group method with arithmetic mean (UPGMA) cluster analysis based on genetic distance, and population structure inferred by STRUCTURE using the admixture model

Table 4 Number of melon accessions classified into 24 subclusters and three populations in Fig. 2

Structure analysis indicated the presence of 3 main populations in 291 accessions studied (Fig. 2), based on the ∆K value, and 61, 101, and 93 accessions were assigned to populations P1, P2, and P3, respectively, using a Q-value threshold of 70% (Table 4). The remaining 36 accessions were regarded as admixture type. The classifications of 3 clusters and 3 populations were highly associated, as 86.1, 88.2, and 85.2% of accessions in clusters I, II, and III were assigned to P1, P2, and P3, respectively. Accessions of admixture type were mostly found in marginal subclusters such as If, IIg, and IIIa. Accordingly, Cambodian accessions belonged to P2 together with those from Vietnam (Dua thom), Yunnan (China), Thailand, and Myanmar, and the genetic relationship detected by cluster analysis was reproduced by structure analysis. A genetic relationship was also visualized on the scatter plot of PCO1 (21.3%) and PCO2 (16.5%) (Fig. 3). Accessions of three populations were distinctly separated and admixture accessions were located among three groups.

Fig. 3
figure 3

Distribution of 293 accessions on the first two principal coordinates. Accessions are indicated with symbols unique to each population; P1 (◆), P2 (●), P3 (▲), and Admixture (○)

Three clusters were also characterized by seed length and sex expression type (Tables 3 and 4). Most of accessions of clusters I (111/115) and II (65/68) commonly had seeds shorter than 9 mm in length and were classified as small-seed type. In contrast, the sex expression type differed between clusters I and II, and monoecious type and andromonoecious type were common in clusters I (108/115) and II (56/68), respectively. Cambodian melon proved to be small-seed type and monoecious type (56/62), similar to those of Vietnam (Dua thom), Yunnan (China), and Myanmar, thus showing close similarity among these populations. Accessions of cluster III were mostly large-seed type (80/108) and andromonoecious type (87/108).

Genetic relationship between 24 melon populations

The genetic distance (GD) between 24 populations calculated from the RAPD and SSR data ranged from 0.018 (Cambodia-North vs Cambodia-South) to 0.984 (Makuwa vs Xinjiang (China) (Table 5). GD among 5 populations of Cambodian melon ranged from 0.018 to 0.097, and was 0.062 on average. These values were nearly equivalent to those among 3 Vietnamese populations of Dua bo, Dua gang and Dua le (0.035–0.082), among Conomon and Makuwa (0.047) and among 5 populations from Xinjiang (China) to Iran (0.044–0.112), indicating that Cambodian melons were not genetically differentiated among geographical populations.

Table 5 Genetic distance between 24 populations based on the frequency of 10 random amplified polymorphic DNA (RAPD) and seven simple sequence repeat (SSR) markers

The genetic relationship between 24 populations was visualized by cluster analysis (Fig. 4) and by PCO analysis (Fig. 5). In accordance with the results of accession-based analysis, 24 populations were separated into 3 groups and Cambodian populations related closely to those in neighboring countries.

Fig. 4
figure 4

Genetic relationship between 24 populations of melon, revealed by unweighted pair group method with arithmetic mean (UPGMA) cluster analysis based on genetic distance

Fig. 5
figure 5

Distribution of 24 populations of melon on the first two principal coordinates

Discussion

The distribution of Conomon and Makuwa is limited to the eastern part of Asia, including China, Vietnam, Korea and Japan, and thus Conomon and Makuwa were considered as being introduced from the West. Previous research using isozyme and molecular markers have suggested that Conomon and Makuwa originated from small-seed melons with Ia type cytoplasm in areas lying somewhere between India and China (Akashi et al. 2002; Tanaka et al. 2007; Serres-Giardi and Dogimont 2012), and Tanaka et al. (2007) confirmed the presence of landraces, which were clustered together with Conomon and Makuwa, in Northeast India. These results highlight the importance of genetic research of melon landraces in Southeast Asia. However, few accessions are available in the public genebank worldwide. Therefore, we collected melon germplasm from Southeast Asian countries and analyzed genetic structure in Myanmar (Yi et al. 2009) and Vietnam (Nhi et al. 2010; Duong et al. 2021). This research indicated genetic differentiation in melon landraces in Southeast Asian countries, and highlighted the importance of genetic research using landraces from Cambodia and Laos.

The joint expedition team of NARO (Japan) and CARDI (Cambodian Agricultural Research and Development Institute, Cambodia) conducted melon germplasm collection in the western (2014), eastern (2015), central and northern (2016) and southern (2017) regions (2017) of Cambodia during 2014–2017 (Fig. 1). The geographical distribution, horticultural characteristics, and usage of melon landraces in Cambodia were first fully described by their reports (Matsunaga et al. 2015; Tanaka et al. 2016, 2017, 2019). According to their reports, non-netted melon with low sugar content was commonly grown in every part of Cambodia and utilized mature fruit as a desert, and immature fruit as vegetable or pickles. Immature and mature fruits of Conomon are mainly used for making pickles in Japan, suggesting the similarity between Cambodian melons and Conomon. To discuss the origin of Conomon and Makuwa, using part of this collection, we uncovered the genetic diversity of Cambodian melons in the present study, and compared it with those in neighboring countries.

Genetic diversity of cultivated melons is rich in India (Akashi et al. 2002; McCreight 2004) and this country was once considered the secondary center of diversity. However, based mainly on the discovery of wild perennial melon (C. trigonus), India is now considered as one of the domestication centers of melons along with Africa (John et al. 2013; Endl et al. 2018). Moreover, large genetic variations in Indian melons have been reported in various traits (Dhillon et al. 2007, 2009; Fergany et al. 2011; Roy et al. 2012). Seed size variation was also studied, and both large- and small-seed types are commonly distributed in India, with the latter being rather frequent in Northeastern India (Akashi et al. 2002; Tanaka et al. 2007). In contrast, small-seed type is frequent in Myanmar (36/41, Yi et al. 2009), Cambodia (58/58, Table 3), and Vietnam (61/61, Table 3), indicating an increasing trend of small-seed type from India to Vietnam. Genetic diversity also showed a similar geographical cline. D was larger in India compared to Myanmar (Yi et al. 2009), while in the present study (without Indian accessions), it was highest in Myanmar (0.428, Table 3) followed by Vietnam (0.333) and Cambodia (0.228). Although D was rather high in Thailand (0.328), the proportion of small-seed melons was exceptionally low (40%), indicating a necessity of diversity study using more landraces.

In areas from South to Southeast Asia, Myanmar landraces are closely related to Indian landraces (Tanaka et al. 2007; Yi et al. 2009), and as well as those of Thailand, Cambodia, Yunnan (China), and Vietnam (Dua thom) (Table 4, Fig. 4). These results indicated the introduction of non-netted melons from India to Southeast Asia and the decrease of genetic variation by natural and artificial selection. In this area, various ethnic groups of Tibeto-Burman or Austroasiatic language families live intricately across political borders (Clarke 2001). For example, the “Karen” tribe lives in Northeastern India, Myanmar, and Thailand, and the “Hmong” tribe lives in Laos, Thailand, and Vietnam. Local people definitely played a crucial role in the trade of goods (Misra 2005; Shibayama 2019) and the spread of melon from India to the East.

Southeast Asian melon landraces comprised two genetic groups, as clearly shown in Table 4 and Figs. 3, 4 and 5. The first group consisted of landraces of cluster I and population P2, which are commonly grown in Myanmar, Thailand, Cambodia, Yunnan (China), and mountain area of Vietnam (Dua thom). The second group consisted of landraces of cluster II and population P1, including Conomon and Makuwa, Agrestis (weedy melon) collected in Japan and Korea, and landraces grown in the plains of Vietnam. Landraces of both groups show similarities in several traits, such as smooth skin without net and small seeds shorter than 9 mm, and so should be classified as ssp. agrestis, according to the former systematics by Pitrat (2013). In addition, both groups belong to the same maternal lineage (Ia), as revealed by the analysis of chloroplast genome markers (Tanaka et al. 2013: Tanaka et al. in preparation). However, they also differed in several traits, reflecting their genetic differentiation. The key trait is sex expression type, and the first and second groups showed monoecy and andromonoecy, respectively (Table 3). Flesh sweetness also differs among groups, but the difference is not simple and only Makuwa of the second group has sweet flesh.

It was therefore concluded that melon landraces predominantly grown in most parts of Southeast Asia, including Cambodia, were of the first group and those of the second group were limited to the plains of Vietnam. In Vietnam, the second group is grown in the plains by “Kinh” tribe originally migrated from southern part of China, whereas the first group is grown in mountain areas by ethnic groups “Hmong” and “Thai” in northern part and by ethnic groups “Jarai” and “Hmong” in southern part (Duong et al. 2021). Therefore, the allopatric distribution of ethnic groups was considered as the primary reason for the presence of two genetic groups in Vietnam, though the possibility of severe selection due to pests, insects and so on could not be ruled out. In case of Cambodia, it shares the border with southern part of Vietnam where “Khmer” tribe lives as ethnic group, and thus the introduction of melon landraces across the border was reasonably hypothesized. However, landraces of different genetic groups were grown on both sides of the border, and Conomon and Makuwa, or their ancestral type were not found in Cambodia. Therefore, Cambodian melon is not directly related with the establishment of Conomon and Makuwa, supporting the introduction of Conomon and Makuwa from southern part of China to Vietnam as indicated by Duong et al. (2021). To further confirm this conclusion, landraces from Thailand, Laos, and Yunnan (China) should be studied in detail.