Introduction

Mycale cecilia is a widespread shallow-water sponge distributed along the tropical Eastern Pacific from Mexico to the Galapagos Islands, with some records in Hawaii [1, 2]. The species typically inhabits a broad variety of habitats, including rocky substrates from bays and estuaries and coral reefs ecosystems, where it may directly interact with corals by covering their branches [3]. This species is also well known for synthesizing secondary metabolites such as mycalazals, mycalenitriles, and related pyrrole-2-carbaldehyde derivatives, which have shown activity as growth inhibitors of various tumor cell lines, in particular against the human prostate cancer cell line [4].

In the last decades, a variety of molecular markers (e.g. sequences from COI mtDNA and 18S, 28S ITS’s nrDNA genes) have been used to establish phylogenetic relationships in sponges [5], and in some cases, to elucidate the phylogeography and genetic connectivity of their populations [6]. However, these markers have shown a low level of intraspecific variability [6, 7], which hindered its application for population studies, highlighting the need to include more variable markers. Novel methodological developments in massive parallel DNA next-generation sequencing technologies allow targeting large fractions of the genome to identify multiple Single Nucleotide Polymorphisms (SNPs) [8]. However, the microbial genetic pool is difficult to separate from the sponge DNA after DNA extraction; hence, identifying sponges’ SNPs from DNA mixed (holobionts and sponges) may be challenging using next-generation sequence data [8].

Microsatellite loci have been shown to be a powerful tool and a good option for populations genetic studies in sponges. Nevertheless, their implementation has been limited to a few studies in the Caribbean, the Atlantic-Mediterranean, Antarctica and the Indo-Pacific regions [e.g., 9, 10, 11, 12]. No microsatellites have yet been developed for poriferans from the Eastern Pacific. Here, we report the isolation of 12 microsatellite markers in M. cecilia and preliminary data on their allelic variation in two coral reef localities of the Mexican Pacific coast. In addition, we tested the cross-amplification of these microsatellites in a closely related congeneric species Mycale microsigmatosa (from the Gulf of Mexico).

Materials and methods

Sample collection, DNA extraction and next-generation sequencing

A specimen of M. cecilia was collected from Mazatlán, Sinaloa, México (23°11’09.18” N, 106°25’23.47” W) in October 2015. Genomic DNA was obtained from fresh tissue using Wizard® SV Genomic DNA Purification Kit (Promega, Madison, WI). For next-generation sequencing a genomic DNA library was assembled with the Kapa gDNA library kit (Kapa Biosystems, Wilmington, MA), applying a multiplex index. This library was then sequenced using 1/7 of a single lane (2 × 125 base pairs) in a MiSeq platform (Ilumina, San Diego, CA).

For the evaluation of the novel microsatellite loci, 30 individuals of M. cecilia were collected by SCUBA diving from two coral reef areas, 15 from Isabel Island, Nayarit (21°50’31” N, 105°52’51” W) in October of 2018, and 15 from La Paz, Baja California Sur (24°09’31” N, 110°20’26” W) in March 2019. Additionally, eight specimens of the congeneric species Mycale microsigmatosa were collected in Laguna de Términos, Campeche in March 2019 (18°38’18” N, 91°47’55” W) and included to tests the cross-amplification of the new microsatellites. For these samples, DNA was extracted using the cetyltrimethyl ammonium bromide (CTAB) protocol [13].

Design of microsatellite primers and genotyping

All DNA reads were analyzed for quality control using FastQC v.0.10.1 (Babraham Institute, Cambridge, UK) [14], then they were trimmed and subject to de novo assembly into contigs with CLC Genomics Workbench v.7.0.3 (CLC bio, Boston, MA) [15]. The search for repetitive motifs for microsatellites (di-, tri-, and tetra nucleotide repeats) and the PCR- primer design were made from the resulting contigs (under the parameters: minimum of 5x coverage, a product size of 110–250 bp and primer length range 19–39 bp) using Msatcommander [16]. The resulting oligos were then synthesized in Macrogen, South Korea. Initially, all primer sets were tested in 10 specimens at annealing temperatures ranging from 54 to 60 °C and through MgCl2 concentration gradients. A fluorochrome was incorporated into the complementary universal tail M13(-21) of the forward primer [17], however, amplifications with fluorochrome were only achieved for 16 microsatellites, which were used in the rest of samples. Loci were amplified using the following reactions conditions: 6.1 µl dH2O sterile Milli-Q (Merck Millipore), 3 µl 10x PCR buffer (20mM MgCl2), 1 µl (20mM MgCl2), 0.7 dNTPs (10mM), 0.5 µl of the unlabeled M13(-21)-tailed forward primer, 0.5 of reverse primer (10mM), 1 µl BSA (Bovine Serum Albumin), 0.1 µl Go Taq® Flexi DNA Polymerase PROMEGA (5µ/µl) and 1 µl of template DNA. Additionally, 1 µl of the fluorescently-labeled M13(-21) tail (FAM, NED, PET, VIC) (Applied Biosystems) for a final volume of 14.9 µl. PCR-amplification were performed on a T100 Thermal Cycler (Bio-Rad) using the following profiles: 95°C / 5 min, (94°C / 30 s; 57–58°C / 35 s; 72°C / 1 min) x 39 cycles; 72°C / 20 min.

PCR products were then visualized on 2.0% agarose gel and then genotyped with an ABI3730 DNA Analyzers at The University of Arizona.

Data analyses

Alleles were visualized and sized in GeneMarker v.2.6.3 with GeneScan™ 500 LIZ (Applied Biosystems). Finally, the presence and frequency of null alleles as well as genotyping errors were assessed with Micro-Checker v.2.2.3 with a size standard [18].

Basic genetic diversity indices, number of alleles, observed (HO) and expected (HE) heterozygosities, polymorphic index content (PIC), and linkage disequilibrium (LD) tests were calculated with MStools [19]. Hardy–Weinberg equilibrium (HWE) by locus was assessed using a probability test with a level of significance determined by the following Markov chain parameters: 1000 dememorization steps, 100 batches and 1000 iterations per batch were analyzed using GENEPOP web v 4.2 [20], using the Weir and Cockerham estimate [21]. The Benjamini and Hochberg false discovery rate (FDR) procedure was applied to correct for multiple testing [22].

Results and discussion

We obtained a total of 79,223,420 good quality reads (Q = 32) from the sequencing experiment, which were assembled into 441,241 contigs with an average length of 140 bp. Thirty-six microsatellite loci met the criteria and were tested (25 dinucleotide, 3 trinucleotide and 8 tetranucleotide). Of the 36 candidate loci, 16 were successfully amplified, but 4 were monomorphic and 12 consistently polymorphic. (Table 1).

Table 1 Microsatellite markers isolated from Mycale cecilia

Micro-Checker did not detect evidence of scoring errors. High (> 10%) null allele frequencies were found in La Paz for loci MYC1-146780, MYC1-304443 and MYC1-3453. Also, no significant values were found in the linkage disequilibrium test for any pair of loci after FDR correction (α = 0.05). All microsatellite markers were polymorphic with PIC values higher than 0.554 (range 0.566–0.871 for La Paz and 0.554–0.859 for Isabel Island), making them a helpful tool for future studies of genetic structure for M. cecilia. A total of 51 alleles were found in La Paz and 47 in Isabel Island, which ranged from 2 to 9 alleles per locus with a mean number of 4.25 for La Paz and 3.91 for Isabel Island.

Expected heterozygosity (HE) was overall high, with values ranging from 0.616 to 0.901. Results from GENEPOP suggested that a total of eight loci deviated from Hardy-Weinberg equilibrium (HWE) from La Paz and one from Isabel Island (Table 1), this could be due to factors unrelated to the presence of null alleles [23]. On the other hand, the heterozygotes deficiency found in nine loci in La Paz seems to be linked to HWE deviations, as seen in other sponges [24, 25]. Also, it has been suggested that it may be due to the mixing of genetic populations because of the introduction of different sources, known as the Wahlund effect [13].

M. cecilia is a suitable model sponge species for population genetic studies due to its abundance, widespread distribution (along the Eastern Tropical Pacific coast), and the variety of ecosystems it inhabits. Further studies using these molecular markers will help investigate the structure and gene flow among populations of this species. This is relevant in ecosystems in need of a better ecological understanding such as coral reefs, where genetic connectivity studies could significantly contribute to determining natural areas that require protection.

Cross-amplification with eight specimens of the congeneric species Mycale microsigmatosa was successful for 11 loci (except MYC1-19973). Therefore, they are also now being tested in additional specimens from the Gulf of Mexico and the Mexican Caribbean for population studies. Should the loci prove as polymorphic as in M. cecilia, these results are encouraging for the extended application of these novel microsatellite loci in other congeneric species such as Mycale phyllophila from the Indo-Pacific.