Introduction

Nucella lapillus (Linnaeus 1758) is a marine gastropod species belonging to the family Muricidae within the order Neogastropoda. It is widely distributed along the rocky shores of the North Atlantic, from Southern Portugal to Northwestern Iceland, and from Terra Nova (Canada) to Long Island (New York, USA) (Collins et al. 1996; Colson and Hughes 2007; Marko et al. 2014). This species exhibits remarkable intraspecific morphological and chromosomal polymorphisms that are linked to its habitat, primarily wave exposure, and to a lesser extent, predation (Staiger 1957; Bantock and Cockayne 1975; Pascoe 2006).

The two chromosomal races of N. lapillus are typically characterized by thick-shelled specimens with 2n > 26 chromosomes found in sheltered sides and thin-shelled specimens with 2n = 26 chromosomes occupying wave-exposed localities (Crothers 1985; Vazquez 2015). This variation arises from the fragmentation of five specific metacentric pairs in the 2n = 26 karyotypes into ten acrocentric pairs, likely due to Robertsonian fusions/fissions (Pascoe 2006), and the different balanced combinations of intact and fragmented pairs in homo and heterozygosis result in karyotype intermediate between 26 and 36 chromosomes, which vary between locations and specimens (Pascoe 2006). As such, N. lapillus provides an excellent model for understanding the evolutionary process of species differentiation under divergent environmental conditions, making it ideal for studying chromosomal evolution, speciation, and biodiversity (Staiger 1957; Pascoe 2006). Several previous studies have shown that this phenotypic and chromosomal differentiation observed in N. lapillus is linked to a substantial genetic divergence between chromosomal races, as demonstrated by allozyme markers and microsatellites (Kirby et al. 1994; Guerra-Varela et al. 2009). However, the extent of this genetic differentiation remains a matter of debate, and it is yet to be determined whether these are isolated species in various degrees of hybridization or a single species in divergent evolution under different selective pressures (Kirby et al. 1994).

In this regard, the mitochondrial genome is a reliable molecular marker to infer the phylogenetic evolution between species and populations. Previous studies on N. lapillus have been limited to a fragment of the malate dehydrogenase I gene (Kirby 2000), and a comprehensive analysis of the mitochondrial genome is essential to determine distinct mitochondrial lineages between ecotypes or localities. Therefore, the present study aims to assemble and annotate the complete mitogenome of N. lapillus using two next-generation sequencing approaches, namely Illumina short-read and long-read nanopore multiplex PCR sequencing. The gene arrangement and sequence similarity will be compared to other Muricidae species, and differences between chromosomal races of this species will be assessed.

Materials and methods

Biological samples

A total of 40 adult specimens of N. lapillus were collected at each of the localities indicated in Table 1 and transported to the laboratory of molecular cytogenetics of the University of Vigo (NW Spain). There, mitotic metaphases were obtained from gills and gonads according to García-Souto et al. (2018). In brief, after an in vivo overnight colchicine (0.005%) treatment in seawater, animals were euthanized and their soft tissues dissected and processed to obtain chromosome spreads and genomic DNA following previous protocols. For cytogenetics, gills and gonads were hypotonized in 50% and 25% seawater and fixed with methanol/acetic (3:1). Small pieces of fixed tissue were disaggregated with 60% acetic acid and dropped onto heated glass slides. These were stained with a mixture of DAPI (0.14 μg/mL) and PI (0.07 μg/mL), examined using a Nikon Eclipse-800 microscope equipped with an epifluorescence detection module and photographed with a DS-Qi1Mc CCD camera (Nikon) controlled by the NIS-Elements software (Nikon). Separated images for each metaphase and fluorochrome were obtained and merged with Adobe Photoshop CS6. In parallel, muscular tissue was fixed in ethanol and preserved at -20 °C, and total DNA was isolated using the EZNA Mollusc DNA Kit (Omega Bio-Tek). DNA integrity was evaluated using a Genomic DNA ScreenTape (Agilent) on a Tapestation 4200 (Agilent), and the DNA concentration was determined through the Qubit 4 fluorometer Broad Range Assay (Invitrogen).

Table 1 Sampling locations of N. lapillus, indicating the chromosomal races

Whole-genome short-read sequencing

We outsourced the sequencing of four N. lapillus DNA samples—two from Brest, France (2n = 26), and two from Udra, Spain (2n > 26)—to Macrogen in South Korea for whole-genome DNA Illumina paired-end sequencing. Whole-genome DNA libraries were prepared using an Illumina TruSeq DNA PCR-free kit with 350 bp insert size. Subsequently, the libraries were subjected to paired-end sequencing on an Illumina NovaSeq 6000 System, employing a 300-cycle NovaSeq 6000 SP Reagent Kit v 1.5 and resulting in a final yield of 8 (Brest samples) and 30 (Udra samples) Gb per sample. These accounted for circa 2000× of the complete mtDNA.

Whole-genome long-read sequencing

Whole-genome long-read libraries were obtained using a sequencing kit by Oxford Nanopore Technologies Ltd (ONT, SQK-LSK109) on gDNA from a single specimen from Udra (Spain, 2n > 26) subjected to size selection using the Short Read Eliminator kit (Circulomics), end-prep repair (NEBNext FFPE DNA Repair Mix, New England BioLabs) and adapter ligation (NEBNext Ultra II Ligation Module, New England BioLabs). These libraries were loaded in MinION R9.4 flowcells (Oxford Nanopore Technologies Ltd; FLO-MIN106 rev-D), controlled by MinKNOW v19.12.5. Sequencing raw data was converted to fastq with Guppy v6.5.7 in super accuracy mode. The final yield accounted for 1 Gb, representing approximately 70× of the mtDNA.

Mitochondrial DNA assembly

Primary assembly of the mitochondrial genome was conducted using MITObim v1.9.1 with default parameters (Hahn et al. 2013), leveraging short-read sequencing data and employing a fragment of the Cytochrome C Oxidase I gene from N. lapillus (EU391582.1) to prime the assembly. Subsequently, Illumina paired-end reads were realigned to the resulting draft mtDNAs through BWA-mem v0.7.17-r1188 (Li and Durbin 2010). The alignments were then sorted, quality filtered (applying a threshold of − q60) and indexed using samtools v1.10 (Li et al. 2009). The resulting alignment files were employed for two rounds of polishing the draft mtDNAs using Pilon v1.23 (Walker et al. 2014), after which we assessed the accuracy of SNPs by visual inspection of the mapped paired-end reads and structural artifacts by mapping nanopore reads onto the draft with minimap2 v2/2.24 (Li 2018) and variant calling using sniffles v2.2 (Sedlazeck et al. 2018). The unassembled portion of the control region was deduced from ONT long reads, as generated by flye v2.9 (Kolmogorov et al. 2019) and self-corrected with medaka 1.7.2 (https://github.com/nanoporetech/medaka). We employed the MITOS2 web server (Donath et al. 2019) to annotate all coding and RNA genes on the resulting polished mitochondrial genomes. The mitochondrial genetic code for invertebrates was applied during the annotation process and the RefSeq63 dataset for metazoans served as reference data. We manually curated the annotations to fit the ORF predictions by ORF-FINDER (Rombel et al. 2002) and other available Muricidae mitogenomes available in Genbank.

Mitochondrial multiplexed PCR

To generate a larger cohort for the mitochondrial DNA of N. lapillus, a set of partially overlapping PCRs were designed (Supplementary Table 1) using Primal Scheme following previously established protocols (Quick et al. 2017). Genomic DNA was extracted from specimens of N. lapillus from different Spanish, French and British populations whose chromosomal number was previously established (Table 1). Their mtDNAs were then amplified in 26 ~ 650 bp fragments from two pooled multiplexed PCRs, using 1× Q5® High-Fidelity Master Mix (New England Biolabs). PCR products were purified, end-prep repaired, and ligated to ONT-specific barcoding adapters (Native Barcoding Expansion kits 1–12 EXP-NBD104 and 13–24, EXP-NBD114, Oxford Nanopore Technologies Ltd) using the NEB Blunt/TA Ligase Master Mix (New England Biolabs). After equimolar mixing and purification of the adapted fragments, library adapters (Adapter Mix II, Oxford Nanopore Technologies Ltd) were ligated to the libraries in a single reaction using Quick T4 DNA Ligase following the manufacturer’s recommendations. The resulting libraries were then sequenced in MinION R9.4 flowcells (FLO-MIN106 rev-D), controlled by MinKNOW v19.12.5, and the data was base called in “high-accuracy” mode 8 (dna_r9.4.1_450bps_hac.cfg), de-multiplexed assuming barcodes in both ends of the inserts and filtered to ensure a Phred quality score > 7 with Guppy v6.5.7. All downstream processing, including trimming, alignment, variant identification and consensus recovery of mitochondrial DNA sequences, was performed using the ARTIC bioinformatics minion pipeline (Pater et al. 2021), prefiltering chimeric reads below 400 bp and above 700 bp and normalizing coverage to 400 reads per position.

Phylogenetic analyses

The mitochondrial sequences corresponding to the 13 protein-coding genes (PCGs) and the two ribosomal RNA (rRNA) genes from the 40 sequenced N. lapillus and the eight available species of the family Muricidae at Genbank (Thais luteostoma (TLU, NC_039165), Chicoreus torrefactus (CTO, NC_039165), Boreotrophon candelabrum (BCA, NC_046505, Tian et al. 2019), Menathais tuberosa (MTU, NC_031405, Sung et al. 2016), Ocinebrellus falcatus (OFA, NC_046052, Hao et al. 2019c), Ocinebrellus inornatus (OIN, NC_046577, Hao et al. 2019a), Ceratostoma burnetti (CBU, NC_046569, Hao et al. 2019b) and Ceratostoma rorifluum (CRO, NC_046526, Tian et al. 2019) were extracted, concatenated and aligned with MUSCLE v3.8.425 (Edgar 2004). We determined the best-fit nucleotide substitution model using JModelTest2 (Darriba et al. 2012). Subsequently, we constructed a maximum-likelihood phylogeny using PhyML (Guindon et al. 2010) with 100 bootstraps. To infer the evolutionary relationships among populations of N. lapillus, we employed the starBEAST multispecies coalescent model within BEAST v2.6.2 (Bouckaert et al. 2019). Our analysis utilized a Yule speciation prior and a strict molecular clock. The best-fit nucleotide substitution model, determined by jModelTest2 (Darriba et al. 2012), was applied to the concatenated mitochondrial haplotypes, which included 13 PCGs and two rRNA genes. For this analysis, we defined the seven sampling points (Cawsand, Whitsand Bay, Brest, Granville, Roscoff, Udra and Senín) along with nine outgroup species as tips for the species tree. We ran a single MCMC simulation for ten million iterations, with samples collected every 1000 steps. A burn-in period of 10% was used to ensure that effective sample size values exceeded 200, as assessed with Tracer v1.7.1. The resulting posterior distributions of trees were visualized using DENSITREE v2.1 (Bouckaert 2010). Finally, a maximum clade credibility tree was generated with TreeAnnotator (Bouckaert et al. 2019) to summarize the topology information, with a 10% burn-in and common ancestors for node height calculations. We assessed the genetic distance within and between the sampling points based on the Kimura-2 parameter (Kimura 1980) in MEGA (Kumar et al. 2018). We conducted a PCA analysis based on the nucleotide diversity of the obtained mitogenomes with the R package adegenet (Jombart 2008) and an analysis of molecular variance (AMOVA) by means of the package pegas in R (Paradis 2010) to quantify how the genetic variability is partitioned among and within geographical populations and chromosomal races.

Results

Karyological analysis

Our cytogenetic analysis revealed specimens of exclusively 2n = 26 chromosomes in the French populations from Brest and Granville and the British locality of Whitsand Bay, while all specimens examined from Roscoff (France) and Cawsand (UK) belonged to the 2n > 26 chromosomal race. On the contrary, both chromosomal races were found to coexist in the Spanish sampling sites of Udra and Senín (Table 1). Examples of metaphase plates with different chromosome numbers are shown in Fig. 1.

Fig. 1
figure 1

Metaphase plates from Nucella lapillus specimens analyzed in this study holding 26 (a, Whitsand Bay—UK), 29 (b, Cawsand—UK) and 31 (c, Roscoff—France) chromosomes. Scale bars, 5 μm

General features of N. lapillus mitochondrial genome

Preliminary drafts of the mitochondrial genome draft of four N. lapillus specimens based on short reads yielded single circular DNA molecules, 15,337 bp in length. Our analysis revealed that the gene arrangement of these specimens matched the content and ordination of the Muricidae family, which comprised 37 genes, including 13 PCGs, 22 transfer RNA (tRNAs) genes and 2 ribosomal RNA genes. Our analysis also revealed that the mitochondrial genome of N. lapillus specimens contained two copies of tRNA genes for serine (trnS1 and trnS2) and leucine (trnL1 and trnL2), as found in the vast majority of metazoans (Tomita et al. 1998). Notably, all of the features observed in our study were encoded on the D-strand, with the exception of eight tRNA genes, the seven tRNA gene array comprised between atp6 and rrnS, and the trnT(tgt), which were found in antisense.

Upon closer inspection of the preliminary draft based on short reads only, it became apparent that the control region and its regulatory elements were mostly absent. Indeed, remapping of Illumina reads onto the preliminary draft assemblies revealed a small region devoid of coverage despite high coverage in the immediately adjacent regions (Fig. 2a). This gap suggested the presence of an artifact in the assembly, possibly due to a repeat in the control region that was longer than the Illumina library insert size and thus not fully resolved by neither Mitobim nor Pilon (Fig. 2a). Upon variant calling using sniffles on nanopore reads mapped on this first preliminary draft, a 1037 bp insertion became apparent. To address this issue, we utilized long-read sequencing technology to reconstruct this conflictive region, which included a 450 bp perfect inverted repeat flanking an AT microsatellite 299 bp in length (Fig. 2a). Incorporating this inverted repeat, the complete draft for the mitochondrial DNA of N. lapillus was found to be 16,474 bp in length (Fig. 2b).

Fig. 2
figure 2

Complete assembly of the mitochondrial genome of N. lapillus. (a) Schematic comparison of the control region of the mitochondrial genome drafts of N. lapillus, as recovered by nanopore long reads (Flye and Medaka) and Illumina paired-end short reads (Mitobin and Pilon), highlighting the inverted repeats (IR) and AT microsatellite (MS) at the control region absent from the short-read only based assembly. In gray, mean read coverage (40 × ONT, 500 × Illumina). (b) Circular map of the complete mitochondrial genome of N. lapillus

As predicted by mitos2, some of the genes required manual curation. Such is the case for the ORF of the nad4 gene that extends beyond its expected length and partially overlaps with the adjacent tRNAH sequence toward the 3′ end, until reaching the nearest theoretical stop codon. This was due to an incomplete UAA codon, terminated upon polyadenylation of the resulting mRNA.

Multiplexed PCR genome sequencing

We used a targeted mitochondrial multiplexed PCR approach to sequence 40 N. lapillus specimens, including 19 with 2n = 26 and 21 with 2n > 26 chromosomes from different locations. The reads were aligned to the first N. lapillus mitochondrial draft genome, with a range of 2422 to 9714 reads recovered per genome (mean = 9364; SD = 3012), and the depth of coverage varied from 105.233X to 407.967X (mean = 393.34X; SD = 123.20X), according to analysis after normalization with the ARTIC bioinformatics pipeline (Supplementary Table 2). However, coverage decreased significantly in all samples due to the lack of primer efficiency at amplicon 10 and an almost complete dropout for amplicon 20, which spans the tandem inversion on the control region, and to the loss of both the 5′ (32 nucleotides) and 3′ (276 nucleotides) termini regions beyond the most outer primer binding sites (Fig. 3). We verified the accuracy of this multiplexed PCR nanopore sequencing approach for SNP calling by comparing it with whole-genome Illumina sequencing for a single specimen from Udra, Spain (NLA_SP_U01). Following established strategies (Mariner-Llicer et al. 2021), we achieved nearly concordant consensus sequences between the two platforms, with 53 out of 54 true positive called SNPs and 3 out of 3 filtered true negatives, resulting in an F1-score of 0.99, high agreement (98.47%), accuracy (98.25%) and a perfect true positive rate (100%), as well an observed error rate of 1.85%, thus demonstrating the reliability of this pipeline for sequence comparison and analysis. Despite incomplete genome coverage resulting from these coverage dropouts, this protocol allowed for sufficient recovery of almost the complete mtDNA, producing reliable consensus sequences for all tested samples to undergo phylogenetic analysis.

Fig. 3
figure 3

Whole N. lapillus mitochondrial genome coverage plot. We performed nanopore amplicon sequencing of the entire N. lapillus mitochondrial genome, as PCR multiplexed with two sets of 700 bp partially overlapping amplicons (Set-1 and Set-2)

Phylogenetic analysis of mitochondrial sequences

Phylogenetic analysis was conducted on a sequence alignment encompassing 12.8 kb of most of the mitochondrial genomes (13 PCGs and 2 rRNA gene sequences) of 40 N. lapillus specimens and the other available Muricidae mitochondrial genomes in Genbank. The resulting phylogenetic tree shows the corresponding genetic relationships, with N. lapillus samples clustering predominantly by population rather than by their chromosomal races (Fig. 4a, Supplementary Fig. 1). O. falcatus and O. inornatus are sister taxa to N. lapillus and form a clade that is sister to a clade composed of C. burnetti and C. rorifluum. We also conducted a multilocus species tree analysis using a dataset comprising the 13 PCGs and the two rRNA genes from the mitochondrial genomes (Fig. 4b), treating the collecting locations as separate entities for species classification. These results highlight that these populations exhibit a clearer clustering pattern based on geographical distinctions rather than by their chromosomal race. Most northern sampling points in the UK and France (Whitsand Bay, Cawsand, and Roscoff) form a distinct cluster separate from the Spanish locations, with the notable exception of Granville, which shows closer genetic affinity to the Spanish samples. Additionally, a PCA analysis based on pairwise genetic distances among all sequenced specimens further supports this population-based clustering (Fig. 4c).

Fig. 4
figure 4

Phylogeny, species tree and principal component analysis (PCA) of seven N. lapillus populations. (a) A maximum-likelihood phylogenetic tree based on the mitochondrial DNA sequences of seven populations of N. lapillus and the available Muricidae mitogenomes. ML bootstrap percentages > 50 are indicated at the nodes. Scale bar shows nucleotide substitutions per site. Chromosomal races are indicated as green (2n = 26 chromosomes) or red (2n > 26 chromosomes) squares at the tips. A proportional branch transformed version of the same phylogenetic tree is provided in Supplementary Fig. 1. (b) Multispecies coalescent tree of the Muricidae family based on their mitochondrial DNAs. (c) Principal component analysis of N. lapillus populations

Uncorrected K2P pairwise distances averaged 0.0018 across the population clades identified in Fig. 4b, underscoring a low genetic variation among the studied populations (Table 2). To further dissect genetic variation within and between populations of N. lapillus and assess the impact of chromosomal race on mitochondrial DNA variation, separate AMOVAs were conducted. Our results revealed a moderate genetic differentiation between populations, with 22.0% of the total variation attributable to differences between sampling points (Fst = 0.318, p < 0.05), while the remaining 78.0% was due to differences within populations, suggesting a moderate genetic differentiation between populations. However, chromosomal race exerted no significant effect, explaining only 0.3% of the total variation (Fst = 0, p > 0.05). These findings suggest that although there is moderate genetic differentiation between geographical populations, chromosomal race does not significantly influence the observed genetic variance in the mitochondrial DNA.

Table 2 Average evolutionary divergence estimates (K2P) based on the mitochondrial DNAs of N. lapillus within and between population sampling points

Discussion

This study presents the first assembly of mitochondrial DNA sequences for N. lapillus, laying the groundwork for future genomic comparative analyses in this species. The obtained sequences were used to investigate the mitochondrial genetic diversity and differentiation among seven populations spanning a broad geographic range and including both the 2n = 26 and 2n > 26 chromosomal races.

Assembling mitochondrial genomes from short-read data remains a formidable task, even with the use of high-throughput sequencing technology and rigorous data filtering (Formenti et al. 2021). The accuracy of the obtained sequences can be impacted by various factors, such as the coverage depth, read quality and the presence of repetitive elements that are longer than the average insert length and thus caution should be taken when interpreting the results, especially regarding the identification of rare or divergent haplotypes (Keraite et al. 2022). Our study also highlights the limitations of assemblies based uniquely on short-read technologies, as this approach was unable to generate a complete mitochondrial genome due to the presence of repetitive regions on it. In this regard, repeats in the control region of mitochondrial DNA, whether microsatellites or longer motifs, are in fact quite common in many taxa and are likely underrepresented in many of the available reference mitogenomes due to the intrinsic limitations of short-fragment assemblies (Formenti et al. 2021). This limitation may also apply to the other nine available Muricidae mitogenomes stored at Genbank, which were all obtained using short Illumina reads and present much-reduced control regions. The resolution of these conflicting regions often requires long-read sequencing for hybrid assembly to achieve complete resolution (De Coster et al. 2021). In fact, recent revisions of multiple vertebrate mitogenomes using long-read sequencing revealed widespread repeats and gene duplications that were previously undetected by short-read assemblies, particularly at the control region (Formenti et al. 2021).

Mitochondrial genes have been reported to exhibit incomplete stop codons ending in U or UA in some species, including humans. These are known to become complete termination codons (UAA) upon subsequent polyadenylation of the mRNA (Hou et al. 2006; Oh et al. 2007; Ki et al. 2010; Temperley et al. 2010). This feature has also been reported in the mitochondrial genomes of other Muricidae gastropods, including Ocinebrellus falcatus (NC_046052), Ocinebrellus inornatus (NC_046577), Ceratostoma burnetti (NC_046569) and Ceratostoma rorifluum (NC_046526), all with the notation “TAA stop codon is completed by the addition of 3′ A residues to the mRNA” and “transl_except = (pos:10,711, aa:TERM).” The nad4 gene in N. lapillus spans 1372 bp, equivalent to 457 triplets and a single terminal T, which would be completed to a TAA stop codon with the addition of two 3′ A residues to the mRNA. This translational anomaly is worth noting, as it contributes to the diversity of mitochondrial genomes in the Muricidae family and may have implications for protein synthesis and function (Temperley et al. 2010).

We found no rearrangements of the mitochondrial genomes of N. lapillus with respect to the other available muricid mitogenomes, which is in contrast to the high level of variability that is typically observed in mitochondrial gene ordination in molluscs, even within the same genus (Varney et al. 2021). We speculate that this could be attributed to the recent radiation of the muricids, which may not have allowed enough time for their mitochondrial DNAs to undergo such structural variations. Nevertheless, it is important to note that this is but a minuscule representation of the species within this family, comprising more than 1600 recognized taxa (Barco et al. 2010). Therefore, a comprehensive analysis of the evolution of their mitogenomes would require a much larger effort encompassing a wider range of species.

The presence of inverted repeats in the control region of the mitochondrial genome of N. lapillus might be a common feature in the Muricidae family, since the available mitogenomes for this family at NCBI, based solely on short reads, apparently lack the necessary control region, likely due to these repeats collapsing during incomplete draft genome assembly. Otherwise, the control region provides valuable genetic information, accumulating more mutations as it is less subjected to evolutionary constraints. In this regard, length polymorphisms at the DNA repeats on it can be used to discriminate between species and populations, and thus may have taxonomical applicability in discerning mitochondrial lineages and establishing intra- and interspecific relationships. Therefore, understanding and properly sequencing these control regions is crucial for comprehensive analysis of the mitochondrial genome and its evolutionary history in different taxa. For instance, DNA repeats have been found to be useful in distinguishing populations of other species, such as the green sea turtle Chelonia mydas (Encalada et al. 1996) and the Japanese eel Anguilla japonica (Sang et al. 1994), where they have been used to assess population structure and connectivity. The inverted repeats and an internal microsatellite at the control region of N. lapillus could vary in length between geographically or ecologically distant animals, providing a valid marker for population analysis. This possibility will be explored in future research.

Various PCR approaches have been successfully employed for the retrieval of complete mtDNAs across diverse species. These include, among others, PCR-tiling techniques, as demonstrated by Xin et al. (2017), or primer walking strategies, exemplified by Serb and Lydeard (2003). These targeted approaches facilitate the retrieval of complete mitochondrial genomes with a high level of accuracy, offering a cost-effective alternative to whole-genome sequencing (Quick et al. 2017). Beyond cost-efficiency and accuracy, these PCR-based methods boast scalability, enabling simultaneous analysis of multiple samples within a single sequencing run. This scalability not only further reduces overall costs, but also significantly enhances analysis throughput. In addition, the multiplex PCR approach is highly efficient and only requires small amounts of starting DNA, which is especially useful for the study of rare or endangered species, where sample availability can be limited. The application of multiplex PCR coupled with ONT sequencing for the recovery of mitochondrial DNAs presents a valid and cost-effective alternative to whole-genome NGS sequencing for phylogenetic analysis (Walsh et al. 2022; Karin et al. 2023). Overall, this approach represents a valuable and cost-effective approach to the recovery of mitochondrial DNA for phylogenetic analysis.

In this study, we observed a low genetic diversity and moderate geographical differentiation in mitochondrial DNA among N. lapillus populations, with no apparent differentiation between chromosomal races. This contrasts with previous findings from nuclear markers such as allozymes (Day et al. 1993; Kirby et al. 1994), microsatellites (Guerra-Varela et al. 2009) and mitochondrial genes (Kirby 2000), which implied partial genetic isolation between both races, leading us to anticipate at least some genetic differentiation on their mitochondrial DNAs. One possible explanation for this lack of mtDNA differentiation could be an ancient introgression and replacement of either mitochondrial haplotype. Both chromosomal races likely underwent differentiation during a period of allopatric isolation. Subsequently, a secondary overlap and introgression would have mitigated the genetic differentiation on neutral markers, such as the mitochondrial DNA (Panova et al. 2011). After this, isolation by distance would have played a role in the genetic differentiation between populations at present, leading to the distinct evolution of the current mitochondrial haplotypes. Similar results have been reported in other marine gastropods, such as Littorina spp. A study by Johannesson et al. (1993) found evidence of mitochondrial introgression between subspecies of L. saxatilis, while other authors reported a similar process between L. fabalis, L. obtusata and L. arcana (Panova et al. 2011; Sotelo et al. 2020). Mitochondrial DNA introgression has also been documented in other sessile marine animals, such as corals (Fukami and Knowlton 2005) or sponges (Turon and López-Legentil 2004). These studies highlight the potential role of mitochondrial DNA introgression in driving genetic differentiation and speciation in marine organisms (Sotelo et al. 2020).

Nonetheless, it is also conceivable that, driven by specific selective pressures, the karyological evolution outpaced that of the mitochondrial DNAs, otherwise subjected to neutral selection. Indeed, a correlation between the chromosomal polymorphism in N. lapillus and Darwinian fitness has been suggested before (Pascoe 2006), although the extent to which this structural variation exerts, or has ever exerted, such an effect remains uncertain. The frequent occurrence of spontaneous somatic Robertsonian translocations in 2n > 26 specimens at present (Dixon et al. 1994; Pascoe 2006) implies a genetic predisposition to such rearrangements, also potentially triggered or intensified by environmental factors, providing additional support to this hypothesis. Interestingly, this scenario would also open up the possibility of parallel karyological evolution, for which either chromosomal race might have evolved independently in multiple locations on similar ecological niches. Thus, the absence of significant genetic differences between the mitochondrial DNAs of both chromosomal races may be interpreted as a consequence of disparate evolutionary rates between the karyotypes and mitochondrial DNAs. Further research is deemed to explore the possible drivers of genetic differentiation and the potential fitness implications of the karyological differences between both chromosomal races of N. lapillus.