Introduction

Many regional reviews and assessments have appeared around the world in recent decades seeking to document the diversity of non-native marine species. A significant challenge in comparing and contrasting these studies—with a goal, for example, of identifying global bioinvasion patterns—is that the methods employed to evaluate non-native (and at times cryptogenic) species diversity vary widely, resulting in inconsistent coverage of taxonomic diversity, habitats and historical depth, even in ostensibly well-studied regions. It may thus be difficult to interpret whether strikingly different numbers of non-native marine species in different areas of the world, or within the same taxonomic groups across regions, reflect (1) differential invasion patterns or (2) different assessment criteria and capabilities.

Spatial invasion patterns of interest may center on invasion susceptibility and resistance, including whether temperate marine regions are more invaded compared to tropical regions (Hutchings et al. 2002; Freestone et al. 2013; Giachetti et al. 2020) or to high-latitude regions (de Rivera et al. 2011), whether at the same latitudes there are asymmetric invasion patterns across oceans (Torchin et al. 2021), whether invasions differ between regions influenced by maritime versus continental climates (Ruiz et al. 2000, 2015), or whether some habitats are more susceptible to invasions than others (Wasson et al. 2005; Ruiz et al. 2011, 2015). Similarly, temporal patterns of invasions may be of interest, to assess invasion rates or the effectiveness of management strategies (Byers et al. 2015; Seebens et al. 2017; Bailey et al. 2020). However, the accuracy of species’ first collection or detection dates provided in such assessments is often highly variable as well, varying from coarse- to fine-grained, creating potential challenges for resolving clear patterns and making global comparisons.

Differences in the breadth, depth, and accuracy of the assessment of invasion diversity may result from a number of different factors and approaches. A number of authors have discussed the potential limitations, challenges, and errors inherent in assembling inventories of non-native species (Wasson et al. 2000; Ruiz et al. 2000; McGeoch et al. 2012; Rocha et al. 2013; Ojaveer et al. 2014a, b; Marchini et al. 2015; Marchini and Cardeccia 2017; Katsanevakis and Moustakas 2018; Albano et al. 2021). Many studies do not (1) acknowledge the diversity of understudied taxonomic groups (such as protists, flatworms, nemerteans, nematodes, sponges, hydroids, polychaetes, bryozoans, and many others) which may have not been assessed thoroughly (if at all) for the presence of non-native species. Many studies also do not (2) appear to capture the full extent of the regional historical biodiversity literature, resulting in overlooking a potentially large number of earlier records if not earlier invasions. Further, independent of historical work, by not considering the potential (3) phyletic (even if well studied) and (4) habitat breadth that might include or support non-native species, many invasions documented in more modern literature in a region may also be simply overlooked. Critically, most studies do not appear to (5) employ the reassessment techniques available to expose the diversity of non-native species that may have long been incorrectly regarded as native.

Thus, simply reviewing and summarizing databases and selected literature on previously reported invasions for a given region in order to produce a working list of non-native species may result in overlooking a substantial fraction of the non-native biota. Concomitantly, accepting at face value species reported without sufficient justification as being non-native may be problematic. More challenging may be a widespread but rarely stated assumption that online databases that provide lists of non-native species are both comprehensive and have been expertly vetted. Finally, the advent of molecular techniques as a tool by which to assess invasion diversity, in the absence of expert morphological taxonomic confirmation of the results obtained from sequencing, may lead to significantly inaccurate reports of non-native species, as discussed below.

By employing an extensive suite of research techniques that we detail here, work in the Hawaiian Islands resolved the previous recognized number of marine bioinvasions from 90 to 333 (Carlton and Eldredge 2009, 2015), in the Galápagos Islands from 5 to 53 (Carlton et al. 2019) and in Argentina/Uruguay from 29 to 129 (Schwindt et al. 2020). Nevertheless, and despite the significant expansion in recognition of the actual scale of non-native species present in these locations, these assessments remain incomplete relative to historical and cryptic invasions. The biogeographic status of many of these newly-recognized non-native species was often not discussed in the pertinent regional literature, because they were long assumed, in the absence of systematic investigation, to be native species.

Our objective is to present a comprehensive guide to the methods and techniques that we have developed and applied over the past 50 years to assess the diversity of non-native and cryptogenic marine species and to either resolve or estimate the timing of first detection of such species. Many of these approaches are also applicable to investigations in freshwater and terrestrial systems. We discuss the following topics, summarized in Fig. 1, that permit a deeper unveiling of the often remarkable, but hidden, diversity of non-native species and their potential detection dates.

Fig. 1
figure 1

Schematic workflow by which to assess the diversity and history of marine bioinvasions and cryptogens. The first step is to consider candidate species across the fullest possible taxonomic and habitat breadth to search for species from four different literature and field sources. The second step is to filter the resulting species inventory with the criteria for recognition of non-native species, which will produce the first draft of an invasions and cryptogen assessment. The third step is to perform a deeper exploration of invasion timing, resulting in a vetted inventory of the diversity, history, and timing of bioinvasions and cryptogens for the region under consideration

  • Preamble I: Definitions and Terminological Considerations

  • Preamble II: The Sine Qua Non of a Robust and Reliable Taxonomic Foundation

Establishing Assessment Breadth

  1. I.

    Establishing Target Taxonomic and Habitat Breadth

    1. IA.

      Phyletic Diversity

    2. IB.

      Habitat Diversity

Searching for Invasions

  1. II.

    Literature Surveys

    1. IIA.

      Regional Invasion and Taxonomic Literature, and the Need for Search Term Diversity

    2. IIB.

      Global Taxonomic and Invasion Literature with “Hidden” Invasion Records

  1. III.

    Reassessment of the Status of “Native” Species

    1. IIIA.

      Targeting Candidate “Native” Species for Re-Evaluation of Biogeographic Status

  1. IV.

    Field Surveys and Sampling

    1. IVA.

      Colonization Substrates (Passive Sampling)

    2. IVB.

      Rapid Assessment Survey (RAS)

    3. IVC.

      Extended Site Surveys

    4. IVD.

      eDNA Metabarcoding and qPCR Surveys

    5. IVE.

      Invited Experts: Focus on Specific Taxonomic Groups

Filtering the Record

  1. V.

    Criteria for Recognition of Non-Native Species

    1. VA.

      Species to be excluded from calculations of regional marine invasion diversity

    2. VB.

      Review of criteria for recognition of non-native species

    3. VC.

      Retention in invasion assessments of species not recently reported

    4. VD.

      Type localities are not default native regions

Establishing Invasion Timing

  1. VI.

    Methods for Establishing the Timing of Invasion Records, and the Importance of Not Using Publication Dates

    1. VIA.

      Methods for Determining Earlier Dates of Collection or Detection

Not treated in depth in the present essay are methods by which to determine (1) the known or probable geographic origin of a recognized invasion—that is, the presumptive native (indigenous or endemic) region, and (2) the known or probable vectors involved in transporting a species of concern to a new location.

Preamble I: definitions and terminological considerations

Community species diversity generally changes in three ways. Species populations increase and decrease, new species arrive, and species disappear. Our consideration here focuses on species arrivals—the additions—to communities over space and time. A core question in community ecology and evolution is how the addition of novel species may alter community structure and function, in terms of the abundance and distribution of resident species, predator–prey and parasite-host dynamics, competitive networks, and energy flow, among other phenomena.

Additions to marine communities may occur in two ways: species may be transported by, for example, ocean currents or birds, or be transported by human activity. In ecological and evolutionary literature, both are considered biological invasions. For example, Braun (1921), Lindroth (1957), and Simberloff (1976) described the natural processes of historical plant movements, the arrival in Iceland of certain beetle and moth species, and the experimental colonization of mangrove islands, respectively, as invasions. Chapter 2 of Charles Elton’s 1957 “The Ecology of Invasions by Animals and Plants” is devoted to examples of natural species range expansions (Elton 1957). MacArthur and Wilson (1967) described the “fundamental process” of biogeography as “dispersal, invasions, competition, adaptation, and extinction.” Carlton (1987) noted that “Biological invasions in marine communities occur through two processes, range expansions and introductions.” Williamson (1996) wrote that “Biological invasion happens when an organism, any sort of organism, arrives somewhere beyond its previous range,” and, “Invasions have been an important component of the evolutionary process throughout geological history.” Vermeij (1996) emphasized that, “By invasion I mean the geographical expansion of a species into an area not previously occupied by that species. Invasions may occur as the result of climatic and tectonic changes as well as through introduction by humans.”

Historic and geological movements of species have also long been described as invasion, invading, and invader phenomenon (Linderg 1991; Vermeij 1991). The Great American Biotic Interchange (GABI)—the late Cenozoic range expansion of diverse species between North America and South America following the creation of the Isthmus of Panama—has been framed for decades as a textbook model of biological invasions (di Castri 1989; Leite et al. 2014). Knapp et al. (2021) delve into paleogenetic tools relative to “ancient biological invasions.”

However, commencing in the early 2000s, the concept of “invasion” became restricted, relative to “alien flora,” to “where the distribution and abundance of plants changes as a result of human activity” (Pyšek et al. 2004). In essence, the definition of biological invasions changed to a focus on the vector. This re-definition gained subscription in the years to follow (for example, Wilson et al. 2016; Blackburn et al. 2020; Robinson et al. 2020; Hulme 2021; Convention on Biological Diversity (https://www.cbd.int/invasive/WhatareIAS.shtml, accessed August 2023)).

Anchored by fundamental interests in community dynamics, and the ecological, evolutionary, economic, and other consequences of species additions, we continue here the long-standing definition that biological invasions include both human-mediated introductions and range expansions.

Biological invasions, or in shortened form, bioinvasions, thus refer to the process of species arriving by any vector and establishing reproducing populations in a geographic region where they did not exist in historical time. Bioinvasions may also be defined as those species found outside of their native range, usually thus having transgressed one or more physical or climatic barriers to dispersal. However, for a great many marine species it may be challenging if not impossible to unequivocally assess accurately the true boundaries of their native, natural ranges. Anthropogenic movement of many marine species began centuries if not millennia prior to the onset of biological studies (Carlton 1999b; Ojaveer et al. 2018) and thus the assumed native range of a potentially large number of species is in reality not known. Through archeological, historical, genetic or other evidence it may be possible to delineate prehistoric ranges before human-mediated transport commenced, but in the absence of such data, the purportedly extraordinarily broad latitudinal and longitudinal (especially “pantropical”) ranges of many marine species should not be assumed to be natural (Carlton 1987; Darling and Carlton 2018).

For historical biogeographers, paleoecologists (Webb; 2006; Bacon et al. 2015) and ecologists (Mooney and Drake 1986), a fundamental interest in bioinvasions is how a community may respond to the arrival of novel species, in terms of, for example, potential changes in resident species’ abundance, diversity, distribution and interactions. For the public and government, including environmental managers, interest may further focus on how invasions do or will impact the quality of ecosystem services, the viability of rare and endangered native species, cultural values, the economy, and public health, as well as how to institute potential pre-invasion or post-invasion management strategies (Ruiz and Carlton 2003; Veitch et al. 2011; Wan et al. 2017; Giakoumi et al. 2019; Zengeya and Wilson 2020; Simberloff 2021).

We note that invasion (defined above), invasive (a non-native species that “spreads” or has some measure of “impact”) and invasiveness (“The features of an alien organism, such as their life history, traits and modes of reproduction that define their capacity to invade, i.e., to overcome various barriers to invasion” (Richardson et al. 2011)) are three distinct concepts and are not synonyms.

Introduced species (synonyms and related terms are treated below in Section “IIA. Regional invasion and taxonomic literature, and the need for search term diversity”) are those which have been transported by human activities into a region where they were previously absent and which have become established as evidenced by the presence of self-sustaining reproducing populations. Introductions have occurred for many reasons over millennia and result in a vast array of impacts ranging, along imperceptible gradations, from difficult to detect (in the absence of experimental or pre-invasion quantitative evidence) to significant changes in the recipient community. We note that the lack of reports of impact is not equivalent to the lack of impacts. Introductions are facilitated by a long list of anthropogenic vectors (Ruiz and Carlton 2003; Williams et al. 2013; Grosholz et al. 2015; Fowler et al. 2016; Ojaveer et al. 2018), including, now, rafting marine debris (Carlton et al. 2017; Rech et al. 2018a, b; Haram et al. 2021).

While we suggest no fixed before-after dates by which to recognize a species as transported by human activity, we note the following examples that would all be considered introduced species from both an ecological and evolutionary point of view: the introduction of dingoes into Australia by humans more than 3500 years ago (Fillios and Tacon 2016; Cairns 2021), the plants and animals introduced to Britain by Romans (AD 43–AD 410; Webb 1985; Witcher 2013) as well as the introduction by Polynesians (commencing circa AD 1000–1100; Athens et al. 2014) of plants and animals to the Hawaiian Islands. Webb (1985) succinctly captured this concept: “The frequent practice of treating long-established aliens as equivalent to natives gives rise to confusion in discussions of phytogeography or quaternary history: the former may be hard to distinguish from natives, but they are aliens none the less.” Crees and Turvey (2015) provide examples of even older introductions.

In concert with this, we do not find compelling ecological or evolutionary support for distinguishing, as archaeophytes and neophytes, human-mediated introductions that occurred in Europe before and after, respectively, Columbus’ first voyage in 1492 (Richardson et al. 2000; Essl et al. 2018), long after transatlantic anthropogenic introductions had commenced (Dugmore et al. 2005; Essink and Oost 2019). We note that certain regional cultural, social, economic, political, or legal—but not scientific—perspectives may regard some earlier introductions as “native”.

Native (indigenous, endemic, autochthonous) species are those that have been historically present in a region, as determined by paleontological, archeological, biogeographic, molecular, and other evidence (see Crees and Turvey 2015) for a nuanced and extended discussion).

Cryptogenic species are taxa of a known identity (to varying taxonomic levels, as discussed below) whose evolutionary and biogeographic origins are poorly described or not yet known, and thus cannot yet be resolved as either non-native or native (Carlton 1996). As noted by Carlton (2009), cryptogenic species are not (1) non-native species of uncertain geographic origin, (2) cryptic invaders, or (3) introductions whose mechanism of transport is uncertain. We underscore the latter: Evans et al. (2020) inaccurately extended the definition of cryptogenic to “include species for which uncertainty exists as to whether their introduction was human-mediated or not,” and thus calculated the number of “true newcomer” fish in the Mediterranean that were “cryptogenic.” The term “polyvectic” (below) correctly covers this concept. Cryptogenic species are also not (4) non-native species represented by both native and introduced genes (Yund et al. 2015), (5) species whose introduction occurred long ago and were not witnessed (Zenetos et al. 2005), nor (6) species whose type locality (the location where a species was originally described) is outside of the region under study. Critically, assigning a species cryptogenic status should (7) not mean that they should be evaluated as if they were not-native (Blackburn et al. 2014) or imply that “it is guilty of being introduced until it is proven otherwise” (Campbell et al. 2018). By definition, a cryptogenic species is one that has not yet been determined to be non-native or native.

It is important to emphasize that in some cases species may be considered, based upon biogeographic or other evidence, as non-native or cryptogenic even if only resolved to family or genus level. That is, taxa that cannot be resolved to species level do not automatically default to an unassigned status. Biogeographers regularly recognize taxa, unresolved to lower taxonomic levels, as members of clearly allochthonous clades that are demonstrably not, or unlikely to be, native to a given continent or ocean (Carlton and Eldredge 2009). Cryptogenic (and introduced) species may also include undescribed taxa, as discussed below (Sections “III. Reassessment of the status of “native” species” and “VD. Type localities are not default native regions”).

Unassigned taxa are insufficiently resolved taxonomically to be assigned a status of non-native, cryptogenic, or native. These taxa are also referred to as unresolved, undetermined, indeterminate, uncategorized, unidentified (sensu Carlton 2009) or data deficient (sensu Essl et al. 2018).

Range expansions typically refer (in a marine context) to the movement of species along shore, shelf, or island corridors, as well as to poleward movements in the open ocean, into regions where they were previously absent. Rare possible non-corridor exceptions exist, such as the ostensibly natural transoceanic dispersal of the cattle egret (Bubulcus ibis) from Africa to South America (Cele and Downs 2020). Range expansions include both the movements of recent introduced species whose post-introduction ranges expand to their natural physiological limits, and native species or older introduced species that are responding to environmental changes (such as habitat alterations or climate shifts). The phrase range extension is used in the literature, often without distinction, for two distinct phenomena: (1) range expansions as defined here, and (2) the discovery that a species’ presumed native range is more extensive than previously known. Such discoveries may be due to increased sampling effort, to the development of new sampling techniques, or to exploration of new areas (Lonhart 2009), rather than the establishment of new populations.

Once a species has arrived and established, further range expansions may be facilitated by both natural vectors (such as currents, winds, and birds) and by anthropogenic vectors (Richardson et al. 2011, the latter thus constituting introductions as well as range expansions.). Thus, after a species is introduced to a region, both (for example) coastal currents and ships (or other human vectors) may move it to a new region within a biogeographic province or between adjacent provinces.

Similarly, many native species responding to climate change and expanding poleward may potentially do so both by currents and by human vectors that are transporting them to regions which were historically not amenable to colonization. Indeed, such species may have been transported for centuries to higher latitudes, only now to find warmer temperatures now suitable for reproduction. Rather than assuming that species moving poleward are doing so naturally, the relative roles of natural dispersal versus anthropogenic transport must be determined and documented if possible. If evidence suggests that both natural and human-mediated vectors may be in play relative to the movement of a species, these would be considered polyvectic species (Carlton and Ruiz 2005, who note that “polyvectism is a significant management challenge”). We emphasize that if it is not known if a range-expanding species has been moved by, for example, ocean currents or vessel hull fouling, the assignable vector is not “unknown”: possible vectors are often in fact well known, but which vector(s) is or are responsible for a given introduction event may not be resolved (Carlton and Ruiz 2005). The term unknown vector (cryptovectic) is reserved for those very rare instances where no known dispersal mechanism explains the arrival of a new colonist (Carlton and Ruiz 2005). In the absence of data, however, the default is not natural dispersal.

Regardless, a vast number of species have been, are, and will be expanding their ranges poleward in both hemispheres as a result of warming waters due to human-induced climate change. These species are thus not in the category of fundamentally “natural” expansions uninfluenced by human actions, even if transported by ocean currents or birds. While some species may be dispersing naturally, those responding to anthropogenic habitat and climate change are not expanding naturally. Species responding to climate change and moving poleward, whether by natural or anthropogenic means, are not native in the regions which they have newly colonized and where they were historically absent.

The fact that range expansions of species may not be accommodated in legal, policy, or other definitions of introduced (alien, non-native) species in regional or international protocols does not mean they should be disregarded. To omit range expansions in treatments of regional invasions discounts their potentially significant impacts (see, for example, Ling et al. 2009; Strain and Johnson 2009, 2013; Henry and Sorte 2021). Our concerns relative to potential economic, societal, or ecological repercussions relative to the arrival and establishment of historically absent species remain fundamentally the same, whether a species is from overseas, or moving naturally poleward from lower latitudes, or is a human-transported native species from an adjacent region. The fact that species invading from adjacent provinces may (but not necessarily) have deeper ecological and evolutionary relationships with resident species in the invaded region (Sorte et al. 2010) versus with species from distant provinces, while of no small interest, does not remove the former from the category of biological invasions.

Preamble II: the sine qua non of a robust and reliable taxonomic foundation

Nearly all of the analytical methods that we describe below operate at the mercy of taxonomy—that is, the presumption of a robust and reliable taxonomic foundation, whether analyzing phyletic or habitat diversity of invasions, re-assessing a species’ biogeographic history, or in biological survey data. A working assumption is typically that the identifications of species being considered and lists of species being analyzing have been expertly vetted to the extent that our current understanding permits (acknowledging that molecular genetics and finer-grained morphological work may reveal that many taxa now under the umbrella of one scientific name are in fact species complexes; Darling and Carlton 2018). Thus, establishing at the outset the quality of both past and present taxonomic work, relative specifically to identifications of non-native species, is extremely critical. This said, given that the ability of non-specialists to assess taxonomic quality assurance and quality control may be limited (or non-existent), a path of least resistance is to simply take on faith the reported species names. The fundamental assumption of presumptive taxonomic expertise may be reflected in part by the fact that some genetic databases (discussed below) do not require reporting the name of the person who identified the species in question.

The detailed means by which ecologists, geneticists, biogeographers, modelers, and others could judge the taxonomic quality of a body of work is beyond the scope of the present work. However, we note several basic steps that can be taken. In the process of attempting to determine the identity and number of non-native species in a given area, regional and local species lists (see Section “II. Searching for invasions: literature surveys”) based on biodiversity or ecological surveys may be useful. Researchers should seek evidence that taxonomic experts were involved in such lists and surveys, either in the authorship or acknowledgments. If none of the authors are specialists in any of the taxonomic groups considered, and if the methods refer solely to the use of local keys or guides, if voucher specimens have not been archived in a recognized depository, and if no experts are acknowledged for the taxa considered, then acceptance of the identifications of the species, and certainly key species of interest, must proceed carefully (Bortolus 2012a, b). Researchers should pay special attention to reports of species otherwise previously known only from another ocean or another hemisphere and newly reported by non-specialists for the first time. Such reports would require verification by taxonomic experts through direct examination of specimens. Indeed, for key species of interest, re-examination and verification by experts may be of value. We discuss “verification” of species identifications by genetic analyses below.

Matches with DNA (nucleic acid)-Based Species Databases may not yield the correct identification

The availability of public databases with molecular sequences of “identified” species has often proven to be of exceptional value in potentially yielding the scientific names, based on genetic data, of unidentified specimens, especially if morphological taxonomists are not available, if the material in hand alludes morphological identification, if species complexes are involved, or in eDNA metabarcoding of whole community samples or species-specific qPCR assays (Section “IVD. eDNA metabarcoding surveys and qPCR assays”). Newly generated molecular sequences (from whatever sources) are compared with online sequences deposited by others to produce a purported identification. Non-specialists may thus assume that a 99–100% (or sometimes lower) “match” produces a “correct” identification of the material in hand, resulting in common statements in the literature such as “identification was confirmed by molecular analysis.” This said, it is critical to remember that all first-time species entries in genetic databases are based on morphological identifications, identifications which may or may not have been made by expert taxonomists.

The reliability of GenBank, as an example, as a tool for verifying species identifications has generated a number of discussions (see Leray et al. 2019, 2020; Locatelli et al. 2020; Dupérré 2020; Ricciardi et al. 2021; Hayes 2021; Sigwart et al. 2021). To add sequences to GenBank does not require sequence providers to document the name of the person who identified the species, a photograph of specimens, or the institutional deposition of specimens, nor details about the collection site (GenBank Submissions Handbook https://www.ncbi.nlm.nih.gov, accessed August 2023). The iBOL alliance however requires saving the specimen from which a DNA sample was taken, a photo, and the name and contact of the person who identified the specimen. However, most sequences are uploaded to GenBank; critical baseline taxonomic and curatorial information may or may not be in a supporting publication. At least in the marine systematics community it is widely discussed, often privately, that many species in genetic databases are either known or probable misidentifications. Better of course is published evidence in the peer-reviewed literature that mistakes have been made, although only those who deposited the original sequences can then correct the identification. Fehlauer-Ale et al. (2015) stipulated the GenBank numbers for three species of marine bryozoans that were incorrectly identified in that database, and provided the correct identifications. As of August 2023, none of these identifications had been changed in GenBank. Sigwart et al. (2021) noted that more than 6%, or 62 of 942 sequences deposited in GenBank for species of the oyster genus Magallana “represent identification errors,” while Fort et al. (2021) found that up to one-third of all deposited sequences in GenBank of foliose species of the green alga Ulva were misidentified. Gauff et al. (2023) note that for species in the bryozoan genus Watersipora, there are “pervasive erroneous identifications in GenBank, which in turn perpetuate further errors in recent studies.”

The extent to which the use of GenBank can influence identifications is reflected in an increasing number of papers that report improbable or impossible records of species, based solely on “confirmation” to matched sequences. Thus, molecular genetic “identification” alone, absent expert morphological confirmation, of remarkably disjunct species occurrences, may result in the mis-identification and thus mis-reporting of species. As an example, the Asian barnacle Megabalanus rosa was reported, based on a molecular match in GenBank, as established in the rocky intertidal of southern California, but in fact was the native California barnacle Megabalanus californicus (Hagerty et al. 2018, and Corrigendum; Hagerty et al. 2019). Ardura et al. (2021) have listed many cold-water species as invasions from French Polynesia, based solely on genetically-based identifications, which we comment on further below.

We are aware of the challenges of verifying genetic matches with expert identifications based on morphology. The dearth—or complete unavailability—of taxonomists for many marine taxa in many regions is well-known. For those taxonomists who are available, the queue to obtain an expert opinion may be very long. In many areas of the world there are no scholarly taxonomic guides to the marine fauna or flora in general, nor to many specific taxonomic groups. However, none of these situations mean that genetic-only identifications should automatically be considered valid, especially when they produce eyebrow-raising names of species previously known only from another ocean or another continent, or with serious climatic mis-matches, unaccompanied by a museum-deposited specimens or even a photograph. Erroneous identifications may have further consequences relative to the understanding of ecological interactions as well as in management decisions (Bortolus 2008).

With these preambles considered, we outline here suggested “best practices” approaches that would permit more thorough and accurate assessments of bioinvasion diversity.

I. Establishing target taxonomic and habitat breadth to be assessed

IA. Phyletic diversity

An attempt should be made to capture the broadest possible range of marine and maritime taxa in reviews of bioinvasion diversity. We review below the historical and taxonomic impediments that may make it challenging to do so. Examples of inventories that have attempted to include a broad range of taxa include Carlton and Eldredge (2009, 2015, Hawaiian Islands), Carlton et al. (2019, Galápagos Islands), Schwindt et al. (2020, Argentina and Uruguay), Mead et al. (2011a, b, South Africa), Katsanevakis et al. (2020, Mediterranean Sea), Galil (2009, Mediterranean Sea), and Minchin et al. (2013, British Isles).

Examples of the range of taxa that could be captured in invasion inventories are shown in Table 1. Documented marine invasions include viruses, bacteria, fungi, a wide range of additional microorganisms (including foraminiferans, ciliates, dinoflagellates), algae, seagrasses and salt marsh halophytes, many invertebrate phyla, and fish, birds, and mammals (we discuss below the inclusion of introduced mammals in marine invasion inventories). Invasion inventories omit many of these groups, not necessarily because of the lack of reports of such invasions (which we discuss below), but because the authors may not have investigated whether invasions were in fact already reported in many taxa, and, as we further discuss below, because authors may lack taxonomic resources and expertise.

The desire to assess invasions across a wide range of marine taxa also quickly encounters long-term and long-discussed critical challenges in the adequate censusing of many groups of marine organisms, challenges which are often linked to the “taxonomic impediment” (Carlton and Fowler 2018; Dupérré 2020; Ricciardi et al. 2021). Carlton and Fowler (2018) listed examples of 29 phyla and classes of marine invertebrates and fish that are currently subject to global and coastal transportation by a wide range of anthropogenic vectors. Of these, they noted that a review of the past 20 years of surveys suggest that 27 of these 29 groups appear to be “globally under-reported as invasions.” These under-represented groups, all marked by a serious and growing lack of taxonomic expertise world-wide, range from sponges and hydroids to flatworms, nemerteans and polychaetes, as well as to many groups of small crustaceans and bryozoans.

Thus, the breadth and depth of capturing invasion diversity varies strikingly by location, and in many cases will mirror local interest and available taxonomic expertise. Non-native marine copepods and mysids have been well explored in San Francisco Bay, California (Orsi and Walter 1991; Orsi and Ohtsuka 1999; Slaughter et al. 2016) and in the American Pacific Northwest (Cordell et al. 2007a, b, 2008) but in many regions of the world no introduced copepods or mysids are reported at all, reflecting the lack of investigation, rather than a lack of invasions.

Similarly, despite the fact that a reasonable assumption would be that many species of diatoms and dinoflagellates (among many other microbial groups) have been transported and successfully introduced by ballast water for more than 150 years, their recognition as non-native species in marine coastal systems around the world is highly variable and often absent. Wyatt and Carlton (2002) remarked that “most modern phytoplankton invasions have simply been overlooked,” detailing evidence for this suggestion. Long-standing complexities of diatom and dinoflagellate taxonomy, the lack of historical data bases, the existence of dinoflagellate resting cysts that may remain undetected for long periods of time, and the boom-and-bust cycles of many phytoplankton species, among other challenges, have all served to obfuscate the detection and verification of phytoplankton invasions. Perhaps no better example is found in San Francisco Bay, California, USA, which hosts nearly 300 non-native species of algae, invertebrates, and fish (Carlton 2009). The Bay also supports 500 distinct phytoplankton taxa, with 396 identified to species level—none of which are recognized as non-native species by phytoplankton researchers, although many of the same species bear cosmopolitan names, have been found in ballast water, and are not known from ocean currents (Carlton 2009). We further note below (Section “VB. Review of criteria for recognition of non-native species”) the “smalls” rule of invasion biology—the inverse correlation of body size with the ability to be recognized as non-native (Carlton 2009). Nevertheless, we encourage researchers to consider the potential for, at least, cryptogenic phytoplankton taxa in the communities in which they work.

The way forward: how to embrace phyletic diversity of invasions

To more fully clarify or investigate the phyletic diversity of invasions under consideration, researchers should,

  • Taxonomic Lacunae Explicitly discuss the taxonomic groups that are missing from the regional treatment, in order to weigh the potential scale of invasion underestimation. Thus, if a regional marine invasion inventory considers no polychaetes or amphipods, these lacunae should be clearly flagged as a lack of knowledge of the biogeographic history of taxa in the study area (if such is the case), rather than the implication that such taxa have been investigated and no non-native species are present.

  • Earlier Non-Invasion Literature Search regional taxonomic and natural history literature which did not appear in searches using invasion terms, particularly searches that use more “modern” words (such as invasion, invasive, or non-indigenous) for non-native species. Earlier literature may refer to non-native species as “introduced,” or simply suggest that a species was historically transported by ships to the present location.

  • Tap Specialist Knowledge Consult with both local and global taxonomic specialists to determine if they are aware either of literature that does or may contain records or suggestions of non-native species in the region in question, or of species that they themselves consider certain or likely non-native candidates, regardless of whether the biogeographic status of such species in a given region has been previously published.

  • Mining Non-Regional Literature Search in global taxonomic, and particularly monographic, literature. As we review below (Section “II. Searching for invasions: literature surveys”), non-native species present in a given region may be reported as such in non-regional literature.

IB. Habitat diversity

Few invasion inventories explore the full range of potential habitat diversity occupied by marine and estuarine non-native and cryptogenic species (Table 2). Non-native species have invaded virtually every marine habitat (Table 2), including the deep sea (Carlton 2002, 2003), in many habitats prominently so. We consider 22 habitats which support non-native species, grouped under Water Column, Intertidal (including Dunes and Supralittoral), Sublittoral, and Deep Sea.

  • Water Column, Marine and Estuarine

    • Coastal (neritic) waters

    • Oceanic neuston-pleuston (neopelagic)

  • Dunes, Supralittoral, and Intertidal, Marine and Estuarine

    • Maritime sand dunes

    • Supralittoral (strand zone)

    • Maritime wharves

    • Rocky intertidal (exposed coast)

    • Rocky intertidal (protected coasts and estuaries)

    • Other hard-bottom intertidal (e.g. oyster reefs, serpulid tubeworm reefs)

    • Soft-bottom intertidal (e.g. mudflats, sandflats, clay-peat banks, sandy beaches, mixed sediments),

    • Sandy beaches and surf zones (exposed coast)

    • Salt marshes

    • Rocky salt marshes

    • Mangroves

    • Coral reefs (intertidal)

  • Subtidal (Sublittoral)

    • Fouling (intertidal and subtidal)

    • Wood-boring (largely in harbors and ports)

    • Hard-bottom subtidal to shelf (including rocky, to 200 m)

    • Soft-bottom subtidal to shelf (mixed sediments, to 200 m)

    • Seagrasses

    • Kelp beds

    • Coral reefs (subtidal)

  • Deep Sea

    • Deep Sea mixed bottoms (> 200 m)

Only 3 of these 22 habitats involve human-created environments (acknowledging that all of the other habitats have sustained human modification)—maritime wharf communities, intertidal and subtidal fouling communities, and wood-boring communities, largely in harbors and ports. Bays and estuaries are typically highly invaded (Table 2), and include water column, natural hard-bottom and soft-bottom intertidal and subtidal substrates, mangroves, intertidal and subtidal fouling and wood-boring communities and seagrass (including eelgrass) communities. As many of these are studied as distinct habitats within bays and estuaries, and given that there has been a concentration of work on artificial substrates, some of these habitats may remain largely unexplored for invasions (or, if invasions are reported in such environments, as we note below, may be overlooked in invasion reviews).

Importantly, non-native species found in three of the above habitats—maritime (ocean) sand dunes, supralittoral zones, and salt marshes—may often be either overlooked or intentionally omitted in assessments of marine invasion diversity. We argue for their inclusion in assessments of marine bioinvasion diversity based upon ecological and evolutionary physiological grounds, given that species in these habitats typically require an ocean—that is, a saline—environment in order to reproduce, feed, grow and survive. For example, the European beachgrass (marram grass) Ammophila arenaria is a member of the non-native maritime beach dune community on the Pacific coast of North America and the North American cordgrass Spartina alterniflora is a prominent non-native species in salt marshes in countries around the world. Both plants require a salt-based environment to exist. Salt marshes are inundated by tidal ocean waters daily, and non-native species occurring in this habitat thus qualify as members of the marine bioinvasion community, rather than terrestrial ecosystems.

Similarly, often under-reported globally are invasions in the marine supralittoral zone, also known as the strand-line, wrack-line, drift-line, and maritime community. Terrestrial entomologists and botanists recognized as early as the 1700s that rock, shingle, sand, and shore debris ballast transported in ocean-going ships had begun the movement of coastal insects and plants around the world (Lindroth 1957; Mack 2003). Despite Roux’s early (1828) mention of the probable transport of a non-native shore isopod (Ligia exotica) in a ship’s hold (Carlton 2011), recognition lagged behind in marine literature that a great many other littoral invertebrates were also likely dispersed for centuries in “solid” ballast. Examples of the latter include talitrid amphipods (“beach-hoppers,” “marsh-hoppers” and “sand-hoppers”), halophilic oniscoid isopods, insects, and gastropods. Importantly, even well-recognized non-native maritime plants and insects are often omitted in treatments of marine bioinvasions—and may also be omitted in treatments of terrestrial invasions! Supralittoral animal and plant species are included in bioinvasion reviews for South Africa (Mead et al. 2011a, b), Argentina (Schwindt et al. 2020), the Galápagos Islands (Carlton et al. 2019), and the Hawaiian Islands (Carlton and Eldredge 2009) (Table 3), but are generally absent in marine invasion reviews.

We further argue that certain non-native species of birds and mammals, albeit not requiring the ocean for reproduction or survival, should be critically considered for inclusion in coastal invasion inventories based upon their ecological roles in marine communities. The roles of non-native birds and mammals as consumers and predators in marine environments may often be overlooked by marine invasion researchers. In fact, non-native terrestrial mammals (Carlton and Hodder 2003) and non-native birds (below) utilize and may impact coastal marine and estuarine communities. The marine invasion inventories for Chesapeake Bay (Ruiz et al. 1999) include introduced terrestrial mammals utilizing salt marshes. In contrast, the same introduced mammals invasions are omitted from the treatment of introduced aquatic and marine species in the Great Lakes and the North and Baltic Seas (De Lafontaine and Costan 2002; Reise et al. 1999; Gollasch et al. 2009). The non-native duck Anas platyrhynchos is included in the inventory of marine bionvasions of the Hawaiian Islands (Carlton and Eldredge 2015) because these birds feed in brackish water habitats on Oahu. Similarly, the Eurasian mute swan Cygnus olor is included among marine bioinvasions of New England (Williams 2007) and British Columbia (Levings et al. 2002), where it is a common consumer in coastal waters. We regard these and other species as members of marine intertidal communities, and include them in lists of marine bioinvasions, as discussed earlier.

However, resident non-native species that neither rely on a marine environment for reproduction, nor regularly act as competitors or predators in marine communities, should, in general, not be treated as members of marine or brackish ecological communities. A great many species of terrestrial plants, for example, live in a broadly-defined “sea spray” or “salt spray” zone (but not the wave splash zone), and many of these same plants are found well inland in cities, gardens, roadsides, grasslands, and farms, far from the ocean. Anton et al. (2019) included well-known terrestrial plant invaders, such as Australian pine (Casuarina equisetifolia), velvet grass (Holcus lanatus), bitou bush (Chrysanthemoides monilifera) and turf grass (buffalo grass, St. Augustine grass) (Stenotaphrum secundatum), the latter a common lawn grass, in their review of the “global ecological impacts of marine exotic species,” because these plants appeared in studies that included “environments getting sea spray,” and in searches with the word “coastal.” While we do not doubt the abundance and potential ecological engineering roles of these and many other terrestrial species that tolerate but do not require a saline habitat, marine bioinvasion ecology would be fundamentally redefined by extending an umbrella over these species.

A broad suite of open ocean habitats have sustained invasions, include near-shore planktonic ecosystems (by diatoms and dinoflagellates, jellyfish, and fish), soft-bottom subtidal communities, kelp beds, open coast sandy beaches, coral reefs and deep sea. For the latter, Voight et al. (2012) have offered a sobering example of how scientific equipment used to study deep-sea hydrothermal vents has the ability to transport species between vent systems. Likewise there have been newly-detected invasions of the high seas by coastal species colonizing plastic marine debris (e.g., Haram et al. 2021). In particular, exposed rocky intertidal shores support (and in some areas have long supported) highly abundant invaders in regions as widespread as Europe, the Western North Atlantic, Argentina, Chile, and the Hawaiian Islands (Table 2). In contrast is an older view, as expressed by Zevina and Kuznetsova (1965), that “an open coast is not suited to the introduction of new organisms. Only closed inlets and bays, from which the larvae will not be carried into the open sea, and in which they will be able to set in close proximity to each other, are suitable.”

Certain habitats that appear to support few invasions may have simply not benefitted from focused study on their invasion history, a hypothesis that, as far as we can determine, has not been tested. The absence of reports of large number of invasions from any given habitat (such as open coast sandy beaches or deeper ocean waters) should not be interpreted as a general dearth of invasions (see Preisler et al. 2009; Bumbeer and Rocha 2016), even if some habitats are less invaded than others. As we comment earlier in this essay, phytoplankton invasions may be extensively underreported globally.

Perhaps again because of the general focus on invasions in port and harbor habitats, there appear to be few global reviews of the presence and impact of non-native species in most of the habitats and communities shown in Table 2. The exceptions include Williams’ (2007) review of invasions in seagrass ecosystems, a brief review of selected zooplankton invasions (Bollens et al. 2002), a brief review of animal invasions in salt marshes (Byers 2009), and a review of certain invasions in mudflats (Ruesink 2018).

The way forward: how to embrace habitat diversity of invasions

To more fully clarify or investigate the habitat diversity of invasions, researchers should,

  • Habitat Lacunae Explicitly identify those habitats that are missing from the regional treatment, in order to weigh the potential scale of invasion underestimation. Thus, if a regional marine invasion inventory fails to consider supralittoral, salt marsh, or mud-sand habitats, these omissions should be clearly flagged as being uninvestigated, rather than the implication that such habitats have been investigated and no non-native species were found to be present.

  • Habitat-Specific Literature Search specific regional habitat literature—such as the literature on salt marshes or soft-sediment infauna—if these habitats do not surface using (in parallel to the above phyletic considerations) search terms for non-native species.

  • Tap Specialist Knowledge Consult with both local and global habitat specialists to determine if they are aware of either literature that does or may contain records or suggestions of non-native species in the region in question, or of species that they themselves consider certain or likely non-native candidates, regardless of whether the biogeographic status of such species in a given habitat has been previously published.

II. Searching for invasions: literature surveys

It is critical to recognize at the outset of reviewing regional invasion literature that many researchers, including systematists, biogeographers, and ecologists, still do not recognize the trichotomy of native, non-native, and cryptogenic species, despite the introduction of the cryptogenic concept more than 25 years ago (Carlton 1996). Thus, a researcher is often faced with lists of “native” or “non-native” species, a result of the general long-term default in biogeography and ecology to categorize species as native even if the evolutionary and distributional history of a given species is not known (Carlton 2009). We address this situation in Section “III. Reassessment of the status of “native” species”. Linked to this is that the nineteenth century concept of natural “cosmopolitanism” of shallow-water coastal species remains alive and well in the hearts of many scientists working with many small-bodied and poorly known taxa (Darling and Carlton 2018; this, too, remains one of the greater challenges in resolving invasion diversity. We address both of these topics in further detail below as well (Section “III. Reassessment of the status of “native” species”).

Here we present the multiple ways in which literature, often from many different sources, can be critically evaluated.

IIA. Regional invasion and taxonomic literature, and the need for search term diversity

Taxa reported in regional (in-country) invasion literature and databases typically yield the first working lists of non-native and cryptogenic species. If the desire is to publish an authoritative, scholarly review of regional marine invasions, all records derived from both regional and global databases should be verified by examination of primary original sources. While this can be both time-consuming and challenging (if not annoying), doing so will avoid errors passed down from one secondary source to another.

A standard approach is to use web-based search systems, such as Web of Science, SciELO, BIOSIS Citation Index, CABI, Scopus, and ASFA. We note however, that users should be aware that these databases do not cover all languages, and thus sites such as Google Scholar may be important to find more regional non-English literature. Researchers may often unnecessarily restrict their use of search terms and search combinations. We emphasize that it is important to use a broad range of search terms, given that standardized terminology in invasion science is far from stabilized (Occhipinti-Ambrogi and Galil 2004; Falk-Petersen et al. 2006; Richardson et al. 2011). We further emphasize that ceasing one’s search, after using multiple search terms and standard databases, may result in overlooking many invasions, as we detail in the sections below.

If the resources to do so are available, the following terms (not a selection of them) should be used in searches, in pertinent combinations, in order to reveal previously reported invasions in a given region. Naturally, these words should be translated into appropriate languages and any additional common regional terms should also be included.

  • Invasion terms adventitious, adventive, alien, allochthonous, biopollution, climate migrant, colonist, colonization, colonizing, cosmopolitan, cryptogenic, exotic, extralimital, foreign, hitchhiker, immigrant, imported, introduced, introduction, invaded, invader, invasive, migrant, naturalization, naturalized, neobiota, neocosmopolitan, neophyte, neozoa(n), non-indigenous, nonindigenous, non-native, nonnative, pseudoindigenous, stowaway, tramp, transfer, translocated, transplant, transported, waif, weed, xenodiversity

  • Biogeography terms cosmopolitan, cryptogenic, expand, expansion, extension, “first discovery”, “first record”, new, “new record”, northward, novel, poleward, range expansion, range extension, southward, spread, spreading

  • Vector terms aquaculture, aquarium industry, ballast, biofouling, biological supply, canals, fisheries,

  • Fouling, habitat restoration, mariculture, marine debris, marine litter, oil and gas drilling platforms, rafting, seafood, ship, vessel

  • Habitat terms aufwuchs, biofouling, brackish, coastal, estuarine, estuary, fouling, marine, maritime, ocean, sea; see also Table 2

IIB. Global taxonomic and invasion literature with “hidden” invasion records

McGeoch et al. (2012) noted that the significance of incomplete information searches in alien species listing is “assumed to be small where significant effort is invested in thorough searches,” which presumes that the searcher knows how and where to search. In contrast, non-native species may be reported in global taxonomic and invasion literature, unbeknownst to regional researchers who may not be aware that the species in question even exists in their country. For example, Herbert (2012) noted that the European salt marsh snail Myosotella myosotis was a non-native species in South Africa, but that “this information, published in a taxonomic revision of western Atlantic Ellobiidae … escaped the attention of the South African marine science community”—although the record was published in 1996 and this snail had been present there since the 1880s. Myosotella myosotis is a global invader that was inadvertently re-named as a new species many times after its introductions around the world (Martins 1996; Carlton 2009). It has similarly been overlooked as an invasion in Peru, where it has been present since the 1830s, and in Bermuda (Martins 1996).

We provide additional examples of such hidden records in Table 4, in three categories:

  1. (1)

    Taxonomic literature with hidden invasion records in which the authors indicated that the species in question is introduced elsewhere, but these records largely appear to have gone unnoticed in the regions indicated,

  2. (2)

    Invasion literature with hidden invasion records for other countries or regions,

  3. (3)

    Taxonomic literature with invasion records not identified as such in the cited references (but are so suggested in Table 4, as examples)

The way forward: how to address hidden invasion records

To investigate potentially “hidden” records of invasions, researchers should,

  • Search Other Regional Invasion Literature Using the local region or country name, search invasion inventories in other regions for species that have been flagged as non-native in parts of their known range.

  • Search Regional or Global Systematics Literature Using the local region or country name, search papers and monographs that review the taxonomy and distribution of selected marine taxa on an ocean-wide basis if not global basis, for species that have been flagged as non-native in parts of their known range. The depths to which such records may exist but can be overlooked are frustrating: for regions that we know well, such as Argentina and the Hawaiian Islands, we have no doubt that buried in the other invasion literature or taxonomic literature are records of non-native species that we have not yet discovered.

  • Tap Specialist Knowledge As suggested above, relative to embracing taxonomic and habitat diversity, consult with experts in specific taxonomic groups that often encompass large numbers of non-native taxa. Examples include corophiid and caprellid gammarid amphipods, sphaeromatid isopods, balanoid barnacles, campanulariid hydroids, diadumenid anthozoans, spionid and serpulid polychaetes, bugulid bryozoans, and ascidians.

III. Reassessment of the status of “native” species

Many hundreds, and perhaps thousands, of species of marine protists, invertebrates, fish, algae, and maritime and marsh higher plants were transported around the world long before scientific investigations commenced (Carlton 1987, 2003, 2009; Ojaveer et al. 2018). Thus, in any given coastal marine community there may be scores or hundreds of species whose biogeographic status as native has never been questioned, a situation underlain by the “shifting baseline” assumption that species first recorded at a location, especially in the 1700s, 1800s or early 1900s, were native (Carlton 2009; Ojaveer et al. 2018). Heavily layered upon this assumption was the common (and in a surprising number of cases still enduring) usage of the same scientific name for species occurring in far-reaching locations around the world, leading to a persistent view that literally thousands of coastal species of marine animals and plants were (or are) “naturally cosmopolitan” (Darling and Carlton 2018). Carlton (2009) has reviewed in detail the consequences of these assumptions and views relative to the multi-century obfuscation of the scale of invasions. In reviewing an earlier suggestion that “nearly 1000 coastal species” may have been overlooked as invasions (Carlton 2003), Carlton (2009) suggested that, “in retrospect, the number 1000 now seems too low.”

While global invasions commenced prior to the 1500s, a great many invasion inventories report the oldest known invasions as beginning only in the mid-1800s, with rare earlier records. Despite this temporal disconnect of several centuries between the apparent onset of invasions and their first detection, there has been an overall reluctance, as noted above, in most invasion inventories to re-assess the biogeographic status of ostensibly “native” species (Table 3). As examples, of 22 study regions in four oceans (North and South Atlantic and North and South Pacific) representing 11 continental margins or island systems, only 8 studies re-assessed selected “native” species and re-assigned some of these to a non-native status (9 did the same for cryptogenic species) (Table 3). We know of no studies (including our own) that attempt a “deep dive”—a daunting task—into even a significant number of candidate taxa deserving biogeographic re-assessment relative to how many may represent earlier invasions. For the 8 study areas noted here that have benefitted from some re-examination of the native biota, re-assessment is typically limited to relatively few taxa or taxonomic groups, leaving the bulk of the “native” biota unquestioned. In turn, identification of cryptogenic species, foraged from “native” lists, are, without exception, only examples of cryptogens, with most of the marine biota similarly remaining biogeographically unexcavated.

Beyond the shifting baseline, several additional reasons may account for this reluctance to re-consider species’ indigenous (or even endemic) status:

  1. (1)

    A general hesitancy to challenge “conventional wisdom,” as held by senior in-country zoologists or botanists, including reluctance to question the biogeographic status of “iconic” native species (see for example Bortolus et al. 2015 relative to the history of the understanding of the biogeography of the marsh grass Spartina alterniflora in South America).

  2. (2)

    An assumption that re-constructing the origin and dispersal history of a species that may have been subject to centuries of human-mediated dispersal may be nearly impossible (a resistant belief now potentially set aside, in part, by the advent of molecular genetics that could tease apart a species’ history over time and space, such as the work resolving the origin of the widespread supralittoral isopod Ligia exotica as being in East and Southeast Asia Hurtado et al. 2018), and work resolving the octocoral Carijoa riisei, formerly thought to be native to the Caribbean (from where it was first described in 1860), as native to the Indo-West Pacific (Concepcion et al. 2010)).

  3. (3)

    The assumption that a species’ type locality (which may be in the study area in question) is the native region (see Section “VD. Type localities are not default native regions”, below).

and,

  1. (4)

    What may simply appear to be the daunting knowledge required of the obscure and complex systematic and biogeographic literature of a taxon, often under older species names and in multiple languages, accompanied by a similar complex and unsynthesized literature on historical anthropogenic vectors applicable to the potential historic movement of any one species.

Yet, abundant evidence suggests that a great many non-native species may be hidden under an indigenous-endemic umbrella. As Carlton (2009) outlined, many non-native species are pseudoindigenous, having been, (a) mistakenly re-described as new in the introduced region (i.e., already described in their native regions), (b) first described as new (with the presumption they were native) where they were actually introduced, and then later discovered in their native regions, and (c) described as new after being introduced (as determined post-description by other researchers, by the application of criteria discussed below), but nevertheless remain unknown elsewhere.

Pseudoendemic is an unrelated term, having been introduced independently multiple times, including meaning a population largely dependent on constant immigration (Redfield and Beale 1940), a species found only in one sampled location (although known to occur in other locations), as opposed to the number of species common to two or more sampled locations (Balinsky 1967), endemic species whose range has been inadequately or unevenly sampled (Daniels et al. 1995), species “already known from other countries or not being genuinely distinct species” (Fraser-Jenkins 2008), and species “now confined to one or a few islands, but that had much larger ranges (prior to) human contact” (Steadman 2006), among other meanings.

Relative to the three pseudoindigenous categories noted above, Carlton (2009) provided 94 examples of species in category (a); in many cases, species were redescribed as new in different regions (up to 11 times for one species), for a total of 159 cases of mistaken re-description amongst these 90-some species alone. Carlton (2009) also provided examples of 21 species in pseudoindigenous category (b), and 7 species in category (c), the latter, as defined above, thus being sui generis designations as introductions by later work. Soledade et al. (2013) have described cases in category (a) as a “precautionary tale when describing species in a world of invaders.”

Descriptions of non-native species as new species in the above categories were not recognized by authors as introduced at the time of description (Carlton 2009), a situation that continues to the present day (Soledale et al. 2013; Aguilar et al. 2022). However in a small number of cases, authors recognized that a species, albeit apparently undescribed, was not likely native (Carlton 2009, Table 2.4): as examples of species recognized at the time of their description (or mistaken redescription) as non-native, Carlton (2009) offered examples of 7 species in category (a), 5 species in category (b) and 7 species in category (c). Recognition of marine species as undescribed but introduced dates back to at least the 1870s. Cooper (1872), in describing the brackish-water snail Alexia setifer as a new species from San Francisco Bay (now known as Myosotella myosotis, the same species long-overlooked in South Africa as discussed above) noted that the Bay had been searched for mollusks “for more than twenty years,” and concluded that it might not be native, speculating that it may have been brought from China on ship hulls, or “as ova in damp nets or otherwise.” Authors recognizing non-native species at the time of their description have used trivial names such as aliena, alienense, enigmatica, and exotica (Carlton 2009) nomadica (Galil et al. 1990), invadens (Reise et al. 2011), perambulata (Louis and Menon 2009), and ingressus (Engin et al. 2018), among others.

An important derivative from these examples is that a number of demonstrably non-native species remain known only from where they have been introduced (Carlton 2009 Tables 2.3 and 2.4-C). Of course, some of these species may yet be found to be described elsewhere, with earlier descriptions (and thus names) not yet being matched to the satellite populations of these species. However, in many if not most cases, it appears likely that the biota in the native or presumptive native regions of these taxa (as judged by criterion 6 in Section “V. Criteria for recognition of non-native species”) is simply not sufficiently known, and that these species have come to light only by being introduced to regions under greater investigation, or by the vagaries of fortuitous encounters (such as the discovery of a new species of bryozoan, Bugula tsunamiensis, known only (at this time) from rafting debris in the North Pacific, but for reasons detailed in McCuller et al. (2018) appears to find its origin in Honshu, Japan). The scale of pseudoindigenous diversity is unknown, but as a phenomenon it further contributes to the strong likelihood that the number of “endemic” or “indigenous” species mis-categorized as such is under-estimated—and thus that the number of introductions in any one region is likely higher than estimated.

In contrast to the earlier lack of recognition of new species as potential candidates for having been introduced, an increasing number of undescribed or potentially undescribed marine invertebrate species from well-studied shallow-water areas are now recognized as being non-native at the time of discovery and description, a sign of greater awakening of an understanding of the scale of invasions. We provide examples of these in Section “VD. Type localities are not default native regions”, below.

In addition, non-native species have been initially misidentified as already-described native species, with well-known cases including mistaking the Japanese seastar Asterias amurensis for a native seastar in Tasmania (Buttermore et al. 1994), the Western Atlantic comb jelly Mnemiopsis leidyi for a native ctenophore in The Netherlands (Faasse and Bayha 2006), and the Mediterranean mussel Mytilus galloprovincialis, mis-identified as the native mussel Mytilus edulis (now Mytilus trossulus) for many decades in southern California (Geller 1999). It is unlikely that all such cases have been recognized.

In sum, while the most common approach in creating invasion inventories is to rely on species already reported as non-native, as discussed above, in reality the historical biogeographic status of many species in a given region may not have been properly assessed, if assessed at all. Jaric et al. (2019) have categorized the underestimation of non-native species diversity as “crypticity in biological invasions,” encompassing the multiple challenges of undescribed species, taxonomically difficult species complexes, pseudoindigenous species, cryptogenic species, and undetected species.

IIIA. Targeting candidate “native” species for re-evaluation of biogeographic status

Of hundreds or thousands of species in a regional biota, which taxa or taxonomic groups potentially representing overlooked invasions would bear investigation? Put another way, in which taxa should considerable time and effort be invested to tease out those species which may have been introduced long before formal scientific work began?

While we have argued (above, and Table 2) that non-native species are found in a broad range of marine habitats, the following specific groups and habitats bear early consideration. We recognize that our considerations here overlap with our admonitions to more fully explore both phyletic and habitat diversity of non-native species:

  1. (1)

    “Native” species that occur largely or primarily in habitats known to support well-recognized non-native species. Such species fall under one of the criteria for recognizing non-native species (Section “V. Criteria for recognition of non-native species”), that is, close association with known invaders. Thus—again noting our emphasis of the wide habitat diversity of invasions—low-hanging fruit includes re-consideration of species in local biofouling communities that have not been previously considered as possible non-native taxa.

  2. (2)

    Habitats and thus taxonomic groups not previously considered in a given region as supporting possible or probable invasions, such as supralittoral maritime amphipods and isopods (often omitted in considerations of marine bioinvasions) and teredinid shipworms and limnoriid isopods.

  3. (3)

    Taxonomic groups such as campanulariid hydroids and shipworms that have been historically dismissed as “naturally cosmopolitan” because of unfounded assumptions of “natural rafting” (see Carlton 1999a; Carlton and Eldredge 2009).

  4. (4)

    Symbionts, commensals, and parasites of recognized invaders. These may include host invader taxa supporting (a) species-specific symbiotic and commensal ciliates, (b) folliculinid ciliates found with limnoriid isopods (“gribbles”) and other hosts, (c) commensal isopods such as Iais spp. found with their sphaeromatid isopod hosts, or (d) host-specific marine fungi associated with non-native maritime plants, such as mangroves.

Species that thus come under consideration as potential invaders are then analyzed through the lenses of the criteria outlined in Section “V. Criteria for recognition of non-native species”.

IV. Field surveys and sampling

If no field surveys for non-native species have been conducted for more than 5 years, efforts should be made to deploy standard assessment methods (below) to determine if any new invasions have occurred. This requires access to taxonomic expertise across as many phyla as possible, which may be challenging within country. Despite the abundance and diversity of, for example, sponges, flatworms, hydroids, polychaetes, bryozoans, ascidians and other taxa in marine communities, there may be no in-country experts to identify specimens, or, as discussed earlier, available experts may not have the time to examine material. In this case, effort should be made to contact taxonomic experts in other countries who may be available.

Whether in-country or not, taxonomists are invited to be co-authors of papers resulting from the survey work (that is, rather than being simply acknowledged). Unfortunately, funds are rarely available to engage professional for-hire taxonomists or taxonomic consulting companies, and we acknowledge that those involved in those trades are rarely inclined to exchange their services for the honor of co-authoring papers. Regardless, we have commented earlier that in the absence of the availability of morphological taxonomists, genetic analyses must not be substituted in toto to generate identifications.

We outline here some standard survey methods, which may variously be quantitative, qualitative, or a mixture of the two. Applying more than one method will increase the detection of non-native and cryptogenic species (Schwindt et al. 2014). Campbell et al. (2007) provide an overview of selected methods.

We note the rapidly expanding and highly beneficial role of the public (“citizen scientists”), social network sites such as iNaturalist (https://www.inaturalist.org/), and the increasing availability of reporting applications in documenting the occurrence of novel invasions, the spread of older invasions, and invasion diversity in general (Pearson et al. 2019; Martínez-Laiz et al. 2020; Hermoso et al. 2021; Kaminas et al. 2022; Howard et al. 2022).

IVA. Colonization substrates (passive sampling)

Standardized plates (panels), made of a wide variety of materials, are deployed to capture the recruitment of both sessile and mobile species in different sites, in different locations within sites, different habitats and depths, and over varying lengths of time (Bumbeer and Rocha 2012; Schwindt et al. 2014; Maraffini et al. 2017; Ramalhosa et al. 2021; Loureiro et al. 2021; Leclerc et al. 2021; Chebaane et al. 2022). Substrate deployments have a rich history (Jarvis 1853; Parker 1924; Visscher 1928; Coe and Allen 1937; Miyazaki 1938; Edmondson and Ingram 1939; McDougall 1943; Edmondson 1944; Millard 1952), with these earlier studies often providing an important baseline for fouling community diversity. Holmes and Callaway (2020) experimented with a “mixed material survey” (MMS) (substrates of different materials deployed simultaneously) to assess non-native species colonization; the MMS attracted a greater proportion of non-native species compared to the deployment of standard acrylic settlement plates. Plates may be deployed in different configurations, including vertically- and horizontally-oriented panels, the latter leading to algae and/or sediment accumulation on upper surfaces but providing shading for negatively phototactic species on lower surfaces. Some deployments employ both methods (Collin et al. 2015), given that larval settlement may be influenced by (among many phenomena) physical factors such as light, gravity, and water flow. Plates may also be caged to exclude predators accompanied by uncaged (open) treatments (Freestone et al. 2013; Oricchio et al. 2016; Giachetti et al. 2020), noting that unattended biofouling on cages can alter the treatment, and that predators may recruit into (and be trapped in) cages. Once retrieved, plates may be sampled at various intensities (all taxa or selected taxa, and for morphological and/or genetic analyses). Expert taxonomists may be involved at the time of plate retrieval (for live analysis) or later.

IVB. Rapid assessment survey (RAS)

An RAS consists of typically assembling a team of taxonomic experts and support personnel to sample a series of sites and habitats along a length of coastline over a defined period of time (Pederson et al. 2021). In a marina-based RAS, for example, about 15 small boat harbors may be sampled along a distance of 400 km over a period of 7 days by a team of 15 to 20 personnel. As with all survey methods, RASs have limitations, including restricted temporal and spatial coverage, that may result in underestimating the number of non-native species present (Rohde et al. 2017). RASs have been conducted since the 1990s in a number of countries, including, for example, the following sites, with the referenced works providing extensive details of sampling and analytical methods: the United States Pacific coast (Cohen et al. 2005, and earlier surveys reviewed therein), the U.S. New England coast (Mathieson et al. 2008; Pederson et al. 2021, as well as earlier surveys reviewed therein)), England (Arenas et al. 2006), Scotland (Ashton et al. 2006; Nall et al. 2015; Collin et al. 2015), Ireland (Minchin 2007, based on a one-person survey extending over 18 months), Brazil (Marques et al. 2013) and Korea (Park et al. 2017). The first RAS known to us was conducted in June 1970 in San Francisco Bay (California) by James T. Carlton and the late Neil A. Powell.

This method is similar to bioblitzes, which are also intensive time and space limited surveys, but differ in assembling teams of observers, collectors and taxonomists at one site, who may then fan out across a relatively small region and bring specimens back to a central processing area (Ashton et al. 2020).

We note that taxon-specific (target taxa) surveys can also be conducted (Minchin 2012; Minchin et al. 2016). These conceivably could further focus on detecting species in the wild that have been documented to date only on arriving or regional vessels (for example, Meloni et al. 2020) or known to be released into local waters, such as the same species repeatedly released in a port or bay (Carlton and Geller 1993). “Watch lists” for potential invaders are common throughout much of the world, but often highlight species not actually known to be arriving on specific vectors into a given region. Concentrating on species known to have arrived (and potentially (via vessel biofouling) or likely (via ballast water) released), or known to be present on in-country vessels may be lower-hanging fruit.

IVC. Extended site surveys

Extended surveys may be conducted over time involving multiple sites (and potentially multiple habitats), either within a region (Rohde et al. 2017), or throughout country (Campbell et al. 2007). The resulting greater sampling and necessarily increased taxonomic effort would typically require greater funding resources. Examples include marine bioinvasion surveys conducted around the Australian continent (Campbell et al. 2007, with multiple sampling strategies), along the coast of Patagonia, Argentina (Schwindt et al. 2014), and around the United States (Bastida-Zavala et al. 2017, employing fouling plate deployments).

IVD. eDNA metabarcoding surveys and qPCR assays

Increasingly advanced molecular genetic techniques are permitting the early detection, and monitoring the spread of, non-native species using water, sediment and marine debris samples (Zaiko et al. 2015; Borrell et al. 2017; Holman et al. 2019; Pearman et al. 2020; Suarez-Menendez et al. 2020; Ibabe et al. 2019, 2021). eDNA or qPCR sampling may also permit species-specific detection and monitoring (for example Jerde et al. 2011; Willis et al. 2011; Gargan et al. 2022). Agersnap et al. (2022) and Miya et al. (2022) have presented models for combining citizen science or community-based collection of water samples with eDNA or qPCR monitoring. As discussed above, limitations center on correct species identifications, even with 99–100% sequence matches, given that identifications mounted on genetic bank websites may not be correct or that the nearest matches may still not represent the species sequenced. As noted above, Ardura et al. (2021) reported apparently in error a large number of North Atlantic Ocean cold-water species in tropical Pacific Islands based only on genetically-based identifications. Other limitations include only partial sampling of non-native species present using metabarcoding (Couton et al. 2022), a limitation of course with all sampling methods. Metabarcoding surveys do not completely replace visual surveys nor collections of actual specimens that would permit morphological confirmation of identifications made through DNA assays.

IVE. Invited experts: focus on specific taxonomic groups

It may prove fruitful to invite expert taxonomists to a country or a region to work on a specific taxonomic group to determine regional richness, with a focus on assessing or resolving the presence of non-native species. Rationales for funding such work include resolving the scale of native vs. non-native diversity in severely under-reported groups (Carlton and Fowler 2018), establishing a biodiversity baseline by which to detect future invasions in understudied regions (for example, the Arctic and Antarctic), as well as, critically, determining which species, if any, that have been the subject of key ecological research (such as studies on ecological engineers) may in fact not be native, despite presumptions to that effect.

V. Filtering the Records: Criteria for recognition of non-native species

The majority of non-native species yielded by standard literature searches (IIA) do not, in general, require re-assessment as to their invasion status, having been (presumably) vetted repeatedly. Exceptions are those species which are listed as introductions but fail to meet minimum criteria of residency in marine waters, are not established in the wild, and other eliminating considerations, as outlined below (Section “VA. Species to be excluded from calculations of regional marine invasion diversity”). For the re-evaluation of species thought to be native but which may be introduced, robust distinguishing criteria are required, which we review below (Section “VB. Review of criteria for recognition of non-native species”). These same criteria may be applied to newly described species whose native status is uncertain but which may fall within the realm of new species that have been recognized as non-native at the time of their description (Section “VD. Type localities are not default native regions”).

VA. Species to be excluded from calculations of regional marine invasion diversity

The following categories of species should not be included in the calculations of the total number of non-native marine species in a given region. In particular, we strongly discourage (as noted below) listing species held only in mariculture (aquaculture) facilities as being resident non-natives, no more than species found solely in zoos or botanical gardens should be included. In summary, the categories discussed below are,

  • Species indicated as non-native without supporting evidence

  • Native species mistaken as introductions

  • Species intercepted on incoming vectors, found only in aquaculture, mariculture or aquarium facilities, or demonstrably not established in the wild

  • Species not found alive

  • Temporary range expansions

  • Failed invasions

Species indicated as non-native without supporting evidence

Not all species categorized as non-native (or cryptogenic) in the literature should be accepted at face value. Without supporting evidence, such taxa should be excluded from non-native diversity calculations pending evaluation. For example, Moro et al. (2003) assigned hundreds of species of marine plants and animals in the Canary Islands to the following categories, but none of these assignments are accompanied by supporting literature reference or other data: Nativo Seguro, Nativo Probable, Nativo Posible, Introducido Seguro No Invasor, Introducido Seguro Invasor, Introducido Probable (the “probable” and “possible” assignments would be considered cryptogenic in invasion literature). Similarly, Subba Rao (2005) reported more than 200 species as introduced post-1960 to Indian Seas, the majority without specific evidence as to their non-native status. While such lists are of potential value to highlight possible non-native species, the history and status of each species would require investigation.

Native species mistaken as introductions

As Carlton (2009, Sect. 2.5, “The Overestimation of Invader Diversity”) noted, mis-categorizing native species as introduced is relatively rare, in part because, as discussed earlier (Section “II. Searching for invasions: literature surveys”) a default in biogeography is to consider a species native. Application of the criteria detailed here (Section “V. Criteria for recognition of non-native species”) may highlight such cases, but the scale of misconstruing natives as introductions is unknown.

Examples include three species previously treated as non-native in the Hawaiian literature re-assigned to native status (Carlton and Eldredge 2009): the marine snails Bulla vernicosa (a sporadically-blooming native species first thought to be a World War II-era introduction from Guam) and Vitularia miliaris (also thought to be a WWII-era introduction, but in fact a previously overlooked native species long resident in the Islands and in the Hawaiian fossil record), and the mantis shrimp Gonodactylellus hendersoni (now G. demanii) a relatively small and easily overlooked species. The copepod Centropages maigo, initially thought to be a ballast-water introduction in Japan, is more likely native (Ohtsuka et al. 2007).

Yund et al. (2015) resolved that the ascidian Botryllus schlosseri, previously often considered as introduced in the Northwest Atlantic, appears to be primarily composed of native populations. Villalobos-Guerrero and Carrera-Parra (2015) demonstrated that what was thought to be the introduced North Atlantic polychaete Alitta succinea in the Eastern Tropical Pacific is a regional endemic species. The bryozoan Membranipora rustica (Florence et al. 2007), a native species in South Africa, was formerly thought to be the non-native North Atlantic species Membranipora membranacea, Carlton (2009) further reviewed the curious case of the misinterpretation of the history of the native xanthid crab Pilumnoides rubus in South Africa.

In the above cases, a combination of biology, ecology, natural history, biogeography, genetics or higher resolution taxonomy served to resolve these cases, all at the heart of the criteria laid out in Section “VB. Review of criteria for recognition of non-native species”.

Species intercepted on incoming vectors, found only in aquaculture, mariculture or aquarium facilities, or demonstrably not established in the wild

A large number of regional records of non-native species may be in hand that are not, or are not likely, based upon established wild populations. These include, (i) species arriving on, in, or with vectors, even if released into the environment (Marchini et al. 2015), although, as noted above, such species provide fodder for taxa-specific searches; (ii) species released into the environment with no evidence of establishment; (iii) species held solely in captivity; and (iv) species raised in mariculture operations but showing no evidence of wild, established reproducing populations (for example, Sherwood and Carlile 2012, relative to a red alga in Hawaii), even if occasional volunteers are found outside of aquaculture sites. Examples of the latter are the common occurrences of non-cultured non-native oysters attached to natural substates within a several kilometer halo of oyster farming operations in regions around the world.

Species not found alive

These include dead specimens that may have arrived with vectors, such as species that may have fallen off ships, or arrived with marine debris. These are of interest and should be recorded, but not listed as non-native species (Marchini et al. 2015). For example, the marsh snail Myosotella myosotis was incorrectly reported as non-native in Uruguay, but these records were based only on empty shells (Orensanz et al. 2002), and only years later it was found alive (Schwindt et al. 2020).

Temporary range expansions

Ephemeral expansions of species beyond their normal documented ranges and that do not result in permanently established reproducing populations should be excluded. Such expansions may be related to aperiodic phenomena such as Pacific Decadal Oscillations (PDOs), El Niño–Southern Oscillations (ENSOs) and other marine heat waves, and La Niña events (Victor et al. 2001; Montagne and Cadien 2001; Lluch-Belda et al. 2005; Lonhart 2009; Goddard et al. 2018; Sanford et al. 2019; Lonhart et al. 2019), or to cyclonic weather systems, such as hurricanes.

Failed invasions

Non-native species that failed to become established (for many reasons; see Simberloff and Gibbons 2004; Miller et al. 2007; Miller and Ruiz 2009; Zenni and Nuñez 2013; Bosso et al. 2022) should not be included in invasion diversity calculations. Failed invasions are defined as non-native species populations (F1 generation or more) either known to have (1) been reproducing, or (2) transiently settled in the wild from spawning adults on an arriving vector, but in either case are then documented by survey work to no longer be present. Examples of (1) include the establishment of a European sea anemone (Sagartia sp.) in Massachusetts from 2000 to 2010 (Wells and Harris 2014) and the presence of the Atlantic quahog (clam) Mercenaria mercenaria in a Southern California lagoon from the 1960s to the 1980s (Burnaford et al. 2011). Examples of (2) include the transient settlement of the mussel Mytilus galloprovincialis in Pearl Harbor, Hawaii, spawned from adult mussels arriving on a vessel from the State of Washington (Apte et al. 2000; Carlton and Eldredge 2009), and the transient settlement of the southern serpulid worm Hydroides elegans in Eel Pond, in Woods Hole, Massachusetts (Bastida-Zavala et al. 2017), spawned from adult populations arriving on a visiting vessel.

It is difficult to interpret many occasional discoveries, often bundled under a “failed invasion” category, including whether these may have represented transient reproducing populations, were collected off a visiting vessel (without mention of such), or represented recruits spawned from the biofouling on an arriving vessel. While these occurrences may be of interest in terms of prospective invasions, such records do not constitute new records for, nor additions to, the local biota, nor should they be treated as additions to the list of marine bioinvasions. Examples include the discovery of a single specimen of the Australasian barnacle Austrominius modestus on an experimental panel in Cape Town in 1949 (Sandison 1950, published in Nature in more halcyon days), admitted to the list of South African invasions in Robinson et al. (2020) and the Western Pacific ascidian Styela plicata, long admitted to the lists of Uruguayan biota, although based on a sole 1880s record (Traustedt 1883; Scarabino et al. 2018).

Failed invasions do not include the discovery of single or a few adult individuals of a species that likely are the result of direct release, whether accidental (such as the one-off occurrences of green crabs Carcinus maenas around the world, particularly in the 1800s Carlton and Cohen 2003) or intentional (such as the discovery of the Atlantic blue crabs Callinectes sapidus in Hawaii, Carlton and Eldredge 2009), in the absence of evidence of successful reproduction. Nevertheless, scattered one-time records of Carcinus around the world, including in tropical waters where they cannot live, were long included in inventories of where the green crab had invaded.

VB. Review of criteria for recognition of non-native species

Criteria for the often difficult task of distinguishing native from non-native species have been proposed and discussed by Lindroth (1957), Carlton (1979), Webb (1985), Chapman and Carlton (1994), Essl et al. (2018), Campbell et al. (2018) and others. Quell et al. (2021) have proposed a series of biological traits which may characterize some non-native marine invertebrates, but such traits do not function as criteria to distinguish native from non-native species. In Table 5 we define 14 criteria, provide caveats and critiques for each, and identify examples of non-native species fitting each criterion.

In outline these criteria are,

  1. (1)

    Prior absence in region of concern/interest: Absence from the historical, archeological, or recent fossil record

  2. (2)

    Global Biogeography: Globally disjunct distributions

  3. (3)

    Global Biogeography and Temporal History

  4. (4)

    Regional Biogeography: Highly restricted distribution

  5. (5)

    Regional Biogeography: History of geographic expansion

  6. (6)

    Allochthonous (exotic) evolutionary origin

  7. (7)

    Limited Natural dispersal potential for transoceanic and interoceanic colonization

  8. (8)

    Anthropogenic Dispersal Potential

  9. (9)

    Invasion Founder Effect: Reduced genetic variation

  10. (10)

    Invasion Founder Effect: Reduced morphological or physiological variation

  11. (11)

    Invasion Founder Effect: Reduced variation in reproductive strategies

  12. (12)

    Ecology: Predatory, symbiotic, commensal, or parasitic association with non-native species

  13. (13)

    Ecology: Prevalence or restriction to disturbed, anthropogenic habitats (including artificial substrates) often with concentrations of non-native species

  14. (14)

    Ecology: Rapid increase in population size

The application of these criteria is highly dependent on the taxon and location in question, the history and depth of regional biodiversity investigation, and the quality and quantity of biological, genetic, physiological, and ecological data available. Thus which, and how many, criteria are applied and applicable will vary considerably. As an example, if the well-known Asian crab Hemigrapsus sanguineus, a recognized invader of the North Atlantic Ocean, should be introduced and become established in Argentina, if correctly identified, then criterion (1) would be sufficient to recognize it as non-native to South America.

Robust evidence of prior absence, in particular, is one of the strongest criteria for recognizing a non-native species, especially for large, conspicuous species unlikely to have been overlooked for decades or centuries in well-studied areas. The Indo-Pacific octocoral Carijoa riisei’s appearance in the 1960s in the Hawaiian Islands is a classic example of a prominent, easily recognized species not detected in the Archipelago since scientific collections began in the 1800s. Although species can be missed in ecological and biodiversity surveys, no evidence suggests that C. riisei would fall into such a “missed” category. First thought to be found in Hawaii in 1972, it was later determined to have been photographed in Hawaii in 1966, leading Concepcion et al. (2010) to conclude that this 6-year gap “demonstrates that lack of precedence is a weak foundation for classifying” non-native species. Lack of precedence, in fact, remains one of the strongest foundations; it is not uncommon to discover that species were found by members of the public years before occurring in scientific samples (Carlton 2008). There is, further, thus no evidence that C. riisei is cryptogenic in the Hawaiian Islands, as suggested by Salimi et al. (2021).

An instructive lesson in the synergism between the ability to establish prior absence in a region, and enduring assumptions about the probability of natural colonization (dispersal) of species from a source area, is the history of the introduction of two marine species from the South American mainland to the Galápagos Islands (Carlton et al. 2019). A baseline barnacle survey was conducted in the Islands in 1964, and baseline crab surveys in the 1930s. In 1966 a large barnacle from the South American mainland, Megabalanus coccopoma, was discovered in fouling communities in the Galápagos, introduced by vessel traffic (Carlton et al. 2019). The edible mangrove crab Cardisoma crassum, common in Ecuador, was found in the wild on Santa Cruz Island in 1993, with local knowledge establishing that it was intentionally introduced that year (Carlton et al. 2019). Both species remain established today in the Islands. A critical lesson here is that had the barnacle been first found in the Galápagos in 1964 or earlier, or had the crab been introduced (without anyone knowing) intentionally half-a-century earlier, Galápagos biogeographers would assume that both species had naturally dispersed from the mainland. Other than these 2 species all remaining marine species that are found both on the mainland and in the Islands are considered natural colonists in the Galápagos (Carlton et al. 2019).

As Carlton et al. (2019) concluded, anthropogenic movements of marine species from South America to the Galápagos may be underestimated, as it would appear highly unlikely that the only two marine species to be transported accidentally or intentionally from the mainland to the Archipelago happen to be species that arrived since the 1950s. In fact, the distance between South America and the Galápagos, although less than 1000 km, may exceed the dispersal capabilities of many species, although such dispersal is consistently invoked.

Other species (Table 5) are recognized as probable introductions based solely on criterion (12). For some species, the application of various criteria may change over time and result in finer-scale resolution. For example, as noted earlier, Roux (1828) invoked prior absence (our criterion 1) to recognize that the littoral isopod Ligia exotica was not native to the Mediterranean, but it was work 90 years later (Hidalgo et al. 2018) that resolved its origin as the Indo-Pacific through criterion (9).

For a great many species, data for criteria 5, 9, 10, 11, and 14 may simply not be available without extensive historical study or laboratory and field research. For many soft-bodied species with no preservable hard parts, application of the archeological or recent fossil record is inapplicable. For very small, including microbial, inconspicuous and historically undersampled (or never sampled) species, criterion (1) may especially not be applicable, although the inability to establish prior absence has led in some circumstances to the application of the “smalls rule”—an assumption that small species, in general, are naturally very widely distributed (see Carlton 2009).

The above aside, whenever possible, as many applicable criteria should be mustered to bolster the case that a given species is or may be non-native; conversely, not all criteria must be met to do so (Bortolus et al. 2015; Miranda et al. 2018). For many species, the data may at this time be insufficient to resolve a species status; such species are thus cryptogenic.

VC. Retention in invasion assessments of species not recently reported

How long to continue to retain species not recently re-reported on inventories of non-native species is an unresolved question in invasion science. Ruiz et al. (2000) noted that the “population status was considered unknown for introductions with no records within the past 20–30 years.” Robinson et al. (2016) set aside records for South Africa for species not recorded “in the past 25 years.” For larger, well-known macroscopic species—for example, large gastropods and bivalves, or large crabs—no records for several decades may indeed suggest that the species is no longer present. Intensive expert searching in the same habitat in the same sites for species not reported for decades, regardless of species size, would also suggest that the species be removed from the register of currently recognized established invasions.

While the above scenarios provide evidence of absence, a fixed time period for confirming the population status for a great many cryptic, small, and especially taxonomically challenging species may inadvertently imply that there is a probability that the species is no longer present, when in fact there may have been no attempts to confirm such—nor, for many species, would it be expected that there would have been such searches. For example, a number of commensal and symbiotic ciliophorans were reported between the 1920 and 1940s from only introduced molluscan and crustacean hosts in San Francisco Bay, California (Carlton 1979), but there are no subsequent records. In this case, however, there is no evidence of absence, and as all the hosts in question remain in San Francisco Bay, there are no reasons to suspect that these potentially host-specific symbionts are absent.

We suggest that species that have not been specifically searched for again, and which are not likely to be encountered in sampling programs without a targeted search (accompanied by taxonomic expertise) be retained in invasion inventories (while noting that recent details of their distribution or population size are not known). This approach would both serve to restore species to lists and to potentially promote interest in focused searches. Environmental DNA (eDNA) or qPCR samples might serve to reveal the continued presence of certain non-native species not collected in recent years, although many cryptic and small taxa of concern may not have been sequenced.

VD. Type localities are not default native regions

A great many species of marine, freshwater, and terrestrial invaders were first described from a non-native region, with a number of these still remaining unknown elsewhere, but are nevertheless unquestionable introductions (Carlton 2009, and above in Section “III. Reassessment of the status of “native” species”).

Table 6 presents 15 recent (since 2005) examples of new species recognized at the time of their description as probable non-native species. That awareness is now increasing that non-native species can be both undescribed and unknown in their native regions is perhaps indicative of the generally greater recognition of bioinvasions but also perhaps the unsurprising scale of increasing human-mediated dispersal. These examples capture Western Europe and the Mediterranean, regions with deep and robust baselines that permit detection of novel macroscopic species, as well as North and South America, the Hawaiian Islands, and the Indian Ocean, and taxa as diverse as flatworms, sponges, cnidarians, worms, bryozoans, fish, and seaweeds. Of these few examples, only 2 have yet to be discovered in their homeland. We suggest that the spectacular occurrence of the previously undescribed red seaweed Chondria tumulosa in the Hawaiian Archipelago, although initially categorized as cryptogenic (Sherwood et al. 2020), is certainly an invasion and is not native to the Islands. Similarly, although not shown in Table 6, the amphipod Corophium urdaibaiense (Marquiegui and Perez 2006) found in the Bay of Biscay, France in 2004, and described from (and still only known in) France, was very unlikely to be overlooked in a country with a very long history of extensive study on these small crustaceans.

VI. Methods for establishing the timing of invasion records, and the importance of not using publication dates

The resolution of the date of first collection or detection may appear to be a matter of generally minor concern. However, a good deal of attention in invasion science, as well as invasion management, is focused on the pattern and pulse of invasions over time, and whether invasions can be related to, for example, the history of a particular vector (its beginning, its peak, and, perhaps, its demise), to environmental or other changes in the donating or receiving environment (Carlton 1989), or to other phenomena and processes. For example, Byers et al. (2015) employed data on “time since introduction” to examine whether there was a positive relationship between time since arrival and current (as of 2012) range size of marine invaders, and whether distributional equilibrium had been reached.

Here we identify methods by which to establish when a species was first discovered in a given region. Faced with apparently no or few recourses, many studies simply use the publication date of a paper to mark the first record of a species in a given region. Such a demarcation date is to be avoided if at all possible. The publication date of a record is often a poor substitute to establish invasion chronology, if the goal is to correlate invasion patterns with vectors, environmental changes, and post-invasion range expansion. Instead, time and effort should be invested in discovering if data may be available by which to establish when the species was in fact first detected or collected.

We distinguish here resolving the date of first collection from when a species may have first become established as reproducing populations. Many species may have arrived and become established in a given region years or decades (and in some cases centuries) before being first collected. This said, the known timing of relevant vector activity may help establish potential invasion timing baselines. For example, Hartman (1936) made many of the first collections of Atlantic polychaete worms in San Francisco Bay, California in the 1930s, many of which species were likely introduced beginning in the 1870s with the importation of Atlantic oysters (Carlton 1979). In this case, the first detection of these species is an artifact of the arrival of a taxonomic specialist (see also discussion in Macan 1974), and thus Cohen and Carlton (1998) elected to not include Hartman’s 1930s records of Atlantic worms in San Francisco Bay in calculating invasion rates of the Bay, since their introduction may have occurred as much as 50 or more years earlier. In some regions, the scale of collection biases may be difficult to detect: Robinson et al. (2020) note that only 4 non-native species were collected in South Africa prior to 1900 (none recognized at the time as introductions), whereas between 1900 and 2000 another 65 non-native species were recorded, although many of the latter (such as hydroids, polychaetes, isopods, amphipods, bryozoans, and others) may have arrived in the 1800s if not earlier. Thus the commensal folliculinid protist Mirofolliculina limnoriae, an established non-native species in South Africa whose host is the non-native wood-boring isopod Limnoria tripunctata, while first noticed in Cape Town Harbor in 2008, may well have arrived centuries ago. A disconnect between a probable era of introduction and the first date of detection appears not to be recognized or acknowledged in many studies.

VIA. Methods for determining earlier dates of collection or detection

We suggest several avenues, often low-hanging fruit, by which to avoid using publication dates as dates of first detection.

Accessing station data

Many species reports in taxonomic literature provide only a station number (of an expedition or a collector) with no collection date. Station data should then be sought in other sources, although these may require considerable mining to discover. For example, Osburn (1950, 1953) reported the Indo-West Pacific bryozoan Synnotum aegyptiacum and the North Atlantic bryozoan Anguinella palmata, respectively, from Peru based upon collections made at Velero station 847–38. Osburn did not provide a reference for obtaining station data details. Velero station data can be recovered from Fraser (1943), producing a 1938 collection date for this station.

Museum specimens and museum records

Museum specimens may provide unpublished collection data. We offer several compelling examples:

  1. (1)

    The ascidian Polyandrocarpa zorritensis was described from Peru by Willard Van Name in 1931 (Van Name 1931), without a collection date; it is now regarded as introduced to South America (Carlton et al. 2019). The type material was deposited at the Yale University Peabody Museum of Natural History, whose on-line catalogue (accessed October 2021) provides a collection date of 1866, setting the record back 65 years.

  2. (2)

    The Australasian ascidian Asterocarpa humilis was reported as a new introduction in Chile in 2000, based upon 1997 collections (Clarke and Castilla 2000). However, a synonym of A. humilis is Cnemidocarpa robinsoni Hartmeyer, 1916 (Turon et al. 2016) described from Robinson Crusoe Island, Chile; the type material was said by Hartmeyer to be deposited in the Natural History Museum in Berlin. Inquiry of the Museum revealed that the specimens were still extant, but bore only a collector’s name (“Plate”) and no collection date or further data (Carsten Lüter, personal communication, 2021). Plate is the German zoologist Ludwig Hermann Plate who spent two months on Robinson Crusoe Island in 1894 (Plate 1896), setting the record for this ascidian back 103 years in Chile.

  3. (3)

    The Western Pacific amphipod Incisocalliope derzhavini was first identified in San Francisco Bay, California in 1971 (Chapman 1988, as Parapleustes derzhavini). However, study of preserved hydroids collected in San Francisco Bay and held at the Smithsonian Institution and the Yale Peabody Museum of Natural History revealed that I. derzhavini had been in San Francisco Bay since at least 1904 (Chapman 1988), setting the record back 67 years.

  4. (4)

    The North Atlantic amphipod Monocorophium acherusicum was first identified from Yaquina Bay, Oregon, in 1987 (Chapman 1988). Examination of epibiota on shells of the Atlantic oyster Crassostrea virginica from Yaquina Bay in the Smithsonian Institution (received from F. L. Washburn in 1905, revealed the presence of (dried) specimens of M. acherusicum (Carlton 1979), setting the record back 66 years. Atlantic oysters were imported from the Atlantic coast and planted in Yaquina Bay between 1897 and 1899 (Washburn 1900).

Discussion

The study of marine bioinvasions is in its infancy: the field did not exist prior to the 1970s. This young field, much of which work began in earnest only in the 1990s, would thus not be expected to catch up quickly with more than 500 years of invasion history. We identify in this essay, and suggest means by which to address, critical gaps in the global resolution of the diversity and history of marine bioinvasions. These gaps have arisen in part by profound differences in the range of taxonomic groups covered in regional invasion assessments, and in part limited rare attempts to question the presumed nativeness of many species that may have been introduced over prior centuries. Coupled with this are largely coarse-grained attempts to establish invasion timing, such that frequently the date of publication of a paper is used as the first record, or the date of collection is taken as the date of introduction, even when evidence may be at hand that the latter is an artifact of specialized investigation of a particular taxonomic group.

As we posed at the start of this essay, without taking into account differential levels of exploration and study, it is difficult to address fundamental questions in invasion science such as whether the strikingly different numbers of marine invasions reported around the world reflect varying propagule pressures (as mediated by many pulsating vectors over time), the differential susceptibility or resistance of communities and habitats to the colonization of new species (“invasibility”), to the characteristics of the potentially invading species (“invasiveness”), or to some combinations of these.

We close this essay by exploring current approaches to understanding the scale of marine bioinvasions in tropical waters.

Wells and Bieler (2020) argue that the consistently low numbers of non-native marine species reported from Guam, Pilbara (Western Australia), Singapore, and southern Florida, as well as work comparing tropical and temperate Australian ports, “strongly suggests that the relative paucity of non-indigenous marine species in the studied environments is not due to a lack of study or inability to detect NIMS [non-indigenous marine species] caused by poor taxonomic knowledge, but rather by increased biological interactions in a biodiverse environment.” The potentially lower number of marine invasions in tropical waters have been discussed for some years (Hutchings et al. 2002). Relative to the goals of this essay, we address this conclusion briefly, and specifically probe Singapore as a model system.

None of the studies noted by Wells and Bieler address the large number of marine taxa that have never been studied relative to their biogeographic history, nor do these studies attempt a re-assessment of the history of ostensible native species in well-known taxa, nor do they plumb the depths of the potential scale of cryptogenic taxa. For Singapore, Wells et al. (2019), in listing 22 non-native marine species, noted that there had been “no comprehensive non-indigenous marine survey in Singapore,” but cited Jaafar et al. (2012) as an “extensive” study. In turn, Jaafar et al. (2012) state that they conducted “an exhaustive review to determine the status of marine and estuarine non-indigenous species in Singapore,” which review produced the names of three non-native species (two bivalves and one worm); they did not question the native designations of thousands of marine species recorded for Singapore. Wells et al. (2019), while acknowledging that the suggestion of Yeo et al. (2011) that “part of the explanation (for the low numbers of non-native species) may be that species were introduced in centuries before marine studies began” was “soundly based,” nevertheless reached the conclusion that there was “strong evidence” for the relatively lower numbers of non-native species in the Indo-West Pacific tropics, and that these numbers are “neither due to a lack of knowledge nor an absence of sampling.”

The phylum Arthropoda is represented in the current non-native species list for Singapore (Wells et al. 2019) by one species of amphipod and two species of barnacles; the phylum Annelida is represented by one species of serpulid tubeworm. No Singapore hydroids, sponges, or ascidians, among many other taxonomic groups, are considered relative to their non-native status. The situation is reminiscent of Angola, where marine Arthropoda invasions are represented solely by two species of barnacles (Pestana et al. 2017), although certain taxonomic groups omitted in the Singapore treatment, including sponges, hydroids, and ascidians, are recognized as non-native in Angola. Critically, Yeo et al. (2011) listed 127 (sic) species of crabs in Singapore that were cryptogenic—i.e., that could not be confirmed as native. Wells et al. (2019) acknowledge that analyzing the histories of 127 crab species would require a good deal of work. Wells et al. (2019) further argued that “taxonomists would have been looking for non-indigenous marine species in their marine studies but have found very few.”

In fact, taxonomists involved in survey work rarely if ever look for non-indigenous species: they seldom address the critical historical biogeographic consideration that now widespread species may in fact not have been so, prior to human activity in the oceans. Further, in our experience, taxonomists identifying species not previously recorded in a location often do not consider whether such taxa might be non-native, but default to concluding that a previously undetected species may have been overlooked, or rare, or mis-identified earlier by non-specialists. Thus, taxonomists working in Singapore (and elsewhere in tropical waters) may be aware of non-native species that are newly reported but rarely retrospectively question the distributions of the species they identify, which distributions, for hundreds if not thousands of species, are often said to range from the Red Sea and East Africa to the South China Sea and southern Japan and to the Hawaiian Islands. This lack of questioning is influenced in part by the fundamental assumption that such distributions are natural, despite a thousand and more years of potential mixing by coastal vessels transporting hull fouling organisms and ballast through the South Seas, the Central Pacific, and the Indian Ocean.

Given the lack of historical perspective, given the large number of species-rich taxonomic groups remaining largely or entirely unstudied, and given the number of regions around the world remaining largely uninvestigated, producing global “heat maps” documenting the number of marine invasions by country and region may overestimate the quality of the available data. Seebens and Kaplan (2022), using filtered (“cleaned”) data from GBIF (Global Biodiversity Information Facility) and OBIS (Ocean Biodiversity Information System), produced a color-coded global map of the number of “recorded alien taxa” for marine and brackish taxa, scaled from 0 to > 140 taxa in intervals of 20. Broad swaths of the globe are plotted as having 0–50 invasions, with Seebens and Kaplan (2022) acknowledging that “the distribution of records is highly biased toward a few well-sampled regions such as Europe, North America, Australia, and New Zealand.” However, simply reporting more invasions than elsewhere does not mean that these regions are well-sampled or well-understood, suggesting that, at this time, global maps of marine invasions may be better presented as the relative scale of selective study of some non-native species, rather than as the “number of recorded” non-native species. Global maps can be interpreted or used for many purposes outside academia, and may have a strong influence in setting priorities in research, with funding agenda, or in management decisions. Global maps, if used at all, should be accompanied by strong cautions about weaknesses and the scale of missing data in order to avoid misinterpretations about the status of bioinvasions.

Thus, the reporting of more non-native species in some regions of the world than others should not be interpreted to mean that better-studied areas have addressed the lacunae identified here. While Seebens and Kaplan (2022) correctly commented that “distinctly higher numbers of marine alien taxa than reported can be expected for most marine ecoregions,” we cannot yet state, as they do, “except for European and North American coastal waters”—waters that we judge to be critically undersampled for invasions in a vast array of taxa and over deeper historical time.

Without thorough and vetted assessments of the scale of invasions in marine ecosystems over time, our ability to look deep into marine community ecology and evolution and into both ecological and evolutionary biogeography is limited, as is our ability to frame robust invasion management plans. At the beginning of the twenty-first century, science and management may be better served by recognizing the scale of what we may be overlooking, rather than conclude that the data are sufficient to render strong conclusions about where invasions have or have not been successful. This approach may more powerfully set the stage for anticipating, if not predicting, the scale of future invasions in the world’s oceans.

Table 1 Phyletic dversity to be considered for marine bioinvasions assessments
Table 2 Habitat diversity to be considered for marine bioinvasions assessments
Table 3 Examples of non-native marine species inventories conducted largely since 2000, and whether the biogeographic status of “native” species was re-assessed
Table 4 Examples of records of known or probable non-native species “hidden” in taxonomic or invasions literature
Table 5 Criteria for recognizing non-native (introduced and range-expanding) species
Table 6 Recent examples of invertebrates recognized at the time of description as non-native, and therefore not native in their type localities