Introduction

The genus Phytophthora has had profound impacts on human history by causing agriculturally and ecologically important plant diseases (Erwin & Ribeiro 1996). Among the most notorious Phytophthora species is P. infestans, cause of the late blight disease, which was the primary cause of the Irish potato famine from 1845 to 1852 in which approximately one million people died and 1.5 million emigrated from Ireland (Turner 2005). Another example is the sudden oak death pathogen, P. ramorum, that has killed millions of coast live oak, tanoak and Japanese larch trees, and has permanently altered the forest ecosystems in California and Oregon, USA (Goheen et al. 2002, Rizzo et al. 2002, Rizzo et al. 2005). Other species, such as P. cinnamomi, P. nicotianae, and P. sojae, can also cause highly destructive plant diseases (Erwin & Ribeiro 1996). The impact caused by Phytophthora species has continued to increase with the emergence of new pathogens and diseases. The number of species known in the genus has doubled during the past decade due to extensive surveys in previously unexplored ecosystems such as natural forests (Jung et al. 2011, 2017, Rea et al. 2010, Reeser et al. 2013, Vettraino et al. 2011), streams (Bezuidenhout et al. 2010, Brazee et al. 2017, Reeser et al. 2007, Yang et al. 2016), riparian ecosystems (Brasier et al. 2003a, 2004, Hansen et al. 2012), and irrigation systems (Hong et al. 2010, 2012, Yang et al. 2014a, b). The total number of formally named species in the genus was about 58 in 1996 (Erwin & Ribeiro 1996), but now is more than 150. In addition, some provisionally or informally named species are also expected to be formally described in the near future.

A sound taxonomic system is foundational for correctly identifying Phytophthora species and safeguarding agriculture, forestry, and natural ecosystems. Traditionally, taxonomy of the genus was based on morphological characters. A fundamental morphology-based classification of Phytophthora species was established by Waterhouse (1963) who classified the species into six groups based on the morphology of sporangia, homothallism, and configuration of antheridia. However, plasticity in morphological characters amongst isolates of individual species is significant, so is homology or homoplasy among different species. For example, isolates of P. constricta (Rea et al. 2011), P. gibbosa (Jung et al. 2011), P. lateralis (Kroon et al. 2012), P. mississippiae (Yang et al. 2013), and P. multivesiculata (Ilieva et al. 1998) all produce a mixture of semi-papillate and non-papillate sporangia. Many non-papillate species recovered from irrigation water such as Phytophthora hydropathica (Hong et al. 2010) and P. irrigata (Hong et al. 2008) were morphologically inseparable from P. drechsleri, while sequence analyses demonstrated that they are distinct species. Also, production of many morphological structures and physiological features needs specific environmental conditions, while observation of these features requires substantial training and expertise. Difficulty in obtaining important morphological data can impair accurate species identification.

With the advent of DNA sequencing, the taxonomic concept for the genus has evolved from morphology to molecular phylogeny-based (Blair et al. 2008, Cooke et al. 2000, Kroon et al. 2004, Lara & Belbahri 2011, Martin et al. 2014, Martin & Tooley 2003, Robideau et al. 2011, Villa et al. 2006). In particular, the availability of whole genome sequences from P. sojae, P. ramorum (Tyler et al. 2006) and P. infestans (Haas et al. 2009) enabled the identification of genetic markers useful for multi-locus phytogenies (Blair et al. 2008).

Cooke et al. (2000) developed the first molecular phylogeny for the genus by analyzing sequences of the internal transcribed spacer region (ITS) of 51 species. Kroon et al. (2004) constructed a phylogeny based on sequences of four nuclear and mitochondrial genes of 48 species, and Blair et al. (2008) produced a sophisticated phylogeny based on sequences of seven nuclear genetic markers. That multi-locus phylogeny divided 82 Phytophthora species into 10 phylogenetically well-supported clades. Martin et al. (2014) analyzed sequences of seven nuclear and four mitochondrial genes of 90 formally named and 17 provisional species and provided phylogenies including 10 clades, almost identical to that of Blair et al. (2008), except that P. quercina and P. sp. ohioensis were excluded from clade 4 and grouped into a potentially new clade.

A comprehensive molecular phylogeny is required to understanding the evolution of Phytophthora species. Although discordance has been found between the molecular phylogeny and the morphology-based taxonomy (Cooke et al. 2000, Ersek & Ribeiro 2010), correlations have been observed between molecular phylogenies and individual morphological and physiological traits. Recent studies indicated that species in individual clades or subclades are mostly identical in sporangial papillation, and optimum and maximum growth temperatures (Cooke et al. 2000, Kroon et al. 2012, Martin et al. 2012, Yang 2014). However, there was limited to no correlation between phylogeny and the morphology of sexual organs, such as antheridial configuration (Cooke et al. 2000, Kroon et al. 2012, Martin et al. 2012, Yang 2014). These studies have implied that divergence in sporangial morphology and variation in environmental specialization may be the keys in the evolutionary history of Phytophthora species. Nevertheless, these hypotheses need to be further tested and the exact evolutionary history of the genus Phytophthora warranted more investigation.

In this study, an expanded phylogeny, including more than 180 Phytophthora taxa, many not included in any previous phylogeny, was constructed. Sequences of seven nuclear genetic markers were used for construction of the phylogeny. In light of this phylogeny, ancestral state reconstructions were conducted on the sporangial papillation of Phytophthora species. Important evolutionary divergence events and associated changes in the sporangial morphology of Phytophthora species are discussed.

Materials and Methods

Isolate selection

A total of 376 Phytophthora isolates representing 142 described and 43 provisionally named species, plus one isolate of each Elongisporangium undulatum (basionym: Pythium undulatum), Halophytophthora fluviatilis, and Phytopythium vexans (basionym: Pythium vexans) as outgroup taxa were included (Table 1). These included 114 ex-types (Table 2). Also included were 164 authentic isolates that were designated as representative isolates by the originators of the respective species names (Table 1). The majority of these isolates were provided by the originators of the respective species, while the rest were purchased from the Westerdijk Fungal Biodiversity Institute (CBS), Utrecht, The Netherlands.

Table 1 Information regarding isolates used in this study. GenBank accession numbers are listed in Table S1.
Fig. 1
figure 1

A phylogeny for the genus Phytophthora based on concatenated sequences of seven nuclear genetic markers. Topology and branch lengths of maximum likelihood analysis are shown. Bootstrap values for maximum likelihood and maximum parsimony, and Bayesian posterior probabilities (percentages) are indicated on individual nodes and separated by a forward slash. An asterisk is used in place of nodes with unambiguous (100 %) support in all three analyses. A dash is used in place of a topology from an analysis ambiguous to the other two analyses and these sets of numbers with ambiguity in one analysis are also highlighted in red. Detailed structures of clades 2, 6, 7, and 9 are shown in Figs 25, respectively. Species represented by ex-types and authentic isolates are written in brown and blue, respectively. Branches indicating three hypothesized evolutionary paths with all species producing papillate or semi-papillate sporangia are drawn in red or orange, respectively. Scale bar indicates number of substitutions per site.

Fig. 2
figure 2

Structure of Phytophthora clade 2 in a genus-wide phylogeny for the genus Phytophthora based on concatenated sequences of seven nuclear genetic markers. Topology and branch lengths of maximum likelihood analysis are shown. Bootstrap values for maximum likelihood and maximum parsimony, and Bayesian posterior probabilities (percentages) are indicated on individual nodes and separated by a forward slash. An asterisk is used in place of nodes with unambiguous (100 %) support in all three analyses. A dash is used in place of a topology from an analysis ambiguous to the other two analyses and these sets of numbers with ambiguity in one analysis are also highlighted in red. Species represented by ex-types and authentic isolates are written in brown and blue, respectively. Scale bar indicates number of substitutions per site.

Fig. 3
figure 3

Structure of Phytophthora clade 6 in a genus-wide phylogeny for the genus Phytophthora based on concatenated sequences of seven nuclear genetic markers. Topology and branch lengths of maximum likelihood analysis are shown. Bootstrap values for maximum likelihood and maximum parsimony, and Bayesian posterior probabilities (percentages) are indicated on individual nodes and separated by a forward slash. An asterisk is used in place of nodes with unambiguous (100 %) support in all three analyses. A dash is used in place of a topology from an analysis ambiguous to the other two analyses and these sets of numbers with ambiguity in one analysis are also highlighted in red. Species represented by ex-types and authentic isolates are written in brown and blue, respectively. Scale bar indicates number of substitutions per site.

Fig. 4
figure 4

Structure of Phytophthora clade 7 in a genus-wide phylogeny for the genus Phytophthora based on concatenated sequences of seven nuclear genetic markers. Topology and branch lengths of maximum likelihood analysis are shown. Bootstrap values for maximum likelihood and maximum parsimony, and Bayesian posterior probabilities (percentages) are indicated on individual nodes and separated by a forward slash. An asterisk is used in place of nodes with unambiguous (100 %) support in all three analyses. A dash is used in place of a topology from an analysis ambiguous to the other two analyses and these sets of numbers with ambiguity in one analysis are also highlighted in red. Species represented by ex-types and authentic isolates are written in brown and blue, respectively. Scale bar indicates number of substitutions per site.

Fig. 5
figure 5

Structure of Phytophthora clade 9 in a genus-wide phylogeny for the genus Phytophthora based on concatenated sequences of seven nuclear genetic markers. Topology and branch lengths of maximum likelihood analysis are shown. Bootstrap values for maximum likelihood and maximum parsimony, and Bayesian posterior probabilities (percentages) are indicated on individual nodes and separated by a forward slash. An asterisk is used in place of nodes with unambiguous (100 %) support in all three analyses. A dash is used in place of a topology from an analysis ambiguous to the other two analyses and these sets of numbers with ambiguity in one analysis are also highlighted in red. Species represented by ex-types and authentic isolates are written in brown and blue, respectively. Scale bar indicates number of substitutions per site.

Table 2 Numbers of species and ex-types included in phylogenies for the genus Phytophthora in previous studies and this study.

DNA extraction

To extract genomic DNA (gDNA), an approximately 5 × 5 mm culture plug of each isolate was taken from the actively growing area of a fresh culture. This was then grown in 20 % clarified V8 broth (lima bean broth for growing a P. infestans isolate 27A8) at room temperature (ca. 23 °C) for 7–14 d to produce a mycelial mass. The mass was then blot-dried using sterile tissue paper and then lysed in liquid nitrogen or using a FastPrep®-24 system (MP Biomedicals, Santa Ana, CA). gDNA was extracted using the DNeasy® Plant Mini kit (Qiagen, Valencia, CA) or the Maxwell® Plant DNA kit in combination with a Maxwell® Rapid Sample Concentrator (Promega, Madison, WI).

DNA amplification and sequencing

A set of primers for seven genetic markers were used for DNA amplification including 60S Ribosomal protein L10 (60S), beta-tubulin (Btub), elongation factor 1 alpha (EF1α), enolase (Enl), heat shock protein 90 (HSP90), 28S ribosomal DNA (28S), and tigA gene fusion protein (TigA) as indicated in Blair et al. (2008). PCR reaction mixtures were prepared with the Takara Taq DNA polymerase (Takara Shuzo, Shiga, Japan) according to the manufacturer’s instructions. The PCR cycling protocol was the same as indicated by Blair et al. (2008), except that the Eppendorf® Mastercycler® Pro thermal cycler (Eppendorf, Hamburg) was used in this study. All PCR products were evaluated for successful amplification using agarose gel electrophoresis. Unsuccessful PCR amplifications were repeated using a modified protocol to attempt successful amplifications by optimizing annealing temperature using gradient PCR (typically with lower annealing temperatures) or using the GoTaq® Flexi DNA Polymerase (Promega, Madison, WI) PCR mixture system.

Prior to sequencing, excess primer and dNTPs were removed from successful PCR products with shrimp alkaline phosphatase and exonuclease I (USB Catalog # 70092Y and 70073Z). One unit of each enzyme was added to 15 µL PCR product, incubated at 37 °C for 30 min, followed by heat inactivation at 65 °C for 15 min. Sequencing was performed with both amplifying primers as well as internal primers, if any, for individual genetic markers at the University of Kentucky Advanced Genetic Technologies Center (Lexington, KY). Derived sequencing files were visualized with FinchTV version 1.4.0 (Geospiza, Seattle, WA). Sequences of each isolate with all primers for individual genetic markers were aligned with Clustal W (Larkin et al. 2007) and edited manually to correct obvious sequencing errors and code ambiguous sites according to the International Union of Pure and Applied Chemistry (IUPAC) nucleotide ambiguity codes to produce a consensus sequence. All sequences produced in this study have been deposited in GenBank (Supplementary Table 1).

Among 379 isolates (including three isolates of the outgroup taxa) in the following phylogenetic analyses, all seven phylogenetic markers from 321 isolates were sequenced in this study. Sequences of all markers from 49 isolates by Blair et al. (2008) were also included in the analyses. Additionally, for seven isolates, sequences of one or two genes were newly produced in this study while the remaining gene sequences were from Blair et al. (2008). Sequences from P. lilii (CBS 135746) and P. sp. ohioensis (ST18-37) were obtained from Rahman et al. (2015) and from the Phytophthora Database (Park et al. 2013), respectively.

Phylogenetic analyses

Concatenated sequences of all isolates were aligned using Clustal X version 2.1 (Larkin et al. 2007). The alignment was edited in BioEdit version 7.2.5 (Hall 1999) to trim aligned concatenated sequences to an equal size and set missing data to question marks. The edited alignment was then analyzed in jModelTest version 2.1.7 (Posada 2008) to select the most appropriate model for the following phylogenetic analyses. Maximum likelihood (ML) analysis was performed using RAxML version 8.2.0 (Stamatakis 2014) with the selected model and 1000 bootstrap replicates. Maximum parsimony (MP) analysis was conducted using PAUP version 4.0a147 (Swofford 2002) with 1000 bootstrap replicates. Bayesian analysis (BA) was performed using MrBayes version 3.2.6 (Ronquist et al. 2012) for two million generations with the selected model. Phylogenetic trees were viewed and edited in FigTree version 1.4.2. Alignment and phylogenetic trees from all methods have been deposited in TreeBASE (S19303).

Ancestral character state reconstructions of sporangial papillation

Information on the sporangial papillation of individual species was compiled from the literature (Erwin & Ribeiro 1996, Gallegly & Hong 2008, Kroon et al. 2012, Martin et al. 2012) with emphasis given to their respective original descriptions (Table 1). Both likelihood and parsimony ancestral state reconstructions were performed on the ML tree from the phylogenetic analyses using Mesquite version 3.03 (Maddison & Maddison 2017).

Results

Sequences, alignment, and phylogenetic model

PCR amplification and sequencing was successful for almost all isolates and seven genetic markers. Failure to obtain sequences only occurred occasionally for a few isolates, such as the EF1α gene of Phytophthora bilorbang (61G8), the Enl gene of P. macrochlamydospora (33E1, 31E9, and 33D5), and P. quininea (45F2), and TigA of P. megasperma (62C7) (Supplementary Table 1). These failures were set as missing data in the alignment. After trimming, each isolate was represented by an 8435-bp concatenated sequence in the alignment including gaps and missing data. This included 496 bp for 60S, 1136 bp for Btub, 965 bp for EF1α, 1169 bp for Enl, 1758 bp for HSP90, 1270 bp for 28S, and 1641 bp for TigA (TreeBASE S19303). The general time reversible nucleotide substitution model with gamma-distributed rate variation and a proportion of invariable sites (GTR+I+G) was identified by jModelTest as the most appropriate model for the phylogenetic analyses.

An expanded phylogeny including 10 clades and basal taxa

The three phylogenetic analysis methods, including ML, MP, and BA analyses (TreeBASE S19303), resulted in similar tree topologies. The topology and branch lengths of the ML inference are shown in Fig. 1. The monophyly of each of the previously recognized 10 clades was generally well supported with a few exceptions. Specifically, all clades except for clade 4 were highly supported by > 95 % bootstrap values in ML analysis and 100 % posterior probability (PP) in BA analysis (Fig. 1). Clades 1–3, 5, 7, and 10 were also highly supported by > 95 % bootstrap values in the MP analysis (Fig. 1). However, clades 6, 8, and 9, were only moderately supported with bootstrap numbers of 68, 61, and 52 in the MP analysis, respectively (Fig. 1).

As nearly half of all taxa included in this phylogeny were recently described, all clades in this phylogeny are expanded here to various extents compared to previously published phylogenies. The general structure of clades 1, 3, 5, 8 and 10 remained as previously assigned by Blair et al. (2008) and Martin et al. (2014) with additions of new species. For example, clade 1 was divided into three well-supported subclades and P. nicotianae was placed basal to subclades 1b and 1c (Fig. 1). Clade 8 was divided into four generally well-supported subclades, except P. stricta, which was placed basal to all clade 8 species (Fig. 1). New subclades were assigned to clade 2 (Fig. 2), clade 6 (Fig. 3), clade 7 (Fig. 4) and clade 9 (Fig. 5).

Several species were placed basal to other species in their respective clades. First, the cluster of P. quercina and P. sp. ohioensis was placed basal to other species of clade 4 in all three analyses. The bootstrap supports of the ML and MP analyses, and PP (percentage) for the separation of this cluster from that of P. alticola, P. arenaria, P. megakarya, P. palmivora, and P. quercetorum in clade 4 were only 48, 78, and 84, respectively (Fig. 1). Second, P. lilii was excluded from all known clades; it was placed basal to clades 1–5 and 7 (Fig. 1). Third, in clade 6, bootstrap support for the ML and MP analyses, and PP for all species except P. asparagi and P. sp. sulawesiensis were 100/100/100 (Fig. 3). This set of support numbers decreased to 99/92/100 when P. sp. sulawesiensis was included, and to 100/68/100 when further including P. asparagi (Fig. 3). Fourth, the support numbers for clade 8 species excluding P. stricta was 100/100/100, but 96/61/100 when P. stricta was included (Fig. 1). Fifth, all papillate species in clade 10 (Table 1) formed a well-supported main cluster, while two more recently described non-papillate species, P. gallica and P. intercalaris, were placed basal to the main cluster (Fig. 1).

New subclades in clades 2, 6, 7, and 9

(a) Clade 2

In addition to the previously recognized subclades 2a and 2b, many species, such as P. acerina, P. capensis, P. citricola, P. multivora, P. pachypleura, P. plurivora, and P. pini in the commonly referred to “Phytophthora citricola-complex” defined a new subclade 2c (Fig. 2). Furthermore, P. bisheria, P. frigida, and P. elongata formed new subclade 2d and the cluster of P. multivesiculata and P. taxon aquatilis formed new subclade 2e, with maximum support values in each case (Fig. 2).

(b) Clade 6

Subclade 6a included P. gemini, P. humicola, P. inundata, P. rosacearum, P. sp. personii, P. sp. 48H2, P. sp. 62C9 and P. taxon walnut. The cluster of P. rosacearum and P. taxon walnut could not be separated from that represented by P. gemini with only moderate support values for separation (82/61/100) (Fig. 3). Isolates 62C9 and 48H2, belonging to two new species, had ambiguous placements within subclade 6a among the three analyses (Fig. 3). With approximately 20 species newly included in the present phylogeny, the previously recognized “P. megasperma-P. gonapodyides complex” (Brasier et al. 2003a), subclade II of clade 6 (Jung et al. 2011), or subclade 6b (Kroon et al. 2012) expanded and its separation from subclade 6a was well-supported by 100/100/100 values (Fig. 3). Within subclade 6b, separation of the cluster of P. bilorbang, P. lacustris, and P. riparia from the other subclade 6b species was highly supported by 97/94/100 (Fig. 3), indicating that these three species may define a new subclade, although this is not done in this study. Phytophthora sp. sulawesiensis was placed basal to other clade 6 species except for P. asparagi, while P. asparagi was basal to all other species in clade 6 (Fig. 3). Phytophthora asparagi was previously assigned as subclade 6c (Kroon et al. 2012) and subclade III of clade 6 (Jung et al. 2011); considering that the support value of MP analysis was only moderate (68 %) when this single taxon was included (Fig. 3), this previous assignation as a subclade was not adopted here. In addition, in order to be consistent with subclade names in other clades, subclades 6a and 6b were used here instead of subclades I and II by Jung et al. (2011).

(c) Clade 7

Four subclades were distinguished in clade 7. Separation of the previously assigned subclades 7a and 7b was only moderately supported by values 71/56/100 (Fig. 4). The general structure of subclade 7a remained the same even with the addition of seven new taxa. Six of these new species, including P. attenuata, P. flexuosa, P. formosa, P. intricata, P. ×heterohybrida, and P. ×incrassata were recently recovered from forest soils and streamwater in Taiwan (Jung et al. 2017). On the other hand, P. cinnamomi and P. parvispora were separated from subclade 7b. They, along with a provisional species, P. sp. ax from Virginia, USA (Table 1), formed a distinct new subclade 7c (Fig. 4). The new subclade 7d, including two recently described species from Japan (Rahman et al. 2014b), P. fragariaefolia and P. nagaii, was placed basal to other subclades in clade 7 (Fig. 4).

(d) Clade 9

The split of clade 9 into two subclades 9a and 9b was highly supported in ML (98 %) and BA (100 %) analyses and moderately supported in the MP (52 %) analysis (Fig. 5). However, monophyly was highly supported for subclade 9b (100/100/100) but not for subclade 9a (44/-/95) (Fig. 5). Within subclade 9a, three monophyletic clusters were formed: 9a1, 9a2, and 9a3. However, support for the separation of these three clusters was moderate or ambiguous. In particular, the MP results did not produce any consistent separation of the three clusters (Fig. 5). Cluster 9a1 included many recently described high-temperature tolerant species, such as P. aquimorbida, P. chrysanthemi, P. hydropathica, P. macilentosa, P. parsiana, and P. virginiana). The cluster of P. macrochlamydospora (two lineages with two isolates in each lineage, Table 1) and P. quininea constituted 9a2 (Fig. 5). The cluster of two other high-temperature tolerant species P. insolita and P. polonica constituted 9a3 (Fig. 5). The well-supported cluster of P. captiosa, P. constricta, and P. fallax was assigned as subclade 9b (Fig. 5).

Evolutionary history of sporangial papillation inferred from ancestral character state reconstructions

Sporangial papillation of individual species is indicated in Table 1 and Fig. 6. Due to the size of the cladograms, clusters including species with the same sporangial papillation within each (sub)clade were compressed in Mesquite. Both likelihood and parsimony methods suggested that non-papillate is the progenitor state of Phytophthora species, and that semi-papillate and papillate types were derived from the non-papillate. The analyses indicated three major clusters of semi-papillate and (or) papillate species diverged from the non-papillate ancestors. First, species in clades 1 to 5 (semi-papillate or papillate) diverged from non-papillate species in clade 7 and P. lilii (Fig. 6). Second, species in subclades 8b to 8d (semi-papillate) diverged from non-papillate subclade 8a species (Fig. 6). Third, papillate clade 10 species including P. boehmeriae, P. gondwanensis, P. kernoviae, and P. morindae diverged from the non-papillate P. gallica and P. intercalaris (Fig. 6). Several species such as P. macrochlamydospora, P. mississippiae, P. gibbosa, and P. constricta also evolved to produce partially semi-papillate sporangia (Fig. 6).

Fig. 6
figure 6

Ancestral state reconstructions of sporangial papillation for the genus Phytophthora based on likelihood (left cladogram) and parsimony (right cladogram). Trace character history analyses were performed on the maximum likelihood phylogeny in Mesquite. Clusters including species of uniform sporangial papillation within individual (sub)clades were compressed in Mesquite.

Discussion

Here we presented an expanded phylogeny for the genus Phytophthora, encompassing 142 formally named and 43 provisionally recognized species (Table 2). In addition to this comprehensive coverage, this expanded phylogeny features over 1500 signature sequences generated from 278 ex-type and authentic isolates of 162 Phytophthora taxa (Supplementary Table 1). Furthermore, this study provided new insights into the evolutionary history of sporangial papillation in Phytophthora.

The expanded phylogeny provides a sound taxonomic framework for this agriculturally and ecologically important genus. One hundred and fourteen ex-types were included, representing 80 % of the 142 formally named species in this phylogeny. The majority of the 29 species not represented by ex-types, such as P. gonapodyides, P. infestans, P. meadii, P. mexicana, and P. nicotianae, were described long ago without designation of an ex-type culture. Likewise, almost all the 43 provisional species in this phylogeny were represented by authentic isolates from the originators of the respective species (Table 1 and Supplementary Table 1). This new framework will facilitate identification of new taxa in the future. As the genus continues to rapidly expand, some recently described species were not included in this study: P. mekongensis in subclade 2a (Puglisi et al. 2017), P. amaranthi in subclade 2b (Ann et al. 2016), P. boodjera in clade 4 (Simamora et al. 2015), P. chlamydospora in subclade 6b (Hansen et al. 2015), P. uniformis (basionym: P. alni subsp. uniformis) and P. ×multiformis (basionym: P. alni subsp. multiformis) in subclade 7a (Brasier et al. 2004, Husson et al. 2015), P. pseudolactucae in subclade 8b (Rahman et al. 2015), and P. prodigiosa (Puglisi et al. 2017) and P. pseudopolonica (Li et al. 2017) in subclade 9a. Likewise, some informally designated species also were not included: such as P. taxon humicola-like, P. taxon kwongan, and P. taxon rosacearum-like in subclade 6a (Jung et al. 2011). These and other emerging species are yet to be incorporated in the overall phylogeny of the genus.

The generation of over 1500 signature sequences from ex-types and authentic isolates in this study will aid researchers and first responders in correctly identifying Phytophthora cultures to the species level. DNA sequencing of selected genetic markers has become common practice in the identification of Phytophthora cultures (Kang et al. 2010). However, it is recognized that the accuracy of culture identity determined by this approach depends on the quality of the reference sequences used — and currently many sequence deposits are erroneously identified in public repositories, including GenBank (Kang et al. 2010). These errors originated in sequence deposits of cultures that were identified by morphological characters alone, and compounded by those identified through sequence matches to erroneous reference sequences or by single DNA markers (Kang et al. 2010). In this study, 29 isolates were found associated with an erroneous or modified identity (Supplementary Table 2). For instance, isolate 29B3 in clade 1 was identified as P. pseudotsugae and used as a key isolate for this species by Gallegly & Hong (2008). However, its sequences were distinct from those of the P. pseudotsugae ex-type (ATCC 52938). In the phylogenetic tree, it was basal to the cluster of P. cactorum and P. hedraiandra, thus its species identity was changed to P. aff. pseudotsugae (Fig. 1). In clade 2, isolate 26H4 was identified as P. citrophthora (Gallegly & Hong 2008) but sequences and phylogeny showed that it was close to but distinct from P. citrophthora isolates 03E5 and 26H3. It formed a cluster with isolate IMI 342898 (P10341), which was coded as P. sp. aff. colocasiae-1 by Martin et al. (2014). The identity of both isolates was then changed to P. aff. citrophthora (Fig. 2). Similarly, in clade 8, isolate 22G2 had been identified as P. cryptogea, although it was distinct from the P. cryptogea ex-type 61H9 (CBS 113.19). In the phylogenetic tree, it was basal to the cluster of P. cryptogea and P. erythroseptica, and the species identity was consequently changed to P. aff. cryptogea (Fig. 1). Changes in the identifications of these isolates, including the new and original names used, are indicated in Supplementary Table 2. The changes in the naming of these isolates highlights the importance of using signature sequences from ex-type or authentic isolates as references in future culture identification. In order to facilitate this practice, the signature sequences generated from ex-types or authentic isolates in the present study are marked as ‘(ex-type)’ or ‘(authentic)’, respectively, under the ‘isolate’ section in the ‘feature’ table of GenBank deposits. The research, diagnostic and regulatory communities are encouraged to use these sequences as references in future culture identification.

This study provided new insights into the evolutionary history of sporangial morphology in the genus Phytophthora, a subject that has fascinated generations of mycologists and plant pathologists. There have been three major hypotheses regarding the development of papillation, as illustrated in Fig. 7a, b, and c, respectively. First, papillate species were considered as descendants of Pythium-like, non-papillate ancestors and semi-papillation has been considered as intermediate between non-papillation and papillation (Blackwell 1949, Cooke et al. 2000, Erwin & Ribeiro 1996). Second, some semi-papillate species, exemplified by P. primulae in the group III of Waterhouse (1963) are primitive; they were suggested to have evolved to papillate and non-papillate species through two distinct evolutionary lines (Brasier 1983). Third, semi-papillate sporangia are morphological variants of papillate and non-papillate types (Cooke et al. 2000). Here we suggest that the non-papillate type is ancestral, and that non-papillate species could have evolved directly into either semi-papillate or papillate species (Fig. 7d). The evolution to semi-papillate species is exemplified by those in subclades 8b–d (Fig. 1), while evolution to papillate species is illustrated by P. boehmeriae and other papillate species in clade 10 (Fig. 1). The relationship between semi-papillate and papillate species appears to be more complicated (Fig. 7d). We also hypothesize that some semi-papillate species, such as those in subclade 1c, may have diverged from papillate ancestors, while some papillate species such as P. frigida may have evolved from semi-papillate ancestors of subclade 2d (Fig. 6).

Fig. 7
figure 7

Illustration of hypotheses on evolution of Phytophthora and associated changes in sporangial papillation: (a) species producing papillate sporangia evolved from non-papillate ancestors. Semi-papillation is considered as intermediate between non-papillation and papillation (Blackwell 1949, Cooke et al. 2000, Erwin & Ribeiro 1996); (b) some semi-papillate species, exemplified by P. primulae in the group III of Waterhouse (1963), are primitive and evolved to be non-papillate and papillate through two evolutionary paths, by Brasier (1983); (c) papillate species evolved from non-papillate ancestors. Semi-papillate species have been considered as morphological variants of papillate or non-papillate species, by Cooke et al. (2000); (d) a new hypothesis developed in this study that non-papillate ancestors evolved directly to either papillate or semi-papillate species. Some semi-papillate species further evolved to be papillate, or vice versa.

These new hypotheses are supported by the results from phylogeny and ancestral state reconstructions that suggest three major evolutionary paths in sporangial papillation of Phytophthora species (Fig. 1). First, the ancestor of modern species in clades 1–5 evolved to be papillate or semi-papillate (Figs 1, 6) while diverging from the common non-papillate ancestor of clade 7 species (Figs 1, 6). Second, the common ancestor of species in subclades 8b–d diverged from that of subclade 8a species while acquiring semi-papillation (Figs 1, 6). Third, the common ancestor of five clade 10 species in the main cluster including P. boehmeriae, P. gondwanensis, P. kernoviae, P. morindae, and P. sp. boehmeriae-like, acquired papillate sporangia while diverging from two non-papillate clade 10 species, P. gallica and P. intercalaris (Figs 1, 6). Besides these three major groups of papillate or semi-papillate species, a few species may have evolved to acquire semi-papillation independently, such as P. macrochlamydospora in clade 9 (Fig. 6). This evolutionary process may be underway for some other species including P. constricta, P. gibbosa, and P. mississippiae, which all produce both semi-papillate and non-papillate sporangia (Fig. 6). Furthermore, evolutionary reversion to partial production of non-papillate sporangia may have occurred in P. multivesiculata and P. lateralis in two semi-papillate subclades 2e and 8c, respectively (Fig. 6). However, that conclusion is uncertain due to limited and ambiguous data from species in these two subclades. Specifically, P. lateralis was ambiguously reported as non-papillate (Erwin & Ribeiro 1996, Gallegly & Hong 2008, Martin et al. 2012, Tucker & Milbrath 1942) or non- to semi-papillate (Kroon et al. 2012) in different studies. In subclade 2e, the only sister taxon of P. multivesiculata, P. taxon aquatilis, was provisionally described as semi-papillate, but only based on a single isolate (Hong et al. 2012). Evolutionary reversion in the sporangial papillation of these two species requires validation in the future. Also, more studies are warranted to analyze additional characters based on phylogenies with better clade-to-clade resolutions and provide a more comprehensive picture on the evolutionary history of Phytophthora species.

That a number of species were placed basal to other species in their respective clades in this expanded phylogeny presents a significant challenge to the monophyly of their respective clades and the current 10-clade system. First, P. stricta was initially placed close to other species in subclade 8a based on sequences of the cytochrome c oxidase 1 (cox1) gene, but was not grouped in any ITS clade (Yang et al. 2014a). This species was grouped in clade 8 in our expanded phylogeny by ML and BA analyses (Fig. 1); the monophyly of this clade was only moderately supported (61 %) in the MP analysis (Fig. 1). Second, the monophyly of clade 6 including P. asparagi was only moderately supported (68 %) in the MP analysis (Fig. 3). Third, although the inclusion of P. intercalaris in clade 10 was supported with maximum values, the exact positions of this species and P. gallica were still unresolved since the next node was only moderately supported (53 %) in the ML analysis and ambiguous in the MP analysis (Fig. 1). Fourth, similar to the finding of Blair et al. (2008), support for the monophyly of clade 4 including P. quercina and P. sp. ohioensis was only moderate (48/78/84). Also, similar ambiguity in the placement of the ‘P. quercinaP. sp. ohioensis’ cluster was observed among different phylogenetic approaches, and using different datasets including nuclear, mitochondrial, and combined nuclear and mitochondrial sequences (Martin et al. 2014). Fifth, this phylogeny confirmed the finding by Rahman et al. (2015) that P. lilii was not grouped in any clade of the current 10-clade system (Fig. 1). This species was not assigned as a distinct clade in our study, due to the relatively low clade-to-clade resolutions (Fig. 1). Further analyses are warranted to determine whether this unique species should be assigned as a new clade.

Although many branches in the expanded phylogeny have consistent maximum support in all three methods, some have only moderate to low or inconsistent support. These results highlight the challenges of correctly inferring the evolutionary separation of many closely related Phytophthora species, even when concatenated sequences from seven phylogenetic markers were used. It can be expected that as the cost of gene sequencing drops further, it will become possible to increase phylogenetic resolution among Phytophthora species by using concatenations of much larger numbers of genes. For example, Ye et al. (2016) used 293 concatenated housekeeping proteins to infer a robust phylogeny of seven fully sequenced Phytophthora species and confirmed that downy mildews (represented by three genome sequences) are nested within the genus Phytophthora, close to Phytophthora clade 4 (Ye et al. 2016). However, even with full genome sequences, ambiguity may not be completely resolved in cases where speciation has involved large populations of sexually reproducing individuals, for example, as a result of geographic separation. In these cases, there may be many sequence polymorphisms shared among separated species and these may confound the inference of a reliable phylogeny. Resolution of this level of ambiguity may require sequencing the whole genome of many isolates from the species of interest as well as using improved phylogenetic and coalescent methods.

With the number of described Phytophthora species increasing, recent studies have raised an important concern in the accurate detection of species boundaries using phylogenetic data (Jung & Burgess 2009, Pánek et al. 2016, Safaiefarahani et al. 2015). One example is the status of P. hedraiandra as a distinct species in subclade 1a (Pánek et al. 2016). As evidenced by the amplified fragment length polymorphism (AFLP) and phylogenetic analysis based on sequences of ITS, phenolic acid decarboxylase, and cox1 genes, a recent study concluded that P. hedraiandra was just one lineage of P. cactorum, while morphological data provided only limited information to delimitate these two species (Pánek et al. 2016). Also, phylogenetic analyses in this study indicated that P. cactorum and P. hedraiandra cluster with strong support (98/100/100), and P. aff. hedraiandra isolate 33F4 (previously identified as P. hedraiandra Supplementary Table 2), was clustered with P. cactorum (Fig. 1). Phylogenies based on nuclear sequences prior to this study also supported P. hedraiandra as closely related to P. cactorum (Blair et al. 2008, Martin et al. 2014). However, in the phylogenies based on concatenated sequences of four mitochondrial loci, and combined seven nuclear and four mitochondrial loci, P. hedraiandra was basal to the cluster of P. cactorum and P. pseudotsugae, and clustered with P. idaei, respectively (Martin et al. 2014). Phytophthora cactorum and P. hedraiandra also have very distinctive single-strand-conformation polymorphism patterns (Gallegly & Hong 2008). Apparently, more investigations are warranted to resolve the P. cactorum complex. Likewise, indistinct boundaries are present among species in other subclades, such as the ‘P. citricola complex’ or subclade 2c (Brazee et al. 2017, Jung & Burgess 2009), the ‘P. cryptogea complex’ in subclade 8a (Safaiefarahani et al. 2015, 2016) and cluster 9a1 in subclade 9a including P. hydropathica (Hong et al. 2010), P. parsiana (Mostowfizadeh-Ghalamfarsa et al. 2008), P. virginiana (Yang & Hong 2013) and other provisionally designated species. Accurately delimiting these closely related species within the genus remains an important task.

This expanded phylogeny has highlighted the importance and difficulty of accurately interpreting the position of hybrid Phytophthora species. As exemplified by P. xalni (Brasier et al. 2004, Husson et al. 2015), many hybrid species have been identified among emerging plant pathogens (Jung et al. 2017, Man in’t Veld et al. 2012, Nirenberg et al. 2009). Due to the presence of multiple alleles originated from parent species in their nuclear genes, phylogenetic analysis of these hybrids based on nuclear sequences alone may not produce a robust placement. As illustrated in this phylogeny, the placement of hybrid species may be ambiguous. Specifically, in subclade 6b, support values for the placement of P. ×stagnum and its closely related species, P. mississippiae, P. borealis, and P. sp. delaware were moderate in the ML and BA analyses and ambiguous in the MP analysis (Fig. 3). Similarly, in subclade 7a, the placement of P. xalni, P. xcambivora, P. xheterohybrida, and P. ×jncrassata’ cluster was not well resolved due to ambiguous placement in the MP analysis and moderate support values in the other two analyses (Fig. 4). Adding mitochondrial sequences into the phylogenetic analyses may be a solution to this problem. However, due to the uniparental inheritance of mitochondria, the hybrids and their maternal parents are inseparable by mitochondrial sequences and their placements could conflict with nuclear analyses (Martin et al. 2014).