Introduction

Two genera, namely, Impatiens and Hydrocera, are the sole members of the family Balsaminaceae. The genus Hydrocera is monotypic. Impatiens is a large genus containing more than 1000 species with a distribution pattern in the mountain areas of old world tropics and subtropics (Janssens et al. 2006). The five biodiversity hotspots for this highly endemic genus have been identified as Southeast Asia, Southern India and Sri Lanka, tropical Africa, Madagascar, and the Eastern Himalayas (Yuan et al. 2004; Janssens et al. 2006). Several novel species, belonging to this explosively speciating plant, are recognized in these regions every year (Kuang et al. 2014; Gogoi and Borah 2014; Luo et al. 2015). The genus Impatiens contains more than 210 species in India with amazing localization in two biodiversity hotspots, namely, Himalayas in the north of India and the Western Ghats in the south of India. Over half of these occur in the Western Ghats of India and at least 103 species of Impatiens are endemic to the Western Ghats alone (Bhaskar 2012).

Molecular phylogeny of balsams based on ITS sequences (Yuan et al. 2004) proposed that extant Impatiens species are of Southeast Asian origin, from where it dispersed to other parts of the globe in several dispersal events. Contrarily, atpBrbcL intergenic spacer sequences based on phylogenetics of Janssens et al. (2006) suggested that Impatiens originated in South China from which it colonized the nearby regions and afterwards dispersed to north America, India, Africa, the Southeast Asian peninsula, and the Himalayan area. All these published data of molecular phylogeny and biogeography of Balsaminaceae inferred from ITS sequences (Yuan et al. 2004) and chloroplast atpBrbcL spacer sequences (Janssens et al. 2006) contained only a few samples of Impatiens species from South India, creating a gap in the existing phylogeny of balsams. Hence, this work is a novel attempt on the molecular phylogeny of Impatiens species with representatives from six sections of balsams from South India.

Materials and methods

Representative samples from the different sections of Impatiens species were collected from Southern Western Ghats of India. The plants were authenticated, and voucher specimens were deposited in the Herbarium of St. Thomas College (Palai, Kerala, India). The details of the sample collection were summarized in Table 1.

Table 1 Species used in this study with location, voucher no., and GenBank accession no. of atpBrbcL and trnL-F sequences

Total genomic DNA was extracted using Gen Elute Plant Genomic DNA Miniprep Kit (Sigma Aldrich, St. Louis, USA). For PCR amplification, OrionX h-Taq PCR Smart Mix (Origin, India) was used. The primers used for the amplification of the chloroplast atpBrbcL intergenic spacer gene were IMP-atpB—5′-ACATCTAGTACCGGACCAATGA-3′ and IMP-rbcL—5′-AACACCAGCTTTGAATCCAA-3′ (10 pM each) (Janssens et al. 2006), and trnL-F region were trnL-F—c:5′-CGAAATCGGTAGACGCTACG-3′ and trnL-F—f:5′-ATTTGAACTGGTGACACGAG-3′ (10 pM each) (Taberlet et al. 1991).

The temperature profile of amplification of atpBrbcL intergenic spacer region was as per Janssens et al. (2006), and that of trnL-F region was as per Taberlet et al. (1991). Amplification reactions were carried out in an Agilent Sure Cycler 8800 (Agilent Technologies, USA) (ESM Figs. 1S, 2S). Amplicons (atpBrbcL amplicon of size 900 bp and trnL-F amplicon of size 600–650 bp) were sequenced in AB1 cycle sequencer (Scigenome Labs Pvt. Ltd., Cochin, Kerala, India).

All sequences generated in this study were subjected to a BLAST search (NCBI) against the GenBank nucleotide database and submitted to GenBank (Table 1). I. omeiana was selected as outgroup for phylogenetic analyses of Impatiens (Janssens et al. 2009). Sequences of Impatiens species from three diversity hotspots were collected from GenBank accessions (Table 2). The sequences were multiple aligned and edited using the CLUSTALW (Thomson et al. 1994) program incorporated in BioEdit 7.0.5.2 (Hall 1999).

Table 2 Details of sequences of atpBrbcL and trnL-F of Impatiens spp. obtained from GenBank

The Akaike information criterion (AIC) implemented in the program jModelTest version 2.1.5 (Darriba et al. 2012) was used to choose substitution models that best fit the data set. Bayesian inference analysis was carried out in MrBayes v.3.2.2 (Ronquist et al. 2012) in two independent runs, each with one heated chain and one cold chain and for one lakh generations. Convergence occurred when standard deviation (SD) of split frequencies fell below 0.05; the first 25% of MCMC generations were discarded as burn-in and a consensus phylogram was created. Posterior probability values were used to estimate branch support. Trees were visualized by Fig Tree, Tree Figure drawing tool version 1.4.2 (Rambaut 2014).

Results and discussion

Phylogenetic analysis of this study included two chloroplast regions (atpBrbcL, trnL-F) from 30 sequences of South Indian Impatiens species. In addition, 13 sequences of each of these regions were obtained from NCBI database. To assess the level of congruence between these data sets, each data set was analyzed independently to see if they produced a similar topology. The separate analyses produced topologies similar to each other. In comparison with separate analyses, the combined phylogeny had a well-resolved topology.

The combined atpBrbcL and trnL-F data matrix contained 1664 characters. A general time reversible model of evolution with invariant sites and a gamma distribution (GTR + I + G) was selected using jModelTest version 2.1.5. This model was used for the Bayesian inference (BI) analysis. The resulted tree by BI analysis had a well-resolved topology (Fig. 1). The resolved lineages of Impatiens species were grouped into four clades with strong Bayesian posterior probability (BPP) values. Two Southeast Asian species and the Himalayan species formed clade 1. Clade 2 included four Southeast Asian species. Two Southeast Asian species formed clade 3. Clade 4 was divided into two subclades, i.e., A and B (BPP of 1.00). Subclade A contained species of sections Racemosae, Sub-Umbellatae, Tomentosae, and Scapigerae (BPP of 1.00). Subclade B is divided into three subclades, i.e., B1, B2, and B3. African species (I. hians) formed Subclade B1. Subclade B2 included African species (I. keilii and I. mannii) and South Indian species of section Microsepalae with BPP of 0.89. Species of section Annuae produced Subclade B3 with BPP of 0.94.

Fig. 1
figure 1

Bayesian consensus cladogram based on combined sequences of chloroplast atpBrbcL intergenic spacer and trnL-F genes. The numbers by the nodes indicate Bayesian posterior probabilities greater than 0.5

Implications on infrageneric classification and biogeography of Impatiens species of Western Ghats

Impatiens is considered taxonomically as one of the most difficult genera of angiosperms, mainly due to hypervariable structure and fragile nature of its flowers making examinations of dried specimen extremely difficult (Grey-Wilson 1980). The important revision of the African taxa by Grey-Wilson (1980) distinguished six informal infrageneric groups for the African species for practical diagnosis. Based on morphological and molecular data sets, Yu et al. (2015) presented a new classification of Impatiens, with the genus being divided into two subgenera, subgenus Clavicarpa and subgenus Impatiens. The subgenus Impatiens was further subdivided into seven sections.

In the taxonomic treatments of South Indian Impatiens by Bhaskar (2012), balsams of South India were classified under seven sections, i.e., Scapigerae, Epiphyticae, Annuae, Microsepalae, Tomentosae, Sub-Umbellatae, and Racemosae. Based on the present molecular phylogenetic study, species of each section formed monophyletic association with strong BPP support. This study authenticates the morphological classifications of Bhaskar (2012).

Based on several morphological similarities among species endemic to Africa and South India, close affinity between African and South Indian taxa and a possible migration route connecting these two areas were suggested (Grey-Wilson 1980). In this study, species of sections Microsepalae and Annuae showed African affinities with sister–clade relationships. This confirms Grey-Wilson’s (1980) suggestions of affinity between African and South Indian species. Sections Scapigerae, Sub-Umbellatae, Tomentosae, and Racemosae formed a separate clade (Subclade A) with sister–clade relationships with the extant Southeast Asian species.

There are several hypotheses related to the origin of Impatiens (Jones and Smith 1966; Grey-Wilson 1980). Bhaskar (1981) proposed that Western Ghats is the place of origin of the genus Impatiens. His hypothesis was based on the observation that Western Ghats of India contains the phylogenetically old species with primitive radial pollen grains, diploid chromosome number, and shrubby habit.

ITS phylogeny of Yuan et al. (2004) and atpBrbcL phylogeny of Janssens et al. (2006) revealed that Impatiens spp. colonized African continent from Southwest China in three independent dispersal events. Madagascan species was derived from a single colonization event (Janssens et al. 2009). The present combined chloroplast gene analysis contained only three African and no Madagascan species. In this African species, I. keilii and I. mannii were placed with species of section Microsepalae. Section Annuae formed a sister–clade with this section. Himalayan species (I. scabrida) showed affinity to Southeast Asian species (I. aquatilis and I. uliginosa).

The biogeographical elucidation based on the present study is mainly in accordance with the conclusion of Yuan et al. (2004). The present analysis postulated that South India was colonized by two independent dispersal events, i.e., once by Southeast Asian ancestor as shown by the sister–clade relationships of extant Southeast Asian species and the sections Scapigerae, Tomentosae, Sub-Umbellatae, and Racemosae and a more recent colonization by an ancestor with African affinities (sections Microsepalae and Annuae).