Diversity and recombination analysis of Cotton leaf curl Multan virus: a highly emerging begomovirus in northern India
- 353 Downloads
Cotton leaf curl disease (CLCuD), caused by begomoviruses in association with satellite molecules, is a major threat to cotton production causing enormous losses to cotton crop in most of the cotton growing countries including Indian subcontinent. In this study, isolates of begomovirus and satellite molecules associated with CLCuD were collected from North India (Haryana, New Delhi). They were amplified employing rolling circle replication mechanism, cloned, sequenced and, their phylogenetic and recombination analysis was performed.
The five Cotton leaf curl Multan virus (CLCuMuV) isolates investigated in this study showed monopartite organization of the genome typical of Old World begomoviruses. Nucleotide sequence analyses assigned them as the strains of CLCuMuV and were designated as CLCuMuV-SR13, CLCuMuV-SR14, CLCuMuV-ND14, CLCuMuV-ND15 and CLCuMuV-SR15. The genome of CLCuMuV-SR13 shared a highest level of nucleotide sequence identity (98%) with CLCuMuV (JN678804), CLCuMuV-SR14 and CLCuMuV-SR15 exhibited 96% with CLCuMuV (KM096471), while isolates CLCuMuV-ND15 and CLCuMuV-SR15 revealed 96% sequence identity with CLCuMuV (AY765253). The four betasatellite molecules investigated in this study shared 95–99% nucleotide sequence identity with Cotton leaf curl Multan betasatellite (CLCuMB) from India. The betasatellite molecules were designated as CLCuMB-SR13, CLCuMB-SR14, CLCuMB-ND14 and CLCuMB-ND15. Alphasatellite molecules in this study, designated as GLCuA-SR14, GLCuA-ND14 and GLCuA-SR15, revealed 98% identity with Guar leaf curl alphasatellite (GLCuA) reported from Pakistan.
The phylogenetic and recombination studies concluded that the isolates of CLCuMuV genomes undertaken in this study have a potential recombinant origin. Remarkably, significant recombination was detected in almost all the genes with contribution of Cotton leaf curl Kokhran Virus (CLCuKoV) in IR, V1, V2, C1, C4 and C5 regions and of CLCuMuV in C2 region of CLCuMuV-SR14. CLCuKoV also donated in C2, C3 regions of CLCuMuV-ND14; V1, V2, C2 and C3 regions of CLCuMuV-ND15 and C1 of CLCuMuV-SR15. Altogether, these observations signify the uniqueness in Indian CLCuMuV isolates showing contribution of CLCuKoV in all the genes. An interesting observation was frequent identification of GLCuA in CLCuD leaf samples.
KeywordsCotton leaf curl Multan virus Cotton leaf curl disease Betasatellite Alphasatellite Recombination
Begomoviruses associated with cotton leaf curl disease
Cotton leaf curl Alabad virus
Cotton leaf curl Bangalore virus
Cotton leaf curl disease
Cotton leaf curl Gezira virus
Cotton leaf curl Kokhran Virus
Cotton leaf curl Multan betasatellite
Cotton leaf curl Multan Virus
Cotton leaf curl Rajasthan virus.
Guar leaf curl alphasatellite
Open Reading Frame
India is the largest cotton producing country in the world . Cotton (Gossypium hirsutum, family Malvaceae) is a shrub cultivated in tropical and subtropical countries for its soft, fluffy and staple fiber. In Mexico, wild cotton species show greatest diversity followed by Australia and Africa. G. hirsutum, an important cash crop, is prone to infection by several pathogens and insects. Among them begomoviruses associated with cotton leaf curl disease (CLCuD) cause severe losses to cotton crop [2, 3, 4].
CLCuD is induced by complexes of cotton infecting begomoviruses (family Geminiviridae) intimately associated with satellite molecules viz. betasatellite and alphasatellite. The family Geminiviridae comprises of viruses having single-stranded (ss), circular DNA genomes which are encapsidated in geminate particles [5, 6]. On the basis of genome organization, nucleotide sequence similarities, insect vector and host range, geminiviruses are presently grouped into nine genera viz. Becurtovirus, Begomovirus, Capulavirus, Curtovirus, Eragovirus, Grablovirus, Mastrevirus, Topocovirus and Turncurtovirus . Among them, Begomovirus, the largest and most important genus of the family Geminivirideae, presently comprises of > 200 species. They are transmitted by Bemisia tabaci (whitefly) insect vector and infect dicotyledonous plants [8, 9]. Five species of begomovirus complexes i.e., Cotton leaf curl Multan virus (CLCuMuV), Cotton leaf curl Bangalore virus (CLCuBaV), Cotton leaf curl Kokhran virus (CLCuKoV), Cotton leaf curl Gezira virus (CLCuGeV) and Cotton leaf curl Alabad virus (CLCuAlV) are presently associated with CLCuD . The CLCuD-affected cotton plants show characteristic symptoms of thickening and darkening of veins, curling of leaves (mostly upward), development of outgrowths (enations) on the lower side of leaves and stunted growth of plants. This disease spreads through whitefly in a persistent, circulative manner [9, 11].
In 1967, CLCuD was reported for the first time in Multan, Pakistan . Later in 1980, it unexpectedly decreased cotton productivity causing great concern to cotton growers and agricultural scientists. In 1992–1997, it again reduced the yield by 29% . In India CLCuD was first reported from Sri Ganganagar in 1993 . Recently, several variants of CLCuKoV-Bu strain have been reported from India and Pakistan, thus posing an alarming situation which may lead to another CLCuD outbreak in the subcontinent . The recombinant origin of begomoviruses significantly contributes to their increasing pathogenicity [16, 17, 18, 19, 20].The genome of begomoviruses associated with CLCuD (henceforth referred to as BAC) is monopartite consisting of a circular, single stranded (ss) DNA molecule (~ 2.7 kb). It is organized into 7 ORFs viz. C1, C2, C3, C4 and C5 in complementary sense and V1 and V2 in virion sense. The ORFs participate in replication (C1 and C3), transcription activation (C2) and packaging (V1 and V2). Though the exact function of C4 and C5 genes is still not known, C4 protein participates in suppression of RNA silencing mechanisms and C5 protein, which is not common, may be involved in replication of DNA but remains insignificant in viral infection . The opposing complementary sense and virion sense genes are separated by a conserved non-coding intergenic region (IR) which is called as the common region (CR), consisting of about 200 nts having a highly conserved nonanucleotide (TAATATTAC) sequence containing an origin of replication (ori) .
Monopartite begomoviruses have also been found to be associated with circular, ss satellite molecules-termed as betasatellites are ~ 1.35 kb (approximately half the size of the monopartite begomovirus DNA genome). They are required for induction of disease symptoms in their host plants [23, 24]. The replication, movement and encapsidation of betasatellites relies on helper virus. The function of betasatellites is governed by the βC1 protein, which functions as a pathogenicity determinant [25, 26].
Some begomovirus-betasatellites are also associated with alphasatellites . The size of alphasatellite is nearly half of the begomoviral genome (1.3–1.4 kb). They are able to replicate autonomously. The alphasatellite encodes a replication associated protein (Rep), which depends on helper begomovirus for its movement and whitefly-mediated transmission in plants . Alphasatellite regulates the virulence of begomovirus-betasatellite complex [28, 29]. Another distinct class of non-coding DNA satellite has been found to be associated with some begomoviruses termed as deltasatellite. The size of deltasatellite is about one quarter of the begomovirus genome (~ 700 nt) and its function is still not known .
The genome of begomovirus replicates via rolling circle replication mechanism. Geminiviruses can also replicate their dsDNA via recombination-dependent replication mechanism [31, 32, 33]. It is well established that recombination plays an important role in the establishment, emergence and evolution of new species of geminiviruses. The exchange of genetic material via recombination causes plant virus evolution . The network of relationships among various begomoviruses in the Indian subcontinent could also be explained through recombination .
In this study, complete genome of several isolates of CLCuMuV, associated betasatellites and alphasatellites were cloned and sequenced. Since the biodiversity of begomoviruses in North India is very rich, the recombination events in the viruses were investigated and analysed for their possible role in the evolution of virus. Infectious clones of CLCuMuV as well as CLCuMB were prepared and agroinoculated in Nicotiana benthamiana plants. Furthermore, phylogenetic analysis and potential recombination of isolates of CLCuMuV with reference to other begomoviruses were studied to understand the CLCuD-begomovirus complexity and evolution in the Indian subcontinent.
Confirmation of BAC complex
The samples taken in this study from Sirsa during the years 2013, 2014 and 2015 were designated as CLCuMuV-SR13, CLCuMuV-SR14, and CLCuMuV-SR15, respectively and from New Delhi during 2014 and 2015 were abbreviated as CLCuMuV-ND14 and CLCuMuV-ND15. PCR- based amplification of the CP gene confirmed the presence of beomoviral genome in CLCuD cotton samples (Additional file 1: Figure S1). Additionally, 1 μg of Rolling Circle Amplification (RCA) product obtained from CLCuD-infected leaves following digestion with restriction enzyme KpnI yielded a DNA fragment (~ 2.7 kb), thus confirming the presence of the begomoviral DNA in CLCuD-symptomatic leaf samples.
Nucleotide sequence analysis of genome of BAC and its relationship with other begomoviruses
PCR amplification using RCA product as the template yielded the amplicons of expected sizes (~ 1.2 kb employing primers F1For/F1Rev, and ~ 1.7 kb with primers F2For/F2Rev). The complete nucleotide sequences of five different BAC shared 96–98% nucleotide sequence identity with CLCuMuV, thus were regarded as the isolates of CLCuMuV based on recent species demarcation threshold of begomovirus . Hence, these isolates having genomes of 2757, 2748, 2751, 2751 and 2750 nts were designated as CLCuMuV-SR13, CLCuMuV-SR14, CLCuMuV-ND14, CLCuMuV-ND15 and CLCuMuV-SR15, respectively. Their DNA sequences were submitted to GenBank database with accession numbers KJ868820, KX951460, KX951461, KY561820 and KY888163, respectively. The genome of CLCuMuV-SR13 (KJ868820) isolate shared the highest identity (98%) with CLCuMuV (JN678804).while isolates CLCuMuV-SR14 (KX951460) and CLCuMuV-SR15 (KY888163) exhibited highest identity (96%) with CLCuMuV (KM096471) while CLCuMuV-ND14 (KX951461) and CLCuMuV-ND15 (KY561820) isolates shared highest identity (96%) with CLCuMuV (AY765253). Nucleotide sequences of all the isolates displayed typical begomoviral features with respect to size, organization and ORFs arrangement of the genome. Isolates from New Delhi (ND14 and ND15) possess genome of the same size (2751 nts), while in Sirsa isolates (SR14 and SR15) it was 2748 and 2750 nts, respectively. The difference of three nts was observed within the Rep region. The Sirsa isolate (SR13) showed a difference of 9–12 nts in the V2 region, while the lengths of V1, C2, C3 and C4 were similar in all the isolates of this study, a difference of 3–12 nts was observed in V2 and Rep regions (Additional file 2: Table S1).
The heat map representing the pairwise sequence alignment generated by Sequence Demarcation Tool version 1.2 (SDT, 58) showed that the genome of CLCuMuV isolates shared more than 91% of sequence identity with other relevant BAC sequences used in this analysis (Additional file 3: Figure S2 a-e) which is more than the cutoff value (91%) for demarcation of begomovirus species .
The amplification and sequencing of full-length (~ 2.7 kb) genome obtained using oligo primers (viz. Cot For./Cot Rev.) further confirmed the presence of CLCuMuV in all the RCA derived samples used in the present study.
Sequence comparison and phylogenetic analysis of satellite molecules
SDT analysis showed that in the present study CLCuMB shared > 90% identity with other relevant betasatellites used in SDT analysis (Additional file 6: Figure S3 a-d), while alphasatellites displayed > 92% identity with other alphasatellite sequences used in the analysis (Additional file 7: Figure S4 a-c).
Since recombination is the leading aspect in the evolution and emergence of geminiviruses [14, 38], we investigated the evidence of recombination in the nucleotide sequences of newly identified CLCuMuV isolates and associated satellites using RDP4 (Recombination detection program). The analysis was performed by comparing the present sequences of BAC with various sequences of begomoviruses retrieved from GenBank database. It displayed potential recombination events with distinct patterns in several full-length genomes of newly- identified CLCuMuV and associated betasatelllite and alphasatellite sequences as listed below:
Cotton leaf curl Multan virus
In isolates CLCuMuV-ND14(KX951461) and CLCuMuV-ND15 (KY561820), recombinant fragments signified CLCuKoV (HF549182) as a common parental sequence and differed only in length of their spanning regions. Interestingly, CLCuKoV (HF549182) contributed in the V1 and V2 regions of CLCuMuV-ND14 (KX951461) at nt position 474–860, and in V1, V2, C2 and C3 regions of CLCuMuV-SR14 (KX951460) at nt position 442–1207 (Fig. 4b, c).
In CLCuMuV-SR15 (KY888163), two different recombination events were detected depicting Cotton leaf curl Rajasthan Virus (CLCuRaV, KM065514) and CLCuKoV (HF549182) as minor parents (Fig. 4d). CLCuKoV (HF549182) contributed in C1 region (nt position 2450–2560), and a recombinant fragment of 13–76 nts involving IR and C5 was donated by CLCuRaV (KM05514) in the genome of CLCuMuV- SR15 (KY888163). Recombination events detected in CLCuMuV-ND14, CLCuMuV-ND15 and CLCuMuV-SR15 were supported by more than 3 methods in RDP.
Nucleotide sequences analysed in this study, including putative parental sequences of these isolates have shown significant recombination breakpoints in IR, V1, C1 and C2 regions which further supports the previous studies [34, 37, 39]. The RDP analysis has depicted that the putative parents of the present virus isolates, and also other viruses included during analysis, have evolved via recombination at some point of time. Thus, suggesting that the mechanism of recombination greatly affects the begomoviral evolution.
Cotton leaf curl Multan betasatellite
CLCuMB-ND14 (KX966003) displayed recombination events at two sites. One at nt position 955–1279 involving SCR and A-rich regions with CLCuMB (KM065438) as minor parent and CLCuMB (GQ369730) as major parent. The other recombination site spanned nt position 1280–954 involving SCR, βC1 and A-rich regions with CLCuMB (GQ369730) as minor parent and CLCuMB (JF502376) as major parent (Fig. 5c).
Infectivity of cloned DNA
The infectivity analysis of Cotton leaf curl Multan virus and Cotton leaf curl Multan betasatellite in Nicotiana benthamiana plants
Plant infected/Plant inoculated (Nicotiana benthamiana)
Days to infection
Downwards leaf curling, and vein thickening
pGR5.4 + pCAMβ
Downwards leaf curling, leaf swelling, vein thickening and stunting
Mock (pGR5.4 in E. coli)
The present study was taken up as it deemed desirable to understand the complexity of this disease. Cotton leaves showing CLCuD symptoms were taken from Haryana and New Delhi, India. RCA was performed for characterization and phylogenetic analysis of begomoviruses,, betasatellite and alphasatellite molecules associated with CLCuD from North India. Nucleotide sequences of all the BAC showed the typical size and organization of typical monopartite begomovirus [40, 41, 42, 43]. The isolates of the BAC in the present study exhibited a maximum nucleotide sequence identity with CLCuMuV sequences (JN678804, AY765253 and DQ191160) reported earlier from different parts of India . The sequence similarity was further supported by phylogenetic analysis. The clustering of the begomoviral isolates with CLCuMuV is consistent with nucleotide sequence alignments. Further, phylogenetic studies demonstrated that isolates from the same region (Sirsa, Haryana) collected in different years fell under two separate groups. The formation of separate clusters might be due to difference in the length of V2 region of CLCuMuV-SR13 as compared to other isolates identified in this study. Also, the nucleotide sequence of CLCuMuV-SR13 (KJ868820) showed highest similarity with CLCuMuV sequences from Pakistan (EU384573, AJ496461, AJ496287 and AJ002447), thus formed a group with Pakistani isolates. Other four newly identified sequenes of CLCuMuV-SR14 (KX951460), CLCuMuV-ND14 (KX951461), CLCuMuV-ND15 (KY561820) and CLCuMuV-SR15 (KY888163) shared maximum identity with CLCuMuV sequences from India (AY765253, AY765256 and DQ191160) and forming a group with Indian isolates. The CLCuMuV isolates in this study also exhibited highest sequence identity amongst themselves indicating the presence of same strain according to recent begomoviral demarcation threshold . This further demonstrates that CLCuD in northern Indian regions is predominantly induced by CLCuMuV . Interestingly, despite high sequence similarity, there exists diversity among genome organization of these isolates. For instance, the genomes of CLCuMuV-SR14 and CLCuMuV-SR15 possess 7 ORFs, whereas those of CLCuMuV-SR13, CLCuMuV-ND14 and CLCuMuV-ND15 contain 6 ORFs. Remarkably, C5 gene rarely reported in CLCuMuV genome, was identified in CLCuMuV-SR14 and CLCuMuV-SR15.
Betasatellite sequences analysed in this study showed typical genome organization having a single ORF on complementary-sense strand, SCR (conserved amongst all the betasatellites) and a sequence rich in adenine (A-rich region) [40, 45]. These sequences shared a maximum identity with CLCuMB isolates from India and were supported by their clustering pattern in phylogenetic analysis. However, CLCuMB sequences, obtained from two different regions (Sirsa and New Delhi) appeared to form separate clusters. The clustering pattern of CLCuMB sequences from Sirsa seemed to have same ancestral origin but have probably evolved separately. These results depicted the monophyletic nature of CLCuMB which is consistent with the previously reported (pre-2013) CLCuMB sequences from India and Pakistan, contrasting the recently reported (post-2013) CLCuMB sequences from India and Pakistan that form groups regardless of their geographical origin [37, 44, 46].
Alphasatellite sequences obtained in this study also showed a typical pattern consisting of a single virion sense Rep gene and an A-rich region . Notably, none of the alphasatellite reported in the current study shared significant sequence similarity with cotton leaf curl alphasatellite, rather they exhibited a maximum similarity to GLCuA (HE599397, HE599396, HG934789,…etc) from Pakistan. This encouraged us to conclude that the alphasatellite associated with CLCuD isolates of the present study is actually GLCuA. The frequent identification of GLCuA in the present study from CLCuD-infected cotton plants, however, contrasts with the recent detection of CLCuD associated alphasatellites (ToLCA and OkLCuA) by Datta et al., 2017 . Thus, our results revealed a mixed infection and different natural combinations of CLCuMuV and its associated satellites CLCuMB and GLCuA in cotton plants from India. Phylogenetic analysis supported these results by placing the GLCuA sequences on a separate branch closely related to GLCuA from Pakistan (HE599397, HE599396, HG934789,…etc). The occurrence of GLCuA might be due to migration of whitefly vector from the neighboring country Pakistan as suggested by Sahu et al., 2018 .
The identification of CLCuMuV in CLCuD cotton plants in North India shows a correlation with the recent literature which documents the rebound of CLCuMuV in Punjab (India) and Vehari in Pakistan [44, 46]. Thus, our observations from North India together with the recent studies from the Punjab regions of India and Pakistan, suggest the recurrence of CLCuMuV in the Indian subcontinent and shows its predominance and frequent emergence in North Indian regions which might cause shift in previous epedemics of the recombination-prone CLCuKoV to this subcontinent .
Various factors such as mixed infections, high level of viral replication and increased vector host range could contribute significantly to recombination . The evolution and biodiversity of begomoviruses and also other viruses is mainly attributed to recombination . Our analysis revealed a potential recombination origin of CLCuMuV isolates studied in this article. In previous studies IR, V1, V2 and C1 regions were shown to be the recombination hotspots in begomovirus genome [24, 34, 36, 39, 46, 49, 50, 51]. In the present study, recombination events were however identified in all the CLCuMuV genes and correlate well with the recent analysis . In spite of similarities, earlier findings differ from this study in several aspects . For instance, recombinant fragments of CLCuMuV were contributed by CLCuKoV and CLCuAlV (as major parents) and CLCuMuV (as minor parent). Inaddition, almost all of the recombinant fragments of CLCuMuV were contributed by CLCuMuV (as major parent) and CLCuKoV (as minor parent) and no evidence of recombination from CLCuAlV was observed in the present study. Further, in previous study CLCuKoV was shown to contribute only the Rep, CR and ori to CLCuMuV . However, in the present study CLCuKoV showed contribution in genes of both complementary sense and virion sense of the CLCuMuV genome. For instance, in CLCuMuV-SR14, CLCuKoV devoted the IR, C1, C4, C5, V1 and V2 regions. Inspite of that, C1, C4, V1, V2 and C5 regions of this isolate (SR14) were also shared by CLCuMuV. Further, C3 and C2 regions were contributed by isolates CLCuMuV (JN678804) and CLCuMuV (EU384574), respectively. In CLCuMuV-ND14, CLCuKoV donated only in the V1 and V2 regions and not the C1Rep. In CLCuMuV-ND15, CLCuKoV contributed the V1, V2, C2 and C3 genes. Though CLCuMuV isolates from Pakistan having short stretches of C1, C2 and C3 were reported to be derived from CLCuKoV, in the Indian isolates contribution of CLCuKoV was previously detected only in the IR and Rep regions . These observations suggest that the emergence of several new BAC in north Indian regions occurred because of recombination between two distinct species viz. CLCuMuV and CLCuKoV.
In our study nucleotide sequences of CLCuMB showed recombination in the βC1, SCR and A-rich regions, thus supporting the previous reports from Pakistan and India [24, 37, 44, 48]. And, in GLCuA, recombination was detected in Rep and A-rich regions which is compatible with earlier studies suggesting these regions as the hotspots of recombination .
In conclusion, this study revealed that CLCuMuV has been evolving continuously and the evolution of recombinant begomovirus isolates reported here might be derived via exchange of genetic material from related begomoviruses viz., CLCuMuV, CLCuKoV and CLCuRaV. Our results clearly demonstrated the complex pattern of inter-species and intra-species recombination leading to significant structural changes in DNA components of CLCuMuV. It thus resulted in the genetic variability and emergence of new variants. This genetic diversification of CLCuMuV may lead to diverse host range (enhancing threat to other crops), transmission ability and capability of breaking resistance, hence posing a big challenge for disease management. This study encourages further analysis about molecular epidemiology and genetic evolution of geminivirus. Timely diagnosis and application of control strategies need to be done before this disease complex spreads over a large scale and causes severe threat to economy. This molecular information may provide insights about the genome complexities, sequence identity and genetic variability parameters viz., recombination analysis of CLCuMuV genome, its associated satellite DNAs (CLCuMB and GLCuA) which are indispensable for developing virus-based disease management strategies. This may accelerate the development of CLCuD-resistant plants. Pathogen-derived resistance manipulating CLCuMuV genes has already shown a great potential to generate virus-resistant plants.
Sample collection and detection of begomoviruses associated with CLCuD (BAC)
Leaves were collected from cotton plants exhibiting CLCuD symptoms at the experimental fields of Central Institute for Cotton Research (CICR Regional Station, 29.54°N 75.04°E), Sirsa, Haryana (20 samples) and from JMI (28.56°N 77.28°E), New Delhi (5 samples) between 2013 and 2015 every year during the months of October. The total genomic DNA from the diseased leaves was isolated employing DNAeasy plant mini kit (QIAGEN, Germany). It was subjected to RCA to amplify full-length genomes of BAC with the help of templiphi DNA amplification kit (GE Healthcare, USA).The concentration of DNA was estimated by Biophotometer plus (Eppendorf, Germany). Presence of the BAC was confirmed following PCR-based amplification of the CP gene using RCA product as template employing oligo primers viz. P1 (5′GGGATTTGATTTCAGTAATAAGG 3′) and P2 (5´ GAGCATGTTGTATATGTAGACCA 3´) specific for the CP gene of BAC .
Amplification, cloning and nucleotide sequencing of the genomes of BAC
PCR-based amplification of entire genomes of BAC was performed using RCA product as template employing overlapping begomovirus-specific overlapping primer pairs viz. F1For/F1Rev and F2For/F2Rev . The desired amplicons (1.2 kb and 1.7 kb) from the gels were eluted and purified employing gel extraction Kit (QIAGEN, Germany) and subsequently cloned in PCR cloning vector pGEMT (pGEM-T easy vector systems, Promega, USA). The authenticity of the clones was confirmed following restriction digestion, PCR amplification and sequencing in both directions by automated sequencer (ABI 3730 XL) at Xcelris Genomics (Ahmedabad, India). Sequence data derived from all the clones was assembled and combined to acquire the full- length genome sequence of BAC. Based on sequences obtained from the PCR amplicons of BAC, a primer pair (viz. Cot For. – 5’GGTACCTGAGTACAGGGGTCTATC 3’/Cot Rev. – 5’AAGCTTAATCAAAGTACAGCACAGGG 3′) was designed and employed to amplify and clone full-length genome of CLCuMuV, which were subjected to nucleotide sequence determination. The sequences obtained from the full-length CLCuMuV genome were compared with those of of BAC clones obtained through the overlapping primer pairs in PCR. Furthermore, the primer pair (viz. Cot For/ Cot Rev) was also used for specific detection of full-length BAC in all the samples investigated.
Cloning and nucleotide sequencing of satellite molecules (betasatellites/alphasatellites) associated with CLCuD
Amplification of satellite molecules was done using RCA product as the template employing oligo primers specific for betasatellite  and alphasatellite . The PCR products were resolved on agarose gel, purified with the help of Gel extraction Kit (QIAGEN, Germany) and cloned into pDrive (QIAGEN, Germany) and pGEM-T easy vector (Promega, USA). Nucleotide sequences of the clones representing betasatellite/alphasatellite were determined using automated sequencer (Xcelris Genomics, Ahmedabad, India).
Sequence comparison and phylogenetic analysis
Nucleotide sequences derived from different clones representing BAC isolates were assembled to construct complete genome. Sequences of the complete BAC genomes and their associated satellite molecules (this study) were compared with those available in NCBI GenBank database following BLASTn (https://blast.ncbi.nlm.nih.gov) program . A total of 54 full-length sequences of genomes of BAC, 44 betasatellite and 53 alphasatellite sequences originating from the Indian subcontinent and depicting high homology with the sequences determined under study, were selected and retrieved from GenBank (details of sequences used in this analysis are provided in Additional file 8: Table S4; Additional file 9: Table S5; Additional file 10: Table S6). Phylogenetic analysis following multiple sequence alignments was performed using MEGA-7.0 . Default parameter of one character-based and two distance-based algorithms were included. A consensus dendrogram was finally drawn for these algorithms employing bootstrap value of 1000 replicates. Pairwise sequence alignment was performed using SDT .
To investigate the extent of recombination, putative breakpoints and parents of CLCuMuV isolates, RDP-4 was employed . The full-length sequences of BAC, betasatellite and alphasatellite were used for recombination analysis (Additional file 11: Table S7 a-c). Nucleotide sequences were aligned using CLUSTAL-W in MEGA 7 . The resultant alignments were exported to the RDP4 and subjected to recombination analysis. Recombination analysis having cutoff of 0.05 was used as a P-value and the default settings was performed in RDP4 programme employing RDP, BOOTSCAN, GENCONV, SISCAN, MAXCHI, CHIMAERA and 3SEQ to estimate the recombination events. Recombination analysis were considered true, if they were supported by at least 3 methods.
Development of CLCuMuV/CLCuMB-derived infectious clones and agroinoculation into Nicotiana benthamiana plants
Infectious clones of the CLCuMuV genome (isolate CLCuMuV-SR13) and associated CLCuMB (CLCuMB-SR13) were cloned in pGreen0029 (pGR5.4) and pCAMBIA1302 (pCAMβ) binary vectors respectively, in accordance with Pratap et al., 2011 . The KpnI based partially digested RCA product generated a DNA fragment of ~ 2.7 kb which was cloned into pGreen0029 vector (pGR2.7I) following its nucleotide sequence determination (Xcelris Genomics, Ahmedabad, India). The RCA product obtained from CLCuD-infected cotton leaves was subjected to PCR amplification using oligo primers (viz. Cot For/Cot Rev. containing KpnI and HindIII restriction sites in Forward and Reverse primers, respectively) as described in above section. PCR amplicon (~ 2.7 kb), thus obtained, was cloned into pGEM-T Easy vector (pGEM-T-2.7), digested with KpnI and HindIII restriction enzymes (Fermentas, USA) and cloned in pGreen0029 vector (pGR2.7II). The DNA insert from the pGR2.7I vector was released following digestion with KpnI restriction enzyme and subsequently cloned in binary vector pGR2.7II. The complete head to tail dimer of CLCuMuV genome was cloned in pGreen0029 vector pGR5.4. The integration and orientation of dimeric clone of CLCuMuV genome was verified after restriction digestion with NcoI.
Similarly, full-length CLCuMB sequence (CLCuMB-SR13), which was previously cloned in pDrive (pDriveβ) vector, was subjected to restriction digestion by EcoRI and BglII. The digested DNA fragment (~ 500 bp) having hairpin loop, thus generated, was cloned into pCAMBIA1302 binary vector (pCAM0.5β). The complete CLCuMB sequence cloned into pDriveβ was released following digestion with EcoRI and cloned into pCAM0.5β vector (pCAMβ). Cloning and orientation of the partial dimer was confirmed by restriction digestion. The infectious clone viz. pGR5.4 (along with helper plasmid pSoup) and binary vector (pCAMβ) were mobilized in Agrobacterium tumefaciens strain LBA4404 by freeze thaw method . The mobilization of pGR5.4 and pCAMβ was confirmed by colony PCR employing primers specific for BAC (viz. Cot For/Cot Rev) and betasatellite (β01/β02) respectively, as described in above section. The agroinfiltration in N. benthamiana plants was done as described by Khan et al., 2015 .
First author is thankful to UGC for financial support as NON-NET fellowship for pursuing Ph.D.
No funding was obtained in this study.
Availability of data and materials
All the data analyzed or generated during this study are included in this published article.
RQ: performed the experiments viz., cloning, sequencing, phylogenetic and recombination analysis and wrote and revised manuscript; ZAK: performed experiements viz. preparation of infectious clones of CLCuMuV − 13 and CLCuMB-13, and infectivity testing in tobacco and wrote the corresponding part; DM: provided CLCuD samples, helpful discussion; JAK: designed and guided the study, revised and edited the draft of manuscript. All authors have read and approved the manuscript.
Ethics approval and consent to participate
The article does not contain any studies with human participants or animals performed by any of the authors.
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
- 1.Barwale R. A perspective on cotton crop in India – opportunities and challenges. Cotton Statistics News. 2016;10:1–3.Google Scholar
- 5.Gutierrez C. Replication and gene expression of DNA viruses. In: Khan JA, editors DJ, editors. Handbook of plant virology. New York: Food Products; 2006. p. 75–89.Google Scholar
- 12.Hussain T, Ali M. A review of cotton diseases in Pakistan. Pak Cott. 1975;19:71–86.Google Scholar
- 22.Ashraf MA, Shahid AA, Mohamed BB, Dahab AA, Bajwa KS, Rao AQ, Khan MAU, Ilyas M, Haider MS, Husnain T. Molecular characterization and phylogenetic analysis of a variant of highly infectious cotton leaf curl Burewala virus associated with CLCuD from Pakistan. Aust J Crop Sci. 2013;7(8):1113–22.Google Scholar
- 30.Lozano G, Trenado HP, Fiallo-Olivé E, Chirinos D, Geraud-Pouey F, Briddon RW, Navas-Castillo J. Characterization of non-coding DNA satellites associated with sweepoviruses (genus Begomovirus, Geminiviridae) – definition of a distinct class of begomovirus-associated satellites. Front Microbiol. 2016;7:162.CrossRefGoogle Scholar
- 34.Silva FN, Lima ATM, Rocha CS, Castillo-Urquiza GP, Alves-Júnior M, Zerbini FM. Recombination and pseudorecombination driving the evolution of the begomoviruses Tomato severe rugose virus (ToSRV) and Tomato rugose mosaic virus (ToRMV): two recombinant DNA-A components sharing the same DNA-B. Virol J. 2014;11:66.CrossRefGoogle Scholar
- 46.Zubair M, Zaidi SS, Shakir S, Farooq M, Amin I, Scheffler JA, Scheffler BE, Mansoor S. Multiple begomoviruses found associated with cotton leaf curl disease in Pakistan in early 1990 are back in cultivated cotton. Sci Rep. 2017;7:680.Google Scholar
- 55.Kumar J, Kumar A, Gunapati S, Khan JA. Detection and characterization of begomovirus-associated alpha and beta satellites from infected tomato plants with leaf curl symptoms. J Plant Pathol. 2010;92(2):544.Google Scholar
- 59.Martin DP, Murrell B, Golden M, Khoosal A, Muhire B. RDP4: detection and analysis of recombination patterns in virus genomes. Virus Evol. 2015;1:–5.Google Scholar
Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.