Complete Genome Sequence of Pantoea ananatis Strain NN08200, an Endophytic Bacterium Isolated from Sugarcane

Stain NN08200 was isolated from the surface-sterilized stem of sugarcane grown in Guangxi province of China. The stain was Gram-negative, facultative anaerobic, non-spore-forming bacteria. The complete genome SNP-based phylogenetic analysis indicate that NN08200 is a member of the genus Pantoea ananatis. Here, we summarize the features of strain NN08200 and describe its complete genome. The genome contains a chromosome and two plasmids, in total 5,176,640 nucleotides with 54.76% GC content. The chromosome genome contains 4598 protein-coding genes, and 135 ncRNA genes, including 22 rRNA genes, 78 tRNA genes and 35 sRNA genes, the plasmid 1 contains 149 protein-coding genes and the plasmid 2 contains 308 protein-coding genes. We identified 130 tandem repeats, 101 transposon genes, and 16 predicted genomic islands on the chromosome. We found an indole pyruvate decarboxylase encoding gene which involved in the biosynthesis of the plant hormone indole-3-acetic acid, it may explain the reason why NN08200 stain have growth-promoting effects on sugarcane. Considering the pathogenic potential and its versatility of the species of the genus Pantoea, the genome information of the strain NN08200 give us a chance to determine the genetic background of interactions between endophytic enterobacteria and plants.


Introduction
The genus Pantoea comprises several species that are associated with plants have been found, either as pathogenic or beneficial bacteria to plants [1,2]. Some of the first identified members of Pantoea were plant pathogens, but many studies subsequently indicated that Pantoea exist in a multitude of environments and most of them do beneficial to bioremediation and plant growth [3][4][5]. There are many Pantoea strains isolated from plants, soil and environment and are currently being explored for agricultural applications [6,7]. Approximately, 20 Pantoea species have been identified, having diverse characteristics [8]. The ubiquity, versatility and genetic tractability of Pantoea make it ideal for exploring niche specific adaptation and opportunism, and for the development of agricultural and environmental products [9,10].
To obtain endophytes that have growth-promoting effects on host sugarcane plants and have potential for agricultural application, we attempted to isolate and identify endophytic bacteria associated with sugarcane plants grown in Guangxi Province, the major sugarcane and sugar-producing area of China. Bacterial strain NN08200 was isolated from surfacesterilized stems of a ROC22 sugarcane plant grown in Nanning, Guangxi, China. We had determined the plant growthpromoting potential of strain NN08200 to sugarcane under a greenhouse condition [11]. Moreover, we observed the strain NN08200 colonization at the roots and aerial parts of micropropagated sugarcane plantlets with fluorescence microscopy and confocal microscopy. Sequence determinations and phylogenetic analysis of the 16S rRNA gene indicated that strain NN08200 is affiliated with the genus Pantoea, and the strain was preserved in the China General Microbiological Culture Collection Center, with the preservation number CGMCC No. 5438. Here, we present a summary of the features of strain NN08200 and its complete genome sequence, which provides a reference for resolving the phylogeny and taxonomy of closely related strains and Quan Zeng and GuoYing Shi equally contributed to this work.
* ChunJin Hu chunjin-hu@126.com 1 Microbiology Research Institute, Guangxi Academy of Agricultural Sciences, Nanning 530007, People's Republic of China genetic information to study the plant growth-promoting potential and plant-associated lifestyle of strain NN08200.

Classification and General Features
Strain NN08200 is a Gram-negative, non-spore-forming, motile rod with peritrichous flagella (Fig. 1). This bacterium was able to grow in anaerobic using cooked meat medium with thermal melting vaseline and aerobic using beef extract medium, and grew optimally between 28 and 32 °C (Table 1). It forms circular, convex, smooth colonies on nutrient agar; in addition, it grows well on Ashby nitrogen-free culture medium, showing round, transparent colonies. Strain NN08200 is a species of Pantoea, showing several differences from the Pantoea species described so far. The strain is an endophyte from sugarcane. It is positive for indole production, nitrate reduction and arginine decarboxylase and lysine decarboxylase activity.

Genome Sequencing Information Genome Project History
Pantoea ananatis strain NN08200 was selected for sequencing based on its taxonomic significance and because it could be used in promoting plant growth. The genome sequence is deposited in GenBank with the accession number CP035034. Information about the genome sequencing and its association with MIGS version 2.0 compliance is shown in Table 2.

Growth Conditions and DNA Isolation
P. ananatis strain NN08200 was grown in liquid Luria-Bertani medium at 28 °C until stationary phase. Genomic DNA was extracted using a TIANamp bacterial DNA kit (Tiangen Biotech, Beijing, China). The quantity and quality of DNA were assessed using a NanoDrop spectrophotometer (Thermo Scientific, USA).

Genome Sequencing and Assembly
The genomic DNA of P. ananatis strain NN08200 was first constructed into a 10-kb SMRT Bell library and sequenced using the PacBio RS II sequencing system. Low-quality reads were filtered by the SMRT portal (version 2.3.0) and the filtered reads were assembled to generate five contigs containing 5,176,640 bases [21,22]. The final assembly of the genome provided an average of 166-fold coverage. The five contigs were scaffolding to three circular sequences. The fully assembled P. ananatis strain NN08200 genome is composed of a 4.7-M base pair chromosome, and two plasmids, whose sizes were 125k and 307k base pairs, respectively.

Genome Properties
The genome of strain NN08200 contains a single chromosome of 4,743,568 nucleotides with 53.8% G+C content and two plasmids, one of 125,402 nucleotides with 56.47% G+C content and another of 307,670 nucleotides with 52.17% G+C content. The chromosome contains 4733 predicted genes: 4598 protein-coding genes and 135 RNA genes including 78 tRNA genes, 35 sRNA genes, and 22 rRNA genes (Table 3; Fig. 3). The plasmid 1 contains 149 protein-coding genes and the plasmid 2 contains 308 protein-coding genes. Ciros was used to show the genome and the result of gene function annotation [29]. In total, 4369 genes were assigned in Clusters of Orthologous Groups of proteins (COG) functional categories and they are listed in Table 4.

Insights from the Genome
Here we present the complete genome sequence of Pantoea ananatis strain NN08200. Protein-coding sequences accounted for 4598(97.15%) of the total of 4733 genes identified. 54 complete genomes of P. ananatis have been download from NCBI to performed an Average Nucleotide Identity (ANI) analysis with strain NN08200 [30]. The results justified the conclusion of phylogenetic analysis, strain NN08200 with other strains resulted in a high ANI (> 95%).
The results suggested that the strain NNo8200 belongs to the P. ananatis. NN08200 and P. ananatis S8 resulted in the highest ANI (99.2%) and show that they are similar than other strains.

Conclusion
In this study, we present the complete genome sequence of Pantoea ananatis strain NN08200, an endophyte from sugarcane. The genome of P. ananatis NN08200 consists of a 4,743,568-bp long chromosome, containing 4598 protein coding genes. P. ananatis NN08200 also contains two plasmids. To analyze the complete genome sequence of Pantoea ananatis strain NN08200, we found an indole pyruvate decarboxylase encoding gene which involved in the biosynthesis of the plant hormone indole-3-acetic acid [31], it may promote plant growth by improving the synthesis of indoleacetic acid. The new genomic data will facilitate future applications of this strain in agricultural production.