Genome-Scale Identification of Cell-Wall-Related Genes in Switchgrass through Comparative Genomics and Computational Analyses of Transcriptomic Data
Large numbers of plant cell-wall (CW)-related genes have been identified or predicted in several plant genomes such as Arabidopsis thaliana, Oryza sativa (rice), and Zea mays (maize), as results of intensive studies of these organisms in the past 2 decades. However, no such gene list has been identified in switchgrass (Panicum virgatum), a key bioenergy crop. Here, we present a computational study for prediction of CW genes in switchgrass using a two-step procedure: (i) homology mapping of all annotated CW genes in the fore-mentioned species to switchgrass, giving rise to a total of 991 genes, and (ii) candidate prediction of CW genes based on switchgrass genes co-expressed with the 991 genes under a large number of experimental conditions. Specifically, our co-expression analyses using the 991 genes as seeds led to the identification of 104 large clusters of co-expressed genes, each referred to as a co-expression module (CEM), covering 830 of the 991 genes plus 823 additional genes that are strongly co-expressed with some of the 104 CEMs. These 1653 genes represent our prediction of CW genes in switchgrass, 112 of which are homologous to predicted CW genes in Arabidopsis. Functional inference of these genes is conducted to derive the possible functional relations among these predicted CW genes. Overall, these data may offer a highly useful information source for cell-wall biologists of switchgrass as well as plants in general.
KeywordsSwitchgrass Plant cell wall Homology mapping Co-expression analysis
This work was supported in part by the National Science Foundation (DEB-0830024 and DBI-0542119) and the DOE BioEnergy Science Center grant (DE-PS02-06ER64304), which is supported by the Office of Biological and Environmental Research in the Department of Energy Office of Science. This work was also supported in part by the Agriculture Experiment Station and the Biochemical Spatiotemporal Network Resource Center (3SP680) of South Dakota State University.
XC and QM participated in the coordination of the paper, carried out or participated all the analyses of transcriptomic data and the comparative genomics framework, and drafted the manuscript; XM participated in framework design. YT provided the transcriptomic data along with relevant data details, XR offered biology guidance in co-expression analysis, and YW and GL proved the TF prediction results. CZ designed the network analysis part. RAD reviewed and edited the paper and assisted in interpretation of data, and YX conceived the study, participated in its design and coordination, and revised the manuscript. All authors read and approved the final manuscript.
- 4.Konda NM, Shi J, Singh S, Blanch HW, Simmons BA, Klein-Marcuschamer D (2014) Understanding cost drivers and economic potential of two variants of ionic liquid pretreatment for cellulosic biofuel production. Biotechnol Biofuels 7:86. doi: 10.1186/1754-6834-7-86 PubMedCentralCrossRefPubMedGoogle Scholar
- 13.Zhang JY, Lee YC, Torres-Jerez I, Wang M, Yin Y, Chou WC et al (2013) Development of an integrated transcript sequence database and a gene expression atlas for gene discovery and analysis in switchgrass (Panicum virgatum L.). Plant J 74(1):160–173. doi: 10.1111/tpj.12104 CrossRefPubMedGoogle Scholar
- 28.Law JA, Vashisht AA, Wohlschlegel JA, Jacobsen SE (2011) SHH1, a homeodomain protein required for DNA methylation, as well as RDR2, RDM4, and chromatin remodeling factors, associate with RNA polymerase IV. PLoS Genet 7(7), e1002195. doi: 10.1371/journal.pgen.1002195 PubMedCentralCrossRefPubMedGoogle Scholar