Skip to main content
Log in

Conserved nucleotide sequences in highly expressed genes in plants

  • Published:
Journal of Genetics Aims and scope Submit manuscript

Abstract

Genes that code for proteins expressed at high and low levels in plants were classified into separate data sets. The two data sets were analysed to identify the conserved nucleotide sequences that may characterize genes with contrasting levels of expression. The AUG context that characterized the highly expressed genes is (A/C)N2AAN3(A/T)T(A/C) AACAATGGCTNCC(T/A)CNA(C/T)(A/C). The data set of highly expressed genes shows overrepresentation of codons for alanine at the second position and serine at the third and fourth positions after the translation initiation codon. The characteristic transcription initiation site in the highly expressed genes is CAN(A/C)(A/C)(C/A)C(C/A)N2A(C/A). The promoter region is characterized by two tandemly repeated TATA elements, sometimes with one and rarely with two point mutations in the highly expressed genes. Besides the two tandemly repeated TATA elements, the promoter context in the highly expressed genes is overrepresented by C, C and G at the -3, -1 and+9 positions respectively. The characteristic TATA motif in the highly expressed plant genes is (T/C)(T/A)N2TCACTATATATAG. Most of these features are not present in the genes ubiquitously expressed at low levels in plants.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Bachmair A., Finley D. and Varshavsky A. 1986 In vivo half-life of a protein is a function of its amino terminal residue.Science 234, 179–186.

    Article  PubMed  CAS  Google Scholar 

  • Baralle F E. and Brownlee G. G. 1978 AUG is the only recognizable signal sequence in the 5’ non-coding regions of eukaryotic mRNA.Nature 274, 84–87.

    Article  PubMed  CAS  Google Scholar 

  • Benzerra I. C., Luiz A. B., Neshich G. and Almeida E. R. 1995 A corn-specific gene encodes tarin, a major globulin of taro.Plant Mol. Biol. 28, 137–144.

    Article  Google Scholar 

  • Berry-Lowe S. L., McKnight T. O., Shah D. M. and Meagher R. B. 1982 The nucleotide sequence, expression and evolution of one member of a multigene family encoding the small subunit of ribulose-1,5-bisphosphate carboxylase in soybean.J. Mol. Appl. Genet. 1, 483–498.

    PubMed  CAS  Google Scholar 

  • Breathnach R. and Chambon P. 1981 Organization and expression of eukaryotic split genes coding for proteins.Annu. Rev. Biochem. 50, 349–383.

    Article  PubMed  CAS  Google Scholar 

  • Breen J. P. and Crouch M. L. 1992 Molecular analysis of a cruciferin storage protein gene family ofBrassica napus.Plant Mol. Biol. 19, 1049–1055.

    Article  PubMed  CAS  Google Scholar 

  • Breton C., Chaboud A. M., Rochon E., Bates E. M., Cock J. M., Formm H. and Dumas C. 1995 PCR-generated cDNA library of transition stage maize embryos: cloning and expression of calmodulin gene during early embryogenesis.Plant Mol. Biol. 27, 105–113.

    Article  PubMed  CAS  Google Scholar 

  • Cavener D. R. 1987 Comparison of the consensus sequence flanking translation start sites inDrosophila and vertebrates.Nucl. Acids Res. 15, 1353–1361.

    Article  PubMed  CAS  Google Scholar 

  • Chen W. and Struhl K. 1988 Saturation mutagenesis of a yeasthis3 TATA element: Genetic evidence for a specific TATA binding protein.Proc. Natl. Acad. Sci. USA 85, 2691–2695.

    Article  PubMed  CAS  Google Scholar 

  • Damme E. J. M., Barre A., Rouge P., Leuven E and Peumans W. J. 1995 The seed lectin of black locust(Robinia pseudoacacia) are encoded by two genes which differ from the bark lectin genes.Plant Mol. Biol. 29, 1197–1210.

    Article  PubMed  Google Scholar 

  • Gadner E. S., Holnstroem K. O., De Paiva G. R., De Castro L.-A. B., Carneiru M. and Grossi De Sa M. F. 1991 Isolation, characterization and expression of a gene coding for a 2S albumin fromBertholletia excelsa (Brazil nut).Plant Mol. Biol. 16, 437–448.

    Article  Google Scholar 

  • Gallie D. R. and Walbot V. 1992 Identification of motifs within the tobacco mosaic virus 5’ leader responsible for enhancing translocation.Nucl. Acids Res. 20, 4361–4368.

    Article  Google Scholar 

  • Hagenbuchle O., Santer M. and Steitz J. A. 1978 Conservation of the primary structure at the 3’ end of 18S rRNA from eukaryotic cells.Cell 13, 551–563.

    Article  PubMed  CAS  Google Scholar 

  • Hamilton R., Watanabe C. K. and Boer H. A. 1987 Compilation and comparison of the sequence context around the AUG start codons inSaccharomyces cerevisiae mRNAs.Nucl. Acids Res. 115, 3581–3593.

    Article  Google Scholar 

  • Heidecker G. and Messing J. 1986 Structure analysis of plant genes.Annu. Rev. Plant Physiol. 37, 439–466.

    Article  CAS  Google Scholar 

  • Hong J. C., Nagao R. T. and Key J. L. C. 1990 Characterization of a proline-rich cell wall protein gene family of soybean: A comparative analysis.J. Biol. Chem. 265, 2470–2475.

    PubMed  CAS  Google Scholar 

  • Hsing Y. C., Chen Z., Shih M., Hsieh J. and Chow T. 1995 Unusual sequence of group 3 LEA mRNA inducible by maturation or drying in soybean seeds.Plant Mol. Biol. 29, 863–868.

    Article  PubMed  CAS  Google Scholar 

  • Hua S., Dube S. K., Barnett N. M. and Kung S. B. 1991 Nucleotide sequence of gene Oef-2 and its cDNA encoding 23kDa polypeptide of oxygen-evolving complex in photosystem II from tobacco.Plant Mol. Biol. 17, 551–553.

    Article  PubMed  CAS  Google Scholar 

  • Joshi C. P. 1987 An inspection of the domain between putative TATA box and translation start site in 79 plant genes.Nucl. Acids Res. 15, 6643–6653.

    Article  PubMed  CAS  Google Scholar 

  • Joshi C. P., Zhou H., Huang X. and Chiang V. L. 1997 Context sequences of translation initiation codon in plants.Plant Mol. Biol. 35, 993–1001.

    Article  PubMed  CAS  Google Scholar 

  • Kozak M. 1980 Role of ATP in binding and migration of 40S ribosomal subunits.Cell 22, 7–8.

    Article  PubMed  CAS  Google Scholar 

  • Kozak M. 1984 Compilation and analysis of sequences upstream from the translational start site in eukaryotic mRNAs.Nucl. Acids Res. 12, 857–872.

    Article  PubMed  CAS  Google Scholar 

  • Kozak M. 1986 Point mutations define a sequence flanking the AUG initiator codon that modulate translation by eukaryotic ribosomes.Cell 44, 283–292.

    Article  PubMed  CAS  Google Scholar 

  • Kozak M. 1987a At least six nucleotides preceding the AUG initiator codon enhance translation in mammalian cells.J. Mol. Biol. 196, 947–950.

    Article  PubMed  CAS  Google Scholar 

  • Kozak M. 1987b An analysis of 5’ non coding sequence from 699 vertebrate messenger RNAs.Nucl. Acids Res. 15, 8125–8148.

    Article  PubMed  CAS  Google Scholar 

  • Kuster H., Schroder G., Fruhling M., Pich U., Rieping M., Schubert I., Perlick A. M. and Puhler A. 1995 The nodule specific V/ENOD-GRP3 gene encoding a glycine-rich early nodulin is located on chromosome 1 ofVicia faba L. and is predominantly expressed in the interzone II-III of root needles.Plant Mol. Biol. 28, 405–421.

    Article  PubMed  CAS  Google Scholar 

  • Lamppa G. and Jacks C. 1991 Analysis of two linked genes coding for acyl carrier protein (ACP) fromArabidopsis thaliana.Plant Mol. Biol. 16, 469–474.

    Article  PubMed  CAS  Google Scholar 

  • Leutwiller S., Meyerowitz M. and Tobin M. 1986 Structure and expression of three light-harvesting chlorophyll a/b binding genes inArabidopsis thaliana.Nucl. Acids Res. 14, 4051–4064.

    Article  Google Scholar 

  • Lindstrom J. T., Chu B. and Belanger E C. 1993 Isolation and characterization of anArabidopsis thaliana gene for the 54kDa subunit of the signal recognition particle.Plant Mol. Biol. 23, 1265–1272.

    Article  PubMed  CAS  Google Scholar 

  • Miao Z. H., Liu X. and Lam E. E. L. 1994 TGA3 is a distinct member of the TGA family of BZIP transcription factor inArabidopsis thaliana.Plant Mol. Biol. 25, 1–11.

    Article  PubMed  CAS  Google Scholar 

  • Mukumoto F., Hirose S., Imaeski H. and Yamazaki K. 1993 DNA sequence requirement of a TATA element-binding protein fromArabidopsis for transcriptionin vitro.Plant Mol. Biol. 23, 995–1003.

    Article  PubMed  CAS  Google Scholar 

  • Ohta M., Sugita M. and Sugiura M. 1995 Three types of nuclear genes encoding chloroplast RNA-binding proteins (cp29, cp31 and cp33) are present inArabidopsis thaliana: presence of cp31 in chloroplast and its homologue in nuclei/cytoplasm.Plant Mol. Biol. 25, 529–539.

    Article  Google Scholar 

  • Peña E., Lopez A. and Jimenez S. 1995 Synthesis of ribosomal proteins from stored mRNAs early in seed germination.Plant Mol. Biol. 28, 327–336.

    Article  Google Scholar 

  • Rocher A. O. and Vierling E. 1995 Cytoplasmic HSP70 homologues of pea: differential expression in vegetative and embryonic organs.Plant Mol. Biol. 27, 441–450.

    Article  Google Scholar 

  • Ruan Y., Gilmore J. and Conner T. 1998 TowardsArabidopsis genome analysis: monitoring expression profiles of 1400 genes using cDNA microarrays.Plant J. 15, 821–833.

    Article  PubMed  CAS  Google Scholar 

  • Sasaki T., Song J., Koga-Ban Y., Matsui E., Fang F., Higo H.et al. 1994 Towards cataloguing all rice genes: large-scale sequencing of randomly chosen rice cDNAs from a callus cDNA library.Plant J. 6, 615–624.

    Article  PubMed  CAS  Google Scholar 

  • Slabas A. R., Fordham-Skelton A. P., Fletcher D., Martinez-Rivas J. M., Swinhoe R., Croy R. D. and Evans T. M. 1994 Characterisation of cDNA and genomic clones encoding homologues of the 65kDa regulatory subunit of protein phosphatase 2A inArabidopsis thaliana.Plant Mol. Biol. 26, 1125–1138.

    Article  PubMed  CAS  Google Scholar 

  • Srinivasan R. and Oliver D. J. 1995 Light dependent and tissue specific expression of the H-protein of glycine decarboxylase complex.Plant Physiol. 109, 161–168.

    Article  PubMed  CAS  Google Scholar 

  • Stiles J. I., Szostak J. W., Young A. T., Wu R., Consaul S. and Sherman F. 1981 DNA sequence of a mutation in the leader region of the yeast iso-1-cytochrome c mRNA.Cell 25, 277–284.

    Article  PubMed  CAS  Google Scholar 

  • Szekeres M., Haizel T., Adam E. and Nagy R 1995 Molecular characterization and expression of a tobacco histone H1 cDNA.Plant Mol. Biol. 27, 597–605.

    Article  PubMed  CAS  Google Scholar 

  • Wanner L., Li G., Ware D., Somssich I. C. and Davis K. R. 1995 The phenylalanine ammonia-lyase gene family inArabidopsis thaliana.Plant Mol. Biol. 27, 327–338.

    Article  PubMed  CAS  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Rakesh Tuli.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Sawant, S.V., Singh, P.K., Gupta, S.K. et al. Conserved nucleotide sequences in highly expressed genes in plants. J Genet 78, 123–131 (1999). https://doi.org/10.1007/BF02924562

Download citation

  • Received:

  • Issue Date:

  • DOI: https://doi.org/10.1007/BF02924562

Keywords

Navigation