Skip to main content

Analyzing Protein Sequences Using Signal Analysis Techniques

  • Chapter

Abstract

This chapter discusses the use of frequency and time-frequency signal processing methods for the analysis of protein sequence data. The amino acid sequence of a protein may be considered as a twenty symbol alphabet sequence, or it may be considered as a sequence of numerical values reflecting various physicochemical aspects of the amino acids such as hydrophobicity, bulkiness, or electron-ion interaction potential. When primary protein sequence information is mapped into numerical values, it is possible to treat the sequence as a signal and apply well known signal processing methods for analysis. These methods allow proteins to be clustered into functional families and can also lead to the identification of biologically active sights. This chapter discusses frequency and time-frequency methods for protein sequence analysis and illustrates these concepts using various protein families. In addition, a method for selecting appropriate numerical mappings of amino acids is introduced.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   169.00
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD   169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  • Arce, G. R. and Hasan, S. R. (2000). “Elimination of Interference Terms of the Discrete Wigner Distribution Using Nonlinear Filtering.” IEEE Transactions on Signal Processing 48:2321–2331.

    Article  Google Scholar 

  • Auger, F. and Flandrin, P. (1995). “Improving the Readability of Time-frequency and Time-scale Representations by the Reassignment Method.” IEEE Transactions on Signal Processing 43:1068–1089.

    Article  Google Scholar 

  • Clarke, N. D. (1995). “Covariation of Residues in the Homeodomain Sequence Family.” Protein Science 4:2269–2278.

    Article  PubMed  CAS  Google Scholar 

  • Cosic, I. (1994). “Macromolecular Bioactivity: Is It Resonant Interaction Between Macromolecules?-Theory and Applications.” IEEE Transactions on Biomedical Engineering 41:1101–1114.

    Article  PubMed  CAS  Google Scholar 

  • Cosic, I. (1997). The Resonant Recognition Model of Macromolecular Bioactivity. Basel: Birkhauser Verlag.

    Google Scholar 

  • Eisenberg, D. (1984). “Three-dimensional Structure of Membrane and Surface Proteins.” Annual Reviews in Biochemistry 53:595–623.

    Article  CAS  Google Scholar 

  • Fang, Q. and Cosic, I. (1999). “Prediction of Active Sites of Fibroblast Growth Factors Using Continuous Wavelet Transforms and the Resonant Recognition Model.” In: Proceedings of The Inaugural Conference of the Victorian Chapter of the IEEE EMBS, pp. 211–214.

    Google Scholar 

  • Fauchere, J. L., Charton, M., Kier, L. B., Verloop, A., and Pliska, V. (1988). “Amino Acid Side Chain Parameters for Correlation Studies in Biology and Pharmacology.” International Journal of Peptide Protein Research 32:269–278.

    Article  CAS  Google Scholar 

  • GenomeNet. http://www.genome.ad.jp/dbget/aaindex.html . Kyoto University: Institute for Chemical Research.

  • Hunter, L. (1993). “Molecular Biology for Computer Scientists.” In: Hunter, L., ed. Artificial Intelligence and Molecular Biology, pp. 1–46. Menlo Park: AAAI Press.

    Google Scholar 

  • Kawashima, S. and Kanehisa, M. (2000). “Aaindex: Amino Acid Index Database.” Nucleic Acids Research 28:374–5.

    Article  PubMed  CAS  Google Scholar 

  • Kyte, J. and Doolittle, R. F. (1982). “A Simple Method for Displaying the Hydropathic Character of a Protein.” Journal of Molecular Biology 157: 105–132.

    Article  PubMed  CAS  Google Scholar 

  • Lambert, F. L. (2002). “Disorder-A Cracked Crutch for Supporting Entropy Discussions.” Journal of Chemical Education 79:187–192.

    Article  CAS  Google Scholar 

  • Mroczkowski, B. and Hall, R. (1990). “Epidermal Growth Factor: Biology and Properties of Its Gene and Protein Precursor.” In: Habenicht, A., ed. Growth Factors, Differentation Factors and Cytokines, Berlin: Springer-Verlag.

    Google Scholar 

  • Pennisi, E. (2000). “Genomics Comes of Age.” Science 290:2220–2221.

    Article  PubMed  CAS  Google Scholar 

  • Pierce, J. R. (1980). An Introduction to Information Theory, Symbols, Signals and Noise. New York: Dover Publications, Inc.

    Google Scholar 

  • Qian, S. and Chen, D. (1996). Joint Time-Frequency Analysis: Methods and Applications. Upper Saddle River, NJ: Prentice-Hall PTR.

    Google Scholar 

  • Shannon, C. E. (1948). “A Mathematical Theory of Communication.” Bell Systems Technical Report 27:379–423.

    Google Scholar 

  • Tomii, K. and Kanehisa, M. (1996). “Analysis of Amino Acids and Mutation Matrices for Sequence Comparison and Structure Prediction of Proteins.” Protein Engineering 9:27–36.

    Article  PubMed  CAS  Google Scholar 

  • Veljkovic, V., Cosic, I., Dimitrjevic, B., and Lalovic, D. (1985). “Is It Possible to Analyze DNA and Protein Sequences by the Methods of Digital Signal Processing?” IEEE Transactions on Biomedical Engineering 32:337–341.

    Article  PubMed  CAS  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer Science+Business Media, Inc.

About this chapter

Cite this chapter

Bloch, K.M., Arce, G.R. (2006). Analyzing Protein Sequences Using Signal Analysis Techniques. In: Zhang, W., Shmulevich, I. (eds) Computational and Statistical Approaches to Genomics. Springer, Boston, MA. https://doi.org/10.1007/0-387-26288-1_9

Download citation

Publish with us

Policies and ethics