Abstract
In our everyday life, sentences and words convey meaning behind symbols. In living cells, proteins represent a type of equivalence for sentences and words. In this chapter, we give a semantic analysis for protein sequences in order to explain what protein structures may be telling the cell. Our goals in the semantic analysis of proteins are to find basic words and their meanings, leading to the analysis of the structure of grammar, syntax and semantics. Just as in human languages, there are words in protein sequences. Based on information theory and statistics, we define the local words and give an algorithm to search for them. Using combinatorial graph theory, we describe key words and core words, and analyze their properties. Furthermore, we present search algorithms for key words and core words. These words may play a significant role in protein sequences. Finally, we apply these concepts to the identification of homologous protein structures.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
(2008). Semantic Analysis for Protein Primary Structure. In: Theory and Mathematical Methods for Bioinformatics. Biological and Medical Physics, Biomedical Engineering. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-74891-5_15
Download citation
DOI: https://doi.org/10.1007/978-3-540-74891-5_15
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-74890-8
Online ISBN: 978-3-540-74891-5
eBook Packages: Biomedical and Life SciencesBiomedical and Life Sciences (R0)