Classifying G-Protein Coupled Receptors with Hydropathy Blocks and Support Vector Machines
This paper developes a new method for recognizing G-Protein Coupled Receptors (GPCRs) based on features generated from the hydropathy properties of the amino acid sequences. Using the hydropathy characteristics, namely hydropathy blocks, the protein sequences are converted into fixed-dimensional feature vectors. Subsequently, the Support Vector Machine (SVM) classifier is utilized to identify the GPCR proteins belonging to the same families or subfamilies. The experimental results on GPCR datasets show that the proteins belonging to the same family or subfamily can be identified using features generated based on the hydropathy blocks.
KeywordsSupport Vector Machine Protein Sequence Feature Vector Dipeptide Composition Amino Acid Alphabet
Unable to display preview. Download preview PDF.
- 6.Vapnik, V.: The Nature of Statistical Learning Theory, 2nd edn. Springer, Berlin, Heidelberg, New York (1996)Google Scholar
- 16.Cristianini, N., Shawe-Taylor, J.: An Introduction to Support Vector Machines. Cambridge University Press, Cambridge, United Kingdom (1986)Google Scholar