Abstract
G protein-coupled receptors (GPCRs) are a large and heterogeneous superfamily of receptors that are key cell players for their role as extracellular signal transmitters. Class C GPCRs, in particular, are of great interest in pharmacology. The lack of knowledge about their full 3-D structure prompts the use of their primary amino acid sequences for the construction of robust classifiers, capable of discriminating their different subtypes. In this paper, we describe the use of feature selection techniques to build Support Vector Machine (SVM)-based classification models from selected receptor subsequences described as n-grams. We show that this approach to classification is useful for finding class C GPCR subtype-specific motifs.
Keywords
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Caragea, C., Silvescu, A., Mitra, P.: Protein Sequence Classification Using Feature Hashing. In: 2011 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), pp. 538–543. IEEE (2011)
Chang, C., Lin, C.: LIBSVM: A Library for Support Vector Machines. ACM Transactions on Intelligent Systems and Technology 2(3), 27:1–27:27 (2011)
Cheng, B., Carbonell, J., Klein-Seetharaman, J.: Protein classification based on text document classification techniques. Proteins: Structure, Function, and Bioinformatics 58(4), 955–970 (2005)
Can Cobanoglu, M., Saygin, Y.l., Sezerman, U.: Classification of GPCRs Using Family Specific Motifs. IEEE/ACM Transactions on Computational Biology and Bioinformatics 8(6), 1495–1508 (2011)
Davies, M.N., Secker, A., Freitas, A., Clark, E., Timmis, J., Flower, D.R.: Optimizing amino acid groupings for GPCR classification. Bioinformatics 24(18), 1980–1986 (2008)
Katritch, V., Cherezov, V., Stevens, R.C.: Structure-Function of the G Protein Coupled Receptor Superfamily. Annual Review of Pharmacology and Toxicology 53(1), 531–556 (2013)
Kittler, J.: Feature Set Search Algorithms. In: Chen, C.H. (ed.) Pattern Recognition and Signal Processing, pp. 41–60. Sijthoff and Noordhoff, Alphen aan den Rijn (1978)
König, C., Cruz-Barbosa, R., Alquézar, R., Vellido, A.: SVM-based classification of class C GPCRs from alignment-free physicochemical transformations of their sequences. In: Petrosino, A., Maddalena, L., Pala, P. (eds.) ICIAP 2013 Workshops. LNCS, vol. 8158, pp. 336–343. Springer, Heidelberg (2013)
Mhamdi, F., Elloumi, M., Rakotomalala, R.: Textmining, features selection and datamining for proteins classification. In: Proceedings of the 2004 International Conference on Information and Comunication Technologies: From Theory to Applications, pp. 457–458. IEEE (2004)
Saeys, Y., Inza, I., Larrañaga, P.: A review of feature selection techniques in bioinformatics. Bioinformatics 23(19), 2507–2517 (2007)
Trzaskowski, B., Latek, D., Yuan, S., Ghoshdastider, U., Debinski, A., et al.: Action of molecular switches in GPCRs– theoretical and experimental studies. Current Medicinal Chemistry 19(8), 1090–1109 (2012)
Vroling, B., Sanders, M., Baakman, C., Borrmann, A., Verhoeven, S., Klomp, J., Oliveira, L., de Vlieg, J., Vriend, G.: GPCRDB: information system for G protein-coupled receptors. Nucleic Acids Research 39(suppl. 1), D309–D319 (2011)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
König, C., Alquézar, R., Vellido, A., Giraldo, J. (2014). Finding Class C GPCR Subtype-Discriminating N-grams through Feature Selection. In: Saez-Rodriguez, J., Rocha, M., Fdez-Riverola, F., De Paz Santana, J. (eds) 8th International Conference on Practical Applications of Computational Biology & Bioinformatics (PACBB 2014). Advances in Intelligent Systems and Computing, vol 294. Springer, Cham. https://doi.org/10.1007/978-3-319-07581-5_11
Download citation
DOI: https://doi.org/10.1007/978-3-319-07581-5_11
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-07580-8
Online ISBN: 978-3-319-07581-5
eBook Packages: EngineeringEngineering (R0)