Pattern Recognition for Subfamily Level Classification of GPCRs Using Motif Distillation and Distinguishing Power Evaluation
G protein coupled receptors (GPCRs) are one of the most prominent and abundant family of membrane proteins in the human genome. Since they are main targets of many drugs, GPCR research has grown significantly in recent years. However the fact that only few structures of GPCRs are known still remains as an important challenge. Therefore, the classification of GPCRs is a significant problem provoked from increasing gap between orphan GPCR sequences and a small amount of annotated ones. This work employs motif distillation using defined parameters, distinguishing power evaluation method and general weighted set cover problem in order to determine the minimum set of motifs which can cover a particular GPCR subfamily. Our results indicate that in Family A Peptide subfamily, 91% of all proteins listed in GPCRdb can be covered by using only 691 different motifs, which can be employed later as an invaluable source for developing a third level GPCR classification tool.
Keywordsg-protein coupled receptors data mining pattern recognition
Unable to display preview. Download preview PDF.
- 1.Filmore, D.: It’s a GPCR World. Modern Drug Discovery 7(11), 24–28 (2004)Google Scholar
- 2.Joost, P., Methner, A.: Phylogenetic analysis of 277 human G-protein- coupled receptors as a tool for the prediction of orphan receptor ligands. Genome Biology 3(11), research0063.1–research0063.16 (October 2002)Google Scholar
- 4.Gerber, S., Krasky, A., Rohwer, A., Lindauer, S., Closs, E., Rognan, D., Gunkel, N., Selzer, P.M., Wolf, C.: Identification and characterisation of the dopamine receptor II from the cat flea Ctenocephalides felis (CfDo- pRII). Insect Biochemistry and Molecular Biology 36(10), 749–758 (2006)CrossRefGoogle Scholar
- 10.Foord, S.M., Bonner, T.O.M.I., Neubig, R.R., Rosser, E.M., Pin, J.P., Davenport, A.P., Spedding, M., Harmar, A.J.: International Union of Pharmacology. XLVI. G Protein-Coupled Receptor List. Pharmacological Reviews 57(2), 279–288 (2005)Google Scholar
- 13.Bhasin, M., Raghava, G.P.S.: GPCRpred: an SVM-based method for prediction of families and subfamilies of G-protein coupled receptors. Nucleic Acids Research 32(Web Server Issue), W383–W389 (2004)Google Scholar
- 15.Papasaikas, P.K., Bagos, P.G., Litou, Z.I., Promponas, V.J., Hamod- Rakas, S.J.: PRED-GPCR: GPCR recognition and family classification server. Nucleic Acids Research 32(Web Server Issue), W380–W382 (2004)Google Scholar
- 16.Yabuki Y., Muramatsu T., Hirokawa T., Mukai H., Suwa M.: GRIFFIN: a system for predicting GPCR–G-protein coupling selectivity using a support vector machine and a hidden Markov model. Nucleic Acids Research, 33(Web server issue), W148–W153 (2005)Google Scholar
- 24.Sol, C.: Identification of disease related significant SNPs. M.Sc. Thesis. Faculty of Engineering and Natural Sciences. Sabanci University (2010)Google Scholar