Prediction of the O-Glycosylation with Secondary Structure Information by Support Vector Machines
Mucin-type O-glycosylation is one of the main types of the mammalian protein glycosylation. It is serine (Ser) or threonine (Thr) specific, though any consensus sequence is still unknown. In this report, support vector machines (SVM) are used for the prediction of O-glycosylation for each Ser or Thr site in the protein sequences. 29 mammalian protein sequences are selected from UniProt8.0, and its structure information is obtained from Protein Data Bank (PDB). A protein subsequence with a prediction target of Ser or Thr site at the center is used as input to SVM, and its amino acid sequence information, and the secondary structure or accessibility, which are calculated by DSSP from PDB data, are encoded as an input data. The results of the preliminary experiments show the effectiveness of the local structure information added to the sequence information.
KeywordsPrediction SVM O-glycosylation Protein Secondary Structure
Unable to display preview. Download preview PDF.
- 1.Taylor, M.E., Drickamer, K.: Introduction to Glycobiology. Oxford Univ. Press, Oxford (2003)Google Scholar
- 10.Nouno, I., et al.: Prediction of mucin-type O-glycosylation by layered neural networks and support vector machines. In: Proceedings of the 17th Int. Conference on Genome Informatics (December 2006)Google Scholar